Top Banner
Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality Piotr Indyk, Rajeev Motwa ni The 30 th annual ACM symposium on theory of computing 1998
13

Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality

Jan 01, 2016

Download

Documents

hoyt-mccray

Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality. Piotr Indyk, Rajeev Motwani. The 30 th annual ACM symposium on theory of computing 1998. Problems. Nearest neighbor (NN) problem: - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality

Approximate Nearest Neighbors: Towards Removing the Curse of

Dimensionality

Piotr Indyk, Rajeev Motwani

The 30th annual ACM symposium on theory of computing 1998

Page 2: Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality

Problems

• Nearest neighbor (NN) problem:– Given a set of n points P={p1, …, pn} in some metric sp

ace X, preprocess P so as to efficiently answer queries which require finding the point in P closest to a query point qX.

• Approximate nearest neighbor (ANN) problem:– Find a point pP that is an –approximate nearest nei

ghbor of the query q in that for all p'P, d(p,q)(1+)d(p',q).

Page 3: Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality

Motivation

• The nearest neighbors problem is of major importance to a variety of applications, usually involving similarity searching. – Data compression– Databases and data mining– Information retrieval– Image and video databases– Machine learning – Pattern recognition – Statistics and data analysis

• Curse of dimensionality– The curse of dimensionality is a term coined by Richard

Bellman to describe the problem caused by the exponential increase in volume associated with adding extra dimensions to a (mathematical) space.

Page 4: Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality

Overview of results and techniques

• These results are obtained by reducing -NNS to a new problem: point location in equal balls.

Page 5: Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality

nearest neighbor search (NNS)-nearest neighbor search (NNS)

Ring-Cover Trees

Point location in equal balls (PLEB)- Point location in equal balls (PLEB)

Locality-Sensitive Hashing

Proposition 1 Proposition 2

The Bucketing method

Proposition 3

Random projections

Content

Page 6: Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality

Definitions

Page 7: Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality

Theorems

Page 8: Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality

Constructing Ring-cover trees

Page 9: Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality

Analysis of Ring-cover trees

Page 10: Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality

Definitions

Page 11: Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality

Locality-Sensitive Hashing

Page 12: Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality

The Bucketing method

• We decompose each ball into a bounded number of cells and store them in a dictionary.

• The bucketing algorithm works for any lp norm.

Page 13: Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality

J. L. Lemma