On The Geometry Of Similarity Search Dimensionality Curse And Concentration Of Measure
Pestov Vladimir. Information Processing Letters 1999
[Paper]
NEURIPS
We suggest that the curse of dimensionality affecting the similarity-based
search in large datasets is a manifestation of the phenomenon of concentration
of measure on high-dimensional structures. We prove that, under certain
geometric assumptions on the query domain and the dataset , if
satisfies the so-called concentration property, then for most query
points the ball of radius centred at
contains either all points of or else at least of
them. Here is the distance from to the nearest neighbour
in and is the dimension of .
Similar Work