On Fast Bounded Locality Sensitive Hashing
Wygocki Piotr. Arxiv 2017
[Paper]
ARXIV
Independent
In this paper, we examine the hash functions expressed as scalar products,
i.e., , for some bounded random vector . Such hash functions
have numerous applications, but often there is a need to optimize the choice of
the distribution of . In the present work, we focus on so-called
anti-concentration bounds, i.e. the upper bounds of . In many applications, is a vector of independent random
variables with standard normal distribution. In such case, the distribution of
is also normal and it is easy to approximate . Here, we consider two bounded distributions in the context of
the anti-concentration bounds. Particularly, we analyze being a random
vector from the unit ball in and being a random vector from
the unit sphere in . We show optimal up to a constant anti-concentration
measures for functions .
As a consequence of our research, we obtain new best results for \newline
\textit{-approximate nearest neighbors without false negatives} for in
high dimensional space for all , for
. These results improve over those
presented in [16]. Finally, our paper reports progress on answering the open
problem by Pagh~[17], who considered the nearest neighbor search without false
negatives for the Hamming distance.
Similar Work