Coveringlsh Locality-sensitive Hashing Without False Negatives
Pagh Rasmus. Arxiv 2015
[Paper]
ARXIV
Independent
LSH
We consider a new construction of locality-sensitive hash functions for
Hamming space that is covering in the sense that is it guaranteed to
produce a collision for every pair of vectors within a given radius . The
construction is efficient in the sense that the expected number of hash
collisions between vectors at distance~, for a given , comes close to
that of the best possible data independent LSH without the covering guarantee,
namely, the seminal LSH construction of Indyk and Motwani (STOC ‘98). The
efficiency of the new construction essentially matches their bound when
the search radius is not too large — e.g., when ,
where is the number of points in the data set, and when
where is an integer constant. In general, it differs by at most a factor
in the exponent of the time bounds. As a consequence, LSH-based
similarity search in Hamming space can avoid the problem of false negatives at
little or no cost in efficiency.
Similar Work