Locality-sensitive Hashing Scheme Based On Longest Circular Co-substring | Awesome Learning to Hash Add your paper to Learning2Hash

Locality-sensitive Hashing Scheme Based On Longest Circular Co-substring

Yifan Lei, Qiang Huang, Mohan Kankanhalli, Anthony K. H. Tung . Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data 2020 – 27 citations

[Paper]   Search on Google Scholar   Search on Semantic Scholar
Datasets Distance Metric Learning Hashing Methods Locality-Sensitive-Hashing Tools & Libraries

Locality-Sensitive Hashing (LSH) is one of the most popular methods for (c)-Approximate Nearest Neighbor Search ((c)-ANNS) in high-dimensional spaces. In this paper, we propose a novel LSH scheme based on the Longest Circular Co-Substring (LCCS) search framework (LCCS-LSH) with a theoretical guarantee. We introduce a novel concept of LCCS and a new data structure named Circular Shift Array (CSA) for (k)-LCCS search. The insight of LCCS search framework is that close data objects will have a longer LCCS than the far-apart ones with high probability. LCCS-LSH is LSH-family-independent, and it supports (c)-ANNS with different kinds of distance metrics. We also introduce a multi-probe version of LCCS-LSH and conduct extensive experiments over five real-life datasets. The experimental results demonstrate that LCCS-LSH outperforms state-of-the-art LSH schemes.

Similar Work