Locality Sensitive Hashing For Efficient Similar Polygon Retrieval
Kaplan Haim, Tenenbaum Jay. Arxiv 2021
[Paper]
ARXIV
Independent
LSH
Locality Sensitive Hashing (LSH) is an effective method of indexing a set of
items to support efficient nearest neighbors queries in high-dimensional
spaces. The basic idea of LSH is that similar items should produce hash
collisions with higher probability than dissimilar items.
We study LSH for (not necessarily convex) polygons, and use it to give
efficient data structures for similar shape retrieval. Arkin et al. represent
polygons by their “turning function” - a function which follows the angle
between the polygon’s tangent and the -axis while traversing the perimeter
of the polygon. They define the distance between polygons to be variations of
the (for ) distance between their turning functions. This metric
is invariant under translation, rotation and scaling (and the selection of the
initial point on the perimeter) and therefore models well the intuitive notion
of shape resemblance.
We develop and analyze LSH near neighbor data structures for several
variations of the distance for functions (for ). By applying our
schemes to the turning functions of a collection of polygons we obtain
efficient near neighbor LSH-based structures for polygons. To tune our
structures to turning functions of polygons, we prove some new properties of
these turning functions that may be of independent interest.
As part of our analysis, we address the following problem which is of
independent interest. Find the vertical translation of a function that is
closest in distance to a function . We prove tight bounds on the
approximation guarantee obtained by the translation which is equal to the
difference between the averages of and .
Similar Work