Publication details

 

Similarity Join in Metric Spaces

Basic information
Original title:Similarity Join in Metric Spaces
Authors:Pavel Zezula, Vlastislav Dohnal, Claudio Gennaro, Pasquale Savino
Further information
Citation:ZEZULA, Pavel - DOHNAL, Vlastislav - GENNARO, Claudio - SAVINO, Pasquale. Similarity Join in Metric Spaces. In Proceedings of the European Conference on Information Retrieval Research. Vyd. LNCS 2633. Berlin : Springer-Verlag, 2003. ISBN 3-540-01274-5, pp. 452-467. 14.4.2003, Pisa, Italy.
Original language:English
Field:Computer hardware and software
Type:Article in Proceedings
Keywords:similarity join; index structures; performance; text management

Similarity join in distance spaces constrained by the metric postulates is the necessary complement of more famous similarity range and the nearest neighbors search primitives. However, the quadratic computational complexity of similarity joins prevents from applications on large data collections. We first study the underlying principles of such joins and suggest three categories of implementation strategies based on filtering, partitioning, or similarity range searching. Then we study an application of the D-index to implement the most promising alternative of range searching. Though also this approach is not able to eliminate the intrinsic quadratic complexity of similarity joins, significant performance improvements are confirmed by experiments.

Related projects: