Publication details

 

A Metric Index for Approximate Text Management

Basic information
Original title:A Metric Index for Approximate Text Management
Authors:Vlastislav Dohnal, Claudio Gennaro, Pavel Zezula
Further information
Citation:DOHNAL, Vlastislav - GENNARO, Claudio - ZEZULA, Pavel. A Metric Index for Approximate Text Management. In Information Systems and Databases. Anaheim - Calgary - Zurich : ACTA Press, 2002. ISBN 0-88986-362-8, pp. 37-42. 2002, September 25-27, 2002, Tokyo, Japan.
Original language:English
Field:Information theory
Type:Article in Proceedings
Keywords:metric data; similarity search; index structures; similarity join

Text collections of data need not only search support for identical objects, but approximate matching is even more important. A suitable metric for such a task is the edit distance measure. However, the quadratic complexity of the edit distance prevents from applying storage organizations such as the sequential search. We have investigated the properties of the D-index to approximate searching and matching of text databases.

Related projects: