Publication details

 

Crawling, Indexing, and Similarity Searching Images on the Web

Basic information
Original title:Crawling, Indexing, and Similarity Searching Images on the Web
Authors:Michal Batko, Fabrizio Falchi, Claudio Lucchese, David Novák, Raffaele Perego, Fausto Rabitti, Jan Sedmidubský, Pavel Zezula
Further information
Citation:BATKO, Michal - FALCHI, Fabrizio - LUCCHESE, Claudio - NOVÁK, David - PEREGO, Raffaele - RABITTI, Fausto - SEDMIDUBSKÝ, Jan - ZEZULA, Pavel. Crawling, Indexing, and Similarity Searching Images on the Web. In Proceedings of the Sixteenth Italian Symposium on Advanced Database Systems. Mondello : Salvatore Gaglio, Ignazio Infantino, Domenico Sacca, 2008. ISBN 978-88-6122-154-3, pp. 382-389. 22.6.2008, Mondello.
Original language:English
Field:Informatika
Type:Article in Proceedings
Keywords:similarity search; content-based image retrieval; metric space; MPEG-7 descriptors; peer-to-peer search network

In this paper, we report on our experience in building an experimental similarity search system on a test collection of more than 50 million images, to show the possibility to scale Content-based Image Retrieval (CBIR) systems towards the Web size. First, we had to tackle the non-trivial process of image crawling and descriptive feature extraction, performed by using the European EGEE computer GRID, building a test collection, the first of such scale, that will be opened to the research community for experiments and comparisons. Then, we had to develop indexing and searching mechanisms which can scale up to these volumes and answer similarity queries in real-time. The results of our experiments are very encouraging for future applications.

Related projects: