Data di Pubblicazione:
2009
Abstract:
Similarity search in metric spaces is a general paradigm that can be used in several application fields. One of them is content-based image retrieval systems. In order to become an effective complement to traditional Web-scale text-based image retrieval solutions, content-based image retrieval must be efficient and scalable. In this paper we investigate caching the answers to content-based image retrieval queries in metric space, with the aim of reducing the average cost of query processing, and boosting the overall system throughput. Our proposal allows the cache to return approximate answers with acceptable quality guarantee even if the query processed has never been encountered in the past. By conducting tests on a collection of one million high-quality digital photos, we show that the proposed caching techniques can have a significant impact on performance. Moreover, we show that our caching algorithm does not suffer of cache pollution problems due to near-duplicate query objects.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
Similarity search; Caching
Elenco autori:
Orlando, Salvatore; Lucchese, Claudio; Falchi, Fabrizio; Rabitti, Fausto; Perego, Raffaele
Link alla scheda completa:
Titolo del libro:
17th Italian Symposium on Advanced Database Systems