Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

Assigning identifiers to documents to enhance the clustering property of fulltext indexes

Contributo in Atti di convegno
Data di Pubblicazione:
2004
Abstract:
Web Search Engines provide a large-scale text document retrieval service by processing huge Inverted File indexes. Inverted File indexes allow fast query resolution and good memory utilization since their d-gaps representation can be effectively and efficiently compressed by using variable length encoding methods. This paper proposes and evaluates some algorithms aimed to find an assignment of the document identifiers which minimizes the average values of d-gaps, thus enhancing the effectiveness of traditional compression methods. We ran several tests over the Google contest collection in order to validate the techniques proposed. The experiments demonstrated the scalability and effectiveness of our algorithms. Using the proposed algorithms, we were able to sensibly improve (up to 20.81%) the compression ratios of several encoding schemes
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Elenco autori:
Orlando, Salvatore; Silvestri, Fabrizio; Perego, Raffaele
Autori di Ateneo:
PEREGO RAFFAELE
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/57513
Titolo del libro:
SIGIR '04 Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval ACM New York, NY, USA ©2004
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)