Data di Pubblicazione:
2019
Abstract:
Dictionary-based compression schemes provide fast decoding operation, typically at the expense of reduced compression effectiveness compared to statistical or probability-based approaches. In this work, we apply dictionary-based techniques to the compression of inverted lists, showing that the high degree of regularity that these integer sequences exhibit is a good match for certain types of dictionary methods, and that an important new trade-off balance between compression effectiveness and compression efficiency can be achieved. Our observations are supported by experiments using the document-level inverted index data for two large text collections, and a wide range of other index compression implementations as reference points. Those experiments demonstrate that the gap between efficiency and effectiveness can be substantially narrowed.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
Compression; Decoding; Efficiency; Inverted index
Elenco autori:
Pibiri, GIULIO ERMANNO
Link alla scheda completa: