Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

Optimal space-time tradeoffs for inverted indexes

Contributo in Atti di convegno
Data di Pubblicazione:
2015
Abstract:
Inverted indexes are usually represented by dividing posting lists into constant-sized blocks and representing them with an encoder for sequences of integers. Different encoders yield a different point in the space-time trade-off curve, with the fastest being several times larger than the most space-efficient. An important design decision for an index is thus the choice of the fastest encoding method such that the index fits in the available memory. However, a better usage of the space budget could be obtained by using faster encoders for frequently accessed blocks, and more space-efficient ones those that are rarely accessed. To perform this choice optimally, we introduce a linear time algorithm that, given a query distribution and a set of encoders, selects the best encoder for each index block to obtain the lowest expected query processing time respecting a given space constraint. To demonstrate the effectiveness of this approach we perform an extensive experimental analysis, which shows that our algorithm produces indexes which are significantly faster than single-encoder indexes under several query processing strategies, while respecting the same space constraints.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
Compression; Inverted indexes; Knapsack problems
Elenco autori:
Ottaviano, Giuseppe; Tonellotto, Nicola; Venturini, Rossano
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/302877
  • Dati Generali

Dati Generali

URL

http://dl.acm.org/citation.cfm?id=2685297&CFID=748200543&CFTOKEN=15193674
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)