Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

Expansion via prediction of importance with contextualization

Contributo in Atti di convegno
Data di Pubblicazione:
2020
Abstract:
The identification of relevance with little textual context is a primary challenge in passage retrieval. We address this problem with a representation-based ranking approach that: (1) explicitly models the importance of each term using a contextualized language model; (2) performs passage expansion by propagating the importance to similar terms; and (3) grounds the representations in the lexicon, making them interpretable. Passage representations can be pre-computed at index time to reduce query-time latency. We call our approach EPIC (Expansion via Prediction of Importance with Contextualization). We show that EPIC significantly outperforms prior importance-modeling and document expansion approaches. We also observe that the performance is additive with the current leading first-stage retrieval methods, further narrowing the gap between inexpensive and cost-prohibitive passage ranking approaches. Specifically, EPIC achieves a MRR@10 of 0.304 on the MS-MARCO passage ranking dataset with 78ms average query latency on commodity hardware. We also find that the latency is further reduced to 68ms by pruning document representations, with virtually no difference in effectiveness.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
Document representation; Query representation; Neural ranking; Efficient ranking
Elenco autori:
Nardini, FRANCO MARIA; Perego, Raffaele
Autori di Ateneo:
NARDINI FRANCO MARIA
PEREGO RAFFAELE
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/420623
Link al Full Text:
https://iris.cnr.it//retrieve/handle/20.500.14243/420623/137710/prod_440218-doc_157963.pdf
  • Dati Generali

Dati Generali

URL

https://doi.org/10.1145/3397271.3401262
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)