Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

Improved risk minimization algorithms for technology-assisted review

Articolo
Data di Pubblicazione:
2023
Abstract:
MINECORE is a recently proposed decision-theoretic algorithm for technology-assisted review that attempts to minimise the expected costs of review for responsiveness and privilege in e-discovery. In MINECORE, two probabilistic classifiers that classify documents by responsiveness and by privilege, respectively, generate posterior probabilities. These latter are fed to an algorithm that returns as output, after applying risk minimization, two ranked lists, which indicate exactly which documents the annotators should review for responsiveness and which documents they should review for privilege. In this paper we attempt to find out if the performance of MINECORE can be improved (a) by using, for the purpose of training the two classifiers, active learning (implemented either via relevance sampling, or via uncertainty sampling, or via a combination of them) instead of passive learning, and (b) by using the Saerens-Latinne-Decaestecker algorithm to improve the quality of the posterior probabilities that MINECORE receives as input. We address these two research questions by carrying out extensive experiments on the RCV1-v2 benchmark. We make publicly available the code and data for reproducing all our experiments.
Tipologia CRIS:
01.01 Articolo in rivista
Keywords:
Machine learning; Technology assisted review; Prior probability shift
Elenco autori:
Molinari, Alessio; Esuli, Andrea; Sebastiani, Fabrizio
Autori di Ateneo:
ESULI ANDREA
SEBASTIANI FABRIZIO
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/433939
Link al Full Text:
https://iris.cnr.it//retrieve/handle/20.500.14243/433939/148157/prod_481846-doc_198199.pdf
Pubblicato in:
INTELLIGENT SYSTEMS WITH APPLICATIONS
Journal
  • Dati Generali

Dati Generali

URL

https://www.sciencedirect.com/science/article/pii/S2667305323000340?via%3Dihub
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)