Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

Optimizing text quantifiers for multivariate loss functions.

Articolo
Data di Pubblicazione:
2015
Abstract:
We address the problem of quantification, a supervised learning task whose goal is, given a class, to estimate the relative frequency (or prevalence) of the class in a dataset of unlabeled items. Quantification has several applications in data and text mining, such as estimating the prevalence of positive reviews in a set of reviews of a given product or estimating the prevalence of a given support issue in a dataset of transcripts of phone calls to tech support. So far, quantification has been addressed by learning a general-purpose classifier, counting the unlabeled items that have been assigned the class, and tuning the obtained counts according to some heuristics. In this article, we depart from the tradition of using general-purpose classifiers and use instead a supervised learning model for structured prediction, capable of generating classifiers directly optimized for the (multivariate and nonlinear) function used for evaluating quantification accuracy. The experiments that we have run on 5,500 binary high-dimensional datasets (averaging more than 14,000 documents each) show that this method is more accurate, more stable, and more efficient than existing state-of-the-art quantification methods.
Tipologia CRIS:
01.01 Articolo in rivista
Keywords:
Quantification
Elenco autori:
Esuli, Andrea
Autori di Ateneo:
ESULI ANDREA
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/291689
Link al Full Text:
https://iris.cnr.it//retrieve/handle/20.500.14243/291689/174521/prod_331333-doc_156682.pdf
Pubblicato in:
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA
Journal
  • Dati Generali

Dati Generali

URL

http://dl.acm.org/citation.cfm?doid=2786971.2700406
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)