Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills
  1. Outputs

Semantically Aware Text Categorisation for Metadata Annotation

Chapter
Publication Date:
2019
abstract:
In this paper we illustrate a system aimed at solving a long-standing and challenging problem: acquiring a classifier to automatically annotate bibliographic records by starting from a huge set of unbalanced and unlabelled data. We illustrate the main features of the dataset, the learning algorithm adopted, and how it was used to discriminate philosophical documents from documents of other disciplines. One strength of our approach lies in the novel combination of a standard learning approach with a semantic one: the results of the acquired classifier are improved by accessing a semantic network containing conceptual information. We illustrate the experimentation by describing the construction rationale of training and test set, we report and discuss the obtained results and conclude by drawing future work.
Iris type:
02.01 Contributo in volume (Capitolo o Saggio)
Keywords:
Text categorization; Lexical resources; NLP; Language models; semantics
List of contributors:
Pasini, Enrico
Authors of the University:
PASINI ENRICO
Handle:
https://iris.cnr.it/handle/20.500.14243/382163
Book title:
Digital Libraries: Supporting Open Science. IRCDL 2019
  • Overview

Overview

URL

http://link.springer.com/10.1007/978-3-030-11226-4_25
  • Use of cookies

Powered by VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)