Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills
  1. Outputs

Exploit Multilingual Language Model at Scale for ICD-10 Clinical Text Classification

Conference Paper
Publication Date:
2020
abstract:
The automatic ICD-10 classification of medical documents is actually an unresolved issue, despite its crucial importance. The existence of machine learning approaches de- voted to this task is in contrast with the lack of annotated resources, especially for languages different from English. Recent Transformer-based multilingual neural language models at scale have provided an innovative approach for dealing with cross lingual Natural Language Processing tasks. In this paper, we present a preliminary evaluation of the Cross-lingual Language Model (XLM) architecture, a recent multilingual Transformer- based model presented in literature, tested in the cross lingual ICD-10 multilabel classification of short medical notes. In detail, we analysed the performances obtained by fine tuning the XLM model on English language training data and tested for ICD- 10 codes prediction of an Italian test set. The obtained results show that the use of the novel XLM multilingual neural language architecture is very promising and it can be very useful in case of low resource languages.
Iris type:
04.01 Contributo in Atti di convegno
Keywords:
Transformers; Multilingual Neural Language Model; XLM; Multilabel Text Classification; Cross-lingual Clas- sification; ICD-10 Coding; Deep Learning
List of contributors:
Silvestri, Stefano; DE PIETRO, Giuseppe; Ciampi, Mario; Gargiulo, Francesco
Authors of the University:
CIAMPI MARIO
GARGIULO FRANCESCO
SILVESTRI STEFANO
Handle:
https://iris.cnr.it/handle/20.500.14243/381292
Published in:
PROCEEDINGS - IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS
Series
  • Overview

Overview

URL

https://ieeexplore.ieee.org/document/9219640
  • Use of cookies

Powered by VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)