Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills
  1. Outputs

Crosslingual named entity recognition for clinical de-identification applied to a COVID-19 Italian data set

Academic Article
Publication Date:
2020
abstract:
The COrona VIrus Disease 19 (COVID-19) pandemic required the work of all global experts to tackle it. Despite the abundance of new studies, privacy laws prevent their dissemination for medical investigations: through clinical de-identification, the Protected Health Information (PHI) contained therein can be anonymized so that medical records can be shared and published. The automation of clinical de-identification through deep learning techniques has proven to be less effective for languages other than English due to the scarcity of data sets. Hence a new Italian de-identification data set has been created from the COVID-19 clinical records made available by the Italian Society of Radiology (SIRM). Therefore, two multi-lingual deep learning systems have been developed for this low-resource language scenario: the objective is to investigate their ability to transfer knowledge between different languages while maintaining the necessary features to correctly perform the Named Entity Recognition task for de-identification. The systems were trained using four different strategies, using both the English Informatics for Integrating Biology & the Bedside (i2b2) 2014 and the new Italian SIRM COVID-19 data sets, then evaluated on the latter. These approaches have demonstrated the effectiveness of cross-lingual transfer learning to de-identify medical records written in a low resource language such as Italian, using one with high resources such as English.
Iris type:
01.01 Articolo in rivista
Keywords:
Annotated Italian data set; Clinical de-identification; COVID-19; Deep learning; Named entity recognition
List of contributors:
Catelli, Rosario; DE PIETRO, Giuseppe; Esposito, Massimo; Gargiulo, Francesco
Authors of the University:
ESPOSITO MASSIMO
GARGIULO FRANCESCO
Handle:
https://iris.cnr.it/handle/20.500.14243/381060
Published in:
APPLIED SOFT COMPUTING (PRINT)
Journal
  • Overview

Overview

URL

http://www.scopus.com/record/display.url?eid=2-s2.0-85092315597&origin=inward
  • Use of cookies

Powered by VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)