Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

Supporting tabular data characterization in a large scale data infrastructure by lexical matching techniques

Capitolo di libro
Data di Pubblicazione:
2013
Abstract:
Digital Libraries continue to evolve towards research environments supporting access and management of multiform Information Objects spread across multiple data sources and organizational domains. This evolution has introduced the need to deal with Information Objects having traits different from those characterizing Digital Libraries at their early stages and to revise the services supporting their management. Tabular data represent a class of Information Objects that require to be efficiently managed because of their core role in many eScience scenarios. This paper discusses the tabular data characterization problem, i.e., the problem of identifying the reference dataset of any column of the dataset. In particular, the paper presents an approach based on lexical matching techniques to support users during the data curation phase by providing them with a ranked list of reference datasets suitable for a dataset column.
Tipologia CRIS:
02.01 Contributo in volume (Capitolo o Saggio)
Keywords:
Data curation; Large-scale data infrastructures; Lexical similarity
Elenco autori:
Pagano, Pasquale; Candela, Leonardo; Coro, Gianpaolo
Autori di Ateneo:
CANDELA LEONARDO
CORO GIANPAOLO
PAGANO PASQUALE
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/246161
Titolo del libro:
Digital Libraries and Archives. 8th Italian Research Conference. IRCDL 2012. Revised Selected Papers
Pubblicato in:
COMMUNICATIONS IN COMPUTER AND INFORMATION SCIENCE (PRINT)
Series
  • Dati Generali

Dati Generali

URL

http://link.springer.com/chapter/10.1007%2F978-3-642-35834-0_5#
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)