Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

GDup: De-duplication of Scholarly Communication Big Graphs

Contributo in Atti di convegno
Data di Pubblicazione:
2018
Abstract:
Today, several online services offer functionalities to access information from big scholarly communication graphs, which interlink entities such as publications, authors, datasets, organizations, etc. Such graphs are often populated over time as aggregations of multiple sources and therefore suffer from entity duplication problems. Although deduplication of graphs is a known and actual problem, solutions tend to be dedicated and address a few of the underlying challenges. In this paper, we propose the GDup system, an integrated, scalable, general-purpose system for entity deduplication over big information graphs. GDup supports practitioners with the functionalities needed to realize a fully-fledged entity deduplication workflow over a generic input graph, inclusive of Ground Truth support, end-user feedback, and strategies for identifying and merging duplicates to obtain an output disambiguated graph. GDup is today one of the core components of the OpenAIRE infrastructure production system, monitoring Open Science trends on behalf of the European Commission.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
deduplication; information graphs; big data; scholarly communication
Elenco autori:
Manghi, Paolo; Bardi, Alessia; Atzori, Claudio
Autori di Ateneo:
ATZORI CLAUDIO
BARDI ALESSIA
MANGHI PAOLO
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/359280
Link al Full Text:
https://iris.cnr.it//retrieve/handle/20.500.14243/359280/20597/prod_401241-doc_139819.pdf
https://iris.cnr.it//retrieve/handle/20.500.14243/359280/20598/prod_401241-doc_141095.pdf
  • Dati Generali

Dati Generali

URL

https://ieeexplore.ieee.org/document/8606645
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)