Data di Pubblicazione:
2018
Abstract:
The OpenAIRE infrastructure populates a scholarly communication big graph interlinking metadata objects of publications, datasets, software, organizations, funders, and projects. In order to de-duplicate this graph, OpenAIRE has developed GDup, an integrated, scalable, general-purpose system for entity deduplication over big information graphs. GDup offers functionalities to realize a hilly-fledged entity deduplication workflow over a generic input graph, inclusive of Ground Truth support, end-user feedback, and strategies for identifying and merging duplicates to obtain an output disambiguated graph.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
Deduplication; Graph; Big data; Scholarly communication; OpenAIRE
Elenco autori:
Manghi, Paolo; Bardi, Alessia; Atzori, Claudio
Link alla scheda completa: