Data di Pubblicazione:
2019
Abstract:
Research in information science and scholarly communication strongly relies on the availability of openly accessible datasets of scholarly entities metadata and, where possible, their relative payloads. Since such metadata information is scattered across diverse, freely accessible, online resources (e.g. Crossref, ORCID), researchers in this domain are doomed to struggle with (meta)data integration problems, in order to produce custom datasets of often undocumented and rather obscure provenance. This practice leads to waste of time, duplication of efforts, and typically infringes open science best practices of transparency and reproducibility of science. In this article, we describe how to generate DOIBoost, a metadata collection that enriches Crossref with inputs from Microsoft Academic Graph, ORCID, and Unpaywall for the purpose of supporting high-quality and robust research experiments, saving times to researchers and enabling their comparison. To this end, we describe the dataset value and its schema, analyse its actual content, and share the software Toolkit and experimental workflow required to reproduce it. The DOIBoost dataset and Software Toolkit are made openly available via Zenodo.org. DOIBoost will become an input source to the OpenAIRE information graph.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
Crossref; Data integration; Data science; Microsoft academic graph; Open science; ORCID; Scholarly communication; Unpaywall
Elenco autori:
Manghi, Paolo; LA BRUZZO, SANDRO FABRIZIO; Mannocci, Andrea
Link alla scheda completa:
Link al Full Text:
Titolo del libro:
Digital Libraries: Supporting Open Science