Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

PURE: A Dataset of Public Requirements Documents

Contributo in Atti di convegno
Data di Pubblicazione:
2017
Abstract:
This paper presents PURE (PUblic REquirements dataset), a dataset of 79 publicly available natural language requirements documents collected from the Web. The dataset includes 34,268 sentences and can be used for natural language processing tasks that are typical in requirements engineering, such as model synthesis, abstraction identification and document structure assessment. It can be further annotated to work as a benchmark for other tasks, such as ambiguity detection, requirements categorisation and identification of equivalent re-quirements. In the paper, we present the dataset and we compare its language with generic English texts, showing the peculiarities of the requirements jargon, made of a restricted vocabulary of domain-specific acronyms and words, and long sentences. We also present the common XML format to which we have manually ported a subset of the documents, with the goal of facilitating replication of NLP experiments.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
Empirical Software Engine; Empirical Studies; Model Synthesis; Natural Language Requirements; NLP; NLP Tasks; Public Requirements; PURE; Requirements Abstraction; Requirements Ambiguity Detection; Requirements Categorisation; Requirements Dataset; XML
Elenco autori:
Gnesi, Stefania; Ferrari, Alessio; Spagnolo, GIORGIO ORONZO
Autori di Ateneo:
FERRARI ALESSIO
SPAGNOLO GIORGIO ORONZO
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/335225
  • Dati Generali

Dati Generali

URL

https://ieeexplore.ieee.org/document/8049173/?reload=true
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)