Data di Pubblicazione:
2004
Abstract:
Linguistic Miner is a project carried out at ILC whose objective is the development of an integrated system to build, organise and
manage a corpus of Italian texts (of various origins and formats), and to design and constantly add new tools for the automatic
extraction of tiered linguistic knowledge to be made available for many teaching, publishing, and other cultural purposes. The project
is based on a notion that is preliminary to all the systems for corpus-based linguistic analysis: a language represented by the largest
possible collection of heterogeneous texts is the best source of linguistic information at any level of analysis considered. The first goals
of such a system are the semi-automated construction of an Italian data mine for the extraction of linguistic information, the validation
of linguistic patterns, the installation of useful tools and resources for a range of different categories of Italian language users. The
main feature of the project is its purpose of building large language reference corpora allowing for the creation and use of effective
tools for the handling and processing, as well as the automatic linguistic synthesis, of such corpora.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
linguistic analysis; information extraction
Elenco autori:
Sassolini, Eva; Ceccotti, MARIA LUIGIA; Cucurullo, Sebastiana; Picchi, Eugenio; Sassi, Manuela
Link alla scheda completa:
Titolo del libro:
Proceedings of the 4th International Conference on Language Resources and Evaluation