Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

Genealogical Data Mining from Historical Archives: The Case of the Jewish Community in Pisa

Articolo
Data di Pubblicazione:
2023
Abstract:
The Jewish community archive in Pisa owns a vast collection of documents and manuscripts that date back centuries. These documents contain valuable genealogical information, including birth, marriage, and death records. This paper aims to describe the preliminary results of the Archivio Storico della Comunita Ebraica di Pisa (ASCEPI) project, with a focus on the extraction of data from the Nati, Morti e Ballottati (NMB) Registry document in the archive. The NMB Registry contains about 1900 records of births, deaths, and balloted individuals within the Jewish community in Pisa. The study uses a semiautomatic pipeline of digitization, transcription, and Natural Language Processing (NLP) techniques to extract personal data such as names, surnames, birth and death dates, and parental names from each record. The extracted data are then used to build a knowledge base and a genealogical tree for a representative family, Supino. This study demonstrates the potential of using NLP and rule-based techniques to extract valuable information from historical documents and to construct genealogical trees.
Tipologia CRIS:
01.01 Articolo in rivista
Keywords:
entity extraction; digital manuscripts; digital humanities; genealogical tree
Elenco autori:
Marchetti, Andrea; Moretti, Manuela; D'Errico, Andrea; LO DUCA, Angelica
Autori di Ateneo:
D'ERRICO ANDREA
LO DUCA ANGELICA
MARCHETTI ANDREA
MORETTI MANUELA
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/452726
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)