A methodology for evaluating algorithms for table understanding in PDF documents
Contributo in Atti di convegno
Data di Pubblicazione:
2012
Abstract:
This paper presents a methodology for the evaluation of table understanding algorithms for PDF documents. The evaluation takes into account three major tasks: table detection, table structure recognition and functional analysis. We provide a general and exible output model for each task along with corresponding evaluation metrics and methods. We also present a methodology for collecting and ground-truthing PDF documents based on consensusreaching principles and provide a publicly available groundtruthed dataset. Copyright © 2012 by the Association for Computing Machinery, Inc. (ACM).
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
Document analysis; Document understanding; Ground-truth dataset; Metrics; Performance evaluation; Table processing
Elenco autori:
Ruffolo, Massimo; Oro, Ermelinda
Link alla scheda completa: