Title
A Document layout analysis method based on morphological operators and connected components
Date Issued
01 October 2018
Access level
metadata only access
Resource Type
conference paper
Publisher(s)
Institute of Electrical and Electronics Engineers Inc.
Abstract
During the last decades, the interest in preserving digitally historical documents have gained considerable attention. To exploit all the advantages and opportunities offered by the digitized documents, it's necessary to understand their contents. The first step toward that understanding is to determine the locations of the entities of the document, such as figures, titles, and captions, text, etc. This paper presents a new hybrid approach to analyze the structure of documents that is founded on morphological operators and connected components. The proposed method is divided into two stages, preprocessing, in which the quality of the document images is enhanced; and layout analysis, in which, we identify three types of layout. We also include a fragmentation process, in which we divide the page image into sections. Finally, We conducted the experiments on a dataset containing ancient historical newspapers.
Start page
622
End page
631
Language
English
OCDE Knowledge area
Ciencias de la computación
Scopus EID
2-s2.0-85071101179
Resource of which it is part
Proceedings - 2018 44th Latin American Computing Conference, CLEI 2018
ISBN of the container
9781728104379
Conference
Proceedings - 2018 44th Latin American Computing Conference, CLEI 2018
Sources of information: Directorio de Producción Científica Scopus