Rule-based page segmentation for palm leaf manuscript on color image

Papangkorn Inkeaw, Jakramate Bootkrajang, Phasit Charoenkwan, Sanparith Marukatat, Shinn-Ying Ho, Jeerayut Chaijaruwanich*

*此作品的通信作者

研究成果: Conference contribution同行評審

1 引文 斯高帕斯(Scopus)

摘要

Palm leaf manuscripts are important source of history and ancient wisdom. Large number of manuscripts have been already digitized in the form of folio images. To extract useful information, an optical character recognition (OCR) is often considered to be the first step towards text mining. Unfortunately, folio images contain multiple unsegmented palm leaf images, making it difficult to manage in OCR process. This motivates us to propose a new page segmentation method for palm leaf manuscripts. This method consists of two main steps, first of which is the detection of objects in folio images using Connected Component Labeling method in a transformed L*a*b* color space. The second step is rule-based selection of objects as either palm leaf or not palm leaf. The experiments performed on 20 publicly available palm leaf manuscripts composed of 384 folio images demonstrated that the proposed method effectively segmented folio images into separate palm leaf images, with 99.86% precision and 96.67% recall scores.

原文English
主出版物標題Digital Libraries
主出版物子標題Knowledge, Information, and Data in an Open Access Society - 18th International Conference on Asia-Pacific Digital Libraries, ICADL 2016, Proceedings
編輯Atsuyuki Morishima, Andreas Rauber, Chern li Liew
發行者Springer Verlag
頁面127-136
頁數10
ISBN(列印)9783319493039
DOIs
出版狀態Published - 2016
事件18th International Conference on Asia-Pacific Digital Libraries, ICADL 2016 - Tsukuba, 日本
持續時間: 7 12月 20169 12月 2016

出版系列

名字Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
10075 LNCS
ISSN(列印)0302-9743
ISSN(電子)1611-3349

Conference

Conference18th International Conference on Asia-Pacific Digital Libraries, ICADL 2016
國家/地區日本
城市Tsukuba
期間7/12/169/12/16

指紋

深入研究「Rule-based page segmentation for palm leaf manuscript on color image」主題。共同形成了獨特的指紋。

引用此