Recognition of handwritten Lanna Dhamma characters using a set of optimally designed moment features

Papangkorn Inkeaw, Phasit Charoenkwan, Hui Ling Huang, Sanparith Marukatat, Shinn-Ying Ho, Jeerayut Chaijaruwanich*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

7 Scopus citations

Abstract

Lanna Dhamma alphabet was used mainly for religious communication in the ancient Lanna Kingdom of Thailand. The old manuscripts using this alphabet are gradually decayed. It is desirable to preserve these valuable manuscripts in machine-encoded text files. Existing works used optical character recognition (OCR) methods based on wavelet transform for recognition of handwritten Lanna Dhamma characters. However, the test accuracy of writer-independent recognition is not satisfactory. This work proposes an OCR method, called LDIMS, for recognition of handwritten Lanna Dhamma characters using a set of optimally designed moment features. The LDIMS using an optimization approach to feature selection consists of three main phases: (1) determination of moment orders for each of eight effective moment descriptors, (2) the best combination of selected moment descriptors and (3) the optimized selection of moment features using an inheritable bi-objective genetic algorithm. The LDIMS has three individual feature sets for the recognition of handwritten Lanna Dhamma characters in upper, middle and lower levels. The character images gleaned from previous work were used as a training dataset. A new character image dataset from different writers was established for evaluating ability of writer-independent recognition. The experimental results show that the LDIMS using four moment descriptors, Meixner, Charlier, Tchebichef and Hahn, has test accuracies of 86.60, 74.38 and 85.82% for the characters in upper, middle and lower levels, respectively. The LDIMS with a mean accuracy of 82.27% performed well in recognizing the handwritten Lanna Dhamma characters from new writers, compared to existing methods using generic descriptors in terms of both accuracy and feature number used. Experimental results show that the generalized OCR method, LDIMS, is also effective for character recognition of digit and English alphabets, compared to existing methods.

Original languageEnglish
Pages (from-to)259-274
Number of pages16
JournalInternational Journal on Document Analysis and Recognition
Volume20
Issue number4
DOIs
StatePublished - 1 Dec 2017

Keywords

  • Image moments
  • Lanna Dhamma alphabet
  • Optical character recognition
  • Writer-independent recognition

Fingerprint

Dive into the research topics of 'Recognition of handwritten Lanna Dhamma characters using a set of optimally designed moment features'. Together they form a unique fingerprint.

Cite this