NCTU and NTUT's Entry to CLP-2014 Chinese Spelling Check Evaluation

Yih-Ru Wang, Yuan Fu Liao

Research output: Contribution to conferencePaperpeer-review

7 Scopus citations

Abstract

This paper describes our Chinese spelling check system submitted to SIGHAN Bake-off 2014 evaluation. The system's main components are still the conditional random field (CRF)-based word segmentation/part-ofspeech (POS) tagger and tri-gram language model (LM) used last year. But we tried to refine the misspelling rules, decision-making threshold and improve LM rescoring speed to reduce false alarm rate and improve rescoring speed. Bake-off 2014 evaluation results show that one of our system (Run2) did achieve reasonable performance with about 0.485/0.468 accuracies and 0.226/0.180 F1 scores in the detection/ correction metrics.

Original languageEnglish
Pages216-219
Number of pages4
StatePublished - 2014
Event3rd CIPS-SIGHAN Joint Conference on Chinese Language Processing, CLP 2014 - Wuhan, China
Duration: 20 Oct 201421 Oct 2014

Conference

Conference3rd CIPS-SIGHAN Joint Conference on Chinese Language Processing, CLP 2014
Country/TerritoryChina
CityWuhan
Period20/10/1421/10/14

Fingerprint

Dive into the research topics of 'NCTU and NTUT's Entry to CLP-2014 Chinese Spelling Check Evaluation'. Together they form a unique fingerprint.

Cite this