摘要
This paper describes our Chinese spelling check system submitted to SIGHAN Bake-off 2014 evaluation. The system's main components are still the conditional random field (CRF)-based word segmentation/part-ofspeech (POS) tagger and tri-gram language model (LM) used last year. But we tried to refine the misspelling rules, decision-making threshold and improve LM rescoring speed to reduce false alarm rate and improve rescoring speed. Bake-off 2014 evaluation results show that one of our system (Run2) did achieve reasonable performance with about 0.485/0.468 accuracies and 0.226/0.180 F1 scores in the detection/ correction metrics.
原文 | English |
---|---|
頁面 | 216-219 |
頁數 | 4 |
出版狀態 | Published - 2014 |
事件 | 3rd CIPS-SIGHAN Joint Conference on Chinese Language Processing, CLP 2014 - Wuhan, 中國 持續時間: 20 10月 2014 → 21 10月 2014 |
Conference
Conference | 3rd CIPS-SIGHAN Joint Conference on Chinese Language Processing, CLP 2014 |
---|---|
國家/地區 | 中國 |
城市 | Wuhan |
期間 | 20/10/14 → 21/10/14 |