Developing learner corpus annotation for Chinese grammatical errors

Lung Hao Lee, Li Ping Chang, Yuen Hsien Tseng

研究成果: Conference contribution同行評審

13 引文 斯高帕斯(Scopus)

摘要

This study describes the construction of the TOCFL (Test Of Chinese as a Foreign Language) learner corpus, including the collection and grammatical error annotation of 2,837 essays written by Chinese language learners originating from a total of 46 different mother-Tongue languages. We propose hierarchical tagging sets to manually annotate grammatical errors, resulting in 33,835 inappropriate usages. Our built corpus has been provided for the shared tasks on Chinese grammatical error diagnosis. These demonstrate the usability of our learner corpus annotation.

原文English
主出版物標題Proceedings of the 2016 International Conference on Asian Language Processing, IALP 2016
編輯Minghui Dong, Chung-Hsien Wu, Yanfeng Lu, Haizhou Li, Yuen-Hsien Tseng, Liang-Chih Yu, Lung-Hao Lee
發行者Institute of Electrical and Electronics Engineers Inc.
頁面254-257
頁數4
ISBN(電子)9781509009213
DOIs
出版狀態Published - 10 3月 2017
事件20th International Conference on Asian Language Processing, IALP 2016 - Tainan, 台灣
持續時間: 21 11月 201623 11月 2016

出版系列

名字Proceedings of the 2016 International Conference on Asian Language Processing, IALP 2016

Conference

Conference20th International Conference on Asian Language Processing, IALP 2016
國家/地區台灣
城市Tainan
期間21/11/1623/11/16

指紋

深入研究「Developing learner corpus annotation for Chinese grammatical errors」主題。共同形成了獨特的指紋。

引用此