Transfer of frames from english framenet to construct Chinese framenet: A bilingual corpus-based approach

Tsung Han Yang, Hen Hsen Huang, An Zi Yen, Hsin Hsi Chen

研究成果: Conference contribution同行評審

7 引文 斯高帕斯(Scopus)

摘要

Current publicly available Chinese FrameNet has a relatively low coverage of frames and lexical units compared with FrameNet in other languages. Frames are incompletely specified for some lexical units, and some critical lexical elements are even missing. That results in suitable frames cannot be triggered and filled in practical applications. This paper presents an automatic approach to constructing Chinese FrameNet. We first capture the mapping between English lexical entries and their Chinese counterparts in a large scale sentence-aligned English-Chinese bilingual corpus. Then, a semantic transfer approach is proposed based on word alignments applied to a large balanced bilingual corpus. The resource currently covers 779 frames and 36k lexical units. We apply it to annotate diary and tweet, and achieve overall 86% success rate to provide frame recommendations that are acceptable by annotators. The success rates in terms of source types are 95% and 80% for diaries and tweets respectively.

原文English
主出版物標題LREC 2018 - 11th International Conference on Language Resources and Evaluation
編輯Hitoshi Isahara, Bente Maegaard, Stelios Piperidis, Christopher Cieri, Thierry Declerck, Koiti Hasida, Helene Mazo, Khalid Choukri, Sara Goggi, Joseph Mariani, Asuncion Moreno, Nicoletta Calzolari, Jan Odijk, Takenobu Tokunaga
發行者European Language Resources Association (ELRA)
頁面868-872
頁數5
ISBN(電子)9791095546009
出版狀態Published - 2019
事件11th International Conference on Language Resources and Evaluation, LREC 2018 - Miyazaki, 日本
持續時間: 7 5月 201812 5月 2018

出版系列

名字LREC 2018 - 11th International Conference on Language Resources and Evaluation

Conference

Conference11th International Conference on Language Resources and Evaluation, LREC 2018
國家/地區日本
城市Miyazaki
期間7/05/1812/05/18

指紋

深入研究「Transfer of frames from english framenet to construct Chinese framenet: A bilingual corpus-based approach」主題。共同形成了獨特的指紋。

引用此