Ensemble Multi-Channel Neural Networks for Scientific Language Editing Evaluation

Lung Hao Lee, Yuh Shyang Wang, Chao Yi Chen, Liang Chih Yu*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

A huge and growing number of scientific papers are authored by non-native English speakers, driving increased demand for effective computer-based writing tools to help writers composing scientific articles. The Automated Evaluation of Scientific Writing (AESW) shared task promotes the use of natural language processing tools to improve the quality of scientific writing in English by predicting whether a given sentence needs language editing or not. In this study, we propose an Ensemble Multi-Channel Neural Networks (called EMC-NN) model for scientific language editing evaluation, comprised of three main parts: a multi-channel word embedding representation, a combination of Bidirectional Long Short-Term Memory and Convolutional Neural Networks, and a majority voting ensemble. Experimental results on 143,804 testing sentences show that our proposed EMC-NN achieved an F1-score of 0.6367, outperforming the winner of the AESW-2016 competition task and the recent BERT transformers. Based on a series of in-depth analyses comparing the number of channels, ensemble size and network architectures, the proposed EMC-NN model is a relatively simple, but effective approach that offers significant performance improvements for scientific writing evaluation tasks.

Original languageEnglish
Pages (from-to)158540-158547
Number of pages8
JournalIEEE Access
Volume9
DOIs
StatePublished - 2021

Keywords

  • Automated writing evaluation
  • Ensemble learning
  • Multi-channel neural networks
  • Natural language processing
  • Scientific English

Fingerprint

Dive into the research topics of 'Ensemble Multi-Channel Neural Networks for Scientific Language Editing Evaluation'. Together they form a unique fingerprint.

Cite this