Named Entity Recognition for Chinese Healthcare Applications

Cheng Yen Lee*, Ming Hsiang Su, Matus Pleva, Daniel Hládek, Yuan Fu Liao

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Named Entity Recognition is a fundamental task in information extraction, which locates and classifies defined named entities in unstructured text. Chinese NER is more difficult than English NER. Since there are no separators between Chinese characters, incorrectly segmented entity boundaries will cause error propagation in NER. In this study, named entity recognition is constructed and applied in the Chinese medical domain, where Chinese medical datasets are labeled in BIO format. The Chinese HealthNER Corpus contains 33,89 sentences, of which 2531 sentences are divided into the validation set and 3204 sentences are divided into the test set. This study uses PyTorch Embedding + BiLSTM + CRF, RoBERTa + BiLSTM + CRF, BERT Classifier, and BERT + BiLSTM + CRF for training and compares their model performance. Finally, the BERT + BiLSTM + CRF achieves the best prediction performance with a precision of 91.30%, recall of 89.46%, and F1 score of 90.53%

Original languageEnglish
Title of host publication2023 International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2023 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages749-750
Number of pages2
ISBN (Electronic)9798350324174
DOIs
StatePublished - 2023
Event2023 International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2023 - Pingtung, Taiwan
Duration: 17 Jul 202319 Jul 2023

Publication series

Name2023 International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2023 - Proceedings

Conference

Conference2023 International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2023
Country/TerritoryTaiwan
CityPingtung
Period17/07/2319/07/23

Keywords

  • Chinese language processing
  • NLP
  • medical domain
  • named entity recognition
  • text corpus

Fingerprint

Dive into the research topics of 'Named Entity Recognition for Chinese Healthcare Applications'. Together they form a unique fingerprint.

Cite this