Authors
Xin Su, Chuang Cheng, Kuo Yang, Xuezhong Zhou
Publication date
2022/10/21
Book
China Health Information Processing Conference
Pages
29-40
Publisher
Springer Nature Singapore
Description
There are a lot of biomedical knowledge hidden in the massive scientific clinical literature. These knowledge exist in an unstructured form and is difficult to extract automatically. Natural language processing makes it possible to mine these knowledge automatically. At present, most information extraction models need enough data to achieve good performance. Due to the scarcity of high-quality biomedical labeled data, it is still difficult to extract biomedical literature accurately in the case of few samples. This paper describes our participation in the task 1 of the “China Health Information Processing Conference” (CHIP 2022). We proposes a knowledge-based data augmentation framework to achieve data expansion to overcome the scarcity of training data. The experimental results show that after data augmentation, the of named entity recognition using BioBERT-BiLSTM-CRF reaches 0.58 and the of …
Total citations
Scholar articles
X Su, C Cheng, K Yang, X Zhou - China Health Information Processing Conference, 2022