Authors
Shweta Yadav, Asif Ekbal, Sriparna Saha
Publication date
2018/10
Journal
Soft Computing
Volume
22
Issue
20
Pages
6881-6904
Publisher
Springer Berlin Heidelberg
Description
Entity extraction is an important step in biomedical text mining. Among many other challenges, there are two very crucial issues, viz. determining the most applicable feature set so that the model can be precise and less complex, and adapting the system across multiple benchmark corpora. In this paper, we propose a novel method for feature selection using the search capability of particle swarm optimization. The compact feature set used for training the classifier yields much better results when compared to the baseline model, which was developed with a complete set of features. A large number of features suitable for named entity recognition task from biomedical domain are also developed in the current paper. The complete set of features is implemented by studying the properties of datasets and from the domain knowledge. We have used conditional random field, a robust classifier as the underlying …
Total citations
20182019202020212022202320242977562