Named Entity Recognition in Medical Domain: A systematic Literature Review
DOI: http://dx.doi.org/10.62527/joiv.8.4.3111
Abstract
Biomedical Named Entity Recognition (BioNER) is essential to bioinformatics because it identifies and classifies biological entities in biomedical texts. With the increasing number of biomedical literature and the rapid progress of the BioNER approach, it is essential to conduct a systematic literature review (SLR) on BioNER. This SLR consolidates existing information and provides directions for future studies in the BioNER field. This review systematically explores scientific journals and conferences published from 2019 to 2024. This research uses PubMed and Scholar as reference search databases because of their affiliation with other well-known publishers such as IEEE, Elsevier, and Springer. The results show a transition from conventional machine learning to deep learning. Neural networks and transformers show better performance in deep learning methods. The datasets often used in BioNER development are BC2GM, BC5CDR, and NCBI-Disease. Precision, Recall, and F1-Score are used in most papers to evaluate model performance. The performance of these models mostly depends on the availability of big annotated datasets and significant computational tools. Therefore, it is vital for future research to address the issues of annotated data and resource availability to build accurate models. Researchers should investigate the creation of ideal designs that lower computing complexity without compromising performance. Overall, this SLR offers a thorough overview of the latest research on BioNER. It provides significant insights for academics and practitioners in bioinformatics and medical research, helping them understand the innovative aspects of BioNER research.
Keywords
Full Text:
PDFReferences
P. N. Ahmad, A. M. Shah, and K. Y. Lee, “A Review on Electronic Health Record Text-Mining for Biomedical Name Entity Recognition in Healthcare Domain,” May 01, 2023, MDPI. doi: 10.3390/healthcare11091268.
B. Song, F. Li, Y. Liu, and X. Zeng, “Deep learning methods for biomedical named entity recognition: a survey and qualitative comparison,” Brief Bioinform, vol. 22, no. 6, Nov. 2021, doi: 10.1093/bib/bbab282.
M. C. Durango, E. A. Torres-Silva, and A. Orozco-Duque, “Named Entity Recognition in Electronic Health Records: A Methodological Review,” Oct. 01, 2023, Korean Society of Medical Informatics. doi: 10.4258/hir.2023.29.4.286.
H. I. Purnama, I. Wilujeng, and A. Jabar, “Web-based E-learning in Elementary School:A Systematic Literature Review,” 2023. [Online]. Available: www.joiv.org/index.php/joiv
R. E. Ramos-Vargas, I. Román-Godínez, and S. Torres-Ramos, “Comparing general and specialized word embeddings for biomedical named entity recognition,” PeerJ Comput Sci, vol. 7, pp. 1–22, 2021, doi: 10.7717/peerj-cs.384.
V. K. Keloth et al., “Advancing entity recognition in biomedicine via instruction tuning of large language models,” Bioinformatics, vol. 40, no. 4, Apr. 2024, doi: 10.1093/bioinformatics/btae163.
Y. Tong et al., “Improving biomedical named entity recognition by dynamic caching inter-sentence information,” Bioinformatics, vol. 38, no. 16, pp. 3976–3983, Aug. 2022, doi: 10.1093/bioinformatics/btac422.
Z. Chai, H. Jin, S. Shi, S. Zhan, L. Zhuo, and Y. Yang, “Hierarchical shared transfer learning for biomedical named entity recognition,” BMC Bioinformatics, vol. 23, no. 1, Dec. 2022, doi: 10.1186/s12859-021-04551-4.
X. Zheng, H. Du, X. Luo, F. Tong, W. Song, and D. Zhao, “BioByGANS: biomedical named entity recognition by fusing contextual and syntactic features through graph attention network in node classification framework,” BMC Bioinformatics, vol. 23, no. 1, Dec. 2022, doi: 10.1186/s12859-022-05051-9.
Y. Tian, W. Shen, Y. Song, F. Xia, M. He, and K. Li, “Improving biomedical named entity recognition with syntactic information,” BMC Bioinformatics, vol. 21, no. 1, Dec. 2020, doi: 10.1186/s12859-020-03834-6.
S. K. Hong and J. G. Lee, “DTranNER: Biomedical named entity recognition with deep learning-based label-label transition model,” BMC Bioinformatics, vol. 21, no. 1, Feb. 2020, doi: 10.1186/s12859-020-3393-1.
Y. Lou, X. Zhu, and K. Tan, “Dictionary-based matching graph network for biomedical named entity recognition,” Sci Rep, vol. 13, no. 1, Dec. 2023, doi: 10.1038/s41598-023-48564-w.
Z. Guan and X. Zhou, “A prefix and attention map discrimination fusion guided attention for biomedical named entity recognition,” BMC Bioinformatics, vol. 24, no. 1, Dec. 2023, doi: 10.1186/s12859-023-05172-9.
G. Hou, Y. Jian, Q. Zhao, X. Quan, and H. Zhang, “Language model based on deep learning network for biomedical named entity recognition,” Methods, vol. 226, pp. 71–77, Jun. 2024, doi: 10.1016/j.ymeth.2024.04.013.
M. R. Khan, M. Ziyadi, and M. AbdelHady, “MT-BioNER: Multi-task Learning for Biomedical Named Entity Recognition using Deep Bidirectional Transformers,” Jan. 2020, [Online]. Available: http://arxiv.org/abs/2001.08904
A. Dash, M. Pandey, and S. S. Rautaray, “Novel deep neural network framework for biomedical named entity recognition,” Int J Health Sci (Qassim), pp. 4310–4324, Jun. 2022, doi: 10.53730/ijhs.v6ns5.9557.
V. Moscato, M. Postiglione, C. Sansone, and G. Sperli, “TaughtNet: Learning Multi-Task Biomedical Named Entity Recognition From Single-Task Teachers,” IEEE J Biomed Health Inform, vol. 27, no. 5, pp. 2512–2523, May 2023, doi: 10.1109/JBHI.2023.3244044.
T. Liang, C. Xia, Z. Zhao, Y. Jiang, Y. Yin, and P. S. Yu, “Transferring From Textual Entailment to Biomedical Named Entity Recognition,” IEEE/ACM Trans Comput Biol Bioinform, vol. 20, no. 4, pp. 2577–2586, Jul. 2023, doi: 10.1109/TCBB.2023.3236477.
L. Luo, C. H. Wei, P. T. Lai, R. Leaman, Q. Chen, and Z. Lu, “AIONER: all-in-one scheme-based biomedical named entity recognition using deep learning,” Bioinformatics, vol. 39, no. 5, May 2023, doi: 10.1093/bioinformatics/btad310.
X. Wang, J. Lyu, L. Dong, and K. Xu, “Multitask learning for biomedical named entity recognition with cross-sharing structure,” BMC Bioinformatics, vol. 20, no. 1, Aug. 2019, doi: 10.1186/s12859-019-3000-5.
C. Sun, Z. Yang, L. Wang, Y. Zhang, H. Lin, and J. Wang, “Biomedical named entity recognition using BERT in the machine reading comprehension framework,” J Biomed Inform, vol. 118, Jun. 2021, doi: 10.1016/j.jbi.2021.103799.
P. Chen, J. Wang, H. Lin, Y. Zhang, and Z. Yang, “Knowledge Adaptive Multi-Way Matching Network for Biomedical Named Entity Recognition via Machine Reading Comprehension,” IEEE/ACM Trans Comput Biol Bioinform, vol. 20, no. 3, pp. 2101–2111, May 2023, doi: 10.1109/TCBB.2022.3233856.
X. Wang, Y. Zhang, Q. Li, and X. Ren, Distantly Supervised Biomedical Named Entity Recognition with Dictionary Expansion. IEEE, 2019.
H. Alamro, T. Gojobori, M. Essack, and X. Gao, “BioBBC: a multi-feature model that enhances the detection of biomedical entities,” Sci Rep, vol. 14, no. 1, Dec. 2024, doi: 10.1038/s41598-024-58334-x.
M. Košprdić, N. Prodanović, A. Ljajić, B. Bašaragin, and N. Milošević, “From Zero to Hero: Harnessing Transformers for Biomedical Named Entity Recognition in Zero- and Few-shot Contexts,” May 2024, [Online]. Available: http://arxiv.org/abs/2305.04928
J. M. Giorgi and G. D. Bader, “Towards reliable named entity recognition in the biomedical domain,” Bioinformatics, vol. 36, no. 1, pp. 280–286, Jan. 2020, doi: 10.1093/bioinformatics/btz504.
C. Che, C. Zhou, H. Zhao, B. Jin, and Z. Gao, “Fast and effective biomedical named entity recognition using temporal convolutional network with conditional random field,” Mathematical Biosciences and Engineering, vol. 17, no. 4, pp. 3553–3566, May 2020, doi: 10.3934/MBE.2020200.
P. Chen, J. Wang, H. Lin, D. Zhao, and Z. Yang, “Few-shot biomedical named entity recognition via knowledge-guided instance generation and prompt contrastive learning,” Bioinformatics, vol. 39, no. 8, Aug. 2023, doi: 10.1093/bioinformatics/btad496.
P. Mishra, A. a, and A. Larhgotra, “Named Entity Recognition on Biomedical Text,” in Advancements in Communication and Systems, Soft Computing Research Society, 2024, pp. 197–207. doi: 10.56155/978-81-955020-7-3-18.
R. Phan, “Robust Biomedical Name Entity Recognition Based on Deep Machine Learning,” 2020.
R. Siddalingappa and K. Sekar, “Bi-directional long short term memory using recurrent neural network for biological entity recognition,” IAES International Journal of Artificial Intelligence, vol. 11, no. 1, pp. 89–101, Mar. 2022, doi: 10.11591/ijai.v11.i1.pp89-101.
P. Chen, M. Zhang, X. Yu, and S. Li, “Named entity recognition of Chinese electronic medical records based on a hybrid neural network and medical MC-BERT,” BMC Med Inform Decis Mak, vol. 22, no. 1, Dec. 2022, doi: 10.1186/s12911-022-02059-2.
Y. Li et al., “Towards Chinese clinical named entity recognition by dynamic embedding using domain-specific knowledge,” Jun. 01, 2020, Academic Press Inc. doi: 10.1016/j.jbi.2020.103435.
L. Fu, Z. Weng, J. Zhang, H. Xie, and Y. Cao, “MMBERT: a unified framework for biomedical named entity recognition,” Med Biol Eng Comput, vol. 62, no. 1, pp. 327–341, Jan. 2024, doi: 10.1007/s11517-023-02934-8.
S. Guo, W. Yang, L. Han, X. Song, and G. Wang, “A multi-layer soft lattice based model for Chinese clinical named entity recognition,” BMC Med Inform Decis Mak, vol. 22, no. 1, p. 201, Jul. 2022, doi: 10.1186/s12911-022-01924-4.
Y. Gao, Y. Wang, P. Wang, and L. Gu, “Medical named entity extraction from chinese resident admit notes using character and word attention-enhanced neural network,” Int J Environ Res Public Health, vol. 17, no. 5, Mar. 2020, doi: 10.3390/ijerph17051614.
B. Ji et al., “Research on Chinese medical named entity recognition based on collaborative cooperation of multiple neural network models,” J Biomed Inform, vol. 104, Apr. 2020, doi: 10.1016/j.jbi.2020.103395.
G. Wen, H. Chen, H. Li, Y. Hu, Y. Li, and C. Wang, “Cross domains adversarial learning for Chinese named entity recognition for online medical consultation,” J Biomed Inform, vol. 112, Dec. 2020, doi: 10.1016/j.jbi.2020.103608.
Q. Zou et al., “Phenonizer: A Fine-Grained Phenotypic Named Entity Recognizer for Chinese Clinical Texts,” Biomed Res Int, vol. 2022, 2022, doi: 10.1155/2022/3524090.
W. Guo, J. Lu, and F. Han, “Named Entity Recognition for Chinese Electronic Medical Records Based on Multitask and Transfer Learning,” IEEE Access, vol. 10, pp. 77375–77382, 2022, doi: 10.1109/ACCESS.2022.3192866.
Y. Zhou et al., “Clinical Named Entity Recognition using Contextualized Token Representations,” Jun. 2021, [Online]. Available: http://arxiv.org/abs/2106.12608
Y. Zheng et al., “An imConvNet-based deep learning model for Chinese medical named entity recognition,” BMC Med Inform Decis Mak, vol. 22, no. 1, Dec. 2022, doi: 10.1186/s12911-022-02049-4.
H. Zhou, Z. Liu, C. Lang, Y. Xu, Y. Lin, and J. Hou, “Improving the recall of biomedical named entity recognition with label re-correction and knowledge distillation,” BMC Bioinformatics, vol. 22, no. 1, Dec. 2021, doi: 10.1186/s12859-021-04200-w.
Y. Su, M. Wang, P. Wang, C. Zheng, Y. Liu, and X. Zeng, “Deep learning joint models for extracting entities and relations in biomedical: a survey and comparison,” Nov. 01, 2022, Oxford University Press. doi: 10.1093/bib/bbac342.
N. Perera, M. Dehmer, and F. Emmert-Streib, “Named Entity Recognition and Relation Detection for Biomedical Information Extraction,” Aug. 28, 2020, Frontiers Media S.A. doi: 10.3389/fcell.2020.00673.
L. Ding, Z. Zhang, and H. Liu, “A Bootstrapped Chinese Biomedical Named Entity Recognition Model Incorporating Lexicons ⋆,” 2022. [Online]. Available: http://openkg.cn/
Z. Zhang and A. L. P. Chen, “Biomedical named entity recognition with the combined feature attention and fully-shared multi-task learning,” BMC Bioinformatics, vol. 23, no. 1, Dec. 2022, doi: 10.1186/s12859-022-04994-3.
V. Suárez-Paniagua, R. M. Rivera Zavala, I. Segura-Bedmar, and P. Martínez, “A two-stage deep learning approach for extracting entities and relationships from medical texts,” J Biomed Inform, vol. 99, Nov. 2019, doi: 10.1016/j.jbi.2019.103285.
P. Savci and B. Das, “Structured Named Entity Recognition (NER) in Biomedical Texts Using Pre-Trained Language Models,” in 12th International Symposium on Digital Forensics and Security, ISDFS 2024, Institute of Electrical and Electronics Engineers Inc., 2024. doi: 10.1109/ISDFS60797.2024.10527329.
H. Sousa, A. Mario Jorge, A. Pasquali, C. Santos, and M. Lopes, “A Biomedical Entity Extraction Pipeline for Oncology Health Records in Portuguese,” in Proceedings of the ACM Symposium on Applied Computing, Association for Computing Machinery, Mar. 2023, pp. 950–956. doi: 10.1145/3555776.3578577.
V. Araujo, A. Carvallo, and D. Parra, “Adversarial evaluation of bert for biomedical named entity recognition,” Association for Computational Linguistics (ACL), 2019.
L. C. Seller, “Trabajo Fin de Máster Named Entity Recognition of Human Phenotype Ontology Concepts in Spanish Clinical Texts,” 2024.
C. Zhuoma, J. Cairang, D. Sangjie, Z. Yangmao, and Z. Zhuoma, “Tibetan Medical Named Entity Recognition Study for Tibetan Clinical Electronic Medical Records,” 2022. [Online]. Available: https://ssrn.com/abstract=4040676
D. Li, Y. Zhang, Y. Zhang, and H. Lu, A hybrid model based on deep convolutional neural network for medical named entity recognition. Institute of Electrical and Electronics Engineers (IEEE) Computer Society, 2022.
E. Qumsiyeh and R. Jayousi, “Biomedical Information Extraction Pipeline to Identify Disease-Gene Interactions from PubMed Breast Cancer Literature,” in Proceedings - 2021 International Conference on Promising Electronic Technologies, ICPET 2021, Institute of Electrical and Electronics Engineers Inc., 2021, pp. 1–6. doi: 10.1109/ICPET53277.2021.00007.
S. Sazzed, “BanglaBioMed: A Biomedical Named-Entity Annotated Corpus for Bangla (Bengali),” 2022. [Online]. Available: https://www.prothomalo.com/life/
M. A. Shaaban, A. Akkasi, A. Khan, M. Komeili, and M. Yaqub, “Fine-Tuned Large Language Models for Symptom Recognition from Spanish Clinical Text,” 2023.
F. A. Satti et al., “A semantic sequence similarity based approach for extracting medical entities from clinical conversations,” Inf Process Manag, vol. 60, no. 2, Mar. 2023, doi: 10.1016/j.ipm.2022.103213.
C. Crema, “Advancing Italian Biomedical Information Extraction with Transformers-based Models: Methodological Insights and Multicenter Practical Application,” 2023.
U. T. P. Phan, P. N. V Nguyen, and N. T. H. Nguyen, “A Named Entity Recognition Corpus for Vietnamese Biomedical Texts to Support Tuberculosis Treatment,” 2022. [Online]. Available: https://github.com/ptpuyen1511/VietBioNER.
G. Alisa, D. Dmitry, L. Anton, L. Alexey, M. Alexey, and K. Irina, “Method for Biomedical Information Extraction of Immunosuppressive Cell Properties,” in Proceedings - 2021 IEEE Ural-Siberian Conference on Computational Technologies in Cognitive Science, Genomics and Biomedicine, CSGB 2021, Institute of Electrical and Electronics Engineers Inc., May 2021, pp. 210–213. doi: 10.1109/CSGB53040.2021.9496030.
Q. Pan, C. Huang, and D. Chen, “A method based on multi-standard active learning to recognize entities in electronic medical record,” Mathematical Biosciences and Engineering, vol. 18, no. 2, pp. 1000–1021, Jan. 2021, doi: 10.3934/MBE.2021054.
C. Pearson, N. Seliya, and R. Dave, “Named Entity Recognition in Unstructured Medical Text Documents,” 2021.
A. Kumar and A. Sharaff, “ABEE: automated bio entity extraction from biomedical text documents,” Data Technologies and Applications, vol. 57, no. 2, pp. 222–244, Apr. 2023, doi: 10.1108/DTA-04-2022-0151.