Sage Journals: Discover world-class research

Abstract

Aiming at the problem of insufficient labeled data in the medical field, a Named Entity Recognition model that introduces counterfactual mechanism to enhance vocabulary is proposed in this article. With the idea of Semi-Supervised Learning, the model aims at a small amount of labeled data, builds a counterfactual vocabulary generator that captures more dependencies to enhance medical data by introducing and improving the counterfactual mechanism in the structural causal model. Further more, a vocabulary information fusion recognizer is constructed to verify the effectiveness of the data. The recognizer integrates character feature embedding, vocabulary information feature embedding in training data, and position feature embedding. While achieving medical vocabulary enhancement, it also solves the problem of inaccurate entity recognition and improves the accuracy of entity recognition. Through comparative and ablation experiments, it is shown that the named entity recognition model with counterfactual mechanism in this paper achieved an F1 score of 84.67% and 86.15% on the CCKS2019 and CCKS2020 datasets, respectively, which were 0.22%–4.57% and 0.315–5.04% higher than other related models, and 3.82% and 3.86% higher than traditional counterfactual generators, respectively, proving the effectiveness of the model.

Keywords

named entity recognition vocabulary enhancement counterfactual model structural causal model

Get full access to this article

View all access options for this article.

References

Chen

(2020). Named entity recognition in medical field based on BiGRU_CRF mode. Electronic Technology & Software Engineering, 14, 186–188. https://doi.org/10.20109/j.cnki.etse.2020.14.077

Chen

Aguilar

Neves

Solorio

(2021). Data augmentation for cross-domain named entity recognition. arXiv preprint arXiv:2109.01758.

Cherkassky

(2004). Practical selection of SVM parameters and noise estimation for SVM regression. Neural Networks, 17(1), 113–126. https://doi.org/10.1016/S0893-6080(03)00169-2

Chiu

J. P.

Nichols

(2016). Named entity recognition with bidirectional LSTM-CNNs. Transactions of the Association for Computational Linguistics, 4, 357–370. https://doi.org/10.1162/tacl_a_00104

Collobert

Weston

Bottou

Karlen

Kavukcuoglu

Kuksa

(2011). Natural language processing (almost) from Scratch. Journal of Machine Learning Research, 12(1), 2493–2537. https://doi.org/10.5555/1953048.2078186

Croskerry

Norman

(2008). Overconfidence in clinical decision making. The American Journal of Medicine, 121(5), S24–S29. https://doi.org/10.1016/j.amjmed.2008.02.001

Devlin

Chang

M. W.

Lee

Toutanova

(2019). BERT: Pretraining of deep bidirectional transformers for language understanding[EB/OL]. (2018-10-11) [2022-01-05]. https://arxiv.org/abs/1810.04805.

Dick

(2014). Inverting ion images without Abel inversion: Maximum entropy reconstruction of velocity maps. Physical Chemistry Chemical Physics, 16(2), 570–580. https://doi.org/10.1039/C3CP53673D

Eddy

S. R.

(2011). Accelerated profile HMM searches. PLoS Computational Biology, 7(10), e1002195. https://doi.org/10.1371/journal.pcbi.1002195

10.

Feero

W. G.

Bigley

M. B.

Brinner

K. M.

(2008). New standards and enhanced utility for family health history information in the electronic health record: An update from the American Health Information Community's Family Health History Multi-Stakeholder Workgroup. Journal of the American Medical Informatics Association, 15(6), 723–728. https://doi.org/10.1197/jamia.M2793

11.

Friedman

Alderson

P. O.

Austin

J. H. M.

Cimino

J. J.

Johnson

S. B.

(1994). A general natural-language text processor for clinical radiology. Journal of the American Medical Informatics Association, 1(2), 161–174. https://doi.org/10.1136/jamia.1994.95236146

12.

Gan

Zhang

Wan

Chen

Liu

Zhao

Shi

Liu

(2021). Enhance both text and label: Combination strategies for improving the generalization ability of medical entity extraction. China Conference on Knowledge Graph and Semantic Computing. Springer, Singapore, pp. 92–101.

13.

Habib

us Salam

Butt

M. A.

Akram

Smarandache

(2020). A neutrosophic clinical decision-making system for cardiovascular diseases risk analysis. Journal of Intelligent & Fuzzy Systems, 39(5), 7807–7829. https://doi.org/10.3233/JIFS-201163

14.

Han

Xiao

Guo

Wang

(2021). Transformer in transformer. Advances in Neural Information Processing Systems, 34, 15908–15919. https://doi.org/10.48550/arXiv.2103.00112

15.

Huang

S. C.

Pareek

Seyyedi

Banerjee

Lungren

M. P.

(2020). Fusion of medical imaging and electronic health records using deep learning: A systematic review and implementation guidelines. NPJ digital Medicine, 3(1), 136. https://doi.org/10.1038/s41746-020-00341-z

16.

Kang

Khot

Sabharwal

Hovy

(2018). Adventure: Adversarial training for textual entailment with knowledge-guided examples, in: ACL.

17.

Kiser

A. C.

Eilbeck

Ferraro

J. P.

Skarda

D. E.

Samore

M. H.

Bucher

(2022). Standard vocabularies to improve machine learning model transferability with electronic health record data: Retrospective cohort study using health care–associated infection. JMIR Medical Informatics, 10(8), e39057. https://doi.org/10.2196/39057

18.

Lauriola

Lavelli

Aiolli

(2022). An introduction to deep learning in natural language processing: Models, techniques, and tools. Neurocomputing, 470, 443–456. https://doi.org/10.1016/j.neucom.2021.05.103

19.

Sun

Han

(2020a). A survey on deep learning for named entity recognition. IEEE Transactions on Knowledge and Data Engineering, 34(1), 50–70. https://doi.org/10.1109/TKDE.2020.2981314

20.

Yan

Qiu

Huang

(2020b). FLAT: Chinese NER using flat-lattice transformer. arXiv preprint arXiv:2004.11795.

21.

Louvan

Magnini

(2020). Simple is better! lightweight data augmentation for low resource slot filling and intent classification. Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation, Association for Computational Linguistics, Hanoi, Vietnam, pp. 167–177.

22.

Marzorati

Pravettoni

(2017). Value as the key concept in the health care system: How it has influenced medical practice and clinical decision-making processes. Journal of Multidisciplinary Healthcare, 10, 101–106. https://doi.org/10.2147/JMDH.S122383

23.

Mathew

Amudha

Sivakumari

(2020). Deep learning techniques: An overview. Advanced Machine Learning Technologies and Applications: Proceedings of AMLTA 2020, 1141, 599–608. https://doi.org/10.1007/978-981-15-3383-9_54

24.

Mikheev

Moens

Grover

(1999). Named entity recognition without gazetteers. Ninth Conference of the European Chapter of the Association for Computational Linguistics, pp. 1–8.

25.

Min

McCoy

R. T.

Das

Pitler

Linzen

(2020). Syntactic data augmentation increases robustness to inference heuristics, in: ACL.

26.

Peters

M. E.

Ammar

Bhagavatula

Power

(2017). Semi-supervised sequence tagging with bidirectional language models, ACL.

27.

Regina

Meyer

Goutal

(2020). Text data augmentation: Towards better detection of spear-phishing emails. arXiv preprint arXiv:2007.02033.

28.

Ruojia

Wouk

C. S.

Jimin

(2019). Healthcare data mining: Word segmentation and named entity recognition in Chinese electronic medical record. Library and Information Service, 63(2), 34–42. https://doi.org/10.13266/j.issn.0252-3116.2019.02.004

29.

Settles

(2004). Biomedical named entity recognition using conditional random fields and rich feature sets. Proceedings of the international joint workshop on natural language processing in biomedicine and its applications (NLPBA/BioNLP), pp. 107–110.

30.

Shang

J. B.

Liu

L. Y.

Ren

Han

(2018). Learning named entity tagger using domain- specific dictionary. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 2054–2064.

31.

Singh

Sethi

Kalra

G. S.

(2020). Spatially adaptive image denoising via enhanced noise detection method for grayscale and color images. IEEE Access, 8, 112985–113002. https://doi.org/10.1109/ACCESS.2020.3003874

32.

Wang

Huang

(2022). Cross-corpus named entity recognition on Chinese electronic medical records based on label sharing. International Conference on Artificial Intelligence and Intelligent Information Processing (AIIIP 2022). SPIE, 124562D, 30 November 2022, 527–533. https://doi.org/10.1117/12.2659713

33.

Wang

Wei

(2022). A data expansion strategy for improving coal-gangue detection. International Journal of Coal Preparation and Utilization, 43(6), 1119–1137. https://doi.org/10.1080/19392699.2022.2096016

34.

Wang

Z. H.

Yang

(2020). Attention-based bidirectional long short-term memory networks for relation classification using knowledge distillation from BERT. 2020 IEEE Intl Conf on Dependable, Autonomic and Secure Computing, Intl Conf on Pervasive Intelligence and Computing, Intl Conf on Cloud and Big Data Computing, Intl Conf on Cyber Science and Technology Congress. Calgary: IEEE, pp. 562–568.

35.

Jiang

Lei

(2015). Named entity recognition in Chinese clinical text using deep neural network. Studies in Health Technology and Informatics, 216, 624–628.

36.

S. T.

Liu

Tao

Musen

M. A.

Chute

C. G.

Shah

N. H.

(2012). Unified Medical Language System term occurrences in clinical notes: A large-scale corpus analysis. Journal of the American Medical Informatics Association, 19(e1), e149–e156. https://doi.org/10.1136/amiajnl-2011-000744

37.

Xingli

Junjie

Haiqun

(2022). Improvement of data augment algorithm for named entity recognition with small samples. Data Analysis and Knowledge Discovery, 6(10), 128–141.

38.

Zhang

Yang

(2018). Chinese NER using lattice LSTM. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. Melbourne: Association for Computational Linguistics, pp. 1554–1564.

39.

Zhang

Zhao

LeCun

(2015). Character-level convolutional networks for text classification, in: NIPS.

Named Entity Recognition Model with Counterfactual Mechanism to Enhance Vocabulary

Abstract

Keywords

Get full access to this article

References