Sage Journals: Discover world-class research

Abstract

Named entity recognition is a highly relevant research topic in the field of natural language processing owing to its widespread application, which mainly involves three types of nested entities, discontinuous entities and flat named entities. Notably, nested named entities are characterized by one entity containing multiple sub-entities, ambiguous boundary definitions, and flexible structural formats. These features give rise to challenges such as semantic ambiguity, slow decoding efficiency, error propagation, and information loss. Therefore, to effectively address these issues and enhance classification performance, it is critical to integrate information sources such as internal markers, neighbouring word pairs, first and last word pairs, labels, and related spans. This is achieved in the present study via a newly proposed nested named entity recognition model based on triple cross affine attention. The proposed model encodes the input text using the BERT model and Bi-LSTM before extracting relevant features from the input text by applying DCNN. The extracted feature sequences are used as input into the triple cross affine attention module which computes the scores, allowing the model to classify and predict the outputs using the MLP layer. The experimental results demonstrate that the precision, recall, and F1 value metrics of the proposed method outperform other existing benchmark algorithmic models when applied to the ACE2004, ACE2005, and GENIA standard datasets. Additionally, it exhibits superior recognition performance in nested named entity recognition.

Keywords

Triple cross affine attention nested named entity word pair relationship multi-layer perceptron

Get full access to this article

View all access options for this article.

References

Bob

D. D. V.

Floris

F. B.

Max

A. V.

Hessam

Marius

Ivana

, et al. (2019). A deep learning framework for unsupervised affine and deformable image registration. Computing Research Repository, 52(0), 128.0–143.0.

Devlin

Chang

Lee

Toutanova

(2019). BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the NAACLHLT (pp. 4171–4186).

Feng

Ruili

Jun

, et al. (2020). Improving entity linking through semantic reinforced entity embeddings. In Annual meeting of the association for computational linguistics, 2020 (pp. 6843–6848).

Huang

Liu

(2021). SpanNER: Named entity re-/recognition as span prediction. In Proceedings of the annual meeting of the association for computational linguistics (pp. 7183–7195).

Tan

C. Q.

Chen

M. S.

Huang

S. F.

Huang

, et al. (2021). Nested named entity recognition with partially-observed treecrfs. In AAAI conference on artificial intelligence, Vol. 35, (pp. 12839–12847).

Zhang

Ren

Sun

(2016). Deep residual learning for image recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA (pp. 770–778).

Hendrycks

Gimpel

(2016). Gaussian error linear units (GELUs). arXiv: Learning.

Huang

Zhao

Fang

Xiao

W. D.

, et al. (2022). Extract-Select: A span selection framework for nested named entity recognition with generative adversarial training. In Findings of the association for computational linguistics: ACL 2022 (pp. 85–96).

Jingye

Hao

Jiang

Shengqiong

Meishan

Chong

Donghong

Fei

, et al. (2022). Unified named entity recognition as word-word relation classification. In AAAI Conference on artificial intelligence (pp. 10965–10973).

10.

Kim

J-D

Ohta

Tateisi

Tsujii

(2003). Genia corpus–a semantically annotated corpus for bio-textmining. Bioinformatics (Oxford, England), 19(1), i180-–i182.

11.

Kim

Son

Kim

Lee

Lim

(2021). Enhancing Korean named entity recognition with linguistic tokenization strategies. IEEE Access, 9, 151814.

12.

Lample

Ballesteros

Subramanian

Kawakami

Dyer

(2016). Neural architectures for named entity recognition. In Proceedings of the NAACL (pp. 260–270).

13.

Titov

(2018). Improving entity linking by modeling latent relations between mentions. In Proceedings of the ACL (pp. 1595–1604).

14.

Lin

Zhang

(2021). A span-based model for joint overlapped and discontinuous named entity recognition. In Proceedings of the ACL-IJCNLP (pp. 4814–4828).

15.

Fei

Ren

(2021). MRN: A locally and globally mention-based reasoning network for document-level relation extraction. In Proceedings of the ACL-IJCNLP findings (pp. 1359–1370).

16.

Liu

Wei

Jia

Vosoughi

(2021). Modulating language models with emotions. In Findings of the ACLIJCNLP (pp. 4332–4339).

17.

Lou

Yang

S. L.

K. W.

(2022). Nested named entity recognition as latent lexicalized constituency parsing. In Proceedings of the 60th Annual meeting of the association for computational linguistics (Volume 1: Long Papers) (pp. 6183–6198).

18.

Roth

(2015). Joint mention extraction and classification with mention hypergraphs. In Proceedings of the EMNLP (pp. 857–867).

19.

Luan

Wadden

Shah

Ostendorf

Hajishirzi

(2019). A general framework for information extraction using dynamic span graphs. In Proceedings of the NAACL (pp. 3036–3046).

20.

X. Z.

Eduard

H. H.

(2016). End-to-end sequence labeling via bi-directional lstm-cnns-crf. In Proceedings of the 54th Annual meeting of the association for computational linguistics, ACL 2016 (pp. 1–12).

21.

K. A. K.

Sanjay

Amit

Sanjay

K. J.

, et al. (2016). A survey on question answering systems with classification. Journal of King Saud University - Computer and Information Sciences, 28(3), 345–361.

22.

Natawut

Giuseppe

Simone

Oleg

, et al. (2021). Continual learning for named entity recognition. In AAAI Conference on artificial intelligence (pp. 13570–13577).

23.

Straková

Straka

Hajic

(2019). Neural architectures for nested ner through linearization. In Proceedings of the ACL (pp. 5326–5331).

24.

Tan

C. Q.

Qiu

Chen

M. S.

Wang

Huang

(2020). Boundary enhanced neural span classification for nested named entity recognition. In Proceedings of the AAAI conference on artificial intelligence, Vol. 34, (pp. 9016–9023).

25.

Tang

Wan

Yang

(2020). Word-character graph convolution network for Chinese named entity recognition. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28, 1520–1532.

26.

Wan

Zhang

, et al. (2022). Nested named entity recognition with span-level graphs. In Proceedings of the 60th Annual meeting of the association for computational linguistics (Volume 1: Long Papers) (pp. 892–903).

27.

Wang

(2018). Neural segmental hypergraphs for overlapping mention recognition. In Proceedings of the EMNLP (pp. 204–214).

28.

Wang

Shou

L. D.

Chen

(2020). Pyramid: A layered model for nested named entity recognition. In Proceedings of the 58th Annual meeting of the association for computational linguistics (pp. 5918–5928).

29.

Wei

Z. P.

J. L.

Wang

Tian

Chang

(2020). A novel cascade binary tagging framework for relational triple extraction. In Proceedings of the ACL (pp. 1476–1488).

30.

Yan

Gui

Dai

Guo

Zhang

Qiu

(2021). A unified generative framework for various NER subtasks. In Proceedings of the ACL-IJCNLP (pp. 5808–5822).

31.

Yang

S. L.

K. W.

(2022). Bottom-up constituency parsing and nested named entity recognition with pointer networks. In Proceedings of the 60th Annual meeting of the association for computational linguistics.

32.

J. T.

Bohnet

Poesio

(2020). Named entity recognition as dependency parsing. In Proceedings of the 58th Annual meeting of the association for computational linguistics (pp. 6470–6476).

33.

Koltun

(2015). Multi-scale context aggregation by dilated convolutions. In International conference on learning representations.

34.

Yuan

Tan

Huang

(2022). Fusing heterogeneous factors with triafine mechanism for nested named entity recognition. In Proceedings of the 60th Annual meeting of the association for computational linguistics (Volume 1: Long Papers) (pp. 3174–3186).

35.

Zheng

Cai

Leung

H.-F.

(2019). A boundary-aware neural model for nested named entity recognition. In Proceedings of the EMNLP-IJCNLP (pp. 357–366).

36.

Zheng

Wang

Cai

(2021). Object-Aware Multimodal Named Entity Recognition in Social Media Posts With Adversarial Learning. IEEE Transactions on Multimedia, 23, 2520–2532.

37.

Zhong

Chen

(2021). A frustratingly easy approach for entity and relation extraction. In North American association for computational linguistics (NAACL).

Triple Cross Affine Attention for Nested Named Entity Recognition

Abstract

Keywords

Get full access to this article

References