PLM-PGHC: A novel de-biasing framework for robust question answering

Abstract

Reading Comprehension models have achieved superhuman performance on mainstream public datasets. However, many studies have shown that the models are likely to take advantage of biases in the datasets, which makes it difficult to efficiently reasoning when generalizing to out-of-distribution datasets with non-directional bias, resulting in serious accuracy loss. Therefore, this paper proposes a pre-trained language model based de-biasing framework with positional generalization and hierarchical combination. In this work, generalized positional embedding is proposed to replace the original word embedding to initially weaken the over-dependence of the model on answer distribution information. Secondly, in order to make up for the influence of regularization randomness on training stability, KL divergence term is introduced into the loss function to constrain the distribution difference between the two sub models. Finally, a hierarchical combination method is used to obtain classification outputs that fuse text features from different encoding layers, so as to comprehensively consider the semantic features at the multidimensional level. Experimental results show that PLM-PGHC helps learn a more robust QA model and effectively restores the F1 value on the biased distribution from 37.51% to 81.78%.

Keywords

Natural language processing machine reading comprehension pre-trained language model de-biasing framework

Get full access to this article

View all access options for this article.

References

Agushaka

J.O.

, Ezugwu

A.E.

and Abualigah

, Dwarf mongoose optimization algorithm[J], Computer Methods in Applied Mechanics and Engineering 391 (2022), 114570.

Bahdanau

, Cho

and Bengio

, Neural machine translation by jointly learning to align and translate[J], arXiv preprint arXiv:1409.0473, 2014.

Baradaran

, Ghiasi

and Amirkhani

, A survey on machine reading comprehension systems[J], Natural Language Engineering 28(6) (2022), 683–732.

Bezdan

, Stoean

, Naamany

A.A.

et al., Hybrid fruit-fly optimization algorithm with k-means for text document clustering[J], Mathematics 9(16) (2021), 1929.

Ezugwu

A.E.

, Agushaka

J.O.

, Abualigah

et al., Prairie dog optimization algorithm[J], Neural Computing and Applications 34(22) (2022), 20017–20065.

Qiu

, Liu

, Zhou

et al., Adversarial attack and defense technologies in natural language processing: A survey[J], Neurocomputing 492 (2022), 278–307.

Clark

, Yatskar

and Zettlemoyer

, Don’t take the easy way out: Ensemble based methods for avoiding known dataset biases[C], Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019:4069–4082.

Devlin

, Chang

M.W.

, Lee

et al., Bert: Pre-training of deep bidirectional transformers for language understanding[J], arXiv preprint arXiv:1810.04805, 2018.

Ganesh

, Chen

, Lou

et al., Compressing large-scale transformer-based models: A case study on bert[J], Transactions of the Association for Computational Linguistics 9 (2021), 1061–1080.

10.

Han

, Hsu

I.H.

, Sun

et al., ESTER: A machine reading comprehension dataset for reasoning about event semantic relations[C], Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021:7543–7559.

11.

Hermann

K.M.

, Kocisky

, Grefenstette

et al., Teaching machines to read and comprehend[J], Advances in Neural Information Processing Systems 28 (2015), 1693–1701.

12.

Hinton

G.E.

, Training products of experts by minimizing contrastive divergence[J], Neural Computation 14(8) (2002), 1771–1800.

13.

Hosseinalipour

and Ghanbarzadeh

, A novel metaheuristic optimisation approach for text sentiment analysis[J], International Journal of Machine Learning and Cybernetics 14(3) (2023), 889–909.

14.

Huq

S.M.

, Maskeliūnas

and Damasevicius

, Dialogue agents for artificial intelligence-based conversational systems for cognitively disabled: A systematic review[J], Disability and Rehabilitation: Assistive Technology, 2022:1–20.

15.

Joshi

, Chen

, Liu

et al., Spanbert: Improving pre-training by representing and predicting spans[J], Transactions of the Association for Computational Linguistics 8 (2020), 64–77.

16.

, Lee

, Kim

et al., Look at the First Sentence: Position Bias in Question Answering[C], Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020:1109–1121.

17.

Lai

, Xie

, Liu

et al., Race: Large-scale reading comprehension dataset from examinations[J], arXiv preprint arXiv:1704.04683, 2017.

18.

Lai

, Zhang

, Feng

et al., Why Machine Reading Comprehension Models Learn Shortcuts?[C], Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, 2021:989–1002.

19.

Liu

, Zhou

, Zhao

et al., K-bert: Enabling language representation with knowledge graph[C], Proceedings of the AAAI Conference on Artificial Intelligence 34(03) (2020), 2901–2908.

20.

Nadeem

M.I.

, Ahmed

, Li

et al., SHO-CNN: A metaheuristic optimization of a convolutional neural network for multi-label news classification[J], Electronics 12(1) (2022), 113.

21.

Niu

and Zhang

, Introspective distillation for robust question answering[J], Advances in Neural Information Processing Systems 34 (2021), 16292–16304.

22.

Omoregbe

N.A.I.

, Ndaman

I.O.

, Misra

et al., Text messaging-based medical diagnosis using natural language processing and fuzzy logic[J], Journal of Healthcare Engineering 2020 (2020), 1–14.

23.

Qiu

, Liu

, Zhou

et al., Adversarial attack and defense technologies in natural language processing: A survey[J], Neurocomputing 492 (2022), 278–307.

24.

Rajpurkar

, Jia

and Liang

, Know What You Don’t Know: Unanswerable Questions for SQuAD[C], Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018:784–789.

25.

Rajpurkar

, Zhang

, Lopyrev

et al., Squad: 100,000+ questions for machine comprehension of text[J], arXiv preprint arXiv:1606.05250, 2016.

26.

Seo

, Kembhavi

, Farhadi

et al., Bidirectional attention flow for machine comprehension[J], arXiv preprint arXiv:1611.01603, 2016.

27.

Sugawara

, Inui

, Sekine

et al., What Makes Reading Comprehension Questions Easier?[C], Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2018:4208–4219.

28.

Vaswani

, Shazeer

, Parmar

et al., Attention is all you need[J], Advances in Neural Information Processing Systems 2017, 30.

29.

Vinyals

, Fortunato

and Jaitly

, Pointer networks[J], Advances in Neural Information Processing Systems (2015), 28.

30.

Wang

and Jiang

, Machine comprehension using match-lstm and answer pointer[J], arXiv preprint arXiv:1608.07905, 2016.

31.

Wang

, Yang

, Wei

et al., Gated self-matching networks for reading comprehension and question answering, Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017:189–198.

32.

, Li

, Wang

et al., R-drop: Regularized dropout for neural networks[J], Advances in Neural Information Processing Systems 34 (2021), 10890–10905.

33.

Yang

, Dai

, Yang

et al., Xlnet: Generalized autoregressive pretraining for language understanding[J], Advances in Neural Information Processing Systems (2019), 32.

34.

A.W.

, Dohan

, Luong

M.T.

et al., QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension[C], International Conference on Learning Representations.

35.

Zhu

, Wang

and Kong

, Counterfactual QA: Eliminating Bias in Question Answering, (2021).