Multi-head attention based candidate segment selection in QA over hybrid data

Abstract

Question Answering based on Tabular and Textual data is a novel task proposed in recent years in the field of QA. At present, most QA systems return answers from a single data form, such as knowledge graphs, tables, texts. However, hybrid data including structured and unstructured data is quite pervasive in real life instead of a single form. Recent research on TAT-QA mainly suffers from the higher error of extracting supporting evidences from both tabular and textual content. This paper aimed to address the problem of failure evidence extraction from more complex and realistic hybrid data. We first proposed two types of metrics to evaluate the performance of evidence extraction on hybrid data, i.e. wrong evidence ratio (WER) and missing evidence ratio (MER). Then we utilize a candidate extractor to obtain supporting evidence related to the question. Third, an origin selector is designed to determine from where the question’s answer comes. Finally, the loss of origin selector is fused to the final loss function, which can improve the evidence extraction performance. Experimental results on the TAT-QA dataset showed that our proposed model outperforms the best baseline in terms of F1, WER and MER, which proves the effectiveness of our model.

Keywords

Question answering on tabular and textual data Wrong Evidence Ratio Missing Evidence Ratio multi-head attention

Get full access to this article

View all access options for this article.

References

Zhang

and Feng

, A Survey of Question Answering over Knowledge Base, in: China Conference on Knowledge Graph and Semantic Computing, 2019.

Huang

Wang

Qiu

Zhao

Peng

and Wang

, Recent trends in deep learning based open-domain textual question answering systems, IEEE Access 8 (2020), 94341–94356.

Jin

Siebert

and Chen

, A Survey on Table Question Answering: Recent Advances, ArXiv, abs/2207.05270, 2022.

Zhu

Lei

Huang

Wang

Zhang

Feng

and Chua

, TAT-QA: A Question Answering Benchmark on a Hybrid of Tabular and Textual Content in Finance, ACL/IJCNLP, 2021.

Chen

Zha

Chen

Xiong

Wang

and Wang

W.Y.

, HybridQA: A Dataset of Multi-Hop Question Answering over Tabular and Textual Data, ArXiv, abs/2004.07347, 2020.

Dua

Wang

Dasigi

Stanovsky

Singh

and Gardner

, DROP: A Reading Comprehension Benchmark Requiring Discrete Reasoning Over Paragraphs, NAACL, 2019.

Liu

Kan

Zheng

Wang

Lei

Liu

and Qin

, Molweni: A Challenge Multiparty Dialogues-based Machine Reading Comprehension Dataset with Discourse Structure, ArXiv, abs/2004.05080, 2020.

Nie

Feng

Song

Wang

and Wang

, Large-scale question tagging via joint question-topic embedding learning, ACM Transactions on Information Systems (TOIS) 38 (2020), 1–23.

Zhong

Xiong

and Socher

, Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning, ArXiv, abs/1709.00103, 2017.

10.

Zhang

Yang

Yasunaga

Wang

I.Z.

Yao

Roman

Zhang

and Radev

D.R.

, Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task, EMNLP, 2018.

11.

Zhang

and Balog

, Auto-completion for Data Cells in Relational Tables, in: Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019.

12.

Zhang

Dai

Balog

and Callan

, Summarizing and Exploring Tabular Data in Conversational Search, in: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2020.

13.

Sun

Bedrax-Weiss

and Cohen

W.W.

, PullNet: Open Domain Question Answering with Iterative Retrieval on Knowledge Bases and Text, ArXiv, abs/1904.09537, 2019.

14.

Chen

Chang

Schlinger

Wang

W.Y.

and Cohen

W.W.

, Open Question Answering over Tables and Text, ArXiv, abs/2010.10439, 2021.

15.

Grau

and Ligozat

, A Corpus for Hybrid Question Answering Systems, in: Companion Proceedings of the The Web Conference 2018, 2018.

16.

Seo

Kembhavi

Farhadi

and Hajishirzi

, Bidirectional Attention Flow for Machine Comprehension, ArXiv, abs/1611.01603, 2017.

17.

Dhingra

Liu

Yang

Cohen

W.W.

and Salakhutdinov

, Gated-Attention Readers for Text Comprehension, ArXiv, abs/1606.01549, 2017.

18.

A.W.

Dohan

Luong

Zhao

Chen

Norouzi

and Le

Q.V.

, QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension, ArXiv, abs/1804.09541, 2018.

19.

Chen

Wang

Chen

Zhang

Wang

Zhou

and Wang

W.Y.

, TabFact: A Large-scale Dataset for Table-based Fact Verification, ArXiv, abs/1909.02164, 2020.

20.

Hermann

K.M.

Blunsom

and Pulman

S.G.

, Deep Learning for Answer Sentence Selection, ArXiv, abs/1412.1632, 2014.

21.

Tan

Xiang

and Zhou

, LSTM-based Deep Learning Models for non-factoid answer selection, ArXiv, abs/1511.04108, 2015.

22.

Yin

Schütze

Xiang

and Zhou

, ABCNN: Attention-based convolutional neural network for modeling sentence pairs, Transactions of the Association for Computational Linguistics 4 (2016), 259–272.

23.

Tran

N.K.

and Niederée

, Multihop Attention Networks for Question Answer Matching, in: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018.

24.

Wang

Guo

and Guan

, Evidence sentence extraction for reading comprehension based on multi-module, Journal of Chinese Information Processing 6 (2022), 109–116.

25.

Liu

Ott

Goyal

Joshi

Chen

Levy

Lewis

Zettlemoyer

and Stoyanov

, RoBERTa: A Robustly Optimized BERT Pretraining Approach, ArXiv, abs/1907.11692, 2019.

26.

Vaswani

Shazeer

N.M.

Parmar

Uszkoreit

Jones

Gomez

A.N.

Kaiser

and Polosukhin

, Attention is All you Need, ArXiv, abs/1706.03762, 2017.

27.

Herzig

Nowak

P.K.

Müller

Piccinno

and Eisenschlos

J.M.

, TaPas: Weakly Supervised Table Parsing via Pre-training, ArXiv, abs/2004.02349, 2020.

28.

Lei

Jin

Kan

Ren

and Yin

, Sequicity: Simplifying Task-oriented Dialogue Systems with Single Sequence-to-Sequence Architectures, ACL, 2018.

29.

Jin

Lei

Ren

Chen

Liang

Zhao

Y.E.

and Yin

, Explicit State Tracking with Semi-Supervisionfor Neural Dialogue Generation, in: Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018.

30.

Lei

Wang

Gan

Kan

and Chua

, Re-examining the Role of Schema Linking in Text-to-SQL, EMNLP, 2020.

31.

Dan

and Gimpel

, Bridging Nonlinearities and Stochastic Regularizers with Gaussian Error Linear Units, ArXiv, abs/1606.08415, 2016.

32.

Devlin

Chang

Lee

and Toutanova

, BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding, NAACL, 2019.

33.

Ran

Lin

Zhou

and Liu

, NumNet: Machine Reading Comprehension with Numerical Reasoning, EMNLP, 2019.