Sage Journals: Discover world-class research

Abstract

The widespread application of Large Language Models (LLMs) extends to education and assessment, underscoring their broader scope and attention. In this context, pretrained LLMs are employed for generating voluminous questions on the basis of their vastly pretrained linguistic patterns. However, challenges remain in generating multihop questions automatically. Hallucinatory named entities or false positive phrases are observed during the automatic generation of multihop questions at scale, which is highly challenging. This challenge arises from the tendency of language models to capture co-occurrence within a given context rather than accurately identifying the relationships between named entities, known as knowledge gaps. To address this challenge, techniques to foster named entity expansion are highly demanding. In this context, Retrieval Augmented Models (RAMs) supplement the requirement by incorporating standard knowledge models such as ontologies for named entity-based text expansion of the source text. However, the limited factual representation from each individual ontology is rarely adequate for text expansion, where Ontology Mapping (OM) needs attention. Here, two key objectives are focused on: 1. Extraction of overlapping entity mappings in the Basic Matching (BM) stage and 2. Ranking intersectional entity mappings in the Final Alignment (FA) stage. With this motivation, experiments are conducted on OM approaches and to integrate them with the RAM and transformer models for the multihop question generation process. Analysis using benchmark datasets and evaluation metrics demonstrates that the proposed hybrid LLM model, incorporating Ontology Mapping (OM), the RAG model (RAM), and a Large Language Model (LLM), achieves ROUGE-L scores ranging from 41% to 45% and BERTScore between 74% and 88%, indicating strong relevance to the entity context. Additionally, the study introduces new metric, RAG Assessment (RAGAS), and the results reveal that the proposed approach effectively balances ROUGE-L and RAGAS-Precision scores, which range from 39% to 43%, highlighting reduced hallucination in auto-generated multihop questions.

Keywords

ontology mapping large language models factual hallucination retrieval augmented modelling automatic question generation

Get full access to this article

View all access options for this article.

References

Abduljabbar

Omar

(2015). Exam questions classification based on Bloom’s taxonomy cognitive level using classifiers combination. Journal of Theoretical and Applied Information Technology, 78(3), 447–455.

Anand

Goel

Hira

Buldeo

Kumar

Verma

Gupta

Shah

R. R.

(2023). SciPhyRAG-Retrieval augmentation to improve LLMs on physics Q &A. International Conference on Big Data Analytics (pp. 50–63).

Bellahsene

Emonet

Ngo

Todorov

(2017). YAM++ Online: A Web Platform for Ontology and Thesaurus Matching and Mapping Validation. In Blomqvist

Hose

Paulheim

Ławrynowicz

Ciravegna

Hartig

(Eds.), The Semantic Web: ESWC 2017 Satellite Events. ESWC 2017. Lecture Notes in Computer Science (Vol. 10577, pp. 137–142). Springer. https://doi.org/10.1007/978-3-319-70407-4_26 .

Computer Networks Ontology. https://bioportal.bioontology.org/ontologies/CN.

Data Collection Ontology. https://bioportal.bioontology.org/ontologies/GDCO.

Data Processing Ontology. https://bioportal.bioontology.org/ontologies/ONL-DP.

Emerson

Chali

(2023). Transformer-based multihop question generation (student abstract). Proceedings of the AAAI conference on artificial intelligence (Vol. 37, No. 13, pp. 16206–16207).

James

Espinosa-Anke

Schockaert

(2023). Ragas: Automated evaluation of retrieval augmented generation. arXiv preprint arXiv:2309.15217.

Fan

Ding

Ning

Wang

Yin

Chua

T.-S.

(2024). A survey on RAG meeting LLMs: Towards retrieval-augmented large language models. Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD ‘24) (pp. 6491–6501). Association for Computing Machinery, New York, NY, USA. https://doi.org/10.1145/3637528.3671470

10.

Feldman

Foulds

J. R.

Pan

(2023). Trapping llm hallucinations using tagged context prompts. arXiv preprint arXiv:2306.06085.

11.

Forouzan

B. A.

(2007). Data communications and networking. Huga Media.

12.

Fournier

Caron

G. M.

Aloise

(2023). A practical survey on faster and lighter transformers. ACM Computing Surveys, 55(14s), 1–40. https://doi.org/10.1145/3586074

13.

Gracia

Bernad

Mena

(2011). Ontology matching with CIDER: Evaluation report for OAEI 2011. Proceedings of the 6th International Conference on Ontology Matching - Volume 814 (OM’11) (pp. 126–133). CEUR-WS.org, Aachen, DEU.

14.

Gupta

Chauhan

Tej

A. R.

Ekbal

Bhattacharyya

(2020). Reinforced multitask approach for multihop question generation. arXiv preprint arXiv:2004.02143.

15.

Hwang

M. H.

Shin

Seo

J. S.

Cho

Lee

C. K.

(2023). Ensemble-nqg-t5: Ensemble neural question generation model based on text-to-text transfer transformer. Applied Sciences, 13(2), 903. https://doi.org/10.3390/app13020903

16.

Jiang

F. F.

Gao

Sun

Liu

Dwivedi-Yu

Neubig

(2023). Active retrieval augmented generation. arXiv preprint arXiv:2305.06983.

17.

Jiménez-Ruiz

Cuenca Grau

(2011). LogMap: Logic-Based and Scalable Ontology Matching. In Aroyo

(Ed.), The Semantic Web – ISWC 2011. ISWC 2011. Lecture Notes in Computer Science (Vol. 7031, pp. 273–288). Springer. https://doi.org/10.1007/978-3-642-25073-6_18 .

18.

Joshi

A. K.

Jain

Hitzler

Yeh

P. Z.

Verma

Sheth

A. P.

Damova

(2012). Alignment-based querying of linked open data. On the Move to Meaningful internet Systems: OTM 2012: Confederated International Conferences: CoopIS, DOA-SVI, and ODBASE 2012, Rome, Italy, September 10-14, 2012. Proceedings, Part II (pp. 807–824). Springer.

19.

Jouault

Seta

Hayashi

(2016). Content-dependent question generation using LOD for history learning in open learning space. New Generation Computing, 34(1), 367–394. https://doi.org/10.1007/s00354-016-0404-x

20.

Huang

Zhang

Zhou

Liu

Cao

Zheng

H.-T.

Shen

(2023). Automatic context pattern generation for entity set expansion. IEEE Transactions on Knowledge and Data Engineering, 35(12), 12458–12469. https://doi.org/10.1109/TKDE.2023.3275211

21.

Mathematical Modelling Ontology. https://bioportal.bioontology.org/ontologies/MAMO.

22.

Mittal

Murthy

Kumar

Bhat

(2024). Towards understanding and mitigating the hallucinations in NLP and speech. Proceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD) (CODS-COMAD ‘24). Association for Computing Machinery, New York, NY, USA (pp. 489–492). https://doi.org/10.1145/3632410.3633297

23.

Ontology for Data Mining. https://bioportal.bioontology.org/ontologies/ONTODM-CORE.

24.

Ontology of Data Types. https://bioportal.bioontology.org/ontologies/ONTODT.

25.

Rahali

Akhloufi

M. A.

(2023). End-to-end transformer-based models in textual-based NLP. AI, 4(1), 54–110. https://doi.org/10.3390/ai4010004

26.

Rocca-Serra

Ruttenberg

O’Connor

Whetzel

Schober

Greenbaum

Courtot

Brinkman

Sansone

S.-A.

Scheuermann

Peters

(2011). Overcoming the ontology enrichment bottleneck with Quick Term Templates. Applied Ontology, 6(1), 13–22. https://doi.org/10.3233/AO-2011-0086

27.

Rodriguez-Torrealba

Garcia-Lopez

Garcia-Cabot

(2022). End-to-end generation of multiple-choice questions using text-to-text transfer transformer models. Expert Systems with Applications, 208(1), 118258. https://doi.org/10.1016/j.eswa.2022.118258

28.

Sanfilippo

E. M.

(2021). Ontologies for information entities: State of the art and open challenges. Applied Ontology, 16(2), 111–135. https://doi.org/10.3233/AO-210246

29.

Santi

Manacero

Peronaglio

F. F.

Lobato

R. S.

Spolon

Cavenaghi

M. A.

(2022). Training transformers for question generation task in intelligent tutoring systems. 2022 17th Iberian Conference on Information Systems and Technologies (CISTI) (pp. 1–6).

30.

Sarkar

Singh

(2023). Combining the knowledge graph and t5 in question answering in nlp. Sentiment Analysis and Deep Learning: Proceedings of ICSADL 2022 (pp. 405–409).

31.

Shao

Gong

Shen

Huang

Duan

Chen

(2023). Enhancing retrieval-augmented large language models with iterative retrieval-generation synergy. arXiv preprint arXiv:2305.15294.

32.

Stoilos

Stamou

Kollias

(2005). A string metric for ontology alignment. In Gil

Motta

Benjamins

V. R.

Musen

M. A.

(Eds.), The Semantic Web – ISWC 2005. ISWC 2005. Lecture Notes in Computer Science (Vol .3729, pp. 624–637). Springer. https://doi.org/10.1007/11574620_45

33.

Subhashree

Kumar

P. S.

(2020). Augmenting linked data ontologies with new object properties. New Generation Computing, 38(1), 125–152. https://doi.org/10.1007/s00354-020-00085-0

34.

Yang

Zhang

Bengio

Cohen

W. W.

Salakhutdinov

Manning

C. D.

(2018). HotpotQA: A dataset for diverse, explainable multihop question answering. arXiv preprint arXiv:1809.09600.

35.

Zhang

Feng

(2023). Candidate set expansion for entity and relation linking based on mutual entity–relation interaction. Big Data and Cognitive Computing, 7(1), 56. https://doi.org/10.3390/bdcc7010056

Ontology Mapping for Retrieval Augmented Modelling to Reduce Factual Hallucinations in Pretrained Language Model-Based Auto-Generated Questions

Abstract

Keywords

Get full access to this article

References