A thesaurus data model for an intelligent retrieval system

Abstract

This paper demonstrates the application of conven tional database design techniques to thesaurus representation. The thesaurus is considered as a printed document, as a semantic net, and as a relational database to be used in conjunction with an intelligent information retrieval system Some issues raised by analysis of two standard thesaun in clude the prevalence of compound terms and the representa tion of term structure: thesaurus redundancy and the extent to which it can be eliminated in machine-readable versions: the difficulty of exploiting thesaurus knowledge originally designed for human rather than automatic interpretation: deriving "strength of association" measures between terms in a thesaurus considered as a semantic net, facet representation and the need for variations in the data model to cater for structural differences between thesauri. A complete schema of database tables is presented, with an outline suggestion for using the stored information when matching one or more thesaurus terms with a user's query.

Get full access to this article

View all access options for this article.

References

M. Stonebraker , Document processing in a relational database system, ACM Transactions on Database Systems 1 (1983) 143-158.

D.C. Blair , An extended relational document retrieval model , Information Processing and Management 24 (1988) 349-371.

J. Aitchison and A. Gilchrist , Thesaurus construction: a Practical Manual, 2nd ed. (ASLIB, (1987).

J. Aitchison , Communtcation and Information Thesaurus, (FID and UNESCO, 1992)

G.A. Miller , R Beckwith , C Felbaum , D Gross and K. Miller , Introduction to Wordnet—an online lexical database (Princeton University, Cognitive Science Laboratory, Technical Report No. 43, (1990).

C.F. McMath , R.S. Tamaru , R Rada , A graphical thesaurus-based information retrieval system. Internattonal Journal of Man-Machine Studies 31 (1989) 121-147.

A. Chong , Topic: a concept-based document retrieval system, Library Software Review (USA) 8 (1989) 281-284.

H. Chen , Knowledge-based document retrieval: framework and design, Journal of Information Science 18 ( 1992) 293-314.

M.P. Smith , A.S. Pollitt , C.S. Li , Evaluation of concept translation through menu navigation in the MenUSE intermediary system. In: Proceedings of the 14th BCS IRSG Research Colloquwm on Information Retrieval , University of Lancaster, April 1992 .

10.

M. Hancock-Beaulieu and S. Walker , An evaluation of automatic query expansion in an online library catalogue, Journal of Documentation 48 (1992) 406-421.