A hybrid approach to domain-independent taxonomy learning

Abstract

Creating domain ontologies is usually performed by teams of knowledge engineers and domain experts, and is considered to be a time-consuming and difficult task. As a result, scientists have started to develop automatic approaches to ontology learning and population. For the proposed research, we focus on the central subtask of ontology learning, being the hypernym detection task, where the system has to detect hierarchical semantic relationships, i.e. hypernym–hyponym relationships, between domain-specific terms, resulting in a domain-specific taxonomy.

We propose in this paper a hybrid approach to automatic taxonomy learning, which combines a data-driven and a knowledge-based component. The data-driven component is composed of a lexico-syntactic pattern-based module, a morpho-syntactic analyzer and a distributional model, whereas the knowledge-based component extracts structured semantic information from the Linked Open Data cloud (DBpedia) and WordNet. The proposed methodology has been applied to three different knowledge domains: viz. food, equipment and science. A thorough quantitative and qualitative evaluation has shown promising results for all considered test domains. In addition, the results show a clear contribution of all different modules to the automatic taxonomy learning task. Although there is still room for improvement for all different modules, our approach outperforms state-of-the-art systems that participated in the SemEval “Taxonomy Extraction Evaluation” task when it comes to comparing the automatically constructed taxonomy against a manually verified gold standard taxonomy. As all modules are run automatically, the system provides a flexible and domain-independent approach to automatic taxonomy learning and could be an important step in solving the knowledge acquisition bottleneck in ontology learning.

Keywords

Taxonomy construction taxonomy learning hypernym detection

Get full access to this article

View all access options for this article.

References

Alexopoulou, D., Andreopoulos, B., Dietze, H., Doms, A., Gandon, F., Hakenberg, J., Khelif, K., Schroeder, M. & Wächter, T. (2009). Biomedical word sense disambiguation with ontologies and metadata: Automation meets accuracy. BMC Bioinformatics, 10(1), 1–15. doi:10.1186/1471-2105-10-1.

Azevedo, C., Iacob, M.E., Almeida, J., van Sinderen, M., Ferreira Pires, L. & Guizzardi, G. (2015). Modeling resources and capabilities in enterprise architecture: A well-founded ontology-based proposal for ArchiMate. Information Systems, 235–262. doi:10.1016/j.is.2015.04.008.

Baroni, M., Bernardi, R., Do, N. & Chung-chieh, S. (2012). Entailment above the word level in distributional semantics. In Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France (pp. 23–32).

Baroni, M. & Bernardini, S. (2004). BootCaT: Bootstrapping corpora and terms from the web. In Proceedings of LREC 2004 (pp. 1313–1316).

Bernhard, D. (2006). Multilingual term extraction from domain-specific corpora using morphological structure. In Proceedings of EACL, The Association for Computer Linguistics (pp. 171–174).

Biemann, C. (2005). Ontology learning from text: A survey of methods. LDV Forum, 20(2), 75–93.

Bordea, G., Buitelaar, P., Faralli, S. & Navigli, R. (2015). Semeval-2015 task 17: Taxonomy Extraction Evaluation (TExEval). In Proceedings of the 9th International Workshop on Semantic Evaluation, Association for Computational Linguistics, Denver, Colorado (pp. 902–910).

Caraballo, S. (1999). Automatic acquisition of a hypernym-labeled noun hierarchy from text. In Proceedings of ACL-99, Baltimore, MD (pp. 120–126).

Chiarcos, C., McCrae, J., Cimiano, P. & Fellbaum, C. (2013). Towards open data for linguistics: Lexical Linked Data. New Trends of Research in Ontologies and Lexical Resources, 7–25. doi:10.1007/978-3-642-31782-8_2.

10.

Cross, V. & Bathijaa, V. (2010). Automatic ontology creation using adaptation. Artificial Intelligence for Engineering Design, Analysis and Manufacturing, 24(Special Issue 01), 127–141. doi:10.1017/S0890060409000183.

11.

Dastgheib, S., Mesbah, A. & Kochut, K. (2013). mOntage: Building Domain Ontologies from Linked Open Data. In 2013 IEEE Seventh International Conference on Semantic Computing (pp. 70–77). doi:10.1109/ICSC.2013.21.

12.

Faralli, S. & Navigli, R. (2013). A Java framework for multilingual definition and hypernym extraction. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, Sofia, Bulgaria (pp. 103–108).

13.

Fellbaum, C. (1998). WordNet: An Electronic Lexical Database. MIT Press.

14.

Firth, J.R. (1957). A synopsis of linguistic theory 1930–1955. In

F.R.

Palmer (Ed.), Studies in Linguistic Analysis. Oxford: Philological Society. (Reprinted in F.R. Palmer (Ed.), Selected Papers of J.R. Firth 1952–1959, London: Longman, 1–32.)

15.

Flati, T., Vannella, D., Pasini, T. & Navigli, R. (2014). Two is bigger (and better) than one: The Wikipedia bitaxonomy project. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL 2014), Baltimore, Maryland, USA (pp. 22–27).

16.

Fu, R., Guo, J., Qin, B., Che, W., Wang, H. & Liu, T. (2014). Learning semantic hierarchies via word embeddings. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL 2014), Baltimore, Maryland, USA (pp. 1199–1209).

17.

Gale, W.A., Church, K. & Yarowsky, D. (1992). One sense per discourse. In Proceedings of the DARPA Speech and Natural Language Workshop, New York, USA (pp. 233–237). doi:10.3115/1075527.1075579.

18.

Grabar, N., Hamon, T. & Bodenreider, O. (2012). Ontologies and terminologies: Continuum or dichotomy? Applied Ontology, 7, 375–386.

19.

Grefenstette, G. (2015). NRIASAC: Simple hypernym extraction methods. In Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015), Denver, Colorado, USA (pp. 911–914). doi:10.18653/v1/S15-2152.

20.

Gruber, T.R. (1993). A translation approach to portable ontology specifications. Knowledge Acquisition, 5(2), 199–220. doi:10.1006/knac.1993.1008.

21.

Harris, Z.S. (1968). Mathematical Structures of Language. New York: Interscience Publishers John Wiley & Sons.

22.

Hearst, M. (1992). Automatic acquisition of hyponyms from large text corpora. In Proceedings of the International Conference on Computational Linguistics (pp. 539–545).

23.

Hellmann, S., Lehmann, J., Sören, A. & Brümmer, M. (2013). Integrating NLP using linked data. In The Semantic Web – ISWC 2013: 12th International Semantic Web Conference (pp. 98–113). Berlin Heidelberg: Springer. doi:10.1007/978-3-642-41338-4_7.

24.

Hippisley, A., Cheng, D. & Ahmad, K. (2005). The head-modifier principle and multilingual term extraction. Natural Language Engineering, 11(2), 129–157. doi:10.1017/S1351324904003535.

25.

Kozareva, Z. & Hovy, E. (2010). Learning arguments and supertypes of semantic relations using recursive patterns. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL), Uppsala, Sweden (pp. 1482–1491).

26.

Kozareva, Z., Riloff, E. & Hovy, E. (2008). Semantic class learning from the web with hyponym pattern linkage graphs. In Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics (ACL), Columbus, Ohio, USA (pp. 1048–1056).

27.

Lefever, E. (2015). LT3: A multi-modular approach to automatic taxonomy construction. In Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015), Denver, Colorado, USA (pp. 943–947).

28.

Lefever, E., Macken, L. & Hoste, V. (2009). Language-independent bilingual terminology extraction from a multilingual parallel corpus. In Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Athens, Greece (pp. 496–504).

29.

Lefever, E., Van de Kauter, M. & Hoste, V. (2014). HypoTerm: Detection of hypernym relations between domain-specific terms in Dutch and English. Terminology, 20(2), 250–278. doi:10.1075/term.20.2.06lef.

30.

Lehmann, J., Isele, R., Jakob, M., Jentzsch, A., Kontokostas, D., Mendes, P., Hellmann, S., Morsey, M., van Kleef, P., Auer, S. & Bizer, C. (2014). DBpedia – A large-scale, multilingual knowledge base extracted from wikipedia. Semantic Web Journal, 6(2).

31.

Lenci, A. & Benotto, G. (2012). Identifying hypernyms in distributional semantic spaces. In Proceedings of the First Joint Conference on Lexical and Computational Semantics (*SEM), Montréal, Canada (pp. 75–79).

32.

Liang, J., Nguyen, T., Koperski, K. & Marchisio, G. (2006). Ontology-based natural language query processing for the biological domain. In Proceedings of the HLT-NAACL BioNLP Workshop on Linking Natural Language and Biology (pp. 9–16). New York, New York: Association for Computational Linguistics. doi:10.3115/1654415.1654418.

33.

Macken, L., Lefever, E. & Hoste, V. (2013). TExSIS: Bilingual terminology extraction from parallel corpora using chunk-based alignment. Terminology, 19(1), 1–30. doi:10.1075/term.19.1.01mac.

34.

Malaisé, V., Zweigenbaum, P. & Bachimont, B. (2004). Repérage et exploitation d’énoncés définitoires en corpus pour l’aide à la construction d’ontologie. In

Blache, (Ed.), Proceedings of TALN 2004 (Traitement Automatique des Langues Naturelles), Fès, Maroc (pp. 269–278).

35.

McCrae, J., Aguado-de Cea, G., Buitelaar, P., Cimiano, P., Declerck, T., Gómez-Pérez, A., Gracia, J., Hollink, L., Montiel-Ponsoda, E., Spohr, D. & Wunner, T. (2012). Interchanging lexical resources on the Semantic Web. Language Resources and Evaluation, 46(6), 701–709. doi:10.1007/s10579-012-9182-3.

36.

Meij, E., Bron, M., Hollink, L., Huurnink, B. & de Rijke, M. (2011). Mapping queries to the Linking Open Data cloud: A case study using DBpedia. Web Semantics: Science, Services and Agents on the World Wide Web, 9(4), 418–433.

37.

Mendes, P., Jakob, M. & Bizer, C. (2012). DBpedia for NLP – A multilingual cross-domain knowledge base. In Proceedings of the International Conference on Language Resources and Evaluation (LREC), Istanbul, Turkey (pp. 1813–1817).

38.

Mendes, P.N., Max, J., García-Silva, A. & Bizer, C. (2011). DBpedia spotlight: Shedding light on the web of documents. In Proceedings of the 7th International Conference on Semantic Systems, I-Semantics ’11 (pp. 1–8). New York, NY, USA: ACM. doi:10.1145/2063518.2063519.

39.

Mikolov, T., Sutskever, I.n., Chen, K., Corrado, G. & Dean, J. (2013). Distributed representations of words and phrases and their compositionality. In Advances in Neural Information Processing Systems (Vol. 26, pp. 3111–3119). Curran Associates, Inc.

40.

Mititelu, V. (2008). Hyponymy patterns. Semi-automatic extraction, evaluation and inter-lingual comparison. In Text, Speech and Dialogue. Lecture Notes in Computer Science (Vol. 5246, pp. 37–44). doi:10.1007/978-3-540-87391-4_7.

41.

Navigli, R. & Velardi, P. (2010). Learning word-class lattices for definition and hypernym extraction. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden (pp. 1318–1327).

42.

Oakes, M. (2005). Using hearst’s rules for the automatic acquisition of hyponyms for mining a pharmaceutical corpus. In Proceedings of the Workshop Text Mining Research (pp. 63–67).

43.

Pantel, P. & Ravichandran, D. (2004). Automatically labeling semantic classes. In Proceedings of HLT/NAACL-04, Boston, MA (pp. 321–328).

44.

Paulheim, H., Ristoski, P., Mitichkin, E. & Bizer, C. (2014). Data Mining with Background Knowledge from the Web. RapidMiner World.

45.

Peters, W. (2013). Establishing interoperability between linguistic and terminological ontologies. In New Trends of Research in Ontologies and Lexical Resources (pp. 27–42). Berlin Heidelberg: Springer. doi:10.1007/978-3-642-31782-8_3.

46.

Peters, W. (2016). Tackling resource interoperability: Principles, strategies and models. In Proceedings of the LREC 2016 Workshop “Cross-Platform Text Mining and Natural Language Processing Interoperability”, Portoroẑ, Slovenia (pp. 34–37).

47.

Ponzetto, S. & Strube, M. (2011). Taxonomy induction based on a collaborative built knowledge repository. Artificial Intelligence, 175, 1737–1756. doi:10.1016/j.artint.2011.01.003.

48.

Prokofyev, R., Tonon, A., Luggen, M., Vouilloz, L., Djellel Eddine, D. & Cudré-Mauroux, P. (2015). In SANAPHOR: Ontology-Based Coreference Resolution, The Semantic Web – ISWC 2015: 14th International Semantic Web Conference, Bethlehem, PA, USA (pp. 458–473).

49.

Ray, S., Singh, S., Joshi, B.P., Tiwary, U.S., Siddiqui, T., Radhakrishna, M. & Tiwari, M.D. (2009). Exploring multiple ontologies and WordNet framework to expand query for question answering system. In Proceedings of the First International Conference on Intelligent Human Computer Interaction (IHCI 2009) (pp. 296–305). New Delhi: Springer. doi:10.1007/978-81-8489-203-1_29.

50.

Rei, M. & Briscoe, T. (2014). Looking for hyponyms in vector space. In Proceedings of the Eighteenth Conference on Computational Natural Language Learning (pp. 68–77).

51.

Ritter, A., Soderland, S. & Etzioni, O. (2009). What is this, anyway: Automatic hypernym discovery. In Proceedings of Association for Advancement of Artificial Intelligence Spring Symposium on Learning by Reading and Learning to Read (pp. 88–93).

52.

Roller, S., Erk, K. & Boleda, G. (2014). Inclusive yet selective: Supervised distributional hypernymy detection. In Proceedings of COLING 2014, The 25th International Conference on Computational Linguistics: Technical Papers, Dublin, Ireland (pp. 1025–1036).

53.

Santus, E., Lenci, A., Lu, Q. & Schulte Im Walde, S. (2014). Chasing hypernyms in vector spaces with entropy. In Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics (pp. 38–42). Sweden: Gothenburg.

54.

Shwartz, V., Goldberg, Y. & Dagan, I. (2016). Improving hypernymy detection with an integrated path-based and distributional method. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, Berlin, Germany.

55.

Sowa, J.F. (2000). Knowledge Representation: Logical, Philosophical and Computational Foundations. Pacific Grove, CA, USA: Brooks/Cole Publishing Co.

56.

Sparck Jones, K. (1979). Experiments in relevance weighting of search terms. Information Processing and Management, 15, 133–144. doi:10.1016/0306-4573(79)90060-8.

57.

Tjong Kim Sang, E., Hofmann, K. & de Rijke, M. (2011). Extraction of Hypernymy Information from Text, Interactive Multi-Modal Question-Answering. Theory and Applications of Natural Language Processing (pp. 223–245). Berlin Heidelberg: Springer.

58.

Van de Kauter, M., Coorman, G., Lefever, E., Desmet, B., Macken, L. & Hoste, V. (2013). LeTs Preprocess: The Multilingual LT3 Linguistic Preprocessing Toolkit. Computational Linguistics in the Netherlands Journal.

59.

Van der Plas, L. & Bouma, G. (2005). Automatic acquisition of lexico-semantic knowledge for question answering. In Proceedings of the IJCNLP Workshop on Ontologies and Lexical Resources, Jeju Island, Korea.

60.

Velardi, P., Faralli, S. & Navigli, R. (2013). OntoLearn reloaded: A graph-based algorithm for taxonomy induction. Computational Linguistics, 39(3), 665–707. doi:10.1162/COLI_a_00146.

61.

Weeds, J. & Weir, D. (2003). A general framework for distributional similarity. In Proceedings of EMNLP-03, Sapporo, Japan (pp. 81–88).

62.

Wright, S.E. (1997). Term selection: The initial phase of terminology management. In Handbook of Terminology Management (pp. 13–23). John Benjamins. doi:10.1075/z.htm1.04wri.

63.

Wright, S.E. & Budin, G. (2001). Handbook of Terminology Management, Volume 2: Application-Oriented Terminology Management. John Benjamins Publishing Company.

64.

Zhang, C., Niu, Z., Jiang, P. & Fu, H. (2012). Domain-specific term extraction from free texts. In 9th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2012) (pp. 1290–1293). IEEE. doi:10.1109/FSKD.2012.6234350.