Sage Journals: Discover world-class research

Abstract

We live in a big data era of unstructured data expressed as natural language (NL) texts. As the volume of text-based information grows, effective methods for encoding and extracting meaningful knowledge from this corpus are of paramount relevance. A challenging task concerns transforming NL texts into structured and semantically rich data. Semantic web technologies have revolutionized how we represent and access structured knowledge. Resource description framework (RDF) triples serve as a fundamental building block for this purpose, enabling the integration of diverse data sources. This investigation examines methods for RDF triple generation and knowledge graphs (KGs) enhancement from NL texts. This study area presents wide-ranging applications encompassing knowledge representation, data integration, NL understanding, and information retrieval. Our systematic literature review addresses the understanding, characterization, and identification of challenges and limitations in existing approaches to RDF triple generation from NL texts and their inclusion into an existing KG. We retrieved, categorized, and analyzed $150$ articles from several scientific databases. We provide a comprehensive overview of the field, identify research gaps, and provide directions for future research. We found the most commonly available study categories, especially considering the domain, target language, the public availability of datasets, and real-world applications. Our results reveal a growing trend in this field in the last few years related to the use of transformer-based machine learning methods for triple generation. Our study also drives innovation by highlighting open research questions and providing a road map for future investigations.

Keywords

knowledge graph construction RDF triples text-to-triple natural language processing

1. Introduction

An unprecedented volume of text is generated daily in modern information systems, leading to large data sources from which meaningful knowledge can be derived. In particular, natural language (NL) texts (e.g., web pages, social network posts, and unstructured textual documents) have been generated substantially in the past few years (Kríž et al., 2014).

A great portion of such textual data remains largely unprocessed and untapped (Egger & Gokce, 2022), representing a reservoir of information that could be explored for deriving actionable insights. In this scenario, developing novel methodologies and software tools to convert large volumes of unstructured text into structured, computer-interpretable knowledge is of paramount relevance.

Unlocking the potential of unprocessed NL text can significantly contribute to informed decision-making grounded on knowledge representation. With resource description framework (RDF) (Candan et al., 2001) triples at their core, Semantic Web technologies offer an approach to organizing said information. In this context, knowledge graphs (KGs) (Ehrlinger & Wöß, 2016) play a key role in creating rich and interconnected RDF triples by structuring interlinked data meaningfully (Paulheim, 2017).

KGs serve as invaluable assets across diverse domains, ranging from applications related to improving search engine outcomes (Xiong et al., 2017) to those targeting the enhancement of results of artificial intelligence models and applications (Schneider et al., 2022). Our motivation extends to the critical role KGs play in powering various applications and services (Ji et al., 2022; Zou, 2020). These structured knowledge repositories find applications in diverse fields, including but not limited to information retrieval systems (Arnaout & Elbassuoni, 2018), query and answering questions (Yasunaga et al., 2021), and data integration platforms (Bytyçi et al., 2023; Suchanek & Weikum, 2013a).

The core issue addressed in this investigation is the effective generation of RDF triples from NL texts. This often encompasses tasks regarding entity recognition (Mansouri et al., 2008), relation extraction (Nasar et al., 2021), and challenges in transforming textual information into a structured, semantically rich format. Our study also investigates existing methods in the literature for incorporating generated RDF triples into an existing KG, respecting an underlying ontology (Gruber, 1995).

Studies present distinct approaches for generating RDF triples from NL texts based on natural language processing (NLP) techniques. Some consider rule-based methods like open information extraction (OpenIE) (Dutta et al., 2015) and semantic role labeling (SRL) (Rossanez et al., 2020b) which aim to identify the triple elements (subject, predicate, and object) from textual sentences based on their grammatical structures. More recent research employ transformers (Vaswani et al., 2017) and other types of neural network architectures in identifying and generating RDF triples to generate a KG from scratch.

We investigate additional challenges in enhancing an existing KG with new RDF triples. For instance, KGs require respecting existing intrinsic ontology statements. This often involves identifying already-existing triples and whether they follow such ontology definitions. This integration is even harder when constructing a KG relies on assimilating a fully comprehensive knowledge set at once. KGs are constructed based on the information available at a given moment, making it arduous to capture the entirety of knowledge in a single instantiation comprehensively. The constantly expanding repository of NL texts introduces a dynamic facet to this challenge. Textual data is not stagnant, as new knowledge is continuously generated, thus requiring, a mechanism for KGs to keep pace with this influx of information. In this sense, KGs need to constantly change and evolve automatically or manually to adhere to the evolution of knowledge in the domain they represent (Regino et al., 2021). Information “freshness” and “recency” in KGs of any size or domain is critical to their usefulness (Sendyk et al., 2022).

Existing survey articles on RDF triple generation emphasize the creation of new KGs (Bytyçi et al., 2023; Suchanek & Weikum, 2013a). We found studies describing solutions for similar problems, including transforming table-formatted texts in RDF triples (Auer et al., 2007), transforming relational databases in RDF triples (Thuy et al., 2014), and generating NL texts from RDF triples (Moussallem et al., 2018). Nevertheless, to the best of our knowledge, there is no available systematic literature review that combines research related to RDF triple generation and KG enhancement from NL texts.

This systematic literature review provides insights into how unstructured textual data are transformed into RDF triples and how the produced knowledge is aggregated into existing KGs linked to an underlying ontology. Our specific objectives are:

–
Investigate studies and techniques that identify relevant parts of NL-produced texts to transform it into RDF triples. Intent and entity discovery, relation extraction, and named entity recognition are examples of such techniques.
–
Investigate techniques that automatically or semi-automatically build RDF triples in a single domain (e.g., biology and medicine) or multi-domain.
–
Investigate methods that link the created triples to an existing set of classes and relations defined by one or more ontologies.
–
Investigate techniques that check the added knowledge’s for consistency, validity, and semantic coherence.

Our study employs a systematic approach organized into three phases: preparation, execution, and reporting. Each phase involves distinct steps, from formulating research questions to categorizing and analyzing relevant articles. Our methodology includes defining research questions, identifying query strings, defining inclusion/exclusion criteria, categorizing studies and methods, generating metadata, description, and analysis of results, and reporting on open challenges in the field.

Our study offer in-depth descriptions of categories, challenges, advancements, and triple-generation trends from NL texts and KG enhancement. We offer researchers and practitioners valuable insights for future investigations. For those entering the study of text-to-triple transformation, this review provides a structured overview, highlighting key considerations in the field. Our study also identifies the varied approaches and methodologies employed in the surveyed articles. Paper search identified $15$ articles on RDF triple generation from unstructured NL texts for KG enhancement, which were systematically categorized into $10$ distinct categories originally defined in this study.

The proposed categorization is a foundation for synthesizing information and discerning common trends, challenges, and advancements within each thematic group. This structured approach may be relevant to researchers, practitioners, and enthusiasts interested in extracting targeted knowledge based on their specific areas of interest. Readers, especially those new to the field, can navigate these categories to understand specific aspects, like technical methodologies, language specificity, and ontology construction. Follows: aside from this introduction, this article has seven other sections. Section 2 formalizes the problem and its components. Section 3 presents the methodology of how we systematically evaluated the state-of-the-art. Section 4 provides quantitative analyses of the retrieved articles. Section 5 categorizes and describes the key relevant articles filtered, and Section 6 discusses relevant aspects of our findings and addresses the research questions. Finally, Section 7 draws conclusion remarks of the article.
2. The Triple Generation Problem

This section defines the triple generation problem and highlights its inherent difficulties (Section 2.1), using a motivating example (Section 2.2).

2.1. Triple Generation: Formalization and Challenges

This section formalizes the key terms and concepts central to this research. Figure 1 presents the typical pipeline for triple generation and KG enhancement from NL texts.

Figure 1.

Typical pipeline for triple generation and KG enhancement based on NL texts. It comprises five elements (set of textual documents, ontology, generated triples, existing KG, and enhanced KG) and two processing components. Boxes $A$ and $B$ represent the RDF triple generation and the KG enhancement components, respectively. KG = knowledge graph; NL = natural language; RDF = resource description framework.

As core elements we have: Ontology ( $O$ ):

A shared and formal conceptualization and representation of knowledge (Gruber, 1995). Formally, an ontology $O$ is formed by the components $(C, P, R)$ , in which $C$ represents a set of classes, $P$ is a set of properties, and $R$ is a set of relationships. $O$ is also named T-Box.

Knowledge graph (KG):

A graph-based representation constructed by linking RDF triples according to the structure defined in ontologies. Formally, a KG is a connected directed graph formed by $V$ vertices (entities) and their directed $E$ edges (connections).

Class (

C_{i}

A category or type within the ontology representing a set of entities with common properties. Formally, $C_{i}$ is an element of the set of classes $C$ in the ontology.

Property (

p_{i}

A characteristic or attribute associated with entities in the ontology. Formally, $p_{i}$ is an element of the ontology’s set of properties $P$ .

Triple (

T_{i}

A statement in the form of a subject-predicate-object, representing a relation between entities in the KG. Formally, $T_{i}$ is the triple $(s, p, o)$ in which $s$ is a subject entity, $p$ is a predicate (property), and $o$ is an object entity.

Based on such formalization, the problem of generating RDF triples from NL texts and appending them to an existing KG can be formally defined as follows:

Given a set of NL texts $T = {t_{1}, t_{2}, \dots, t_{n}}$ and an existing knowledge graph $K G$ ( $K G \neq \emptyset$ ), the goal is to generate RDF triples $S_{T} = {(s_{1}, p_{1}, o_{1}), (s_{2}, p_{2}, o_{2}), \dots, (s_{m}, p_{m}, o_{m})}$ from $T$ (component A in Figure 1), such that generated triples are inserted into $K G$ leading to an enhanced knowledge graph $K G^{'}$ , that is, $K G^{'} = K G \cup S_{T}$ (component B).

Common challenging problems faced in triple generation refer to: –

Ambiguity in NL: NL is inherently ambiguous, hindering the precise identification of entities, relationships, and attributes. Resolving this ambiguity is critical for accurate triple generation.

–

Diverse sentence structures: NL exhibits a wide range of sentence structures, making it difficult to design an approach that fits all problems for extracting RDF triples. Handling diverse syntactic patterns adds complexity to the problem.

–

Contextual dependencies: Extracting triples often requires understanding the context in a document or a broader knowledge context. Capturing contextual dependencies is key to accurately generating triples.

–

Named entity recognition (NER): Identifying entities within the text is a subproblem that involves NER. Variability in entity types and expressions poses challenges in accurately recognizing entities in different contexts.

–

Entity linking (EL): Beyond simply recognizing entity mentions, EL is the process of disambiguating and grounding each mention to a unique URI in the target KG (or another reference knowledge base) (Möller et al., 2022). This step is used to avoid creating duplicate nodes for the same real-world entity and to ensure that newly generated triples correctly attach to existing graph elements.

Aggregating triples to an existing KG poses additional challenges, such as: –

T-Box alignment: A new RDF triple can be composed of entity and property types not described in the initial KG. This new structure must be aggregated into the T-Box portion of the KG and the triple itself in the A-Box portion. In some situations, however, that specific triple may not be incorporated into the KG, depending on its use context. If the KG’s T-Box is aligned with a third-party ontology, including such new knowledge representation may even affect the alignment.

–

A-Box alignment: The entities (subject, object, or both) of the new triple may already be present in the graph. In this case, adding a triple will require identifying whether the URIs representing it are already available in the KG. If so, this could lead to appending a new property (i.e., the new triple’s predicate) to existing entities or linking an existing entity to a new one. Also, neither of the elements of the new triple may already exist in the KG; in this case, those new elements may end up being disconnected. Once triples are generated, an entity linking module must determine whether each subject and object mention already exists in the KG (i.e., assign the correct URI). If a mention resolves to an existing node, the new property is appended; otherwise a new node is created and linked.

–

URI alignment: In case the URI creation for the new triples is automatic, adding such triples to an existing KG may require searching for existing URIs that represent the same entities. This would ensure consistency in knowledge representation.

2.2. Motivating Example

Let us consider the following sentences:

–
$t_{1}$ = “SpaceX, founded by Elon Musk, is known for pioneering space exploration and innovation”;
–
$t_{2}$ = “The engine is incompatible with Ford Mustang 1990.”

For the first example $t_{1}$ , relevant parts of it are identified. Step A of Figure 1 identifies entities, such as “SpaceX,” “Elon Musk,” “space exploration,” and “innovation” from the given text, transforming this information into RDF triples, producing relationships like:

–
<SpaceX, founded_by, Elon Musk>
–
<SpaceX, known_for, space_exploration>
–
<SpaceX, known_for, innovation>

The created triples should be integrated into an existing $K G$ , based on the application of ontology statements from $O$ . In this context, the rules identify “SpaceX” as a company and “Elon Musk” as a founder, ensuring these classes are present in $O$ . Given an existing $K G$ , which initially comprises nodes for “SpaceX,” “founder,” and general relations, integrating new triples enriches this graph. The updated KG—enhanced knowledge graph $K G^{'}$ —(step B of Figure 1) now includes nodes specific to “space exploration” and “innovation,” accompanied by their corresponding relations.

For $t_{2}$ , entities like “engine,” “incompatible,” and “Ford Mustang 1990” are identified in $T$ in step A of Figure 1. The generated RDF triples build relationships such as <engine, incompatible_with, Ford_Mustang_1990>. Exploring ontology elements helps ensure the meaningful integration of that into the $K G$ . For instance, identify “engine” as a component, “incompatible” as a property, and “Ford Mustang 1990” as a specific model of $O$ . Examining the state of $K G$ before the integration of new information reveals existing nodes for “engine” and “incompatible.” The subsequent integration of $S_{T}$ enhances this $K G$ by adding nodes specific to “Ford Mustang 1990” in step B of Figure 1 reaching knowledge graph $K G^{'}$ .

We list the following challenges for converting $t_{1}$ and $t_{2}$ into triples:

–
Ambiguity in NL: The phrase “founded by” may pose ambiguity. It could be interpreted as the founder of a company or the person behind a project;
–
Diverse sentence structures: The sentence structure includes both the subject “SpaceX” and the additional information “founded by Elon Musk” requiring a mechanism to handle diverse syntactic patterns.
–
Contextual dependencies: Understanding the context that associates Elon Musk with SpaceX is essential. What does “incompatible” mean? The whole engine? Without context awareness, incorrect triples may be generated.
–
Named entity recognition (NER): NER is required to identify “SpaceX,” “Elon Musk,” “Ford,” “Mustang,” “1990,” and “engine” as entities, introducing the challenge of recognizing entity types accurately.

3. Methodology

This section describes the methodology for systematically reviewing the literature on triple generation and KG enhancement from NL texts based on Budgen and Bereton (Budgen & Brereton, 2006). Figure 2 presents our methodology with three distinct phases: preparation, execution, and reporting. A series of ordered steps further delineate each of these phases. This phase-based structure provides a comprehensive overview of the entire procedure and highlights our experimental approach’s systematic and controlled nature.

Figure 2.

Systematic literature review methodology comprising three main phases: preparation, execution, and reporting. Each phase comprises steps represented by numbers from 1 to 15.

The “Preparation Phase” serves as the initial step, in which we lay the groundwork for our investigation. During this phase, we formulate research questions, identify relevant scientific databases, and establish rigorous inclusion and exclusion criteria (steps 1–6 in Figure 2). The preparation phase seeks to construct a process that guides this systematic literature review article.

We proceed to the “Execution Phase,” in which we pragmatically implement the process discussed in the preparation phase. This phase entails executing a sequence of planned tasks, including database queries, article retrieval, and applying inclusion and exclusion criteria. We synthesize data, categorize results, and perform preliminary analyses (steps 7–11 in Figure 2). The execution phase emphasizes data acquisition and initial assessments.

Finally, the research concludes with the “Reporting Phase.” We gather all our analyses, including bibliometric, quantitative, and qualitative assessments, into a cohesive and structured presentation. This phase is dedicated to synthesizing our findings, the systematic organization of results, and in-depth analytical examinations (steps 13–15 in Figure 2). We discuss our outcomes comprehensively, emphasizing their implications and significance within the context of our survey’s objectives.

Sections 3.1, 3.2, and 3.3 describe each phase and their steps.

3.1. Preparation Phase

1. Study the necessity: The initial step of the preparation phase was essential to thoroughly study the necessity of conducting this investigation. We understood the current landscape of RDF triple generation from NL texts and investigated the relevance of this topic within semantic web technologies and knowledge representation. On this basis, we justify the significance of our study. The thorough study of such literature gaps not only justified beginning our investigation but also guided the subsequent phases by providing a clear rationale for their objectives and methodologies.

2. Define the focus: After establishing the necessity, we defined the specific focus of our study. Here, we determined the boundaries and scope of our investigation. To guide this step, we answered three important questions (Table 1): What is the study? Where do we gather scientific data to answer the research questions? Why is the study important? We also made decisions regarding the scope of the study:

Table 1.
Study Definition. Initial Questions to Define the Reasons for Conducting This Study, as well as Its Targets.

Question Definition

Why? Investigate different methods of building knowledge graphs from natural language texts

Where? Via the existing literature related to the area of study

What? The survey explores various approaches, algorithms, and techniques used for converting natural language texts into RDF triples

Question	Definition
Why?	Investigate different methods of building knowledge graphs from natural language texts
Where?	Via the existing literature related to the area of study
What?	The survey explores various approaches, algorithms, and techniques used for converting natural language texts into RDF triples

RDF = resource description framework.

–

We would accept any RDF triple generation method (e.g., entity recognition and relation extraction). This choice sought to embrace the diversity of methodologies prevalent in the field, ensuring a comprehensive exploration of the topic.

–

We would not restrict the survey to scientific articles by the type of natural language texts (e.g., news articles, academic papers, and social media posts). This decision allows to capture the breadth of RDF triple generation applications across different textual domains.

The motivation to review the literature on triple generation and KG enhancement derived from the lack of this type of research in the state-of-the-art based on our initial observations of result analyses. Among all the articles found on the subject of generating triples (in this study), only 10% referred to adding these triples to an already existing KG.

3. Define research questions (RQs): We defined clear and concise RQs in addition to the focus steps. These questions were tailored to address the specific aspects of RDF triple generation and KG enhancement from NL texts and are addressed through quantitative and qualitative analyses in Sections 4 and 5. Those questions were defined to ensure that our study provides meaningful insights and answers to existing challenges or gaps identified (Table 2).

Table 2.

Overview of RQs. These Focused Inquiries Form the Basis for Exploring the Surveyed Literature.

RQ	Question
RQ-01	What are the benefits and drawbacks of a method that generates RDF triples from texts?
RQ-02	Are there any patterns for texts used as input and for the triples generated as output?
RQ-03	What are the most used techniques? What are the most accurate ones?
RQ-04	Are there fully automated approaches to generate knowledge graphs from text?
RQ-05	What are the main applications that benefits from the text-to-triple approaches?
RQ-06	How do the methods explore the T-Box and A-Box in terms of text-to-triple generation?

RQs = research questions; RDF = resource description framework.

The RQs address the benefits and drawbacks of the surveyed methods, patterns in input texts and generated triples, the most utilized and accurate techniques, the presence of fully automated approaches, applications benefiting from the methodologies; and the exploration of T-Box and A-Box in text-to-triple generation. These research questions not only serve as a guide for our analyses but also function as a means to benchmark the success of our survey.

4. Define query terms: To systematically retrieve relevant literature, we defined query strings used to search scientific databases. Effective search terms are essential for retrieving a comprehensive set of articles that align with our goals. We incorporated relevant keywords, terms, and Boolean operators to refine the search. Table 10 in the Appendix presents the employed query terms. These can be condensed in a unique query, as defined in equation (1).

\begin{aligned} (generation O R extraction O R construction O R population) A N D \\ (triples O R knowledge graph O R knowledge base) A N D \\ (natural language text O R unstructured text O R textual data) \end{aligned}

(1)

The query string represents a balance between completeness and specificity. The selection of terms reflects the multifaceted nature of the field, encompassing aspects such as entity recognition, relation extraction, and knowledge representation.

The iterative process of refining and validating the query string in equation (1) involved a continuous dialogue among the research team (authors in this article), ensuring that the queries encapsulate emerging trends and diverse perspectives within the domain. The final set of search terms, detailed in Appendix Table 10, represents a distilled synthesis of our collective understanding.

5. Define scientific databases: At this step, we define in which scientific databases we could execute the queries listed in the previous step. We chose databases most likely to contain relevant articles related to Computer Science, including the domains of Text classification/Mining and Semantic Web. The chosen databases were ACM DL,¹ IEEE Xplore,² Springer Link,³ Elsevier,⁴ Scopus,⁵ and ACL Anthology.⁶

We assume that these databases contain highly ranked and cited articles relevant to our study. The chosen databases collectively offer a broad spectrum of publications, including conference proceedings and journals, ensuring a sound exploration of the academic landscape.

6. Define inclusion and exclusion criteria: In this last step, we defined rigorous inclusion and exclusion criteria to determine which articles should be included in the survey. Inclusion criteria (Table 3) specify the characteristics an article must possess to be considered for the survey, whereas exclusion criteria (Table 4) identify reasons for excluding articles.

Table 3.

Inclusion Criteria (IC). Articles Must Present the Characteristics Here Defined to be Included in the Study.

EC	Type	Definition
IC-01	Application	Book chapters, conference papers (full and short articles), journals, and thesis
IC-02	Application	Papers of type as listed at IC-01 that create, use, or theoretically define a way to generate triples from any kind of textual data
IC-03	Language	Articles written in English
IC-04	Scope	Articles that uses the RDF specification to construct the triples

RDF = resource description framework.

Table 4.

Exclusion Criteria (ECs). Articles Wwith Such Characteristics Are Not Included in the Study.

EC	Type	Definition
EC-01	Date	Articles 10 years older than the execution of the queries in August 2023
EC-02	Duplicate	Articles’ duplicate found in the scientific databases
EC-03	Short articles	Articles with less than four pages
EC-04	Abstract	Articles without abstract
EC-05	Language	Articles not written in English
EC-06	Application	Articles that just mention a tool that generates triples from text, not explaining how and why it was used

In summary, we included articles that scientifically (IC-01) contribute to the understanding of RDF triple generation from NL texts (IC-02), written in English (IC-03) and that uses RDF specification (IC-04). By setting these standards, we sought to ensure that the articles chosen were not only recent but also provided comprehensive insights into the current state-of-the-art. We excluded articles that are too old (EC-01), duplicates (EC-02), short (EC-03), without abstract (EC-04), not written in English (EC-05), and without further explanation on how generating triples based on text (EC-06).

3.2. Execution Phase

7. Retrieve the papers: In the first step of the execution phase, we retrieved the relevant articles based on the query strings defined in the preparation phase. We collected all articles that matched the search criteria in the predefined databases. The title and abstract of each article were read for criteria verification. We retrieved a total of $150$ articles from our search queries. By relying on the query strings built in the Preparation Phase, we ensured that the retrieved articles aligned closely with our present study’s specific focus and objectives.

8. Read the papers: We read and reviewed each article systematically. This step involved an in-depth examination of the content of the selected articles which involved a collaborative effort among the researchers, leveraging diverse perspectives to interpret and extract insights from the selected papers.

While reading, we took detailed notes and highlighted key information extracted in the following steps: “Create Categories,” “Generate Metadata,” and “Describe Results.”

We employed a collaborative approach in the review process. To this end, we involved $15$ graduate students, among master’s and PhD, enrolled in the Semantic Web course (Graduate Program in Computer Science, Institute of Computing, University of Campinas, Brazil) to actively contribute to this step. Each student, possessing the ability and expertise in reading and interpreting scientific articles (topic taught in the course), analyzed 10 articles each. Their task extended beyond reading; they were invited to categorize the articles based on predefined categories outlined in the subsequent step. The authors of this study double-checked the final assessment of each article reviewed by the students to ensure their quality further.

The rationale behind involving master’s and PhD students in this collaborative review lies in many reasons. First, the students’ academic standing ensures an understanding and critical analysis necessary for interpreting the nuances present in scientific articles. Their familiarity with the Semantic Web and ontologies domain positions them as adept evaluators of the selected articles. Moreover, including these graduate students aligns with a commitment to diversity in perspectives. Drawing on their varied academic backgrounds and research interests within the Semantic Web, those students brought a breadth of insights that enriched our study’s analytical depth. The collaborative approach contributed to a shared understanding of the reviewed literature among the research team, reducing risks associated with using a possible biased categorization.

In addition to the categorization, the students identified valuable aspects of each article’s characteristics, including the availability of the data investigated in the article, the type of evaluation conducted, and an assessment of the solution’s applicability in real-world settings. These insights, derived from the students’ discerning analysis, provided a holistic view of the articles beyond their immediate contributions to RDF triple generation and KG enhancement. After the students categorized the articles, the authors of this study reviewed the provided results, correcting potential erroneous categorizations when necessary.

9. Create the categories: To organize the collected articles, we created categories that reflect the various aspects of RDF triple generation from NL texts. Aligned with the research questions and focus areas, we defined such categories based on our screening of this literature review and the authors’ background. The names were crafted to align with common terminology in the field, ensuring clarity and coherence in the categorization process.

Categorizing the articles facilitates the synthesis of information and helps identify common themes and trends in the literature. Table 5 presents the proposed 10-category set followed by a brief description.

Table 5.
Categories of the Retrieved Articles and Their Correspondent Description, According to This Study’s Focused Areas.

Category Definition

Neural network Approaches that use neural network architecture to build the triples

Multilingual Approaches that process and generate triples from and to more than one language

English specific Approaches that process and generate triples from texts written in English

Non-English Approaches that process and generate triples from texts not written in English

T-Box population Approaches that define new classes and properties

A-Box population Approaches that insert new instances

Lexical resources Approaches that use any external lexical resources (e.g., WordNet)

Specific transformation Approaches that generate triples of a single domain (e.g., biomedical ontologies)

Generic transformation Approaches that generate triples of any type of domain (e.g., DBpedia)

Link creation Approaches that create links of the newly added resources to existing well-known Linked Open Data (LOD) datasets (e.g., Wikidata)

Survey papers Papers focused on the analysis of the literature on text to triples solutions

Category	Definition
Neural network	Approaches that use neural network architecture to build the triples
Multilingual	Approaches that process and generate triples from and to more than one language
English specific	Approaches that process and generate triples from texts written in English
Non-English	Approaches that process and generate triples from texts not written in English
T-Box population	Approaches that define new classes and properties
A-Box population	Approaches that insert new instances
Lexical resources	Approaches that use any external lexical resources (e.g., WordNet)
Specific transformation	Approaches that generate triples of a single domain (e.g., biomedical ontologies)
Generic transformation	Approaches that generate triples of any type of domain (e.g., DBpedia)
Link creation	Approaches that create links of the newly added resources to existing well-known Linked Open Data (LOD) datasets (e.g., Wikidata)
Survey papers	Papers focused on the analysis of the literature on text to triples solutions

The categories for organizing the collected articles reflect a diverse landscape of methodologies in the context of RDF triple extraction and KG enhancement from NL texts. We grouped categories that describe similar aspects of the literature to improve comprehension:

–

Language specificity: represented by “Multilingual,” “English Specific,” and “Non-English” categories. The rationale for choosing this category group (along with its categories) is that understanding how language is used in the text-triple-KG transformation directly impacts solutions created and whether there is a tendency to process multilingual texts generating KGs that are also multilingual.

–

Technical methodology: represented by “Neural Network” category. The rationale for studying models based on deep learning reveals the authors’ interest in categorizing articles that follow the trend in using these techniques.

–

Ontology and KG enhancement: represented by “T-Box Population” and “A-Box Population” categories. This group of categories aimed to group articles based on how the KG enhancement stage occurs: only via instances (A-box), or also through their classes (T-Box).

–

Resource utilization: represented by “Lexical Resources” category. We understand that using external lexical resources can help create more rule-adherent triples and improve the overall quality of the final version of the KG.

–

Domain specificity: represented by “Specific Transformation” and “Generic Transformation” categories. This group of categories concerns understanding whether there is a preference for enhancing KGs based on text from a specific domain, such as biomedical, with a large set of ontologies (e.g., BioPortal,⁷ ) or whether it is not related to any specific domain.

10. Generate metadata: Metadata generation involves extracting essential data from each article to create a structured dataset, described in Table 6, for analysis. The collected metadata is further used in Section 4, generating relevant quantitative analyses.

Table 6.

List of All Metadata Used in the Survey, Their Types, and Descriptions.

Metadata	Type	Description
Year of publication	Quantitative	Respecting 10-year-old exclusion criteria from Table 4
Country	Quantitative	Nationality of the authors
Category	Quantitative	values found in Table 5
Objective	Quantitative	Three possible values: extract knowledge from text; create KG;and connect to KG with existing ontology rules
Degree of automatism	Quantitative	Three possible values: manual, automatic, and semi-automatic
Methodology	Qualitative	Article’s methodology
Domain	Quantitative	Ten possible values: health, technology, education, language, agriculture, commerce, arts, government, financial, and no domain
Used in real-world	Quantitative	Article generates a product used in real world (e.g., software) or not
Dataset available	Quantitative	Article generates a dataset publicly available or not
Evaluation type	Quantitative	Five possible values: quantitative, qualitative, user evaluation, gold standard comparison, and case study
Strengths	Qualitative	Strong points of the methodology, results, and evaluation
Weaknesses	Qualitative	Weak points of the methodology, results, and evaluation
Validation risks	Qualitative	Potential threats or problems that could compromise the validity and reliability of the results
Open challenges	Qualitative	Research challenges that are still open according to the article

11. Filter the results: As part of our systematic review process, we retrieved a comprehensive set of articles using well-defined inclusion and exclusion criteria. These criteria were intentionally broad to ensure high recall, allowing us to capture the full spectrum of research on generating RDF triples from natural language texts. At this stage, we prioritized inclusiveness to avoid overlooking potentially relevant work.

Following a detailed full-text analysis, we identified a smaller subset of studies that directly addressed our core research focus—namely, the generation of RDF triples from natural language and their integration into existing KGs. Although related to triple generation, most of the initially retrieved articles centered on constructing new KGs rather than updating existing ones. These were therefore not included in the in-depth methodological review presented in Section 5.

Nonetheless, the full collection of articles remains essential to our study. Section 4 presents a broader statistical and quantitative examination of the field, exploring aspects such as publication trends, venues, datasets, evaluation approaches, and the types of natural language inputs considered. This broader analysis provides important context and supports our efforts to understand the field holistically. It also plays a critical role in addressing RQ-05, as many applications are relevant regardless of whether the focus is on building a new KG or updating an existing one. Examining the entire dataset enables us to map out the real-world domains and use cases that drive research in this area, helping to identify gaps and opportunities.

Although several prior surveys have reviewed techniques for generating RDF triples from text (Masoud et al., 2021; Nismi Mol & Santosh Kumar, 2023; Suchanek & Weikum, 2013b), our study occupies a distinct space in the literature. To our knowledge, there has been no focused review examining methods for generating RDF triples that conform to ontological constraints and are suitable for insertion into pre-existing KGs. In this way, our work contributes a unique perspective that advances the understanding of RDF triple generation within this context.

In summary, the broader set of papers forms the foundation for a wide-ranging analysis of the field, while the focused subset enables a more detailed evaluation of approaches directly relevant to knowledge graph updates. These two analytical layers—one broad, one deep—are complementary and offer a well-rounded view of the current research landscape.

12. Describe the results: We described the results of our literature review based on the generated metadata (cf. Section 4) and categorized articles (cf. Section 5). We summarized the key findings and insights from each category, highlighting trends, challenges, and advancements in the context of this survey.

3.3. Reporting Phase

13. Report analyses: We reported analyses conducted on the collected data, creating quantitative insights into trends and patterns within the literature. We analyzed publication trends, research challenge distribution, and the prevalence of specific keywords. We present the findings using graphs, charts, and tables in Section 4.

14. Answer RQs: The first crucial step was systematically answering the RQs formulated in the preparation phase. Based on the findings from the execution phase, we provided clear and concise responses to each RQ (cf. Sections 4 and 5). We referenced the relevant articles and their key findings to support the answers. We ensured that the responses aligned with our study’s scope and provided insights into the state of the field. All RQs addressed in this study were answered exclusively through the systematic screening and analysis of the selected papers, without incorporating additional methods such as stakeholder interviews or expert consultations.

15. Report open challenges: We identified and reported open challenges and areas of future research within our study scope (cf. Section 6). We summarized the limitations and gaps in the existing literature and discussed the unresolved issues, methodological shortcomings, and opportunities for further investigation.

4. Statistical Analysis

This section presents a quantitative analysis of the literature on RDF triple generation based on NL texts. We aim to gain insights into the research landscape in this domain. In this analysis, we used all $150$ articles to better understand how the state of the art dealt with the generation of triples with or without adding them to an existing KG (KG enhancement). We conducted a literature analysis based on the metadata, described in Step 10 of our methodology (cf. Figure 2 and Section 3). The list of used metadata and analyses and why we used them is as follows:

–
The number of articles per year: provides insights into the temporal evolution of research, indicating trends over time (cf. Figure 3).
–
The number of articles per category: enables an exploration of diverse approaches and methodologies, categorizing articles based on specific themes and focus areas (cf. Figure 4).
–
The number of articles by domain: facilitates a domain-specific analysis of the literature, offering a deeper understanding of how RDF triple generation varies across different knowledge domains (cf. Figure 5).
–
The number of articles by scientific database: identifies the distribution of relevant research across reputable scientific databases, revealing the popularity and impact of different platforms (cf. Table 7).
–
The number of articles by evaluation type: identifies the methodologies used to assess proposed solutions, helping to discern the rigor and validity of the research within the surveyed literature (cf. Figure 6).
–
The number of articles by the automation level: classifies articles based on the degree of human intervention involved in the proposed solutions, providing insights into the level of automation achieved (cf. Figure 7).
–
The number of articles used in real-world applications: distinguishes between papers with immediate practical implications and those whose contribution is more theoretical or conceptual, highlighting the potential impact of the proposed solutions (cf. Figure 7).
–
The number of articles that make data available: promotes transparency and reproducibility in the scientific community by identifying articles whose study handled publicly available data (cf. Figure 7).

Figure 3.
Number of articles by publication year. We observe a growing trend in the later years, especially from 2019 to 2022. The numbers related to 2023 are from January to August.

Figure 4.
Number of articles by category. Lower number of studies in the multilingual or link creation categories present an opportunity for future research. On the other hand, studies focusing on the A-Box population or English-specific are more common.

Figure 5.
Number of articles within the $10$ most popular domains. It helps to answer RQ-05. As observed, most studies are not specific to a particular domain.

Figure 6.
Number of studies by each type of evaluation. Studies evaluated by quantitative analysis (especially focusing on using gold standards) are more commonly observed in the literature.

Figure 7.
Three plots describing how automatic the solution is, how applicable it is in the real-world settings, and how research data are publicly available. Most studies present a fully automated method, do not relate to a real-world usage scenarios, and do not disclose their data publicly.

Table 7.
Number of Studies Found, According to the Scientific Database. Elsevier Scopus and IEEE Xplore Have Yielded More Relevant Studies.

Database #

ACM DL 14

IEEE Xplore 43

SpringerLink 37

Elsevier Scopus 48

ACL Anthology 7

Figure 3 presents the number of articles published annually. One may notice a growing trend, especially considering the 2019–2022 interval. Such a finding helps to ensure the relevance of our work’s subject in recent years. We further analyze and discuss our findings from this analysis in Section 6 (discussion).

Figure 4 presents the number of articles per category. We included a category for survey-related studies, totaling $10$ categories. This plot helps understanding the distribution of research efforts across various subfields within the domain. This is crucial for identifying trends and emphasizing different aspects of the topic from the existing literature.

Additionaly the distribution of articles across categories and evaluation approaches can indicate more mature areas (e.g., A-Box population, generic transformation, and English-specific) or those which require further investigation (e.g., multilingual, link creation, and non-English). In our analysis, each article may belong to one or more categories.

Figure 5 presents the most common domains in which RDF triples are extracted from NL texts for posterior KG enhancement. Most of the studies found target no specific domain. As for studies focusing on specific domains, health is the most frequent, followed by technology, and science & education. We observe a relation between the domain of NL text; the KG produced from these triples, and the application that benefits from this knowledge. Figure 5 contributes to answer RQ-05, that is, “the main applications that benefit from the text-to-triple approaches.”

Figure 6 presents the number of articles grouped by the type of evaluation they employ. We identified five distinct types of evaluation, such as quantitative analysis and case studies. This figure highlights the diversity of methodologies used in assessing the proposed solutions.

We investigated the extent to which the solutions are capable of automation which is a fundamental factor in assessing these methods’ scalability and real-world applicability. The first plot in Figure 7 categorizes articles based on how they describe their solutions’ level of “automaticity.” We classify articles into three categories: automatic, semi-automatic, and manual. The level of “automaticity” in the proposed solutions can inform us about how ready for practical deployment they are.

We analyzed studies concerned with real-world applications and discussed data availability. Both of them have implications for the practicality and reproducibility of the research. Identifying how many articles have been applied in real-world scenarios and how many make their data publicly available guide researchers in selecting the most relevant and accessible resources for their work. In the second plot in Figure 7, we explore the practical relevance of the articles by counting how many of them describe applications of proposed methods in real-world settings. In the third plot of Figure 7, we show the number of articles that handled publicly available data. By observing the numbers concerning the automation aspect in Figure 7, we can positively answer RQ-04. The number of studies in which no human intervention is required in the process is greater than those in which some level of manual interaction is required.
5. RQs Analysis

Database	#
ACM DL	14
IEEE Xplore	43
SpringerLink	37
Elsevier Scopus	48
ACL Anthology	7

This section categorizes the identified studies related to RDF triple generation from NL texts aiming to enhance KGs. The categorization of articles provides a navigational guide to help understand the area under study. Additionally, this sets the stage for discussions, paving the way for advancements in this domain.

From the $150$ articles obtained after the first screening (cf. Section 3), we narrowed the sample to $15$ after applying filters described in Step 11 of Section 3 for the key aspects considered in this study. Section 4 presents the considered categories’ descriptions.

The upcoming sections describe in further detail the categories and the studies that best fit such categories (relying on our procedure conducted for this purpose). Table 8 presents our proposed organization of categories relying on our observations from the analyzed articles. Table 9 describes the learning methods used by each paper to transform text into RDF triples, the evaluation metrics and their respective values.

Table 8.
Categories of Each Article and Their Presence in Each Work.

Group 1: LS Group 2: OE Group 3: UD Group 4: TR Group 5: LR Group 6: LC

Paper English-specific Non-English Multilingual T-Box population A-Box population Specific transformation Generic transformation Neural network Lexical resources Link creation

Zhang and Nguyen (2021) X X X X X

Yu et al. (2020) X X X

Rios-Alvarado et al. (2023) X X

Stern and Sagot (2012) X X

Kertkeidkachorn and Ichise (2018) X X X

Sordo et al. (2015) X X X X

Dutta et al. (2015) X X

Rossanez et al. (2020b) X X X X X X

Lin et al. (2020) X X X

Sendyk et al. (2022) X X X X

Xu et al. (2022) X X X X

Fei et al. (2022) X X X X X

Liu et al. (2018) X X X X X X

Yan and Gao (2019) X X X

Li et al. (2020) X X

The columns represent each of the 10 categories. The rows represent each of the 15 articles. The columns are composed of five groups of categories: (1) language specificity (LS); (2) ontology and KG enhancement (OE); (3) underlying domain (UD); (4) technical resources (neural network); (5) lexical resources (LR); and (6) link creation (LC).

5.1. Language Specificity

We considered the language specificity addressed in the study. More specifically, we considered whether the method applies to English texts (cf. Section 5.1.1), other languages (cf. Section 5.1.2), or even if applicable to texts in multiple languages (cf. Section 5.1.3). Language specificity is represented by the Group 1: LS in Table 8.

Multilingual KGs created over multilingual texts provide a versatile solution by handling multiple languages within a single model, enabling broader language coverage. However, they may face challenges in achieving language-specific precision. Language-specific KGs, while resource-intensive, can offer superior accuracy and depth for individual languages. The choice between multilingual and language-specific depends on the application’s language requirements and the trade-off between broad coverage and language-specific proficiency.

5.1.1. English-Specific

Zhang and Nguyen (2021) focused on generating triples from a text in English. Their primary objective was to create KGs and connect them with ontology classes and properties. The methodology involved text preprocessing, including filtering by English, removal of special words, and capitalization. The following step was done by applying distinct relation and entity extraction methods, including rule-based, OpenNRE, and OpenIE (Fader et al., 2011). The extracted relations were inserted into a graph database, and a software tool was created to explore the results. The specific textual domain is health-related (COVID-19), and although the application has not been used in real-world settings, the dataset is public. The conducted evaluations are quantitative. The solution’s strengths lie in its automation, independence from domain experts, and generalizability. In turn, the solution may yield limited results because it sometimes employs models trained in unrelated domains. Open challenges include using advanced extraction techniques like BERT (Devlin et al., 2019) and connecting to external vocabularies for additional information, such as DBpedia (Auer et al., 2007).

Similarly, Yu et al. (2020) focused on generating triples from texts in English. Their primary objective was establishing connections with classes and properties in an existing ontology. The methodology lies in constructing a network encyclopedia classification system and KG. Domain experts defined the classification system as a KG with concepts and instances. A web crawler algorithm extracted classification information, capturing upper and lower concept relations. Co-occurrence analysis mined implicit associations between concepts, facilitating the extraction of upper and lower relations during KG construction. Their algorithm quantifies co-occurrence probabilities based on the occurrence frequency of classifications, identifying relations if certain conditions are met. Their solution utilized classification data from encyclopedia entries to ensure a reliable and rich KG, showing the interplay between web crawling, co-occurrence analysis, and semantic analysis in constructing a comprehensive classification system. Yu et al. (2020) specifically constructed a KG related to food. Their method was applied in a project using a public dataset. The conducted evaluation was quantitative. The solution’s strengths lie in its comprehensive approach to relation extraction in a domain KG, through various knowledge sources, including structured, semi-structured, and unstructured data. Their method employed a convolutional residual network for extracting lower relations from web texts and stored the resulting KG in Neo4j,⁸ showing practical applicability. Concerning their investigation weaknesses, the solution presented dependencies on available knowledge sources, and its effectiveness is contingent on the quality of training data, which is highly sensitive to extraction errors. Risks to the validation included the potential imperfections and errors in the Wikipedia data used for experiments and the necessity for parameter tuning in NLP and semantic analysis algorithms. Open challenges involved developing different relation extraction models for diverse forms of knowledge existence.

5.1.2. Non-English

Rios-Alvarado et al. (2023) investigated triple generation from documents written in Spanish. Their main goal was knowledge extraction from texts, the KG creation, and the connection with classes and properties from an existing ontology. The methodology involved text segmentation using NLP, word tagging, knowledge extraction through lexical analysis, and KG construction. The application domain was technology. Evaluations included quantitative analysis and comparison with a gold standard. The strength of this solution appears in enabling knowledge extraction from Spanish language text, which is not common in the literature. Potential weaknesses included the manual validation of extracted entities using a small sample, which can impact accuracy and reliability. Open challenges included exploring alternative methods for feature extraction and implementing the KG for question-answering applications.

The study by Stern and Sagot work (Stern & Sagot, 2012) belongs to the non-English category as they focused on extracting entities from French texts. The methodology employed NER with the support of well-known databases, such as Wikipedia,⁹ Aleda (Sagot & Stern, 2012), and Geonames,¹⁰ specifically tailored for news-related content. The automated process links these entities to ontologies and assigns URIs, subsequently compiling a database containing all identified entities and their occurrences in news articles. The solution’s strengths lie in its reliance on widely acknowledged databases for NER. Their work addressed texts in French exclusively. The investigation publicly evaluated its methodology through quantitative measures and comparison with a Gold Standard, emphasizing transparency in its assessment.

Table 9.
Learning Value and Evaluation Metrics of Each Article.

Paper Learning method ACC PRE REC F1 QLI

Zhang and Nguyen (2021) Basic pattern rules and neural RE 0.80

Yu et al. (2020) RE with GloVe, CNN and transformer 0.78

Rios-Alvarado et al. (2023) NER with lexical patterns identifier 0.53 0.67 0.59

Stern and Sagot (2012) EL and NER of GeoNames and Wikipedia 0.88 0.85 0.77 0.81

Kertkeidkachorn and Ichise (2018) Rule-based and similarity-based on OpenIE 0.46 0.56 0.51

Sordo et al. (2015) RE with dependency trees analysis, NER with DBpedia spotlight 0.74 0.72 X

Dutta et al. (2015) Markov clustering, IM with DBpedia and OpenIE 0.86 X

Rossanez et al. (2020b) SRL for main relations and DP for secondary relations X

Lin et al. (2020) RE using OpenIE and context knowledge 0.56 0.56 0.56

Sendyk et al. (2022) BERT and transformer to merge triples 0.98 0.99 0.99 X

Sendyk et al. (2022) BERT and transformer to acquire triples 0.98 0.98 0.99

Sendyk et al. (2022) BERT and transformer to classify triples 0.81 0.86 0.81

Xu et al. (2022) NER with BERT-CRF and RE with bootstrapping

Fei et al. (2022) Perspective transfer network, a transformer-based neural network 0.89

Liu et al. (2018) Encoder-decoder transformer-based framework 0.84

Yan and Gao (2019) Learning method not specified X

Li et al. (2020) Phrase mining with random forest classifier and BERT models 0.88

Li et al. (2020) NER with BERT models and dictionary knowledge 0.81

Paper	Learning method	ACC	PRE	REC	F1	QLI
Zhang and Nguyen (2021)	Basic pattern rules and neural RE	0.80
Yu et al. (2020)	RE with GloVe, CNN and transformer		0.78
Rios-Alvarado et al. (2023)	NER with lexical patterns identifier		0.53	0.67	0.59
Stern and Sagot (2012)	EL and NER of GeoNames and Wikipedia	0.88	0.85	0.77	0.81
Kertkeidkachorn and Ichise (2018)	Rule-based and similarity-based on OpenIE		0.46	0.56	0.51
Sordo et al. (2015)	RE with dependency trees analysis, NER with DBpedia spotlight		0.74	0.72		X
Dutta et al. (2015)	Markov clustering, IM with DBpedia and OpenIE		0.86			X
Rossanez et al. (2020b)	SRL for main relations and DP for secondary relations					X
Lin et al. (2020)	RE using OpenIE and context knowledge		0.56	0.56	0.56
Sendyk et al. (2022)	BERT and transformer to merge triples	0.98	0.99		0.99	X
Sendyk et al. (2022)	BERT and transformer to acquire triples	0.98	0.98		0.99
Sendyk et al. (2022)	BERT and transformer to classify triples	0.81	0.86		0.81
Xu et al. (2022)	NER with BERT-CRF and RE with bootstrapping
Fei et al. (2022)	Perspective transfer network, a transformer-based neural network				0.89
Liu et al. (2018)	Encoder-decoder transformer-based framework				0.84
Yan and Gao (2019)	Learning method not specified					X
Li et al. (2020)	Phrase mining with random forest classifier and BERT models		0.88
Li et al. (2020)	NER with BERT models and dictionary knowledge		0.81

The first shows the articles; the second column is the main method used to generate triples from a text; the following four columns present quantitative values: accuracy (ACC), precision (PRE), recall (REC), and F1. The last column shows if the authors perform a qualitative evaluation (QLI). The rows represent the papers. RE stands for relation extraction; NER stands for named entity recognition; OpenIE stands for open information extraction; IM stands for instance matching; SRL stands for semantic role labeling; DP stands for dependency parsing.

5.1.3. Multilingual

Kertkeidkachorn and Ichise (2018) constructed a multilingual framework to map predicates from NL texts to KG triples, named T2KG. The methodology combined rule-based and similarity-based approaches and includes five steps: (1) entity mapping, linking entities in the text to corresponding entities in the KG; (2) coreference resolution, detecting chains of entities and pronouns referring to the same entity; (3) triple extraction, extracting relation triples from the text using open information extraction techniques; (4) triple integration, generating a text triple by combining results from entity mapping, coreference resolution, and triple extraction; and (5) predicate mapping, mapping a predicate of a text triple to a predefined predicate in other KGs. Their solution extracts information from textual documents and aims to integrate it into a KG. The goal was to incorporate new information into preexisting knowledge structures (e.g., ontologies and KGs). The solution applies to various domains. The authors report achieving high precision, recall, and F1 scores, effectively mapping their generated triples to DBpedia’s, and integrating significant new knowledge into the existing KG. The major drawback is the difficulty in mapping complex predicates. Validation risks include potential errors in graph generation depending on the input data. Open challenges involve improving triple extraction accuracy, handling complex predicates, integrating multiple sources of information, and developing effective methods for assessing KG quality, completeness, correctness, and consistency, as well as error identification and correction.

5.2. Ontology and KG Enhancement

In this category, we investigate whether the study comprises the triple generation from NL texts either enhances the Ontology that describes an existing KG, that is, populating T-Box (cf. Section 5.2.1), or enhancing the instances portion of the knowledge representation, that is, populating A-Box (cf. Section 5.2.2). The ontology and KG enhancement are represented by Group 2: OE in Table 8.

5.2.1. T-Box Population

Sordo et al. (2015) focused on extracting knowledge from unstructured text sources to generate structured data for music recommendations. The methodology involved identifying relevant entities in texts, extracting meaningful relations between them, and connecting this knowledge to existing ontology classes and properties. The methodology of the knowledge graph construction comprised several key steps, beginning with text input preprocessing, segmenting the input text into sentences, and tokenizing. This work employed NER using DBpedia Spotlight (Mendes et al., 2011) to identify music-related entities, focusing on types like song, band, person, album, and music genre. Simultaneously, dependency parsing (DP) generates trees for sentences, aiming to find relations between multi-word music-related entities. These processes were integrated in a subsequent step, combining NER and DP results by merging nodes in the dependency tree corresponding to recognized entities. The subsequent stages involved relation extraction (RE), in which relations between recognized music-related entities are extracted based on paths in the dependency trees. Empirical rules were introduced to filter out relations irrelevant linguistically. Finally, a graph representation was constructed, encapsulating the music-related entities as nodes and their relations as edges. This graph, composed of five distinct entity types, provided a structured representation of the knowledge extracted from the input text, facilitating a comprehensive understanding of relations within the music domain.

The domain of their work was in the arts, particularly music. The application of their work relates to a real-world project, and the dataset used was publicly available. Evaluations include quantitative and qualitative analyses, user evaluations, and comparisons with a gold standard. The solution’s strengths include using NL for user recommendations, which enhances the user experience, and comparing the extracted knowledge with a gold standard to assess the quality of the relations. Its weaknesses include low recall in extracting relations, potentially leading to the loss of relevant information, and the need for a prior syntactic simplification step to handle potentially noisy relations. Potential validation risks include limited generalization due to a single dataset and the possibility of noisy relations extracted from text variability. Open challenges relates to improving relation extraction system recall and evaluating the method on larger, more diverse datasets for generalization assessment.

5.2.2. A-Box Population

Dutta et al. (2015) generated KGs and connected them to classes and properties of an existing ontology. The methodology involved data clustering before mapping the relations between phrases and clusters. The employed methodology concerns constructing a KG by converting open information extraction (OIE) (Fader et al., 2011) facts into assertions within a target knowledge base (KB). The process involved four key components: instance matching (IM), lookup (LU), clustering (CL), and property mapping (PM) modules. The domain of this work referred to language, and although it was not used in a real-world setting, the dataset is publicly available. Their investigation described an evaluation protocol that includes quantitative assessments and comparisons with a gold standard. One of their solution’s strengths is its focus on simplifying the mapping process for knowledge bases, with clustering being a valuable addition. No weaknesses were identified explicitly, but potential validation risks include dealing with identical phrases that might have different meanings. An open challenge is creating a T-box when certain relations present in OIE do not exist in the target knowledge base.

5.3. Underlying Domain

In this category, we considered studies that are either applicable to a specific domain, for example, biomedical (cf. Section 5.3.1), or those which are domain-agnostic (cf. Section 5.3.2). Domain specificity is represented by Group 3: UD in Table 8.

5.3.1. Specific Transformation

Specific transformation refers to methodologies for generating triples for a specific domain. The intersection emphasizes the alignment of domain-specific approaches with ontology construction, highlighting how these methodologies often tailor the generation process to the intricacies of a particular domain.

Rossanez et al. (2020b) presented a specialized method for generating KGs specifically in the biomedical domain. The methodology comprised four main steps: preprocessing, triple extraction, ontology linking, and graph generation. The focus was on extracting knowledge from biomedical texts, connecting it to existing ontologies, and ultimately generating a KG. While the application has not been used in a real-world application, the dataset used is public, and the evaluations included quantitative and qualitative assessments. The strengths of their solution include proposing a semi-automatic method for KG generation and relating it to existing ontologies. The method identifies primary and secondary relations. Validation relies on a ground truth defined by medical experts, albeit not domain specialists. The results show promise, especially regarding the Jaccard coefficient. Weaknesses include the method not being entirely automatic, domain limitations due to the vast internal vocabulary of various medical subfields, and the relatively small number of samples (e.g., articles in the medical domain) evaluated by medical experts. Identified potential validation risks include the involvement of non-specialist medical experts, which might have introduced bias, and the use of abstracts rather than full-text articles, which hinders the assessment of the generalizability of the results. Open challenges included the development of an automatic approach for generating RDF triples more akin to those created by domain specialists, leveraging logical inferences to capture implicit relations in texts, exploring the use of other KGs and ontologies to enrich the main set of triples, comparing knowledge graphs linked to different ontologies using UMLS CUIs (Bodenreider, 2004), and involving more domain-specific experts to establish a baseline for comparison.

5.3.2. Generic Transformation

Generic transformation represents methodologies that generate triples agnostic to a specific domain. This intersection shows how even domain-agnostic approaches often involve ontology construction, showcasing the relevance of aligning the generated triples with a predefined ontology.

Lin et al. (2020) introduced a method that generates triples across various domains. They aimed to extract knowledge from a text by generating KGs and their connection with existing ontology classes and properties. Their methodology included two key stages: knowledge extraction and knowledge linking. The first stage extracts information from documents, including entities and triples. The second stage constructs a graph from this data, and the linkage between entities and predicates is determined using similarity measures. Notably, their study was applied in real projects and leverages public datasets, making it suitable for generalized application. Their evaluation involved quantitative analysis and comparison with a gold standard. Lin et al. (2020) solution’s strengths included the population of incomplete and outdated knowledge bases, extraction and organization of information from unstructured documents, utilization of a semantic graph approach, efficient integration of entities and relations, and effectiveness demonstration compared with other reference techniques. Their study also handled the coherent creation of new entities. However, the proposed method may face limitations in scenarios with very large or complex datasets, and its linking efficacy can depend on the quality of reference KB data. Challenges include dealing with ambiguity and polysemy in extracted information, potential biases in datasets, erroneous input data, and matching failures between entities and relations in reference datasets. Open challenges involve improving entity and relation linking in lengthy and complex documents, adapting the method for different domains and languages, exploring techniques for handling noisy and ambiguous data, and investigating the scalability of the method for larger datasets.

“A Task-Agnostic Machine Learning Framework for Dynamic Knowledge Graphs,” authored by Sendyk et al. (2022), falls under the category of generalized transformations. Their study described the development of a generic framework capable of generating triples in any domain. Their methodology involved a series of steps, including web data extraction, training NLP models based on synthetic data, page classification, sentence segmentation, and graph generation. Notably, the framework was designed for a specific domain, allowing its application across various fields. While their defined process is semi-automatic, incorporating manual classification, the study addressed potential biases by implementing steps to avoid user biases during classification and introducing synthetic data to enhance the available dataset. The solution’s strengths lie in its attempt to mitigate user biases and augment data availability through synthetic data. Challenges include the need for a robust manual classification step and potential risks associated with biased classification and imbalanced base texts. Validation aspects included the framework assessment using quantitative measures and a case study, showing its real-world applicability and emphasizing the need for well-defined manual classification steps in open challenges.

5.4. Neural Networks

This category represents methodologies that uses neural network architecture for triple generation.

Transformers (Balabin et al., 2022) are neural network models that rely on the attention mechanism to draw global dependencies between their inputs and outputs. They are often used in text-to-text applications, such as translation. In the literature, several studies employ such models to identify entities and relations from text to build triples.

Graph neural networks (GNNs) (Scarselli et al., 2008) are used in relation extraction tasks and knowledge representation. We found no articles in the literature that describe solutions using GNNs for the complete pipeline of transforming text into triples and adding them to an existing KG.

Xu et al. (2022) described how they used BERT-based and bootstrapping methods to construct a constantly evolving KG in the domain of Traditional Chinese Medicine (TCM). BERT is a transformer-based deep learning model by Google used in various NLP tasks (Devlin et al., 2019). The authors state that the KGs in the TCM domain are static, not fully representing the evolving characteristics of the medicine domain. To overcome this problem, they proposed a methodology that generates a dynamic growth of the proposed KG. It starts by collecting data based on user input keywords and then employs the BERT-CRF (Wu & He, 2019) to identify entities and Bootstrapping to identify relations and obtain structured data. Finally, the structured data was integrated into an existing KG. Bootstrapping is applied to extract the relation between symptoms of diseases and their causes. The entity recognition and relation extraction result is merged into an existing KG. While the method allows the continuous and dynamic growth of the TCM KG through user interactions, its main difficulty lies in merging KGs, especially in handling equivalent entities and term ambiguity. Also, the authors Xu et al. (2022) do not describe how to assess the qualitative aspects of the added information to the KG. Additionally, the methodology was only applicable to English-based knowledge and the TCM domain (Xu et al., 2022).

Fei et al. (2022) relied on few-shot relational triple extraction (RTE) to construct triples. The authors state that traditional triple-construction approaches are not aware of the semantics and coherence of the generated triples. The proposed methodology, perspective-transfer network (PTN), included a multi-perspective approach to constructing a KG. It operates on episodes comprising support and query sets. The framework designed by Fei et al. (2022) used three perspectives: relation, entity, and triple. In the relation perspective, the query detects potential relations by marking entity pairs sentences. A binary classifier predicts if a relation exists among them. If a relation is identified, the entity perspective extracts entity spans in the query, combining relation and entity information into triples. The triple perspective then validates these triples, utilizing labeled query sentences and a binary classifier. Although it does not have a specific domain, the solution can be applied generally. It has not been used in a real project, but the dataset is public, and the evaluations involve quantitative analysis and a comparison with the gold standard. The main strength of their solution (Fei et al., 2022) is the ability to handle a few training examples, which is a common limitation in many NLP tasks. Additionally, their study explore the utilization of an efficient and scalable neural network architecture that can be trained on modern GPUs. Also, the solution is fully automatic and does not require human intervention. As a downside, the solution may be computationally intensive, requiring substantial hardware resources. Its performance can be sensitive to the quality of the input data, mainly affected by annotation errors and data noise (Fei et al., 2022).

Liu et al. (2018) proposed an application named Seq2RDF, which uses the transformer architecture to construct triples. Seq2RDF was one of the first applications of transformers to build KGs, in 2018. The methodology involves applying transformers and generating embeddings for triple generation. It uses DBpedia (Auer et al., 2007) as input to train models, with the encoder processing NL sentences and the decoder producing triples in the <subject, predicate, object> format. Their solution (Liu et al., 2018) is generic and not limited to a single domain. It was applied to a real-world project, but their dataset is public. Evaluations included quantitative analysis and comparison with a gold standard. The authors state that the differential of Seq2RDF is its simplicity and efficiency for generating triples from NL texts. The downside of Seq2RDF remains that it can only generate a single triple per sentence.

5.5. Lexical Resources

External lexical resources, such as WordNet (Miller, 1995), PropBank (Kingsbury & Palmer, 2003), or VerbNet (Palmer & Kipper, 2004), have been explored to assist RDF triple generation. Examples of their application include identifying verbs and their parameters in sentences as candidates for subjects or objects. Resources like Yago (Suchanek et al., 2007), BabelNet (Navigli & Ponzetto, 2012), or SpaCy (Honnibal & Montani, 2017), are often employed for identifying named entities in texts. This category emphasizes the reliance on linguistic resources in language-specific contexts, showcasing how the utilization of such resources plays a crucial role in these methodologies.

Yan and Gao (2019) fits this category. Using the Baidu Encyclopedia to aid in the KG construction. Their main objective was to generate KGs and connect them to existing ontology classes and properties, focusing on the domain of biology, specifically water. Their methodology involved information extraction, knowledge fusion, and knowledge processing. Information extraction encompasses extracting entities (concepts), attributes, and relations between entities from the data source, forming the foundation for ontological knowledge representation. After acquiring new knowledge, the following step, called knowledge fusion, integrates this new knowledge to eliminate contradictions and ambiguities. The method was applied in a project and the used dataset is public. The conducted evaluation (Yan & Gao, 2019) was based on a case study. The solution’s strengths lie in customizing the wrapper for extracting entity attributes and values from semi-structured water entry data in the encyclopedia. On the other hand, its weakness lies in the dependence on data sources, as the quality and quantity of data can impact the comprehensiveness of content extraction. The main validation risk is the lack of comparisons with other methods. Open challenges involve expanding the scope of extraction to acquire more knowledge and conducting further research to identify potentially better methods for KG construction in the water domain.

5.6. Link Creation

This category focuses on methodologies that create links to existing well-known Linked Open Data (LOD) datasets. LODs often adhere to standardized ontologies and vocabularies, such as RDF and OWL. This adherence ensures consistency and interoperability between different datasets, reducing ambiguity and enhancing the overall quality of information in the KG.

“AliMe KG: Domain Knowledge Graph Construction and Application in E-commerce,” by Li et al. (2020) was categorized under link creation due to its primary focus on connecting newly created triples with other existing datasets. Their proposal involved key components, such as phrase mining, named entity recognition, relation extraction, and knowledge fusion. The authors described a semi-automated process for knowledge acquisition and validation, incorporating human annotation and feedback cycles to enhance KG precision and completeness. Their study (Li et al., 2020) was applied in the e-commerce domain and showcases real-world applications of the AliMe KG in pre-sales conversation scenarios. The evaluation combines both quantitative and qualitative assessments. The solution’s strengths include providing a systematic methodology with semi-automated processes for mining structured knowledge from NL texts applicable to multiple languages and domains. Identified challenges include the significant need for human annotations and feedback cycles, which can be time-consuming and expensive. The solution also relies on external knowledge sources, introducing potential issues of reliability and currency, leading to biases or errors in the KG. Risks to validation involve biases in data sources and potential inaccuracies in external knowledge, impacting the representation and generalization of results. Open challenges include expanding the AliMe KG coverage in the Alibaba e-commerce platform and exploring its application in various domains beyond e-commerce.

6. Open Challenges

This section describes key findings (cf. Section 6.1) and common themes and revisits our research questions (cf. Section 6.2). We discuss open research challenges and potential paths for future investigations (cf. Section 6.3).

6.1. Findings and Limitations

Publication years analysis (Figure 3) reveals a growing trend, especially from 2019 to 2022, which suggests an increasing interest in and importance of the surveyed topic and justifies the relevance of our study. We believe that the numbers for 2023 did not follow this trend because many accepted articles have not yet entered the scientific databases (Table 7). In 2024, data related to 2023 will be better consolidated. Given the increasing adoption of generative models for various NL processing tasks, including those related to KGs, we expect an accelerated growth in the coming years. The increasing adoption of RAG (retrieval augmented generation) (Lewis et al., 2020) indicates that KGs can and should be used with LLMs for information retrieval tasks.

Study categorization served as a venue for understanding the diverse methodologies employed in generating RDF triples from NL texts and enhancing KGs. Each category addresses specific aspects, such as language specificity, ontology and KG enhancement, domain specificity, use of neural networks, lexical resources, and link creation.

Clustering of articles across categories (Figure 4) provides a comprehensive overview of research efforts. These results align with the subset of $15$ articles described in Section 5. Our findings highlight some facts that warrant reflection. Despite thousands of spoken and computationally represented languages, English still dominates as the language used for NL texts and the KGs resulting from text processing. A more in-depth future work is needed to identify how non-English languages use KGs generated in English for information representation.

Regarding transformation and domain categories (Section 5), we identified the proximity between specific single-domain transformations and generic transformations across multiple domains. WCreating domain-specific KGs can benefit companies, government agencies, or any other entities interested in using specific KGs, given that their maintenance involves domain experts to ensure information consistency and less maintenance. On the other hand, KGs spanning multiple domains may face maintenance and accelerated growth challenges due to the multiplicity of domains and, consequently, the continuous creation of RDF triples. As such, we believe that, for example, creating a specific and local KG about the health sector—the domain with the most articles based on our results (cf. Figure 5)—offers more benefits in terms of creation and maintenance than a domainless KG.

The less represented categories but of great relevance to this research are multilingual and link creation. More solutions should somehow support the creation of links with other ontologies or existing KGs. The rationale is that such solutions would facilitate information reuse, connecting nodes from local KGs with larger and more established KGs. In this sense, they would contribute to the LOD movement, transforming a Web with some disconnected data islands into an interconnected archipelago of data (Regino et al., 2021).

Article distribution based on evaluation types (Figure 6) highlights the methodological diversity in assessing existing proposed solutions. Quantitative methods were widely employed since this type of assessment can use a large set of texts and triples as input, which facilitates adapting the methodology and refactoring tests. With a large dataset, quantitative analysis enables researchers to iteratively adjust their methodologies and improve testing procedures, ensuring a more comprehensive and informed evaluation of proposed solutions.

Figure 7 shows that most of the $150$ articles present automatic or semi-automatic solutions, which, to some extent, streamlines the RDF triple production and improvements in existing KGs. As such, quantitative methods dealing with large amounts of data are needed for an overall evaluation of the results of these solutions. However, qualitative methods are valuable allies that help to identify semantically incoherent RDF triples. Even though the triples are generated in the <subject, predicate, object> pattern and can be connected to an existing KG, such triples may be redundant, repeated, or inconsistent with those in the KG. Qualitative methods can identify such issues. Despite the potential limitations in volume and speed associated with evaluation assessments using quantitative methods, we understand that combining both quantitative and qualitative approaches is crucial for generating more consistent and comprehensive evaluations. This is particularly important as it allows for a multi-faceted assessment of results from different perspectives.

Several proposed solutions used a gold standard to evaluate the generated triples. We found no standard used by the $15$ studies described in Section 5. Constructing a “generic gold standard” to evaluate the generation of KGs from texts would be a valuable asset, helping future solutions to be compared using a unified evaluation basis.

The surveyed articles across various categories exhibit several limitations that provide opportunities for future research and refinement of NL-based KG generation methodologies. First, the challenge of domain specificity. Many studies, particularly those in the specific transformation category, struggle with the adaptability of their approaches to diverse domains. Biomedical applications, for example, present specific requirements that may not be addressed adequately by generic NL-based KG generation methods. Moreover, several articles lack extensive evaluation in real-world scenarios, relying on relatively small datasets or limited case studies. This hinders a comprehensive understanding of the scalability and robustness of the proposed solutions, making it crucial for future work to address this gap by validating solutions in diverse and realistic settings.

Second, while transformer-based approaches present remarkable capabilities, as shown in the neural network category, they are not without limitations. Many studies employing neural network architectures focus on English-centric applications, raising concerns about the generalizability of these methods to non-English languages. Effectiveness of these neural network models can be contingent on the availability of large and high-quality training datasets, limiting their applicability to less-resourced languages. Additionally, the computational intensity associated with neural networks is a common challenge, potentially restricting their deployment in resource-constrained environments. These limitations demand research efforts to enhance the multilingual applicability, dataset diversity, and computational efficiency of transformer-based KG generation methods to foster broader advancements.

Third, Figure 7 indicates that fewer articles explicitly mention real-world applications, suggesting a potential gap between research and practical applications. The availability of publicly accessible data remains limited, impacting the reproducibility and accessibility of the proposed studies.

6.2. Answer Summary of RQs

Our literature review enables answering the defined RQs in Table 2. Each RQ was answered throughout this text, which are summarized below:

RQ-01—“What are the benefits and drawbacks of a method that generates RDF triples from texts?” As highlighted in Section 5, these approaches contribute to KG enrichment by extracting structured information from unstructured texts, fostering a more comprehensive understanding of various domains. Their versatility is evident across applications, including healthcare, e-commerce, and the Arts. Drawbacks include language dependency, which limits the applicability of some methods to specific languages, and potentially excludes valuable information from texts in other languages. Domain sensitivity is another challenge, as certain methods are tailored to specific domains, making them less adaptable across diverse knowledge areas. Dependency on external lexical resources introduces challenges related to the consistency, coverage, and real-time updates of those resources.

RQ-02—“Are there any patterns for texts used as input and for the triples generated as output?” We identified that methods preprocess texts to extract entities, relations, and contextual information, leading to diverse patterns in generated triples. Table 8 showcases the intersection of different categories, emphasizing the variety in approaches and potential patterns. Generally, texts undergo preprocessing by filtering, entity extraction, and relation extraction, resulting in patterns like <entity, relation, entity>.

RQ-03—“What are the most used techniques? What are the most accurate ones?” Most studies, especially recent ones, rely on transformers techniques in either one step of the process or the overall method (Fei et al., 2022; Liu et al., 2018; Sendyk et al., 2022; Xu et al., 2022; Yu et al., 2020). Previous research relied mostly on NLP techniques combined with rule-based approaches to identify and extract RDF triples from text (Paulheim, 2017; Schneider et al., 2022). The advent of transformers showed greater accuracy when considering those used in past research. Due to their ease of use, transformer-based solutions are becoming more popular in diverse applications (Casola et al., 2022). Based on Table 9, we note that articles related to transformers, in addition to being highly popular in recent years, also report impressive evaluation metrics, like precision, recall, F1-score, and accuracy, with values approaching 1. Although these studies do not utilize the same benchmarking datasets or evaluation methodologies, we interpret these high values as indicative of performance that surpasses the current state-of-the-art. However, a definitive conclusion regarding which method for generating triples from text achieves the highest accuracy can only be reached by adopting a unified and sufficiently comprehensive benchmarking framework.

RQ-04—“Are there fully automated approaches to generate knowledge graphs from text?” Few studies present a semi-automatic method (Dutta et al., 2015; Liu et al., 2018; Rossanez et al., 2020b; Sendyk et al., 2022; Sordo et al., 2015). Most research employs a fully automated method, that is, the KG is generated with no human intervention.

RQ-05—“What are the main applications that benefit from the text-to-triple approaches?” A portion of the studies in our survey are not currently employed in real-world applications. From the original 150 papers retrieved, only 27 (18%) were employed in live software. Of 15 papers detailed in Section 5, only five were used in real-world applications (Lin et al., 2020; Sendyk et al., 2022; Sordo et al., 2015; Yan & Gao, 2019; Yu et al., 2020). This suggests an existing gap between research advancements and practical implementation. Nonetheless, for the subset of studies that are applied in real-world scenarios, we verified a direct impact on information retrieval-based applications across diverse domains. Specifically, these applications span various sectors such as health, technology, education, and more. In healthcare, for instance, text-to-triple approaches contribute to the construction of KGs that aid in medical research, diagnosis, and treatment recommendations. In education, the application of text-to-triple techniques facilitates the creation of educational KGs, supporting adaptive learning systems and personalized content delivery.

RQ-06—“How do the methods explore the T-Box and A-Box in terms of text-to-triple generation?” We identified some relevant approaches. T-Box population methods enhance/enrich existing ontologies by extracting knowledge from unstructured texts and connecting it to ontology classes and properties. In turn, A-Box population methods focus on populating the instances portion of knowledge representation with relations derived from texts. The challenge lies in balancing both aspects for a comprehensive and accurate KG construction. As discussed in Sections 5.2.1 and 5.2.2, methods explore T-Box and A-Box differently, emphasizing the relevance of understanding and addressing ontology and instance-related aspects for effective text-to-triple generation. Furthermore, successful A-Box population hinges on robust entity linking by automatically matching mentions to existing URIs. Several of our 15 studies explicitly evaluate EL quality (Lin et al., 2020; Sendyk et al., 2022), even if they may not use the term “entity linking.”

6.3. Promising Research Directions

This section outlines research gaps and promising research directions arising from the analyses and findings described in Sections 4 and 5. We also discuss opportunities identified by us (the authors) and not identified in the reviewed articles in our study.

Multilingual NL-based KG generation: Developing methodologies that can effectively generate RDF triples from NL texts in multiple languages remains a significant challenge. The inherent linguistic variations, syntactic structures, and semantic nuances across different languages pose obstacles. Moreover, the availability of large and diverse training datasets and benchmarks (Gardent et al., 2017; Mihindukulasooriya et al., 2023) for less-resourced and less-spoken languages is limited, complicating the training of robust multilingual models.

To the best of our knowledge, no tools exist for adding triples to KG from texts written in NL exclusively for low-resource or low-comprehensive languages. We have found many tools for English Zhang and Nguyen (2021), Yu et al. (2020), Sordo et al. (2015), Dutta et al. (2015), Rossanez et al. (2020b), Spanish Rios-Alvarado et al. (2023), Xu et al. (2022), Fei et al. (2022), French Stern and Sagot (2012), multilingual Kertkeidkachorn and Ichise (2018), Lin et al. (2020), Sendyk et al. (2022), Liu et al. (2018) and other languages with many speakers (Li et al., 2020; Yan & Gao, 2019). Addressing this challenge requires innovative approaches to handle linguistic diversity, improve cross-language generalization, and explore techniques for effective knowledge transfer between languages.

Real-world applicability and scalability: NL-based KG generation methods may lack extensive validation in real-world scenarios, impacting their practical applicability and scalability. Validation methods applied to diverse domains and large-scale applications are challenging due to the complexity and variability of real-world data. Researchers face difficulties using comprehensive gold-standard datasets for various domains and ensuring their proposals can scale to handle large, dynamic datasets (Gardent et al., 2017; Mihindukulasooriya et al., 2023). Overcoming these challenges requires developing evaluation frameworks that simulate real-world conditions and exploring scalable NL processing techniques suitable for diverse application contexts.

Evaluation metrics for quality and completeness: The lack of a universal benchmark gold standard dataset for evaluating KG enhancement from NL texts remains a significant challenge. While some benchmarks, such as WebNLG (Gardent et al., 2017) and Text2KG (Mihindukulasooriya et al., 2023), are available, they are not widely or consistently adopted across studies. As shown in Table 9, many papers diverge in their choice of datasets and metrics, hindering direct comparisons between proposed methods. Existing evaluation metrics, often limited to quantitative measures like precision and recall, fail to fully capture essential aspects such as semantic accuracy, coherence, and relevance of the generated RDF triples. The subjective nature of these qualitative dimensions, combined with the lack of a unified evaluation framework, further complicates developing comprehensive and standardized metrics.

Handling ambiguity and polysemy: Ambiguity and polysemy in NL texts introduce complexities in accurately identifying entities and relations. Resolving the ambiguity arising from multiple interpretations of words or phrases and disambiguating between polysemous meanings is challenging. Existing NL-based KG generation methods often struggle in contexts where entities have diverse meanings or where words can be used in multiple contexts. Mitigating this challenge requires developing context-aware models, advanced disambiguation techniques, and strategies to incorporate contextual information for accurate entity and relation identification.

Several studies identified in our systematic literature review reported this challenge. Lin et al. (2020) reported difficulties with both ambiguity and polysemy in noisy data used as input in their method, hidering the process of relation definition between the entities to build the triples. Kertkeidkachorn and Ichise (2018) cited errors that arise with both polysemy and ambiguity, such as low NER accuracy, matching and inconsistent mappings with other KGs. Dutta et al. (2015) discussed errors in their method when dealing with identical phrases. Xu et al. (2022) reported difficulties with equivalent entities in when merging KGs.

Ethical and bias: NL-based KG generation methods are susceptible to biases present in training data, which may perpetuate biases. Addressing ethical considerations and mitigating biases in the generated knowledge is a pressing challenge. The difficulty lies in identifying and addressing implicit biases in training data, ensuring fair representation of diverse perspectives, and establishing guidelines for responsible NL-based KG generation. Creating KGs based on fairness is a topic of study that has been cited as future work in some articles within KG engineering (Chen et al., 2024; Franklin et al., 2022; Mihindukulasooriya et al., 2023), showing its importance for the scientific community. Overcoming this challenge requires interdisciplinary collaboration and ethical frameworks to promote fairness in knowledge representation.

Dynamic KG evolution: Ensuring the dynamic growth and temporal evolution of NL-based KGs, such as temporal KGs (TKGs) (Rossanez et al., 2020a) to capture real-time changes in knowledge domains is complex. Many existing methods focus on static KGs, limiting their ability to adapt to emerging information. Addressing this challenge involves designing frameworks that allow continuous knowledge acquisition, integration, and evolution while considering the computational complexity and potential risks associated with real-time updates.

User involvement: Incorporating user feedback, domain expert insights, and human-in-the-loop interactions in NL-based KG generation can be meaningful to the overall quality of the KG. Many solutions lack mechanisms for effectively involving users and domain experts in the generation process, leading to potential gaps in understanding contextual nuances. Overcoming this challenge requires developing interactive NL-based KG generation interfaces, feedback loops, and collaborative frameworks that help users contribute domain-specific knowledge, validate generated triples, and enhance the overall quality of knowledge representation.

Real-world applications: Creating KGs from texts has a wide range of practical applications. For example, in the e-commerce domain, KGs are used to enhance user experiences by providing personalized recommendations and ensuring product compatibility through semantic relations extracted from textual descriptions. In healthcare, KGs constructed from scientific articles and electronic health records facilitate identifying clinical patterns and support evidence-based decision-making. In the automotive industry, KGs derived from technical manuals and specifications enable precise queries regarding part compatibility and vehicle maintenance.

An emerging application involves integrating KGs with language models in graph-RAG (retrieval-augmented generation) systems (Dong et al., 2024). In this context, KGs serve as a structured knowledge layer, enhancing the ability of LLMs to generate more accurate and context-aware responses. For instance, in customer support systems, KGs can provide information directly related to a user’s interaction history or the company’s knowledge base, ensuring alignment with verified data.

7. Conclusion

A considerable portion of textual data remains unprocessed, representing substantial amount of information that holds the potential to provide actionable insights. This is driven by the need to develop novel methodologies and software tools capable of transforming these large volumes of unstructured text into structured, computer-interpretable knowledge. Semantic Web technologies, particularly with the core use of RDF triples and KGs, offer an approach to organize this information. This systematic literature review about studies addressing RDF triple generation from unstructured NL texts sought to enhance existing KGs. We identified the most prominent approaches in the literature for extracting RDF triples from text, especially concerning their inclusion in existing KGs. We provided a comprehensive overview of the domain pointing out the main challenges and the limitations on the current state-of-the-art on such research. Our study systematically surveyed a diverse set of $150$ articles from distinct scientific databases. From such, we identified $10$ categories and presented a detailed description of the most key studies from those, resulting in $15$ articles discussed in detail. We contributed by outlining opportunities for future research by identifying the categories, approaches, challenges, open research questions, and research gaps. Our contribution paves the way for further advancements in novel methods for the automatic RDF triple generation from texts.

Footnotes

Acknowledgments

This study was financed by the National Council for Scientific and Technological Development – Brazil (CNPq) process number 140213/2021-0. In addition, this research was partially funded by the São Paulo Research Foundation (FAPESP) (grants #2022/13694-0, #2022/15816-5, and #2024/07716-6). This work was also supported by Nederlandse Organisatie voor Wetenschappelijk Onderzoek [grant number Nwa.1332.20.002]. The authors thank Espaço da Escrita—Pró-Reitoria de Pesquisa—UNICAMP—for the language services provided. The opinions expressed in this work do not necessarily reflect those of the funding agencies.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship and/or publication of this article.

ORCID iDs

André Gomes Regino

Anderson Rossanez

Ricardo da Silva Torres

Julio Cesar dos Reis

Notes

Appendix

Table 10 presents all the terms used to query the papers. It is an expanded version of equation (1).

Figure 8 shows the papers submitted by country. Figure 9 highlights the five countries with the highest contribution, namely China, India, the USA, Germany, and Canada.

Considering the institutions, Figure 10 shows the universities that contributed with at least two articles. Of 16 universities, the top five are from China.

References

Arnaout

Elbassuoni

(2018). Effective searching of RDF knowledge graphs. Journal of Web Semantics, 48, 66–84. https://doi.org/10.1016/j.websem.2017.12.001

Auer

Bizer

Kobilarov

Lehmann

Cyganiak

Ives

(2007). Dbpedia: A nucleus for a web of open data. In Proceedings of the 6th international semantic web conference (ISWC 2007) (pp. 722–735). Springer. https://doi.org/10.1007/978-3-540-76298-0_52

Balabin

Hoyt

C. T.

Birkenbihl

Gyori

B. M.

Bachman

Kodamullil

A. T.

Plöger

P. G.

Hofmann-Apitius

Domingo-Fernández

(2022). Stonkgs: A sophisticated transformer trained on biomedical text and knowledge graphs. Bioinformatics (Oxford, England), 38(6), 1648–1656. https://doi.org/10.1093/bioinformatics/btac001

Bodenreider

(2004). The unified medical language system (UMLS): Integrating biomedical terminology. Nucleic Acids Research, 32(Database issue), D267–70. https://doi.org/10.1093/nar/gkh061

Budgen

Brereton

(2006). Performing systematic literature reviews in software engineering. In Proceedings of the 28th international conference on software engineering, ICSE ’06 (pp. 1051–1052). New York, NY, USA: Association for Computing Machinery. ISBN 1595933751. https://doi.org/10.1145/1134285.1134500

Bytyçi

Ramosaj

Bytyçi

(2023). Review of automatic and semi-automatic creation of knowledge graphs from structured and unstructured data.

Candan

K. S.

Liu

Suvarna

(2001). Resource description framework: Metadata and its applications. SIGKDD Explorations: Newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining, 3(1), 6–19. https://doi.org/10.1145/507533.507536

Casola

Lauriola

Lavelli

(2022). Pre-trained transformers: An empirical comparison. Machine Learning with Applications, 9, 100334. https://doi.org/10.1016/j.mlwa.2022.100334

Chen

Rossi

R. A.

Park

Trivedi

Wang

Kim

Dernoncourt

Ahmed

N. K.

(2024). Fairness-aware graph neural networks: A survey. ACM Transactions on Knowledge Discovery from Data, 18(6), 1–23. https://doi.org/10.1145/3649142

10.

Devlin

Chang

M. W.

Lee

Toutanova

(2019). Bert: Pre-training of deep bidirectional transformers for language understanding. In North American chapter of the association for computational linguistics. https://api.semanticscholar.org/CorpusID:52967399

11.

Dong

Wang

Zheng

Chen

Zhang

Wang

(2024). Advanced rag models with graph structures: Optimizing complex knowledge reasoning and text generation. In 2024 5th international symposium on computer engineering and intelligent communications (ISCEIC) (pp. 626–630). IEEE. https://doi.org/10.1109/ISCEIC63613.2024.10810209

12.

Dutta

Meilicke

Stuckenschmidt

(2015). Enriching structured knowledge with open information. In Proceedings of the 24th international conference on world wide web, WWW ’15 (pp. 267–277). Republic and Canton of Geneva, CHE: International World Wide Web Conferences Steering Committee. ISBN 9781450334693. https://doi.org/10.1145/2736277.2741139

13.

Egger

Gokce

(2022). Natural language processing (NLP): An introduction. Cham: Springer International Publishing. ISBN 978-3-030-88389-8, pp. 307–334. https://doi.org/10.1007/978-3-030-88389-8_15

14.

Ehrlinger

Wöß

(2016). Towards a definition of knowledge graphs. In 12th international conference on semantic systems (SEMANTiCS2016) (pp. 14–17). http://ceur-ws.org/Vol-1695/paper4.pdf

15.

Fader

Soderland

Etzioni

(2011). Identifying relations for open information extraction. In Proceedings of the conference of empirical methods in natural language processing (EMNLP ’11). Edinburgh, Scotland, UK.

16.

Fei

Zeng

Zhao

Xiao

(2022). Few-shot relational triple extraction with perspective transfer network. In Proceedings of the 31st ACM international conference on information & knowledge management, CIKM ’22 (pp. 488–498). New York, NY, USA: Association for Computing Machinery. ISBN 9781450392365. https://doi.org/10.1145/3511808.3557323

17.

Franklin

J. S.

Bhanot

Ghalwash

Bennett

K. P.

McCusker

McGuinness

D. L.

(2022). An ontology for fairness metrics. In Proceedings of the 2022 AAAI/ACM conference on ai, ethics, and society (pp. 265–275). https://doi.org/10.1145/3514094.3534137

18.

Gardent

Shimorina

Narayan

Perez-Beltrachini

(2017). Creating training corpora for NLG micro-planners. In R. Barzilay & M. Y. Kan (Eds.), Proceedings of the 55th annual meeting of the association for computational linguistics (Volume 1: Long Papers) (pp. 179–188). Vancouver, Canada: Association for Computational Linguistics. https://doi.org/10.18653/v1/P17-1017

19.

Gruber

T. R.

(1995). Toward principles for the design of ontologies used for knowledge sharing. International Journal of Human-Computer Studies, 43(5), 907–928. https://doi.org/10.1006/ijhc.1995.1081

20.

Honnibal

Montani

(2017). spaCy 2: Natural language understanding with Bloom embeddings, convolutional neural networks and incremental parsing. https://doi.org/10.5281/zenodo.1212303

21.

Pan

Cambria

Marttinen

P. S.

(2022). A survey on knowledge graphs: Representation, acquisition, and applications. IEEE Transactions on Neural Networks and Learning Systems, 33(2), 494–514. https://doi.org/10.1109/tnnls.2021.3070843

22.

Kertkeidkachorn

Ichise

(2018). An automatic knowledge graph creation framework from natural language text. IEICE Transactions on Information and Systems, 101(1), 90–98. https://doi.org/10.1587/transinf.2017SWP0006

23.

Kingsbury

Palmer

(2003). Propbank: The next level of treebank. In Proceedings of treebanks and lexical theories (Vol. 3). Citeseer. https://api.semanticscholar.org/CorpusID:58251410

24.

Kríž

Hladká

Nečaskỳ

Knap

(2014). Data extraction using NLP techniques and its transformation to linked data. In Human-Inspired Computing and Its Applications: 13th Mexican International Conference on Artificial Intelligence, MICAI 2014, Tuxtla Gutiérrez, Mexico, November 16-22, 2014. Proceedings, Part I 13 (pp. 113–124). Springer. https://doi.org/10.1007/978-3-319-13647-9_13

25.

Lewis

Perez

Piktus

Petroni

Karpukhin

Goyal

Küttler

Lewis

Yih

Wt.

Rocktäschel

Riedel

Kiela

(2020). Retrieval-augmented generation for knowledge-intensive NLP tasks. Advances in Neural Information Processing Systems, 33, 9459–9474.

26.

F. L.

Chen

Qiu

Zhang

Chen

(2020). Alimekg: Domain knowledge graph construction and application in e-commerce. In Proceedings of the 29th ACM international conference on information & knowledge management (pp. 2581–2588). https://doi.org/10.1145/3340531.3412685

27.

Lin

Xin

Chen

(2020). Kbpearl: A knowledge base population system supported by joint entity and relation linking. Proceedings of the VLDB Endowment, 13(7), 1035–1049. https://doi.org/10.14778/3384345.3384352

28.

Liu

Zhang

Liang

McGuinness

D. L.

(2018). Seq2rdf: An end-to-end application for deriving triples from natural language text. In CEUR workshop proceedings (Vol. 2180). CEUR-WS. https://ceur-ws.org/Vol-2180/paper-37.pdf

29.

Mansouri

Affendey

L. S.

Mamat

(2008). Named entity recognition approaches. International Journal of Computer Science and Network Security, 8(2), 339–344.

30.

Masoud

Pereira

McCrae

Buitelaar

(2021). Automatic construction of knowledge graphs from text and structured data: A preliminary literature review. In 3rd Conference on Language, Data and Knowledge (LDK 2021). Schloss Dagstuhl-Leibniz-Zentrum für Informatik.

31.

Mendes

P. N.

Jakob

García-Silva

Bizer

(2011). Dbpedia spotlight: shedding light on the web of documents. In Proceedings of the 7th international conference on semantic systems, I-Semantics ’11 (pp. 1–8). New York, NY, USA: Association for Computing Machinery. ISBN 9781450306218. https://doi.org/10.1145/2063518.2063519

32.

Mihindukulasooriya

Tiwari

Enguix

C. F.

Lata

(2023). Text2kgbench: A benchmark for ontology-driven knowledge graph generation from text. In T. R. Payne, V. Presutti, G. Qi, M. Poveda-Villalón, G. Stoilos, L. Hollink, Z. Kaoudi, G. Cheng, & J. Li (Eds.), The semantic web – ISWC 2023 (pp. 247–265). Cham: Springer Nature Switzerland. ISBN 978-3-031-47243-5. https://doi.org/10.1007/978-3-031-47243-5_14

33.

Miller

G. A.

(1995). Wordnet: A lexical database for English. Communications of the ACM, 38(11), 39–41. https://doi.org/10.1145/219717.219748

34.

Möller

Lehmann

Usbeck

(2022). Survey on English entity linking on Wikidata: Datasets and approaches. Semantic Web, 13(6), 925–966. https://doi.org/10.3233/SW-212865

35.

Moussallem

Ferreira

T. C.

Zampieri

Cavalcanti

M. C.

Xexéo

G. B.

Neves

Ngomo

A. C. N.

(2018). Rdf2pt: Generating brazilian portuguese texts from RDF data. In Proceedings of the eleventh international conference on language resources and evaluation (LREC 2018).

36.

Nasar

Jaffry

S. W.

Malik

M. K.

(2021). Named entity recognition and relation extraction: State-of-the-art. ACM Computing Surveys, 54(1), 1–39. https://doi.org/10.1145/3445965

37.

Navigli

Ponzetto

S. P.

(2012). Babelnet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network. Artificial Intelligence, 193, 217–250. https://doi.org/10.1016/j.artint.2012.07.001

38.

Nismi Mol

Santosh Kumar

(2023). Review on knowledge extraction from text and scope in agriculture domain. Artificial Intelligence Review, 56(5), 4403–4445. https://doi.org/10.1007/s10462-022-10239-9

39.

Palmer

Kipper

K. L.

(2004). Verbnet. The Oxford Handbook of Cognitive Science.

40.

Paulheim

(2017). Knowledge graph refinement: A survey of approaches and evaluation methods. Semantic Web, 8, 489–508. https://doi.org/10.3233/SW-160218

41.

Regino

A. G.

dos Reis

J. C.

Bonacin

Morshed

Sellis

(2021). Link maintenance for integrity in linked open data evolution: Literature survey and open challenges. Semantic Web Journal, 12(3), 517–541. https://doi.org/10.3233/SW-200398

42.

Rios-Alvarado

A. B.

Martinez-Rodriguez

J. L.

Garcia-Perez

A. G.

Guerrero-Melendez

T. Y.

Lopez-Arevalo

Gonzalez-Compean

J. L.

(2023). Exploiting lexical patterns for knowledge graph construction from unstructured text in Spanish. Complex & Intelligent Systems, 9(2), 1281–1297. https://doi.org/10.1007/s40747-022-00805-7

43.

Rossanez

dos Reis

J. C.

da Silva Torres

(2020a). Representing scientific literature evolution via temporal knowledge graphs. In 6th Managing the evolution and preservation of the data web (MEPDaW) Workshop, international semantic web conference (ISWC) (pp. 33–42). http://ceur-ws.org/Vol-2821/paper5.pdf

44.

Rossanez

Dos Reis

J. C.

de Ribaupierre

Torres

R. D. S.

(2020b). Kgen: A knowledge graph generator from biomedical scientific literature. BMC Medical Informatics and Decision Making, 20(4), 1–24. https://doi.org/10.1186/s12911-020-01341-5

45.

Sagot

Stern

(2012). Aleda, a free large-scale entity database for French. In LREC 2012 : eighth international conference on language resources and evaluation (p. 4 pages). Istanbul, Turkey. https://hal.science/hal-00699300.

46.

Scarselli

Gori

Tsoi

A. C.

Hagenbuchner

Monfardini

(2008). The graph neural network model. IEEE Transactions on Neural Networks, 20(1), 61–80. https://doi.org/10.1109/TNN.2008.2005605

47.

Schneider

Schopf

Vladika

Galkin

Simperl

Matthes

(2022). A decade of knowledge graphs in natural language processing: A survey. arXiv preprint arXiv:2210.00105.

48.

Sendyk

Davies

Priscu

Sutherland

Madi

Dick

Khalil

Abu Alkheir

Wainer

(2022). A task-agnostic machine learning framework for dynamic knowledge graphs. In Proceedings of the 32nd Annual international conference on computer science and software engineering (pp. 22–31). https://doi.org/10.5555/3566055.3566058

49.

Sordo

Oramas

Espinosa-Anke

(2015). Extracting relations from unstructured text sources for music recommendation. In Natural language processing and information systems: 20th International conference on applications of natural language to information systems, NLDB 2015, Passau, Germany, June 17–19, 2015, Proceedings 20 (pp. 369–382). Springer. https://doi.org/10.1007/978-3-319-19581-0_33

50.

Stern

Sagot

(2012). Population of a knowledge base for news metadata from unstructured text and web data. In AKBC-WEKEX 2012-The knowledge extraction workshop at NAACL-HLT 2012.

51.

Suchanek

Weikum

(2013a). Knowledge harvesting from text and web sources. In 2013 IEEE 29th international conference on data engineering (ICDE) (pp. 1250–1253). https://doi.org/10.1109/ICDE.2013.6544916

52.

Suchanek

Weikum

(2013b). Knowledge harvesting from text and web sources. In 2013 IEEE 29th international conference on data engineering (ICDE) (pp. 1250–1253). IEEE. https://doi.org/10.1109/ICDE.2013.6544916

53.

Suchanek

F. M.

Kasneci

Weikum

(2007). Yago: A core of semantic knowledge. In Proceedings of the 16th international conference on World Wide Web (pp. 697–706). https://doi.org/10.1145/1242572.1242667

54.

Thuy

P. T. T.

Thuan

N. D.

Han

Park

Lee

Y. K.

(2014). Rdb2rdf: Completed transformation from relational database into rdf ontology. In Proceedings of the 8th international conference on ubiquitous information management and communication (pp. 1–7). https://doi.org/10.1145/2557977.2558083

55.

Vaswani

Shazeer

Parmar

Uszkoreit

Jones

Gomez

A. N.

Kaiser

Polosukhin

(2017). Attention is all you need. In I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, & R. Garnett (Eds.), Advances in neural information processing systems (Vol. 30). Curran Associates, Inc. https://proceedings.neurips.cc/paper_files/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf

56.

(2019). Enriching pre-trained language model with entity information for relation classification. (pp. 2361–2364). ISBN 978-1-4503-6976-3. https://doi.org/10.1145/3357384.3358119

57.

Xiong

Power

Callan

(2017). Explicit semantic ranking for academic search via knowledge graph embedding. In Proceedings of the 26th international conference on world wide web (pp. 1271–1279). https://doi.org/10.1145/3038912.3052558

58.

Guo

Zhang

Feng

(2022). A method for traditional chinese medicine knowledge graph dynamic construction. In Proceedings of the 5th international conference on big data technologies, ICBDT ’22 (pp. 196–202). New York, NY, USA: Association for Computing Machinery. ISBN 9781450396875. https://doi.org/10.1145/3565291.3565323

59.

Yan

Gao

(2019). Research and exploration on the construction method of knowledge graph of water field based on text. In 2019 2nd international conference on information systems and computer aided education (ICISCAE) (pp. 71–77). IEEE. https://doi.org/10.1109/ICISCAE48440.2019.221590

60.

Yasunaga

Ren

Bosselut

Liang

Leskovec

(2021). QA-GNN: Reasoning with language models and knowledge graphs for question answering. In K. Toutanova, A. Rumshisky, L. Zettlemoyer, D. Hakkani-Tur, I. Beltagy, S. Bethard, R. Cotterell, T. Chakraborty, & Y. Zhou (Eds.), Proceedings of the 2021 conference of the north american chapter of the association for computational linguistics: Human language technologies (pp. 535–546). Online: Association for Computational Linguistics. https://doi.org/10.18653/v1/2021.naacl-main.45

61.

Mao

Cai

(2020). A relationship extraction method for domain knowledge graph construction. World Wide Web, 23, 735–753. https://doi.org/10.1007/s11280-019-00765-y

62.

Zhang

W. E.

Nguyen

(2021). Constructing Covid-19 knowledge graph from a large corpus of scientific articles. In 2021 IEEE international conference on big knowledge (ICBK) (pp. 237–244). IEEE. https://doi.org/10.1109/ICKG52313.2021.00040

63.

Zou

(2020). A survey on application of knowledge graph. Journal of Physics: Conference Series, 1487(1), 012016. https://doi.org/10.1088/1742-6596/1487/1/012016

	Group 1: LS			Group 2: OE		Group 3: UD		Group 4: TR	Group 5: LR	Group 6: LC
Paper	English-specific	Non-English	Multilingual	T-Box population	A-Box population	Specific transformation	Generic transformation	Neural network	Lexical resources	Link creation
Zhang and Nguyen (2021)	X				X		X		X	X
Yu et al. (2020)	X							X	X
Rios-Alvarado et al. (2023)		X					X
Stern and Sagot (2012)		X				X
Kertkeidkachorn and Ichise (2018)			X				X			X
Sordo et al. (2015)	X			X			X		X
Dutta et al. (2015)	X				X
Rossanez et al. (2020b)	X			X	X	X			X	X
Lin et al. (2020)			X				X			X
Sendyk et al. (2022)			X		X		X	X
Xu et al. (2022)		X				X		X		X
Fei et al. (2022)		X		X			X	X	X
Liu et al. (2018)			X	X	X		X	X	X
Yan and Gao (2019)		X				X			X
Li et al. (2020)		X								X

A Systematic Literature Review on RDF Triple Generation From Natural Language Texts

Abstract

Keywords

1. Introduction

2.1. Triple Generation: Formalization and Challenges

4. Statistical Analysis

5.1.1. English-Specific

5.1.2. Non-English

5.2. Ontology and KG Enhancement

5.2.1. T-Box Population

5.2.2. A-Box Population

5.3. Underlying Domain

5.3.1. Specific Transformation

5.3.2. Generic Transformation

5.4. Neural Networks

5.5. Lexical Resources

5.6. Link Creation

6. Open Challenges

6.1. Findings and Limitations

6.2. Answer Summary of RQs

6.3. Promising Research Directions

7. Conclusion

Footnotes

Acknowledgments

Funding

Declaration of Conflicting Interests

ORCID iDs

Notes

Appendix

References