Cheap,Quick,and Rigorous: Artificial Intelligence and the Systematic Literature Review

Abstract

The systematic literature review (SLR) is the gold standard in providing research a firm evidence foundation to support decision-making. Researchers seeking to increase the rigour, transparency, and replicability of their SLRs are provided a range of guidelines towards these ends. Artificial Intelligence (AI) and Machine Learning Techniques (MLTs) developed with computer programming languages can provide methods to increase the speed, rigour, transparency, and repeatability of SLRs. Aimed towards researchers with coding experience, and who want to utilise AI and MLTs to synthesise and abstract data obtained through a SLR, this article sets out how computer languages can be used to facilitate unsupervised machine learning for synthesising and abstracting data sets extracted during a SLR. Utilising an already known qualitative method, Deductive Qualitative Analysis, this article illustrates the supportive role that AI and MLTs can play in the coding and categorisation of extracted SLR data, and in synthesising SLR data. Using a data set extracted during a SLR as a proof of concept, this article will include the coding used to create a well-established MLT, Topic Modelling using Latent Dirichlet allocation. This technique provides a working example of how researchers can use AI and MLTs to automate the data synthesis and abstraction stage of their SLR, and aide in increasing the speed, frugality, and rigour of research projects.

Keywords

artificial intelligence machine learning systematic literature review social science,transparency

Introduction

The growth of Artificial Intelligence (AI) is changing the way that research is conducted across a variety of fields and disciplines (Longo, 2020). Not since the industrial revolution have new technologies, such as the ones provided through AI and Machine Learning Techniques (MLTs) automated what were once labour-intensive tasks (Dwivedi et al., 2021). Simultaneously, research outputs are continuing to expand exponentially, making reviews that aim to systematically gather and synthesise all pertinent information related to a phenomenon increasingly more difficult (Gusenbauer & Haddaway, 2020). There is therefore a gap where AI and MLTs can be utilised to respond to not only the increasing amount of evidence now available, but also to the demands of providing evidence in a timely, frugal, transparent, and rigorous manner. As such, it is not the purpose of this article to claim that AI and MLTs can or will produce superior research, or diminish the role of the researcher in anyway, but that they are another tool for researchers to employ; in this context, researchers with coding experience that are conducting systematic literature reviews (SLRs).

The SLR is often considered the ‘gold standard’ of evidence gathering and evidence synthesis (Pati & Lorusso, 2018; Thomé et al., 2016). This article seeks to draw attention to a novel method that researchers conducting a SLR can utilise to reduce time and money on what is otherwise a laborious and time-consuming stage in a SLR, data synthesis and data abstraction. In addition to financial and time constraints, due to the very nature of the time taken to conduct them, SLRs can quickly become out of date (Marshall & Wallace, 2019; Sundaram & Berleant, 2022). Due to this time crunch, the automation of stages of an SLR with AI and MLTs can offer researchers an opportunity to speed up their reviews. Additional benefits of this method are the reduction of researcher bias, improved transparency and repeatability, all of which are intertwined within the SLR process.

An information technology-based computer system, AI has the ability to perform tasks that normally require human intelligence (Yang & Siau, 2018). Characterised by cognitive abilities, learning, adaptability, and decision-making, human intelligence through AI has allowed for the automation of tasks that would normally take a person significant time to complete (Chen et al., 2020). A subset of AI, Machine Learning (ML) is a form of AI that allows software applications to make projections and precise outcomes and is one of the fastest growing fields within computer science (Osisanwo et al., 2017). The three main kinds of ML are: reinforced, supervised, and unsupervised learning (Mahalakshmi et al., 2022).

The method that this article advocates combines an unsupervised MLT with an already established form of qualitative analysis; Deductive Qualitative Analysis (DQA). This combination allows AI to play a supportive role in the coding and categorisation of extracted SLR data, and in synthesising then abstracting SLR data. There are other stages of an SLR that can benefit from AI and MLTs, and this article will briefly touch upon them, however, the main thrust of this work is on the data synthesis and abstraction stage. The data for this article was gathered utilising DQA methods (the process can be found and assessed in Left blank for blind review). Although Atkinson, 2022 based on DQA, it is also important to briefly point as to why this method was chosen over other methodologies, such as Grounded Theory Methods (GTMs).

Traditional GTMs are based on researchers collecting, and concurrently analysing data (Charmaz & Belgrave, 2007), with the aim of generating new theories based on qualitative data (Glaser et al., 1968). This has traditionally meant that researchers inductively encode data from the beginning of research, even as evidence is still being gathered (Barberis Canonico et al., 2018). Conversely, DQA aims to find a middle ground between a priori theorising and facilitating the emergence of new theories during a research project (Gilgun, 2014; Gilgun, 2019. This allows for the pre-structuring of data encoding, for example, via preconceived data extraction templates, and for it to be updated as new information arises throughout a research project. As such, DQA is a form of concept guided research that will benefit from MLTs. This is because the middle ground between inductive and deductive reasoning offered by DQA suits MLTs, especially probabilistic MLTs. As DQA facilities the gathering and combining of information according to predetermined or preidentified themes (Gilgun, 2014), there is a unique opportunity to illustrate how it can be combined with MLTs to synthesise and abstract data gathered during a SLR. Again, it is important to highlight that AI is a supportive tool in this instance. It is to augment the data synthesis and abstraction stage of a SLR, not to replace the role of the researcher. Indeed, it is the role that abstraction, during the synthesising of data, plays within MLTs that places the role of the researcher firmly in the centre of the proposed method.

Involving the simplification of data, abstraction has been widely utilised in AI (see, Zucker (2003) for an overview). In essence, abstraction refers to the human ability to focus on simpler explanations of observation and conceptualisation, alongside reasoning (Goldstone & Barsalou, 1998). In the context of utilising MLTs for data synthesis of SLR data, abstraction involves the identification of key themes or concepts that are relevant to the deductive analysis and representing them in a more concise and organised way (Mohan & Kumar, 2022). As such, it allows for complex data sets, such as those extracted during a SLR, to be simplified to facilitate integration and analysis. This is achieved through filtering crucial elements while excluding unrelated or less significant details (Batra et al., 2020; Kallimani, 2018). Therefore, if a research project, such as this one, utilises data extraction based on preconceived templates, when combined with DQA, abstraction allows researchers to leverage the strengths of qualitative analysis methods with MLTs to synthesise and obtain meaningful insights from qualitative data in a more efficient and scalable manner. One way for researchers to achieve this is through unsupervised MLTs such as topic modelling programs utilising Latent Dirichlet allocation (LDA).

A popular and well-established probabilistic model, LDA facilitates latent topic identification within a collection of documents, such as data uncovered during a SLR. It is commonly applied in Natural Language Programming (NLP) and text mining to identify hidden thematic structures in textual data (Jacobi et al., 2016). LDA facilitates abstraction by automatically identifying latent topics across a corpus of data. Through analysing word-pattern co-occurrence, LDA assigns probabilistic distributions to each document the degree to which they are associated within a given topic (Rahimi et al., 2022). This then allows for higher level abstraction of the content as researchers can focus on the main themes and ideas uncovered in the data (Priyadharshini & Magesh, 2021). LDA also facilitates abstraction through topic summarisation. LDA generates a set of word-topic distributions representing the probability of each word occurring in each topic (Onah et al., 2022). Through examining the most probable words that are associated within each topic, researchers can gain an understanding of the main concepts and themes that are represented by the topics. This summarisation aids in distilling key information and abstracting the data (Onah et al., 2022). This then helps to accomplish a key outcome sought throughout an SLR; the identification of gaps in the research domain under investigation through a comprehensive summary of its pertinent research (Paul et al., 2021).

The paper is structured as follows. The next section will provide an overview of the literature pertinent to creating and running a SLR. The following section will then discuss the role for AI and automation for SLRs, and briefly how AI can be applied to each stage of an SLR. It will then discuss how MLTs such as LDA topic modelling can aid in increasing the speed, frugalness, and strength of the data synthesis and abstraction stage of an SLR. Following this, the methods section will give an extensive overview of the coding used to create a well-established MLT, LDA Topic Modelling. Next, the results and discussion section will present the synthesised and abstracted data from a SLR on the resilience and sustainability of energy infrastructures (Left Blank for Review). The section synthesised and abstracted are the ‘Tier One Policy Problems’ that were Atkinson, 2022 extracted during the review. For the purpose of the data, this article is to be considered as a proof of concept. The full explanation and analysis of the extracted data will be provided in future publications. The article will then finish with a section covering the limitations and biases of this method, followed by a conclusion.

Literature Review

SLRS

SLRs seek to gather, appraise, and synthesise evidence pertaining to a phenomenon under investigation (Petticrew & Roberts, 2008). In order to achieve this, they involve thorough searches of research databases aimed at uncovering all pertinent work in the review domain (Siddaway et al., 2019). There are a number of reasons to undertake a SLR: (1) to advertise research being undertaken and to avoid repeating research already conducted; (2) to guide the planning of new research; and (3) to underpin claims of originality when new research is contrasted against old (Paul et al., 2021).

SLRs are important because they allow for the incremental advancement of a field through building on previous research (Lame, 2019). Additionally, they provide a structured approach to gather, appraise, and synthesise evidence, they also aid in monitoring research practices in the review domain, and finally, they are important as a means to bridge different domains (Lame, 2019). Not all SLRS follow the same guidelines; however, there are certain stages that are common to all SLRs.

The process of formulating and creating a SLR can be broken down into several stages. Framing a research question is generally considered the first stage (Khan et al., 2003). The next stage is where relevant works are identified, normally through the screening of titles and abstracts, followed by sourcing full texts of included studies and performing quality assessment on the included studies (Khan et al., 2003). Following this stage is data extraction, then data synthesis and data abstraction, followed with the analysis/interpretation of results (Beller et al., 2018; Tsafnat et al., 2014).

There are many guides available to help researchers with these stages. For instance, the 2020 Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) statement contains a 27 item guideline for conducting a systematic review (Page et al., 2021). Additionally, the Preferred Reporting Items for Systematic reviews and Meta-Analyses – Protocols (PRISMA-P) checklist is a 15 item guideline that aims at aiding the reporting of review protocols prior to their conduction (Moher et al., 2015). Table 1 provides a list of available guidelines for researchers seeking to conduct a SLR. There are different methods that researchers may employ to accomplish each stage of an SLR. Recently, AI and MLTs have begun to be applied to stages of SLRs.

Table 1.

Guidelines for SLRs.

Name	Discipline	Authors/Year
Preferred reporting items for systematic reviews and meta-analyses	Medical, but can be used elsewhere.	(Page et al., 2021)
Preferred reporting items for systematic reviews and meta-analyses – protocols	Medical, but can be used elsewhere.	(Moher et al., 2015)
Guidelines for performing systematic literature reviews in software engineering	Software engineering	(Keele, 2007)
A guide to conducting a standalone systematic literature review	Information systems	(Okoli, 2015)
How to do a systematic literature review in nursing: A step- by-step guide	Nursing	(Bettany-Saltikov & McSherry, 2016)
Conducting systematic literature review in operations management	Operations management	(Thomé et al., 2016)
How to do a systematic review: A best practice guide for conducting and reporting narrative reviews, meta-analyses, and meta-syntheses	Psychology	(Siddaway et al., 2019)
Guidance on conducting a systematic literature review	Education and research.	(Xiao & Watson, 2019)

Automating Stages of an SLR

The use of AI in SLRs is not a newfound occurrence. AI tools have been utilised in reviews and evidence synthesis since 2016 (see EPPI-Reviewer and Abstracker for examples) (Nguyen-Trung et al., 2023). Due to SLRs taking a substantial amount of time, months to years in some cases (Larsen et al., 2019; Tsafnat et al., 2014) and are becoming increasingly complicated due to the amount of published evidence that needs to be incorporated into the review (Antons et al., 2023; Beller et al., 2018), additional tools need to become available to increase their speed and frugality, while also maintaining the rigour and transparency expected in an SLR. A route to overcoming these issues is by integrating computational methods that combine the powers of human comprehension and judgement with a computers speed and effectiveness (Antons et al., 2023). These methods can be applied to individual, or multiples stages of a review. As noted earlier, the first stage of a SLR is generally research question formulation. Following this stage, AI and MLTs can increase the speed and frugality of SLRs.

Search Strategy and Inclusion/Exclusion Criteria using Chat GPT

Normally a labour-intensive stage, formulating a comprehensive search strategy to answer a research question and then determining inclusion and exclusion criteria is a stage that can benefit from AI. A public tool created by OpenAI, Chat GPT is based on the Generative Pre-Trained language model (Kirmani, 2022). Chat GPT is a sophisticated chatbot capable of completing an array of text-based tasks that range from answering questions, to generating letters (Lund & Wang, 2023). By prompting Chat GPT with a research question, the tool can then aid in developing preliminary search strategies for researchers to examine. Wang et al. (2023) provide a comprehensive guide on how to prompt Chat GPT to formulate search strategies to be utilised in SLRs. The tool can then be prompted to suggest possible inclusion and exclusion criteria within the context of the research to be undertaken.

Title/Abstract Screening using ASReview

When conducting a SLR, researchers can face hundreds, if not thousands, of studies to screen by titles and abstracts for inclusion/exclusion in their reviews. This is traditionally a labour-intensive critical stage of a review that needs to be conducted as efficiently and as transparently as possible. A program that can drastically speed up this process is ASReview.

ASReview utilises machine learning techniques to overcome the manual and time-consuming process of screening titles and abstracts for systematic reviews. An open source and free tool, the source code of ASReview is available under an Apache 2.0 license and includes the relevant documentation (van de Schoot et al., 2021).

The process of using ASReview is simple; (1) researchers upload a file containing the metadata of articles identified in their literature searches; (2) the researcher trains the model. As the screening process is binary in nature, a researcher must select at a minimum one article that corresponds to their inclusion criteria and one that does not. The more articles selected for inclusion/exclusion at the beginning, the increase in efficiency of the active learning process utilised in ASReview; (3) the binary label of (1 for relevant vs. 0 for irrelevant) is then used to train a new model, following this a new article is presented to the researcher based on the previous articles inclusion or exclusion (van de Schoot et al., 2021). This process continues until a certain user specified number is reached. The researcher then has a file of articles labelled relevant or irrelevant (van de Schoot et al., 2021). It is important to highlight that ASReview is not a bias-free tool and is only effective when inclusion/exclusion criteria and a stopping rule are predefined (Warren & Moustafa, 2023).

Data Extraction using ChatPDF

Utilising GPT 3.5, ChatPDF allows the user to converse with PDF files. Once the application is opened, PDF files can be uploaded, and the user can begin asking the article questions using the built-in interface. The application and how to set it up can be found at https://github.com/dotvignesh/PDFChat. This program allows researcher to scrape and extract data from PDF files using targeted questions. This can significantly increase the speed that data is extracted at, greatly reducing researcher time and speeding up the process of a review. The novelty and benefits of this method of data extraction is currently the subject of another methods paper.

As can be seen, each stage of a review provides an opportunity for automation. MLTs are being increasingly employed to analyse and automate steps in the SLR process. Outside of the tools already outlined, Sundaram and Berleant (2022) and Tsafnat et al. (2014) provide an overview of AI tools that can be used for steps in the review process, and Antons et al. (2023) provide a comprehensive review method for the creation of standalone computational literature reviews.

Data Synthesis and Abstraction: Machine Learning Techniques

As the focus of this article is on the data synthesis and abstraction stage, it is this stage that will be unpacked. According to Sundaram and Berleant (2022) there is a need for SLRs focusing on automation on data synthesis. Indeed, due to the amount of unstructured data uncovered during a review, new synthesis methods, or the integration of methods from other disciplines, have the ability to expand the understanding of the emergent world of data (Abram et al., 2020).

Automating Data Synthesis

Data synthesis is the approach to interpreting extracted data to satisfy a SLRs question(s) or hypothesis/hypotheses (Sundaram & Berleant, 2022). It is the organisation of the raw data obtained during a review into a format that is simple to comprehend and then examine. All researchers employ a coding scheme to synthesise and then abstract data; index cards, highlighters, glue and scissors, or programs such as Nvivo (Perrin, 2001). Data synthesis and abstraction has been identified as one of the most time-consuming components of conducting a SLR, but one that can be improved through automation (Sundaram & Berleant, 2022; Tsafnat et al., 2014). It is also in this stage where the subjective biases that researchers hold may pose several problems with respect to a SLRs validity. A lack of clear processes and techniques during synthesis can threaten result legitimacy (Evans, 2002). Also, owing to subjective decisions made by researchers, it is difficult to delineate a researchers own beliefs and world views, and that of the subject matter (Evans, 2002). Furthermore, due to missing procedures and techniques, there is unlikely to be a trail of the decisions made for others to judge their worth (Evans, 2002). A way to address these issues is using AI and MLTs created with computer programming languages.

Natural Language Processing

As a broad field within AI, NLP is a MLT that straddles the spaces of computational linguistics, AI, and computer science, and covers the manipulation and computer understanding of human language (Millstein, 2020). At its simplest, NLP is a computer/systems ‘ability for a computer/system to truly understand human language and process it in the same way that a human does’ (Goyal et al., 2018, p. 16). NLP has become increasingly prevalent in society. Predictive text and handwriting recognition, web search engines, and machine translation are all based on NLP technologies (Bird et al., 2009). It is also being increasingly adopted in the social sciences (Ungless et al., 2023). The use of MLTs in NLP is useful for researchers as it allows for the automation of tasks, making them more cost and time efficient (Le Glaz et al., 2021).

Topic Modelling

A statistical form of NLP, topic modelling utilises algorithms to summarise large quantities of texts into a range of topics (Leeson et al., 2019). A common way to model topics is using LDA. As a three level Bayesian model, LDA is a generative probabilistic model utilised in compiling data, such as text corporas (Blei et al., 2003). The central thrust of the model is that texts are signified as random mixtures over latent topics, where each topic is distinguished by a distribution over words (Blei et al., 2003). This means that each text is a collection of topics or themes, and that the existence of each word can be assigned to one of the texts topics or themes. It should be noted that LDA is not the only form of topic modelling available to researchers. Other topic models include; Non Negative Matrix Factorization, developed by Liu et al. (2006); Latent Semantic Analysis, developed by Landauer et al. (1998); Parallel Latent Dirichlet allocation, developed by Wang et al. (2009); and Pachinko Allocation Model, developed by Li and McCallum (2006).

Synthesising and Abstracting With Topic Modelling

Topic Modelling using MLTs such as LDA can be employed to leverage data gathered during a SLR for synthesis. This is accomplished through the creation of visualisations of topic clusters from a corpus of extracted text. Indeed, Rahgozar and Inkpen (2019) have illustrated how such techniques can be utilised to produce useful results through LDA topic modelling, including most relevant terms within topics and through mapping intertopic distances. The identification of latent topics within the synthesised text also provides strong abstraction of the data. As the LDA tool generates a set of topics that provide a representation of the included texts through topic scoring (counting the number of times a word appears within a topic cluster), the text is reduced down to simpler subset of the original text (Priyadharshini & Magesh, 2021). This text abstraction allows for easier analysis. It is also a tool that can be utilised across different reviews, or research projects. Considering the time and money saved in employing a versatile tool that, once written, can be used repeatedly for reliable results, the benefits for researchers are justified significantly. Furthermore, employing this tool also adds to the transparency of the research being conducted as questions can be asked of the coding utilised in this process.

The methods section will illustrate the code utilised for this article and for future works. As mentioned earlier on, there is an expectation that those who will utilise this tool will have a certain level of experience in coding. This is not to say that with some time and effort less knowable researchers will be able to utilise this method, however, there is unfortunately insignificant space to tailor the approach to novices. There are many blogs and articles that explain the code, or variants of code, used to create an LDA topic model. The time has been taken to include it in this article due to the important position that SLRs have within research.

Methods

Python version 3.11.3 was used for this project. Table 2 provides a list of the libraries used and their versions.

Table 2.

Python Libraries and Versions.

Library	Version
Pandas	1.5.3
Spacy	3.5.2
NLTK	3.8.1
Gensim	4.3.1
pyLDAvis	3.4.0
Random2	1.0.1
Pickle	Built-in library
Logging	Built-in library

Each library was installed with Pip3 using PowerShell. The code was written in a Jupyter Notebook using Microsoft Visual Studio Code. As Python is an open source and free program, there exists an extensive amount of online material to help create codes. Acquired by Microsoft in 2018, GitHub is a code hosting platform for version control and collaboration (GitHub, 2022b). If there is a particular topic of interest, GitHub facilitates browsing via topics (GitHub, 2022a). For example, ‘machine learning’ is a searchable topic in GitHub, as is ‘Topic Modelling’.

Code can be a challenge to cite as traditional bibliographic information is not always supplied by its creator. The use of online pseudonyms also makes this task difficult. To make it easier for researchers to cite codes from the GitHub repository, GitHub has facilitated built-in support for CITATION.cff files. This feature allows academics and researchers to let people know how to correctly cite their work (Smith, 2021). Unfortunately, non-academics do not share the same bibliographic views regarding correct citations. This makes it difficult to provide comprehensive bibliographic data for codes used. Indeed, at the International Collaboration for the Automation of Systematic Reviews and as noted by O’Connor et al. (2018), there is a lack of transparency with respect to ML systems. The solution, according to Beller et al. (2018), is that every technique used for automation should be shared, in particular, by making code evaluation data and corporas available. In line with this, the following section contains the coding used to create LDA Topic Modelling for this concept paper. The corpora will also be made available alongside the article.

The following sections have been delineated into separate parts. Part A is the importing of the libraries required for the model; Part B sets out how the data was cleaned; Part C creates the dictionary to be used; and finally part D covers the creation of the LDA model. The references that are provided alongside the coding go into more detail than is provided here. There reader is advised to go to the original coding sources should they desire more information.

LDA Topic Modelling Code

A. Importing the libraries and file required for the LDA tool

i. Import Libraries. The libraries and their versions can be seen in in Table 2.

import spacy, nltk, gensim, pickle, random2, sklearn

import pandas as pd

import re

import en_core_web_sm

import nltk

from nltk.corpus import stopwords

nltk.download(‘wordnet’)

nlp = spacy.load(“en_core_web_sm”)(Tavora, 2018).

ii. Load file.

df = pd.read_csv(r“Paste CSV file path here”, header = None)

df.shape

df.head()(Tavora, 2018).

iii. Create to.list.

doc_set = df.values.T.tolist()[0]

print(doc_set[:200]) (Tavora, 2018).

B. Data Cleaning.

iv. Tokenise

from nltk.stem import WordNetLemmatizer

lemmatizer = WordNetLemmatizer()

from nltk.tokenize import RegexpTokenizer

tokenizer = RegexpTokenizer(r‘\w+’)

tokenined_docs = []

for doc in doc_set:

tokens = tokenizer.tokenize(doc.lower())

tokenined_docs.append(tokens)

print(tokenined_docs[0][0:10]) (Tavora, 2018).

v. Lemmatisation

lemmatized_tokens = []

for lst in tokenined_docs:

tokens_lemma = [lemmatizer.lemmatize(i) for i in lst]

lemmatized_tokens.append(tokens_lemma)

print(lemmatized_tokens[0][0:10]) (Tavora, 2018).

vi. Stopwords

from nltk.corpus import stopwords

origional_stopwords = nltk.corpus.stopwords.words(‘english’)

print (origional_stopwords) (Tavora, 2018).

vii. Custom Stopwords list

custom_stop_word_list = []

print (custom_stop_word_list)

final_stopword_list = custom_stop_word_list + origional_stopwords

print(“Total numbers of final stop words are ”)

print(len(final_stopword_list))

viii. Pass cleaned data, and apply word length limit (optional)

n = 2

cleaned_tokens = []

for lst in lemmatized_tokens:

cleaned_tokens.append([i for i in lst if not i in final_stopword_list if len(i) > n])

print(cleaned_tokens[:][0][:200])(Tavora, 2018).

ix. Bi and Trigrams

bigram = gensim.models.Phrases(cleaned_tokens, min_count = 1, threshold = 1)

trigram = gensim.models.Phrases(bigram[cleaned_tokens], threshold = 1)

bigram_mod = gensim.models.phrases.FrozenPhrases(bigram)

trigram_mod = gensim.models.phrases.FrozenPhrases(trigram) (Prabhakaran, 2018).

def make_bigrams(cleaned_tokens):

return [bigram_mod[doc] for doc in cleaned_tokens]

def make_trigrams(texts):

return [trigram_mod[bigram_mod[doc]] for doc in cleaned_tokens] (Prabhakaran, 2018).

x. Pass Bi and Trigrams through cleaned list

tokens_bigrams = make_bigrams(trigram_mod[bigram_mod[cleaned_tokens]])

print(tokens_bigrams[:][0][:200]) (Prabhakaran, 2018).

C. Create dictionary.

xi. Build dictionary.

from gensim import corpora, models

dictionary = corpora.Dictionary(tokens_bigrams) (Tavora, 2018).

xii. Tokenize documents into document-term matrix.

corpus = [dictionary.doc2bow(text) for text in tokens_bigrams]

import pickle

pickle.dump(corpus, open(‘corpus.pkl’, ‘wb’))

dictionary.save(‘dictionary.gensim’)

corpus[:][0][:10] (Tavora, 2018).

D. Creating the LDA model.

xiii. Create model.

ldamodel = gensim.models.ldamodel.LdaModel(corpus, num_topics = 5, id2word = dictionary, passes = 20, alpha = ‘auto’, per_word_topics = True)

ldamodel.save(‘model.gensim’) (Prabhakaran, 2018; Tavora, 2018).

for el in ldamodel.print_topics(num_topics = 5, num_words = 10):

print(el,‘\n’) (Tavora, 2018).

xiv. Pass dictionary through model.

dictionary = gensim.corpora.Dictionary.load(‘dictionary.gensim’)

corpus = pickle.load(open(‘corpus.pkl’, ‘rb’))

lda = gensim.models.ldamodel.LdaModel.load(‘model.gensim’) (Tavora, 2018).

xv. Create visual display.

lda = gensim.models.ldamodel.LdaModel.load(‘model.gensim’)

import pyLDAvis.gensim_models

lda_display = pyLDAvis.gensim_models.prepare(lda, corpus, dictionary, sort_topics = False)

pyLDAvis.display(lda_display) (Tavora, 2018).

This section has set out the codes used to create an LDA topic model. The next section will present the results from this section and will include a discussion.

Results

This section presents the results from the above code used to create LDA Topic Models with Python. Synthesising and abstracting all the extracted data from the SLR under the preidentified theme ‘Tier One Policy Problems’, or the policy problems most common to each included study, helps to illustrate the largest issues impacting the resilience and sustainability of energy infrastructures in the investigated research domain. Due to space constraints, this article will only discuss the two first topics. The common ten most themes for each topic are presented in table three.

The abstractions of topics one and two are straight forward. The first topic, topic one in table 3, lists climate change alongside themes of supply and demand, electricity generation, global balance and problem. What is identified here is that topic one is grouped around the issue of climate change and how it relates to the problem of balancing global energy operations regarding the supply and demand of electricity. This topic and its related themes are supported by a great deal of international research. For example, Pfenninger et al. (2014) discuss energy system modelling in the context of uncertainty due to climate change and how to balance supply and demand. The International Renewable Energy Agency (IRENA) also supports this topic. Their 2019 report, Global Energy Transformation: a roadmap to 2050, notes that the global CO2 energy budget is running out. The report points to how electrical grids can help balance supply and demand for electricity and address climate change (Gielen et al., 2019).

Table 3.

LDA Topic Modelling Topics and Themes.

Topic #1	Topic #2
(1, ‘0.012“climate_change” + .007“electricity” + .007“demand” + .007“global” + .007“generation” + .007“climate” + .005“supply” + .005“operation” + .005“balance” + .005“problem”’)	(2, ‘0.016“climate_change” + .013“system” + .012“infrastructure” + .007“sector” + .007“local” + .007“fossil_fuel” + .005“potential” + .005“energy” + .005“reliable” + .005“electricity”’)

The abstracted data seen in Topic two, in Table 3, has the themes of climate change and fossil fuel, the infrastructure sector, local energy system, and potential for reliable electricity. What is identified here is that the local energy system has potential for reliable energy generation; it could be assumed that this would address climate change by transitioning away from fossil fuels. Again, to support this statement, there has been considerable research devoted to investigating this topic. For example, Hori et al. (2020) have conducted research on how local energy systems that include participatory approaches have the potential to produce reliable and sustainable renewable energy and aid in mitigating the effects of climate change. Similarly, Dobravec et al. (2021) provide evidence on how a shift from top-down federal approaches towards increasing the involvement of local and regional areas can aid in achieving climate change related goals. The proposed multi-level governance framework will aid in the push towards a transition to low-carbon and 100% renewable energy systems (Dobravec et al., 2021).

As noted earlier, the LDA Topic Model also provides an Intertopic Distance Map. This map allows users to view the distances between topics uncovered through the process set out in the methods section. Figures 1 and 2 provide the Intertopic Distance Maps for topics one and two. When looking at the Intertopic Distance Maps in Figures 1 and 2, there is an overlap between both topics. This suggests the closeness of Topic 1 regarding climate change and how it relates to the problem of balancing global energy operations regarding the supply and demand of electricity is to the second topic on how the local energy system has potential for reliable energy generation, reducing the reliance on fossil fuels and mitigating the effects of climate change.

Figure 1.

Intertopic distance map for topic 1.

Figure 2.

Intertopic distance map for topic 2.

Discussion

The use of AI and MLTs in the SLR automation process is witnessing a surge of growth. Such automation tools are making SLRs more efficient, timely, and require fewer researchers and effort (Beller et al., 2018). Furthermore, they have the positive effects of lessening researcher biases and mistakes made during the review process (Beller et al., 2018). As can be seen in the presented coding above, value decisions regarding the number of topics to present, whether to include bi or trigrams, how data is cleaned, and even how many times a word must appear before it is included, are all decisions which researchers will have to make when utilising this process. However, when codes are presented alongside results, as they are here, there is a trail for other researchers to follow. Decisions can then be questioned, codes can be changed to see what would occur differently, and researchers can better understand and defend their results. For researchers conducting SLRs, this method serves to make the process more rigorous, transparent, and importantly, repeatable.

Limitations and Future Research

MLTs and AI are another tool that researchers can employ during their SLRs. However, it is important to highlight that both AI and MLTs are limited by what is input. If data is input that is noisy, incomplete, or has been extracted through processes that have introduced selective biases, then the results themselves may be erroneous (Le Glaz et al., 2021). Domingos (2012) points out that algorithms themselves may introduce biases. Importantly, although the subjective decisions of the researcher are made visible, this only shifts biases from subjective to systematic. Furthermore, care needs to be taken with using topic modelling, as unsupervised learning can produce inconsistent results (Watanabe & Zhou, 2020). However, this is a step forwards, because subjective decisions that are made visible, are decisions that can be questioned/defended/or further explained.

This article is intended only as a proof of concept. The results provided are limited to the space available. Future work in this space will utilise this method to provide a comprehensive synthesis and abstraction of the data extracted from two SLRs investigating how governance settings can enhance the resilience and sustainability of energy and water infrastructures. This will provide further proof of the importance of the method set out in this article. Other than SLRs, another research project that may benefit from this method are datasets obtained from semi-structured interviews. Should researchers wish to use MLTs on interview datasets then there are additional biases, such as language, that need to be taken into consideration (Ungless et al., 2023). Utilising the other AI and MLTs outlined in this article, two automated SLRs are currently being conducted on how governance settings can improve the resilience and sustainability of transport and communication infrastructures.

Conclusion

The significant increase in the amount of published research has made conducting SLRs more time-consuming and costly. Additionally, the demand for rigorous and transparent research further increases the costs of conducting an SLR, both monetarily and in time. This article has put forward a method for researchers, who have coding experience, to utilise AI and MLTs to reduce both the time and cost of synthesising and abstracting the data obtained through a SLR. In combining MLTs with DQA, this article has also provided a proof of concept regarding the utilisation of MLTs such as LDA Topic Modelling and researchers can synthesise and then perform abstraction on the data obtained during a SLR. Utilising this method the data uncovered during a SLR on how governance settings can enhance the resilience and sustainability of energy infrastructures, specifically the extracted data regarding the ‘Tier One Policy Problems’ has been synthesised, abstracted, and then briefly analysed.

Supplemental Material

Supplemental Material - Cheap, Quick, and Rigorous: Artificial Intelligence and the Systematic Literature Review

Supplemental Material for Cheap, Quick, and Rigorous: Artificial Intelligence and the Systematic Literature Review by Cameron F. Atkinson in Social Science Computer Review

Footnotes

Acknowledgements

I would like to acknowledge that this research was made possible through the College of Arts, Law and Education Research Training Program Stipend provided by the Australian Federal Government and a top-up scholarship provided by the Australian Natural Hazards Research Australia.

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Natural Hazards Research Australia (Stipend scholarship).

Data Accessibility

The author confirms that the data supporting the findings of this study is available with its .

ORCID iD

Cameron F. Atkinson

Supplemental Material

Supplemental material for this article is available online.

Author Biography

Cameron Atkinson is a PhD candidate in the School of Social Sciences at the University of Tasmania. Cameron is a public policy researcher with an interest in disaster resilience policy focusing on how public policy can contribute to enhancing infrastructure resilience. He interests also lie in increasing the rigour and transparency of policy centred research.

References

Abram

M. D.

Mancini

K. T.

Parker

R. D.

(2020). Methods to integrate natural language processing into qualitative research. International Journal of Qualitative Methods, 19, 1609406920984608. https://doi.org/10.1177/1609406920984608

Antons

Breidbach

C. F.

Joshi

A. M.

Salge

T. O.

(2023). Computational literature reviews: Method, algorithms, and roadmap. Organizational Research Methods, 26(1), 107–138. https://doi.org/10.1177/1094428121991230

Atkinson, C., Curnin, S., Murphy-Gregory, Hannah. (2022). Resilient and Sustainable Energy Infrastructure: A Systematic Literature Review Protocol. Social Science Protocols, 5(1). https://doi.org/10.7565/ssp.v5.6608

Barberis Canonico

McNeese

N. J.

Duncan

(2018). Machine learning as grounded theory: Human-centered interfaces for social network research through artificial intelligence. Proceedings of the Human Factors and Ergonomics Society - Annual Meeting, 62(1), 1252–1256. https://doi.org/10.1177/1541931218621287

Batra

Chaudhary

Bhatt

Varshney

Verma

(2020). A review: Abstractive text summarization techniques using NLP. In 2020 International Conference on Advances in Computing, Communication & Materials (ICACCM). Dehradun, India, August 21–22.

Beller

Clark

Tsafnat

Adams

Diehl

Lund

Ouzzani

Thayer

Thomas

Turner

Xia

Robinson

Glasziou

(2018). Making progress with the automation of systematic reviews: Principles of the international collaboration for the automation of systematic reviews (ICASR). Systematic Reviews, 7(1), 1–7. https://doi.org/10.1186/s13643-018-0740-7

Bettany-Saltikov

McSherry

(2016). How to do a systematic literature review in Nursing: A step-by-step guide (2nd ed.). McGraw-Hill Education.

Bird

Klein

Loper

(2009). Natural language processing with Python: Analyzing text with the natural language toolkit (1st ed.). O’Reilly Media, Inc. http://www.foo.be/cours/dess-20122013/b/NaturalLanguageProcessingwithPython-O’Reilly2009.pdf

Blei

D. M.

A. Y.

Jordan

M. I.

(2003). Latent dirichlet allocation. Journal of machine Learning research, 3(Jan), 993–1022. https://www.jmlr.org/papers/volume3/blei03a/blei03a.pdf?ref=https://githubhelp.com

10.

Charmaz

Belgrave

(2007). Grounded theory. The Blackwell encyclopedia of sociology. Wiley.

11.

Chen

Lin

(2020). Artificial intelligence in education: A review. In IEEE access (Vol. 8, pp. 75264–75278). IEEE. https://doi.org/10.1109/ACCESS.2020.2988510

12.

Dobravec

Matak

Sakulin

Krajačić

(2021). Multilevel governance energy planning and policy: A view on local energy initiatives. Energy, Sustainability and Society, 11(1), 2. https://doi.org/10.1186/s13705-020-00277-y

13.

Domingos

(2012). A few useful things to know about machine learning. Communications of the ACM, 55(10), 78–87. https://doi.org/10.1145/2347736.2347755

14.

Dwivedi

Y. K.

Hughes

Ismagilova

Aarts

Coombs

Crick

Duan

Dwivedi

Edwards

Eirug

Galanos

Ilavarasan

P. V.

Janssen

Jones

Kar

A. K.

Kizgin

Kronemann

Lal

Lucini

Williams

M. D.

(2021). Artificial Intelligence (AI): Multidisciplinary perspectives on emerging challenges, opportunities, and agenda for research, practice and policy. International Journal of Information Management, 57, 101994. https://doi.org/10.1016/j.ijinfomgt.2019.08.002

15.

Evans

(2002). Systematic reviews of interpretive research: Interpretive data synthesis of processed data. Australian Journal of Advanced Nursing, 20(2). https://doi.org/10.3316/ielapa.405497388325103

16.

Gielen

Gorini

Wagner

Leme

Gutierrez

Prakash

Asmelash

Janeiro

Gallina

Vale

(2019). Global energy transformation: A roadmap to 2050.

17.

Gilgun

(2014). Introduction to The Chicago School: Deductive qualitative analysis and grounded theory. In (Vol. 358). CreateSpace Independent Publishing Platform USA.

18.

Gilgun

Jane

(2019). Deductive qualitative analysis and grounded theory: Sensitizing concepts and hypothesis-testing. In The SAGE Handbook of Current Developments in Grounded Theory (pp. 107–122). SAGE Publications Ltd. https://doi.org/10.4135/9781526485656

19.

GitHub (2022a). Finding ways to contribute to open source on GitHub. GitHub Inc. https://docs.github.com/en/get-started/exploring-projects-on-github/finding-ways-to-contribute-to-open-source-on-github

20.

GitHub . (2022b). Hello world. GitHub Inc.https://docs.github.com/en/get-started/quickstart/hello-world

21.

Glaser

B. G.

Strauss

A. L.

Strutzel

(1968). The discovery of grounded theory; strategies for qualitative research. Nursing research, 17(4), 364. https://doi.org/10.1097/00006199-196807000-00014

22.

Goldstone

R. L.

Barsalou

L. W.

(1998). Reuniting perception and conception. Cognition, 65(2-3), 231–262. https://doi.org/10.1016/S0010-0277(97)00047-4

23.

Goyal

Pandey

Jain

(2018). Deep learning for natural language processing. Apress.

24.

Gusenbauer

Haddaway

N. R.

(2020). Which academic search systems are suitable for systematic reviews or meta-analyses? Evaluating retrieval qualities of Google scholar, PubMed, and 26 other resources. Research Synthesis Methods, 11(2), 181–217. https://doi.org/10.1002/jrsm.1378

25.

Hori

Kim

Kawase

Kimura

Matsui

Machimura

(2020). Local energy system design support using a renewable energy mix multi-objective optimization model and a co-creative optimization process. Renewable Energy, 156, 1278–1291. https://doi.org/10.1016/j.renene.2019.11.089

26.

Jacobi

Van Atteveldt

Welbers

(2016). Quantitative analysis of large amounts of journalistic texts using topic modelling. Digital journalism, 4(1), 89–106. https://doi.org/10.1080/21670811.2015.1093271

27.

Kallimani

J. S.

(2018). Survey on extractive text summarization methods with multi-document datasets. 2018 International Conference on Advances in Computing, Communications and Informatics (ICACCI). Bangalore, India, September 19–22.

28.

Keele

(2007). Guidelines for performing systematic literature reviews in software engineering. In: Technical report, ver. 2.3 ebse technical report. ebse.

29.

Khan

K. S.

Kunz

Kleijnen

Antes

(2003). Five steps to conducting a systematic review. Journal of the Royal Society of Medicine, 96(3), 118–121. https://doi.org/10.1177/014107680309600304

30.

Kirmani

A. R.

(2022). Artificial intelligence-enabled science poetry. ACS Energy Letters, 8(1), 574–576. https://doi.org/10.1021/acsenergylett.2c02758

31.

Lame

(2019). Systematic literature reviews: An introduction. In Proceedings of the design society: international conference on engineering design. 22nd International Conference on Engineering Design, ICED 2019,

Delft, Netherlands, August

5–8.

32.

Landauer

T. K.

Foltz

P. W.

Laham

(1998). An introduction to latent semantic analysis. Discourse Processes, 25(2-3), 259–284. https://doi.org/10.1080/01638539809545028

33.

Larsen

K. R.

Hovorka

Dennis

West

J. D.

(2019). Understanding the elephant: The discourse approach to boundary identification and corpus construction for theory review articles. Journal of the Association for Information Systems, 20(7), 887–927. https://doi.org/10.17705/1jais.00556

34.

Leeson

Resnick

Alexander

Rovers

(2019). Natural language processing (Nlp) in qualitative public health research: A proof of concept study. International Journal of Qualitative Methods, 18, 160940691988702. https://doi.org/10.1177/1609406919887021

35.

Le Glaz

Haralambous

Kim-Dufor

D.-H.

Lenca

Billot

Ryan

T. C.

Marsh

Devylder

Walter

Berrouiguet

Lemey

(2021). Machine learning and natural language processing in mental health: Systematic review. Journal of Medical Internet Research, 23(5), Article e15708. https://doi.org/10.2196/15708

36.

McCallum

(2006). Pachinko allocation: DAG-structured mixture models of topic correlations. In Proceedings of the 23rd international conference on Machine learning. Pittsburgh Pennsylvania USA, June 25–29.

37.

Liu

Zheng

You

(2006). Nonnegative matrix factorization and its applications in pattern recognition. Chinese Science Bulletin, 51(1), 7–18. https://doi.org/10.1007/s11434-005-1109-6

38.

Longo

(2020). Empowering qualitative research methods in education with artificial intelligence. Computer Supported Qualitative Research: New Trends on Qualitative Research (WCQR2019) 4.

39.

Lund

B. D.

Wang

(2023). Chatting about ChatGPT: How may AI and GPT impact academia and libraries? Library Hi Tech News. https://doi.org/10.2139/ssrn.4333415

40.

Mahalakshmi

Kulkarni

Pradeep Kumar

Suresh Kumar

Nidhi Sree

Durga

(2022). The role of implementing artificial intelligence and machine learning technologies in the financial services industry for creating competitive intelligence. Materials Today: Proceedings, 56, 2252–2255. https://doi.org/10.1016/j.matpr.2021.11.577

41.

Marshall

I. J.

Wallace

B. C.

(2019). Toward systematic review automation: A practical guide to using machine learning tools in research synthesis. Systematic Reviews, 8(163), 1–10. https://doi.org/10.1186/s13643-019-1074-9.

42.

Millstein

(2020). Natural language processing with python: natural language processing using NLTK. Frank Millstein.

43.

Mohan

G. B.

Kumar

R. P.

(2022). A comprehensive survey on topic modeling in text summarization. In Micro-Electronics and Telecommunication Engineering: Proceedings of 5th ICMETE 2021. Ghaziabad, Indiam September 24–25, pp. 231–240.

44.

Moher

Shamseer

Clarke

Ghersi

Liberati

Petticrew

Shekelle

Stewart

L. A.

PRISMA-P Group . (2015). Preferred reporting items for systematic review and meta-analysis protocols (PRISMA-P) 2015 statement. Systematic Reviews, 4(1), 1. https://doi.org/10.1186/2046-4053-4-1

45.

Nguyen-Trung

Saeri

A. K.

Kaufman

(2023). Applying ChatGPT and AI-powered tools to accelerate evidence reviews. https://doi.org/10.31219/osf.io/pcrqf

46.

O’Connor

A. M.

Tsafnat

Gilbert

S. B.

Thayer

K. A.

Wolfe

M. S.

(2018). Moving toward the automation of the systematic review process: A summary of discussions at the second meeting of international collaboration for the automation of systematic reviews (ICASR). Systematic Reviews, 7, 3–5. https://doi.org/10.1186/s13643-017-0667-4

47.

Okoli

(2015). A guide to conducting a standalone systematic literature review. Communications of the Association for Information Systems, 37(43). https://doi.org/10.17705/1CAIS.03743.

48.

Onah

D. F.

Pang

E. L.

El-Haj

(2022). A data-driven latent semantic analysis for automatic text summarization using LDA topic modelling. arXiv preprint arXiv:2207.14687.

49.

Osisanwo

Jet

Awodele

Hinmikaiye

Olakanmi

Akinjobi

(2017). Supervised machine learning algorithms: Classification and comparison. International Journal of Computer Trends and Technology, 48(3), 128–138. https://doi.org/10.14445/22312803/ijctt-v48p126

50.

Page

M. J.

McKenzie

J. E.

Bossuyt

P. M.

Boutron

Hoffmann

T. C.

Mulrow

C. D.

Shamseer

Tetzlaff

J. M.

Akl

E. A.

Brennan

S. E.

Chou

Glanville

Grimshaw

J. M.

Hróbjartsson

Lalu

M. M.

Loder

E. W.

Mayo-Wilson

McDonald

Moher

(2021). The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ, n71. https://doi.org/10.1136/bmj.n71

51.

Pati

Lorusso

L. N.

(2018). How to write a systematic review of the literature. HERD: Health Environments Research & Design Journal, 11(1), 15–30. https://doi.org/10.1177/1937586717747384

52.

Paul

Lim

W. M.

O’Cass

Hao

A. W.

Bresciani

(2021). Scientific procedures and rationales for systematic literature reviews (SPAR-4-SLR). International Journal of Consumer Studies, 45(4), O1–O16. https://doi.org/10.1111/ijcs.12695

53.

Perrin

A. J.

(2001). The CodeRead system: Using Natural Language processing to automate coding of qualitative data. Social Science Computer Review, 19(2), 213–220. https://doi.org/10.1177/089443930101900207

54.

Petticrew

Roberts

(2008). Systematic reviews in the social sciences: A practical guide. John Wiley & Sons.

55.

Pfenninger

Hawkes

Keirstead

(2014). Energy systems modeling for twenty-first century energy challenges. Renewable and Sustainable Energy Reviews, 33, 74–86. https://doi.org/10.1016/j.rser.2014.02.003

56.

Prabhakaran

(2018). Topic modeling with gensim (Python). Machine Learning Plus. https://www.machinelearningplus.com/nlp/topic-modeling-gensim-python/#9createbigramandtrigrammodels

57.

Priyadharshini

Magesh

(2021). Improving text summarization using topic scoring and fuzzy logic approach.

58.

Rahgozar

Inkpen

(2019). Semantics and homothetic clustering of Hafez poetry. In Proceedings of the 3rd Joint SIGHUM workshop on computational linguistics for cultural heritage, social sciences, humanities and literature. Minneapolis, MN, USA, June 6–7.

59.

Rahimi

Zahedi

Mashayekhi

(2022). A probabilistic topic model based on short distance Co-occurrences. Expert Systems with Applications, 193, 116518. https://doi.org/10.1016/j.eswa.2022.116518

60.

Siddaway

A. P.

Wood

A. M.

Hedges

L. V.

(2019). How to do a systematic review: A best practice guide for conducting and reporting narrative reviews, meta-analyses, and meta-syntheses. Annual Review of Psychology, 70(1), 747–770. https://doi.org/10.1146/annurev-psych-010418-102803

61.

Smith

(2021). Enhanced support for citations on GitHub. GitHub. https://github.blog/2021-08-19-enhanced-support-citations-github/

62.

Sundaram

Berleant

(2022). Automating systematic literature reviews with natural language processing and text mining: A systematic literature review. arXiv preprint arXiv:2211.15397. https://doi.org/10.48550/arXiv.2211.15397

63.

Tavora

(2018). Topic modeling. GitHub. https://github.com/marcotav/unsupervised-learning/tree/master/topic-modeling

64.

Thomé

A. M. T.

Scavarda

L. F.

Scavarda

A. J.

(2016). Conducting systematic literature review in operations management. Production Planning & Control, 27(5), 408–420. https://doi.org/10.1080/09537287.2015.1129464

65.

Tsafnat

Glasziou

Choong

M. K.

Dunn

Galgani

Coiera

(2014). Systematic review automation technologies. Systematic Reviews, 3(74), 1–15. https://doi.org/10.1186/2046-4053-3-74

66.

Ungless

E. L.

Ross

Belle

(2023). Potential Pitfalls with automatic sentiment analysis: The example of Queerphobic bias. Social Science Computer Review, 1–9, Article 08944393231152946. https://doi.org/10.1177/08944393231152946.

67.

van de Schoot

de Bruin

Schram

Zahedi

de Boer

Weijdema

Kramer

Huijts

Hoogerwerf

Ferdinands

Harkema

Willemsen

Fang

Hindriks

Tummers

Oberski

D. L.

(2021). An open source machine learning framework for efficient and transparent systematic reviews. Nature Machine Intelligence, 3(2), 125–133. https://doi.org/10.1038/s42256-020-00287-7

68.

Wang

Scells

Koopman

Zuccon

(2023). Can ChatGPT write a good boolean query for systematic review literature search? arXiv preprint arXiv:2302.03495.

69.

Wang

Bai

Stanton

Chen

W.-Y.

Chang

E. Y.

(2009). Plda: Parallel latent dirichlet allocation for large-scale applications. In Algorithmic Aspects in Information and Management: 5th International Conference, AAIM 2009, San Francisco, CA, USA, June 15-17, 2009. Proceedings 5.

70.

Warren

S. L.

Moustafa

A. A.

(2023). Functional magnetic resonance imaging, deep learning, and Alzheimer’s disease: A systematic review. Journal of Neuroimaging: Official Journal of the American Society of Neuroimaging, 33(1), 5–18. https://doi.org/10.1111/jon.13063

71.

Watanabe

Zhou

(2020). Theory-driven analysis of large corpora: Semisupervised topic classification of the UN speeches. Social Science Computer Review, 40(2), 346–366. https://doi.org/10.1177/0894439320907027

72.

Xiao

Watson

(2019). Guidance on conducting a systematic literature review. Journal of Planning Education and Research, 39(1), 93–112. https://doi.org/10.1177/0739456x17723971

73.

Yang

Siau

K. L.

(2018). A qualitative research on marketing and sales in the artificial intelligence age. MWAIS 2018 Proceedings(41). https://aisel.aisnet.org/mwais2018/41

74.

Zucker

J.-D.

(2003). A grounded theory of abstraction in artificial intelligence. Philosophical transactions of the Royal Society of London. Series B, Biological sciences, 358(1435), 1293–1309. https://doi.org/10.1098/rstb.2003.1308

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.02 MB