Revolutionising healthcare with artificial intelligence: A bibliometric analysis of 40 years of progress in health systems

Abstract

The development of artificial intelligence (AI) has revolutionised the medical system, empowering healthcare professionals to analyse complex nonlinear big data and identify hidden patterns, facilitating well-informed decisions. Over the last decade, there has been a notable trend of research in AI, machine learning (ML), and their associated algorithms in health and medical systems. These approaches have transformed the healthcare system, enhancing efficiency, accuracy, personalised treatment, and decision-making. Recognising the importance and growing trend of research in the topic area, this paper presents a bibliometric analysis of AI in health and medical systems. The paper utilises the Web of Science (WoS) Core Collection database, considering documents published in the topic area for the last four decades. A total of 64,063 papers were identified from 1983 to 2022. The paper evaluates the bibliometric data from various perspectives, such as annual papers published, annual citations, highly cited papers, and most productive institutions, and countries. The paper visualises the relationship among various scientific actors by presenting bibliographic coupling and co-occurrences of the author's keywords. The analysis indicates that the field began its significant growth in the late 1970s and early 1980s, with significant growth since 2019. The most influential institutions are in the USA and China. The study also reveals that the scientific community's top keywords include ‘ML’, ‘Deep Learning’, and ‘Artificial Intelligence’.

Keywords

Machine learning artificial intelligence in health health prediction medical systems bibliometric citation analysis web of Science AI in health

Introduction

Artificial intelligence (AI) is a potent tool widely used in medical healthcare systems during the past decade. AI segmentation techniques smartly identify targeted patients with a diagnosis or recognize any specific treatment that could work for them.¹ For example, popular AI-enabled image segmentation approaches – U-NET,² U-Net ++,³ VB-Net⁴ and Attention U-Net⁵ have been used for the segmentation of chest X-ray images to identify COVID-19. At the same time, AI techniques are embedded into medical devices to track the real-time behaviour of patients and enable health practitioners to have an efficient and informed decision-making process.⁶ AI synthesises patient data through sophisticated mathematical algorithms to identify hidden patterns and get optimal prediction results. Several AI methods apply to health management systems, from traditional machine learning (ML) to deep learning, Bayesian, evolutionary, neural network, and federated learning. Each approach performs differently depending on the dataset’s type, size, structure and sparsity.⁷ For example, a recurrent neural network (RNN) is an optimal method for predicting simple time series data related to postpartum depression.⁸ However, when the dimensionality of the time series data increases, the same approach becomes insufficient unless it incorporates an Induced Ordered Weighted Averaging Operator (IOWA)⁹ into RNN to address the nonlinearity of a dataset.^10,11

Over the last four decades, the healthcare sector has witnessed substantial growth in AI and its subsets, ML and deep learning (DL), as demonstrated in Table 1. This growth has been particularly pronounced in recent years, with AI revolutionising various aspects of medicine. AI applications now encompass,¹² remote patient monitoring,¹³ informed medical decision-making,¹⁴ and enhanced diagnostic processes.¹⁵ The increased adoption of AI can be attributed to the digitisation of medical records, technological advancements, and the refinement of AI algorithms, resulting in improved patient outcomes. Moreover, the emergence of the COVID-19 pandemic has further accelerated AI's integration into healthcare, with applications ranging from real-time monitoring of COVID cases to predictive analytics and symptom severity assessment using chest X-rays and CT scans.¹⁶ In line with this growing influence, researchers like Ding et al.¹⁷ have proposed innovative solutions such as a stroke prevention risk assessment model using ML and Logistic Regression. This model not only addresses the need for advanced healthcare solutions but also emphasises user-friendliness, particularly benefiting patients in less developed regions. Similarly, Cao et al.¹⁸ have developed a convolutional neural network (CNN) model tailored for predicting dose prescriptions in complex clinical scenarios like radiation therapy. Additionally, large language models (LLM) such as ChatGPT¹⁹ are now extensively employed in clinical and medical research, reshaping healthcare education, aiding in drug discovery, and refining personalised medicine.^20,21 Despite the promising results demonstrated by AI approaches in numerous medical studies, concerns raised by critics regarding the widespread adoption of AI in healthcare underscore the need for careful consideration and ethical implementation practices.^12–15

Table 1.

Annual evolution of the topic from 1983 to 2022.

Year	2022	2021	2020	2019	2018	2017	2016	2015	2014	2013
TP	15,828	15,804	10,676	7336	4247	2422	1454	1063	753	685
Year	2012	2011	2010	2009	2008	2007	2006	2005	2004	2003
TP	475	390	364	362	317	238	239	189	154	105
Year	2002	2001	2000	1999	1998	1997	1996	1995	1994	1993
TP	97	111	100	101	105	100	57	62	46	53
Year	1992	1991	1990	1989	1988	1987	1986	1985	1984	1983
TP	40	41	10	5	7	8	8	5	3	3

TP: total publications.

Challenges associated with integrating AI into healthcare can be classified into two primary domains: challenges in implementation and concerns regarding the credibility of the outcomes. Issues surrounding accountability, transparency, privacy, and permission²² underscore the complexities inherent in implementing AI technologies. Additionally, ethical considerations, including safety, accountability, security, and the scarcity of computational resources,^23–25 further compound the challenges. These factors collectively hinder the widespread adoption of AI in medical practice. Moreover, these advancements are met with significant challenges hindering the full integration of AI into medical practice. Algorithmic bias poses another challenge, influencing disease likelihood predictions, often tied to gender or race.²⁶ Furthermore, Beede et al.²⁷ identified that AI systems impose an extra workload on nurses to reassess false-positive results. Several studies have also highlighted the complexity of AI systems and the lack of suitable explainability, even in so-called explainable AI systems.^27,28 Weins et al.²⁹ found that, in reality, only a minimal number of AI tools out of several algorithms and approaches are actually implemented in medical practices, and most of them are less helpful than anticipated.³⁰ These challenges collectively impede the seamless integration of AI technologies into medical practice.

In the rapidly evolving landscape of AI in healthcare, the need to comprehend its vast impact necessitates robust methodologies. Bibliometric studies serve as vital tools, offering systematic analyses to unravel complexities, track trends, and quantify the influence of research efforts. By thoroughly examining scientific literature, bibliometric analysis provides invaluable insights into the dynamic interplay between AI and healthcare, guiding strategic decisions and fostering innovation in medical practice. These studies assist in understanding of any field, shedding lights on its historical development, current state, and emerging trends. Several bibliometric studies provide insight into the evaluation of specific topic across various domains. For instance, Khan et al.³¹ conducted a bibliometric analysis of finance papers, and Qiao et al.³² focused on the field of tourism and employed bibliometric indicators to analyse research trends in tourism spanning from 2008 to 2020. Some studies combined multiple domains, such as Wijewickrema³³ combined library, information sciences and information systems domains to analyse the current trend of combined multiple subjects. Some other research discusses a bibliometric study of particular topics, such as the evolution of blockchain,³⁴ the bibliometric study of the Particle Swarm Optimisation algorithm,³⁵ the bibliometric evaluation of Ordered Weighted Averaging Operator³⁶ and the bibliometric survey of analytic hierarchy process and technique for order preference by similarity to ideal solution methods.³⁷

Considering the significant impact of AI in medicine, bibliometric studies focusing on AI in healthcare have gained prominence.^38–40 Many studies have utilised bibliometric analysis in the field of health and medicine. For example, Tran et al.⁴¹ conducted a scientometric analysis to examine the role of AI in medicine, with a particular focus on diseases measured by disability-adjusted life years (DALYs). Similarly, Fosso Wamba and Queiroz³⁸ conducted a bibliometric study focusing on AI’s responsibility and ethical aspects in healthcare. Although both approaches^38,41 employed multiple bibliometric indicators to analyse the topic. However, the studies narrowed their focus by using specific keywords and focusing on particular healthcare issues, which led to the exclusion of several key papers. Similarly, in another study, Shaikh et al.⁴⁰ conducted a bibliometric survey to analyse the adoption of AI in the health sector and explored prominent entities contributing to the topic. However, it is worth noting that the timeframe spanned a mere 25 years, from 1996 to 2021. Moreover, the authors primarily focused their search on e-health related keywords, potentially overlooking a significant amount of research. Although the discussed approaches made valuable contributions, however, these studies needed to offer a comprehensive overview of AI's multifaceted and evolving role in the healthcare sector, spanning from its inception to the present.

Acknowledging the growing significance of AI in the healthcare sector and the proven effectiveness of bibliometric studies, covering quantitative insights into research trends, impact and productivity, this research employs the bibliometric analysis method to investigate the trajectory of the field over a four-decade period. The central aim of this paper is to dissect the evolution of AI and its subsets within the context of medical and health systems since their inception. Utilising bibliometric indicators, the study assesses both the topic's productivity and impact. The qualitative indicator measures the impact or quality of the work by measuring the citations received by different researchers. The quantitative indicator highlights the most productive institutions, journals, and countries. The study further evaluates the productivity, performance and impact of authors, journals, countries, and universities. Additionally, the paper analyses the bibliographic coupling of various parameters using VOSviewer software.

The rest of the paper is organised as follows: Bibliometric methods section discusses the methodology adopted for this study. Results section presents the findings of the study. This section first presents the publication and citation structure of the topic. The section then analyses the influential papers, most productive institutions, countries, and universities. Discussion and findings section presents the discussion, and Conclusion section concludes the paper with potential future research direction.

Bibliometric methods

Bibliometric analysis, a field within library and information sciences employs various indicators to assess scientific literature and analyse extensive bibliographic materials. The bibliometric indicators enable one to know the past, understand the current trend, and explore future research lines in a certain field. Bibliometric analysis provides valuable insight into various aspects of literature and research. The analysis enables one to identify the productivity of an individual, institutions, journals, countries and funding agencies using different key parameters.⁴² Additionally, the approach allows for identifying prevalent keywords within specific research domains, facilitating a deeper understanding of ongoing research trends and providing valuable guidance for researchers to steer their work in alignment with these trends. The use of bibliometric analysis has multiple folds. For potential contributors, the analysis helps understand the diffusion and impact of articles published in the journals and helps the researcher find a suitable journal for their paper submission. For funding agencies and institutions, the analysis serves as a valuable tool to assess the contribution of individual researchers and research groups.⁴³ The bibliometric study not only aids in understanding research trends but also assists researchers and funding agencies in making informed decisions about where to publish and how to proceed with decision-making regarding promotions, research priorities, collaborations and funding.⁴⁴

There are different methodologies to analyse bibliometric data and present the quantitative, qualitative and structural variation in a particular field. The quantitative indicator involves the measurement of productivity in the form of several publications. The qualitative indicator measures the impact or quality of research regarding the number of citations. The structural indicator evaluates the relationship among scientific actors through scientific mappings such as publications, authors, and institutions. These indicators are used to evaluate an individual’s performance, various groups of scientific actors, institutions, funding agencies, and countries.

This study uses bibliometric analysis to present an overview of AI in medical and health system trends. In this analysis, we examine multiple aspects of the journal using quantitative, qualitative, and structural indicators. The paper uses the number of publications and citations with different thresholds. The paper uses an h-index of contributing authors and institutions that effectively portray a scientific field's performance. Besides, to demonstrate the individual standing of contributing institutions, the paper evaluates their ranking using the Academic Ranking of World Universities (ARWU) and QS World University Ranking (QS). To further deepen the analysis and identify the various relationships among scientific actors, the paper presents a mapping analysis of the topic using VOSviewer software. The software combines bibliographic data to map bibliographic coupling and co-occurrences of the author’s keywords. Bibliographic coupling takes place when two articles cite the same third article. Co-occurrences of author keywords measure keywords that appear most frequently in research articles.

To achieve the objectives, the paper uses the web of science (WoS) core collection dataset for the previous four decades. We selected the WoS dataset for its distinguished reputation and several compelling reasons. Firstly, it is recognised globally as the oldest and most extensively utilised authoritative database for research publications and citations.⁴⁵ Secondly, gathering the necessary data for our analysis from this database is straightforward. Furthermore, the widespread acceptance and frequent usage of WoS within the scientific community⁴⁶ guarantee the reliability and consistency of our research findings. The dataset provides a large dataset with various bibliometric indicators and is widely used in different bibliometric analyses. The paper uses a Boolean equation to extract related records. The paper uses the keywords ‘Health’ OR ‘Medical’, OR ‘Medicine’ AND ‘Machine Learning’ OR ‘Deep Learning’ OR ‘Artificial Intelligence’, OR ‘AI’, OR ‘ML’. The publication periods are from 1983 to 2022, which include articles, editorials, reviews, notes, and letters. The comprehensive search has yielded a substantial corpus of 64,063 documents, forming the basis of an in-depth analysis of the evaluation and trends within the field.

Paper objectives

The paper provides a comprehensive overview of the role of AI in health and medical systems over the last four decades. It highlights the transformative impact of AI on healthcare, emphasising its contributions to efficiency, accuracy, personalised treatment, and decision-making.

The paper provides a bibliometric analysis of AI in health and medical systems, using the WoS core collection database. This analysis includes examining the productivity, impact, and trends in the field over the specified period.

The paper identifies the most influential institutions in the field, with a notable presence of the USA and China. This provides insights into the global distribution of contributions and key players driving research in AI for health and medical applications.

The paper explores the prominent keywords associated with AI in health, revealing that ‘Machine Learning,’ ‘Deep Learning,’ and ‘Artificial Intelligence’ are among the top keywords. This information reflects the focus and emphasis within the scientific community.

The paper discusses challenges associated with the adoption of AI in healthcare, including difficulties in implementation, concerns about credibility, accountability, transparency, privacy, and ethical issues. This discussion contributes to a more nuanced understanding of the barriers in integrating AI into medical practice.

The paper acknowledges and compares itself with previous bibliometric studies in the field, highlighting its unique contribution in offering a more comprehensive overview of AI's role in healthcare, spanning from its inception to the present.

The paper contributes methodologically by providing a detailed explanation of the bibliometric methods employed, including quantitative, qualitative, and structural indicators. It emphasises the significance of bibliometric analysis in understanding research trends and guiding decision-making for contributors, funding agencies, and institutions.

By presenting an extensive bibliometric analysis, the paper has the potential to identify research gaps and areas where further investigation or emphasis is needed in the evolving field of AI in health and medical systems.

Results

This section presents the analysis results of the bibliometric analysis, which is further divided into three subsections. First, the paper presents the quantitative indicators by giving the number of papers published in the topic area and citations received by those papers from 1983 to 2022. The paper then presents qualitative measures by showing influential papers regarding citation count, most productive journals, institutions and countries. The section also presents co-occurrences of the author’s keywords.

Publication and citation structure

This section provides an overview of the publication and citation trends within the topic area from 1983 to 2022. It is essential to note that the citation data presented in this study is based on information extracted from the WoS as of mid-2023. The chosen timeframe extends until 2022 to align with WoS's methodology, which computes citations, rankings, and other metrics midway through the year and finalises them by the middle of the following year. It is evident from Table 1 and Figure 1 that there has been a continuously growing trend in publications and citations for the last decade. However, since 2019, we have witnessed significant growth and research in this particular topic area. Over the last decade, from 2013 to 2022, 60,268 documents have been published on this topic, illustrating a notable increase in research activity in the area.

Figure 1.

Annual evolution of the topic.

This growth is due to technological advancements such as internet of things (IoT) enabled devices and natural language processing (NLP) techniques that made it easier to process large amounts of medical data.^7,47,48 Recent COVID-19 pandemics has accelerated the need for AI in healthcare, such as DL segmentation⁴⁹ for chest X-ray scans, Google–Apple partnership for AI-embedded contact tracing applications like COVIDSafe,⁵⁰ vaccine discovery,⁵¹ optimise the allocation of medical resources⁵² and many others. Personalised medicine's need is another factor in the growth of AI in healthcare such as CURATE.AI for liver transplant immunosuppression, etc.,⁵³ quadratic phenotypic optimization platform for multiple myeloma,⁵⁴ parabolic response surface for tuberculosis⁵⁵ and many others. AI algorithms can provide each patient with an optimised, personalised treatment plan.⁵⁶ Many current studies highlight the growing tendency of AI in personalised healthcare, such as combining the Phenotypic personalised medicine (PPM) platform⁵⁷ with the CURATE.AI platform⁵⁸ to give promising results and optimise treatment. In another study, AI-enabled human organ-chip systems create patient-specific medicine.⁵⁹ Likewise, the study⁶⁰ found that deep neural network (DNN) methods perform outstandingly in the personalised medication of Gilomas. Other factors, such as digitisation of data, efficiency, cost saving, and effective data processing with optimal results, are the reason for the growth of research in the field of AI in healthcare.

Table 2 highlights the citation structure of the topic over 40 years. Similar to research publications, there is a consistently growing trend in the number of citations from the inception. It is worth mentioning that after 2017, there was a remarkable increase in the number of citations. The number of citations after 2017 jumped from five to six figures, as shown in Figure 2.

Figure 2.

Annual citation structure of the topic: 1983–2022.

Table 2.

Annual citation structure of the topic from 1983 to 2022.

Year	2022	2021	2020	2019	2018	2017	2016	2015	2014	2013
TP	47,801	168,123	181,890	154,757	124,936	16,864	10,269	7210	4813	2982
Year	2012	2011	2010	2009	2008	2007	2006	2005	2004	2003
TP	1710	689	114	4022	3366	2491	2123	1632	1386	1225
Year	2002	2001	2000	1999	1998	1997	1996	1995	1994	1993
TP	961	984	836	600	472	367	276	220	147	112
Year	1992	1991	1990	1989	1988	1987	1986	1985	1984	1983
TP	46	31	26	27	21	10	4	0	1	0

TP: total publications.

In 2018, 4247 documents received 124,936 citations, a 740% and 1216% increase in citation rate to 2017 and 2016, respectively. Moreover, the analysis results show that 62,987 documents have received 719,645 citations in the last decade, with an average of around 12 citations per document. The complete structure of citations is presented in Table 2.

Table 3 presents the general citation statistics of the topic, comparing them with various benchmarks. The analysis results indicate that 49,048 documents received one or more citations, accounting for approximately 74% of the documents published in the topic area. Particularly, 18,826 documents received 10 or more citations, constituting about 28% of the published documents. Remarkably, 29 documents received 1000 or more citations, and two documents exceeded 5000 citations. Most of the uncited documents are recent publications that have not yet acquired any citations.

Table 3.

Citation structure of documents published in the topic area – AI in health.

Cited references	Number of publications	% of documents
≥01 citation	49,048	73.7963
≥10 citations	18,826	28.3251
≥50 citations	3745	5.6346
≥100 citations	1471	2.2132
≥300 citations	261	0.3927
≥400 citations	152	0.2287
≥600 citations	72	0.1083
≥1000 citations	29	0.0436
≥3000 citations	4	0.0060
≥5000 citations	2	0.0030

AI = artificial intelligence.

Influential papers

The second aspect of the study result is to highlight the qualitative measure of the bibliometric analysis. To conduct a comprehensive analysis of impactful publications on the topic, this study categorises the literature analysis into two distinct timeframes: from 1983 to 2002 and from 2003 to 2022. In the initial two decades (1983–2002), 962 papers were published within this domain. The most influential paper during this period was the research article ‘The Use of the Area Under the ROC Curve in the Evaluation of Machine Learning Algorithms’ by Bardley from the University of Queensland, acquiring 3913 citations.⁶¹ The paper discusses the receiver operating characteristics (ROC) curve in conjunction with ML methods for medical diagnostic problems. As depicted in Tables 5 and 6, the highest-cited article also focuses on the ROC approach. This suggests that the ROC benchmark to assess the performance of the classification model has not only sustained its popularity during the initial two decades but has also remained widely utilised in numerous medical diagnostic problems up to the year 2022. The top six most cited articles for 1983 to 2022 are presented in Table 4.

Table 4.

The top six most cited documents from 1983 to 2002.

R	Title	Authors	TC	DT	V	YR	C/Y
1	The use of the area under the ROC curve in the evaluation of machine learning algorithms	Bradley, AP	3913	J	30	1997	157
2	An introduction to multisensor data fusion	Hall, DL; Llinas, J	1521	J	85	1997	61
3	Logistic regression and artificial neural network classification models: a methodology review	Dreiseitl, S; Ohno-Machado, L	1122	J	35	2002	56
4	Allostasis and allostatic load: implications for neuropsychopharmacology	McEwen, BS	1066	J	22	2000	48
5	Machine learning for medical diagnosis: history, state of the art and perspective	Kononenko, I	788	J	23	2001	38
6	A new evolutionary system for evolving artificial neural networks	Yao, X and Liu, Y	591	J	08	1997	24

R: rank; TC: total citations; DT: document type; V: volume; YR: publication year; C/Y: citation per year; ROC: receiver operating characteristics.

Table 5.

The top six most cited documents from 2003 to 2022.

R	Title	Authors	TC	DT	V	YR	C/Y
1	An introduction to ROC analysis	Fawcett, T	11,521	J	27	2006	720
2	A survey on deep learning in medical image analysis	Litjens. G, et al.	5842	J	42	2017	1168
3	A survey on image data augmentation for deep learning	Shorten, C and Khoshgoftaar, TM	3521	J	6	2019	1174
4	Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs	Gulshan, V. et al.	3330	J	316	2016	555
5	Machine learning: trends, perspectives, and prospects	Jordan, MI and Mitchell, TM	3163	J	349	2015	452
6	Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning	Shin, HC, et al.	3025	J	35	2016	504

R: rank; TC: total citations; DT: document type; V: volume; YR: publication year; C/Y: citation per year; ROC: receiver operating characteristics; CNN: convolutional neural network.

Table 6.

The 30 most cited documents from 1983 to 2022.

R	Title	Authors	TC	DT	V	YR	C/Y
1	An introduction to ROC analysis	Fawcett, T	11,521	J	27	2006	720
2	A survey on deep learning in medical image analysis	Litjens, G, et al.	5842	J	42	2017	1168
3	The use of the area under the ROC curve in the evaluation of machine learning algorithms	Bradley, AP	3913	J	30	1997	157
4	A survey on image data augmentation for deep learning	Shorten, C; Khoshgoftaar, TM	3521	J	6	2019	1174
5	Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs	Gulshan, V, et al.	3330	J	316	2016	555
6	Machine learning: trends, perspectives, and prospects	Jordan, MI; Mitchell, TM	3163	J	349	2015	452
7	Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning	Shin, HC, et al.	3025	J	35	2016	504
8	Computational radiomics system to decode the radiographic phenotype	van Griethuysen, JJM, et al.	2573	J	77	2017	515
9	Deep learning in medical image analysis	Shen, DG; Wu, GR; Suk, HI	2127	S	19	2017	425
10	Efficient multi-scale 3D CNN with fully connected CRF for accurate brain lesion segmentation	Kamnitsas, K, et al.	1866	J	36	2017	373
11	High-performance medicine: the convergence of human and artificial intelligence	Topol, EJ	1840	J	25	2019	613
12	Risk factors for suicidal thoughts and behaviours: a meta-analysis of 50 years of research	Franklin, JC, et al.	1748	J	143	2017	350
13	From local explanations to global understanding with explainable AI for trees	Lundberg, et al.	1676	J	2	2020	838
14	Prediction models for diagnosis and prognosis of COVID-19 infection: systematic review and critical appraisal	Wynants, L, et al.	1631	J	369	2020	816
15	Convolutional neural networks for medical image analysis: full training or fine tuning?	Tajbakhsh, et al.	1627	J	35	2016	271
16	An introduction to multisensor data fusion	Hall, DL; Llinas, J	1521	J	85	1997	61
17	A Survey on human activity recognition using wearable sensors	Lara, OD; Labrador, MA	1496	J	15	2013	166
18	Identifying medical diagnoses and treatable diseases by image-based deep learning	Kermany, DS, et al.	1428	J	172	2018	357
19	Predicting the future – big data, machine learning, and clinical medicine	Obermeyer, Z; Emanuel, EJ	1356	J	375	2016	226
20	nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation	Isensee, F, et al.	1347	J	18	2021	1347
21	Machine learning in medicine	Deo, RC	1340	J	132	2015	191
22	Convolutional neural networks: an overview and application in radiology	Yamashita, R, et al.	1303	J	9	2018	326
23	Deep learning and its applications to machine health monitoring	Zhao, R, et al.	1284	J	115	2019	428
24	Artificial intelligence in radiology	Hosny, A; Parmar, C; Quackenbush, J; Schwartz, LH; Aerts, HJWL	1227	J	18	2018	307
25	A guide to deep learning in healthcare	Esteva, A, et al.	1189	J	25	2019	396
26	Machine Learning in Medicine	Rajkomar, A; Dean, J; Kohane, I	1154	J	380	2019	385
27	Automated detection of COVID-19 cases using deep neural networks with X-ray images	Ozturk, T, et al.	1143	J	121	2020	572
28	Using AUC and accuracy in evaluating learning algorithms	Huang, J; Ling, CX	1142	J	17	2005	67
29	Breast cancer statistics, 2017, racial disparity in mortality by state	DeSantis, CE, et al.	1128	J	67	2017	226
30	Deep neural networks: a promising tool for fault characteristic mining and intelligent diagnosis of rotating machinery with massive data	Jia, F, et al.	1124	J	72–73	2016	187

R: rank; TC: total citations; DT: document type; V: volume; YR: publication year; C/Y: citation per year; CNN: convolutional neural network; ROC: receiver operating characteristics; AI: artificial intelligence 1; CRF: conditional random fields; AUC: area under curve.

In the second timeframe from 2003 to 2022, the most cited article, published in 2006, was ‘An Introduction to ROC Analysis’, authored by Tom Fawcett from the Institute for the Study of Learning and Expertise, USA.⁶² It is noteworthy that this article has attracted a remarkable 11,521 citations. As earlier highlighted, the ROC graph remains a key topic in healthcare research and applications. In this article, the author highlighted various features and issues of the ROC graph when used in research and practice to visualise and evaluate classifiers.

Over the last decade, there has been a significant trend towards utilising DL algorithms in numerous healthcare problems, sparking a growing interest among researchers. Among the top 6 highly cited articles during the period, four articles directly discussed the application of the DL method. The success of DL algorithms is due to the advancements in graphic processing units (GPUs), the abundance of large-scale data, and continuous developments in learning algorithms.⁶³ Hence, the topic holds substantial potential for new researchers to apply the algorithm and explore and analyse its applications across diverse healthcare domains. The top six most cited article for the last two decades, 2003–2022, is presented in Table 5.

The study further highlights the most influential paper of all time from 1983 to 2022 and presents the top 30 most cited articles, as presented in Table 6. The analysis results show that the top article – ‘An introduction to ROC analysis’, has kept its position as the top cited paper since 2006. Similar is the case of the second top-cited article – ‘A survey on deep learning in medical image analysis’, which has kept its dominance since its publication. The article was published in 2017 by Geert Litjens et al.⁶⁴ from the Diagnostic Image Analysis Group, Radboud University Medical Center, the Netherlands. It is worth mentioning that although the article was published in 2017, while in a few years, the article has received 5842 citations with an impressive citation rate per year (C/Y) of 1168 citations. This is the second-highest C/Y among all articles published on the topic. The paper summarises various deep-learning algorithms in medical image analysis. The paper comprehensively summarised more than 300 articles discussing DL algorithms in various applications. The paper first identified challenges for various DL algorithms in medical imaging tasks and then highlighted contributions by various authors to address those challenges. Moreover, it is important to note that 36% of the top 30 most cited articles used the keyword ‘DL’. This statistic highlights the potency of a keyword for a researcher aiming to enhance the visibility of their work and publish impactful papers.

The analysis of impactful papers reveals another key finding. Within the list, a survey paper – ‘A survey on Image Data Augmentation for Deep Learning’, stands out, receiving a notable 3521 citations with a C/Y of 1174. The article was published in 2019 and authored by Connor Shorten and Taghi M. Khoshgoftaar from Florida Atlantic University, USA. The paper is one of the most influential and impactful papers from the field that has attracted significant attention among the community and has received many citations in just a few years. It comprehensively discusses image augmentation algorithms and explores various characteristics of Image Data Augmentation problems in numerous medical applications.

Productive journals

This section highlights the most productive venues that contributed to the topic area. Table 7 presents the top 20 productive venues that published articles in the topic area. These venues are categorised based on – ‘TP’, the total number of publications; ‘TC’, the total number of citations received to those papers; ‘H’ H- index of the journal; and ‘IF’ impact factor of a journal. To further deepen the quality of articles published in those journals, the paper analyses the citation structure of articles by highlighting the papers that have received citations more than equal to – 500, 300, 100 and 50 citations.

Table 7.

Highly productive journals published articles on AI in the medical system.

R	Publication titles	TP	TC	H	IF	≥500	≥300	≥100	≥50
1.	IEEE Access	1410	20,849	62	3.476	1	5	31	78
2.	Lecture Notes in Computer Science	1350	11,578	46	—	2	4	17	41
3.	Sensors	950	10,694	46	3.847	0	1	13	41
4.	Proceedings of SPIE	827	2386	21	—	0	0	1	4
5.	Scientific Reports	706	12,985	51	4.997	5	7	17	57
6.	Applied Sciences Basel	689	4532	30	2.838	0	0	3	15
7.	PLoS ONE	545	9423	48	3.752	0	0	14	46
8.	International Journal of Environmental Research and Public Health	455	4145	29	4.614	1	1	2	13
9.	Journal of Medical Internet Research	436	7782	43	7.077	0	2	14	37
10.	Lecture Notes in Artificial Intelligence	418	1509	18	—	0	0	0	1
11.	Computers in Biology and Medicine	395	6345	37	6.698	1	2	9	23
12.	Journal of Biomedical Informatics	367	8915	44	8.000	2	2	10	41
13.	Diagnostics	349	1457	19	3.992	0	0	0	5
14.	Journal of the American Medical Informatics Association	346	9254	48	7.942	1	3	16	46
15.	Computer Methods and Programs in Biomedicine	332	6132	40	7.027	0	0	11	31
16.	IEEE Journal of Biomedical and Health Informatics	326	6778	37	7.021	1	2	14	29
17.	Artificial Intelligence in Medicine	325	8404	47	7.011	1	2	9	40
18.	Multimedia Tools and Applications	323	2218	23	2.577	0	0	3	6
19.	JMIR Medical Informatics	322	2166	21	3.228	0	0	2	6
20.	BMC Medical Informatics and Decision-Making	312	3667	30	3.298	0	0	5	18

R: rank; TP: total publications; TC: total citations; H: H-index; IF: impact factor.

Table 7 shows that IEEE Access is at the top of the list, producing 1410 documents that attracted 20,849 citations. The journal publishes open-access articles. The journal has an impact factor of 3.476 with an H-index of 62. Notably, the journal has published six articles with over 300 citations and 115 documents with over 50 citations. The second most productive venue is Springer’s Lecture Notes in Computer Science, having an H-index of 46, which published 1350 documents on the topic. The documents received 11,578 citations, with 23 articles receiving more than 100 citations. The third productive journal in the list is Sensor, which has published 950 documents with 10,694 citations. The journal has an impact factor of 3.847 with an H-index of 21. Like IEEE Access, the journal publishes open-access articles. Notably, 706 articles in the journal Scientific Reports have received 12,985 citations, the second most cited articles in the topmost productive venues.

Most productive and influential institutions

This set of analysis – Table 8 presents the most productive and influential institutions in relation to their contributions to the topic. The comparative analysis is based on – the number of papers, number of citations, H-index, citation per paper, and number of papers having more than 100, 300, 500, 1000 and 3000 citations. The 2023 QS World University Ranking (QS) score and Shanghai ARWU is presented to further analyse the quality and ranking of each contributing institution.

Table 8.

Twenty-five most productive and influential institution's contributions.

R	Institution	CNT	TP	TC	≥100	≥300	≥500	≥1000	≥3000	H	TC/TP	QS Rank	ARWU
1	Harvard University	USA	1748	50,141	95	20	16	7	0	1527	29	5	1
2	University of California System	USA	1730	46,055	74	20	10	4	1	1644	27	27	5
3	University of London	UK	1094	25,204	56	15	5	0	0	1112	23	125	201–300
4	Harvard Medical School	USA	1041	34,444	64	17	13	6	1	1219	33	5	1
5	Chinese Academy of Sciences	CHN	942	18,539	39	10	3	0	0	800	20	51–100	-
6	UDICE French Research Universities	FRN	878	13,247	26	5	2	0	0	1187	15	52	40
7	University of Texas System	USA	871	18,513	23	5	3	2	1	1113	21	72	37
8	Stanford University	USA	851	25,021	48	18	6	0	0	1137	29	3	2
9	State University System of Florida	USA	723	17,132	24	6	4	3	0	818	24	188	94
10	Egyptian Knowledge Bank	EGY	708	8634	13	1	0	0	0	338	12	-	-
11	Pennsylvania Commonwealth System of Higher Education	USA	703	14,006	24	7	3	1	0	951	20	13	15
12	University of Toronto	CND	664	14,229	27	5	3	0	0	963	21	34	22
13	Johns Hopkins University	USA	630	12,477	26	1	1	0	0	1030	20	24	14
14	Massachusetts General Hospital	USA	568	13,801	34	5	3	0	0	916	24	5	1
15	University of Washington	USA	566	11,510	22	3	1	1	0	999	20	5	1
16	Centre National De La Recherche Scientifique	FRN	549	7840	15	3	1	0	0	990	14	-	-
17	University of Washington Seattle	USA	543	11,140	22	3	1	1	0	998	21	80	17
18	University of Michigan System	USA	541	13,036	29	5	2	0	0	960	24	25	28
19	University of Michigan	USA	540	13,035	29	5	2	0	0	960	24	25	28
20	University of Pennsylvania	USA	535	13,929	23	5	4	2	0	924	26	13	15
21	University College London	UK	522	15,978	40	10	3	0	0	880	31	8	18
22	Imperial College London	UK	511	15,276	25	9	4	2	0	824	30	6	23
23	University of Oxford	UK	511	12,147	23	4	1	1	0	965	24	4	7
24	Institut National De La Sante Et De La Recherche Medicale	FRN	509	7572	10	2	0	0	0	826	15	-	-
25	Zhejiang University	CHN	507	6302	9	1	0	0	0	411	12	42	36

R: rank; CNT: country; TP: total publications; TC: total citations; QS rank: QS World University Ranking score; ARWU: Academic Ranking of World Universities.

The analysis shows that American universities dominate the list by securing 56% of the top 25 most productive institutions in the topic area, followed by the UK. Top-ranked American institutions contributing to the topic are Harvard University, the University of California, Stanford University, the Pennsylvania Commonwealth System of Higher Education, Massachusetts University, the University of Washington, the University of Pennsylvania and the University of Michigan.

Harvard University is on the top of the list with total publications 1748 that attracted 50,141 citations with an average of 29 citations per paper. Notably, 95 publications have achieved more than 100 citations, including seven articles that attracted over 1000 citations. The university has an H-index of 1527 and is on top of the list in the ARWU ranking and fifth in the QS ranking. Moreover, from the Harvard Medical School, 1041 publications have achieved 34,444 citations. The second most productive institution from the list is the University of California, which produced 1730 documents in the topic area that attracted 46,055 citations with an average citation rate of 27 per paper. Notably, five documents have received over 1000 citations, with one paper reaching 3068 citations. The third most productive institution is the University of London in the UK. The institution has produced 1094 documents that received 25,204 citations with a citation rate of 23 per paper. Three French institutions – UDIC French Research Universities, French National Centre for Scientific Research and National Institute of Health and Medical Research were sixth, 16th and 24th, respectively. Chinese Academy of Science secured fifth place in the list with 942 publications that received 18,539 citations.

The paper now discusses the bibliographic coupling of institutions published in the topic area. The paper uses a minimum publication threshold of 200 documents of an organisation with a minimum number of 100 citations and 10 links. Of 38,429 organisations, 83 organisations meet the criteria.

Figure 3 shows four apparent clusters – yellow, red, blue and green. The yellow cluster is comprised of six nodes, and Harvard Medical School is the largest node among all nodes, with a total link strength of 1363. The red cluster consists of 35 nodes. Most of the nodes in this cluster are American institutions, with a prominent node of Stanford University with a total link strength of 758. Other obvious nodes in the cluster are the University of Michigan, the University of Washington, Duke University and the University of Toronto. The green cluster comprised the majority of Asian universities majority. Chinese universities lead the cluster, with the most obvious node from the Chinese Academy of Science having a link strength of 786 links with other universities. The cluster also shows a coupling among Australian universities such as – the University of Sydney, the University of Melbourne, and the University of New South Wales, Australia has an obvious coupling, and the University of Sydney Australia has a strong connection with Shanghai Jiao Tong University, China. Apparent nodes in the blue cluster are the European universities, with the most obvious node being the University of Oxford, with a total link strength of 398 links. The university strongly connects with the University of Edinburgh, Imperial College London and the University of California Los Angeles. Generally, we can see stronger connections among institutions from the same country, indicating that researchers collaborate more within the country than between countries.

Figure 3.

Bibliographic coupling of institutions published in the topic area: minimum publication threshold of 200 documents with 100 citations and minimum 10 links.

Most productive and influential countries

To analyse the most productive and influential countries that contributed to the topic area, the paper lists the top 10 most productive and influential countries as presented in Table 9. Overall, there has been an increasing trend in publications since 2018. Recent pandemics are one of the factors for an increased number of research in the topic area. A region-wise pictorial representation of the publications is presented in Figure 4. The paper considers a 40-year time frame, from 1983 to 2022 and compares countries based on the number of publications and their citations. To further deepen the quality and impact of publications, the paper analyses the number of published documents that received greater than or equal to 10,000, 5000, 1000 and 500 citations.

Figure 4.

Region-wise topic area publications.

Table 9.

The most productive and influential countries contributed to the topic.

R	Country	TP	TC	Annual evaluation of publication and citation structure							Citations benchmark
R	Country	TP	TC	2022	2021	2020	2019	2018	2017	1983 to 2016	≥ 10,000	≥5000	≥1000	≥500
1	USA	TP	19,292	3879	4481	3333	2386	1532	799	2882	1	0	21	59
1	USA	TC	380,429	5110	30,419	56,188	64,003	57,910	33,898	132,901	1	0	21	59
2	China	TP	11,964	3844	3197	2028	1337	689	311	558	0	0	3	16
2	China	TC	99,258	4618	40,145	16,741	5449	889	1766	29,650	0	0	3	16
3	India	TP	5753	1943	1564	891	531	303	188	333	0	0	1	3
3	India	TC	48,904	21,507	14,677	6291	2974	1388	620	1447	0	0	1	3
4	England	TP	4858	1121	1241	880	579	306	181	550	0	0	2	11
4	England	TC	94,703	32,986	27,543	13,760	6696	3390	1998	8330	0	0	2	11
5	Germany	TP	3340	846	837	595	394	213	95	360	0	0	2	7
5	Germany	TC	51,932	18,298	15,077	7788	3758	1722	1038	4251	0	0	2	7
6	Canada	TP	3131	712	727	430	277	168	85	732	0	0	1	7
6	Canada	TC	45,530	16,767	11,858	6012	3247	1673	1094	4879	0	0	1	7
7	Australia	TP	2699	712	727	430	277	168	85	300	0	0	1	2
7	Australia	TC	45,530	16,767	11,858	6012	3247	1673	1094	4879	0	0	1	2
8	Italy	TP	2658	765	698	487	240	132	74	262	0	0	0	1
8	Italy	TC	35,562	12,855	10,282	4940	2422	1265	809	2989	0	0	0	1
9	South Korea	TP	2652	805	709	507	284	145	56	146	0	0	1	2
9	South Korea	TC	35,526	13,657	10,535	5337	2498	1165	546	1788	0	0	1	2
10	Spain	TP	2247	534	552	373	247	137	82	322	0	0	0	0
10	Spain	TC	28,596	9963	7653	3763	2064	1240	925	2988

R: rank; TP: total publications; TC: total citations.

The analysis results show that the USA is on the top of the list with 19,292 publications that were cited 380,429 times. There has been a consistent increase since 2017. It is worth mentioning that 59 documents have received over 500 citations, 21 documents with over 1000 citations, and a document with 10,000 citations. China is second on the list, with a total publication of 11,964 that attracted 99,258 citations. Since 2018, there has been a good increase in the number of publications, particularly during 2020, 2021 and 2022. During these 3 years, the country produced 75% of all-time documents. Sixteen documents received over 500 citations. India is in third place with 5753 publications, followed by England, Germany, Canada and Australia.

To further analyse the dependencies and interrelationships among countries, the paper presents the bibliographic coupling of countries that contributed to the topic area, as presented in Figure 5.

Figure 5.

Bibliographic coupling of countries published in the topic area with a threshold of 100 documents with 10 citations with a minimum strength of 80.

The paper uses a minimum threshold of 100 publications per country, with a minimum of 10 citations and a minimum strength of 80 links. Out of 182 countries, 67 countries met the threshold. The USA dominated the figure with the most connections with other nodes, followed by China. Other nodes such as India, England, Australia, Canada, Germany, Italy and South Korea have many couplings with other countries. Moreover, Spain, France, Netherlands, Pakistan and Saudi Arabia are emerging in the topic area. Some small nodes, such as Turkey, Spain, Mexico, and Egypt, connect with other countries, including the USA, but not with China.

Co-occurrences of the author's keyword

This section examines the co-occurrences of the author's strong keywords, offering insights into their frequent use, as presented in Table 10 and Figure 6. This bibliometric indicator enables the researcher to identify the most frequent keywords used in the topic area and gain insight into the topic and trends that interest new researchers. The paper uses a threshold of minimum co-occurrences of 250 keywords. Of 90,187 keywords, 93 met the threshold with a minimum of 25 occurrences.

Figure 6.

Co-occurrences of author's keyword for 40 years – minimum occurrences threshold of 250 keywords.

Table 10.

Co-occurrences of the top 30 authors’ keywords.

Number	Keyword	Occurrences	Total link strength
1	Machine Learning	16,102	18,298
2	Deep Learning	11,235	15,406
3	Artificial Intelligence	7349	9412
4	Convolutional Neural Network	2789	2744
5	COVID-19	2468	4322
6	Classification	1871	3240
7	Feature Extraction	1241	3204
8	Transfer Learning	1025	2025
9	Big Data	1047	1979
10	Natural Language Processing	1204	1958
11	Medical Imaging	798	1650
12	Data Mining	938	1552
13	Neural Networks	869	1540
14	Prediction	882	1474
15	Training	454	1411
16	Image Segmentation	638	1391
17	Healthcare	754	1364
18	Segmentation	774	1349
19	Random Forest	793	1227
20	Internet of Things	624	1195
21	Diseases	329	1161
22	Breast Cancer	792	1129
23	Precision Medicine	676	1113
24	Data Models	325	1103
25	Medical Diagnostic Imaging	298	1103
26	Support Vector Machine	661	1096
27	CNN	638	1085
28	Electronic Health Records	700	1077
29	Feature Selection	727	1067
30	Computer Vision	549	1062

CNN: convolutional neural network.

The keyword ‘Machine Learning’ holds the leading position, appearing 16,102 times with a substantial link strength of 18,298. This term refers to a generic phrase that encompasses a wide range of algorithms employing an AI-based method to solve various problems. The keyword ‘Deep Learning’ took the second position with 11,235 occurrences and a total link strength of 15,406. As discussed in the above section, over the past decade, DL algorithms have received significant interest among researchers, as evidenced by their appearances in several top-cited literature. Using multiple layers of neurons with hierarchical feature abstraction makes DL an ideal choice for obtaining optimal results in complex predictions and decision-making processes. The keyword “Artificial Intelligence” draws attention with 7349 occurrences and a total link strength of 9412, highlighting its significance as a broad term covering diverse AI methodologies. CNN comes in the fourth position with 2789 occurrences. CNN, a deep neural network, is broadly used in medical image classification tasks due to its exceptional feature extraction capabilities. Some new keywords have emerged in the last few years, such as COVID-19, which appeared 2468 times with a link strength of 4322. This shows the impact of recent global events on research priorities, particularly in healthcare and epidemiology. Additionally, NLP, with 1204 occurrences, shows the ongoing relevance and exploration of language-related AI applications. Recent advancements in NLP algorithms, particularly in the form of Generative AI tools, have revolutionised many sectors, including healthcare. In future, there is great potential in the research area to further explore these tools and their applications in healthcare.

Discussion and findings

AI has been studied and applied in health and medical-care systems since the late 70s. In the 80s and mid-90s, the use of AI in the health system was minimal, as evident from Table 1. There was very limited research for the first 14 years – from 1983 to 1996, with a few publications. However, the research trend has changed with technological advancement, better sophistication of AI algorithms, and the capability of a system to analyse large amounts of clinical data. Likewise, the number of publications gradually increased to three and then four-digit figure numbers.

The development of NLP in the 2000s revitalised the use of AI in medicine by bridging communication between humans and machines. Those algorithms analyse clinical notes such as consultation reports, operative reports, and discharge instructions as analysis data and infer a personalised treatment plan for any individual patient based on existing records. During the last decade, we have seen a significant increase in the use of AI in medicine. Recent advancements in AI-based image processing algorithms such as – Generative Adversarial Networks,^65,66 RIMD⁶⁷ Style Transfer,^68,69 DeepLesion,⁷⁰ RetinaNet,⁷¹ CheXNet,⁷² lifelong machine learning,⁷³ and several other algorithms enable medical practitioners to scan and analyse medical images and enable them for early detection of any illness and an informed decision-making process.

The key findings from the analysis are presented as follows:

Evolution of AI research in healthcare:

In 1965, the pioneering DENDRAL project⁷⁴ laid the foundation for AI-based medical expert systems. It revolutionised problem-solving in chemistry and medicine through the innovative application of heuristic programming techniques. During the early 80 s, there was significant interest in adopting AI-based expert systems in healthcare. Therefore, the study in⁷⁴ chooses to commence its analysis from 1983. Since then, the study observed a gradual growth in research and utilisation of AI in healthcare. In the first 14 years, from 1983 to 1996, the number of publications in the topic area constantly remained in the double-digits. This was due to technological limitations, computational capacity, data availability and lack of awareness. Then, for the subsequent 7 years, spanning until 2003, the research volume in this domain remained relatively stable, averaging 102 publications per year. This was the time when the widely acclaimed AI algorithm, long short-term memory,⁷⁵ a type of RNN, was proposed. The approach has been widely used in several key medical issues, such as cancer prediction,⁷⁶ heart disease prediction⁷⁷ and many others. Over the last two decades, there has been consistent growth in the field, with a notable surge in the application of AI in healthcare during the past decade, resulting in increased research and publications, as evident from Table 1. The significant improvement can be largely attributed to advancements in technology, better computational capacity, improved performance, and AI algorithm sophistication in image recognition and prediction that are widely used in healthcare.⁷⁸ Over the past 3 years, starting from 2020, there has been a significant upswing in the number of publications, surpassing 10,000 articles annually. These years have seen a substantial uptick in applying AI methodologies for predicting COVID-19, with widespread adoption of Generative AI and other DL methods in the healthcare sector.

Citation trends – insight and research direction

The citation structure of the topic over 40 years shows that there has been a consistent, gradual increase in the number of citations from its inception. About 74% of documents published in the topic area have received at least a single citation, with 29 publications that have received more than 1000 citations. The analysis results show that since 2016, the annual number of citations has been at least 10,000. Particularly, since 2018, the topic received a total of 677,507 citations. This demonstrates the increasing interest of researchers and the growing trend of research in the field. A survey paper – An Introduction to ROC Analysis by Tom Fawcett, published in 2006, received 11,141 citations, the highest in the topic area. In this paper, the author discusses the ROC graph commonly used in medical decision-making. ROC analysis remains a key topic today that has been extensively used in classification and prediction problems across various domains. The second influential paper in the topic area is a survey paper – ‘A survey on deep learning in medical image analysis’ by Geert Litjens et al., published in 2017. The paper has made a notable scientific impact with 5017 citations and a citation rate of 1003 citations per year (C/Y), the highest among the top 50 cited papers. In this paper, the authors conducted a comprehensive survey of 300 papers, discussing DL algorithms used for medical image processing. The paper highlighted key aspects of successful deep-learning methods and discussed distinct challenges within medical image processing. It is worth mentioning that DL algorithm has been extensively used in healthcare in recent years due to their robust capabilities in medical image and predictive analysis. The citation structure shows that 38% of the top 50 cited papers discussed or applied DL methods. Additionally, the co-occurrences of the author's keywords corroborate this, with ‘DL’ being the second most prominent keyword, appearing 11,235 times. The evidence supports DL as the key algorithm for future research directions.

Prominent contributors – journals, institutions, and countries

The analysis results show that IEEE Access is the most productive journal contributed to the topic. The journal published 1410 articles that attracted 20,849 citations. Notably, 78 papers in the journal have received 50 or more citations. IEEE Access, being a multidisciplinary journal, covers a broad range of topics, including AI predictions in healthcare, diagnostics optimisation, and AI-based healthcare decision-making. Moreover, its open-access policy and rapid publication cycles enhance accessibility and dissemination. The second highly productive venue is the Lecture Notes in Computer Science (LNCS), which has published 1350 documents that have received 11,578 citations. LNCS primarily encompasses conference papers, contributing significantly to the field. The analysis also highlights another influential open-access journal – Scientific Reports. With 706 published documents and an impressive 12,985 citations, the journal holds a significant contributor to the topic. Notably, it contributes five papers with over 500 citations and 57 articles with more than 50 citations. Despite being relatively young in the field, since its inception in 2011, the journal consistently publishes top-quality research on the topic.

Several renowned and top-ranked institutions, such as Harvard University, the University of California, the University of London, Stanford University, the University of Washington, Massachusetts University and the University of Oxford, have contributed significantly to the topic. The major contribution is by Harvard University, including its Medical School, which has produced 2789 papers that attracted more than 84,585 citations. The bibliographic coupling of institutions revealed that institutions from the same country tend to have stronger connections or collaborations than institutions from different countries. This is due to common funding sources, geographic proximity and common research interests.

Country-wise annual analysis shows that the USA is at the top of the list and has been consistently contributing to the topic since its inception. The country has contributed 19,292 documents that have attracted 380,429 citations. It is worth mentioning that 21 documents have received 1000 citations, and 59 documents received over 500 citations. China is second on the list, with 11,964 publications that attracted 99,258 citations. China has contributed significantly to the topic in recent years, possibly due to the country’s ambitious goal of becoming a global AI superpower by 2030.⁷⁹ Furthermore, the bibliographic coupling analysis indicates that the USA and China are at the top of the list, exhibiting the highest number of connections with nearly every contributing country.

It is also important to note that companies such as Google (DeepMind), Microsoft, IBM, Apple and other big AI contributor companies start to have great impact in developing AI tools for healthcare.⁸⁰ For instance, Google's DeepMind has been working on several projects that aim to improve healthcare outcomes.⁸¹ One of the most significant breakthroughs is the development of AlphaFold, an AI program that predicts the shape of proteins by generating a 3D model. This has helped scientists to view protein structures and has actively helped to accelerate biological research and future drug discovery. DeepMind has also been working on developing AI-based solutions that can help diagnose eye diseases and detect breast cancer. Microsoft has been working on developing AI-based solutions that can help improve healthcare outcomes. One of the most significant breakthroughs is the development of an AI-based system that can help diagnose cervical cancer.⁸² The system uses ML algorithms to analyse images of the cervix and identify abnormal cells. IBM has been working on developing AI-based solutions that can help improve healthcare outcomes. One of the most significant breakthroughs is the development of Watson for Oncology, an AI-based system that can help oncologists to identify personalised cancer treatments.

Conclusion

This paper presents a bibliometric analysis of the literature on AI and its associated methods and algorithms in health and medical systems. The paper’s primary objective is to evaluate and observe the evolution and trends of this topic from a publication perspective. The analysis results reveal that research on this topic has been conducted since the early 1980s, with a gradual increase in publications and citations. Over the last decade, there has been a surge of interest in this research area due to advancements in technology, easy access to medical data, sophistication in AI algorithms, and the efficiency of making optimal informed decisions. The COVID-19 pandemic has significantly contributed to the increased attention on this topic among researchers. The analysis results revealed that the USA has been a major contributor to research on this topic since its inception, with many top-ranked American institutions involved. Open-source journals such as IEEE Access, Sensors, Scientific Reports, and others have significantly published research on this topic. Chinese researchers have emerged as the leading contributors to this area of study. The most commonly used keywords among researchers are ‘ML’ and ‘DL’.

Finally, the analysis shows a general overview of existing research till 2022, and future results may vary due to bibliometric data limitations, potentially yielding unexpected changes. While this work demonstrates an overall bibliometric analysis on a leading research track for digital health, it is important to conduct further review studies that discuss more technical details within narrower scope to provide a useful guidance to a fast-growing research field. This will be our aim for future research.

Footnotes

Acknowledgements

The authors express their gratitude to all the institutions that have provided support for this endeavour and are detailed in the funding section.

Contributorship

WH designed the review with input from HG and FR. All authors screened articles, analysed the data, and drafted the manuscript. Decisions regarding eligibility for inclusion, risk of bias and data extraction were verified by WH. All authors read and approved the final manuscript.

Declaration of conflicting interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

This research received partial support from ACU under grant number CAWGS – 905737-111. This work was partially funded by the Qatar Japan Research Collaboration Research Program under grant number M-QJRC-2023-313.

Gurantor

WH, EAR.

ORCID iD

Essam A. Rashed

References

Shi

Wang

Shi

, et al. Review of artificial intelligence techniques in imaging data acquisition, segmentation, and diagnosis for COVID-19. IEEE Rev Biomed Eng 2020; 14: 4–15.

Huang

Han

, et al. Serial quantitative chest CT assessment of COVID-19: a deep learning approach. Radiol Cardiothorac Imaging 2020; 2: e200075.

Chen

Zhang

, et al. Deep learning-based model for detecting 2019 novel coronavirus pneumonia on high-resolution computed tomography. Sci Rep 2020; 10: 19196.

Shan

Gao

Wang

, et al. Lung infection quantification of COVID-19 in CT images with deep learning. arXiv 2020; 2003.04655.

Gaál

Maga

Lukács

. Attention u-net based adversarial architectures for chest x-ray lung segmentation. arXiv 2020; 2003.10304.

Muehlematter

Daniore

Vokinger

. Approval of artificial intelligence and machine learning-based medical devices in the USA and Europe (2015–20): a comparative analysis. Lancet Digit Health 2021; 3: e195–e203.

Aldahiri

Alrashed

Hussain

. Trends in using IoT with machine learning in health prediction system. Forecasting 2021; 3: 181–206.

Wang

Pathak

Zhang

. Using electronic health records and machine learning to predict postpartum depression. Stud Health Technol Inform 2019; 264: 888–892.

Hussain

Gao

Raza

, et al. Assessing cloud QoS predictions using OWA in neural network methods. Neural Comput Appl 2022; 34: 1–18.

10.

Hussain

Merigó

Raza

, et al. A new QoS prediction model using hybrid IOWA-ANFIS with Fuzzy C-means, subtractive clustering and grid partitioning. Inf Sci 2022; 584: 280–300.

11.

Hussain

Merigó

Raza

. Predictive intelligence using ANFIS-induced OWAWA for complex stock market prediction. Int J Intell Syst 2022; 37: 4586–4611.

12.

Hunter

Hindocha

Lee

. The role of artificial intelligence in early cancer diagnosis. Cancers (Basel) 2022; 14: 1524.

13.

Shaik

Tao

Higgins

, et al. Remote patient monitoring using artificial intelligence: current state, applications, and challenges. Wiley Interdiscip Rev Data Min Knowl Discov 2023; 13: e1485.

14.

Reverberi

Rigon

Solari

, et al. Experimental evidence of effective human–AI collaboration in medical decision-making. Sci Rep 2022; 12: 14952.

15.

Lin

. A clinician’s guide to artificial intelligence (AI): why and how primary care should lead the health care AI revolution. J Am Board Fam Med 2022; 35: 175–184.

16.

Budd

Miller

Manning

, et al. Digital technologies in the public-health response to COVID-19. Nat Med 2020; 26: 1183–1192.

17.

Ding

Zhang

Niu

, et al. Stroke prevention in rural residents: development of a simplified risk assessment tool with artificial intelligence. Neurol Sci 2023; 44: 1687–1694.

18.

Cao

Kunaprayoon

, et al. AI-assisted clinical decision making (CDM) for dose prescription in radiosurgery of brain metastases using three-path three-dimensional CNN. Clin Transl Radiat Oncol 2023; 39: 100565.

19.

Thirunavukarasu

Ting

DSJ

Elangovan

, et al. Large language models in medicine. Nat Med 2023; 29: 1930–1940.

20.

Mijwil

Aljanabi

Ali

. ChatGPT: exploring the role of cybersecurity in the protection of medical information. Mesopotam J Cybersec 2023; 2023: 18–21.

21.

Cascella

Montomoli

Bellini

, et al. Evaluating the feasibility of ChatGPT in healthcare: an analysis of multiple clinical and research scenarios. J Med Syst 2023; 47: 33.

22.

Davenport

Kalakota

. The potential for artificial intelligence in healthcare. Future Healthc J 2019; 6: 94.

23.

Shaw

Rudzicz

Jamieson

, et al. Artificial intelligence and the implementation challenge. J Med Internet Research 2019; 21: e13659.

24.

Morley

Machado

CCV

Burr

, et al. The ethics of AI in health care: a mapping review. Soc Sci Med 2020; 260: 113172.

25.

Habli

Lawton

Porter

. Artificial intelligence in health care: accountability and safety. Bull World Health Organ 2020; 98: 251.

26.

Davenport

Dreyer

. AI will change radiology, but it won’t replace radiologists. Harv Bus Rev 2018; 27.

27.

Beede

Baylor

Hersch

, et al. A human-centered evaluation of a deep learning system deployed in clinics for the detection of diabetic retinopathy. In: 2020 CHI conference on human factors in computing systems, Honolulu, HI, April 25–30, 2020.

28.

Ghassemi

Oakden-Rayner

Beam

. The false hope of current approaches to explainable artificial intelligence in health care. Lancet Digi. Health 2021; 3: e745–e750.

29.

Wiens

Saria

Sendak

, et al. Do no harm: a roadmap for responsible machine learning for health care. Nat Med 2019; 25: 1337–1340.

30.

Kanagasingam

Xiao

Vignarajan

, et al. Evaluation of artificial intelligence–based grading of diabetic retinopathy in primary care. JAMA Netw Open 2018; 1: e182665–e182665.

31.

Khan

Goodell

Hassan

, et al. A bibliometric review of finance bibliometric papers. Financ Res Lett 2022; 47: 102520.

32.

Qiao

Ding

Zhang

, et al. Accessible tourism: a bibliometric review (2008–2020). Tou Rev 2022; 77: 713–730.

33.

Wijewickrema

. A bibliometric study on library and information science and information systems literature during 2010–2019. Libr Hi Tech 2023; 41: 595–621.

34.

Dabbagh

Sookhak

Safa

. The evolution of blockchain: a bibliometric study. IEEE Access 2019; 7: 19212–19221.

35.

Ajibade

SSM

Ojeniyi

. Bibliometric survey on particle swarm optimization algorithms (2001–2021). J Electr Cmomput Eng 2022; 2022: 3242949.

36.

, et al. Exploring the ordered weighted averaging operator knowledge domain: a bibliometric analysis. Int J Intell Syst 2017; 32: 1151–1166.

37.

Zyoud

Fuchs-Hanusch

. A bibliometric-based survey on AHP and TOPSIS techniques. Expert Syst Appl 2017; 78: 158–181.

38.

Wamba

Queiroz

. Responsible artificial intelligence as a secret ingredient for digital health: bibliometric analysis, insights, and research directions. Inf Syst Front 2021; 25: 2123–2138.

39.

Ismail

AFMF

Sam

MFM

Bakar

, et al. Artificial intelligence in healthcare business ecosystem: a bibliometric study. Int J Online Biomed Eng 2022; 18: 100–114.

40.

Shaikh

Alhashmi

Khedr

, et al. Bibliometric analysis on the adoption of artificial intelligence applications in the e-health sector. Digit. Health 2023; 9: 20552076221149296.

41.

Tran

, et al. Global evolution of research in artificial intelligence in health and medicine: a bibliometric study. J Clin Med 2019; 8: 360.

42.

Mukherjee

Lim

Kumar

, et al. Guidelines for advancing theory and practice through bibliometric research. J Bus Res 2022; 148: 101–115.

43.

Paul-Hus

Desrochers

Costas

. Characterization, description, and considerations for the use of funding acknowledgement data in Web of Science. Scientometrics 2016; 108: 167–182.

44.

Miao

Ding

, et al. Research priorities on one health: a bibliometric analysis. Front Public Health 2022; 10: 889854.

45.

Birkle

Pendlebury

Schnell

, et al. Web of science as a data source for research on scientific and scholarly activity. Quant Sci Stud 2020; 1: 363–376.

46.

İyibildiren

Eren

Ceran

. Bibliometric analysis of publications on web of science database related to accounting information system with mapping technique. Cogent Bus Manag 2023; 10: 2160584.

47.

Aminizadeh

Heidari

Toumaj

, et al. The applications of machine learning techniques in medical data processing based on distributed computing and the Internet of Things. Comput Methods Programs Biomed 2023; 241: 107745.

48.

Mohamed

Rashed

Gaber

, et al. Deep learning model for fully automated breast cancer detection system from thermograms. PloS one 2022; 17: e0262349.

49.

Sharma

Saba

Khanna

, et al. Segmentation-Based classification deep learning model embedded with explainable AI for COVID-19 detection in chest X-ray scans. Diagnostics 2022; 12: 2132.

50.

Goggin

. COVID-19 apps in Singapore and Australia: reimagining healthy nations with digital technology. Media Int Aust 2020; 177: 61–75.

51.

Bagabir

Ibrahim

Bagabir

, et al. COVID-19 and artificial intelligence: genome sequencing, drug development and vaccine discovery. J Infect Public Health 2022; 15: 289–296.

52.

Ordu

Demir

Tofallis

, et al. A novel healthcare resource allocation decision support tool: a forecasting-simulation-optimization approach. J Oper Res Soc 2021; 72: 485–500.

53.

Blasiak

Khong

Kee

. CURATE. AI: optimizing personalized medicine with artificial intelligence. SLAS Tech 2020; 25: 95–105.

54.

Rashid

MBMA

Toh

Hooi

, et al. Optimizing drug combinations against multiple myeloma using a quadratic phenotypic optimization platform (QPOP). Sci Transl Med 2018; 10: eaan0941.

55.

Lee

Clemens

Silva

, et al. Ultra-rapid near universal TB drug regimen identified via parabolic response surface platform cures mice of both conventional and high susceptibility. PLoS One 2018; 13: e0207469.

56.

Schork

. Artificial intelligence and personalized medicine. Precis Med Cancer Ther 2019; 178: 265–283.

57.

Blasiak

Khong

Kee

. CURATE. AI: optimizing personalized medicine with artificial intelligence. SLAS Tech 2020; 25: 95–105.

58.

Mukhopadhyay

Summer

Ling

, et al. Personalised dosing using the CURATE. AI algorithm: protocol for a feasibility study in patients with hypertension and type II diabetes Mellitus. Int J Environ Res Public Health 2022; 19: 8979.

59.

Ingber

. Human organs-on-chips for disease modelling, drug development and personalized medicine. Nat Rev Genet 2022; 23: 467–491.

60.

Sotoudeh

Shafaat

Bernstock

, et al. Artificial intelligence in the management of glioma: era of personalized medicine. Front Oncol 2019; 9: 768.

61.

Bradley

. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognit 1997; 30: 1145–1159.

62.

Fawcett

. An introduction to ROC analysis. Pattern Recognit Lett 2006; 27: 861–874.

63.

Shen

Suk

. Deep learning in medical image analysis. Annu Rev Biomed Eng 2017; 19: 221–248.

64.

Litjens

Kooi

Bejnordi

, et al. A survey on deep learning in medical image analysis. Med Image Anal 2017; 42: 60–88.

65.

Goodfellow

Pouget-Abadie

Mirza

, et al. Generative adversarial networks. Commun ACM 2020; 63: 139–144.

66.

Pérez

Ventura

. Progressive growing of generative adversarial networks for improving data augmentation and skin cancer diagnosis. Artif Intell Med 2023; 141: 102556.

67.

Basnet

Parasteh

Manashty

, et al. RIMD: a novel method for clinical prediction. Artif Intell Med 2023; 140: 102526.

68.

Gatys

Ecker

Bethge

. Image style transfer using convolutional neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, Las Vegas, June 26th–July 1st, 2016.

69.

Liao

Chen

, et al. Few-shot transfer learning for personalized atrial fibrillation detection using patient-based siamese network with single-lead ECG records. Artif Intell Med 2023; 144: 102644.

70.

Yan

Wang

, et al. DeepLesion: automated mining of large-scale lesion annotations and universal lesion detection with deep learning. J Med Imaging 2018; 5: 036501–036501.

71.

Lin

Goyal

Girshick

, et al. Focal loss for dense object detection. In: Proceedings of the IEEE international conference on computer vision, Venice, Italy, October 22–29, 2017.

72.

Rajpurkar

Irvin

Zhu

, et al. Chexnet: Radiologist-level pneumonia detection on chest x-rays with deep learning. arXiv 2017; 1711.05225.

73.

Chi

Tian

Wang

, et al. A novel lifelong machine learning-based method to eliminate calibration drift in clinical prediction models. Artif Intell Med 2022; 125: 102256.

74.

Lederberg

. How DENDRAL was conceived and born. In: A history of medical informatics. ACM; 1990, pp.14–44.

75.

Hochreiter

Schmidhuber

. Long short-term memory. Neural Comput 1997; 9: 1735–1780.

76.

Edara

Vanukuri

Sistla

, et al. Sentiment analysis and text categorization of cancer medical records with LSTM. J Ambient Intell Humaniz Comput 2023; 14: 5309–5325.

77.

Maragatham

Devi

. LSTM model for prediction of heart failure in big data. J Med Syst 2019; 43: 1–13.

78.

Deng

Dong

Socher

, et al. Imagenet: a large-scale hierarchical image database. In: 2009 IEEE conference on computer vision and pattern recognition, 20–25 June 2009, Miami, FL, 2009.

79.

Cheng

Zeng

. Shaping AI’s future? China in global AI governance. J Contemp China 2023; 32: 794–810.

80.

Chen

Decary

. Artificial intelligence in healthcare: an essential guide for health leaders. Healthc Manage Forum 2020; 33: 10–18.

81.

Powles

Hodson

. Google DeepMind and healthcare in an age of algorithms. Health Tech 2017; 7: 351–367.

82.

Wehde

. Healthcare 4.0. IEEE Eng Manag Rev 2019; 47: 24–28.