Sage Journals: Discover world-class research

Abstract

In the health care and medical domain, it has been proven challenging to diagnose correctly many diseases with complicated and interferential symptoms, including arrhythmia. However, with the evolution of artificial intelligence (AI) techniques, the diagnosis and prognosis of arrhythmia became easier for the physicians and practitioners using only an electrocardiogram (ECG) examination. This review presents a synthesis of the studies conducted in the last 12 years to predict arrhythmia’s occurrence by classifying automatically different heartbeat rhythms. From a variety of research academic databases, 40 studies were selected to analyze, among which 29 of them applied deep learning methods (72.5%), 9 of them addressed the problem with machine learning methods (22.5%), and 2 of them combined both deep learning and machine learning to predict arrhythmia (5%). Indeed, the use of AI for arrhythmia diagnosis is emerging in literature, although there are some challenging issues, such as the explicability of the Deep Learning methods and the computational resources needed to achieve high performance. However, with the continuous development of cloud platforms and quantum calculation for AI, we can achieve a breakthrough in arrhythmia diagnosis.

Keywords

Arrhythmia artificial intelligence deep learning diagnosis prediction health care

Introduction

Neuro-cardiovascular diseases are the leading cause of death in the world. Arrhythmias represent a category of these diseases associated with medical issues that can range from a minor inconvenience or discomfort to a fatal problem. An arrhythmia is an abnormality of the heart’s rhythm which is controlled by electrical signals. It may beat too slow, too quick, or irregular.¹ The electrocardiogram (ECG) is an effective tool for arrhythmia diagnosis, it measures the heart’s electrical activity. Other ambulatory devices can be used for the same aim, such as Holter Monitor.² However, the diagnosis of arrhythmias is not always obvious, especially for atrial fibrillation (AF) that can be related to asymptomatic and transient forms. Moreover, there are some limitations in the extraction methods and time series analysis from ECG singularities and their dynamics. To address these limitations, artificial intelligence (AI) is applied to the diagnosis and prognosis of diseases, such as arrhythmia. For this end, we focused in our previous works on the diagnosis of AF with machine learning (ML) methods. For instance, we conducted a multi-dynamics analysis of QRS complex with support vector machine (SVM) and multiple kernel learning (MKL) in Trardi et al,³ which reached respective sensitivity values of 96.54% and 95.47%. Other works were mainly based on the extraction of different features from R-wave derivatives for automatic medical decision-making, especially for AF detection as in literature.^4-6 In addition, the use of univariate and multivariate methods plays a major role in the analysis of the ECG time series.

On future work, the objective is to process ECG signals and classify different categories of heartbeat to detect different types of arrhythmia and thus help health care professionals. For this aim, we realized a literature review on ECG-based models for arrhythmia detection using AI techniques in the last 12 years.

This article is organized as follows: Section “Methods of Search and Selection” presents an overview on search strategy and the criteria of studies’ selection. Section “Results of Studies’ Exploration” emphasizes the exploration of the studies selected and the collected information from these studies. Section “Discussion and Interpretation” is dedicated to the interpretation and discussion of the obtained results, the contribution of this review, and comparison to other literature reviews. To sum up, the final section “Conclusions” contains a summary of the strengths and limitations of the used deep learning (DL) techniques.

Methods of Search and Selection

This section presents the search strategy, the criteria of selection, and the extraction of study characteristics.

Search strategy

To conduct this review, multiple academic research databases were selected to gather relevant articles that were published from January 2010 to September 2022. These open source databases are PubMed, IEEE Xplore, Springer, ScienceDirect, and ResearchGate. PubMed and IEEE Xplore are considered as 2 of the leading databases in biological sciences and engineering, respectively.⁷ Springer is one of the leading research publishers that provides a large number of resources for literature in different fields. ScienceDirect was used for its several and various peer-reviewed journals and articles. For ResearchGate, it allows access to a large number of free papers because it is the largest academic social network in terms of active users.⁸

To realize a targeted search, we identify articles from their titles and abstracts using the following keywords: “artificial intelligence for arrhythmia,” “arrhythmia diagnosis,” “arrhythmia classification,” “heartbeat classification,” “ECG classification.”

Collection of data sources was based on the process below:

Targeted web-based search.

Classification of sources by level of relevance.

Studies’ selection based on the inclusion criteria listed in Table 1.

Table 1.

Inclusion criteria.

Criterion	Description
Date	2010-2022
Type	Journal article and conference
Domain	Bioinformatics and computer science
Participants	With no other disease symptoms/medications effects
Review	Peer-reviewed
Evaluation metrics	Accuracy /sensitivity /specificity/F1 score /AUC /confusion matrix

Abbreviation: AUC, area under curve.

The selected databases provide peer-reviewed articles except ResearchGate that does not require a peer-review for the published articles.

Study selection

The scope of the studies selected comprises the following conditions:

➢ Studies conducted on the diagnosis of arrhythmia in the last 12 years;

➢ Studies addressing the classification of any type of arrhythmia to,

Present a variety of databases,

Identify a large number of arrhythmia in both subclasses and super-classes;

➢ Studies handling the diagnosis of arrhythmia with no other cardiovascular disease interference to encircle tightly the scope and circumstances of occurrence of this abnormality and thus realize a precise and accurate diagnosis and prognosis.

Moreover, the studies can handle either beat classification or category classification. The former contains many subclasses of heart rhythm as shown in Table A3 (Appendix 1) and the latter addresses the classification topic based on the main arrhythmia categories defined by the American National Standards Institute/Association for the Advancement of Medical Instrumentation (ANSI/AAMI). Different heartbeat categories will be found in Table A2 (Appendix 1).

Many criteria were considered in the first articles’ selection, but only articles with the inclusion criteria sorted in Table 1 were retained.

Study mining: extraction of study characteristics

With a view to exploring the selected publications, retrieving information, and identifying patterns, we extracted the following characteristics:

Study perimeter: defines year of publication and authors;

Input information: includes datasets used in the study, number of participants, and number of arrhythmia classes to predict;

ECG signal information: contains ECG recording format and signal duration;

Feature set: defines the extraction approach and the extracted features from each study. The extraction methodology depends on the learning structure, hand-crafted methods, or end-to-end learning (where the selection, extraction, and classification are embedded in one stage);

Methods: define the pre-processing and prediction methods used in each study to implement the AI algorithms;

Evaluation: presents the metrics and key performance to evaluate the prediction.

The objective is to extract a large number of characteristics to analyze deeply each study.

Results of Studies’ Exploration

Selection

To search for the adequate papers, we rely on the process shown in Figure 1.

Figure 1.

The process of study selection.

First, we realized a quick search for the topic by keywords which results in 730 records. Second, we removed duplicates given that we used 5 research databases, then a paper inventory was held by sorting publications by abstract.

When applying the inclusion criteria in Table 1, we focused on selecting papers that do not deal with other cardiovascular diseases. Although the study⁹ includes the treatment of myocardial ischemia, it was selected because there was no interference with arrhythmia diagnosis; the 2 diseases were independently addressed. Irfan et al¹⁰ used a dataset with 13 types of heartbeats, including arrhythmias and myocardial infarctions, which added more variety to the dataset without affecting the performance of the model for arrhythmia diagnosis.

We included the study¹¹ although they conducted the classification of congestive heart failure rhythm with the normal and arrhythmia rhythms because this allowed the classifier to have a higher recognition ability in classification.

In this stage, 100 records were selected. We adopted a diagonal-text reading and found that some papers focused on the analysis of the ECG signals without addressing the arrhythmia prediction, so we excluded 18 studies.

After a full-text review, we selected 40 studies that handle the arrhythmia classification with various methods of pre-processing and several approaches of diagnosis.

Study design

Various datasets were used for the selected studies. In total, 95% of the studies used open access datasets from PhysioNet repository¹² mainly the Massachusetts Institute of Technology-Boston’s Beth Israel Hospital (MIT-BIH) arrhythmia database. Besides, Li et al¹³ used data collected from Fluke ProSim2 vital sign simulator which is a paying portable solution that transforms physiological simulation by adding multi-parameter functionalities, including ECG simulation and arrhythmia waveform selections.¹⁴ Ribeiro et al¹⁵ used the Telehealth Network of Minas Gerais (TNMG) dataset which was obtained from one of the largest telehealth services in Brazil. Hannun et al¹⁶ used an own collected data recorded by Zio monitor which is a portable device described by physicians to diagnose irregular heart rhythms for up to 14 days¹⁷ unlike Holter monitor, used in Park and Kang,¹⁸ which can be worn only from 24 to 28 hours.

For the studies relying on the PhysioNet repository databases, some of them used small samples of individuals (between 14 and 78 participants). Hannun et al¹⁶ included 53 877 participants aged 69 ± 16 years and Ribeiro et al¹⁵ collected data from 1 676 384 individuals older than 16 years. The remainder of the studies did not report the exact number of participants (Table 2).

Table 2.

Input information.

Year	Study	Databases	No. of classes	No. of participants	ECG recording format	Signal duration	Type of data
2019	Chen et al¹⁹	MIT-BIH arrhythmia database,	6	47	2-lead	30 mn	Native
		QT database,	6	NR	2-lead	NR
		MIT-BIH supraventricular arrhythmia database,	4	NR	2-lead	30 mn
		MIT-BIH supraventricular arrhythmia database,
		INCART database	4	NR	12-lead	NR
2017	Acharya et al²⁰	MIT-BIH arrhythmia database	5	47	1-lead (lead II)	30 mn	Native,Augmented
2019	Yildirim et al²¹	MIT-BIH arrhythmia database	5	NR	1-lead (lead II)	30 mn	Native
2019	Yang et al²²	MIT-BIH arrhythmia database	15	NR	2-lead (II and V1)	30 mn	Native
2019	Yang et al²²	INCART database	7	NR	2-lead (II and V1)2-lead (V1 and V5)2-lead (II and V5)2-lead (II and V1)	30 mn	Native
2018	Yildirim²³	MIT-BIH arrhythmia database	5	47	NR	30 mn	Native
2014	Sumathi et al⁹	MIT-BIH arrhythmia Database, MIT-BIH AF database, MIT-BIH malignant ventricular ectopy database	5	NR	NR	NR	Native
2019	Gao et al²⁴	MIT-BIH arrhythmia database	8	47	NR	30 mn	Native
2013	Martis et al²⁵	MIT-BIH arrhythmia database	5	NR	NR	NR	Native
2016	Li et al¹³	MIT-BIH arrhythmia database,Fluke ProSim2 vital sign simulator	5	NR	NR	NR	Native,Simulated
2018	Anwar et al²⁶	MIT-BIH arrhythmia database,MIT-BIH supraventricular arrhythmia database	185	47NR	NR	30 mn	Native
2013	Liu²⁷	MIT-BIH arrhythmia database	5	NR	2-lead (II and VI)	30 mn	Native
2015	Elhaj et al²⁸	MIT-BIH arrhythmia database	5	NR	NR	30 mn	Native
2019	Kim et al²⁹	MIT-BIH arrhythmia database	5	44	NR	NR	Native
2018	Yıldırım³⁰	MIT-BIH arrhythmia database	131517	45(19 F, 26 M)	1-lead (MLII)	10 seconds	Native
2018	Oh et al³¹	MIT-BIH arrhythmia database	5	47	1-lead (MLII)	NR	Native
2018	Oh et al³²	MIT-BIH arrhythmia database	5	47	1-lead (MLII)	Variable length	Native
2018	Raj and Ray³³	MIT-BIH arrhythmia database	165	47	NR	30 mn	Native
2020	Ribeiro et al¹⁵	Telehealth Network of Minas Gerais (TNMG) dataset	6	1 676 384 (60.3% F, 39.7% M) (>16 years)	12-lead	7-10 seconds	Native
2017	Qin et al³⁴	MIT-BIH arrhythmia database	6	NR	1-lead (II)	30 mn	Native
2017	Rajagopal and Ranganathan³⁵	MIT-BIH arrhythmia database	5	47	NR	30 mn	Native
2010	Benali et al³⁶	MIT-BIH arrhythmia database	5	NR	NR	NR	Native
2017	Rajesh and Dhuli³⁷	MIT-BIH arrhythmia database, INCART	5	NR	NR	30:06 mn	Native
2018	Yang et al³⁸	MIT-BIH arrhythmia database	5	NR	2-lead (MLII and V5)	30 mn	Native
2019	Hannun et al¹⁶	Original ECG dataset recorded by Zio monitor	12	53 877 (43% F, 57% M) (69 ± 16 year old)	1-lead (MLII)	30 seconds	Native
2014	Park and Kang¹⁸	MIT-BIH arrhythmia database, Holter ECG monitoring data	17	47	1-lead (MLII)	30 mn	Native
2021	Ullah et al³⁹	MIT-BIH arrhythmia database,PTB diagnostic ECG database	52	NRNR	NR	NR	Native,Augmented
2022	Irfan et al¹⁰	MIT-BIH arrhythmia database,UCI arrhythmia dataset	513	47 (25 F, 22 M),(44.9% M, 55.1% F)	2-lead,NR	30 mnNR	AugmentedNative
2020	Shahin et al⁴⁰	MIT-BIH arrhythmia database	5	47	2-lead	30 mn	Augmented
2022	Ma et al⁴¹	MIT-BIH arrhythmia database	5	47	NR	NR	Augmented
2021	Sabut et al⁴²	CU ventricular tachyarrhythmia database,MIT-BIH malignant ventricular ectopy database	3,3	NR	NR	8 minutes,30 mn	Native
2020	Wang et al⁴³	China Physiological Signal Challenge 2018 database,Computing in Cardiology Challenge 2017 database	9,3	NR	12-lead,1-lead	[6,60] seconds,⩾ 9 seconds	Augmented
2022	Anbarasi et al¹¹	MIT-BIH arrhythmia database,MIT-BIH NSR database,BIDMC database	2,2,2	1018,15	NR,NR,2-lead	1 mn,NR,20 hours	Native
2022	Zubair and Yoon⁴⁴	MIT-BIH arrhythmia database	5	48	1-lead (MLII)	30 mn	Native
2022	Hu et al⁴⁵	MIT-BIH arrhythmia database,MIT-BIH AF database	4, 82	NR	1-lead (MLII),1-lead (MLII)	30 mn,10 hours	Native
2022	Feyisa et al⁴⁶	PTB-XL dataset	41, 20, 5	(52% M, 48% F)	12-lead	10 seconds	Native
2019	Ju et al⁴⁷	MIT-BIH arrhythmia database	13	NR	NR	NR	Native
2022	Wang et al⁴⁸	Computing in Cardiology Challenge 2017 database	4	NR	1-lead	⩾ 9 seconds	Augmented
2021	Wang⁴⁹	MIT-BIH arrhythmia database,China Physiological Signal Challenge 2018 database	2,2	NR	NR	30 mn,[6,60] seconds	Native
2021	Luo et al⁵⁰	MIT-BIH AF database	9	NR	2-lead	30 mn	Augmented
2022	Iftene et al⁵¹	MIT-BIH arrhythmia database,PTB diagnostic ECG database	2,2	47,290	2-lead,NR	30 mn,NR	Augmented

Abbreviations: BIDMC, Beth Israel Deaconess Medical Center; CU, Creighton University; INCART, St. Petersburg Institute of Cardiological Techniques; MIT-BIH, Massachusetts Institute of Technology-Boston’s Beth Israel Hospital; MLII, Modified Limb lead II; NR, Not Reported; NSR, Normal Sinus Rhythm; PTB, Physikalisch-Technische Bundesanstalt; UCI, University of California Irvine.

Figure 2 shows that 9 studies used augmented data to balance the datasets and enhance the AI models. It should be observed that most of the selected studies are published in the last 5 years, and this can be explained by the recent emergence of AI and the growth of literature sources lately.

Figure 2.

Selected publications from 2010 to 2022.

Ullah et al³⁹ used 2 types of datasets from PhysioNet: MIT-BIH arrhythmia database and PTB Diagnostic ECG Database. In addition, they used generative adversarial network (GAN) model to generate new artificial signal for classes with small amount of data. The same technique was used to augment data in Ma et al.⁴¹

Irfan et al¹⁰ used the publicly available MIT-BIH arrhythmia database and UCI arrhythmia dataset available in the University of California Irvine ML Repository.

Synthetic minority oversampling technique (SMOTE) is used in literature^10,50 to handle the problem of the imbalanced data in the MIT-BIH arrhythmia database and MIT-BIH AF database. SMOTE relies on a k-nearest neighbor algorithm to create new synthetic data. Whereas, Shahin et al⁴⁰ upsampled the training data by randomly duplicating the samples resulting in relatively equal classes but not as varied as it should be.

Two public databases from China Physiological Signal Challenge 2018 (CPSC-2018) and Computing in Cardiology Challenge 2017 (CinC-2017) were used in Wang et al;⁴³ the data were augmented by applying, respectively, flipping and random erasure techniques.

Hu et al⁴⁵ used MIT-BIH arrhythmia database for classifying 4 classes following the AAMI annotation and 8 classes following the widely used classification in literature. Different label classifications comprising 41, 20, and 5 classes are also reported in Feyisa et al⁴⁶ with the use of PTB-XL dataset which is a 12-lead database with various types of arrhythmia. Wang et al⁴⁸ used the CinC-2017 database and applied a data augmentation with the Mix-Up operation in the training stage to reduce the data imbalance and thus the overfitting; the method generates more training data without extra computational resources.

Table 2 shows in details the input information for each study.

The databases in the table above can contain a higher number of classes than what was reported in this review, but since there are some studies that focus on specific rhythms, we mentioned only the classes that were actually used for classification.

Feature set

Given that some studies used DL techniques, the end-to-end structure was implemented, in which selection, extraction, and classification are embedded in one stage. However, the hand-crafted methods process the extraction of features independently from other learning stages.

Raw ECG signal is fed as input to models where no feature extraction phase is required; Table 3 reports the hand-crafted studies where feature extraction was realized. In addition, the most employed technique for data extraction is discrete wavelet transform (DWT); 17% of the studies used this technique either separately as in literature^9,19 or with other extraction methods, as in literature,^{13,25,26,28,34,35} such as principal component analysis (PCA) that was used in 15% of the studies. Other methods were used, such as Fast Fourier Transform (FFT) in Chen et al¹⁹ and Higher-Order Spectra (HOS) in literature.^25,28 Other studies used personalized DL techniques for feature extraction. For instance, Yildirim et al²¹ used a convolutional auto-encoder (CAE) while Yang et al²² used a canonical correlation analysis network (CCANet) which combines canonical correlation analysis and cascaded convolution network, and Yang et al³⁸ used a principal component analysis network (PCANet) which is a convolutional neural network (CNN) with PCA filters. Wang et al⁴³ realized a multi-scale feature learning with CNN kernels to extract features from segments with different size.

Table 3.

Feature set.

Year	Study	Extraction approach/data input	Extracted features	Methodology
2019	Chen et al¹⁹	RR interval,DWT,FFT	2 time-domain3 frequency-domain5 space-domain	12-lead feature fusion
2019	Yildirim et al²¹	Raw ECG signal,Deep-coded ECG signals by CAE	NR	CAE for coded data
2019	Yang et al²²	Dual-lead raw ECG signal,Triple-lead raw ECG signal	NR	DL-CCANet,TL-CCANet
2014	Sumathi et al⁹	Wavelet transform	5 space-domain	NR
2013	Martis et al²⁵	HOS cumulants + PCA,DWT + HOS + PCA	12 nonlinear cumulants	NR
2016	Li et al¹³	PCA + KICA	20 nonlinear	NR
2016	Li et al¹³	Db 2 DWT + PCA + LDA	4 frequency-domain	NR
2018	Anwar et al²⁶	RR interval	4 time-domain	NR
		Discrete Meyer Wavelet Transform + ICA	12 time-frequency domain
		Teager energy operator	1 time-domain
2015	Elhaj et al²⁸	Meyer DWT + PCA	12 time-frequency domain	NR
2015	Elhaj et al²⁸	HOS cumulants + ICA	16 nonlinear	NR
2018	Raj and Ray³³	Sparse decomposition	5 time-domain	Composite dictionary (CD): DOST, DST, DCT dictionaries
2017	Qin et al³⁴	Biorthogonal 6.8 Wavelet multi-resolution analysis + PCA	12 time-frequency domain	NR
2017	Rajagopal and Ranganathan³⁵	Db 4 DWT + PCA	12 time-frequency domain	NR
2010	Benali et al³⁶	Raw data	1 space-domain1 frequency-domain3 time-domain	NR
2017	Rajesh and Dhuli³⁷	Intrinsic mode functions (IMFs) decomposition	4 nonlinear	Ensemble empirical mode decomposition (EEMD)
2017	Rajesh and Dhuli³⁷	Intrinsic mode functions (IMFs) decomposition	4 nonlinear	Empirical mode decomposition (EMD)
2018	Yang et al³⁸	PCANet (CNN with PCA filters)	NR	NR
2014	Park and Kang¹⁸	Raw data	2 space-domain4 time-domain	NR
2021	Sabut et al⁴²	Data decomposition	24 time-frequency domain	Db6 DWT, EMD,VMD

Abbreviations: CAE, Convolutional Auto-encoder; CNN, Convolutional Neural Network; DCT, Discrete Cosine Transform; DL-CCANet, Dual-Lead Canonical Correlation Analysis Network; DOST, Discrete Orthogonal Stockwell Transform; DST, Discrete Sine Transform; DWT, Discrete Wavelet Transform; HOS, Higher-Order Spectra; ICA, Independent Component Analysis; KICA, Kernel-Independent Component Analysis; LDA, Linear Discriminant Analysis; NR, Not Reported; PCA, Principal Component Analysis; PCANet, Principal Component Analysis Network.

Another type of signal decomposition is the intrinsic mode functions (IMFs) decomposition that can be characterized by empirical model decomposition (EMD) and ensemble empirical model decomposition (EEMD) as in Rajesh and Dhuli³⁷ or by variable model decomposition (VMD) as in Sabut et al.⁴²

Depending on the used approach, the features may be related to time-, frequency-, or space-domain and can be linear or nonlinear features (Table 3). In addition, 4 studies reported the use of MATLAB software to realize the feature extraction phase where the other studies did not fill in this information.

Pre-processing and prediction methods

All selected studies used pre-processing methods to handle the data except Hannun et al¹⁶ that proceeded directly to the classification. In the selected studies, we found out that some methods were used once for feature extraction and other times for data pre-processing. The pre-processing methods include noise removal, data segmentation, data normalization, data reduction, signal compression, and signal detection. Wavelet transform (WT) method, including different types of wavelets, was used for noise removal in literature^{20,24,28,35,38} and with improved versions in literature.^13,23 Sumathi et al⁹ used Symlet WT for QRS detection as shown in Table 4. The Pan–Tompkins algorithm proposed by Pan and Tompkins⁵² was used for segmentation and QRS detection in literature^18,25,28 and for R-peak detection in literature.^20,33,35,38 More than half of the studies used data normalization. Some studies used ML methods for data processing as in Yildirim et al²¹ and Liu²⁷ where they used, respectively, CAE for signal compression and SVM for QRS marking. Ullah et al³⁹ mentioned segmentation and pre-processing of data with no more details on the used techniques. Irfan et al¹⁰ applied standardization of data (standard scalar unit) and feature reduction with PCA on the UCI dataset, and noise removal with DWT and normalization on the MIT-BIH arrhythmia dataset.

Table 4.

Pre-processing and prediction methods in the selected studies.

Year	Study	Pre-processing methods	Prediction methods	Evaluation methods	Overall Accuracy (%)
2019	Chen et al¹⁹	PCA for dimensionality reduction	Cascaded classifier composed of random forest and multilayer perceptron	NR	99.80
2017	Acharya et al²⁰	Db 6 WT for noise removal, Pan–Tompkins algorithm for R-peak detection, ECG heartbeat segmentation, Z-score normalization	CNN	10-fold cross-validation	94.03
2019	Yildirim et al²¹	ECG heartbeat segmentation, CAE for ECG signal compression	Long short-term memory (LSTM)	NR	99.23
2019	Yildirim et al²¹	ECG heartbeat segmentation, CAE for ECG signal compression	CAE with LSTM	NR	99.11
2019	Yang et al²²	ECG heartbeat segmentation, min-max normalization	SVM with DL-CCANet (MIT-BIH)	10-fold cross-validation	99.40
			SVM with DL-CCANet (INCART lead II and V1)	5-fold cross-validation	98.31
			SVM with DL-CCANet (INCART lead V1 and V5)	5-fold cross-validation	98.26
			SVM with DL-CCANet (INCART lead II and V5)	5-fold cross-validation	98.31
			SVM with TL-CCANet (INCART lead II, V1, and V5)	5-fold cross-validation	98.76
2018	Yildirim²³	ECG heartbeat segmentation, Daubechies Wavelet sequence for multi-resolution analysis	Deep unidirectional LSTM-WS	NR	99.25
2018	Yildirim²³		Deep bidirectional LSTM-WS	NR	99.39
2014	Sumathi et al⁹	Noise removal, Symlet WT for QRS detection	Adaptive neuro-fuzzy inference system (ANFIS) model	NR	98.24
2019	Gao et al²⁴	Db 6 DWT for noise removal, ECG heartbeat segmentation, Z-score normalization	LSTM with focal loss (noise-free data)	NR	99.26
2019	Gao et al²⁴		LSTM with focal loss (noisy data)	NR	99.07
2013	Martis et al²⁵	Noise removal, Pan–Tompkins algorithm for QRS detection, ECG heartbeat segmentation	Feedforward NN(with HOS + PCA)	10-fold cross-validation	94.52
			Least-square SVM(with HOS + PCA)	10-fold cross-validation	94.30
			Feedforward NN(with DWT + HOS + PCA)	10-fold cross-validation	93.61
			Least-Square SVM(with DWT + HOS + PCA)	10-fold cross-validation	93.76
2016	Li et al¹³	Improved wavelet threshold method for noise removal	SVM + genetic algorithm (with MIT-BIH data)	NR	98.80
2016	Li et al¹³	Improved wavelet threshold method for noise removal	SVM + genetic algorithm (with personalized ECG acquisition platform)	NR	97.30
2018	Anwar et al²⁶	Noise removal, ECG heartbeat segmentation	Feedforward NN (18-class scheme)	3-fold cross-validation	99.75
2018	Anwar et al²⁶	Noise removal, ECG heartbeat segmentation	Feedforward NN (5-category scheme)	3-fold cross-validation	99.80
2013	Liu²⁷	Noise removal, data normalization,SVM for QRS detection and marking	Self-constructing neural-fuzzy inference network (SoNFIN)	NR	96.40
2015	Elhaj et al²⁸	Db 6 DWT for noise removal, Pan–Tompkins algorithm for QRS detection, ECG segmentation	Feed-forward NN	10-fold cross-validation	98.90
2015	Elhaj et al²⁸		SVM with RBF kernel	10-fold cross-validation	98.91
2019	Kim et al²⁹	ECG heartbeat segmentation	GoogleNet deep NN with 1-inception	NR	95.30
			GoogleNet deep NN with 2-inception	NR	96.30
			GoogleNet deep NN with 1-inception + CNN	NR	95.90
2018	Yıldırım³⁰	Data normalization	1-D CNN (13-class scheme)	NR	95.20
			1-D CNN (15-class scheme)	NR	92.51
			1-D CNN (17-class scheme)	NR	91.33
2018	Oh et al³¹	Heterogeneous ECG segmentation, Z-score normalization	Modified U-net architecture	10-fold cross-validation	97.32
2018	Oh et al³²	ECG segmentation, Z-score normalization	CNN + LSTM (without dropout regularization)	10-fold cross-validation	98.42
			CNN + LSTM (2-dropout)	10-fold cross-validation	97.88
			CNN + LSTM (3-dropout)	10-fold cross-validation	98.10
2018	Raj and Ray³³	Noise removal, Pan–Tompkins algorithm for R-peak detection, ECG heartbeat segmentation	ABC-DAG-LSTSVMs classifier with 16-class scheme	14-fold cross-validation	99.21
2018	Raj and Ray³³		ABC-DAG-LSTSVMs classifier with 5-class scheme	22-fold cross-validation	90.08
2020	Ribeiro et al¹⁵	Vector cardiogram linear transformation for dimensionality reduction	Unidimensional residual NN	NR	92.55
2017	Qin et al³⁴	ECG heartbeat segmentation	One-vs-one SVM (beat-based scheme)	10-fold cross-validation	99.70
2017	Qin et al³⁴	ECG heartbeat segmentation	One-vs-one SVM (record-based scheme)	10-fold cross-validation	81.47
2017	Rajagopal and Ranganathan³⁵	Db 8 DWT for noise removal, Pan–Tompkins algorithm for R-peak detection, ECG heartbeat segmentation	K-nearest neighbors + SVM	10-fold cross-validation	99.78
2010	Benali et al³⁶	Noise removal, QRS detection (original algorithm by GBM laboratory at Tlemcen university)	Wavelet neural network (WNN)	NR	98.78
2017	Rajesh and Dhuli³⁷	Noise removal, ECG heartbeat segmentation	Sequential minimal optimization (SMO)–SVM (cubic kernel) with EMD for MIT-BIH data	10-fold cross-validation	99.20
			SMO–SVM (RBF kernel) with EEMD for MIT-BIH data	10-fold cross-validation	96.45
			SMO–SVM (cubic kernel) with EEMD for INCART data	10-fold cross-validation	97.57
2018	Yang et al³⁸	Db 8 WT for noise removal, Pan–Tompkins algorithm for R-peak detection, ECG heartbeat segmentation, min-max normalization	Linear SVM,K-nearest neighbors,Random Forest,Backpropagation NN(Noisy data)	10-fold cross-validation	97.7797.1096.0196.95
2018	Yang et al³⁸		Linear SVM,K-nearest neighbors,Random Forest,Backpropagation NN(Noise-free data)	10-fold cross-validation	97.0896.2795.2295.89
2019	Hannun et al¹⁶	NR	Deep CNN with sequence level,Deep CNN with set level	NR	97.897.7
2014	Park and Kang¹⁸	Pan–Tompkins algorithm for QRS detection	Decision tree with J4.8 algorithm (personalized scheme)	10-fold cross-validation	85.26
2014	Park and Kang¹⁸	Pan–Tompkins algorithm for QRS detection	Decision tree with J4.8 algorithm (non-personalized scheme)	10-fold cross-validation	89.95
2021	Ullah et al³⁹	ECG heartbeat segmentation	CNN	NR	99.12
			CNN + LSTM	NR	99.3
			CNN + LSTM + attention method	NR	99.29
2022	Irfan et al¹⁰	DWT for noise removal, data standardization and normalization, PCA for feature reduction	CNN + LSTM with MIT-BIH database	NR	99.35
2022	Irfan et al¹⁰		CNN + LSTM with UCI arrhythmia dataset	NR	99.05
2020	Shahin et al⁴⁰	ECG segmentation, Z-score normalization	Multi-task adversarial network	NR	86
2022	Ma et al⁴¹	Db 6 WT for noise removal, Pan–Tompkins algorithm for R-peak detection, ECG heartbeat segmentation	ResNet + Bi-LSTM + attention method	NR	99.4
2021	Sabut et al⁴²	Noise removal, Z-score normalization, heartbeat segmentation with 5s window	Deep NN	NR	99.2
2020	Wang et al⁴³	Data normalization	Multi-scale fusion CNN	5-fold cross-validation	NR
2022	Anbarasi et al¹¹	CWT for noise removal, ECG segmentation	Combined CNN and LSTM	10-fold cross-validation	98.7
2022	Zubair and Yoon⁴⁴	Noise removal, Pan–Tompkins algorithm for peaks detection, ECG heartbeat segmentation	CNN for inter-patient classification	10-fold cross-validation	96.36
2022	Zubair and Yoon⁴⁴		CNN for intra-patient classification	10-fold cross-validation	99.81
2022	Hu et al⁴⁵	Z-score normalization, wavelet, and Pan–Tompkins for QRS detection, heartbeat segmentation	Transformer-based CNN for 8 classes	10-fold cross-validation	99.12
			Transformer-based CNN for 4 classes	10-fold cross-validation	99.49
			Transformer-based CNN for 2 classes	10-fold cross-validation	99.23
2022	Feyisa et al⁴⁶	Standard normalization, 2.5-s segmentation	Multi-receptive field CNN for 41 classes	NR	98
			Multi-receptive field CNN for 20 classes	NR	96.2
			Multi-receptive field CNN for 5 classes	NR	89.7
2019	Ju et al⁴⁷	Noise removal, dimensionality reduction with PCA	Deep bidirectional GRU network	5-fold cross-validation	99.51
2022	Wang et al⁴⁸	NR	Dual-path recurrent neural network (RNN)	5-fold cross-validation	84.5
2021	Wang⁴⁹	2-s segmentation	CNN + improved BGRU for MIT-BIH data	NR	97.9
2021	Wang⁴⁹	2-s segmentation	CNN + improved BGRU for CPSC data	NR	98.3
2021	Luo et al⁵⁰	0.65-s segmentation, Z-score normalization	Hybrid convolutional RNN	10-fold cross-validation	99.01
2022	Iftene et al⁵¹	PQRST detection, data segmentation, data normalization	1-D CNN with pre-processing	NR	98
			1-D CNN	NR	95
			Bayesian NN	NR	90
			GRU network	NR	94

Abbreviations: BGRU, Bidirectional Gated Recurrent Unit; CPSC, China Physiological Signal Challenge; CNN, Convolutional Neural Network; CWT, Continuous Wavelet Transform; DL-CCANet, Dual-Lead Canonical Correlation Analysis Network; DWT, Discrete Wavelet Transform; EEMD, Ensemble Empirical Mode Decomposition; EMD, Empirical Mode Decomposition; GBM, Génie Bio-médical; GRU, Gated Recurrent Unit; HOS, Higher-Order Spectra; INCART, St. Petersburg Institute of Cardiological Techniques; LSTM, Long Short-Term Memory; LSTM-WS, Long Short-Term Memory Wavelet Sequence; MIT-BIH, Massachusetts Institute of Technology-Boston’s Beth Israel Hospital; NN, Neural Network; NR, Not Reported; RBF, Radial Basis Function; SVM, Support Vector Machine; UCI, University of California Irvine.

In addition, data padding was reported in Wang et al⁴³ as a processing operation to fix the input length.

Using continuous WT, Anbarasi et al¹¹ transformed 1-D signal to 2-D colored images to feed the CNN network. The transfer learning was introduced in Hu et al⁴⁵ to overcome the imbalance data problem.

Ju et al⁴⁷ proposed a bidirectional gated recurrent unit (GRU) network where the output is linked to the forward and backward states resulting in a better fit than unidirectional GRU and simpler structure than LSTM. To alleviate the issue of redundancy in bidirectional GRU, Wang⁴⁹ used an improved version of the aforementioned technique by adding a scale parameter to the model and combining it with CNN for feature extraction.

As shown in Table 4, the selected studies used several AI methods:

ML methods: SVM, random forest, decision tree, feedforward NN, residual NN, K-nearest neighbors.

DL methods: CNN, long short-term memory (LSTM), GAN, GRU.

Statistical AI methods: CCA, linear discriminant analysis.

Artificial evolutionary algorithms: Genetic algorithm.

Mathematics algorithms: Fuzzy logic, directed acyclic graph.

Some studies used the methods above either separately, combined, or in personalized view adapted to the application context to enhance the model performance. For instance, Sumathi et al⁹ combined fuzzy logic with NN, and Ullah et al³⁹ combined CNN with LSTM, and Attention method which uses a weighted sum of all the encoder hidden states to flexibly focus the attention of the decoder to the most relevant parts of the input sequence. Feyisa et al⁴⁶ relied on a multi-receptive CNN where the receptive field can be obtained by either using multiple kernels of different sizes or using a fixed-size kernel with a varying dilation rate.

Most of the studies reported the use of k-fold cross-validation method for evaluation.

When there are various classes/categories in the dataset, the mentioned metrics refer to the overall performance on the ensemble of classes or databases. For more details, Table A1 (Appendix 1) shows different metrics for evaluation.

In the case of multi-class classification, we adopt averaging methods for some metrics calculation, resulting in a set of different average scores (macro, weighted, micro) in the classification report.

Discussion and Interpretation

In this review, we synthetize some literature studies addressing the ECG diagnostic approaches and the arrhythmia classification methods. We establish a comparison between the selected studies by discussing the following topics.

Used datasets and ECG signal information

The set of databases used in the selected studies is listed below. There are some studies that tested some of these databases separately or combined to provide high amount of data. The BIDMC database used in Anbarasi et al¹¹ for congestive heart failure was excluded from this analysis because we want to focus only on databases with arrhythmias.

Open access databases: MIT-BIH databases, QT database, INCART database, PTB diagnostic ECG database, CU ventricular tachyarrhythmia database, Computing in Cardiology Challenge 2017, China Physiological Signal Challenge 2018, PTB-XL dataset. More details about these databases are found on PhysioNet Bank.¹²

UCI arrhythmia dataset: An open access database available on the ML repository of the UCI university.⁵³

Telehealth Network of Minas Gerais (TNMG) dataset: Data collected under the scope of the CODE (Clinical Outcomes in Digital Electrocardiology) study in the Telehealth Network of Minas Gerais which is a public telehealth system in Minas Gerais, Brazil. Publicly available on TNMG dataset.⁵⁴

ProSim simulator dataset: An industry-leading patient simulator for monitoring and preventive testing, developed by Fluke Biomedical. It is a commercial paid solution.¹⁴

Zio monitor dataset: Non-free ambulatory monitoring solution developed by iRhythm Technologies Inc, San Francisco, CA. The solution provides FDA-cleared, single-lead, patch-based ECG monitor that continuously records data from a single vector, the recording can be up to 14 days.¹⁷

Holter monitor dataset: Private data collected from wearable device which records heartbeats for diagnosis. It is a noninvasive solution that can be worn up to 2 days.

As shown in Table 2, 33 out of 40 studies used the MIT-BIH arrhythmia database.

To diagnose arrhythmia, the studies relied on the multi-class prediction. Most of the studies predicted the occurrence of more than 2 types of the arrhythmic heartbeat, yet not all of the studies using 12-class prediction and more recorded the highest performances as in literature.^{16,18,22,26,30,33} This can be explained by the imbalance of datasets; some heartbeat types have a small number of records which affect negatively the classification rate.

However, 33 out of the 40 selected studies performed the classification with input signal equal to 30 minutes’ length (Table 2). Only 1 study used variable length duration,³¹ and a special U-net architecture was developed for this purpose to handle the variable-size data.

As of the ECG recording format, only literature^15,19,43,46 used 12-lead ECG signal which is the standard technique in the real clinical settings. Although literature^15,16 provided the largest datasets among all the selected studies which can improve the model ability of generalization, they did not reach the highest performances due to the imbalance of data. Also, Hannun et al¹⁶ did not apply any pre-processing methods on the data which can increase the error rate.

Data augmentation is used to tackle the issue of data imbalance. While some techniques help to mitigate the overfitting in the training stage, such as SMOTE technique and GAN network, other methods allow only to increase the volume of data without having a measurable effect on the performance and the variance of the dataset since they rely on a simple resampling or the addition of Gaussian noise and interpolation as in Iftene et al.⁵¹

It is logical to analyze the use of different datasets in the same study since it used the same pre-processing and prediction methods. The comparison of the same database used in different studies will not be relevant.

Figure 3 shows that INCART database in Chen et al¹⁹ reached the highest accuracy among the other databases, given that all of them were imbalanced. This good performance can be explained by the fact that INCART is 12-lead and the study combined features from all these leads to ensure classification. However, Yang et al²² showed better accuracy for MIT-BIH database than INCART, from which were extracted only 2 leads (II and V1). Irfan et al¹⁰ and Wang⁴⁹ recorded better results on MIT-BIH database because the other databases were highly imbalanced.

Figure 3.

Overall accuracy recorded in some datasets.

Taking everything discussed above into account, we assume that:

MIT-BIH still is one of the best and most complete databases used in arrhythmia classification as it provides annotations, signal characteristics, and different lead recordings.

The combination of 12-leads can help increasing the accuracy because the model will be fed with various information.

It is essential to tackle the imbalance data issue because it can hinder good pre-processing and prediction techniques from achieving higher performances. SMOTE technique is recommended for this end.

Feature selection and extraction

Table 5 indicates the types of some extracted features from the selected studies. The most used features are RR intervals, which represent the time-domain, and the amplitude of R wave, which represents the space-domain. Moreover, the WT and the PCA methods are the most used in the feature extraction stage, given that PCA provides low-dimension features while preserving as much of the data variation as possible and WT allows to capture both frequency and time information.

Table 5.

Types of extracted data.

Domain	Feature	Studies
Frequency	Signal phase angle of the FFT	^{Chen et al} ¹⁹
	Signal wave power spectrum of the FFT	^{Chen et al¹⁹}
	DWT frequency	^{Chen et al} ¹⁹
	Mean of wavelet coefficient	Li et al¹³
	Min of wavelet coefficient	Li et al¹³
	Max of wavelet coefficient	Li et al¹³
	SD of wavelet coefficient	Li et al¹³
	QRS duration	Benali et al³⁶
Time	Skewness of RR intervals	^{Chen et al¹⁹}
	Kurtosis of RR intervals	Chen et al¹⁹ and ^{Raj and Ray³³}
	SD of RR intervals	Raj and Ray³³
	Interval RR	Anwar et al²⁶,^{Raj and Ray³³}, Elhaj et al²⁸ and Park and Kang¹⁸
	Interval PR	Park and Kang¹⁸
	Position of R point	Park and Kang¹⁸
	Position of P point	Park and Kang¹⁸
	Local RR interval	Anwar et al²⁶ and Raj and Ray³³
	Average RR interval	Anwar et al²⁶ and Raj and Ray³³
	Energy	Anwar et al²⁶ and Raj and Ray³³
	Ratio between the distance RR following the previous one	Benali et al³⁶
Space	Amplitude of R wave	Chen et al¹⁹, Sumathi et al⁹, ^{Benali et al³⁶} and Park and Kang¹⁸
	Amplitude of Q wave	Sumathi et al⁹
	Amplitude of P wave	Park and Kang¹⁸
	Amplitude of S wave	Sumathi et al⁹
	Amplitude of K1	Sumathi et al⁹
	Amplitude of K2	Sumathi et al⁹
	Highest voltage value	Chen et al¹⁹
	Lowest voltage value	Chen et al¹⁹
	Average amplitude	Chen et al¹⁹
	Variance of amplitudes	Chen et al¹⁹
Nonlinear	Variance	Chen et al¹⁹
Nonlinear	Permutation entropy	Raj and Ray³³

Abbreviations: DWT, Discrete Wavelet Transform; SD, Standard Deviation.

Sabut et al⁴² extracted various features having temporal, statistical, and spectral information, such as filter leakage measure, covariance, kurtosis, skewness, threshold crossing interval, Shannon Entropy, etc to improve the accuracy of classification.

For the studies based on DL models, such as CNN as in literature,^39,40,41,45 the extraction is held by the DL model itself, by sliding multiple convolutional windows over the ECG and performing multiple convolutional operations on the local features.

There is no doubt that feature extraction allows a better understanding of the model as it helps setting an explicit feature design of the ML model but when it is embedded in the DL model, it decreases the consumption of resources and time. For instance, we can rely on the strength of CNN for dealing with the extraction stage even if CNN can be time-consuming when a high number of layers are used.

Pre-processing methods

According to Table 4, only 1 study was not subject to data pre-processing. The most used techniques are ECG heartbeat segmentation (17 studies), noise removal (13 studies), data normalization (8 studies), and QRS detection (6 studies). However, 4 studies relied on R-peak detection and this detection reached an accuracy of 99.3% in Oh et al.³¹ Furthermore, the most used algorithms in the pre-processing phase are the Pan–Tompkins algorithm to detect accurately R peaks and QRS complexes, and the WT to reduce the cost of continuous wavelet computation.

Table 6 below summarizes the pre-processing methods, their application, and the objective from their usage.

Table 6.

Pre-processing methods and their applications.

Pre-processing method	Application/type	Objective	Technique
Signal segmentation	[100, 140, 150, 200, 250, 252, 256, 260, 300, 360, 400, 500] sample long.	• Infer the hidden states of signal at each time,• Subsequent signal classification.	Annotated R peaks, annotated QRS complexes, cardiologists’ annotations.
Noise removal	Power line interference, muscle noise, motion artifact, baseline wander, high-frequency artifacts.	• Improve the interpretability and perception of multi-dimension information,• Reduce the probability of error in QRS detection,• Enhance the classification accuracy.	DWT with its different distributions, band-pass filter, median filter, low-pass filter, mathematics equations.
Data normalization	Signal rescaling	• Eliminate the offset effect,• Standardize the ECG signal amplitude,• Improve the backpropagation process by speeding up the convergence rate.	Min-max normalization, Z-score normalization
QRS detection	QRS mid-point, RR markers	• Subsequent rhythm classification,• Identify features characteristics.	Pan–Tompkins algorithm, SVMs, Symlet WT, original algorithm by GBM at Tlemcen University.
R-peak detection	R point recognition	• Facilitate features’ extraction	Pan–Tompkins algorithm
Signal compression	Segmented ECG data	• Reduce the signal size of beats with the minimum loss,• Reduce the storage cost of the large amount of data.	Deep CAE
Dimensionality reduction	Feature space reduction	• Reduce the overhead of computing,• Improve accuracy.	PCA, vector cardiogram linear transformation.

Abbreviation: GBM, Génie Bio-médical.

The ECG signal segmentation is applied with different sample-long segments that vary between 100- and 500-sample long. The samples are centered either around the detected R peaks or the detected QRS complexes. The segmentation can also rely on the extraction of T-to-T segments as in Zubair and Yoon,⁴⁴ or can simply rely on the database annotation files.

Noise removal method is applied to remove different types of noise that can result from patient motion or respiration, power line interference, muscle artifacts, baseline drift, electrode motion artifact or data-collecting device noise. To the fact that each noise source resides in a characteristic frequency band, different filters and techniques are used depending on the type of noise.

Data normalization can be considered one of the most interesting methods due to its important influence on the classification process. Namely, signal rescaling improves significantly the backpropagation process by speeding up the convergence rate.

Most of the studies where pre-processing was applied to data showed a better performance on the classification as in Iftene et al⁵¹ where CNN model reached an accuracy of 95% without pre-processing vs 98% when applying data augmentation and normalization.

The other pre-processing methods used in the selected studies are shown in Table 6.

To show the correlation between the use of pre-processing methods and the obtaining of better accuracies, we plot the performance corresponding to different pre-processing methods.

We compare between noisy data and noise-free data. Figure 4 shows that when cleaning data from noise, better accuracy can be obtained. Yet, Yang et al³⁸ demonstrated its ability to detect successfully noisy heartbeats with different ML methods. This is due to the use of PCA filters when extracting features, which can remove implicitly unwanted noise.

Figure 4.

Accuracy of noisy data vs noise-free data.

We notice that all the studies which recorded accuracies lower than 90% did not undergo noise removal.

As result, we affirm that the use of one or many combined pre-processing methods can decrease the error rate. It is highly recommended to realize data augmentation and noise removal to avoid misclassifications and improve the detection ability of arrhythmia.

Prediction methods

As shown in Table 4, CNN and LSTM network are the most used techniques, followed by the SVM in 22% of the studies. Indeed, more than half of the studies used DL techniques to improve the accuracy. CNNs are used with different variants in the convolutional blocks as in literature.^44-46

Some studies reported the computation time in the learning phase; Yang et al²² recorded the lowest training time with a value equal to 68.8 seconds using leads II and V1. The use of CCANet in the feature extraction phase has definitely reduced the computation cost and improved the accuracy and specificity which reached, respectively, 99.4% and 99.6%.

Shahin et al⁴⁰ reported a very interesting DL technique; the architecture of the adversarial multi-task model consists of 3 networks: the generator network, the heartbeat-type discriminator which discriminates between 5 types of heartbeats, and the subject discriminator which discriminates between 39 different subjects. This design has increased the performance allowing double discrimination and forcing the system to take into account only the heartbeat variations. Yet, it can be improved by changing the method of synthetic data generation; generating new data with GAN network or SMOTE technique instead of upsampling which generates duplicated data.

Sabut et al⁴² used a fusion of 2 CNN branches with different scales and an Attention module to mine the discriminative features. In fact, the attention mechanism boosts the classification performance as shown in Hu et al⁴⁵ where the attention helped to capture the inter-beat dependencies.

The combination of residual convolutional blocks and bidirectional LSTM model with Attention method in Ma et al⁴¹ seems to be effective since it allows a local and global feature extraction, and high accuracy that reached 99.4%. Zubair and Yoon⁴⁴ mitigated the problem of imbalanced data in CNNs by designing a novel cost-sensitive loss function in the network. This learning strategy is based on training efficiently the model without changing the distribution of the data. Besides, the aforementioned study highlighted the use of 2 different paradigms: the intra-patient and inter-patient classifications to show how the latter achieves better generalization capability.

Luo et al⁵⁰ used a hybrid model combining CNN layers, LSTM, and GRU networks. Indeed, the authors took advantage of every network’s strength: the high ability of temporal and spatial information extraction of CNN, acquiring sequential information by LSTM, retaining only relevant information by the GRU, and avoiding the gradient disappearance issue.

As for the development tools, Python was used with its different ML libraries, such as TensorFlow, PyTorch, Scikit-learn, and Keras. MATLAB is also employed in some studies. Iftene et al⁵¹ developed the prediction technique in the Amazon Web Services platform using an integrated DL model.

We gather the prediction methods used in this review in the scheme below (Figure 5).

Figure 5.

Used AI methods.

General AI can be divided into 2 categories:

Symbolic AI which is based on a system of “rules,” the machine therefore does not improvise by itself, it acts according to the rules it has received. One of the most important algorithms in symbolic AI is the genetic algorithm used in Li et al.¹³

ML is a form of AI where based on more data and computers can learn without being explicitly programmed to do so instead of programmers teaching the machines what tasks they need to perform.

We visualize the accuracy of CNN and SVM networks in Figure 6.

Figure 6.

Accuracy of CNN and SVM models.

As it can be shown, the average accuracy over all the studies that used CNN is 97.82% vs 98.41% for SVM. When running through literature, we find that SVMs when preceded with feature extraction stage can achieve promising results. The selected studies in this review used PCA filters, DWT, and convolutional layers for the extraction which definitely have boosted the SVM performance.

Taking everything into consideration:

○ The CNN and LSTM are the most used techniques in the last years; they allow the extraction of temporal, spatial, and sequential information from the ECG signal and they analyze deeply the extracted features which result in high accuracy.

○ The attention method can boost the classification performance and the generalization ability.

○ Pre-processing combined with DL techniques can help achieving promising results.

○ To achieve high performance, DL methods need high-performance computing.

○ When high computational resources are not provided, the use of SVM can be a good alternative for arrhythmia detection but should be preceded with relevant feature extraction methods which can be time-consuming.

Evaluation and performances

First, we compare the studies that used the same datasets. We sort 7 different datasets available on the PhysioNet repository:²³ MIT-BIH arrhythmia database, QT database, MIT-BIH supraventricular arrhythmia database (SV), INCART, MIT-BIH atrial arrhythmia database, Malignant ventricular arrhythmia database, and PTB diagnostic ECG database, besides 4 non-open datasets collected during studies^13,15,16,18 either from simulation devices or Holter monitor or ECG recorder as indicated in Table 2. However, 2 studies^19,26 used SV database, Anwar et al²⁶ combined the data from both MIT-BIH arrhythmia database (MIT-BIH) and SV database to apply class-oriented prediction (based on different sub-categories of beats depending on the used datasets) with 18 classes and subject-oriented prediction (main category classification with beat’s annotation according to ANSI/AAMI standard) with 5 classes while Chen et al¹⁹ applied class-oriented scheme with 4 classes. As shown in Figure 4, the model combining MIT-BIH and SV databases achieved a high accuracy of 99.8%, whereas the model relying only on SV database achieved an accuracy of 97.6% (Figure 7). Nevertheless, both the results were promising.

Figure 7.

Average classification accuracy on SV database.

Studies^19,22,37 enrolled the learning phase using INCART database for 4-class, 7-class, and 5-class prediction, respectively. The highest accuracy of 99.8% was achieved by the first study where they used 12-lead ECG recording format vs an accuracy of 98.76% with 2-lead format and 97.57% for the 2 other studies. For the rest of the studies where they used MIT-BIH arrhythmia database, all of them reached an accuracy above 90%.

Second, we carry on a comparison between studies using the same prediction methods. Regarding the studies applying SVM, they used different kernel functions and some of them were combined with other ML algorithms, but most of them yielded an accuracy greater than 94%. This can be explained by the powerful methods used for feature extraction and data pre-processing, including the use of DL techniques,^20,22,38 in these studies. When comparing the studies that applied CNN, all of them attained high accuracy rates above 94%. The lowest metrics (accuracy, specificity, and sensitivity) were obtained by Qin et al³⁴ that performed SVM on record-based training scheme where the classifier was trained and tested on separate records from different individuals.

Regarding the studies with smaller signal durations (between 7 and 30 seconds), they achieved good F1-score values but the highest scores were obtained by 30-minute duration studies. And yet, the increase in ECG signal length does not guarantee the highest accuracy rates. Indeed, in this review, the studies with the lowest signal duration^15,16,30 could perform better, especially when they applied deep CNNs; however, they either did not proceed data pre-processing¹⁶ or they used imbalanced data³⁰ for the classification.

In Irfan et al,¹⁰ the DL model achieved better results on the second dataset, this is due to the highly imbalanced data in the first dataset. Only the accuracy of the best model was reported in Table 4 (an overall accuracy of 99.35% for balanced data vs 93.33% for imbalanced data).

For Shahin et al,⁴⁰ the adversarial multi-task model achieved an overall accuracy of 86% on the validation set and 87% on the test set, which are lower comparing to other techniques, due to the imbalanced data.

Zubair and Yoon⁴⁴ achieved a high accuracy of 99.81% in the intra-patient paradigm with CNN with different size kernels and cost-sensitive function. Hu et al⁴⁵ reached an accuracy of 99.49% for 4 class-categorization with transformer encoder–decoder network with CNN layers and attention mechanism. The use of CNN with different kernel sizes (to capture different segment and interval lengths) allowed to obtain an accuracy of 98% for 41 arrhythmias classification.

Wang⁴⁹ used a novel method of premature ventricular contraction (PVC) detection where they modified a GRU network to avoid the redundancy of information in the forward and backward connections. This improved version of GRU yielded an accuracy of 97.9% on MIT-BIH data and 98.3% on CPDB.

Most of methods relying on DL, ML, statistical AI techniques, or a combination of them had performed high accuracies because all of the selected studies in this review realized rigorously the feature extraction phase and the pre-processing phase.

Among the studies selected, there are many that have used variety of approaches/databases/methods. Depending on each criterion, we linked the use of pre-processing and prediction methods to the accuracies which they are shown in Table 4.

Contributions and comparison to other literature reviews results

We compare our review to other review papers in literature that focus on reviewing studies with ML methods for arrhythmia classification.

Some papers focused only on describing the DL techniques and neglected the effect of the pre-processing stage and the type of datasets on the performance as in review⁵⁵ which conducted a shallow description of the papers. Unlike Ebrahimi et al,⁵⁶ where they realized a well-organized overview to the existing papers in literature starting from 2017. Yet, they basically selected papers using the public PhysioNet databases which can be useful when producing and comparing works between researchers, but it neglects the DL performance that can be recorded on wearable monitoring devices. In the same review, they presented papers that used variants of GRU, RNN, and CNN: models with very promising results in the literature.

One of the strong points of Annam et al⁵⁷ is the presentation of the inter-patient vs the intra-patient paradigms in heartbeat classification with both DL and ML techniques. However, they did not discuss the pre-processing methods used in the selected papers. Jensen et al⁵⁸ and Tamariz et al⁵⁹ focused on the study of papers handling the validation of, respectively, AF occurrence and ventricular arrhythmias while focusing on the validation metrics and the used datasets without dedicating special attention to the classification methods which were reported as administrative codes. Jensen et al⁵⁸ reported only 16 studies which can question the relevance of this review. Sanamdikar⁶⁰ reported feature extraction, pre-processing, and prediction techniques for arrhythmia classification with description of the used databases. However, the review was limited in terms of the reported techniques, especially for the pre-processing where they mentioned only noise removal.

One of the most interesting reviews in literature is Luz et al.⁶¹ They relied on a good search strategy and succeeded to report information about used databases, feature extraction, pre-processing, and prediction methods.

Parvaneh et al⁶² presented an overview on arrhythmia detection with respect to the following aspects: used datasets, type of input data, model architecture, and evaluation metrics. Due to the shallow analysis of the selected papers, this review is considered to be conceptual. The DL architecture and feature extraction were briefly stated. Another gap is the absence of pre-processing methods which should be discussed because they affect the performance of the DL model. However, Houssein et al⁶³ focused on the studies related to arrhythmia classification by artificial neural network (ANN) and SVMs. The review presented the 3 main stages prior to classification: pre-processing, feature extraction, and feature selection. Detailed information about every phase was given while relating to the used methods in every study. Thus, this can be an interesting review for reference but since they focused only on 2 models, ANN and SVM, more papers should have been included to the analysis.

The strengths of our review can be mentioned as follows:

We presented the search method and the inclusion criteria that we rely on, to select the studies analyzed in our review.

We reported and analyzed papers using either DL or ML or both, to emphasize the good performance that can be reached when combining different techniques. Moreover, we want to provide the reader with other alternatives when high computing resources are not provided.

We established a deep description of the papers; exploration of used datasets, feature extraction, pre-processing, prediction methods, and performance evaluation.

We pointed out the advantages and limitations of the used methods.

We analyzed the relationship between the high performance and the use of pre-processing methods, especially noise removal and data augmentation which help avoiding misclassifications.

We believe that this review can help in defining the scope of future research work when planning to apply ML or DL techniques for arrhythmia classification to given datasets.

In the future, we plan to follow up this literature by the developing an AI model to classify ECG heartbeats and predict the occurrence of arrhythmia.

Conclusions

This review synthetizes and interprets some of the papers in the literature that deal with arrhythmia detection using ECG-based models.

Taking everything into account, we summarize the findings of this review as follows:

➢ The selected studies relied on a multi-class prediction of arrhythmia with no other cardiovascular disease diagnosis, to keep the focus on the irregularity of heartbeat types related to the arrhythmic aspect. Most of the studies used a 30-minute signal length and a single- or dual-lead ECG recording format.

➢ ECG heartbeat segmentation relies on the signal sliding based on the position of R peaks with equal-size segment. Therefore, variable-size segments should be used more frequently, especially when detecting arrhythmias to capture the intra-beat and inter-beat irregularities.

➢ Most of databases contain imbalanced data which result in heartbeat misclassification for the minority classes. Therefore, strong methods for data augmentation should be used as SMOTE or GAN network.

➢ It is found that the use of data augmentation technique is proportional to the use of DL techniques which need balanced data to emphasize their performance.

➢The most performing models used arrhythmia databases from the PhysioNet repository mainly the MIT-BIH databases because they are properly annotated and organized. Moreover, the most used features were RR intervals and the amplitude of R waves which indicate the importance of these time-domain and space-domain features, respectively, in the prediction of arrhythmia.

➢Overall, 96% of the selected studies applied pre-processing methods among which there are noise removal, normalization, and QRS detection. These methods demonstrated their efficiency in decreasing the computing cost and increasing the accuracy rate.

➢ All selected studies used either ML techniques or DL techniques, indicating that AI is becoming an important twist in the health care and telemedicine field. The most used technique is CNNs followed by SVM and the combination of CNN and LSTM. The use of SVM, with the combination of DL techniques in the feature extraction and the pre-processing phases, recorded very important results.

In Table 7, shown all techniques that are used in arrhythmia classification in this review. We present the advantages and limitations of each classification method as they are identified by the authors of each study.

Table 7.

Advantages and limitations of arrhythmia classification methods.

Study	Classification method	Advantages	Drawbacks/ limitations	Perspectives
Chen et al¹⁹	Cascaded classifier composed of random forest (RF) and multilayer perceptron (MLP)	• Time-saving method• No need for signal pre-processing	The absence of the dynamic property extraction from 1-D signal	LSTM network will be applied to extract more features from ECG dynamic
Acharya et al²⁰	CNN	• Self-removal of the unwanted noise• Fully automatic algorithm: No need for additional feature extraction or selection• Insensitive to the ECG signal quality	• Training is computationally expensive and time-consuming (≈hours)• Require specialized hardware to efficiently train (GPU)Require very large number of images	Training the CNN to recognize temporal sequences: normal, abnormal, and potentially life-threatening conditions of heart electrical activity
Yildirim et al²¹	CAE and LSTM	• Reduce training time cost• Secure transmission of patient data due to coded structureLow feature loss during compression	• Complex DL model for compression	The use of coded features with traditional classifiers
Yang et al²²	CCANet and SVM	• High overall accuracyClassification of detailed classes (15 classes on MIT-BIH database and 7 classes on INCART database)• Use of the correlation of multi-lead ECG signalsA small number of parameters to be adjusted	• Lower recognition performance for classes with minimal heartbeatsThe size of ECG matrix needs to be adjusted for different databases due to the different sample rates	Using more ECG leads and adopting a resampling algorithm
Yildirim²³	Unidirectional and bidirectional LSTM with wavelet-based layer	• Providing more distinguishing and size-reduced featuresThe input length does not affect the network storage requirements since LSTMs are located in space and time	Extra time cost due to the wavelet layer	Combining LSTMs with other DL methods
Sumathi et al⁹	ANFIS: combination of NN and fuzzy logic	• The advantage of a fuzzy set is the depiction of prior knowledge into a set of constraints to reduce the optimization research spaceEasy to implement	NR	NR
Gao et al²⁴	LSTM with focal loss	• Addressing the issue of imbalanced dataset classification• Extracting the timing characteristics of ECGRobust model with noisy data	• Time cost of the training phase	Incorporating more beat types and adding different types of noise
Martis et al²⁵	NN and SVM	Obtaining high accuracies due to the use of nonlinear features	NR	Evaluating a combination of several nonlinear features
Li et al¹³	SVM with genetic algorithm	Improving the classifier performance by the GA algorithm	NR	NR
Anwar et al²⁶	Feedforward NN	Computationally efficient for arrhythmia classification: 18 types	NR	The use of automatic patient customization scheme allowing the heartbeat classification method to be able to adjust to individual physiological features using wearable sensors
Liu²⁷	SoNFIN: NN with fuzzy logic	• Obtaining high performance with the fuzzy logicSuitable model for a portable system	Recognition and classification are more difficult for 12-lead signal	NR
Elhaj et al²⁸	Feedforward NN and SVM	High capability for generalization	NR	NR
Kim et al²⁹	GoogleNet Deep NN	Accuracy enhanced by inception structure	The computational complexity increases exponentially as the layer becomes deeper	NR
Yıldırım³⁰	1-D CNN	• Efficient and non-complex• Highly accurate and fast (real-time classification)• End-to-end structure• Reducing computational complexity	• Small number of ECG signal fragments are analyzed.No possibility for classifying fragments of ECG signal containing more than 1 class	Testing the efficiency of developed 1D-CNN using other physiological signals and classifying fragments of the ECG signal that containing more than 1 class
Oh et al³¹	Modified U-net architecture: CAE with skipped connections	• End-to-end solution, requires minimal processing• High capacity of handling the heterogeneity of beats (ECG segments with mixed arrhythmia types)• Producing localized outputs of higher resolutionGood generalization ability without overfitting	• The training phase is computationally intensive and slow• The model is trained and tested using an imbalanced dataset• Predictions for R peak are not precise	The model can be tested on variable length signals for analysis
Oh et al³²	Combination of CNN and LSTM	• CNN is good at picking up spatial features while the LSTM is better at learning temporal features• Predictions made by the system are reproducible with no inter- and intra-observer biases• Noise filtering, feature extraction and selection techniques are not required	• Model is computationally intensive and learning is slow• Algorithm is not robust in detecting the atrial premature beat from normal ECG segment• The system is developed using imbalanced dataset	The use of auto-encoder network on the ECG data for element-wise analysis, by associating each pixel with a class labelHigh-end graphic cards to accelerate the training process of the modelData augmentation to balance data
Raj and Ray³³	Least-square twin SVM	• The method is fast since the classification time is very low• The implementation of cross-validation scheme makes the method highly robust and reliableThe proposed method is highly efficient and completely automatic	• More memory is required for implementation• More optimization time is taken by the optimization technique to tune the classifierThe use of fixed window for the heartbeats	More classes of cardiac arrhythmias can be detected using the proposed method
Ribeiro et al¹⁵	Unidimensional residual NN	• Model more robust to noiseRecognize accurately ECG rhythm and morphological abnormalities in clinical examinations	The presence of relatively infrequent classes leading to few erroneous classifications	Diagnosis of less common forms of arrhythmiaTesting the algorithm in a controlled real-life situation
Qin et al³⁴	One-vs-one SVM	Beats accurately recognized in beat-based scheme	Ineffective recognition of record-based scheme due to the lack of comprehensive knowledge over the training beats	Using the state-of-the-art DL method for the classification of more types of heartbeats
Rajagopal and Ranganathan³⁵	Combination of KNN and SVM	• Successfully trained to classify overlapped classesComplex tasks can be learned using simple procedures by local approximation for the KNN	Limitation in speed and size during both training and testing for SVM	NR
Benali et al³⁶	WNN	• Provide faster training times and multi-resolution analysis capabilitiesHigh discrimination between cardiac rhythms	NR	NR
Rajesh and Dhuli³⁷	SMO–SVM	High ability to discriminate different types of heartbeats even under noisy conditions	The probability to obtain biased results	Feature-based disease identification
Yang et al³⁸	Linear SVM	Robust to noise and skewed data	The imbalance of data	The use of dimension reduction and data augmentation techniques
Hannun et al¹⁶	CNN	• High diagnostic performance similar to that of cardiologistsHigh ability of recognizing patterns from raw ECG signal	Some prevalence-dependent metrics, such as the F1 score would not be expected to generalize to the broader population since the extraction of rhythms was enrolled in targeted patients	NR
Park and Kang¹⁸	Decision tree	• The model can accommodate complicated patterns, and considers many more types of beats.Accurate classification method that reduces the number of false alarms and missing events by considering more types of heartbeats	Low specificity due to the overfitting problem associated with decision tree	• The use of harmonic classification forestAdding the “live mode” to the system for the use in mobile Holter monitoring
Ullah et al³⁹	Combined CNN and LSTM and Attention method	High accuracy due to combination of CNN and LSTM	Possibility of overfitting the data because the design of the proposed model includes 10 residual blocks	• Application of the model in binding domains, such as cloud and mobile systemsDevelop wearable technologies with integrated low-power consumption model
Irfan et al¹⁰	Combined CNN and LSTM	• Impervious to the time-step order• Cost-effective time	The addition of more DL networks will increase computational cost	• Deployment on embedded systemsDeal with real-time data
Shahin et al⁴⁰	Adversarial multi-task model	Improve system generalization by forcing the model to discriminate based only on variations between heartbeat types and not on variations between subjects	High rate of misclassification in test set	• Extend the experiments to other types of heartbeatsUse CNN and LSTM techniques
Ma et al⁴¹	Fusion of ResNet and Bi-LSTM with Attention method	• Approach the periodicity of ECG by the fusion of ResNet spatial information and Bi-LSTM temporal informationHigh recognition performance	NR	Extend the model to other arrhythmia databases
Sabut et al⁴²	Deep NN	Detect shockable ventricular tachycardia	NR	• Use CNN to improve the computational complexityUse the model in AED devices for automatic delivery of shocks
Wang et al⁴³	Multi-scale fusion CNN	Employ correlations among features of different scales	NR	Apply the model to other physiological signal
Anbarasi et al¹¹	Combined CNN and LSTM	Effective model with high extraction techniques	Computationally very expensive	Apply the model to disorders, such as gastrointestinal ailments, the differentiation of neoplastic and non-neoplastic tissues
Zubair and Yoon⁴⁴	CNN	• Incorporate both the short-term and long-term morphological characteristics of ECGIncreased classification rate of minority classes due to cost-sensitive function	NR	Use the cost-sensitive learning in “Point of Sale” systems
Hu et al⁴⁵	Transformer-based CNN	• Competitive heartbeat positioning and classification due to inter-beat dependenciesHigh ability of generalization	NR	• Detection of special signalsDeploy the algorithm on wearable devices
Feyisa et al⁴⁶	Multi-receptive field CNN	Facilitate the ability to look into multiple fields simultaneously and capture various features to discriminate the ECG classes	NR	• Tackle the data imbalance issue with GAN networkUse WT for feature representation
Ju et al⁴⁷	Deep bidirectional GRU Network	Provide more powerful expression and learning ability	NR	NR
Wang et al⁴⁸	Dual-path RNN	Realize both intra-segmental and inter-segmental modeling	Perform poorly for longer sequences	Develop multi-channel dual-path RNN
Wang⁴⁹	Improved bidirectional GRU	Alleviate the problem of information redundancy	Only devoted to PVC detection	Detect other signal types using more diverse data
Luo et al⁵⁰	Hybrid convolutional RNN	Combine the strengths of 3 DL methodsStrong ability to extract deep features	Time-consuming	Extend the model to other disease detection
Iftene et al⁵¹	CNN, BNN, RNN	High accuracy when using pre-processing	Lack of balanced databases	Develop an automated solution for AF detection on mobile devices

Abbreviations: AED, Automatic External Defibrillator; BNN, Bayesian Neural Network; GA, Genetic Algorithm; GPU, Graphics Processing Unit; INCART, St. Petersburg Institute of Cardiological Techniques; KNN, k-nearest neighbors; LSTM, Long Short-Term Memory; MIT-BIH, Massachusetts Institute of Technology-Boston’s Beth Israel Hospital; NR, Not Reported; SVM, Support Vector Machine.

We notice that the most common limitations for the use of DL methods are that they are time-consuming and computationally expensive and require very efficient hardware resources. Otherwise, they can perform accurately the classification of heartbeats with the end-to-end learning besides they can be robust to noise. For the traditional ML methods, they can be simply implemented and are computationally efficient and provide faster training time.

To sum up, we cannot give a decisive recommendation of the best model, based on the analysis made in the “Discussion” section because none of the 40 selected studies applied the exact same feature extraction, pre-processing, or prediction techniques in all the stages. Besides, the input information, the ECG signal information, the development tools, and the computing capacities vary from one study to another. However, when taking into consideration all these variants and the results of the studies’ analysis, we can presume that the usage of DL solely or the usage of ML combined with DL techniques can achieve very promising results.

Footnotes

Appendix 1

Table A3.

Beat annotations by PhysioBank.

Code	Description
N	Normal beat (displayed as “·” by the PhysioBank ATM, LightWAVE, pschart, and psfd)
L	Left bundle branch block beat
R	Right bundle branch block beat
B	Bundle branch block beat (unspecified)
A	Atrial premature beat
a	Aberrated atrial premature beat
J	Nodal (junctional) premature beat
S	Supraventricular premature or ectopic beat (atrial or nodal)
V	Premature ventricular contraction
r	R-on-T premature ventricular contraction
F	Fusion of ventricular and normal beat
e	Atrial escape beat
j	Nodal (junctional) escape beat
n	Supraventricular escape beat (atrial or nodal)
E	Ventricular escape beat
/	Paced beat
f	Fusion of paced and normal beat
Q	Unclassifiable beat
?	Beat not classified during learning

Appendix 2

Table of acronyms.

Acronym	Signification
ABC	Artificial bee colony
AI	Artificial intelligence
AUC	Area under curve
CAE	Convolutional auto-encoder
CNN	Convolutional neural network
DAG	Directed acyclic graph
Db	Daubechies
DL-CCANet	Dual-lead canonical correlation analysis network
DCT	Discrete cosine transform
DOST	Discrete orthogonal stockwell transform
DST	Discrete sine transform
DWT	Discrete wavelet transform
FFT	Fast Fourier transform
GBM	Génie Bio-médical
HOS	Higher-order spectrum
ICA	Independent component analysis
INCART	St. Petersburg Institute of Cardiological Techniques
KICA	Kernel-independent component analysis
LDA	Linear discriminant analysis
LSTM	Long short-term memory
LSTM-WS	Long short-term memory wavelet sequence
LS-TSVM	Least-square twin support vector machine
MIT-BIH	Massachusetts Institute of Technology-Boston’s Beth Israel Hospital
MLP	Multilayer perceptron
NN	Neural network
PCA	Principal component analysis
PCANet	Principal component analysis network
RBF	Radial basis function
RF	Random forest
SD	Standard deviation
SV	Supraventricular
SVM	Support vector machine
TL-CCANet	Triple-lead canonical correlation analysis network

Funding:

The author(s) received no financial support for the research, authorship, and/or publication of this article.

Declaration Of Conflicting Interests:

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Author Contributions

BA, MO, and SD contributed to conceptualization, resources, and supervision.

References

Arrhythmia—NHS. Accessed January 13, 2021. https://www.nhs.uk/conditions/arrhythmia/

Mubarik

Iqbal

AM.

Holter Monitor. In: StatPearls [Internet]. Treasure Island, FL: StatPearls Publishing; Updated July 25, 2022. https://www.ncbi.nlm.nih.gov/books/NBK538203/

Trardi

Ananou

Haddi

Ouladsine

. Multi-dynamics analysis of QRS complex for atrial fibrillation diagnosis. Paper presented at: 2018 5th International Conference on Control, Decision and Information Technologies (CoDIT); April 10-13, 2018; Thessaloniki, Greece. doi:10.1109/CoDIT.2018.8394935

Trardi

Ananou

Haddi

Ouladsine

. A novel method to identify relevant features for automatic detection of atrial fibrillation. Paper presented at: 2018 26th Mediterranean Conference on Control and Automation (MED); June 19-22, 2018; Zadar, Croatia. doi:10.1109/MED.2018.8442479

Trardi

Ananou

Ouladsine

. An advanced arrhythmia recognition methodology based on R-waves time-series derivatives and benchmarking machine-learning algorithms. Paper presented at: 2020 European Control Conference (ECC); May 12-15, 2020, St. Petersburg, Russia. doi:10.23919/ECC51009.2020.9143678

Trardi

Ananou

Ouladsine

. Computationally efficient algorithm for atrial fibrillation detection using linear and geometric features of RR time-series derivatives. Paper presented at: 2022 International Conference on Control, Automation and Diagnosis (ICCAD); July 13-15, 2022; Lisbon, Portugal. doi:10.1109/ICCAD55197.2022.9853910

The best academic research databases [2019 update]—Paperpile. Accessed January 13, 2021. https://paperpile.com/g/academic-research-databases/

ResearchGate. In: Wikipedia. 2020. https://en.wikipedia.org/wiki/ResearchGate

Sumathi

Beaulah

Vanithamani

A wavelet transform based feature extraction and classification of cardiac disorder. J Med Syst. 2014;38:98. doi:10.1007/s10916-014-0098-x

10.

Irfan

Anjum

Althobaiti

Alotaibi

Siddiqui

Ramzan

Heartbeat classification and arrhythmia detection using a multi-model deep-learning technique. Sensors. 2022;22:5606. doi:10.3390/s22155606

11.

Anbarasi

Ravi

Manjula

, et al. A modified deep learning framework for arrhythmia disease analysis in medical imaging using electrocardiogram signal. Biomed Res Int. 2022;2022:5203401. doi:10.1155/2022/5203401

12.

PhysioNet Databases. Accessed January 13, 2021. https://physionet.org/about/database/#open

13.

Yuan

Wang

Cui

Cao

Arrhythmia classification based on multi-domain feature extraction for an ECG recognition system. Sensors. 2016;16:1744. doi:10.3390/s16101744

14.

Patient Monitor Simulators Fluke Biomedical. Accessed January 13, 2021. https://www.flukebiomedical.com/products/biomedical-test-equipment/patient-monitor-simulators

15.

Ribeiro

Paixão

GMM

, et al. Automatic diagnosis of the 12-lead ECG using a deep neural network. Nat Commun. 2020;11:1760. doi:10.1038/s41467-020-15432-4

16.

Hannun

Rajpurkar

Haghpanahi

, et al. Cardiologist-level arrhythmia detection and classification in ambulatory electrocardiograms using a deep neural network. Nat Med. 2019;25:65-69. doi:10.1038/s41591-018-0268-3

17.

Why Zio iRhythm. Accessed January 13, 2021. https://www.irhythmtech.com/patients/why-zio

18.

Park

Kang

PcHD: personalized classification of heartbeat types using a decision tree. Comput Biol Med. 2014;54:79-88. doi:10.1016/j.compbiomed.2014.08.013

19.

Chen

Hong

Guo

Pang

A cascaded classifier for multi-lead ECG based on feature fusion. Comput Methods Programs Biomed. 2019;178:135-143. doi:10.1016/j.cmpb.2019.06.021

20.

Acharya

Hagiwara

, et al. A deep convolutional neural network model to classify heartbeats. Comput Biol Med. 2017;89:389-396. doi:10.1016/j.compbiomed.2017.08.022

21.

Yildirim

Baloglu

Tan

Ciaccio

Acharya

UR.

A new approach for arrhythmia classification using deep coded features and LSTM networks. Comput Methods Programs Biomed. 2019;176:121-133. doi:10.1016/j.cmpb.2019.05.004

22.

Yang

Wang

Zhang

A novel approach for multi-lead ECG classification using DL-CCANet and TL-CCANet. Sensors. 2019;19:3214. doi:10.3390/s19143214

23.

Yildirim

. A novel wavelet sequence based on deep bidirectional LSTM network model for ECG signal classification. Comput Biol Med. 2018;96:189-202. doi:10.1016/j.compbiomed.2018.03.016

24.

Gao

Zhang

Wang

An effective LSTM recurrent network to detect arrhythmia on imbalanced ECG dataset. J Healthc Eng. 2019;2019:6320651. doi:10.1155/2019/6320651

25.

Martis

Acharya

Lim

Mandana

Ray

Chakraborty

Application of higher order cumulant features for cardiac health diagnosis using ECG signals. Int J Neural Syst. 2013;23:1350014. doi:10.1142/S0129065713500147

26.

Anwar

Gul

Majid

Alnowami

Arrhythmia classification of ECG signals using hybrid features. Comput Math Methods Med. 2018;2018:1380348. doi:10.1155/2018/1380348

27.

Liu

S-H.

Arrhythmia identification with two-lead electrocardiograms using artificial neural networks and support vector machines for a portable ECG monitor system. Sensors. 2013;13:813-828.

28.

Elhaj

Salim

Harris

Swee

Ahmed

Arrhythmia recognition and classification using combined linear and nonlinear features of ECG signals. Comput Methods Programs Biomed. 2016;127:52-63. doi:10.1016/j.cmpb.2015.12.024

29.

Kim

Seo

Song

Kim

KS.

Assessment of electrocardiogram rhythms by GoogLeNet deep neural network architecture. J Healthc Eng. 2019;2019:2826901. doi:10.1155/2019/2826901

30.

Yıldırım Ö. Arrhythmia detection using deep convolutional neural network with long duration ECG signals. Comput Biol Med. 2018;10:411-420.

31.

EYK

Tan

Acharya

UR.

Automated beat-wise arrhythmia diagnosis using modified U-net on extended electrocardiographic recordings with heterogeneous arrhythmia types. Comput Biol Med. 2019;105:92-101. doi:10.1016/j.compbiomed.2018.12.012

32.

EYK

Tan

Acharya

UR.

Automated diagnosis of arrhythmia using combination of CNN and LSTM techniques with variable length heart beats. Comput Biol Med. 2018;102:278-287. doi:10.1016/j.compbiomed.2018.06.002

33.

Raj

Ray

KC.

Automated recognition of cardiac arrhythmias using sparse decomposition over composite dictionary. Comput Methods Programs Biomed. 2018;165:175-186. doi:10.1016/j.cmpb.2018.08.008

34.

Qin

Zhang

Yue

Liu

Combining low-dimensional wavelet features and support vector machine for arrhythmia beat classification. Sci Rep. 2017;7:6067. doi:10.1038/s41598-017-06596-z

35.

Rajagopal

Ranganathan

Design of a hybrid model for cardiac arrhythmia classification based on Daubechies wavelet transform. Adv Clin Exp Med. 2018;27:727-734. doi:10.17219/acem/68982

36.

Benali

Bereksi Reguig

Hadj Slimane

Automatic classification of heartbeats using wavelet neural network. J Med Syst. 2012;36:883-892. doi:10.1007/s10916-010-9551-7

37.

Rajesh

KNVPS

Dhuli

Classification of ECG heartbeats using nonlinear decomposition methods and support vector machine. Comput Biol Med. 2017;87:271-284. doi:10.1016/j.compbiomed.2017.06.006

38.

Yang

Wang

Guo

Automatic recognition of arrhythmia based on principal component analysis network and linear support vector machine. Comput Biol Med. 2018;101:22-32. doi:10.1016/j.compbiomed.2018.08.003

39.

Ullah

Siddique

Zulqarnain

Alam

Ahmad

Raza

UA.

Classification of arrhythmia in heartbeat detection using deep learning. Comput Intell Neurosci. 2021;2021:2195922. doi:10.1155/2021/2195922

40.

Shahin

Ahmed

. Adversarial multi-task learning for robust end-to-end ECG-based heartbeat classification. Paper presented at: 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC); July 20-24, 2020; Montreal, QC, Canada. doi:10.1109/EMBC44109.2020.9175640

41.

Cui

Xiao

Liu

Deep learning-based data augmentation and model fusion for automatic arrhythmia identification and classification algorithms. Comput Intell Neurosci. 2022;2022:1577778. doi:10.1155/2022/1577778

42.

Sabut

Pandey

Mishra

BSP

Mohanty

Detection of ventricular arrhythmia using hybrid time–frequency-based features and deep neural network. Phys Eng Sci Med. 2021;44:135-145. doi:10.1007/s13246-020-00964-2

43.

Wang

Fan

Deep multi-scale fusion neural network for multi-class arrhythmia detection. IEEE J Biomed Health Inform. 2020;24:2461-2472. doi:10.1109/JBHI.2020.2981526

44.

Zubair

Yoon

Cost-sensitive learning for anomaly detection in imbalanced ECG data using convolutional neural networks. Sensors. 2022;22:4075. doi:10.3390/s22114075

45.

Chen

Zhou

A transformer-based deep neural network for arrhythmia detection using continuous ECG signals. Comput Biol Med. 2022;144:105325. doi:10.1016/j.compbiomed.2022.105325

46.

Feyisa

Debelee

Ayano

Kebede

Assore

TF.

Lightweight multireceptive field CNN for 12-lead ECG signal classification. Comput Intell Neurosci. 2022;2022:8413294. doi:10.1155/2022/8413294

47.

Zhang

Zhu

Study on a new deep bidirectional GRU network for electrocardiogram signals classification. Paper presented at: Proceedings of the 3rd International Conference on Computer Engineering, Information Science & Application Technology (ICCIA 2019); May 30-31, 2019; Chongqing, China. doi:10.2991/iccia-19.2019.54

48.

Wang

Rahardja

Fränti

Rahardja

Single-lead ECG recordings modeling for end-to-end recognition of atrial fibrillation with dual-path RNN. Biomed Signal Process Control. 2022;79:104067. doi:10.1016/j.bspc.2022.104067

49.

Wang

Automated detection of premature ventricular contraction based on the improved gated recurrent unit network. Comput Methods Programs Biomed. 2021;208:106284. doi:10.1016/j.cmpb.2021.106284

50.

Luo

Yang

Cai

Tang

Chen

Multi-classification of arrhythmias using a HCRNet on imbalanced ECG datasets. Comput Methods Programs Biomed. 2021;208:106258. doi:10.1016/j.cmpb.2021.106258

51.

Iftene

Burlacu

Gifu

Atrial fibrillation detection based on deep learning models. Procedia Comput Sci. 2022;207:3752-3760. doi:10.1016/j.procs.2022.09.436

52.

Pan

Tompkins

WJ.

A real-time QRS detection algorithm. IEEE T Biomed Eng. 1985; BME-32:230-236. doi:10.1109/TBME.1985.325532

53.

Guvenir

Acar

Muderrisoglu

Arrhythmia. UCI Machine Learning Repository; 1998. https://archive.ics.uci.edu/ml/datasets/arrhythmia

54.

TNMG dataset. Accessed October 20, 2022. https://zenodo.org/record/3765780

55.

Gupta

Bajpai

Dhiman

Soni

Gomathi

Mane

. Review of ECG arrhythmia classification using deep neural network [published online ahead of print May 22, 2021]. Mater Today Proc. doi:10.1016/j.matpr.2021.05.249

56.

Ebrahimi

Loni

Daneshtalab

Gharehbaghi

A review on deep learning methods for ECG arrhythmia classification. Expert Syst Appl. 2020;7:100033. doi:10.1016/j.eswax.2020.100033

57.

Annam

Kalyanapu

Somala

Raju

SB.

Classification of ECG heartbeat arrhythmia: a review. Procedia Comput Sci. 2020;171:679-688. doi:10.1016/j.procs.2020.04.074

58.

Jensen

Johnson

Floyd

Heckbert

Carnahan

Dublin

A systematic review of validated methods for identifying atrial fibrillation using administrative data: detection of atrial fibrillation in claims. Pharmacoepidemiol Drug Saf. 2012;21:141-147. doi:10.1002/pds.2317

59.

Tamariz

Harkins

Nair

A systematic review of validated methods for identifying ventricular arrhythmias using administrative and claims data: detection of ventricular arrhythmia in claims. Pharmacoepidemiol Drug Saf. 2012;21:148-153. doi:10.1002/pds.2340

60.

Sanamdikar

ST.

A literature review on arrhythmia analysis of ECG signal. IRJET. 2015;2(3):1-6.

61.

Luz

Schwartz

Cámara-Chávez

Menotti

ECG-based heartbeat classification for arrhythmia detection: a survey. Comput Methods Programs Biomed. 2016;127:144-164. doi:10.1016/j.cmpb.2015.12.008

62.

Parvaneh

Rubin

Babaeizadeh

Xu-Wilson

Cardiac arrhythmia detection using deep learning: a review. J Electrocardiol. 2019;57S:S70-S74. doi:10.1016/j.jelectrocard.2019.08.004

63.

Houssein

Kilany

Hassanien

AE.

ECG signals classification: a review. Int J Intell Eng Inform. 2017;5:376. doi:10.1504/IJIEI.2017.087944

A Literature Review: ECG-Based Models for Arrhythmia Diagnosis Using Artificial Intelligence Techniques

Abstract

Keywords

Introduction

Methods of Search and Selection

Search strategy

Study selection

Study mining: extraction of study characteristics

Results of Studies’ Exploration

Selection

Study design

Feature set

Pre-processing and prediction methods

Discussion and Interpretation

Used datasets and ECG signal information

Feature selection and extraction

Pre-processing methods

Prediction methods

Evaluation and performances

Contributions and comparison to other literature reviews results

Conclusions

Footnotes

Appendix 1

Appendix 2

Funding:

Declaration Of Conflicting Interests:

Author Contributions

References