IMR Press / RCM / Volume 25 / Issue 1 / DOI: 10.31083/j.rcm2501008
Open Access Systematic Review
Machine Learning for Detecting Atrial Fibrillation from ECGs: Systematic Review and Meta-Analysis
Show Less
1 Hunan Provincial Key Laboratory of TCM Diagnostics, Hunan University of Chinese Medicine, 410208 Changsha, Hunan, China
2 School of Acupuncture and Tui-na and Rehabilitation, Hunan University of Chinese Medicine, 410208 Changsha, Hunan, China
3 School of Chinese Medicine, Hunan University of Chinese Medicine, 410208 Changsha, Hunan, China
4 Cardiovascular Department, the First Hospital of Hunan University of Chinese Medicine, 410021 Changsha, Hunan, China
*Correspondence: ljhtcm1@163.com (Jianhe Liu); lianghao@hnucm.edu.cn (Hao Liang)
Rev. Cardiovasc. Med. 2024, 25(1), 8; https://doi.org/10.31083/j.rcm2501008
Submitted: 14 May 2023 | Revised: 11 August 2023 | Accepted: 29 August 2023 | Published: 8 January 2024
Copyright: © 2024 The Author(s). Published by IMR Press.
This is an open access article under the CC BY 4.0 license.
Abstract

Background: Atrial fibrillation (AF) is a common arrhythmia that can result in adverse cardiovascular outcomes but is often difficult to detect. The use of machine learning (ML) algorithms for detecting AF has become increasingly prevalent in recent years. This study aims to systematically evaluate and summarize the overall diagnostic accuracy of the ML algorithms in detecting AF in electrocardiogram (ECG) signals. Methods: The searched databases included PubMed, Web of Science, Embase, and Google Scholar. The selected studies were subjected to a meta-analysis of diagnostic accuracy to synthesize the sensitivity and specificity. Results: A total of 14 studies were included, and the forest plot of the meta-analysis showed that the pooled sensitivity and specificity were 97% (95% confidence interval [CI]: 0.94–0.99) and 97% (95% CI: 0.95–0.99), respectively. Compared to traditional machine learning (TML) algorithms (sensitivity: 91.5%), deep learning (DL) algorithms (sensitivity: 98.1%) showed superior performance. Using multiple datasets and public datasets alone or in combination demonstrated slightly better performance than using a single dataset and proprietary datasets. Conclusions: ML algorithms are effective for detecting AF from ECGs. DL algorithms, particularly those based on convolutional neural networks (CNN), demonstrate superior performance in AF detection compared to TML algorithms. The integration of ML algorithms can help wearable devices diagnose AF earlier.

Keywords
machine learning
atrial fibrillation
ECG
meta-analysis
1. Introduction

Atrial fibrillation (AF) is one of the most common arrhythmias with increasing prevalence, and it is often difficult to detect early due to its asymptomatic and paroxysmal presentation [1]. Detection of AF is mainly based on the electrocardiogram (ECG), which is challenging due to the risk of missed detection. AF is characterized by the disappearance of the P wave in each ECG lead, the replacement of the P wave by the F wave, and the atrial frequency of 350 to 600 times per minute with an irregular R-R interval. Cardiovascular adverse events such as heart failure and thromboembolism caused by AF can lead to increased morbidity and mortality [2]. Detecting paroxysmal AF is difficult with intermittent ECGs. Analyzing a large number of ECG signals generated by 24-hour Holter monitoring to detect AF requires significant human resources. How to timely detect AF and reduce the risk of missed detection while lowering medical costs remains challenging [3].

Machine learning (ML) is a subset of artificial intelligence (AI) that enables computers to learn from data and make predictions. Traditional machine learning (TML) algorithms, such as decision trees and support vector machines (SVM), require manual feature engineering. In contrast, deep learning (DL) algorithms, a new type of ML, use neural networks to automatically learn from data, making it ideal for unstructured data such as medical images and clinical text. DL has become a dominant approach within AI, revolutionizing various medical applications [4]. However, DL has drawbacks such as high computational complexity, poor portability, high hardware requirements, and difficulty in meeting the requirements of ordinary CPUs (Central-Processing-Units), making it challenging to apply DL algorithms on wearable devices.

ML has been used in recent years to analyze ECG signals for the detection of AF, with a focus on improving accuracy, and is now been better applied and developed in medical diagnosis and treatment [5]. With the continuous updating of technology and the continuous expansion of public datasets, various ML algorithms have been gradually applied for the clinical diagnosis of arrhythmias. For example, TML algorithms represented by SVM and DL algorithms represented by convolution neural network (CNN) and long short-term memory (LSTM) have been widely used to analyze ECG signals to detect AF [6]. Some new algorithms that combine DL and other methods have achieved better performance, such as Deepaware algorithms and DeepBeat [7, 8]. In addition, wearable devices such as smartwatches equipped with ECG sensors offer a convenient and non-invasive means of collecting ECG signals for AF detection. These devices provide a wealth of data that can be used to train and improve ML algorithms for better performance. Overall, ML has great potential to improve the efficiency and accuracy of AF detection.

However, the results of several studies have been inconsistent because various factors may affect the performance of the algorithms [7, 9, 10, 11, 12, 13], such as the selection of signals, the use of TML or DL algorithms, and the included datasets. It is, therefore, necessary to systematically review and meta-analyze ML in the detection of AF in ECGs, summarize sensitivity, specificity, and area under curve (AUC) of various algorithms, and investigate whether the results are related to the above factors so as to evaluate the application of ML in the detection of AF in ECGs.

2. Methods

This study was performed in accordance with the guidelines of Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA-DTA) [14] and was part of work registered in INPLASY (No. 202310047). All analyses were based on previously published studies; thus, no ethical approval and patient consent was required.

2.1 Search Strategy

A comprehensive search strategy was designed and executed within PubMed, Web of Science, Embase, and Google Scholar databases from their inception until January 30, 2023. A combination of subject words and keywords was used to formulate this search strategy, with adjustments made to account for differences in the various databases. As an example, the fewer keywords or corresponding Medical Subject Headings (MeSH) were used for searching ((Atrial Fibrillation [MeSH Terms]) or (Auricular Fibrillation [Title/Abstract])) and ((Machine Learning [MeSH Terms]) or (Algorithms [Title/Abstract]) or (Artificial Intelligence [Title/Abstract]) or (Deep Learning [Title/Abstract])) and ((Electrocardiography [MeSH Terms]) or (Electrocardiography (EKG) [Title/Abstract]) or (ECG [Title/Abstract])). The complete search strategy is shown in Supplementary Fig. 1.

One investigator (ZW) designed and conducted the search strategy using input from the study’s principal investigator (CGX). The literature search process is shown in Fig. 1. From the initial pool of 902 studies, 140 studies were screened based on the assessment of their titles and abstracts. A full-text review was then conducted, resulting in the exclusion of 762 studies. Ultimately, 14 studies were included in the quality assessment and meta-analysis [15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28].

Fig. 1.

Flow chart of the selection process.

2.2 Study Selection

The two primary reviewers (ZW and CGX) screened abstracts and titles of the studies to assess their eligibility for inclusion in the meta-analysis. The criteria for eligibility were: (1) studies that focused on the detection of AF, (2) studies that developed AI models using ML algorithms, and (3) studies that utilized ECG signals obtained from AF patients or available AF datasets. The exclusion criteria were: (1) studies that only evaluated cardiovascular parameters without considering the disease status, (2) studies with a small sample size of patients (10) or ECG signals (100), (3) studies that did not report the quantitative performance metrics such as sensitivity and specificity, (4) studies of AF complicated with other diseases, (5) studies that used incomplete ECG signals and only analyzed specific ECG characteristics, and (6) the dataset used for the algorithm in studies must come from multiple different databases or patients or be reasonably split between training and validation datasets. Finally, 14 studies were deemed eligible for inclusion in the quality assessment and meta-analysis.

2.3 Data Extraction and Quality Assessment

Data parameters were determined before the literature search was extracted in the selected studies. We extracted the following information, if possible, from each study: authors, year of publication, study name, study tasks, datasets and their availability (proprietary datasets or public AF datasets), ML algorithm types, and performance measures (AUC, sensitivity, specificity). We evaluated the risk of bias in individual studies using the quality assessment of diagnostic accuracy studies-2 (QUADAS-2) tool [29].

2.4 Data Synthesis

We aimed to summarize the performance of ML algorithms that could be used to detect AF and present the results in the form of summary statistics. We extracted the number of true positive (TP), false positive (FP), false negative (FN), and true negative (TN) data from these studies. If these parameters were unavailable, they were calculated based on sample size and performance indicators such as sensitivity and specificity. In the event that these parameters could not be calculated, the study was excluded.

To mitigate the heterogeneity among the studies in the meta-analysis, meta-regression, and subgroup analysis were performed according to the type of algorithm (TML or DL), datasets and their availability (proprietary datasets or public datasets), and the number of used datasets (using a single dataset and multiple datasets).

2.5 Statistical Analysis

Bivariate and hierarchical summary receiver operating characteristic (HSROC) models were performed to jointly estimate sensitivity, specificity, and AUC. Meta-regression analysis and subgroup analysis were performed according to the above subgroups. All statistical analyses were performed using Revman version 5.4 (Cochrane Collaboration, London, UK), R version 4.2.1 (mada, lme4, lmtest, and msm packages, Vienna, Austria), and Stata version 14.1 (metandi and midas packages, College Station, TX, USA).

3. Results
3.1 Characteristics of Included Studies

Fourteen studies were included in the quality assessment and meta-analysis [15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28]. The characteristics of the studies are shown in Supplementary Table 1.

Ten studies [16, 18, 19, 21, 23, 24, 25, 26, 27, 28] used DL, while only 4 studies [15, 17, 20, 22] used TML. Among the included studies, the CNN algorithm was the most commonly used. In addition, a few studies [23, 28] used a combination of multiple algorithms.

Ten studies [15, 16, 17, 19, 20, 22, 23, 24, 25, 27] used public datasets alone or in combination with proprietary datasets, and many of them used the MIT-BIH AF database. The MIT-BIH AF database was provided by the Massachusetts Institute of Technology, which contained long-term ECG records of 25 patients with paroxysmal AF for use by researchers. The remaining 4 studies [18, 21, 26, 28] only used proprietary datasets, which usually came from ECG signals collected by medical institutions or smart wearable devices at home.

Eight studies [15, 17, 20, 22, 24, 25, 26, 27] used a single dataset as a data source, and only 6 studies [16, 18, 19, 21, 23, 28] used multiple datasets as a data source. While the use of a single dataset reduced the differences among data, it could also limit the generalizability of ML performance indicators to other datasets.

3.2 Quality Assessment

Regarding the assessment of bias risk using QUADAS-2, 14 relevant studies were evaluated across 4 domains: patient selection, index test, reference standard, and flow and timing [15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28].

For each domain assessed by the QUADAS-2 tool, the number of studies classified as having a high, unclear, or low risk of bias are as follows: patient selection (4 studies with high risk of bias, 4 with unclear risk, and 6 with low risk), index test (0 with high risk, 6 with unclear risk, and 8 with low risk), reference standard (0 with high risk, 7 with unclear risk, and 7 with low risk), and flow and timing (1 with high risk, 2 with unclear risk, and 11 with low risk). The detailed quality assessment of the included studies using the QUADAS-2 tool is shown in Supplementary Fig. 2.

3.3 ML Algorithms and Detection of Atrial Fibrillation in ECG

In total, 14 studies were included in the quantitative meta-analysis, and the algorithms had a high performance in the detection of AF in ECGs [15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28]. The forest plot of meta-analysis revealed high diagnostic performance for AF detection, with a sensitivity of 97% (95% confidence interval [CI]: 0.94–0.99) and a specificity of 97% (95% CI: 0.95–0.99) (Fig. 2).

Fig. 2.

Forest plot of machine learning for atrial fibrillation detection in electrocardiogram. CI, confidence interval.

The receiver operating characteristics (ROC) curves of meta-analysis showed that the majority of the studies had a high level of sensitivity and specificity, demonstrating their potential to effectively detect AF in ECG signals (Fig. 3).

Fig. 3.

Receiver operating characteristics curves of machine learning for atrial fibrillation detection in electrocardiogram. HSROC, hierarchical summary receiver operating characteristic.

High heterogeneity was observed among the included studies, evident from the Q-value of 9.6 × 1016 for sensitivity and 7.3 × 1016 for specificity. A meta-regression analysis and subgroup analyses were conducted to explore potential sources of heterogeneity.

3.4 Meta-Regression

According to the characteristics of the studies, we selected 3 factors (ML algorithm types, datasets, their availability, and number of used datasets) to conduct meta-regression and subgroup analysis for the included studies. The results showed that the heterogeneity of ML algorithm types (deep learning in Fig. 4) sensitivity and specificity was not significant (p > 0.05). In addition, the heterogeneity of datasets and their availability (Public Datasets in Fig. 4) and number of used datasets (Single Datasets in Fig. 4) specificity was not significant (p > 0.05). Therefore, ML algorithm types, datasets, and their availability and number of used datasets could not be the reasons for the high heterogeneity of the meta-analysis.

Fig. 4.

Meta-regression and subgroup analysis of machine learning for atrial fibrillation detection in electrocardiogram. CI, confidence interval.

3.5 Comparison of ML Algorithm Types

In the quantitative meta-analysis of included studies, DL algorithms achieved an AUC of 100% (95% CI: 0.99–1.00), while TML algorithms attained an AUC of 0.98 (95% CI: 0.97–0.99). DL algorithms exhibited higher performance in the detection of AF in ECGs as compared to TML algorithms. The results of the meta-analysis showed that DL algorithms had a higher diagnostic sensitivity as compared to TML algorithms.

Although the difference in specificity between them was not significant (p > 0.05), DL algorithms demonstrated a better overall performance (Fig. 5).

Fig. 5.

Summary of receiver operating characteristics curves of traditional machine learning and deep learning.

3.6 Comparison of the Datasets Types and Their Availability

The performance of using public datasets yielded an AUC of 100% (95% CI: 0.99–1.00), sensitivity of 97% (95% CI: 0.94–0.99), and specificity of 98% (95% CI: 0.96–0.99). In contrast, using proprietary datasets achieved an AUC of 0.99 (95% CI: 0.97–0.99), sensitivity of 96% (95% CI: 0.87–0.99), and specificity of 95% (95% CI: 0.83–0.99). When considering the number of datasets used, using single datasets had an AUC of 97% (95% CI: 0.96–0.98), sensitivity of 92% (95% CI: 0.83–0.97), and specificity of 93% (95% CI: 0.81–0.98). In contrast, using multiple datasets achieved an AUC of 100% (95% CI: 0.98–1.00), sensitivity of 98% (95% CI: 0.93–0.99), and specificity of 97% (95% CI: 0.90–0.99).

The analysis of the impact of dataset types and their availability on algorithm performance showed that the use of multiple datasets and public datasets demonstrated slightly better performance. However, there was no significant difference (p > 0.05) between using single datasets or multiple datasets and between using public datasets alone or in combination with proprietary datasets (Fig. 6A,B). These findings suggest that the availability and number of datasets are important factors in testing ML algorithms for AF detection and that the performance benefits of using multiple or public datasets may be relatively better.

Fig. 6.

Comparison: single vs. multiple datasets and public vs. proprietary datasets. (A) Summary of receiver operating characteristics curves of single datasets vs. multiple datasets. (B) Summary of receiver operating characteristics curves of using public datasets alone or in combination with proprietary datasets.

3.7 Publication Bias

The p-value, obtained by the Deeks’ Funnel Plot Asymmetry Test method, was 0.28, so there was no publication bias.

4. Discussion

In this systematic review and meta-analysis, the performance of ML designed for AF detection in ECGs was comprehensively evaluated. This research aimed to provide insights into the development of these algorithms through the synergistic synthesizing of sensitivity and specificity of each study and the conduct of meta-regression and subgroup analysis.

Three main sources of data for ML algorithms were identified: (1) public ECG databases, such as the MIT AF database [24, 27, 30], (2) ECG signals collected in medical institutions [21, 26], and (3) ECG signals collected by wearable or implantable devices [28]. The first two methods have lower interference with the ECG signal acquisition environment and are easier to obtain high-quality ECG signals, resulting in generally excellent accuracy of the algorithms. The third method collects ECG or photoelectric signals often influenced by other factors such as electromyography (EMG) signals. However, while implementing a particular algorithm is largely convenient for non-medical workers, the accuracy of the relevant algorithms involved needs to be improved [31].

In terms of dataset usage, ML algorithms can be trained using single or multiple datasets. The use of single datasets reduces performance differences caused by various data sources but lacks external validation, leading to a lack of credibility in performance. In many studies, the use of the same ML algorithm in different datasets has been showed to produce performance variations [16, 18, 32]. Data preprocessing is one of the methods to obtain high-quality ECG signals, which can transform the data into a form that is more accessible to ML, such as filtering, noise removal, and feature extraction [3]. Specific operations in each study can greatly affect the final performance indicators of ML algorithms.

ECG signals typically consist of the P wave, QRS wave group, and T wave, among others. While the extraction of specific waves or wave groups, such as the P wave, has been explored as a way to detect AF through ML, the performance of this approach has not been found to be optimal [33, 34]. Further optimization of algorithms and improvement of data extraction methods are required to enhance the accuracy.

Similarly, the performance of different ML algorithms differs. DL algorithms are generally superior to TML algorithms and are widely used in the detection of AF. DL algorithms represented by CNN are the most applicable. In some studies, CNN algorithms have been combined with other DL algorithms and even combined with TML algorithms in order to produce a new model [23, 35]. The combination of multiple ML algorithms can potentially become a novel and effective ML method for AF detection. The interpretability of DL is a major challenge to application in clinical practice. Despite the high accuracy of DL algorithms, medical workers could not explain the results to patients without an adequate explanation [36]. In clinical practice, the interpretability of DL algorithms remains a challenge as they are considered black boxes, and the results generated by them cannot be fully explained [37].

ML algorithms are primarily aimed at distinguishing AF ECG signals from non-AF signals and further classifying AF signals into different subtypes, such as paroxysmal atrial fibrillation (PAF) and persistent atrial fibrillation (PeAF). Furthermore, as clinical AF patients may also have other types of arrhythmia diseases or other types of cardiovascular diseases, the use of ML to distinguish AF from other types of arrhythmias, and even from other types of cardiovascular diseases, is a challenging task [38].

The other important goal of ML algorithms is to actively predict the onset time of AF. For healthy individuals, AF has not yet occurred, and they need to rely on ML to predict the risk of AF. In the case of a risk, it is necessary to continue to predict the time of future AF episodes. For individuals who have experienced AF, they need to rely on ML to predict the time of the next AF episode [13]. Due to the high similarity between pre-attack ECG signals and normal ECG signals, it is still difficult to actively predict the time of onset of AF, which requires research.

In order to address the limitations of large equipment in medical institutions, the use of portable and wearable or implantable ECG equipment has become a popular research area in medical and AI fields. Portable and wearable ECG devices play a significant role in the detection of AF, as they can effectively reduce the burden on the medical system by providing early screening for AF and its potential patients and providing internet data support for sharing relevant data between medical professionals and patients [39]. Our research results indicate that AI-based algorithms have the potential to revolutionize the field of AF detection and monitoring, especially when combined with portable and wearable ECG devices. The key advantage of these devices is their ability to continuously monitor patients in real-time and provide timely alerts for potential AF episodes [40]. With DL algorithms such as CNN, AF detection in these devices has reached a high level of accuracy and reliability, making them an ideal choice for large-scale applications. With the widespread use of intelligent wearable devices such as smartwatches in daily life, there is a great opportunity for improving the detection of AF with relevant algorithms [41].

The use of ML for AF detection on wearable devices presents several challenges. First, the limited processing power and memory capacity of wearable devices may hinder the deployment of complex ML algorithms, especially DL algorithms, which typically require significant resources for training and testing. Embedding ML algorithms into electronic health records (EHR) or developing ML algorithms with lower computational requirements that can be used directly for wearable devices may be an effective way to solve this problem [42]. In addition, the presence of motion artifacts and noise in ECG signals or photoplethysmogram (PPG) signals acquired by wearable devices may negatively impact the performance of ML algorithms, requiring robust signal preprocessing and noise reduction techniques [9]. Furthermore, ensuring the interpretability of wearable devices helps users and researchers to better accept relevant algorithms. Overcoming these challenges will be essential to apply ML in AF detection on wearable devices and enable widespread adoption for improved healthcare outcomes.

Real-time results can be obtained by intelligent wearable devices processing signals collected from individuals in their daily lives, but the performance may be reduced by data acquisition, preprocessing, and noise factors. The solution to this problem is the development of accurate data acquisition equipment, the invention of a lightweight data preprocessing mode, and the reduction of noise interference [31]. Currently, with continuous improvements in medical devices, the use of wearable ECG devices to monitor health can help identify potential AF patients [40]. However, existing studies demonstrated the need to improve the accuracy of AI models for wearable devices, and the development of ML algorithms that can be applied to wearable devices has become an area of active research issue.

Mobilenet is a lightweight DL model for arrhythmia classification in embedded wearable devices. This model uses multi-sensor units for data processing and classification [43]. Compared to the resnet model, mobilenet demonstrates higher efficiency, with a remarkable reduction in size from 743 MB to 76 KB (1/10,000) using model compression (TensorFlow Lite) while maintaining similar levels of accuracy [44]. Its model compression capability significantly reduces weight, making it an ideal choice for real-time AF detection in wearable devices.

A limitation of this systematic review and meta-analysis is the high heterogeneity of the included studies, which was partially addressed through meta-regression and subgroup analysis. However, inappropriate statistical analysis may have been generated during the quantitative comprehensive research.

5. Conclusions

In conclusion, ML algorithms perform effectively for AF detection. In terms of the type of ML algorithms, number of datasets, and dataset availability, using DL algorithms, multiple datasets, and public datasets enable better performance. DL algorithms, such as CNN, can be applied in clinical practice for AF detection in view of their superior performance. The integration of ML algorithms with wearable devices has the potential to transform the method of AF detection, enabling more accurate and personalized diagnosis, risk stratification, and therapy selection.

Availability of Data and Materials

All data generated or analyzed during this study are included in this published article.

Author Contributions

CGX and HL contributed to the original draft and designed the research. CGX, ZW, and HL performed the research. CLY and JHL provided assistance and advice on the data synthesis and statistical analysis. CGX and HL analyzed the data. All authors participated in reviewing and editing the manuscript, and all authors read and approved the final manuscript. All authors have participated sufficiently in the work and agreed to be accountable for all aspects of the work.

Ethics Approval and Consent to Participate

Not applicable.

Acknowledgment

Not applicable.

Funding

This research study was supported by the grants of National Natural Science Foundation of China (No. 82274411), Science and Technology Innovation Program of Hunan Province (No. 2022RC1021), Natural Science Foundation of Hunan Province (No. 2022JJ40300), National College Student Innovation and Entrepreneurship Training Program Project (No.202210541033), Hunan Provincial College Student Innovation and Entrepreneurship Training Program Project (No. S202310541016).

Conflict of Interest

The authors declare no conflict of interest.

References
[1]
Engdahl J, Rosenqvist M. Large-scale screening studies for atrial fibrillation - is it worth the effort? Journal of Internal Medicine. 2021; 289: 474–492.
[2]
Staerk L, Sherer JA, Ko D, Benjamin EJ, Helm RH. Atrial Fibrillation: Epidemiology, Pathophysiology, and Clinical Outcomes. Circulation Research. 2017; 120: 1501–1517.
[3]
Rizwan A, Zoha A, Mabrouk IB, Sabbour HM, Al-Sumaiti AS, Alomainy A, et al. A Review on the State of the Art in Atrial Fibrillation Detection Enabled by Machine Learning. IEEE Reviews in Biomedical Engineering. 2021; 14: 219–239.
[4]
Akay A, Hess H. Deep Learning: Current and Emerging Applications in Medicine and Technology. IEEE Journal of Biomedical and Health Informatics. 2019; 23: 906–920.
[5]
Goecks J, Jalili V, Heiser LM, Gray JW. How Machine Learning Will Transform Biomedicine. Cell. 2020; 181: 92–101.
[6]
Murat F, Sadak F, Yildirim O, Talo M, Murat E, Karabatak M, et al. Review of Deep Learning-Based Atrial Fibrillation Detection Studies. International Journal of Environmental Research and Public Health. 2021; 18: 11302.
[7]
Kumar D, Peimankar A, Sharma K, Domínguez H, Puthusserypady S, Bardram JE. Deepaware: A hybrid deep learning and context-aware heuristics-based model for atrial fibrillation detection. Computer Methods and Programs in Biomedicine. 2022; 221: 106899.
[8]
Torres-Soto J, Ashley EA. Multi-task deep learning for cardiac rhythm detection in wearable devices. NPJ Digital Medicine. 2020; 3: 116.
[9]
Kwon S, Hong J, Choi EK, Lee B, Baik C, Lee E, et al. Detection of Atrial Fibrillation Using a Ring-Type Wearable Device (CardioTracker) and Deep Learning Analysis of Photoplethysmography Signals: Prospective Observational Proof-of-Concept Study. Journal of Medical Internet Research. 2020; 22: e16443.
[10]
Ebrahimzadeh E, Kalantari M, Joulani M, Shahraki RS, Fayaz F, Ahmadi F. Prediction of paroxysmal Atrial Fibrillation: A machine learning based approach using combined feature vector and mixture of expert classification on HRV signal. Computer Methods and Programs in Biomedicine. 2018; 165: 53–67.
[11]
Alhusseini MI, Abuzaid F, Rogers AJ, Zaman JAB, Baykaner T, Clopton P, et al. Machine Learning to Classify Intracardiac Electrical Patterns During Atrial Fibrillation: Machine Learning of Atrial Fibrillation. Circulation. Arrhythmia and Electrophysiology. 2020; 13: e008160.
[12]
Wasserlauf J, You C, Patel R, Valys A, Albert D, Passman R. Smartwatch Performance for the Detection and Quantification of Atrial Fibrillation. Circulation. Arrhythmia and Electrophysiology. 2019; 12: e006834.
[13]
Attia ZI, Noseworthy PA, Lopez-Jimenez F, Asirvatham SJ, Deshmukh AJ, Gersh BJ, et al. An artificial intelligence-enabled ECG algorithm for the identification of patients with atrial fibrillation during sinus rhythm: a retrospective analysis of outcome prediction. The Lancet. 2019; 394: 861–867.
[14]
McInnes MDF, Moher D, Thombs BD, McGrath TA, Bossuyt PM, Clifford T, et al. Preferred Reporting Items for a Systematic Review and Meta-analysis of Diagnostic Test Accuracy Studies: The PRISMA-DTA Statement. JAMA. 2018; 319: 388–396.
[15]
Mei Z, Gu X, Chen H, Chen W. Automatic atrial fibrillation detection based on heart rate variability and spectral features. IEEE Access. 2018; 6: 53566–53575.
[16]
Erdenebayar U, Kim H, Park JU, Kang D, Lee KJ. Automatic Prediction of Atrial Fibrillation Based on Convolutional Neural Network Using a Short-term Normal Electrocardiogram Signal. Journal of Korean Medical Science. 2019; 34: e64.
[17]
Czabanski R, Horoba K, Wrobel J, Matonia A, Martinek R, Kupka T, et al. Detection of Atrial Fibrillation Episodes in Long-Term Heart Rhythm Signals Using a Support Vector Machine. Sensors. 2020; 20: 765.
[18]
Baek YS, Lee SC, Choi W, Kim DH. A new deep learning algorithm of 12-lead electrocardiogram for identifying atrial fibrillation during sinus rhythm. Scientific Reports. 2021; 11: 12818.
[19]
Seo HC, Oh S, Kim H, Joo S. ECG data dependency for atrial fibrillation detection based on residual networks. Scientific Reports. 2021; 11: 18256.
[20]
Shi J, Chen C, Liu H, Wang Y, Shu M, Zhu Q. Automated Atrial Fibrillation Detection Based on Feature Fusion Using Discriminant Canonical Correlation Analysis. Computational and Mathematical Methods in Medicine. 2021; 2021: 6691177.
[21]
Taniguchi H, Takata T, Takechi M, Furukawa A, Iwasawa J, Kawamura A, et al. Explainable Artificial Intelligence Model for Diagnosis of Atrial Fibrillation Using Holter Electrocardiogram Waveforms. International Heart Journal. 2021; 62: 534–539.
[22]
Wu C, Hwang M, Huang TH, Chen YMJ, Chang YJ, Ho TH, et al. Application of artificial intelligence ensemble learning model in early prediction of atrial fibrillation. BMC Bioinformatics. 2021; 22: 93.
[23]
Tutuko B, Rachmatullah MN, Darmawahyuni A, Nurmaini S, Tondas AE, Passarella R, et al. Short Single-Lead ECG Signal Delineation-Based Deep Learning: Implementation in Automatic Atrial Fibrillation Identification. Sensors. 2022; 22: 2329.
[24]
Faust O, Shenfield A, Kareem M, San TR, Fujita H, Acharya UR. Automated detection of atrial fibrillation using long short-term memory network with RR interval signals. Computers in Biology and Medicine. 2018; 102: 327–335.
[25]
Buscema PM, Grossi E, Massini G, Breda M, Della Torre F. Computer Aided Diagnosis for atrial fibrillation based on new artificial adaptive systems. Computer Methods and Programs in Biomedicine. 2020; 191: 105401.
[26]
Cai W, Chen Y, Guo J, Han B, Shi Y, Ji L, et al. Accurate detection of atrial fibrillation from 12-lead ECG using deep neural network. Computers in Biology and Medicine. 2020; 116: 103378.
[27]
Jin Y, Qin C, Huang Y, Zhao W, Liu C. Multi-domain modeling of atrial fibrillation detection with twin attentional convolutional long short-term memory neural networks. Knowledge-Based Systems, 2020; 193: 105460.
[28]
Lai D, Bu Y, Su Y, Zhang X, Ma CS. Non-Standardized Patch-Based ECG Lead Together With Deep Learning Based Algorithm for Automatic Screening of Atrial Fibrillation. IEEE Journal of Biomedical and Health Informatics. 2020; 24: 1569–1578.
[29]
Whiting PF, Rutjes AWS, Westwood ME, Mallett S, Deeks JJ, Reitsma JB, et al. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Annals of Internal Medicine. 2011; 155: 529–536.
[30]
Yue Y, Chen C, Liu P, Xing Y, Zhou X. Automatic Detection of Short-Term Atrial Fibrillation Segments Based on Frequency Slice Wavelet Transform and Machine Learning Techniques. Sensors. 2021; 21: 5302.
[31]
Lee S, Chu Y, Ryu J, Park YJ, Yang S, Koh SB. Artificial Intelligence for Detection of Cardiovascular-Related Diseases from Wearable Devices: A Systematic Review and Meta-Analysis. Yonsei Medical Journal. 2022; 63: S93–S107.
[32]
Herraiz ÁH, Martínez-Rodrigo A, Bertomeu-González V, Quesada A, Rieta JJ, Alcaraz R. A Deep Learning Approach for Featureless Robust Quality Assessment of Intermittent Atrial Fibrillation Recordings from Portable and Wearable Devices. Entropy. 2020; 22: 733.
[33]
Yang HW, Hsiao CY, Peng YQ, Lin TY, Tsai LW, Lin C, et al. Identification of Patients with Potential Atrial Fibrillation during Sinus Rhythm Using Isolated P Wave Characteristics from 12-Lead ECGs. Journal of Personalized Medicine. 2022; 12: 1608.
[34]
Tzou HA, Lin SF, Chen PS. Paroxysmal atrial fibrillation prediction based on morphological variant P-wave analysis with wideband ECG and deep learning. Computer Methods and Programs in Biomedicine. 2021; 211: 106396.
[35]
Nguyen QH, Nguyen BP, Nguyen TB, Do TT, Mbinta JF, Simpson CR. Stacking segment-based CNN with SVM for recognition of atrial fibrillation from single-lead ECG recordings. Biomedical Signal Processing and Control. 2021; 68: 102672.
[36]
Mousavi S, Afghah F, Acharya UR. HAN-ECG: An interpretable atrial fibrillation detection model using hierarchical attention networks. Computers in Biology and Medicine. 2020; 127: 104057.
[37]
Rosenberg MA. Trusting Magic: Interpretability of Predictions From Machine Learning Algorithms. Circulation. 2021; 143: 1299–1301.
[38]
Fuadah YN, Lim KM. Optimal Classification of Atrial Fibrillation and Congestive Heart Failure Using Machine Learning. Frontiers in Physiology. 2022; 12: 761013.
[39]
Mäkynen M, Schlindwein FS. Wearable Devices Combined with Artificial Intelligence-A Future Technology for Atrial Fibrillation Detection? Sensors. 2022; 22: 8588.
[40]
Lubitz SA, Faranesh A, Selvaggi C, Atlas S, McManus DD, Singer DE, et al. Detection of Atrial Fibrillation in a Large Population Using Wearable Devices: The Fitbit Heart Study. Circulation. 2022; 146: 1415–1424.
[41]
Papaccioli G, Bassi G, Lugi C, Parente E, D’Andrea A, Proietti R, et al. Smartphone and new tools for atrial fibrillation diagnosis: evidence for clinical applicability. Minerva Cardiology and Angiology. 2022; 70: 616–627.
[42]
Duncker D, Ding WY, Etheridge S, Noseworthy PA, Veltmann C, Yao X, et al. Smart Wearables for Cardiac Monitoring-Real-World Use beyond Atrial Fibrillation. Sensors. 2021; 21: 2539.
[43]
Huang W, Song A, Wan B, Guo J. Wearable health monitoring system based on layered 3D-Mobilenet. Procedia Computer Science. 2022; 202: 373–378.
[44]
Lee KS, Park HJ, Kim JE, Kim HJ, Chon S, Kim S, et al. Compressed Deep Learning to Classify Arrhythmia in an Embedded Wearable Device. Sensors. 2022; 22: 1776.

Publisher’s Note: IMR Press stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share
Back to top