Abstract
Objective
Early diagnosis of atrial fibrillation (AFib) is a priority for stroke prevention. We sought to test four commercial pulse detection systems (CPDSs) for ability to distinguish AFib from normal sinus rhythm using a published algorithm (Zhou et al., PLoS One 2015;10:e0136544), compared with visual diagnosis by electrocardiogram inspection.
Methods
BAYathlon was a prospective, non-interventional, single-centre study. Adult cardiology patients with documented AFib or sinus rhythm who were due to have a routine 5-min electrocardiogram were randomized to undergo a parallel 5-min pulse assessment with a Polar V800, eMotion Faros 360, TomTom heart rate monitor, or Adidas miCoach Smart Run.
Results
144 patients (73 with AFib, 71 with sinus rhythm (based on electrocardiograms); median age: 73 years; 53.5% male) were analysed. Algorithm sensitivities (primary endpoint) and specificities for AFib when applied to CPDS recordings were 93.3% and 94.1% with the Polar V800, 90.0% and 84.2% with the eMotion Faros 360, and 0% and 100% with the other CPDSs (analysis period: 127 heart rate signals + 2 min). When applied to routine electrocardiograms, the algorithm correctly detected AFib in 71/73 patients. Different analysis periods (127 heart rate signals +1 or 3 min) only slightly changed the sensitivities with the Polar V800 and eMotion Faros 360 and had no effect on the sensitivities with the other CPDSs.
Conclusion
AFib screening using the applied algorithm is feasible with the Polar V800 and eMotion Faros 360 (which provide RR interval data) but not with the other CPDSs (which provide pre-processed heart rate time series).
ClinicalTrials.gov identifier: NCT02875106
Introduction
Atrial fibrillation (AFib) is common, with an estimated age-dependent prevalence of ∼3% in adults aged >20 years, and is a major cause of stroke, heart failure, sudden death and cardiovascular morbidity. 1 Early diagnosis of AFib is a recognized priority for the prevention of strokes. AFib is detected as irregular RR intervals and absence of distinct P waves in an electrocardiogram (ECG). 1 However, AFib generally begins with paroxysmal episodes 2 which can be missed by short-term monitoring during office visits.3,4 Moreover, about one-third of patients with AFib are asymptomatic. 5 The consequences of paroxysmal and asymptomatic AFib remain severe. The risk of thromboembolic events associated with paroxysmal AFib was found to be as high as that associated with persistent/permanent AFib in the Euro Heart Survey 6 and the AFib Clopidogrel Trial With Irbesartan for Prevention of Vascular Events (ACTIVE W) trial. 7 Outcomes in asymptomatic AFib have been reported to be similar to or worse than outcomes in symptomatic AFib. 5
Long-term monitoring methods are therefore needed to improve the detection of asymptomatic and paroxysmal AFib in populations at risk of developing AFib or stroke (e.g. the elderly, those with hypertension, heart failure or diabetes mellitus, and especially those with ischaemic stroke1,5). There is likely to be an optimal screening duration and frequency for the detection of paroxysmal AFib in at-risk populations. Implanted devices are used to achieve continuous monitoring in patients with cryptogenic stroke, 8 but such invasive monitoring would not be suitable for AFib screening in broader populations. Many non-invasive mobile health devices have been evaluated as AFib screening tools, with results showing variable accuracy depending on the technology used, the application setting and the study population. 9
An AFib screening algorithm that is compatible with multiple commercial pulse detection systems (CPDSs) may help to increase the accessibility of AFib screening by addressing different consumer/researcher needs and budgets. The aim of the BAYathlon study was to test four CPDSs (Polar V800 with a Polar H7 chest strap monitor (Polar Electro Oy, Kempele, Finland), eMotion Faros 360 set with belt (Mega Electronics Ltd, Kuopio, Finland; now marketed as Bittium Faros 360 by Bittium Corporation, Oulu, Finland), TomTom heart rate monitor (HRM; TomTom, Amsterdam, the Netherlands)) and Adidas miCoach Smart Run (Adidas AG, Herzogenaurach, Germany)) for their ability to distinguish AFib from normal sinus rhythm using a previously published AFib detection algorithm. 10 The CPDSs were chosen for their ease of use, lossless data recording during a wide range of physical activities, open access to the data, and exported data quality for use with AFib detection algorithms.
Methods
Study design and patients
BAYathlon was a prospective, non-interventional, company-sponsored, feasibility study conducted in a single private cardiac practice (the Cardio Centrum Düsseldorf, Düsseldorf, Germany) which routinely conducts ECG recordings. Consecutive adult female and male patients were eligible for inclusion if their treating cardiologist had decided that they should have an ECG recorded as part of routine clinical practice, and if they had AFib or sinus rhythm documented in their medical history. Patients were excluded if they had a pacemaker or any comorbidities or abnormalities of heart function or rhythm which might, in the opinion of the investigator, interfere with the evaluation of study data. Participation in an investigational program with interventions outside of routine clinical practice was also an exclusion criterion.
The investigator collected demographic data from medical records if available, or by interviewing the patient. Enrolled patients were randomly assigned to undergo assessment with one of four CPDSs (Polar V800 with a Polar H7 chest strap monitor, eMotion Faros 360 set with belt, TomTom HRM and Adidas miCoach Smart Run) in parallel with a routine ECG recording using a certified medical device (Schiller CS-200 or AT-104; Schiller AG, Baar, Switzerland) (Figure 1). Randomization was stratified by patient population (AFib or normal sinus rhythm in medical history). The CPDS and routine ECG data were captured and exported via an open interface into the study database over a recording period of 5 min, during which the patients were at rest. As a reference, the routine ECG data were visually reviewed by the investigator to determine a diagnosis of AFib or sinus rhythm (without automated input).

Parallel assessment by routine electrocardiogram and a commercial pulse detection system. Patients underwent parallel assessment by routine electrocardiogram and a commercial pulse detection system (the Polar V800 in this example) for 5 minutes.
The study protocol was approved by the Independent Ethics Committee (Ethik-Kommission der Ärztekammer Nordrhein, Düsseldorf; file reference: 2016/333). All participating patients provided signed informed consent. The study is registered on ClinicalTrials.gov (identifier: NCT02875106).
Data processing for automatic detection of AFib by the algorithm
Automatic classification of CPDS and ECG data as AFib or non-AFib required four processing steps, described below and summarized in Figure 2.

Overview of data processing steps. Patients with AFib or sinus rhythm underwent parallel assessment by routine ECG and one of four commercial pulse detection systems. Data were analysed using the AFib detection algorithm of Zhou et al. 10
1. Conversion of raw data to heart rate (HR) time series
The TomTom HRM and Adidas miCoach Smart Run deliver pre-processed pulse detection data as a HR time series, bypassing the need for an initial processing step.
The Polar V800 provides pulse detection data as a list of consecutive RR intervals (tRR). This list was converted to a HR time series with the following formula:
The eMotion Faros 360 provides pulse detection data as a one-channel ECG record with a sampling rate of 1000 entries per second. RR intervals were extracted from the raw data using the Java software library OSEA-4-Java (Version 1.0.0, https://github.com/MEDEVIT/OSEA-4-Java) 11 and converted to a HR time series using the formula described above for the Polar V800.
The Schiller CS-200 and AT-104 provide a 12-channel ECG record. For automatic classification via the algorithm, channel number 1 was analysed as described above for the eMotion Faros 360, using the Java software library OSEA-4-Java (for diagnosis by visual ECG inspection, the investigator reviewed all 12 channels).
To explore the effect of data pre-processing on the function of the algorithm, a moving average of the RR intervals from the Polar V800 and the eMotion Faros 360 was calculated for different window sizes (2–30 RR intervals) starting at the data entry with index w/2 and ending at index N − w/2, where w is the window size and N is the number of entries in the data sample. The moving average values were then converted to a HR time series as described above.
2. Definition of analysis periods
Due to the design and working mechanism of the algorithm, analysis periods were constructed by taking the first 127 HR signals plus the following 1, 2 and 3 min of the HR time series.
The effect of data pre-processing on the function of the algorithm was evaluated using data from the whole 5-min recording period (N − w moving average values).
3. Detection of AFibepisodes
The data were searched for AFib episodes using the algorithm developed and published by Zhou et al. 10 Briefly, the algorithm identifies AFib episodes in a moving window containing 127 consecutive HR signals, which roughly corresponds to 2 min if an average HR of 64 beats/min is assumed. Zhou et al. convert the symbol sequence into a word sequence using a novel operator. The probability distribution of the word sequence in the specified space is obtained, and a coarser version of Shannon entropy (SE) is next employed to discriminate the AFib arrhythmias. The novelty of this proposed method is based on the HR which was firstly introduced in the field of AFib detection. 10
4. Determination of overall classification (AFib/non-AFib)
If the recording interval contained ≥5 consecutive AFib episodes, the interval was classified as AFib. Otherwise, it was classified as non-AFib (sinus rhythm).
Endpoints
Taking the investigator’s diagnosis based on visual assessment of the routine ECG data as the reference, the primary endpoint was the rate of correctly detected AFib using the CPDS based on an analysis period of 127 HR signals + 2 min (sensitivity of the CPDS for AFib).
The secondary endpoints were the rate of correctly detected non-AFib cases using the CPDS based on the analysis period of 127 HR signals + 2 min (specificity of the CPDS for AFib), the rate of correctly detected AFib when applying the AFib algorithm to the ECG data over the whole 5-min recording period (sensitivity of the algorithm for AFib), and the rate of correctly detected AFib using the CPDS based on analysis periods of 127 HR signals + 1 min and + 3 min (sensitivity analysis in different time frames).
Statistical analysis
Statistical analyses were exploratory and descriptive. The script language Python 3.6.7 (https://www.python.org/; Python Software Foundation, Beaverton, Oregon, USA) was used with the packages NumPy 1.13.3 (https://numpy.org/) and Pandas 0.23.4 (https://pandas.pydata.org/). All analyses were performed in the total study population. Missing data were not replaced. Sensitivity and specificity for AFib were defined as
Sensitivity and specificity are presented with 95% confidence intervals (Wilson confidence limits). Cohen’s κ 12 was calculated to judge the level of agreement with the investigator’s diagnosis based on the routine ECG data.
For an expected sensitivity and specificity of 100.0% for the diagnosis of AFib, the sensitivity and specificity can be estimated with a precision of 16.1% (95% Wilson confidence limits). Based on the expected sensitivity and specificity and the width of the confidence interval in the PULSE-SMART study, 13 a sample size of 20 patients with AFib and 20 patients with sinus rhythm per CPDS was chosen.
The effect of data pre-processing on the algorithm output was evaluated descriptively in patients with AFib (diagnosed by the investigator based on the routine ECG) who were assessed using the Polar V800 or the eMotion Faros 360.
Results
Patients
In total, 163 patients were screened, 161 patients were enrolled, and 144 had full data available for analysis (Figure 3). In each of the four CPDS groups, the median age was higher in patients with a medical history of AFib than in those with sinus rhythm (Table 1).

Patient disposition. Of 163 patients who were screened, 161 were enrolled and 144 were analysed.
Patient demographics.
Age is reported as median (minimum–maximum).
HRM: heart rate monitor.
In three of 74 patients with AFib documented in their medical history, AFib was detected in the ECG data by the investigator but not by the algorithm. In a post hoc analysis, the ECGs from these three patients were re-evaluated by the investigators and one was found to have been incorrectly classified by them. This patient was re-classified as having sinus rhythm, giving a total of 73 patients with AFib and 71 with sinus rhythm diagnosed by the investigator based on the routine ECG.
Sensitivity and specificity of CPDSs and algorithm for AFib
The sensitivity (primary endpoint) and specificity of the algorithm for correctly detected AFib when applied to CPDS recordings for an analysis period of 127 HR signals + 2 min were 93.3% and 94.1%, respectively, with the Polar V800, 90.0% and 84.2%, respectively, with the eMotion Faros 360, and 0% and 100%, respectively, with the other CPDSs (Table 2).
Sensitivity and specificity of AFib detection algorithm when applied to recordings from CPDSs and ECGs.
Reference is investigator’s diagnosis of AFib/sinus rhythm based on manual review of the ECG data. Analysis period is 5 min for the routine ECG and 127 HR signals + 2 min for the CPDSs, except where otherwise specified.
AFib: atrial fibrillation; CPDSs: commercial pulse detection systems; ECG: electrocardiogram; HR: heart rate; HRM: heart rate monitor.
Five recordings with the Polar V800 (one in a patient with AFib and four in patients with sinus rhythm) provided HR rather than RR intervals because of technical issues during data acquisition.
Pre-processing of the RR interval data affected the algorithm output. As the window size for averaging of RR intervals increased, the rate of correct detection of AFib decreased, falling to zero at window sizes of 15 and 20 RR intervals for the Polar V800 and the eMotion Faros 360, respectively (Supplemental material online, Table S1). In one patient with AFib, the Polar V800 provided HR rather than RR interval data because of technical issues during data acquisition – this patient was also not correctly identified by the algorithm.
The sensitivity of the algorithm for correctly detected AFib based on 5 min of ECG data was 97.3% (Table 2). The two undetected cases showed, according to the investigators, pseudo-arrhythmic AFib (Figure 4(a)) and AFib with partly regular transition (Figure 4(b)). A Poincaré plot from a patient with correctly detected AFib is shown in Figure 4(c). The associated Poincaré plots for the undetected cases are presented in Figure 4(d) (pseudo-arrhythmic AFib) and 4(e) (partly regular transition).

(a)–(b) ECGs from two patients with AFib who were incorrectly classified by the algorithm based on their routine ECG data, and (c) a Poincaré plot of correctly detected AFib. The ECGs and Poincaré plots of the two incorrectly classified patients showed (a) and (d) pseudo-arrhythmic AFib and (b) and (e) AFib with partly regular transition.
The sensitivities and specificities of the algorithm for correctly detected AFib based on recordings from the Polar V800 and eMotion Faros 360 showed only slight changes when different analysis periods were used (Table 2). The use of different analysis periods had no effect on the sensitivity of the algorithm for AFib when applied to recordings from the other CPDSs. The level of agreement with the investigator’s diagnosis (Cohen’s κ) also showed little change when different analysis periods were used (Table 3).
Cohen’s κa for agreement of AFib detection algorithm (applied to recordings from CPDS and ECG) with investigator’s diagnosis of AFib/sinus rhythm (based on visual assessment of the ECG data).
AFib: atrial fibrillation; CPDSs: commercial pulse detection systems; ECG: electrocardiogram; HR: heart rate; HRM: heart rate monitor.
The strength of κ > 0.60 and ≤0.80 is considered ‘substantial’, and κ > 0.80 is considered ‘almost perfect’. 14
Five of the 32 recordings with the Polar V800 (one in patients with AFib and four in patients with sinus rhythm) provided HR rather than RR intervals because of technical issues during data acquisition.
Discussion
In the BAYathlon study, the AFib screening algorithm of Zhou et al. 10 showed good sensitivity for AFib when used with CPDSs that provided raw RR interval data (Polar V800 and eMotion Faros 360) but not when used with CPDSs that provided pre-processed HR time series (TomTom HRM and Adidas miCoach Smart Run). The algorithm also correctly identified 71/73 patients with AFib when applied to routinely collected ECG data. As the algorithm used a moving time window of 127 consecutive HR signals, it showed no large differences in sensitivity for AFib when used over different time intervals.
Algorithms for automated AFib detection have mostly been based on RR interval irregularity in ECGs. 10 These algorithms have limitations and in some cases their complexity renders them unsuitable for use in wearable devices. The algorithm developed by Zhou et al. and used in the BAYathlon study provides a real-time and low-complexity method for discriminating AFib episodes based on HR instead of RR interval irregularity. 10 However, the performance of the algorithm (similar to other algorithms for AFib screening) is dependent on the training data set and may be susceptible to errors in certain circumstances, for example in the presence of extra systoles, intracardiac devices and/or atrial flutter.
In the BAYathlon study, the algorithm missed two cases of AFib when applied to routine ECG data. After post hoc review of the ECGs, these cases were considered to be closer to atrial flutter than AFib. Irregular RR intervals are common in atrial flutter, 15 but the lack of a well-defined QRS complex in atrial flutter may prevent the correct detection of QRS complexes by the OSEA-4-Java library, which may in turn prevent the algorithm of Zhou et al. from detecting the irregularities. Re-optimizing the complexity threshold of the algorithm by training on AFib and atrial flutter series may also increase the specificity of AFib detection in the presence of other arrhythmias.
When the algorithm was applied to read-outs from the Adidas miCoach Smart Run and the TomTom HRM, no patients with AFib could be detected. These results suggest that the algorithm does not detect AFib when used with devices that provide pre-processed HR time series rather than RR intervals. Although it is unclear precisely how the Adidas miCoach Smart Run and the TomTom HRM pre-processed their data, we found that averaging RR intervals over a window of 15 or 20 RR intervals prevented the algorithm from detecting AFib. In the single case in which the Polar V800 provided HR rather than RR interval data (resulting in incorrect classification as non-AFib (sinus rhythm)), the data output was also most likely averages of consecutive HR values.
According to a European Heart Rhythm Association consensus document, screening for AFib is advised in high risk populations because of its cost-effectiveness. 5 The use of consumer devices in combination with an AFib detection app may provide new insights that could improve the efficiency of health service provision. For example, the recent Huawei Heart study in China showed geographical variation in the proportion of detected AFib (using Huawei wristbands/watches with a photoplethysmography algorithm), suggesting a need for different prevention approaches in different regions. 16 To offer clinicians an alternative for AFib screening, Fraunhofer and Bayer have developed the BAYathlon app (Supplemental material online, Figure S1), which uses the same source data (RR intervals) and algorithm as described in the current study (but was not used as a data collection tool in this study). The app performs ongoing HR and activity plotting and alerts the user if and to what extent AFib is detected. In contrast to the Apple and Huawei AFib detection systems which are dedicated to specific devices, the BAYathlon app aims to have broad compatibility with devices with open interfaces that provide raw RR interval data. The BAYathlon app may therefore facilitate use of the AFib screening algorithm of Zhou et al. 10 in a real-life clinical setting.
While further studies are required to validate the use of the algorithm with other CPDSs, our results suggest that the algorithm may be used with the Polar V800 and the eMotion Faros 360 to screen for AFib. Both devices used electrode-containing chest strap monitors. The Polar V800 chest strap monitor (Polar H7) was previously shown to have high sensitivity (96.3% (95% confidence interval: 89.7–99.2%)) and specificity (98.2% (95% confidence interval: 96.2–99.3%)) for the detection of AFib (using an algorithm that employed turning points, clustering within Lorentz plots and Shannon entropy) when compared with a 12-lead ECG interpreted by a panel of cardiologists.17,18 The Polar H7 also had better agreement with ECG data during exercise than eight different optically-based forearm- or wrist-worn HR monitors across two studies, suggesting that electrode-containing chest monitors may be more suitable than optical sensors for accurate assessment of HR during exercise.19,20 Athletes might therefore prefer chest-worn ECG devices such as the Polar V800 and the eMotion Faros 360 over optical sensors for HR measurement. As athletes are at significantly increased risk of AFib compared with the general population,21,22 addition of an AFib screening algorithm to chest-worn ECG devices may help to target AFib screening to an at-risk population.
Study limitations
Limitations of the BAYathlon study include its single-site design and limited sample size. There was an imbalance in age distribution across the device groups, particularly in the subset with sinus rhythm. The use of medications which may have influenced HR was not recorded. As BAYathlon was a feasibility study, further real-life studies are needed to provide more generalizable results. For example, all included patients had either AFib or sinus rhythm; the ability of the algorithm to distinguish AFib from other arrythmias was not investigated. ECG and CPDS recordings were taken at rest, so the influence of exercise on the results is not known. The complexity threshold of the algorithm (originally optimized for unprocessed RR series) was not adapted to the different device data evaluated in the current study. Two devices were not able to detect AFib; to compare this with the devices which were able to detect AFib, we calculated the moving average RR intervals over different window sizes (2–30 RR intervals; the window size of 1 indicates raw RR interval data without averaging) for the Polar V800 and the eMotion Faros 360 devices. The averaging of four RR intervals reduced the ability of AFib detection by half (Supplemental material online, Table S1), therefore the unprocessed recordings can be used for AFib detection with that algorithm and processed (averaged values) are not sufficient. Finally, the results of this study apply only to the four tested CPDSs; further studies are required to validate the use of the algorithm with other CPDSs that generate raw RR interval data.
Conclusions
The results of BAYathlon suggest that screening for AFib using the applied algorithm in combination with the Polar V800 and the eMotion Faros 360 devices is feasible when performed in resting individuals for a short interval of 127 HR signals + 1 to 3 min. Further studies are needed to evaluate the performance of the algorithm with other CPDSs that provide raw RR interval data and with long-term continuously collected CPDS data from active individuals. The BAYathlon app may facilitate such studies. Use of averaged rather than individual RR intervals reduces the ability of the algorithm to detect AFib, and the algorithm does not detect AFib when used with devices that provide pre-processed HR time series rather than RR intervals (TomTom HRM and Adidas miCoach Smart Run).
Supplemental Material
sj-pdf-1-dhj-10.1177_20552076211019620 - Supplemental material for Distinguishing atrial fibrillation from sinus rhythm using commercial pulse detection systems: The non-interventional BAYathlon study
Supplemental material, sj-pdf-1-dhj-10.1177_20552076211019620 for Distinguishing atrial fibrillation from sinus rhythm using commercial pulse detection systems: The non-interventional BAYathlon study by Christian Müller, Ulf Hengstmann, Michael Fuchs, Martin Kirchner, Frank Kleinjung, Harald Mathis, Stephan Martin, Ingo Bläse and Stefan Perings in Digital Health
Footnotes
Acknowledgements
The authors thank Daniel Wolff (Bayer AG, Wuppertal, Germany) for data management and Miriam Tamm (Bayer AG, Wuppertal, Germany) for statistical support. Editorial assistance was provided by Dr Claire Mulligan and Dr Paul Overton (Beacon Medical Communications Ltd, Brighton, UK), funded by Bayer Vital GmbH (Leverkusen, Germany), in accordance with Good Publication Practice (GPP3) guidelines (
).
Contributorship
CM and UH are responsible for the study design and drafting of this article, and contributed to the protocol, set-up, and conduct of the study. MF is responsible for the design and development of the data acquisition software for the study. MF contributed to the statistical analysis of the study, the study database administration and drafting of this article. MK is responsible for administration and drafting of this article and contributed to the conduct of the study. FK is responsible for statistical analysis and drafting of this article and contributed to conduct of the study. HM contributed to the concept and design of the study protocol, analysis and interpretation of data, and discussion of the manuscript content. SM contributed to the concept and design of the study protocol, analysis and interpretation of data, and discussion of the manuscript content. IB contributed to the conduct of the study, analysis and interpretation of data, and discussion of the manuscript content. SP contributed to the conduct of the study, analysis and interpretation of data, and discussion of the manuscript content.
Declaration of conflicting interests
CM and MK are employees of Bayer Vital GmbH (Leverkusen, Germany). UH is an employee of Bayer AG (Leverkusen, Germany). CM and UH are inventors for a patent application of the BAYathlon app on behalf of Bayer AG (not yet granted). MF has no relationships relevant to the content of this paper to disclose. FK is an employee of Bayer AG (Berlin, Germany). SM received consultancy fees from Bayer Vital GmbH. HM and IB have no relationships with industry that might pose a conflict of interest in connection with the submitted article. SP is Editor-in-Chief of
which is partly sponsored by Bayer Vital GmbH.
Ethical approval
The study protocol was approved by the Independent Ethics Committee (Ethik-Kommission der Ärztekammer Nordrhein, Düsseldorf; file reference: 2016/333). All participating patients provided signed informed consent. Written consent was provided by the patient for publication of the photograph shown in
.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by Bayer Vital GmbH, Leverkusen, Germany.
Guarantor
CM.
Data availability statement
The data underlying this article will be shared on reasonable request to the corresponding author.
Peer review
This manuscript was reviewed by reviewers who have chosen to remain anonymous.
Supplemental material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
