Abstract
We studied digital stethoscope recordings in children undergoing simultaneous catheterization of the pulmonary artery (PA) to determine whether time-domain analysis of heart sound intensity would aid in the diagnosis of PA hypertension (PAH). Heart sounds were recorded and stored in .wav mono audio format. We performed recordings for 20 seconds with sampling frequencies of 4,000 Hz at the second left intercostal space and the cardiac apex. We used programs written in the MATLAB 2010b environment to analyze signals. We annotated events representing the first (S1) and second (S2) heart sounds and the aortic (A2) and pulmonary (P2) components of S2. We calculated the intensity (I) of the extracted event area (x) as
INTRODUCTION
Pulmonary artery hypertension (PAH) is a serious condition that imposes a global disease burden. If untreated, PAH has a high mortality, whether the cause of the disease is idiopathic, genetic mutation, or a complication of cardiac or pulmonary disease.1,2 PAH is often diagnosed late because early clinical recognition is difficult, even after the onset of symptoms. There is, therefore, a need to explore or reevaluate the clinical diagnosis of PAH.
The auscultatory and phonocardiographic indicators of PAH have been described well, with plausible biological explanations for the findings.3,4 Clinical indicators of PAH include increased loudness of the pulmonary component (P2) of the second heart sound (S2) and increased transmission of P2 to the cardiac apex. However, for the most part these descriptions predated the use of new digital stethoscopes, which are readily available and have the capability of recording and transmitting to a computer an acoustic tracing that can be optimized and analyzed later. There have been a few approaches to the noninvasive diagnosis of PAH combining phonocardiography and mathematical concepts. These investigators, for the most part, concentrated on the difficult task of identifying reliably and precisely S2, A2 (the aortic component of S2), P2, and the splitting interval between A2 and P2.5–8 We undertook a pilot study to characterize the acoustic recordings of the heart sounds in children with and without PAH. Our concern was not precise localization of S2, A2, and P2 but rather a general exploration of the extracted features of the heart sounds that could provide relevant diagnostic information.
Sound signals, such as the heart sounds, may be analyzed in two modes, either the time domain or the frequency domain. Time-domain analysis quantifies the signal behavior over time and is plotted as voltage, current, or, in the case of heart sounds, intensity against time (seconds). In frequency-domain analysis, the component frequencies are spread across a frequency spectrum and are represented as peaks in the frequency spectrogram in units of hertz. We have reported the characteristics of the heart sounds in the frequency domain in children with and without pulmonary hypertensive vascular disease. 9 We hypothesized that, using recordings from a digital stethoscope, we might extract features of the heart sounds in the time domain around A2 and P2 that might differ between subjects with and those without PAH, rather than localizing A2 and P2 precisely. Therefore, we sought to extract features of the heart sounds obtained by digital stethoscope recordings from subjects with and without PAH, in an attempt to correlate these features with the simultaneously and directly measured pulmonary artery (PA) pressure at cardiac catheterization.
METHODS
The Research Ethics Board of the University of Alberta approved the study. All subjects or their parents gave informed and written consent to participate in the study. Informed assent was obtained from children who had reached sufficient developmental ability.
Clinical data collection
We approached pediatric-aged subjects undergoing right heart cardiac catheterization that was required for management of their underlying condition for inclusion in the study. We excluded subjects with congenitally abnormal aortic, pulmonary, or prosthetic valves.
The heart sounds were recorded with a 3M Littmann 3200 digital stethoscope, which works in conjunction with Zargis Cardioscan software (Zargis Medical, Princeton, NJ) to store recorded heart sounds in .wav mono audio format. Heart sound recordings were obtained over 20 seconds, with sampling frequencies of 4,000 Hz. We recorded the heart sounds sequentially at the second left intercostal space (2nd LICS) and the cardiac apex for 20 seconds. We used the MATLAB 2010b (MathWorks, Natick, MA) programming environment to implement methods for signal analysis and optimization. Heart sounds were recorded simultaneously with the direct PA pressure measurements obtained during right heart catheterization in a standard manner. Other hemodynamic data, including heart rate, PA wedge pressure (PAWp) or left atrial pressure (LAp), right atrial pressure, oxygen consumption (
Annotation of S1 and S2
We demarcated events containing S1 (the first heart sound) and from those containing S2 by identifying, from the acoustic recordings, events that were separated by intervals compatible with the relative duration of systole and diastole. Three cardiologists, blinded to the subjects' PA pressure, identified the timing of S1 and S2 independently. They listened to acoustic recordings and marked S1 and S2 on the phonocardiographic tracings. The cardiologists' interpretation was in agreement with our observation that in all of the subjects studied, the duration of diastole was longer than that of systole. Thus, the events identified as S1 and S2 occurred such that the interval between S1 and S2 was shorter than the interval between S2 and S1. Within these demarcated events, we identified the maximal positive and negative normalized amplitudes and annotated these as reference points for S1 and S2 events (Fig. 1). It is important to note that we did not identify S1 and S2 as precise points in time but rather as sound events surrounding S1 and S2.

Annotation of S1 and S2. Normalized amplitudes (Y-axis) of surface acoustic recordings at the second left intercostal space (2LICS) and apex are plotted against time in seconds (X-axis). Black plus signs represent S1; red diamonds represent S2. PAp: pulmonary artery pressure; PH: pulmonary hypertension; S1: first heart sound; S2: second heart sound.
Annotation of A2 and P2
PAH is defined as an mPAp of at least 25 mmHg and a PAWp or LAp no higher than 15 mmHg, measured at heart catheterization in subjects at rest.11–13 Therefore, we divided the S1 and S2 events into groups, depending on whether the recording originated from subjects with mPAP of less than 25 mmHg or subjects with mPAp of at least 25 mmHg. In all subjects, the mean PAWp or LAp was less than 15 mmHg. In order to find a reference point for A2 and P2 events, we superimposed all the events containing S2 recorded from the 2nd LICS and the apex (Fig. 2). We chose time 0 to coincide with the normalized amplitude peak. In subjects with mPAp of less than 25 mmHg, we observed another event occurring after the main or reference peak. In subjects with mPAp of at least 25 mmHg, we observed a secondary normalized amplitude event occurring before the main or reference peak. Thus, in subjects with mPAp of less than 25 mmHg and on the basis of timing and the assumption that A2 occurs before P2, we annotated the reference peak as A2 and the following event as P2. In subjects with mPAp of at least 25 mmHg, on the basis of timing and the assumption that A2 occurs before P2 (none of the subjects had a left bundle branch block), we designated the reference peak as P2 and the preceding event as A2.

Analysis of A2 and P2 durations. The normalized amplitude (Y-axis) is plotted against time in seconds (X-axis). Time 0 is defined as the peak of S2. In subjects with mPAp < 25 mmHg, the taller peak is considered to be A2. In subjects with mPAp ≥ 25 mmHg, the taller peak is considered to be P2. a, Recordings from second left intercostal space (2nd LICS) in subjects with mPAp < 25 mmHg. b, Recordings from the apex in subjects with mPAp ≥ 25 mmHg. c, Recordings from the 2nd LICS in subjects with mPAp < 25 mmHg. d, Recordings from apex from subjects with mPAp ≥ 25 mmHg. A2: aortic component of S2; mPAp: mean pulmonary artery pressure; P2: pulmonary component of S2; S2: second heart sound. A color version of this figure is available online.
Intensity calculation of S2, A2, P2, and S1
By visual inspection of the superimposed S2 waves (Fig. 2), we observed that the duration of the events designated A2 and P2 was 0.04 seconds, that is, 0.02 seconds on either side of the respective event peaks or reference point. Therefore, the duration of A2 and P2 events was considered to be 0.04 seconds, and the total duration of S2 events was considered to be 0.08 seconds. Similarly, the S1 event was extracted from heart sound data within 0.04 seconds on either side of the reference point or peak amplitude of S1 events.
We calculated the intensity (I) of the extracted event area (x), as

Boxplot of 3 time-domain features calculated from auscultation at the 2nd LICS and apex. The ratios of the intensity of the heart sounds P2 ? A2 and P2 ? S2 can be used to discriminate between mean PAp ≥ 25 mmHg (P = .0001) at 2nd LICS and that at apex. S2 ? S1 ratios do not discriminate between subjects with and those without PAH. A2: aortic component of S2; PAH: pulmonary arterial hypertension; PAp: pulmonary artery pressure; P2: pulmonary component of S2; S1: first heart sound; S2: second heart sound; 2nd LICS: second left intercostal space.

Individual patients mean PAp (Y-axis) plotted against the median of the heart sound intensity ratios of P2 ? S2 and P2 ? A2 (X-axis). a, P2? S2 heart sound intensity ratio at the second left intercostal space (2L). b, P2? S2 heart sound intensity ratio at the apex. c, P2? A2 heart sound intensity ratio at 2L. d, P2? A2 heart sound ratio intensity at the apex. Plus signs indicate subjects with mean PAp < 25 mmHg. Diamonds indicate subjects with mean PAp ≥ 25 mmHg. The dot-dashed line represents the mean PAp of 25 mmHg. Values on or above this line indicate pulmonary artery hypertension, and values below it indicate normal PAp, according to the internationally recognized definition of pulmonary artery hypertension. The correlations of the heart sound intensity ratio at 2L for P2 ? A2 with mean PAp (c) were poor in comparison with those of the P2 ? A2 intensity ratio at the apex (d) and the S2 ? P2 intensity ratios at 2L (a) and the cardiac apex (b). A2: aortic component of S2; PAp: pulmonary artery pressure; P2: pulmonary component of S2; S2: second heart sound.

Intensity of P2 plotted against intensity of S2 or A2 for each heartbeat contained in each acoustic recording. a, Intensity of P2 against that of S2 recorded at the second left intercostal space (2L). b, Intensity of P2 against that of S2 recorded at the apex. c, Intensity of P2 against that of A2 at 2L. d, Intensity of P2 against that of A2 at the apex. Blue plus signs indicate heartbeats recorded in subjects with mean PAp < 25 mmHg. Red diamonds indicate heartbeats recorded in subjects with mean PAp ≥ 25 mmHg. A2: aortic component of S2; PAp: pulmonary artery pressure; P2: pulmonary component of S2; SE: sensitivity; SP: specificity; S2: second heart sound.
Statistical methods
The clinical and hemodynamic data from subjects with mPAp of less than 25 mmHg and mPAp of at least 25 mmHg were compared using the Mann-Whitney U test for two independent groups, since the data were not normally distributed. A P value of less than .05 was considered significant. We used Pearson's correlation coefficient to define the correlation between the heart sound intensity ratios P2 ? S2 or P2 ? A2 and the mPAp. We used Fisher's linear discriminant to test the separability of the extracted features between subjects with and those without PAH.
RESULTS
We collected recordings from 26 subjects. In 22 subjects, recordings were sufficiently free of background noise, artifacts, and low-amplitude signals to analyze at least one complete 20-second recording from either the 2nd LICS or the cardiac apex. Thus, we analyzed heart sound recordings from 22 children (9 males and 13 females). Twenty-two recordings obtained at the apex were suitable for analysis (11 subjects with mPAp of less than 25 mmHg and 11 subjects with mPAp of at least 25 mmHg). In 17 of the 22 subjects, the heart sound recordings from 2nd LICS were of sufficient quality to be analyzed. These included 10 subjects with PAH and 7 subjects with normal PA pressure. The clinical and hemodynamic details of the subjects are included in Tables 1–6. The only statistically significant differences between the two groups were hemodynamic measurements that reflected the presence or absence of PAH. Of note, there was no difference in the LAp (or PAWp) or QPI between the two groups. The two groups did not differ by age, weight, height, body surface area (BSA), or body mass index (BMI; Table 7).
Subjects 1–11, with pulmonary artery hypertension (mean PAp ≥ 25 mmHg)
Note: There were 5 male and 6 female subjects in this group. BMI: body mass index; BSA: body surface area; CDH: congenital diaphragmatic hernia; CHD: congenital heart disease; IPAH: idiopathic pulmonary artery hypertension; PAp: pulmonary artery pressure.
Subjects 12–22, with normal PAp (mean PAp < 25 mmHg)
Note: There were 4 male and 7 female subjects in this group. BMI: body mass index; BSA: body surface area; CHD: congenital heart disease; PAp: pulmonary artery pressure.
Pulmonary vascular hemodynamic data for subjects 1–11, with pulmonary artery hypertension (mean PAp ≥ 25 mmHg)
Note: LAp: left atrial pressure; PAp: pulmonary artery pressure; PAWp: pulmonary artery wedge pressure; PVRI: pulmonary vascular resistance index; QPI: pulmonary blood flow index; WU: Wood units.
PVRI calculated from pressure at the time of QPI measurement, not acoustic recording.
Pulmonary vascular hemodynamic data for subjects 12–22, with normal pulmonary artery pressure (mean PAp < 25 mmHg)
Note: LAp: left atrial pressure; NA: not available; PAp: pulmonary artery pressure; PAWp: pulmonary artery wedge pressure; PVRI: pulmonary vascular resistance index; QPI: pulmonary blood flow index; WU: Wood units.
PVRI calculated from pressure at the time of QPI measurement, not acoustic recording.
Systemic vascular hemodynamic and electrocardiographic data for subjects 1–11, with pulmonary artery hypertension (mean PAp ≥ 25 mmHg)
Note: BP: systemic blood pressure; PAp: pulmonary artery pressure; RAp: right atrial pressure.
Systemic vascular hemodynamic and electrocardiographic data for subjects 12–22, with normal PAp (mean PAp < 25 mmHg)
Note: BP: systemic blood pressure; PAp: pulmonary artery pressure; RAp: right atrial pressure.
Comparison of clinical and hemodynamic data between subjects with pulmonary artery hypertension (mean PAp ≥ 25 mmHg) and those with normal PAp (mean PAp < 25 mmHg)
Note: PAp: pulmonary artery pressure.
Significant result (P < .05).
Extraction of A2 and P2
We found that in subjects with normal PA pressure, the normalized amplitude of the recordings obtained from the cardiac apex position clearly differentiated a defined peak before a more diffuse and lower-amplitude peak, which we annotated as A2 and P2, respectively (Fig. 2). In contrast, in subjects with PAH, 2 peaks are seen. The earlier and lower-amplitude peak we annotated as A2, and the later and higher-amplitude signal we annotated as P2.
Comparison of the intensity of S2, A2, and P2
We constructed box plots of 3 time-domain features from both groups of subjects by comparing the ratios of the normalized intensity of the heart sounds S2 ? S1, P2 ? A2, and P2 ? S2. In Figure 3, it can be seen that S2 ? S1 discriminates less well between subjects with PAH and those with normal PA pressure, particularly at the apex. Statistically significant differences in the heart sound intensity ratios P2 ? A2 (P = .0001) and P2 ? S2 (P = .0001) are found between the two groups from sounds recorded at both the 2nd LICS and the apex.
In Figure 4, we have plotted the medians of the ratios of the intensities of the extracted features, P2 ? S2 and P2 ? A2, recorded from the 2nd LICS and the apex against mPAp. There is a linear correlation (r > 0.7) between the ratio P2 ? S2 and mPAp from recordings at the 2nd LICS and the apex. However, for the intensity ratio P2 ? A2, a linear correlation is seen only in the apical recordings. In Figure 5a and 5b, we have plotted the intensity of P2 against that of S2 for each heartbeat for all subjects at the 2nd LICS and the apex. In Figure 5c and 5d, we have plotted the intensity of P2 against that of A2 for each heart beat for all subjects at the 2nd LICS and the apex.
DISCUSSION
The main finding of our investigation is that we can discriminate between the intensity of heart sound events that we annotated as P2 and A2 in children with PAH and that in children without PAH in the recordings taken simultaneously with direct PA pressure measurements (Figs. 3, 5). We found that P2 events have increased intensity in children with PAH. In addition, in children with PAH, the increased intensity of P2 events is transmitted to the apex and overshadows A2 events, in contrast to children without PAH. Experienced clinicians diagnose PAH by auscultation if they hear a loud P2 at the 2nd LICS and hear the transmission of P2 to the apex, where usually only A2 is heard in the absence of PAH.3,4,14 These observations add credibility and biological plausibility to our findings in children and raise the potential for the interpretation of digitally recorded heart sounds by medical personnel without years of experience in auscultation.
We found that the heart sound intensity ratios P2 ? S2 and P2 ? A2 were useful for discriminating subjects with PAH. In our study, we were interested in discovering whether extracted sound features around S1 and S2 would be useful in the diagnosis of PAH and whether it would be possible to demarcate these events without a simultaneous ECG, echocardiogram, or pressure tracing on the same platform as the heart sound recordings. The finding that extracted features that make up S2 in the time domain distinguish subjects with PAH from those with normal PA pressure is encouraging and is further substantiated by the finding that the ratios of the intensity of the components that make up S2, that is, P2 and A2, are also discriminatory for PAH. The precise origin of the extracted features of S2, P2, and A2 cannot be identified in our study, but there is sufficient evidence to suggest that PAH may be identified quantitatively by S2.
We divided the subjects into two groups, based on whether mPAp was less than 25 mmHg or not. In addition, all of our subjects in both groups had an LAp or PAWp of less than 15 mmHg. This is in keeping with the current definition of and recommendations for the diagnosis of PAH.11,13,15 The only statistically significant differences in the hemodynamic measurements between the two groups were mPAp (55 vs. 15 mmHg; P < .001) and PVRI (10.7 vs. 2 Wood units/m 2 ; P < .001). This supports our finding that the S2 (A2 and P2) events were largely influenced by pulmonary vascular hemodynamic measurements; see Tables 3–7.
The QRS complex duration was shorter in subjects with PAH (77 vs. 103 ms; P = .02). The children without PAH had repaired congenital heart disease or had undergone heart transplants and may often have had a widened QRS postoperatively. A longer QRS or prolonged right-heart conduction time might influence the splitting interval between A2 and P2 but not the intensity of S2.
There were no statistically significant differences in age, body weight, BSA, or BMI between the two groups, suggesting that the differences in the intensity of P2 events were unrelated to age, body size, or habitus. However, further studies are needed to reproduce our findings in subjects with an increased BMI, as increased body fat or muscle may impair the transmission of heart sounds from body surface microphones.
In our cohort of children, we had 2 subjects with a mean PA pressure of 25 mmHg and 1 with a mean PA pressure of 24 mmHg. In Figure 4, the dashed line represents an mPAp of 25 mmHg, which demarcates normal from abnormal mPAp. It can be seen that the P2 ? S2 and P2 ? A2 intensity ratios are less discriminatory for mPAp close to 25 mmHg. Differentiating subjects with borderline signs of PAH will be challenging. Nevertheless, Figure 5 demonstrates that if all recorded heartbeats are included, then the estimation of the intensity of S2, A2, and P2 events reveals an easily visible separation between recordings from children with PAH and those from children without PAH. The sensitivity and specificity to discriminate between PAH and normal PA pressure are both high.
In contrast to a recent report, 16 we did not find that the intensity of the ratio S2 ? S1 discriminated between PAH and non-PAH. We found the intensity ratios of the heart sounds P2 ? S2 and P2 ? A2 to be more indicative of PAH. However, Chan et al. 16 used the term “complexity” of S1 and S2, which may differ from intensity. In addition, we studied children, who may have acoustic precordial characteristics different from those of adults.
We did not assess the effect of right versus left ventricular function or right-left ventricular interactions in PAH on the acoustic recordings. One of the aims of our study was to determine whether one could separate A2 from P2 and S1 from S2 without a simultaneous electrocardiographic or pressure tracing on the same platform as the heart sound recording. We did, however, measure the PA pressure simultaneously with the acoustic recordings.
The fidelity of the recordings at the cardiac apex was sufficient for analysis in only 22 of 26 subjects and that at the 2nd LICS in 17 of those 22. This drawback will have to be resolved in future studies.
We made assumptions about S1 and S2 that, although biologically plausible and consistent with the known descriptions of S1 and S2, remain to be validated. Future studies may improve the accuracy by including simultaneous recordings of pressure, ECG, and echocardiogram on the same platform as the acoustic recordings.
Conclusion. The main finding of our investigation is that there is a clear separation between the intensities of P2 and A2 in children with PAH and those in children without PAH in acoustic recordings collected with a handheld digital stethoscope simultaneously with direct PA pressure measurements. The ratios of the heart sound intensities P2 ? A2 and P2 ? S2 discriminate between subjects with and those without PAH. These findings may be of value in the development of an acoustic device to diagnose PAH with more accuracy than conventional auscultation.
