Abstract
We studied digital stethoscope recordings in children undergoing simultaneous catheterization of the pulmonary artery (PA) to determine whether time-domain analysis of heart sound intensity would aid in the diagnosis of PA hypertension (PAH). Heart sounds were recorded and stored in .wav mono audio format. We performed recordings for 20 seconds with sampling frequencies of 4,000 Hz at the second left intercostal space and the cardiac apex. We used programs written in the MATLAB 2010b environment to analyze signals. We annotated events representing the first (S1) and second (S2) heart sounds and the aortic (A2) and pulmonary (P2) components of S2. We calculated the intensity (
INTRODUCTION
Pulmonary artery hypertension (PAH) is a serious condition that imposes a global disease burden. If untreated, PAH has a high mortality, whether the cause of the disease is idiopathic, genetic mutation, or a complication of cardiac or pulmonary disease.1,2 PAH is often diagnosed late because early clinical recognition is difficult, even after the onset of symptoms. There is, therefore, a need to explore or reevaluate the clinical diagnosis of PAH.
The auscultatory and phonocardiographic indicators of PAH have been described well, with plausible biological explanations for the findings.3,4 Clinical indicators of PAH include increased loudness of the pulmonary component (P2) of the second heart sound (S2) and increased transmission of P2 to the cardiac apex. However, for the most part these descriptions predated the use of new digital stethoscopes, which are readily available and have the capability of recording and transmitting to a computer an acoustic tracing that can be optimized and analyzed later. There have been a few approaches to the noninvasive diagnosis of PAH combining phonocardiography and mathematical concepts. These investigators, for the most part, concentrated on the difficult task of identifying reliably and precisely S2, A2 (the aortic component of S2), P2, and the splitting interval between A2 and P2.5–8 We undertook a pilot study to characterize the acoustic recordings of the heart sounds in children with and without PAH. Our concern was not precise localization of S2, A2, and P2 but rather a general exploration of the extracted features of the heart sounds that could provide relevant diagnostic information.
Sound signals, such as the heart sounds, may be analyzed in two modes, either the time domain or the frequency domain. Time-domain analysis quantifies the signal behavior over time and is plotted as voltage, current, or, in the case of heart sounds, intensity against time (seconds). In frequency-domain analysis, the component frequencies are spread across a frequency spectrum and are represented as peaks in the frequency spectrogram in units of hertz. We have reported the characteristics of the heart sounds in the frequency domain in children with and without pulmonary hypertensive vascular disease. 9 We hypothesized that, using recordings from a digital stethoscope, we might extract features of the heart sounds in the time domain around A2 and P2 that might differ between subjects with and those without PAH, rather than localizing A2 and P2 precisely. Therefore, we sought to extract features of the heart sounds obtained by digital stethoscope recordings from subjects with and without PAH, in an attempt to correlate these features with the simultaneously and directly measured pulmonary artery (PA) pressure at cardiac catheterization.
METHODS
The Research Ethics Board of the University of Alberta approved the study. All subjects or their parents gave informed and written consent to participate in the study. Informed assent was obtained from children who had reached sufficient developmental ability.
Clinical data collection
We approached pediatric-aged subjects undergoing right heart cardiac catheterization that was required for management of their underlying condition for inclusion in the study. We excluded subjects with congenitally abnormal aortic, pulmonary, or prosthetic valves.
The heart sounds were recorded with a 3M Littmann 3200 digital stethoscope, which works in conjunction with Zargis Cardioscan software (Zargis Medical, Princeton, NJ) to store recorded heart sounds in .wav mono audio format. Heart sound recordings were obtained over 20 seconds, with sampling frequencies of 4,000 Hz. We recorded the heart sounds sequentially at the second left intercostal space (2nd LICS) and the cardiac apex for 20 seconds. We used the MATLAB 2010b (MathWorks, Natick, MA) programming environment to implement methods for signal analysis and optimization. Heart sounds were recorded simultaneously with the direct PA pressure measurements obtained during right heart catheterization in a standard manner. Other hemodynamic data, including heart rate, PA wedge pressure (PAWp) or left atrial pressure (LAp), right atrial pressure, oxygen consumption (
Annotation of S1 and S2
We demarcated events containing S1 (the first heart sound) and from those containing S2 by identifying, from the acoustic recordings, events that were separated by intervals compatible with the relative duration of systole and diastole. Three cardiologists, blinded to the subjects' PA pressure, identified the timing of S1 and S2 independently. They listened to acoustic recordings and marked S1 and S2 on the phonocardiographic tracings. The cardiologists' interpretation was in agreement with our observation that in all of the subjects studied, the duration of diastole was longer than that of systole. Thus, the events identified as S1 and S2 occurred such that the interval between S1 and S2 was shorter than the interval between S2 and S1. Within these demarcated events, we identified the maximal positive and negative normalized amplitudes and annotated these as reference points for S1 and S2 events (Fig. 1). It is important to note that we did not identify S1 and S2 as precise points in time but rather as sound events surrounding S1 and S2.

Annotation of S1 and S2. Normalized amplitudes (
Annotation of A2 and P2
PAH is defined as an mPAp of at least 25 mmHg and a PAWp or LAp no higher than 15 mmHg, measured at heart catheterization in subjects at rest.11–13 Therefore, we divided the S1 and S2 events into groups, depending on whether the recording originated from subjects with mPAP of less than 25 mmHg or subjects with mPAp of at least 25 mmHg. In all subjects, the mean PAWp or LAp was less than 15 mmHg. In order to find a reference point for A2 and P2 events, we superimposed all the events containing S2 recorded from the 2nd LICS and the apex (Fig. 2). We chose time 0 to coincide with the normalized amplitude peak. In subjects with mPAp of less than 25 mmHg, we observed another event occurring after the main or reference peak. In subjects with mPAp of at least 25 mmHg, we observed a secondary normalized amplitude event occurring before the main or reference peak. Thus, in subjects with mPAp of less than 25 mmHg and on the basis of timing and the assumption that A2 occurs before P2, we annotated the reference peak as A2 and the following event as P2. In subjects with mPAp of at least 25 mmHg, on the basis of timing and the assumption that A2 occurs before P2 (none of the subjects had a left bundle branch block), we designated the reference peak as P2 and the preceding event as A2.

Analysis of A2 and P2 durations. The normalized amplitude (
Intensity calculation of S2, A2, P2, and S1
By visual inspection of the superimposed S2 waves (Fig. 2), we observed that the duration of the events designated A2 and P2 was 0.04 seconds, that is, 0.02 seconds on either side of the respective event peaks or reference point. Therefore, the duration of A2 and P2 events was considered to be 0.04 seconds, and the total duration of S2 events was considered to be 0.08 seconds. Similarly, the S1 event was extracted from heart sound data within 0.04 seconds on either side of the reference point or peak amplitude of S1 events.
We calculated the intensity (

Boxplot of 3 time-domain features calculated from auscultation at the 2nd LICS and apex. The ratios of the intensity of the heart sounds P2 ? A2 and P2 ? S2 can be used to discriminate between mean PAp ≥ 25 mmHg (

Individual patients mean PAp (

Intensity of P2 plotted against intensity of S2 or A2 for each heartbeat contained in each acoustic recording.
Statistical methods
The clinical and hemodynamic data from subjects with mPAp of less than 25 mmHg and mPAp of at least 25 mmHg were compared using the Mann-Whitney
RESULTS
We collected recordings from 26 subjects. In 22 subjects, recordings were sufficiently free of background noise, artifacts, and low-amplitude signals to analyze at least one complete 20-second recording from either the 2nd LICS or the cardiac apex. Thus, we analyzed heart sound recordings from 22 children (9 males and 13 females). Twenty-two recordings obtained at the apex were suitable for analysis (11 subjects with mPAp of less than 25 mmHg and 11 subjects with mPAp of at least 25 mmHg). In 17 of the 22 subjects, the heart sound recordings from 2nd LICS were of sufficient quality to be analyzed. These included 10 subjects with PAH and 7 subjects with normal PA pressure. The clinical and hemodynamic details of the subjects are included in Tables 1–6. The only statistically significant differences between the two groups were hemodynamic measurements that reflected the presence or absence of PAH. Of note, there was no difference in the LAp (or PAWp) or QPI between the two groups. The two groups did not differ by age, weight, height, body surface area (BSA), or body mass index (BMI; Table 7).
Subjects 1–11, with pulmonary artery hypertension (mean PAp ≥ 25 mmHg)
Note: There were 5 male and 6 female subjects in this group. BMI: body mass index; BSA: body surface area; CDH: congenital diaphragmatic hernia; CHD: congenital heart disease; IPAH: idiopathic pulmonary artery hypertension; PAp: pulmonary artery pressure.
Subjects 12–22, with normal PAp (mean PAp < 25 mmHg)
Note: There were 4 male and 7 female subjects in this group. BMI: body mass index; BSA: body surface area; CHD: congenital heart disease; PAp: pulmonary artery pressure.
Pulmonary vascular hemodynamic data for subjects 1–11, with pulmonary artery hypertension (mean PAp ≥ 25 mmHg)
Note: LAp: left atrial pressure; PAp: pulmonary artery pressure; PAWp: pulmonary artery wedge pressure; PVRI: pulmonary vascular resistance index; QPI: pulmonary blood flow index; WU: Wood units.
PVRI calculated from pressure at the time of QPI measurement, not acoustic recording.
Pulmonary vascular hemodynamic data for subjects 12–22, with normal pulmonary artery pressure (mean PAp < 25 mmHg)
Note: LAp: left atrial pressure; NA: not available; PAp: pulmonary artery pressure; PAWp: pulmonary artery wedge pressure; PVRI: pulmonary vascular resistance index; QPI: pulmonary blood flow index; WU: Wood units.
PVRI calculated from pressure at the time of QPI measurement, not acoustic recording.
Systemic vascular hemodynamic and electrocardiographic data for subjects 1–11, with pulmonary artery hypertension (mean PAp ≥ 25 mmHg)
Note: BP: systemic blood pressure; PAp: pulmonary artery pressure; RAp: right atrial pressure.
Systemic vascular hemodynamic and electrocardiographic data for subjects 12–22, with normal PAp (mean PAp < 25 mmHg)
Note: BP: systemic blood pressure; PAp: pulmonary artery pressure; RAp: right atrial pressure.
Comparison of clinical and hemodynamic data between subjects with pulmonary artery hypertension (mean PAp ≥ 25 mmHg) and those with normal PAp (mean PAp < 25 mmHg)
Note: PAp: pulmonary artery pressure.
Significant result (
Extraction of A2 and P2
We found that in subjects with normal PA pressure, the normalized amplitude of the recordings obtained from the cardiac apex position clearly differentiated a defined peak before a more diffuse and lower-amplitude peak, which we annotated as A2 and P2, respectively (Fig. 2). In contrast, in subjects with PAH, 2 peaks are seen. The earlier and lower-amplitude peak we annotated as A2, and the later and higher-amplitude signal we annotated as P2.
Comparison of the intensity of S2, A2, and P2
We constructed box plots of 3 time-domain features from both groups of subjects by comparing the ratios of the normalized intensity of the heart sounds S2 ? S1, P2 ? A2, and P2 ? S2. In Figure 3, it can be seen that S2 ? S1 discriminates less well between subjects with PAH and those with normal PA pressure, particularly at the apex. Statistically significant differences in the heart sound intensity ratios P2 ? A2 (
In Figure 4, we have plotted the medians of the ratios of the intensities of the extracted features, P2 ? S2 and P2 ? A2, recorded from the 2nd LICS and the apex against mPAp. There is a linear correlation (
DISCUSSION
The main finding of our investigation is that we can discriminate between the intensity of heart sound events that we annotated as P2 and A2 in children with PAH and that in children without PAH in the recordings taken simultaneously with direct PA pressure measurements (Figs. 3, 5). We found that P2 events have increased intensity in children with PAH. In addition, in children with PAH, the increased intensity of P2 events is transmitted to the apex and overshadows A2 events, in contrast to children without PAH. Experienced clinicians diagnose PAH by auscultation if they hear a loud P2 at the 2nd LICS and hear the transmission of P2 to the apex, where usually only A2 is heard in the absence of PAH.3,4,14 These observations add credibility and biological plausibility to our findings in children and raise the potential for the interpretation of digitally recorded heart sounds by medical personnel without years of experience in auscultation.
We found that the heart sound intensity ratios P2 ? S2 and P2 ? A2 were useful for discriminating subjects with PAH. In our study, we were interested in discovering whether extracted sound features around S1 and S2 would be useful in the diagnosis of PAH and whether it would be possible to demarcate these events without a simultaneous ECG, echocardiogram, or pressure tracing on the same platform as the heart sound recordings. The finding that extracted features that make up S2 in the time domain distinguish subjects with PAH from those with normal PA pressure is encouraging and is further substantiated by the finding that the ratios of the intensity of the components that make up S2, that is, P2 and A2, are also discriminatory for PAH. The precise origin of the extracted features of S2, P2, and A2 cannot be identified in our study, but there is sufficient evidence to suggest that PAH may be identified quantitatively by S2.
We divided the subjects into two groups, based on whether mPAp was less than 25 mmHg or not. In addition, all of our subjects in both groups had an LAp or PAWp of less than 15 mmHg. This is in keeping with the current definition of and recommendations for the diagnosis of PAH.11,13,15 The only statistically significant differences in the hemodynamic measurements between the two groups were mPAp (55 vs. 15 mmHg;
The QRS complex duration was shorter in subjects with PAH (77 vs. 103 ms;
There were no statistically significant differences in age, body weight, BSA, or BMI between the two groups, suggesting that the differences in the intensity of P2 events were unrelated to age, body size, or habitus. However, further studies are needed to reproduce our findings in subjects with an increased BMI, as increased body fat or muscle may impair the transmission of heart sounds from body surface microphones.
In our cohort of children, we had 2 subjects with a mean PA pressure of 25 mmHg and 1 with a mean PA pressure of 24 mmHg. In Figure 4, the dashed line represents an mPAp of 25 mmHg, which demarcates normal from abnormal mPAp. It can be seen that the P2 ? S2 and P2 ? A2 intensity ratios are less discriminatory for mPAp close to 25 mmHg. Differentiating subjects with borderline signs of PAH will be challenging. Nevertheless, Figure 5 demonstrates that if all recorded heartbeats are included, then the estimation of the intensity of S2, A2, and P2 events reveals an easily visible separation between recordings from children with PAH and those from children without PAH. The sensitivity and specificity to discriminate between PAH and normal PA pressure are both high.
In contrast to a recent report, 16 we did not find that the intensity of the ratio S2 ? S1 discriminated between PAH and non-PAH. We found the intensity ratios of the heart sounds P2 ? S2 and P2 ? A2 to be more indicative of PAH. However, Chan et al. 16 used the term “complexity” of S1 and S2, which may differ from intensity. In addition, we studied children, who may have acoustic precordial characteristics different from those of adults.
We did not assess the effect of right versus left ventricular function or right-left ventricular interactions in PAH on the acoustic recordings. One of the aims of our study was to determine whether one could separate A2 from P2 and S1 from S2 without a simultaneous electrocardiographic or pressure tracing on the same platform as the heart sound recording. We did, however, measure the PA pressure simultaneously with the acoustic recordings.
The fidelity of the recordings at the cardiac apex was sufficient for analysis in only 22 of 26 subjects and that at the 2nd LICS in 17 of those 22. This drawback will have to be resolved in future studies.
We made assumptions about S1 and S2 that, although biologically plausible and consistent with the known descriptions of S1 and S2, remain to be validated. Future studies may improve the accuracy by including simultaneous recordings of pressure, ECG, and echocardiogram on the same platform as the acoustic recordings.
