Sage Journals: Discover world-class research

Abstract

Background: Chronic venous insufficiency (CVI) can be evaluated using Duplex ultrasonography (US) and air plethysmography (PG), yet comparative performance remains unclear. Methods: We retrospectively identified patients who underwent venous insufficiency US and PG within a 90-day interval from March 1, 2015 through July 31, 2024. CVI severity was classified by the clinical (C) component of the Clinical, Etiologic, Anatomic, and Pathophysiologic (CEAP) classification system and categorized as mild (C0–2), moderate (C3), or severe (C4–6). PG and insufficiency US findings were compared across categories and machine learning models were trained to predict severe CVI. Results: We analyzed 1478 limbs from 839 patients who were predominantly women (62%) with a mean age of 61 years (± 14 years). Severe CVI was present in 32.3% of limbs. US detected venous incompetence in 67.4% of limbs overall and in 75.3% of severe CVI limbs. Abnormal PG findings occurred in 72.9% of limbs overall and in 88.5% of severe CVI limbs. PG parameters in machine learning models outperformed US parameters in predicting severe CVI (area under the receiver operating characteristic curve [AUROC] 0.82 vs 0.65). The top PG model (multilayer perceptron [MLP]) achieved an AUROC of 0.82, versus the best US model (gradient boosting) with an AUROC of 0.65. Limiting PG data to incompetence and obstruction parameters modestly reduced performance but remained higher than US (logistic regression AUROC 0.72). Conclusion: Our findings indicate that PG assessment in CVI offers superior performance compared to US. Overall, these results validate the quantitative whole-limb hemodynamic approach using PG, which provides a more complete understanding of the pathology behind CVI.

Keywords

air plethysmography chronic venous insufficiency duplex ultrasound vascular imaging/diagnostics

Background

Chronic venous insufficiency (CVI) is a prevalent vascular disorder associated with substantial healthcare costs and reduced quality of life.¹ Conventional clinical evaluation, although valuable for initial assessment, primarily yields qualitative data that may not reliably confirm the diagnosis or characterize disease severity. Quantitative assessment of venous reflux is essential for accurate diagnosis and effective management.

Ambulatory venous pressure (AVP) reflects the combined influence of hemodynamic factors in CVI, such as valvular reflux and calf muscle pump function. Although previously considered the gold standard for quantitative evaluation, its clinical use is limited by its invasive nature.² Duplex ultrasonography (DUS) and air plethysmography (PG) are noninvasive methods for evaluating CVI. DUS, introduced in the early 1980s for deep vein thrombosis diagnosis, is now also employed to assess reflux and its anatomical location and severity.^3
–6 Introduced in the early 1960s, air plethysmography (PG) was initially used to measure relative lower-limb volume changes during postural shifts and muscular activity.⁷ It provides a quantitative assessment of reflux and is valuable for monitoring surgical outcomes.^8,9 PG has been widely applied in evaluating CVI, particularly calf muscle pump function and venous reflux.^10
–12

There is limited evidence directly comparing the diagnostic accuracy of US and PG in CVI. This study aimed to evaluate the accuracy of US and PG as initial screening tools at the two extremes of venous function defined by Clinical, Etiologic, Anatomic, and Pathophysiologic (CEAP) classification criteria.

Methods

Study design and patients

The Mayo Clinic Gonda Vascular Laboratory database was queried for patients aged 18 years or older with complete venous physiological testing by PG from March 1, 2015 through July 31, 2024. For patients with more than one study, the initial study was used for this analysis. Next, insufficiency US studies were identified if performed within 90 days before or after a PG study. Data from the insufficiency US were extracted, and right and left leg findings were merged with the appropriate limb findings from the PG study. For patients with unilateral insufficiency US, the findings from PG on the contralateral limb could not be used and were discarded. The Mayo Clinic institutional review board reviewed and approved this study.

Insufficiency ultrasound

Patients with suspected venous insufficiency were first imaged for deep venous thrombus from the common femoral to the popliteal vein level. Evaluation for insufficiency was performed with the patient in the upright position at a 50–60° angle or greater with the leg externally rotated. The common femoral vein (CFV), femoral vein (FV), and popliteal vein (PV) were evaluated with spectral Doppler and distal augmentation for reflux. Deep veins are considered competent if the reflux duration is < 1.0 second. Mild to moderate deep incompetence in our practice is considered to be 1.0–3.0 seconds, with mild incompetence having a lower amplitude than moderate incompetence. Severe deep incompetence is defined as reflux > 3.0 seconds.

The superficial venous system was evaluated, including the saphenofemoral junction (SFJ), anterior accessory saphenous vein (AASV), posterior accessory vein if associated with a varix, great saphenous vein (GSV; upper thigh, at the knee and in the calf), saphenopopliteal junction (SPJ; if present), and small saphenous vein (SSV; at the midcalf). When varices are noted, insufficiency is evaluated from the originating vein, and additional sites of testing for insufficiency may be added. Superficial veins are considered competent when reflux duration is < 0.5 seconds. The degree of incompetence is determined based on the duration of reflux, and when borderline, it can be adjusted into a category based on amplitude. Mild incompetence is defined as 0.5–1.0 second of reflux, moderate incompetence as 1.0–3.0 seconds of reflux, and severe incompetence as > 3.0 seconds of reflux.

Plethysmography

Venous hemodynamics (obstruction, incompetence, calf pump function [CPF], and postexercise venous refilling time [P-EVRT]) were assessed in ambulatory outpatients by standard PG methods (Supplemental material) using the VenView chair (AdvanzeCardio, Fountain Hills, AZ, USA), as previously described.^13
–15 Briefly, testing included three different phases: venous outflow to assess venous obstruction, passive drainage and refilling (PDR) to assess and grade valvular incompetence, and then ankle flexes followed by passive refilling to calculate CPF and P-EVRT. CPF was measured as an ejection fraction (EF) and was analyzed as a continuous variable. Venous outflow was classified as obstructed or patent, and venous incompetence per extremity was categorized into normal, mild, moderate, or severe, based on flow and volume, using established laboratory criteria.¹² P-EVRT represents the time taken for venous refilling following exercise-induced emptying; it is measured in seconds and treated as a continuous variable for analysis.¹⁶

CEAP classification

This system evaluates CVI based on clinical presentation, etiology, anatomy, and pathophysiology.¹⁷ The clinical classification was performed and/or supervised by a certified vascular technician at the time of the PG study for each limb and classified into seven classes: C0 (no signs of venous disease), C1 (telangiectasias or reticular veins), C2 (visible varicose veins), C3 (edema), C4 (A: hyperpigmentation, B: lipodermatosclerosis, or atrophie blanche), C5 (healed venous leg ulcer), and C6 (active venous leg ulcer). For this study, the maximum clinical (C) class from each limb was categorized into three groups: C0–C2 was categorized as mild disease, C3 as moderate disease, and C4–6 as severe disease.

Statistical analysis

As each limb contained independent PG, venous insufficiency US, and CEAP categories, the analysis was performed per limb. The primary outcome of interest was the ability of venous insufficiency US parameters compared to PG parameters to predict severe CVI (C4–6). Descriptive characteristics were presented as means and SDs for continuous variables, and numbers and percentages for categorical variables. Pearson’s chi-squared test was used to evaluate the relationship between categorical variables. The Kolmogorov–Smirnov test was used to test the distribution of continuous variables. For comparisons involving more than two groups, one-way ANOVA was used for normally distributed variables, and the Kruskal–Wallis test was applied otherwise. All statistical tests were two-sided, with significance defined as p < 0.05. Statistical analyses were performed using Python with the SciPy¹⁶ and statsmodels libraries.¹⁸

Machine learning analyses

Parameters from venous insufficiency US and PG were analyzed using multiple machine learning models to predict the presence of severe CVI (C4–6) as compared to mild CVI (C0–C2). Calf ejection fraction (continuous), P-EVRT (continuous), obstruction (present or absent), and category of incompetence (none, mild, moderate, severe) were used for the PG data, and classification of incompetence (none, mild, moderate, severe) was used per deep and superficial vein segment for venous insufficiency US data inputs. Multiple machine learning models were performed to account for potential differences in the underlying structure of venous insufficiency US versus PG data, with the two most successful models reported alongside traditional logistic regression models. To address class imbalance in machine learning, the majority class was randomly down-sampled without replacement to match the number of observations in the minority class. The dataset was then partitioned into training and testing subsets using an 80:20 split. Multiple classification algorithms were implemented using Python (version 3.10) and the scikit-learn library (version 1.6.1),¹⁸ including random forest, support vector classifier (SVC), AdaBoost, k-nearest neighbors (KNN), logistic regression, gradient boosting, decision tree, and multilayer perceptron (MLP). XGBoost (version 3.0.2)¹⁹ was also included as a high-performance gradient boosting framework. Each model (for US or PG parameters) was trained on the training set and evaluated on the test set using accuracy, precision, recall, F1-score, and area under the receiver operating characteristic curve (AUROC). Confusion matrices were computed to derive true/false positive and negative rates. For models lacking native probability outputs, decision function scores were normalized to estimate class probabilities for AUROC analysis.

Results

A total of 1478 limbs in 839 patients had both venous insufficiency US and PG measurements and were included in the analysis. Most patients were women (62%) and White (91%), and the mean age was 61 years (SD ± 14.9). A total of 753 left and 725 right limbs were analyzed. Most patients underwent venous insufficiency US and PG within 10 days of each other (80%). The clinical severity of CVI by the CEAP classification was severe in 477 limbs (32.3%), moderate in 496 limbs (33.6%), and mild in 505 limbs (34.2%) (Table 1). There was no difference in the severity of CVI by limb side (p = 0.89). Limbs with severe CVI had a higher mean leg circumference than those with mild CVI (259 ± 35 vs 232 ± 21 mm; p < 0.001).

Table 1.

Demographics, plethysmography, and venous insufficiency ultrasound findings by severity of chronic venous insufficiency.

Characteristics		Severe CVI(C4–6)	Moderate CVI(C3)	Mild CVI(C0–2)	p-value
	N	n = 477	n = 496	n = 505
Age (years), mean (SD)	1478	66.2 (13.9)	61.0 (14.3)	56.3 (14.8)	< 0.001
Ankle circumference (mm), mean (SD)	1443	259 (35)	259 (32)	232 (21)	< 0.001
Leg side, n (%)
Left	753	244 (51.2)	256 (51.6)	253 (50.1)	0.89
Right	725	233 (48.8)	240 (48.4)	252 (49.9)
Female sex, n (%)	910	230 (48.2)	349 (70.4)	331 (65.5)	< 0.001
Race, n (%)
Other	130	39 (8.2)	39 (7.9)	52 (10.3)	0.34
White	1348	438 (91.8)	457 (92.1)	453 (89.7)
Major testing findings, n (%)
Any incompetence by US	997	359 (75.3)	321 (64.7)	317 (62.8)	< 0.001
Deep incompetence by US	326	159 (33.3)	98 (19.8)	69 (13.7)	< 0.001
Superficial incompetence by US	906	317 (66.5)	291 (58.7)	298 (59.0)	0.02
History of phlebitis by US	149	86 (18.0)	39 (7.9)	24 (4.8)	< 0.001
Abnormal PG study	1078	422 (88.5)	388 (78.2)	268 (53.1)	< 0.001

CVI, chronic venous insufficiency; PG, plethysmography; US, ultrasound.

Plethysmography data

Any abnormal PG (obstruction, incompetence, CPF, or P-EVRT) study results were observed in 1078 limbs (72.9% of the total cohort), with the highest prevalence in the severe CVI group (88.5%), decreasing to 78.2% in the moderate and 53.1% in the mild CVI groups (Table 1; p < 0.001). No venous incompetence was observed in 652 limbs (44.1%) with the highest proportion in the mild CVI group (62.8%), compared to 44.6% in the moderate and 23.9% in the severe CVI groups (Table 2). Severe incompetence was noted in 171 limbs (11.6%), predominantly in the severe CVI group (22.4%), compared to 8.7% in moderate and 4.2% in mild CVI. Hemodynamic obstruction was observed in 47 limbs (3.2%) and was most prevalent with severe CVI (4.4%, 21 limbs), followed by 3.8% (19 limbs) with moderate CVI, and 1.4% (seven limbs) with mild CVI (p = 0.002). The mean EF was lowest in the severe CVI group at 40.2% (SD 22.2), increased to 44.9% (22.0) in the moderate group, and was highest in the mild group at 56.5% (21.7; p < 0.001). The mean P-EVRT was lowest in the severe CVI group at 13.0 seconds (SD 11.0), increasing to 18.8 seconds (14.5) in the moderate group and 28.5 seconds (18.3) in the mild group (p < 0.001).

Table 2.

Venous air plethysmography results across chronic venous insufficiency categories.

		Severe CVI(C4–6)	Moderate CVI(C3)	Mild CVI(C0–2)	p-value
	N	n = 477	n = 496	n = 505
Calf ejection fraction (%), mean (SD)	1478	40.2 (22.2)	44.9 (22.0)	56.5 (21.7)	< 0.001
P-EVRT (seconds), mean (SD)	1478	13.0 (11.0)	18.8 (14.5)	28.5 (18.3)	< 0.001
Obstructed, n (%)	47	21 (4.4)	19 (3.8)	7 (1.4)	0.002
Venous incompetence classification, n (%)
None	652	114 (23.9)	221 (44.6)	317 (62.8)	< 0.001
Mild	353	119 (24.9)	120 (24.2)	114 (22.6)
Moderate	302	137 (28.7)	112 (22.6)	53 (10.5)
Severe	171	107 (22.4)	43 (8.7)	21 (4.2)

CVI, chronic venous insufficiency; P-EVRT, postexercise venous refilling time.

Venous ultrasound data

Among the 1478 limbs, 997 (67.4%) exhibited valvular incompetence on venous insufficiency US in at least one location (deep or superficial vein segment), most frequently observed in the severe CVI group (75.3%), followed by 64.7% in the moderate and 62.8% in the mild CVI groups (p < 0.001; Table 1). Deep venous incompetence at one or more vein segments was associated with CVI severity (p < 0.001). Among limbs with deep venous incompetence based on US, 33.3% were in the severe CVI group, 19.8% in the moderate CVI group, and 13.7% in the mild CVI group. Superficial venous incompetence based on venous US was also significantly associated with disease severity (p = 0.02). The severity of venous incompetence, graded as none, mild, moderate, and severe for each deep and superficial vein segment tested, can be found in Table 3. Ultrasound evidence of prior phlebitis was found in 18.0% of limbs with severe CVI, 7.9% of limbs with moderate CVI, and 4.8% of limbs with mild CVI (p < 0.001).

Table 3.

Ultrasound findings across chronic venous insufficiency categories.

		Severe CVI(C4–6)	Moderate CVI(C3)	Mild CVI(C0–2)	p-value
	N	n = 477	n = 496	n = 505
History of phlebitis by US^a, n (%)	149	86 (18.0)	39 (7.9)	24 (4.8)	< 0.001
Classification of incompetence severity by vein segment
CFV, n (%)
None	1331	397 (83.2)	453 (91.3)	481 (95.2)	< 0.001
Mild	36	18 (3.8)	11 (2.2)	7 (1.4)
Moderate	49	30 (6.3)	15 (3.0)	4 (0.8)
Severe	62	32 (6.7)	17 (3.4)	13 (2.6)
FV, n (%)
None	1305	372 (78.0)	451 (90.9)	482 (95.4)	< 0.001
Mild	16	10 (2.1)	4 (0.8)	2 (0.4)
Moderate	50	27 (5.7)	18 (3.6)	5 (1.0)
Severe	107	68 (14.3)	23 (4.6)	16 (3.2)
PV, n (%)
None	1223	347 (72.7)	428 (86.3)	448 (88.7)	< 0.001
Mild	27	9 (1.9)	8 (1.6)	10 (2.0)
Moderate	59	25 (5.2)	20 (4.0)	14 (2.8)
Severe	169	96 (20.1)	40 (8.1)	33 (6.5)
GSV calf, n (%)
None	844	244 (51.2)	301 (60.7)	299 (59.2)	0.001
Mild	84	39 (8.2)	25 (5.0)	20 (4.0)
Moderate	153	68 (14.3)	42 (8.5)	43 (8.5)
Severe	397	126 (26.4)	128 (25.8)	143 (28.3)
GSV knee, n (%)
None	1084	341 (71.5)	364 (73.4)	379 (75.0)	0.87
Mild	26	11 (2.3)	7 (1.4)	8 (1.6)
Moderate	64	22 (4.6)	22 (4.4)	20 (4.0)
Severe	304	103 (21.6)	103 (20.8)	98 (19.4)
GSV thigh, n (%)
None	1118	348 (73.0)	385 (77.6)	385 (76.2)	0.13
Mild	13	2 (0.4)	3 (0.6)	8 (1.6)
Moderate	50	22 (4.6)	15 (3.0)	13 (2.6)
Severe	297	105 (22.0)	93 (18.8)	99 (19.6)
AASV, n (%)
None	1407	448 (93.9)	475 (95.8)	484 (95.8)	0.64
Mild	4	2 (0.4)	2 (0.4)	0 (0.0)
Moderate	12	5 (1.0)	3 (0.6)	4 (0.8)
Severe	55	22 (4.6)	16 (3.2)	17 (3.4)
SFJ, n (%)
None	1204	367 (76.9)	403 (81.2)	434 (85.9)	0.002
Mild	25	10 (2.1)	10 (2.0)	5 (1.0)
Moderate	30	18 (3.8)	9 (1.8)	3 (0.6)
Severe	219	82 (17.2)	74 (14.9)	63 (12.5)
SPJ, n (%)
None	1386	438 (91.8)	468 (94.4)	480 (95.0)	0.19
Mild	11	5 (1.0)	4 (0.8)	2 (0.4)
Moderate	23	11 (2.3)	9 (1.8)	3 (0.6)
Severe	58	23 (4.8)	15 (3.0)	20 (4.0)
SSV, n (%)
None	1170	354 (74.2)	396 (79.8)	420 (83.2)	0.03
Mild	57	25 (5.2)	21 (4.2)	11 (2.2)
Moderate	90	36 (7.5)	29 (5.8)	25 (5.0)
Severe	161	62 (13.0)	50 (10.1)	49 (9.7)

Phlebitis in deep or superficial veins was defined as postthrombotic vein thickening or partial or complete thrombotic occlusion of a vein segment.

AASV, anterior accessory saphenous vein; CFV, common femoral vein; CVI, chronic venous insufficiency; FV, femoral vein; GSV, great saphenous vein; PV, popliteal vein; SFJ, saphenofemoral junction; SPJ, saphenopopliteal junction; SSV, small saphenous vein; US, ultrasound.

Prediction of chronic venous disease severity

Table 4 summarizes the performance of machine learning models trained to predict severe CVI as defined by CEAP classification scores of 4, 5, or 6. Models developed using insufficiency US data (parameters from Table 3) demonstrated the lowest predictive performance across all classifiers. The gradient boosting model, despite being the best-performing US-based approach, reached an AUROC of 0.65. The SVC followed closely with an AUROC of 0.63. Traditional logistic regression had the lowest predictive capacity, with an AUROC of 0.61. For PG-based data models using the complete set of parameters (from Table 2), MLP yielded the highest overall performance with an AUROC of 0.82. Similarly, AdaBoost and logistic regression classifiers using the full PG dataset also demonstrated strong predictive capabilities, with AUROC values of 0.81–0.82. These results indicate that models utilizing comprehensive PG data were highly effective in predicting the presence of severe CVI. As PG contains measurements not performed by US (CPF, P-EVRT), we then limited the PG input to only the incompetence and obstruction data; the model performance declined moderately but remained higher than insufficiency US results. For instance, logistic regression on this dataset achieved an AUROC of 0.72 (as compared to 0.81 before). Both MLP and KNN models showed similar outcomes (AUROC of 0.72).

Table 4.

Top performing machine learning models for the prediction of severe chronic venous insufficiency (CEAP classes 4–6) by venous study type and components.

Study	Data^a	Classifier	Training	Testing	PPV	Sensitivity	Accuracy	AUROC
PG	All	AdaBoost	750	188	0.73	0.73	0.73	0.82
PG	All	MLP	750	188	0.76	0.76	0.76	0.82
PG	All	Logistic regression	750	188	0.74	0.74	0.74	0.81
PG	Incompetence + obstruction	Logistic regression	750	188	0.66	0.66	0.66	0.72
PG	Incompetence + obstruction	MLP	750	188	0.69	0.69	0.69	0.72
PG	Incompetence + obstruction	K-nearest neighbors	750	188	0.69	0.69	0.69	0.72
US	All	GradBoost	763	191	0.61	0.61	0.61	0.65
US	All	SVC	763	191	0.60	0.60	0.60	0.63
US	All	Logistic regression	763	191	0.58	0.58	0.58	0.61

All machine learning models were analyzed on the same sampling; the top two performing models and logistic regression were included for each subset.

AUROC, area under the receiver operating characteristic curve; CEAP, Clinical, Etiologic, Anatomic, and Pathophysiologic; KNN, k-nearest neighbors; MLP, multilayer perceptron; PG, air plethysmography; PPV, positive predictive value; SVC, support vector classifier; US, ultrasound.

Discussion

This study evaluated PG and insufficiency US in assessing the severity of CVI across 1478 limbs in a large cohort of patients. Using machine learning models, our analysis demonstrates the superior predictive performance of PG results compared to US-derived results for the prediction of CVI severity. Even when limiting the PG input to only the incompetence and obstruction data, results with PG remained better than those of the US (Figure 1). The superior performance of PG models indicates that quantitative hemodynamic measures of the overall limb perform better than assessments of deep and superficial vein segments, even when analyzed together with sophisticated analysis techniques. We found that approximately 25% of patients with clinically severe CVI had a completely normal venous insufficiency US (compared to only 11% with normal PG), a finding that supports the superiority of PG not only for determining the severity of CVI but also indicates that PG has a higher sensitivity in patients with more advanced CVI, likely in part secondary to the inclusion of CPF and P-EVRT measurements.

Figure 1.

Receiver operating characteristic curves for machine learning models predicting severe chronic venous insufficiency. Plethysmography all, using all parameters (AdaBoost), achieved the highest performance (AUC = 0.82), followed by Plethysmography limited, restricted to incompetence and obstruction parameters (logistic regression, AUC = 0.72), and ultrasound (GradBoost, AUC = 0.65).

We have previously examined how venous hemodynamic parameters, which can only be assessed via PG, correlate with the diagnosis and classification of CVI. Notably, CPF was independently correlated with CVI severity based on the CEAP classification.²⁰ A stepwise relationship between CPF was observed with the occurrence of active or past ulcers. This relationship was most pronounced when CPF was severely impaired, with ejection fractions between 0% and 9%. Conversely, limbs with CPF in the 40–49% range did not show a statistically significant association with active or prior ulceration when compared with those displaying higher EF values. Reduced CPF has also been shown to be a risk factor for venous thromboembolism^14,21 and has been independently associated with higher mortality.^13,22 Furthermore, reduced CPF cannot be explained by generalized muscle weakness as measured by handgrip strength.²³ P-EVRT has also been shown to independently associate with the severity of CVI classified by CEAP.¹⁵ Rapid P-EVRT (less than 20 seconds) was correlated significantly with increased clinical severity, as indicated by CEAP classes. For every 10-second reduction in P-EVRT below 40 seconds, there was a notable rise in CEAP classes. Refilling times under 10 seconds were strongly associated with a higher prevalence of skin changes and healed or active venous leg ulcers. In this analysis, PG outperformed US in predicting CVI severity; in part, because of these quantitative hemodynamic measurements.

Ambulatory venous pressure was historically regarded as the gold standard for assessing the hemodynamic burden of CVI.² However, due to its invasive nature, it is not suitable for routine screening or repeated measurements. DUS and PG are the two most frequently used noninvasive modalities for evaluating venous function. DUS has become the first-line imaging technique for detecting the presence, extent, and anatomical distribution of venous reflux; in part, this is because of the discontinuation of specific billing codes for PG in the United States, which has led to its decline. DUS does not, however, provide an overall assessment of the limb, but rather a detailed anatomic map that can be helpful for surgical decision-making. Notably, no studies have demonstrated a strong correlation between DUS findings and ambulatory venous pressure. In contrast, hemodynamic parameters obtained from PG have shown a consistent association with invasively measured ambulatory venous pressures.¹⁰

A direct comparison of the US to PG in CVI exists but is limited. Bays et al.,²⁴ in a study of 20 patients, demonstrated that PG accurately distinguishes limbs with and without venous reflux when compared with DUS, which aligns with our findings. They observed that PG is equally valuable to DUS for evaluating patients undergoing ligation and stripping of varicose veins. Moreover, PG allows for the quantitative assessment of superficial venous incompetence, detects any outflow obstruction, and helps determine whether varicosities significantly contribute to elevated venous pressure or serve as collateral outflow pathways from obstructed deep veins. Current guidelines offer only limited recommendations for PG in clinical practice.²⁵

Our work is the largest study comparing US to PG in the assessment of CVI, but it has several limitations. The dataset assembled was restricted to patients who ultimately underwent both PG and insufficiency US testing, which likely enriched the cohort with patients who had more advanced venous disease than would be expected in the general population. Although this does not bias the direct comparison of machine learning models for PG and the US in our cohort, it does prevent a complete understanding of the comparison across the more representative population-based sample. CEAP classification was performed at the time of PG testing (rather than at the time of the US), which may have influenced the strength of its correlation with PG. CEAP classification was performed and recorded before the PG results were known; however, the US results in this study could have been performed before or after the PG study (and CEAP recording).

The CEAP classification was used as the reference standard in this study as a clinically relevant and well-validated system; however, the clinical (C) component as a clinical assessment lacks specificity for venous pathology. This may explain why in some patients with C4, C5, and C6 disease (which we classified as severe CVI), no evidence of venous pathologies on either the US or PG was found. Such cases may in fact represent patients with lymphedema without venous pathology. Unfortunately, additional quantitative scoring tools, such as the Venous Clinical Severity Score, were not uniformly available in the electronic health record. In order to limit any time-based changes of CVI, we ensured that studies were performed in close proximity (< 90 days), and most were done within 10 days (80%). Because the study population consisted of patients with signs or symptoms potentially consistent with CVI who were referred for diagnostic testing, the severity of CVI should not be generalized to the broader population or be used to estimate the true prevalence and characteristics of CVI in the community.

Conclusions

This analysis compares the US and PG directly in a large sample of patients being evaluated for CVI and demonstrates the superior performance of PG for predicting patients with severe CVI. Unfortunately, PG is unavailable to many patients and clinicians, in part due to poor reimbursement and the decommissioning of specific billing codes. This analysis, alongside recent publications firmly establishing CPF and P-EVRT as important parameters in CVI assessment, redemonstrates the clinical value of PG in venous testing. Streamlined PG technology, updated billing codes, and modernized venous testing guidelines are essential to re-establish the importance of PG in venous disease. Although PG outperformed the insufficiency US in this analysis, we believe both tests offer important insights into venous disease and are likely complementary in the assessment and ultimate treatment decisions for patients with CVI.²⁶

Footnotes

Declaration of conflicting interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The authors received no financial support for the research, authorship, and/or publication of this article.

ORCID iDs

Hossam Elbenawi

Ana I Casanegra

Robert McBane

Damon E Houghton

Supplemental material

Supplemental material for this article is available online.

References

Da Silva

Navarro

Batalheiro

[The importance of chronic venous insufficiency: Various 10 preliminary data on its medico-social consequences] [Article in French]. Phlebologie 1992; 45: 439–443.

Criado

. Laboratory evaluation of the patient with chronic venous insufficiency. In: Rutherford

(ed). Vascular surgery, 4th ed. Philadelphia, PA: WB Saunders Company, 1995, 1771–1784.

Talbot

Use of real time imaging in identifying deep venous obstruction: A preliminary report. Bruit 1982; 6: 41–45.

Sullivan

Peter

Cranley

JJ.

Real time B-mode venous ultrasound. J Vasc Surg 1984; 1: 465–471.

Hannan

Stedje

Skorcz

Caprini

JA.

Venous imaging of the extremities: Our first twenty-five hundred cases. Bruit 1986; 10: 29–32.

Kalodiki

Matson

Volteas

, et al. The combination of liquid crystal thermography and duplex scanning in the diagnosis of deep vein thrombosis. Eur J Vasc Surg 1992; 6: 311–316.

Allan

JC.

Volume changes in the lower limb in response to postural alterations and muscular exercise. S Afr J Surg 1964; 2: 75–90.

Christopoulos

Nicolaides

AN.

Haemodynamic factors responsible for venous ulceration. Angiologie 1988; 39: 401–403.

Gillespie

Cordts

Hartono

, et al. The role of air plethysmography in monitoring results of venous surgery. J Vasc Surg 1992; 16: 674–678.

10.

Christopoulos

Nicolaides

Szendo

, et al. Air plethysmography and the effect of elastic compression on venous hemodynamics of the leg. J Vasc Surg 1987; 5: 148–159.

11.

Cordts

Hartono

Lamorte

Menzoian

JO.

Physiological similarities between extremities with varicose veins and with chronic venous insufficiency utilizing air plethysmography. Am J Surg 1992; 164: 260–264.

12.

Rooke

Heser

Osmundson

PJ.

Exercise strain-gauge venous plethysmography: Evaluation of a “new” device for assessing lower limb venous incompetence. Angiology 1992; 43: 219–228.

13.

Halkar

Medina Inojosa

Liedl

, et al. Calf muscle pump function as a predictor of all-cause mortality. Vasc Med 2020; 25: 519–526.

14.

Houghton

Ashrani

Liedl

, et al. Reduced calf muscle pump function is a risk factor for venous thromboembolism: A population-based cohort study. Blood 2021; 137: 3284–3290.

15.

Elbenawi

Ghorbanzadeh

Liedl

, et al. Rapid post exercise venous refilling time is an independent contributor to chronic venous insufficiency. J Vasc Surg Venous Lymphat Disord 2025; 13: 102292.

16.

Seabold

Perktold

. Statsmodels: Econometric and statistical modeling with Python. In: Proceedings of the 9th Python in Science Conference (SciPy 2010), Austin, TX, June 28–July 3, 2010; 92–96.

17.

Passman

Meisner

Dalsing

, et al. The 2020 update of the CEAP classification system and reporting standards. J Vasc Surg Venous Lymphat Disord. 2020; 8: 342–352.

18.

Pedregosa

Varoquaux

Gramfort

, et al. Scikit-learn: Machine learning in Python. J Mach Learn Res 2011; 12: 2825–2830.

19.

Chen

Guestrin

XGBoost: A scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, Aug 13–17, 2016, 785–794. Association for Computing Machinery, New York, NY.

20.

Ghorbanzadeh

Liedl

Elbenawi

, et al. Relationship between calf muscle pump function and severity of chronic venous disease. Vasc Med 2025; 30: 473–479.

21.

Houghton

Ashrani

Liedl

, et al. Reduced calf pump function and proximal deep vein incompetence are predictors for ipsilateral deep vein thrombosis. Vasc Med 2022; 28: 57–58.

22.

McBane

Liedl

Wysokinski

, et al. Calf rEF: Impact of calf muscle pump dysfunction with reduced ejection fraction on all-cause mortality. Mayo Clin Proc 2024; 99: 902–912.

23.

Ghorbanzadeh

Abud

Liedl

, et al. Reduced calf muscle pump function is not explained by handgrip strength measurements. J Vasc Surg Venous Lymphat Disord 2024; 12: 101869.

24.

Bays

Healy

Atnip

, et al. Validation of air plethysmography, photoplethysmography, and duplex ultrasonography in the evaluation of severe venous stasis. J Vasc Surg 1994; 20: 721–727.

25.

De Maeseneer

Kakkos

Aherne

, et al. Editor’s Choice – European Society for Vascular Surgery (ESVS) 2022 clinical practice guidelines on the management of chronic venous disease of the lower limbs. Eur J Vasc Endovasc Surg 2022; 63: 184–267.

26.

Melvin

Wakefield

Obi

AT.

Air plethysmography: A phoenix or destined for ashes?

Vasc Med 2025; 30: 480–481.

Plethysmography compared to ultrasound for the evaluation of chronic venous insufficiency

Abstract

Keywords

Background

Methods

Study design and patients

Insufficiency ultrasound

Plethysmography

CEAP classification

Statistical analysis

Machine learning analyses

Results

Plethysmography data

Venous ultrasound data

Prediction of chronic venous disease severity

Discussion

Conclusions

Footnotes

Declaration of conflicting interests

Funding

ORCID iDs

Supplemental material

References