Abstract
Background
Orthopaedic infections are difficult to eradicate because biofilm and poor local vascularity limit antibiotic exposure. Continuous local antibiotic perfusion (CLAP) delivers sustained, titratable antibiotics directly into infected compartments. We used harmonised individual participant data (IPD) to quantify early effectiveness, longer-term control, safety, and patient-level modifiers.
Methods
We performed an IPD review of observational reports using CLAP as primary or adjunctive therapy (January–May 2025). The primary outcome was 30-days early response (C-reactive protein ≤3 mg/L or earliest sustained clinical/wound improvement). Secondary outcomes were durable infection control at ≥6 and ≥12 months using evaluable denominators with best–worst bounds, infection-free days) and safety. One-stage analyses used mixed-effects logistic regression; Restricted Mean Survival Time (RMST) was preferred when proportional hazards were violated. Multiple imputation supported inferences.
Results
Eighty-one studies (n = 256) were included; 164 patients had observed time-to-response. Fifty-nine percent achieved a 30-days response; median time-to-response was 26 days. Implant involvement was associated with lower odds of 30-days response; trajectories were slower with implants and higher organism burden (polymicrobial ≥3), while osteomyelitis responded faster than fracture-related infection. RMST (30) showed delays with implants (+4.43 days) and polymicrobial infection (+6.74 days), and faster response for osteomyelitis versus fracture-related infection (−9.06 days). Durable control among evaluable patients was 88.4% at ≥6 months and 90.2% at ≥12 months, with best–worst bounds of 89.2–82.2% and 90.9–83.5%, respectively. Infection-free-day RMST supported substantial time free of recurrent infection within the first year. Adverse events were uncommon; renal events were generally reversible.
Conclusions
CLAP achieved encouraging early response and high durability among evaluable patients, with slower trajectories when implants were retained or pathogen burden was high and faster responses in osteomyelitis. Safety appeared acceptable with monitoring. Prospective comparative studies using standardised endpoints, with RMST for non-proportional hazards, are warranted.
Keywords
Introduction
Orthopaedic infections—including osteomyelitis, septic arthritis, fracture-related infection (FRI), and periprosthetic joint infection (PJI)—remain difficult to eradicate and often require repeated debridement, prolonged systemic antibiotics, and complex implant decisions. The burden is substantial and rising with population ageing, higher arthroplasty volumes, complex trauma, and multimorbidity. Population-level data show a sustained and projected increase in musculoskeletal infectious morbidity and related service demand.1,2 Even with contemporary strategies such as debridement, antibiotics and implant retention (DAIR), outcomes are limited by biofilm biology and compromised local vascularity, which reduce effective concentration–time exposure at the nidus and permit persistent or recrudescent infection.3–6
These constraints motivate local strategies that deliver high antimicrobial exposure in situ while minimising systemic toxicity. 7 Continuous local antibiotic perfusion (CLAP) aims to meet that need: catheters are placed within the infected bone, joint, or soft tissues to allow continuous, titratable antibiotic ingress to the target space.7,8 Pharmacologically, prolonged high local exposure can increase biofilm susceptibility and overcome time-dependent tolerance.9,10 Critically, CLAP is typically designed with controlled egress by pairing perfusion with negative-pressure wound therapy (NPWT) or dedicated drainage, which helps avoid pooling, promotes clearance of exudate, and enables monitoring of effluent; this ingress–egress circuit distinguishes CLAP from instillation-only approaches.11–13
Despite these practical advantages, the evidence base is largely case reports and small series with heterogeneity in techniques, denominators, and outcome definitions. An individual-participant-data (IPD) approach allows harmonised diagnostic frameworks, uniform outcome timing/definitions, and prespecified exploration of patient-level modifiers while accounting for study-level heterogeneity. 14 Because early decision points matter clinically, we prioritised a 30-days early-response endpoint (C-Reactive Protein (CRP) ≤3 mg/L or earliest sustained clinical/wound improvement when CRP is unavailable), supported by laboratory reference ranges and prior CLAP series using similar thresholds.15–18 A 30-days horizon is also aligned with surgical site infection (SSI) surveillance windows, trial feasibility, and health-system quality metrics (e.g., 30-days readmission).19–23 Longer-term control (“durability”) was evaluated at prespecified windows with transparent denominators and bounds.
We sought to (i) quantify the proportion achieving early response by 30 days and characterise the distribution of time-to-early-response; (ii) estimate longer-term control at prespecified windows using clear evaluable denominators and best–worst bounds; (iii) explore prespecified patient-level modifiers (implant involvement, organism groups, organism counts, infection type/site/phase, selected comorbidities) for the primary endpoint within a one-stage IPD framework; and (iv) summarise recurrence/relapse and complications (major/minor, including antibiotic-related adverse events). Protocol features (site of antibiotic perfusion, infusion duration/dose/concentration) were not primary analytic targets.
Methods
Protocol and reporting standards
We followed the Preferred Reporting Items for Systematic Review–Individual Participant Data (PRISMA-IPD) guidelines 14 and PRISMA 2020 (Table S1). 24 The protocol was registered with PROSPERO (CRD42025635194; 21 Jan 2025). Two prospectively recorded amendments (25 Mar 2025; pre-extraction, and 20 September 2025) formalized IPD methods and clarified the outcome hierarchy (primary: 30-days early response; secondary: durable infection control and infection-free days).
Patient cohort and eligibility criteria
We included case reports, case series, and cohort studies reporting continuous local antibiotic perfusion (CLAP) as part of an infection management pathway in orthopaedic settings. CLAP was defined as continuous intralesional antibiotic ingress via catheters into bone, joint, or soft tissues with controlled egress through negative-pressure wound therapy (NPWT) or dedicated drainage/aspiration. Variants included intramedullary (iMAP), intra–soft-tissue (iSAP), and intra-articular (iJAP) approaches. We excluded studies using intermittent instillation without controlled egress and studies focused only on other local carriers (e.g., beads/spacers) without continuous perfusion. Populations included adults and children with osteomyelitis, periprosthetic joint infection (PJI), fracture-related infection (FRI), surgical-site infection involving musculoskeletal compartments, or septic arthritis. Non-orthopaedic infections, animal models, and in-vitro reports were excluded. Screening occurred at the study level; during IPD extraction, any individual patients who did not actually receive CLAP were removed while retaining eligible cases from the same report.
Outcomes and definitions
Primary outcome
Early response within 30 days of CLAP initiation, defined as C-reactive protein (CRP) ≤3 mg/L or, where CRP was unavailable, the earliest sustained clinical or wound improvement within 30 days15–18 The binary indicator was derived from observed response times when available; cases described only as ‘response by 30 days’ without an exact response time contributed to the 30-days endpoint but were treated as censored in time-to-event analyses.
Secondary outcomes
(i) Durability of infection control at prespecified windows (≥6 and ≥12 months) using evaluable denominators (patients with follow-up at/after the window and determinate status), with best–worst bounds for unknowns. By “best–worst bounds” we mean the best case assigns all unknown-status patients to success and the worst-case assigns all to failure, yielding a transparent range when follow-up is incomplete. (ii) Infection-free days (IFD) up to fixed horizons (τ = 180 and 365 days) via restricted mean survival time (RMST) using interval-censored methods. (iii) Safety, including major/minor complications and antibiotic-related adverse events (renal, otologic). (iv) Recurrence/relapse and infection-related mortality. (v) Functional recovery when available (descriptive).
A patient was counted as having their infection durably controlled if, by last observation at/after the window, there was no documented recurrence/relapse and no infection-related death. Patients with follow-up at/after the window but indeterminate status (e.g., missing recurrence documentation) were treated as unknown and excluded from the evaluable denominator; we report best–worst bounds by assigning all unknowns to success or failure. In a strict sensitivity definition, durable control additionally required no re-operation for infection and no chronic suppressive antibiotics (CSA) when explicitly reported; lack of reporting was not penalized.
Rationale for the 30-days endpoint
The first month captures the decisive postoperative window when inflammatory markers and wound status typically declare trajectory and when escalation or de-escalation decisions are made. Within our IPD, the 30-days mark lay between the cohort’s median and upper-quartile treatment response times, supporting a threshold that reflects meaningful early treatment response without being overly permissive. Because any single threshold can obscure distributional detail, we paired the binary 30-days analysis with time-to-event and restricted mean survival time (RMST) approaches, preserving information about the full clearance-time distribution and enabling interpretation in absolute days gained or lost over a fixed horizon.
Information sources and search strategy
We searched MEDLINE (PubMed and Ovid), EMBASE (ProQuest), Scopus, J-STAGE, Isho. jp Web, CiNii Research, and the Cochrane Library from inception to 1 May 2025, without language or publication-status restrictions. Trial registries (ClinicalTrials.gov, WHO ICTRP) were searched but yielded no eligible records. We screened reference lists of included studies and prior reviews, scanned citation indices, and searched grey literature (Google Scholar, ResearchGate). Authors were contacted for restricted-access publications and to clarify eligibility or data elements. Database search start dates were: PubMed 10 Jan 2025; Scopus 10 Jan 2025; Ovid MEDLINE 11 Jan 2025; J-STAGE 11 Jan 2025; Cochrane Library 12 Jan 2025; EMBASE (ProQuest) 12 Jan 2025; Isho. jp Web and CiNii Research 13 Jan 2025. Full strategies (one verbatim, others adapted) are in the Supplemental Appendix.
Study selection, data collection, and management
Two reviewers screened titles/abstracts and full texts independently; disagreements were resolved by discussion or a third reviewer. To minimise assessor bias, screening and IPD extraction/harmonisation were performed independently using a prespecified codebook; disagreements were resolved by consensus or third-party adjudication. Where feasible, reconstructed IPD were cross-checked against source tables/figures and verified via author queries. Overlapping series were checked for duplicate patients by centre, time frame, and key characteristics; duplicates were removed. For eligible studies, we requested individual participant data (IPD) covering demographics; infection type, anatomical site, and phase; implant management; microbiology; comorbidities; CLAP modality and antibiotic details (drug, concentration, dose, flow/dwell, duration; NPWT parameters); biomarkers; clinical course; follow-up; and outcomes. Where author-provided IPD was unavailable, we reconstructed patient-level data from published tables/figures/narratives when feasible. Study-level variables (country, centre, design, year, sample size, follow-up) were recorded. Data were stored in a secure repository (Google Drive) with restricted access. Integrity checks included range and plausibility limits for continuous variables; temporal logic checks (procedure → response/failure → last follow-up); and cross-field consistency (e.g., infection type vs anatomical site). Values were reconciled against published tables/figures/narratives; queries to authors resolved discrepancies where possible.
Study-versus patient-level eligibility
Screening at the study level maximized capture of CLAP cohorts across designs and languages. During IPD harmonization, we then applied patient-level verification: individuals misclassified as receiving CLAP or treated exclusively with other local strategies were excluded while eligible co-patients from the same study were retained. This two-step approach reduces selection bias from excluding otherwise informative reports, preserves per-study clustering for one-stage modelling, and ensures that effectiveness estimates are derived strictly from participants who actually received CLAP. It also maintains fidelity to the real-world heterogeneity of CLAP deployment while allowing standardized outcome definitions and covariate structures at the individual level.
Risk of bias assessments
Within studies, we used the CARE Case Report Checklist and JBI Case Report/Case Series Critical Appraisal Tool, which are presented as study-level summaries and traffic-light plots (Figure 1).25,26 IPD integrity checks (sequence plausibility, completeness, cross-field consistency) corroborated these ratings. Across studies, small-study effects/publication bias were evaluated visually (funnel plots) and statistically via Egger’s regression (p < 0.05 indicating asymmetry), which results (interpret cautiously) are provided in the Supplementary Appendix (Figures S1 and S2).
27
Risk-of-bias assessments utilizing CARE and JBI guidelines: (A) case reports and (B) case series, displayed as traffic-light plots. Abbreviations: CARE: CAse REport guidelines; JBI: Joanna Briggs institute.
Clinical and microbiological classifications
Infection types followed consensus frameworks: PJI by International Consensus Meetings (ICM)/Musculoskeletal Infection Society (MSIS)28,29; FRI by the FRI Consensus Group 30 ; soft-tissue and septic arthritis by Centers for Disease Control and Prevention (CDC)/Infectious Diseases Society of America (IDSA) and StatPearls.31,32 The European Bone and Joint Infection Society (EBJIS) definition of PJI was also referenced during harmonization. 33 We defined phase as acute (≤6 weeks), chronic (>6 weeks), or acute-on-chronic based on consensus/diagnostic guidance.33,34 Anatomy was grouped as bone, joint, or soft tissue. Isolates were curated into 11 clinically relevant groups with “negative culture” as reference; organism count was monomicrobial, dual, or polymicrobial (≥3). Low-frequency/collinear organism/comorbidity categories were collapsed (e.g., Gram class) to satisfy events-per-variable (EPV) and variance-inflation (VIF) constraints. Because patients could have >1 organism or comorbidity, these were coded as binary dummies (presence/absence). Complications were classified by Clavien–Dindo (grades ≥III as major). 35 Comorbidities were standardised into a few clinically relevant ones (diabetes, chronic kidney disease, end-stage renal disease, rheumatoid arthritis, other autoimmune conditions), with composites (renal impairment; autoimmune-any). For durability, we constructed an interval-censored dataset with left/right bounds from event dates (failure), last observation, and follow-up.
Statistical analysis
Descriptive statistics and hypothesis testing
Continuous summary data (observed only, non-imputed) are presented as medians with interquartile ranges (IQR); categorical summaries use counts (n/N) and percentages (%).
Categorical comparisons employed Pearson’s χ2 when expected counts were ≥5 and simulated Fisher’s exact testing for sparser data. One-vs-all contrasts (each subgroup vs all remaining patients) were analysed via two-sample Z-tests where both margins exceeded 30. Adjustments for multiple comparisons were made using Bonferroni and false discovery rate (FDR) corrections. These analyses were repeated stratified by infection type and endpoint, with missing data excluded from hypothesis tests but tallied separately for transparency. Hypothesis tests were exploratory/descriptive given the case report/series design and heterogeneous ascertainment.
We additionally extracted study-level CLAP implementation variables to address heterogeneity in clinical application, including CLAP variant (iMAP/iSAP/iJAP), perfusate antibiotic(s), reported concentration and dose, infusion rate, and intended duration. Where reported, we also captured systemic antibiotic co-therapy (presence, agents, duration) and serum antibiotic level monitoring (maximum reported blood concentration). These protocol features were summarized descriptively and were not treated as analytic targets. Categorical variables are presented as counts/percentages (study-level; each study counted once), and numeric parameters are summarized as medians with interquartile ranges where available and comparable across reports. Full study-level CLAP implementation details are provided in the “Supplementary CLAP Implementation Table”.
Multiple imputation for missing data handling
Missing data were handled with multiple imputation by chained equations (MICE) with m = 20 imputations to reduce Monte-Carlo error. The same imputations were used for descriptive Kaplan–Meier/RMST displays and for the primary regression analysis. Categorical predictors used logistic or polytomous logistic imputation; continuous predictors used predictive mean matching; estimates and standard errors were pooled using Rubin’s rules. Time-to-response event times were not imputed for time-to-event analyses; when a clearance/response time was not reported, observations were right-censored at the last observed follow-up time. The 30-days endpoint was computed directly from reported information (response time when available and follow-up/status otherwise) and was not re-derived from imputed event times. Robustness checks (Supplemental Appendix) included leave-one-study-out analyses for the primary model, exclusion of very short follow-up (<90 days), and the strict durability definition noted above.
Kaplan–Meier and restricted mean survival time (RMST)
For time-to-response displays, an event was defined as a reported time to infection clearance/response. If a clearance time was not reported, the observation was treated as right-censored at the last observed follow-up time. Cases described only as “cleared by 30 days” without a clearance time contributed to the 30-days binary endpoint but were not assigned an event time in time-to-event analyses. Time to treatment response was summarised using Kaplan–Meier curves, pooled across multiply imputed datasets using a complementary log–log transformation with 95% confidence intervals, and RMST at 30 days to characterise trajectory.
For subgroup comparisons, we additionally reported RMST differences (ΔRMST) up to τ = 30 days, estimated within each imputed dataset. For displayed KM subgroup contrasts, log-rank tests, single-variable Cox model Wald tests, and Schoenfeld-residual proportional hazards tests were computed within each imputed dataset and combined across imputations. For figure readability, curves were truncated at 365 days (or earlier if maximum observed follow-up was shorter), while inferential tests were computed using the full available follow-up; censoring was indicated by tick marks. Additional details are shown in the Supplemental Appendix.
Regression analyses
For the primary analysis, we fitted a one-stage, multivariable mixed-effects logistic regression with a random intercept for study (patients nested within studies) across the multiply imputed IPD (m = 20), pooling estimates by Rubin’s rules. The IPD comprised individual patients extracted from multiple primary studies (case reports/series). Each patient record was linked to a study identifier (study_id), and patients were therefore nested within studies. In the regression dataset, after restricting to patients with covariate availability for the prespecified model set, 67 studies contributed patients; cluster size ranged from 1 to 15 patients per study (median 1, IQR 1–2), and ∼61% of studies contributed a single patient. Patients from the same study share study-level features (e.g., CLAP protocol variant, antibiotic choice/dosing, surgical approach, outcome ascertainment, and follow-up intensity), which can induce within-study correlation and between-study heterogeneity. We therefore modelled study as a clustering level using a random intercept for study.
Prespecified covariates were implant involvement, bacteria class (Gram-positive, Gram-negative, anaerobes/fungi/miscellaneous, and culture-negative), mono-versus polymicrobial status (≥2 organisms), infection site/phase, age group, and comorbidity indicators (diabetes, renal impairment, and autoimmune conditions). We report adjusted odds ratios (aORs) with 95% confidence intervals; p-values are descriptive. As robustness checks (Table S7), we fitted a pooled fixed-effects logistic model and a Firth bias-reduced logistic model using the same covariates to address potential separation.
Durable infection control
For durable infection control, we estimated success proportions at 180 and ≥365 days (approximately ≥6 and ≥12 months) using evaluable denominators with exact confidence intervals, and we presented best–worst bounds by allocating unknowns entirely to success or failure. Failure (durability event) was defined as recurrence/relapse, re-operation for infection, or infection-related death. Interval-censored endpoints were constructed as: success (no failure) → (follow-up, ∞), known-time failure → [t, t], and failures without a known event time but with follow-up → (0, follow-up). We also estimated IFD using the non-parametric maximum-likelihood estimator for interval-censored data (Turnbull NPMLE) and integrated the survival curve to τ = 180 and 365 days to obtain RMST. 36 Computation used icenReg. 37 RMST provides an interpretable summary of average infection-free time and is robust when proportional hazards do not hold.38,39 Uncertainty was quantified with cluster bootstrap resampling at the study level (studies resampled with replacement, including all patients within each selected study; details in the Supplement).
Results
Study selection and IPD obtained
83 studies (including one District General Hospital series contributing eight additional patients) met inclusion. We requested IPD from all corresponding authors. Where author-provided IPD was unavailable, we reconstructed patient-level data from published tables/figures/narratives when feasible. Two studies lacked usable patient-level detail and one were excluded (another study had one salvageable representative case); one additional study was excluded for duplicate IPD. No study was excluded solely because author-provided IPD was unavailable if sufficient patient-level data could be reconstructed. Across the remaining 81 studies, published/provided IPD yielded 256 patients. For descriptive time summaries, 164 patients had observed time-to-early-response; model-based analyses used multiply imputed datasets. The PRISMA-IPD flowchart is shown in Figure 2. PRISMA-IPD flow diagram of study screening, eligibility, IPD retrieval, and inclusion. Abbreviations: IPD: individual participant data; DGH: district general hospital.
Study and patient characteristics
Baseline characteristics (observed only) by treatment response within 30 days.
aOf 256 total patients, 164 had evaluable 30-days early-response status (CRP ≤3 mg/L or clinical/wound improvement within 30 days) and were included in Table 2. The remaining 92 were unevaluable for 30-days response due to missing outcome documentation (and/or insufficient follow-up where available).
bOnly clinically relevant comorbidities are shown. Full list of comorbidities are shown in the Supplementary Appendix (Table S2).
cFor brevity, detailed “Infection Location (Anatomic)” tables are not included in this table but provided in the Supplementary Appendix (Table S2).
IQR = interquartile range; FDR = false discovery rate; NA = not applicable; MRSA = Methicillin-resistant Staphylococcus aureus; MSSA = Methicillin-susceptible Staphylococcus aureus; ESBL = Extended-spectrum beta-lactamase; CoNS = coagulase-negative Staphylococci.

Pooled summary mosaic plot of demographics, comorbidities, infection characteristics, microbiology, outcomes, and follow-up in CLAP-treated patients; Patient IDs: (A) 1A to 5J, (B) 6 to 12N, (C) 13A to 15J, (D) 16 to 24H, (E) 25 to 44, (F) 45A to 53N, (G) 53.1A to 57B, (H) 58A to 69, (I) 70A to 73C, (J) 74A to 82. Refer to (J) for color and abbreviation legends.
Implementation of CLAP across studies
Summary of CLAP protocol implementation across included studies (study-level, n = 81).
Values are study-level summaries; each study contributes once. Parsed fields standardize concentration (mg/mL), rate (mL/h), and durations (days) when extractable. iMAP = intramedullary antibiotic perfusion; iSAP = intra soft-tissue antibiotic perfusion; iJAP = intra-articular antibiotic perfusion.
Primary outcome—30-days early response
Mixed-effects multivariable logistic analysis (random study intercept) for predictors of 30-days early treatment response.
To maintain adequate events-per-variable (EPV) and model stability, predictors were collapsed/remapped; infection type and sparse/collinear variables were excluded. Specifications are detailed in the Supplementary Appendix.
CI = confidence interval; Ref = reference group.
aComposite of chronic kidney disease & end stage renal failure on dialysis.
bComposite of rheumatoid arthritis & other autoimmune diseases.
cComposited of “2 organisms” and “polymicrobial (≥3)”.
Kaplan–Meier—Cumulative probability of treatment response
In Kaplan–Meier analysis, the MI-pooled estimated 30-days cumulative probability of treatment response was 45.6% (95% CI 38.9–53.0), which is notably lower compared to the non-MI-pooled subset (13.5% difference). This KM-based estimate is lower than the binary 30-days response proportion because cases described only as ‘cleared by 30 days’ contribute to the binary endpoint but are not assigned an event time and are treated as censored in time-to-event analyses.
RMST up to 30 days was 23.41 days (95% CI 22.19–24.63) (Figure 4(a)–(d)). Across subgroup curves, statistically significant differences in time to response were observed by age group (pooled log-rank p ≤ 0.001), sex (p ≤ 0.001), pathoanatomic site (p ≤ 0.001), anatomical region (p ≤ 0.001), and infection phase (p ≤ 0.001). Cumulative response increased rapidly early after treatment initiation and then plateaued, with progressively fewer patients remaining at risk later in follow-up. Kaplan–Meier curves for time to treatment response stratified by: (A) age; (B) sex; (C) pathoanatomic site; (D) implant involvement; (E) anatomical region; (F) infection phase; (G) microbial class; (H) organism count. Shaded bands, 95% CI; Curves are truncated at 365 days for display (max follow-up 1826 days); the red dotted line marks the 30-day cutoff. Tick marks indicate censoring; numbers at risk are shown.
Regression analysis
In the one-stage mixed-effects multivariable logistic model (random intercept for study), implant involvement remained independently associated with lower odds of 30-days early response (aOR 0.18, 95% CI 0.04–0.79; p = 0.023). Other candidate predictors (age group, sex, infection type contrasts, and polymicrobial infection) were imprecise and were not statistically significant after adjustment. (Table 3).
RMST for early-response trajectory (τ = 30 days)
Cluster-bootstrap adjusted restricted mean time to treatment response differences (ΔRMST, τ = 30 days). Positive values indicate delay to clearance.
CI = confidence interval; FRI = Fracture-Related Infection; IAI = Implant-Associated Infection; PJI = Periprosthetic Joint Infection; MRSA = Methicillin-resistant Staphylococcus aureus; MSSA = Methicillin-susceptible Staphylococcus aureus; ESBL = Extended-spectrum beta-lactamase; CoNS = Coagulase-negative Staphylococci.
aOnly clinically relevant comorbidities are shown.
Secondary outcome—Durable infection control & infection-free days
Durable infection control (windows) and infection-free days (Turnbull RMST).
CI = confidence interval; FU = follow-up; RMST = restricted mean survival time.
Evaluable n are cases with sufficient follow-up to determine status at τ; unknown n lack sufficient follow-up, so best/worst success bounds treat unknowns as all success/all failure. Turnbull interval-censored RMST estimates mean infection-free days up to τ (days and % of τ) with clustered bootstrap 95% CIs.
Safety
Adverse events were infrequent, occurring in 13/175 patients (7.43%): four major (2.29%) and nine minor (5.14%). Where reported, renal adverse events were generally reversible with dose adjustment or cessation, and no ototoxic events were described. Major complications (Clavien–Dindo ≥ III) were uncommon. Reporting was heterogeneous across studies; detailed counts and denominators are provided in Table S6 (Supplement).
Discussion
What these analyses tell us—and do not tell us
These results synthesise observational reports and describe early trajectories and durability patterns within the limits of the available data; they are hypothesis-generating rather than causal treatment effects. Although we extracted perfusion parameters where reported, they were inconsistently documented and strongly confounded by indication/anatomy and local program design; therefore, modelling them as dose–response predictors risked spurious inference and we summarised them descriptively.
Summary of results
In this IPD synthesis of 256 CLAP-treated patients (n = 164 evaluable for the 30-days endpoint), 97 (59.1%) achieved early response within 30 days. In the one-stage mixed-effects logistic model (random intercept for study), retained implant involvement was associated with lower odds of 30-days response (aOR 0.18, 95% CI 0.04–0.79). Time-scale summaries using RMST to 30 days were directionally consistent, showing slower trajectories with retained implants (ΔRMST +4.43 days) and polymicrobial ≥3 (ΔRMST +6.74 days), and faster trajectories for osteomyelitis versus FRI (ΔRMST −9.06 days). Durability among evaluable patients remained high at 6 and 12 months: at 180 days, success was 88.4% (129/146) with worst-case bounds 82.2%, and at 365 days, success was 90.2% (101/112) with worst-case bounds 83.5%. Infection-free-day RMST indicated substantial time spent infection-free within the first year (170.5 days to 180 days; 333.9 days to 365 days). Early CRP response does not guarantee eradication, but these durability summaries suggest that for many patients it marks a favourable early trajectory rather than a definitive cure. Major adverse events were uncommon; renal events were uncommon and generally reversible where reported. Overall, CLAP appears may be most useful in contexts where anatomy and microbiology are favourable and less so when hardware is retained, organism burden is high, or difficult organisms are present.3,4,7,8,10–13,40
Clinical and research implications
For patients with retained implants or polymicrobial infection, anticipate a slower early response and plan for ∼4–7 additional days without response (RMST) within the 30-days window. Early escalation of monitoring is appropriate, including renal function and drug levels when aminoglycosides are used, and source control should be pursued without jeopardizing essential hardware. Where anatomy and microbiology are favourable—for example shoulder infection with C. acnes—CLAP with a disciplined ingress–egress circuit may accelerate early improvement and support discharge planning when paired with effective debridement and systemic antibiotics.41–43 Programs adopting CLAP should standardize catheter maps by infection pattern, provide drug-preparation guides with concentration ranges and flow or aspiration targets, use securement and monitoring checklists, and establish laboratory schedules, especially for older adults or longer courses. Embedding these elements in order sets and maintaining a minimal prospective dataset that tracks time to early response, recurrence, complications, unplanned returns to theatre, and simple function flags will support continuous quality improvement and value analyses. Prospective registries or trials should standardize CLAP protocols, collect pharmacokinetics, and power organism-level contrasts, with pre-specified RMST endpoints to accommodate non-proportional hazards and harmonized CRP-based definitions to improve comparability.15–17,19,36–39,44
Implementation heterogeneity and a common operating profile
Across included studies, CLAP implementation was variably reported but clustered around a recognizable operating profile: configuration was reported in 92.6% of studies, perfusate choice in 93.8%, with gentamicin predominating (82.7%). Where available, regimens commonly reported a concentration of ∼1.20 mg/mL infused at ∼2.0 mL/h for ∼14 days, and most studies reported concurrent systemic antibiotics (80.2%) with a median reported duration of ∼28 days. Although the cohort remains clinically heterogeneous across indications, anatomy, and institutional contexts, we observed a recognizable convergence in CLAP implementation across reports. This relative consistency of the intervention exposure improves the interpretability of pooled descriptive estimates by reducing heterogeneity attributable to protocol variation. These summaries help contextualize external validity and stewardship, but protocol fields were not consistently extractable and were not modelled as dose–response predictors because reporting was inconsistent and confounded by indication and program design, so implementation–outcome relationships remain uncertain and are best addressed in prospective registries/trials.
Biology and burden of complexity
The penalties associated with retained implants and polymicrobial communities are biologically plausible. Foreign material supports mature biofilm; mixed consortia produce quorum-mediated protection and metabolic layering that raise eradication thresholds above planktonic MICs.3,4,10,40 CLAP is designed to confront these barriers by sustaining high local concentration–time exposure at the nidus and evacuating inhibitory exudate through controlled egress (typically via NPWT).7,8,11–13 Debridement remains foundational; CLAP appears to preserve that surgical gain by bathing poorly perfused interfaces where systemic therapy struggles to deliver adequate exposure. The ∼4–9-days RMST differences we observed are modest in absolute terms yet clinically meaningful in the early window when escalation, re-operation, or discharge decisions are made.
Infection type and anatomy
Osteomyelitis improved comparatively quickly, plausibly because intramedullary or perilesional lines access canal and dead-space surfaces for sustained exposure.7,8 Given osteomyelitis can harbour indolent infection, a 30-days response should be interpreted cautiously; however, our 6- and 12-months follow-up suggests that many early responders did achieve sustained control. Periprosthetic joint infection was more refractory, echoing multicentre cohorts that underscore the difficulty of eradicating biofilm on prosthetic surfaces and the importance of decisive source control. 21 Recent chronic knee PJI case series also report feasibility of CLAP during DAIR. 45 Early successes in the shoulder and upper limb may reflect organism spectrum (including Cutibacterium acnes), thinner soft-tissue envelopes, and the feasibility of intra-articular perfusion with active aspiration.41–43,46 In contrast, large lower-limb constructs and cement mantles create diffusion barriers and potential micro-reservoirs that slow early response.
Safety, stewardship, and monitoring
Adverse events were infrequent, but nephrotoxicity warrants vigilance. An independent cohort linked early renal decline to older age and longer CLAP duration, supporting conservative course lengths, good hydration, avoidance of nephrotoxins, and routine creatinine monitoring in older adults. 47 Drug levels can be considered for aminoglycosides when infusions extend beyond about a week or renal reserve is limited, with predefined rules for adjustment or cessation. Routine screening for otologic and vestibular symptoms is prudent. Concerns that local antibiotics drive resistance are not supported by the best available comparative evidence in extremity fracture-related infection, although stewardship still applies: narrow spectrum once cultures finalize, avoid unnecessary dual coverage, and culture any recurrences to surveil for resistance. 48
Positioning against established modalities
Selected contemporary series/trials report two-stage exchange for chronic hip and knee PJI can reach about 95% eradication in selected series, and one-stage programs report approximately 90–95% mid-term control.49–51 On earlier horizons, optimized very-early DAIR within 1 week can achieve about 93% control, and repeated DAIR can salvage 78–83% of selected failures.52,53 For systemic-therapy context, OVIVA showed oral therapy was non-inferior to intravenous therapy with roughly 85–87% 1-year success. 54 In osteomyelitis, PMMA beads after thorough debridement report about 91% cure; evidence for calcium-sulphate beads is mixed, with several matched or comparative analyses showing no consistent improvement over DAIR alone and raising cost or complication considerations, including occasional bead- or cement-related reactions.55–59 Against our 30-days profile, CLAP appears competitive when anatomy and microbiology are favourable (e.g., osteomyelitis and the shoulder) but less attractive when implants are retained or pathogens are multiple. Exchange strategies target longer-horizon eradication that CLAP does not attempt to replace.
Relation to endpoint choice and statistical approach
Focusing on a 30-days early-response endpoint is clinically congruent with early DAIR windows, surveillance timeframes, and quality metrics, and it avoids conflating short-term trajectories with longer-term eradication.19–23 The Kaplan–Meier curves showed frequent crossings and non-proportional hazards; RMST was therefore used to summarize time-scale differences in a way that remains interpretable when hazards are not proportional.38,39 For durability, interval-censored NPMLE with explicit evaluable denominators and best–worst bounds kept inference transparent when status was unknown, and infection-free-day RMST captured the lived patient experience of time spent free of recurrent infection.36,37 These complementary analyses situate CLAP’s role: a technique that can accelerate early improvement without claiming parity with exchange strategies on late eradication.
Strengths
This appears to be the first CLAP IPD synthesis at scale, retrieving and harmonizing patient-level data across 81 studies and applying unified diagnostic frameworks from ICM/MSIS, EBJIS, and the FRI Consensus with standardized outcome timing and definitions.28–34 The primary endpoint was analysed in a one-stage mixed-effects framework that accounted for study-level clustering. We paired a clinically interpretable 30-days endpoint with interval-censored durability and infection-free-day RMST, methods that are robust to non-proportional hazards and intuitive for clinicians.36–39,44 Denominators for durability were explicit, and best–worst bounds were reported rather than imputed. IPD integrity checks, multiple imputation with re-derived binary indicators to avoid leakage, and study-cluster bootstrap addressed missingness and between-study heterogeneity. Our outcome framework and analytic approach provide a practical template for future prospective studies and registries evaluating CLAP, particularly where non-proportional hazards and incomplete data are expected.
Limitations
This single-arm synthesis supports contextual, hypothesis-generating inference rather than causal or comparative conclusions versus systemic therapy, beads/spacers, or exchange strategies. Because randomized evidence for CLAP is scarce, most source reports were small observational case reports/series with heterogeneous protocols and variable reporting, increasing clinical heterogeneity and the risk of information bias. Publication and selective reporting biases are likely (e.g., preferential reporting of notable successes or atypical cases), and key clinical variables were inconsistently captured across studies. Outcome ascertainment was heterogeneous (timing and thresholds of CRP testing; variability in how “sustained clinical/wound improvement” was documented), and follow-up was variable, raising the possibility of informative missingness/censoring that cannot be fully corrected statistically. Residual confounding is unavoidable, including confounding by indication (e.g., debridement quality, surgical timing, host factors, and implant involvement thresholds). Events per variable were modest, so subgroup estimates—particularly organism-specific contrasts (e.g., Enterococcus sp.)—are imprecise and should be interpreted as hypothesis-generating prognostic associations rather than treatment effects. Missingness in observed time to response required multiple imputation; sensitivity analyses preserved directionality but precision remained limited. Implementation parameters (e.g., catheter configuration, drug/concentration, flow or aspiration targets, and NPWT settings) varied across reports, although several core elements showed convergence; these were therefore summarised descriptively rather than modelled as dose–response predictors. Chronic suppressive antibiotic use was rarely reported in the IPD and was handled descriptively; conclusions about suppression-dependent durability are limited. Adverse-event ascertainment was variably reported, so rare or delayed toxicities may be under-captured. Generalisability may vary across settings, particularly where devices and monitoring differ. 60
Conclusion
CLAP can accelerate early response in selected patients when combined with sound debridement and systemic antibiotics. Durability among evaluable patients at six and 12 months was high, and safety events were infrequent and largely reversible, although renal monitoring is essential with aminoglycosides, particularly in older adults and longer courses. 47 Ultimately, because our analysis is based on uncontrolled data, estimates should be interpreted as hypothesis-generating, and definitive guidance will require prospective trials or registries. Our findings can serve as a benchmark and inform the design of those future studies. Future work should standardize protocols, capture pharmacokinetics, and evaluate comparative effectiveness using RMST-friendly endpoints that acknowledge the common absence of proportional hazards in this field.38,39
Supplemental material
Supplementary Appendix - Continuous local antibiotic perfusion for orthopaedic infections: A systematic review and pooled individual participant data analysis of observational reports
Supplementary CLAP Implementation Table - Continuous local antibiotic perfusion for orthopaedic infections: A systematic review and pooled individual participant data analysis of observational reports
Supplemental materials for Continuous local antibiotic perfusion for orthopaedic infections: A systematic review and pooled individual participant data analysis of observational reports by Patrick Ze-En Ng, Norio Yamamoto, Mari Yamamoto, Ke Wei Hiew, Wei Ching Chong, Jia Shen Goh, Akihiro Saitsu, Glenn Xin-Zhang Lee, and Naoya Inagaki in Journal of Orthopaedic Surgery
Footnotes
Acknowledgements
We thank Jun Watanabe for his assistance with the EMBASE (ProQuest) database literature search; Satoshi Yamaguchi, Nobuyasu Ochiai, Yukichi Zenke, Keisuke Shimbo, Hideharu Nakamura, Hyonmin Choe, Kenji Kosugi, Kohei Iwamoto, Shuhei Ohyama, Noriaki Yokogawa, Yoshiaki Miyake, Naoyuki Horie, Noriyuki Hattori and Hirotaka Oishi for unpublished IPD upon request.
ORCID iDs
Ethical considerations
This synthesis used published and author-provided de-identified individual participant data from previously reported clinical studies. No new human or animal subjects were recruited. Institutional review board (IRB) approval and informed consent were obtained in the original studies as reported by their authors; additional ethics approval for this IPD synthesis was not required. The protocol was registered with PROSPERO (CRD42025635194) and the review followed PRISMA-IPD.
Author contributions
Conception and Design: Patrick Ng Ze En, Norio Yamamoto. Database search and Acquisition of articles: Patrick Ng Ze En, Hiew Ke Wei, Cheong Wei Ching, Norio Yamamoto, Mari Yamamoto. Risk of Bias Assessment: Hiew Ke Wei, Cheong Wei Ching, Goh Jia Shen, Glenn Lee Xin Zhang, Mari Yamamoto, Akihiro Saitsu, Naoya Inagaki. Data Extraction: Patrick Ng Ze En, Hiew Ke Wei, Cheong Wei Ching, Goh Jia Shen, Glenn Lee Xin Zhang, Mari Yamamoto, Akihiro Saitsu, Naoya Inagaki. Acquisition of Unpublished Data: Norio Yamamoto. Statistical Analysis: Patrick Ng Ze En. Manuscript Draft: Patrick Ng Ze En. Critically Revising the Article: Patrick Ng Ze En, Norio Yamamoto, Hiew Ke Wei, Cheong Wei Ching, Goh Jia Shen, Glenn Lee Xin Zhang, Mari Yamamoto, Akihiro Saitsu. Clinical Expertise: Norio Yamamoto, Mari Yamamoto, Akihiro Saitsu.
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data Availability Statement
Datasets were generated and may be available upon request from the corresponding author.
Declaration of generative AI and AI-assisted technologies in the writing process
During the preparation of this work the author(s) used ChatGPT 4o and 5 (Open AI) as a tool for data analysis assistance, grammar correction and formatting compliance. After using this tool/service, the author(s) reviewed and edited the content as needed and take(s) full responsibility for the content of the publication.
Supplemental material
Supplemental material for this article is available online.
