Abstract
Study Design:
Systematic review.
Objectives:
To review, critically appraise, and synthesize evidence on the use of allogenic stem cell products for spine fusion compared with other bone graft materials.
Methods:
Systematic searches of PubMed/MEDLINE, through October 31, 2018 and of EMBASE and ClinicalTrials.gov through April 13, 2018 were conducted for literature comparing allogenic stem cell sources for fusion in the lumbar or cervical spine with other fusion methods. In the absence of comparative studies, case series of ≥10 patients were considered.
Results:
From 382 potentially relevant citations identified, 6 publications on lumbar fusion and 5 on cervical fusion met the inclusion criteria. For lumbar arthrodesis, mean Oswestry Disability Index (ODI), visual analogue scale (VAS) pain score, and fusion rates were similar for anterior lumbar interbody fusion (ALIF) using allogenic multipotent adult progenitor cells (Map3) versus recombinant human bone morphogenetic protein–2 (rhBMP-2) in the one comparative lumbar study (90% vs 92%). Across case series of allogenic stem cell products, function and pain were improved relative to baseline and fusion occurred in ≥90% of patients at ≥12 months. For cervical arthrodesis across case series, stem cell products improved function and pain compared with baseline at various time frames. In a retrospective cohort study fusion rates were not statistically different for Osteocel compared with Vertigraft allograft (88% vs 95%). Fusion rates varied across time frames and intervention products in case series.
Conclusions:
The overall quality (strength) of evidence of effectiveness and safety of allogenic stem cells products for lumbar and cervical arthrodesis was very low, meaning that we have very little confidence that the effects seen are reflective of the true effects.
Introduction
Spine fusions is an accepted and commonly performed procedure for treatment of various spinal conditions, including degenerative disc disease, degenerative spinal deformity, traumatic spine fractures, and spinal tumors. Achieving solid bony arthrodesis is one of the primary goals of spine fusion surgery. Without it, patients can have poor postsurgical outcomes, develop painful pseudoarthrosis, and may require revision surgeries. 1 -4 Nevertheless, achieving solid arthrodesis in complex spine surgeries can be a daunting task as pseudoarthrosis rate for adult spinal deformity surgery can range from 0% to 25% 2,3,5 -8 and pseudoarthrosis for spine tumors can range from 0% to 43%. 9 -12
The use of autologous iliac crest bone graft (ICBG) has been considered the “gold standard” for spine fusion. 13 -15 ICBG is believed to possess the 3 essential properties of osteogenesis, osteoinductivity, and osteoconductivity to optimize spine arthrodesis. However, ICBG may have limited quantity and quality dependent on the age and biology of the patients. 14,16 -18 Moreover, harvest of ICBG is also associated with increased rate of morbidities such as chronic pain at the harvest site, infection, and hematoma. 14,15,19 -21 As a result, surgeons and researchers are constantly seeking for the best substitute for ICBG that can help achieve bony arthrodesis in complex spine fusions.
Over the past couple of decades, commonly used alternative bone graft substitutes includes allograft, demineralized bone matrix, ceramics, synthetic bone substitutes, and bone morphogenetic proteins. In recent years, there has been a rising interest in the use of stem cells products as bone graft substitute for spine fusions and other orthopedic procedures. 22 -28 Stem cells or progenitor cells are undifferentiated cells that are uncommitted in their cellular differentiation that also possess the property of self-renewal, and they can be driven toward specific tissue or cellular differentiation with growth factors and mitogens. 16,17,29 -32 For spine procedures, mesenchymal stem cells and osteogenic progenitor cells may be differentiated into osteoblasts that can help achieve fusion by providing cells that participate in bone formation and may also produce osteoinductive molecules. 16,17,29 -32 They can be isolated from autologous and allogeneic sources then expanded to provide a constant and viable source of bone graft substitute alternative to ICBG. Unlike other bone graft substitutes available in the market today, mesenchymal stem cells and progenitor cells are believed to possess the osteogenic and osteoinductive properties of ICBG. 17,32 In addition, when mix with other bone graft extenders as carriers or fillers, they can add osteoconductivity to the final stem cell products. 17,32 While the potential of mesenchymal stem cells and osteogenic progenitor cells to provide the clinical results in spine fusion as ICBG is quite appealing, we need supportive data to confirm that assertion. The aim of this study was to systematically review, critically appraise, and synthesize evidence on use of stem cells from allogenic bone marrow aspirate, adipose, or any other allogenic sources for spine fusion compared with other graft materials.
Key Questions
With regard to use of allogenic stem cells for arthrodesis (fusion) in adult patients with degenerative conditions of the lumbar or cervical spine: Key Question 1: Is use of allogenic stem cells for fusion as effective as fusion with standard autograft or other graft materials in the thoracolumbar or cervical spine? Key Question 2: What complications are associated with allogenic stem cell use in fusion? Is use of stem cells safer than fusion with standard allograft or autograft in the thoracolumbar or cervical spine? Key Question 3: Is there evidence that patient factors (eg, age, smoking, comorbid conditions, revision status, presence of deformity), number of levels treated, cell type, or preparation modify the association between allogenic stem cell–based fusion and the primary outcomes? Key Question 4: Is allogenic stem cell use for fusion cost-effective compared with other graft materials?
Figure 1 provides an overview of the patients, interventions and outcomes considered for these questions.

Analytic framework.
Materials and Methods
The methods for this systematic review followed accepted standards for systematic review/comparative effectiveness reviews for rigor, quality, and transparency, including those described by the Agency for Healthcare Research and Quality (AHRQ), Institute of Medicine (IOM) Standards for Systematic Reviews, and the Patient-Centered Outcomes Research Institute (PCORI) Methods Guide. 33 -35
Electronic Literature Search
Systematic searches of PubMed/MEDLINE, through October 31, 2018 and EMBASE and ClinicalTrials.gov through April 13, 2018 were conducted. Only studies with abstracts in humans, written in English were considered for inclusion, with no other limits were placed on the search. A priori inclusion and exclusion criteria are detailed in Appendix A (available in the online supplement of the journal). Briefly we sought to identify comparative studies of allogenic stem cells use versus more commonly used methods of fusion (eg, autograft) in persons with degenerative disease of the cervical or thoracolumbar spine. In the absence of comparative studies, case series were considered. The search strategy included use of controlled vocabulary (MeSH terms) as well as keywords. (Appendix A). Bibliographies of included studies and relevant systematic reviews were reviewed to identify pertinent studies. Citations were dual reviewed for inclusion at both title/abstract and full-text stages. ClincalTrials.gov was searched to identify studies which may have new publications. (Appendix G).
Data Extraction
In addition to results for primary and secondary outcomes, data abstraction included patient characteristics, demographics, lifestyle factors (eg, smoking), comorbidities (eg, obesity), cointerventions (eg, pharmaceutical, physical therapy), intervention, and comparator details (eg, spinal levels treated, cell preparation and concentration, delivery) if reported.
Study Quality
Each included study was independently assessed for risk of bias and methodological quality by 2 reviewers (ACS, AF) using pre-set criteria based on methods delineated in the Cochrane Handbook for Systematic Reviews of Interventions, The Journal of Bone and Joint Surgery, and the Agency for Healthcare Research and Quality with adaptations focusing on criteria associated with methodological quality (Appendix C). 33,36,37 Economic studies were evaluated according to The Quality of Health Economic Studies (QHES) instrument developed by Ofman et al. 38 Where feasible, the focus was on comparative studies with the least potential for bias and the fewest limitations. Risks of bias assessments are detailed in Appendix C.
Data Analysis
Statistical testing was generally not performed for observational studies except for complications. Risk ratios were calculated for dichotomous outcomes for complications from comparative studies if differences between groups were or approached statistical significance using the Rothman EpiSheet; for continuous outcomes, mean differences were calculated and 2-sample t tests if appropriate using GraphPad. 39,40 Study design, heterogeneity across studies and variation in reporting precluded the pooling of data.
Overall Strength of Body of Evidence
For function, pain, fusion and adverse events, the overall strength of evidence across included studies was assessed using the Grading of Recommendations Assessment, Development and Evaluation (GRADE) working group tenets and recommendations made by the AHRQ and is further described in Appendix D.
33,41
-43
The overall quality of evidence was based on studies at least risk for bias. In determining the quality (strength) of a body of evidence regarding a given outcome, the overall quality may be downgraded 1 or 2 levels based on the following domains: (1) risk of bias due to study limitations, (2) consistency (heterogeneity) of results, (3) directness of evidence (eg, hard clinical outcomes), (4) precision of effect size estimates (eg, width of confidence intervals), and (5) publication or reporting bias. Publication and reporting bias are difficult to assess, particularly with fewer than 10 randomized controlled trials.
33
Publication bias was unknown in all studies and thus this domain was eliminated from the strength of evidence tables. The initial quality of the overall body of evidence begins as high for randomized controlled trials and low for observational studies. The body of evidence for methodologically strong observational studies may be upgraded 1 or 2 levels if there are no downgrades in the primary domains above and one or more of the following are met: (1) large magnitude of effect, (2) dose-response gradient, and (3) all plausible biases would decrease the magnitude of an apparent effect. The final overall quality (strength) of the body of literature expresses the confidence in the estimate of effect and the impact that further research may have on the results as follows: High: High confidence that the evidence reflects the true effect. Further research is very unlikely to change our confidence in the estimate of effect. Moderate: Moderate confidence that the evidence reflects the true effect. Further research may change our confidence in the estimate of effect and may change the estimate. Low: Low confidence that the evidence reflects the true effect. Further research is likely to change the confidence in the estimate of effect and likely to change the estimate. Very low: Very little confidence that the evidence reflects the true effect; the true effect is likely to be substantially different than the estimated effect. In addition, this rating may be used if there is no evidence or it is not possible to estimate an effect.
Results
Study Selection and Risk of Bias
From 382 potentially relevant citations identified, eleven studies (from all sources) met the inclusion criteria (Figure 2). Studies excluded at full-text review are listed in Appendix B. Nine ongoing clinical trials were identified (Appendix G). The 2 retrospective cohort studies, one on lumbar fusion 44 and one on cervical fusion 45 were identified and were considered at moderately high risk of bias. Methodological concerns included selection bias, lack of blinded outcomes assessment and failure to control for confounding. The 9 case series were considered at high risk of bias (Appendix C). Across studies, there was substantial heterogeneity with regard to stem cell products and preparations used. This, combined with poor study quality, precluded pooled analysis. No studies formally evaluated differential efficacy or safety. No full economic studies were identified.

Flowchart showing results of literature search.
Key Question 1: Effectiveness of Allogenic Cells for Arthrodesis
Lumbar Arthrodesis
Six studies of lumbar fusion met the inclusion criteria including one retrospective cohort comparing allogenic multipotent adult progenitor cells (Map3) versus recombinant human bone morphogenetic protein–2 (rhBMP-2), 44 3 case series using Osteocel Plus (mesenchymal stem cells with demineralized bone matrix) alone 46,47 or in combination with local autograft, 48 one case series of using Trinity Evolution cellular bone allograft combined with local autograft, 49 and one case series of VIA Graft cellular bone matrix. 50 Across studies, mean ages were 50 to 64 years and 37% to 53% of participants were male. Diabetes was a common comorbidity (∼20% of participants). Smoking frequency ranged from 9% to 42%. Surgical approaches varied across studies. Patient and intervention characteristics are summarized in Table 1. Details of patient populations and procedures are provided in Appendix F.
Summary of Patient Demographics and Intervention Characteristics for Studies Evaluating the Use of Allogenic Stem Cells for Lumbar Fusion
Abbreviations: β-TCP, β-tricalcium phosphate; ALIF, anterior lumbar interbody fusion; BMA, bone marrow aspirate; DDD, degenerative disc disease; HNP, herniated nucleus pulposus; Map3, cellular allogenic bone graft, which includes cortical cancellous bone chips, demineralized bone matrix, and cryogenically preserved, viable multipotent adult progenitor cells; MITLIF, minimally invasive transforaminal lumbar interbody fusion; NR, not reported; rhBMP-2, recombinant human bone morphogenetic protein–2; RoB, risk of bias; PEEK, polyetheretherketone; PLF, posterolateral fusion; TLIF, transforaminal lumbar interbody fusion; XLIF, extreme lateral interbody fusion.
The small retrospective cohort (N = 41) comparing anterior lumbar interbody fusion (ALIF) using Map3 with ALIF using rhBMP-2 44 found no difference between them for function based on Oswestry Disability Index (ODI) or in visual analogue scale (VAS) pain at 12 months. (Table 2) Across levels, both treatments yielded similar fusion rates (90% vs 92%). While some differences in fusion rate were noted based on the number of levels fused, there was no distinct pattern favoring one treatment over the other; sample size was however small.
Summary of Nonrandomized, Retrospective Comparative Data (Lee, 2017): Allogenic Multipotent Adult Progenitor Cells (Map3) for Lumbar Fusion
Abbreviations: ALIF, anterior lumbar interbody fusion; CI, confidence interval; Map3 = cellular allogenic bone graft which includes cortical cancellous bone chips, demineralized bone matrix, and cryogenically preserved, viable multipotent adult progenitor cells; N/A, not applicable; NR, not reported; ODI, Oswestry Disability Index; rhBMP-2, recombinant human bone morphogenetic protein–2; RR, risk ratio; SD, standard deviation; VAS, visual analogue scale.
aAs reported by authors for comparison between groups.
bFusion was defined as evidence of bridging across endplates or bridging from endplates to interspace disc plugs.
cRR calculated by Aggregate Analytics, Inc. (AAI) using the Rothman Episheet.
dPatients could have more than 1 event.
Only one small series of XLIF using Osteocel Plus (N = 35) provided data on both function and pain, reporting improvement in both at 12 months relative to baseline. 47 Similarly, the case series of posterolateral fusion (PLF) using Trinity Evolution with local autograft (N = 43) reported significant improvement in both VAS back pain and leg pain at 12 months compared with baseline 49 (Table 3).
Summary of Case Series Outcomes: Allogenic Cells for Lumbar Fusion.
Abbreviations: ALIF, anterior lumbar interbody fusion; F/U, follow-up; ICBG, iliac crest bone graft; MITLIF, minimally invasive transforaminal lumbar interbody fusion; N/A, not applicable; NR, not reported; ODI, Oswestry Disability Index; PLF, posterolateral fusion; SD, standard deviation; SF-36, Short Form–36 questionnaire; VAS, visual analogue scale; XLIF, extreme lateral interbody fusion.
aAs reported by authors.
bODI is rated on a scale of 0 to 100 with higher scores indicating worse disability. VAS was rated on a scale of 0 to 10 with higher scores indicated worse pain.
One small series of Osteoplus 47 (N = 35) found 56% improvement from baseline in total Short Form–36 (SF-36) scores at 12 months and reported that the majority of patients were at least somewhat satisfied with their outcomes (Appendix E).
Across 5 case series of various allogenic cell products and fusion approaches (N = 158), fusion occurred in ≥90% of patients and levels by 12 to 14 months 46 -50 (Table 3).
Cervical Arthrodesis
Five studies of cervical fusion in patients with degenerative disease met the inclusion criteria, including 1 retrospective cohort comparing Osteocel with allograft and 3 case series. The case series include 1 case series using Osteocel Plus, 51 2 case series using Trinity Evolution with local autograft, 52,53 and 1 using Vivigen allograft. 54 Anterior cervical discectomy and fusion (ACDF) was performed in 3 of the 4 series; a variety of procedures was used in the fourth case series. Across studies, ages ranged from 47 to 64 years and proportion of males ranged from 28% to 56%. Reported tobacco use ranged from 16% to 37% with 1 series reporting 45% of patients were current or former smokers (Table 4).
Summary of Patient Demographics and Intervention Characteristics for Comparative Studies and Case Series Evaluating the Use of Allogenic Stem Cells for Cervical Fusion.
Abbreviations: ACCF, anterior cervical corpectomy and fusion; ACDF, anterior cervical discectomy and fusion; CSM, cervical spondylotic myelopathy; HNP, HNP, herniated nucleus pulposus; NR, not reported; PCF, posterior cervical fusion; PEEK, polyetheretherketone; RoB, risk of bias; OPLL, ossification of posterior longitudinal ligament.
Across the case series, functional improvement based on neck disability index (NDI) and improvement in VAS pain relative to baseline were reported at multiple time frames. Two series (N = 69) using Trinity Evolution reported significant improvement in NDI and VAS neck pain at 6 and 12 months. 52,53
A small series (N = 21) using Vivigen reported an 85.9% decreased in NDI from baseline and 81.4% decrease in VAS pain at 12 months. 54 The largest series, using Osteocel (N = 135 patients with 24-month follow-up) reported significant improvement at 24 months 51 for both NDI and VAS neck pain and arm pain. However, the authors did not provide data for verification (Table 5). In addition, SF-12/36 physical component summary (PCS) scores improved relative to baseline at 6 and 12 months in one series using Trinity Evolution 52 and at 24 months in one series of Osteocel Plus. 47 (Appendix E).
In the retrospective cohort study (N = 114), overall fusion rates were somewhat lower in the Osteocel group compared with the Verigraft allograft controls (88% vs 95%), but statistical significance was not achieved; a similar pattern was seen for those receiving intervention at one level (86.2% vs 96.6%) only. Fusion between the treatment groups was similar for 2-level procedures (Table 6). Authors do not report measures of function or pain.
Across case series, criteria and definitions of fusion varied and fusion frequency varied across time frames and intervention products. At 6 months, they varied with rates in 2 series of Trinity Evolution 52,53 of 66% and 79%, respectively, and of 100% in the Vivigen series 54 ; by 12 months, fusion was seen in 89% and 94% of Trinity Evolution recipients. At 24 months, the series of Osteocel reported fusion frequency of 87% 51 (Table 5).
Summary of Function, Pain, and Fusion Outcomes From Case Series: Allogenic Cells for Cervical Fusion.
Abbreviations: ACDF, anterior cervical discectomy and fusion; F/U, follow-up; MSCs, mesenchymal stem cells; N/A, not applicable; NDI, neck disability index; NR, not reported; PCF, posterior cervical fusion; RoB = risk of bias; SD, standard deviation; VAS, visual analogue scale.
aAs reported by authors.
bNDI and VAS scores are estimated from graphs for Peppers 2017 and Vanichkachorn 2016.
cNDI and VAS were reported on a 0-100 scale with higher scores indicating worse disability and pain.
dAuthors do not clearly report on numbers of subjects at baseline and 24 months; n’s of 182 and 135 respectively are assumed for this report.
eAuthors reported 24-month fusion results in 2 different ways. We reported “24-month minimum results” that included patients who fused at 12 months and did not receive radiographic follow-up at 24 months. Results for those received 24-month radiographic follow-up only were 82% (138/169).
Key Question 2: Safety, Adverse Events, Harms
Adverse events and harms were poorly reported across studies of both lumbar and cervical arthrodesis and sample sizes were likely inadequate to detect rare events; details are provided in Appendices E and F.
Lumbar Arthrodesis
The small retrospective cohort (N = 41) reported that complications overall were less common following ALIF using Map3 (30%) compared with rhBMP-2 44 (57%), but statistical significance was not reached due in part to the small number of patients (Table 2). Postoperative radiculitis in particular was less common in Map3 versus rhBMP-2 recipients (10% vs 38%). No differences in nonunion, posterior infection or epidural hematoma were seen. Across the 4 case series (Appendices E and F), nonunion was reported in 8% to 9% of patients. 46 -49 Reporting of other complications was sparse. No intraoperative complications were seen in three studies 47,49,50 and none related to stem cell products were seen in 2 series 48,49 ; however, small samples sizes limit evaluation of complications particularly for rare events. Additional surgical procedures were done in 7% of patients in one series 49 and may not be related to stem cell product use. Pseudarthrosis leading to revision occurred in 4% (3/75) in one series. 50 Surgical intervention at an adjacent level (1 patient) was reported in another small series (N = 40). 47
Cervical Arthrodesis
The comparative retrospective cohort study reported more nonunion for Map3 recipients compared with allograft recipients (12.3% vs 5.3%) at 12 months, but results did not reach statistical significance. 45 Among patients with failed fusion 4/7 in the Osteocel group versus 3/3 in the allograft group required revision. No treatment-related adverse events were reported, but small sample size may preclude identification of rare events (Table 6). Across case series, rates of nonunion varied substantially (Appendix E). At 6 months, the 2 series of Trinity Evolution reported nonunion rates of 34.4% 52 and 21.4%, 53 with lower rates by 12 months (10.6% and 6.5%). While the small series of Vivigen reported no nonunions, 54 the series of Osteocel reported 18% at 24 months with 13% at >24 months. 51 No revision surgeries were reported across 3 series. 51 -53
Summary Retrospective Comparative Study (McAnany, 2016): Allogenic Cells for Cervical Fusion.
Abbreviations: CI, confidence interval; IC, incalculable; RoB, risk of bias; RR, risk ratio.
aRisk ratios calculated by Aggregate Analytics, Inc. (AAI). P values are reported by authors.
bThe other 3 failed fusion patients in the Osteocel group were asymptomatic and followed clinically but did not receive revision surgery.
Key Questions 3 (Modification of Treatment Effect) and 4 (Economic Studies)
Included studies were not designed to evaluation modification of treatment effect by demographic or other factors. Subgroup analysis in one small lumbar arthrodesis case series found no difference in back pain, leg pain, or fusion frequency between those <65 years old and those ≥65 years old 49 ; however, firm conclusions are not possible given the potential for high risk of bias in this study and lack of comparison with another treatment (Appendices E and F).
No full economic studies comparing allogenic stem cells products with other methods of fusion were identified.
Evidence Summary, Overall Quality (Strength) of Evidence
For both lumbar and cervical arthrodesis, the overall quality of evidence is very low with regard to the benefits and safety of allogenic cell sources for fusion, meaning we have very little confidence that the effects seen reflect the true effects (Tables 7 and 8). Case series comprise the bulk of the evidence. Only 2 cohort studies, one evaluating lumbar arthrodesis and the other cervical arthrodesis were identified. High risk of bias, lack of precision due to small sample sizes, and inability to evaluate consistency across studies resulted in grading the overall evidence as very low.
Evidence Summary: Overall Quality (Strength) of Evidence for Effectiveness and Safety Allogenic Cells for Lumbar Fusion.a
Abbreviations: ALIF, anterior lumbar interbody fusion; NR, not reported; ODI, Oswestry Disability Index (1-100 scale, higher score, worse disability); rhBMP-2, recombinant human bone morphogenetic protein–2; VAS, visual analogue scale (0-10 scale, higher is worse).
aReasons for downgrading quality of evidence (general):
*Serious risk of bias: the majority of studies did not meet one or more criteria of a good-quality randomized controlled trial (see Appendix for details).
†Serious risk of bias: the majority of studies did not meet 2 or more criteria of a good-quality cohort or are case series (see Appendix for details).
‡Small sample size/insufficient power and/or significant variation in estimates (eg, wide confidence intervals [CIs], large standard deviations). Imprecise effect estimate for an outcome: small sample size and/or confidence interval includes both negligible effect and appreciable benefit or harm with the intervention. If sample size is likely too small to detect rare outcomes, evidence may be downgraded twice. If the estimate is statistically significant, it is imprecise if the CI ranges from “mild” to “substantial.” If the estimate is not statistically significant, it is imprecise if the CI crosses the threshold for “mild/small” effects. Wide (or unknown) CI and/or small sample size may result in downgrade.
§Unknown consistency; single study or different measures used across studies. Inconsistency: differing estimates of effects across trials. If point estimates across trials are in the same direction, do not vary substantially, or heterogeneity can be explained, results may not be downgraded for inconsistency.
**Indirect, intermediate, or surrogate outcomes may be downgraded.
††Reporting bias suspected: limited assessment or limited reporting of specific outcomes.
b Fusion was defined as evidence of bridging across endplates, or bridging from endplates to interspace disc plugs (Lee). For fusion for Tohmeh 2012: 97.5% (39/40) were assessed with fluoroscopy-guided, level-by-level, radiography (FGX) and 2.5% (1/40) with computed tomography.
Evidence Summary: Overall Quality (Strength) of Evidence for Effectiveness and Safety of Allogenic Cells for Cervical Fusion.a
NDI = neck disability index (1-100 scale, higher score, worse disability); VAS, visual analogue scale (0-10 scale, higher is worse).
aReasons for downgrading quality of evidence (general):
*Serious risk of bias: the majority of studies did not meet one or more criteria of a good-quality randomized controlled trial (see Appendix for details).
†Serious risk of bias: the majority of studies did not meet 2 or more criteria of a good-quality cohort or are case series (see Appendix for details).
‡Small sample size/insufficient power and/or significant variation in estimates (eg, wide confidence intervals [CIs], large standard deviations). Imprecise effect estimate for an outcome: small sample size and/or confidence interval includes both negligible effect and appreciable benefit or harm with the intervention. If sample size is likely too small to detect rare outcomes, evidence may be downgraded twice. If the estimate is statistically significant, it is imprecise if the CI ranges from “mild” to “substantial.” If the estimate is not statistically significant, it is imprecise if the CI crosses the threshold for “mild/small” effects. Wide (or unknown) confidence interval and/or small sample size may result in downgrade.
§Unknown consistency; single study or different measures used across studies. Inconsistency: differing estimates of effects across trials; if point estimates across trials are in the same direction, do not vary substantially or heterogeneity can be explained, results may not be downgraded for inconsistency.
**Indirect, intermediate or surrogate outcomes may be downgraded.
††Reporting bias suspected: limited assessment or limited reporting of specific outcomes.
Discussion
Achieving solid arthrodesis is an important goal of spine fusion surgery when there are structural instability or compromise from degenerative spine disease, spinal deformity, spine trauma, infections, or tumors. However, it is also one of the major challenges faced by spine surgeons in complex spine surgery. Pseudoarthrosis can lead to poor clinical outcomes, instrumentation failure, or revision surgery. 1 -8 As a result, surgeons and spine researchers are in constant search for the best biological solution or material to help achieve solid arthrodesis in spine fusion cases. While ICBG has been long accepted as the “gold standard” graft material for spine fusion as it possesses osteogenic, osteoinductive, and osteoconductive properties that are critical for spine fusions, 13 -18 the use of ICBG is limited by concerns of harvest morbidity, cost, and lack of efficacy in elderly or osteoporotic patients. 14,15,19 -21 A variety of bone graft substitutes are available for utilization in spine surgery by spine surgeons today with variable fusion results. 13,15,32 Mesenchymal stem cells and osteogenic progenitor cells are one of the many bone graft substitutes available in the market. 17,29,30,32 Unlike other bone graft extenders that mainly possesses osteoinductive or osteoconductive properties, stem cell products are touted to contain the osteogenic property that is critical for the ideal bone graft material but elusive to all bone graft substitutes except for ICBG. 17,29,30,32 With that claim, it is not surprising that many are enthusiastic about the use of stem cell products in spine fusion surgery to achieve arthrodesis.
Mesenchymal stem cells and osteogenic progenitor cells can be derived from patient’s own bone marrows or adipose tissues. However, the quantity and quality of the autogenic stem cells may be limited by the patients’ age and biology similar to ICBG. 15,16,18 On the other hand, allogenic stem cells can be derived from a donor and expanded in cultures then optimized for osteogenic differentiations in a controlled process. 17,29,30,32 This process may circumvent the concerns about quantity and quality of the cells that can implanted in spine fusion surgeries. A number of animal studies have examined the efficacy of stem cells in spine fusions, and they showed variable fusion rates in those studies. However, the preclinical animal studies indicate that stem cells can achieve fusion rate similar to autograft controls. While animal studies indicate efficacy with stem cells in spine fusion to achieve fusion similar to autograft, it’s unclear if similar outcomes and effectiveness can be seen in the clinical setting.
In this systematic review, we examined the currently available literature reports and studies to determine if allogenic stem cells indeed can achieve similar fusion results and clinical outcomes compared to autograft and other widely studied bone graft products available in the market today. In addition, we researched the data on complications and cost consideration associated with allogenic stem cells in spine surgery. Based on our systematic review, we identified only 11 studies on allogenic stem cells in spine fusions that met our study criteria. The majority of the studies were retrospective case series and only two retrospective cohort studies were identified. Overall, the evidence for the efficacy and safety of allogenic cell sources for lumbar and cervical arthrodesis in the clinical setting was very low, primarily due to the high risk of bias and lack of precision of included studies, most of which were case series. No full economic studies were identified and no evidence on the impact of patient or intervention characteristics on effectiveness or safety was available.
In the lumbar spine, findings from the only retrospective comparative study identified suggests that allogenic Map3 versus rhBMP-2 44 are similar with regard to improvement in function and pain; however, complications were more common with rhBMB-2. While case series report improvement in function and pain compared with baseline and fusion occurred in ≥90% of patients across case series of allogenic cell products, comparative studies are needed to truly evaluate the effectiveness.
In the cervical spine, one retrospective cohort study of patients with clinical (radiculopathy or myelopathy) and radiographic evidence of degenerative cervical spine disease comparing Osteocel with cadaveric allograft (Vertigraft) 45 reported somewhat lower fusion rates in the Osteocel group compared with the allograft controls (88% vs 95%), but statistical significance was not achieved and no data on function or pain were reported. Across case series, allogenic stem cell products appeared to be associated with improved pain and function, however in the absence of methodologically sound comparative studies, conclusions regarding effectiveness or safety are problematic.
Our study suggest that clinicians and spine surgeons should maintain our enthusiasm for allogenic stem cells in spine fusion with caution. We currently lack high-level clinical evidence that is derived from well-designed randomized controlled or prospective cohort studies on the use of allogenic stem cells in spine fusion surgery. With the global spine biologics market was valued at $1.9 billion in 2014 and projected to reach $2.4 billion by 2020, the cost burden from spine biologic product should be a major concern in the spine community. Currently, allogenic stem cell products are heavily marketed for their osteogenic potential and perceived benefits of achieving fusion, and surgeons are increasing their utilization of allogenic stem cell products. As surgeons increase our usage of these allogenic stem cell products for spine fusions, we need to insist on obtaining the supporting data for its outcomes, efficacy, and safety from quality and well-designed clinical study.
Conclusions
The overall quality (strength) of evidence of effectiveness and safety of allogenic cell for lumbar and cervical arthrodesis was very low, meaning that we have very little confidence that the effects seen are reflective of the true effects. Firm conclusions regarding effectiveness or safety of allogenic stem cells for arthrodesis are not possible without methodologically sound studies comparing such therapies to other treatments such as standard autograft. In addition to methodologically sound comparative studies, there is a need for formal economic evaluation of stem cell use for fusion.
Footnotes
Acknowledgments
The authors gratefully acknowledge Aaron John Robarts Ferguson for his contributions in performing literature searches, managing citations, data abstraction, and results table editing.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Aggregate Analytics, Inc, received funding from AO Foundation to perform the methodological and analytical aspects of this review. This study was organized and funded by AOSpine International through the AOSpine Knowledge Forum Degenerative, a focused group of international spine oncology experts acting on behalf of AOSpine. Study support was provided directly through the AOSpine Research Department.
Supplemental Material
The supplemental material is available in the online version of the article.
