Abstract
Keywords
Introduction
Soft tissue sarcomas (STS) are a rare, heterogeneous group of mesenchymal cancers representing approximately 1% of adult solid tumour cancers. 1 They arise from connective tissue and include over 100 different histological subtypes which can vary in their presentation, biological behaviour, and outcome. 2
One of the most common sarcoma subtypes, representing ∼20% of cases, is liposarcoma (LPS).3,4 It develops in fat tissues and can occur in any part of the body, most commonly in the retroperitoneum and deep soft tissue of proximal extremities. 5 LPS is made up of several histological subtypes (dedifferentiated (DD), well-differentiated (WD), myxoid/round cell and pleomorphic) with less common mixed and fibroblastic variants also reported. 6 LPS subtypes are associated with different prognoses and responses to treatment 7 with DDLPS being aggressive and treatment resistant.5,7
Population-based incidence and survival data derived from current standard clinical practice are critical for improving patient care and outcomes. 8 Availability of population-based data for sarcomas, specifically LPS, remains scarce due to the rarity and heterogeneity of the disease and how the different subtypes are classified and recorded. There is need for further research into the epidemiology of LPS to inform the management of patients as well as future research, including new treatments.
In the context of assessing treatment outcomes, and further enabling continuous assessment of the risk/benefit ratio of newly approved medicines, population based real world data (RWD) may aid benchmarking in the evaluation of new therapies, where large scale randomised trials may be challenging.9,10 This is particularly true for STS (and LPS specifically) where histology specific treatment decisions have been driven by RWD studies in the absence of randomised trial data, particularly in the frontline setting.7,11,12
Regulators across the globe are recognising the value of real-world evidence (RWE) in supporting new drug development and approvals, especially in rare disease areas.10,13 In the United Kingdom (UK), the Medicines and Healthcare products Regulatory Agency (MHRA) delivers the Clinical Practice Research Datalink (CPRD) research data service, which provides access to one of the largest, longitudinal research databases in the UK. It contains anonymised primary care patient data for approximately 60 million patients from a network of general practices (GPs) across the country and has been linked to several other health related datasets.14–16 CPRD data has not only been used to support patient recruitment to trials, but also to provide RWD virtual control arms as disease course or standard of care treatment comparators, and for pharmacovigilance post-approval.15,17
Study objectives
The primary objectives of this study were to describe a) the demographic and clinical characteristics of LPS patients in England, b) the characteristics of LPS tumours based on patient clinical and demographic characteristics, and c) the overall and stratified survival rates for LPS patients. Where possible, we interrogated these parameters for DDLPS as this subtype is aggressive and treatment resistant.5,7
For the secondary objectives, the study described the proportion of LPS patients by treatment groups, treatment duration and survival time from first treatment initiation.
Methods
Study design and setting
We conducted a retrospective cohort study (using data available through the CPRD 18 ) to describe the demographics, tumour characteristics (at diagnosis), treatments and survival of patients with LPS. In doing so, we aimed to present a detailed picture of the treatment and survival of patients with this rare form of cancer in England.
Data sources
CPRD Aurum contains anonymised primary care electronic healthcare records (EHR) in the form of clinical codes (e.g. SNOMED) from GP practices using EMIS® GP software. 19 It is the largest of CPRD’s primary care databases, including over 60 million patients (over 40 million of which are research acceptable).19,20 The January 2022 release of the CPRD Aurum database 20 was used for cohort definition and determining patient demographics. The coverage of the data extracted was from 1st January 1987–31st December 2021.
CPRD Aurum is also routinely linked to several other datasets to provide a more comprehensive picture of patient care, thus helping to enhance public health and clinical research. 16 Information regarding how CPRD undertake the linkage (including the joining criteria) is provided in a paper by Padmanabhan et al. 16
The ONS Death Registration dataset (containing information on date and causes of death 21 ) was used to more accurately define the date of death for the survival analysis. Copyright © (2022), re-used with the permission of The Health and Social Care Information Centre, all rights reserved.
The National Cancer Registration and Analysis (NCRAS) Cancer Registration (CR) data include diagnostic and treatment information for registered tumours diagnosed or treated within England 22 and was used to determine tumour characteristics, treatment patterns and apply certain inclusion/exclusion criteria for cohort definition. The NCRAS SACT (which includes information on systemic treatments 23 ) and the NCRAS RTDS (which contains records of radiotherapy treatments) 24 were used to ascertain receipt of relevant treatment and treatment duration.
Composite measures of the Index of multiple deprivation (IMD) at the Lower Super Output Areas (LSOA) level in England for 2015 have been linked with CPRD data at the patient (using the patient’s postcode 25 ) and practice level (using the GP postcode 26 ). Patient level IMD data were used to describe the cohort by IMD quintile (as a proxy for socio-economic status). Practice level IMD data were used if patient level information was unavailable.
Study population
Male and female patients newly diagnosed with LPS between 1st January 1998–31st December 2018 (to coincide with the end of the coverage period for the NCRAS datasets) were included in the study (Figure 1). Flow diagram of the patient counts following the application of the inclusion and exclusion criteria. *Research acceptable male and female patients in the January 2022 version of the CPRD Aurum database who had at least 1 day of follow up during the study period. **All patients in the study population. ***Patients in sub-cohort 1 whose index date was recorded in the NCRAS cancer registration data. ****Patients in sub-cohort 2 whose index date fell within the coverage periods for NCRAS SACT and RTDS data (1st January 2014–31st December 2018).
The index date was the first ever (incident) diagnosis event recorded within the study period (Appendix A), defined as a medical code recorded in the CPRD Aurum observation table (Appendix B.1) OR an ICD-10 code (Appendix B.2) AND specified morphology code (see Appendix B.3) in the NCRAS CR data. Using primary and secondary care data for defining the study cohort ensured capture of as many relevant cases as possible and reduced the risk of bias. 27
Where a patient had a first LPS record in CPRD Aurum up to 30 days before a LPS record in the NCRAS CR data, the NCRAS date and diagnosis record was used as the index record as these were deemed different records for the same cancer. Additionally, patients had data deemed “acceptable” for research (based on the acceptability flag generated by CPRD 19 ) and were registered at their GP practice for at least 6 months before their index date (which had to fall within the period they were registered at their GP practice). Prior registration was required to reduce the risk of bias due to misclassification of incident LPS diagnoses. Patients also had to be aged 18 years or older on their index date.
From the above study population, patients were assigned to Sub-cohorts (Figure 1). Sub-cohort 1 (for the first primary objective) included all patients in the study population while Sub-cohort 2 (second and third primary objectives) was limited to patients whose index diagnosis was recorded in the NCRAS CR data. Lastly, Sub-cohort 3 (for all secondary objectives) comprised of patients within Sub-cohort 2 who also had their index date within the coverage periods for NCRAS SACT and RTDS data (1st January 2014–31st December 2018). All patients in the cohorts were registered at practices in England.
Patients were also flagged if they had DDLPS, which was defined as a tumour record in the NCRAS CR data (see Appendix B.2) with the morphology code “8858” (if the index date was recorded in the NCRAS CR data) OR a CPRD Aurum medical code specifying DDLPS (if the index date was recorded in CPRD Aurum- see Appendix B.1).
Patients were followed up from their index date until the study end (defined as the earliest of the patient’s end of registration, the CPRD derived death date, practice last collection date and the end of the study period (31/12/2018)).
Definitions of the patient demographic, tumour and treatment characteristics.
As this was a descriptive study (which in part aimed to described patients with this rare form of cancer), the number of cases who met the inclusion and exclusion criteria determined the sample size.
Statistical analysis
All data (except the NCRAS datasets) were extracted using tools developed by CPRD or Stata 17 as these are held in house. Extracts from the NCRAS datasets were requested from National Health Service (NHS) England who provided data specific to this study. All data management and analyses were conducted using Stata 17.
Demographic statistics were presented for all patients, by first treatment after index date (for those whose index date was recorded in the NCRAS CR data) and for patients with DDLPS. For body mass index (BMI), we required the data to be recorded within 2 years prior to the index date as this measurement could be impacted by the disease in the form of weight loss. 28 Only treatment records after the index date were included (to prevent misclassification of treatment) and patients in the cohort who did not have a first treatment recorded were grouped as “No treatment”.
For Sub-cohort 2, summary statistics of tumour characteristics were calculated overall, for patients with DDLPS and by first treatment. Additionally, the median (with interquartile range [IQR]), one-, three- and 5-year survival rates (with 95% confidence intervals [CIs]) were calculated overall and by first treatment. Patients with less than 1 day of survival time (e.g., their index date was the same as or after their ONS death date) were excluded from the analysis. Kaplan-Meier curves were generated to describe survival overall, for patients with DDLPS, by sex and by 10-years age groups.
For Sub-cohort 3, duration of first treatment (for patients receiving chemotherapy or radiotherapy) and survival rates based on treatment initiation (with 95% CIs) were also calculated. Patients with less than 1 day of survival time (e.g., their treatment initiation date was the same as or after their ONS death date) were excluded from this analysis. The maximum of 4 years was calculated as no patient in Sub-cohort 3 had follow-up time of 5 years (due to the coverage period of NCRAS SACT and RTDS data). Kaplan-Meier curves were generated to describe survival overall, for patients with DDLPS, by sex and by 10-years age groups.
Where present, missing data were reported as counts and percentages. Imputation was only applied when generating the end of treatment date for patients who had this information missing. This was set to 42 days after the start of treatment date.
Due to CPRD’s policy, the results were presented in a way to prevent deductive disclosure by small cells.
This study was conducted by a multidisciplinary team comprising of experts in CPRD data, epidemiology, oncology, and data science. Collaborators were therefore able to draw on the expertise of fellow team members in the planning and execution of the study as well as the interpretation of the results.
Results
The source population contained 33.8 million male and female patients with research acceptable data, who had at least 1 day of follow-up during the study period. Of those patients, 2.7 million had a record in the NCRAS CR dataset.
During the study period (1st January 1998–31st December 2018) 1,315 patients qualified for inclusion in the study (Figure 1) with an average follow-up time (from index date) of 5.6 years (5.1 SD). The cohort contributed 7,424.5 patient years.
Of those patients, 611 (46%) were in Sub-cohort 2 (patients whose index date was recorded in the NCRAS CR data) and 169 (13%) were in Sub-cohort 3 (patients within Sub-cohort 2 who also had their index date within the coverage periods for NCRAS SACT and RTDS data).
Descriptive demographics for patients with liposarcoma record in NCRAS cancer registration data or CPRD Aurum primary care data- overall and by dedifferentiated tumour subgroup a .
SD: standard deviation; IQR: interquartile range.
aDue to CPRD’s policy, the results were presented in a way to prevent deductive disclosure by small cells.
bAge is calculated as year of diagnosis minus year of birth as recorded in CPRD Aurum.
cArea based deprivation is measured using patient-level IMD scores for England. Where patient level measures were missing, practice level IMD scores were used.
Across all cohorts, BMI (recorded within 2 years prior and most proximal to the index date) was missing for approximately half of the patients.
Characteristics of index liposarcoma tumours based on patient clinical data in the NCRAS Cancer Registration (CR) record a .
aDue to CPRD’s policy, the results were presented in a way to prevent deductive disclosure by small cells. This included not presenting values for certain stratifications.
bOne patient received two treatments on the same day and appears in two treatment subgroups.
In patients with DDLPS, information on grade and stage of the disease was missing (including unknown/not appropriate) in a large proportion of cases and 19% of patients had prior cancer diagnosis. The majority (75%) of DDLPS patients had CCI of 0 whilst this information was missing for 12.5% of patients (Table 3).
Following diagnosis, the median waiting time to first treatment was 52 days (IQR 84, 26–110 days) in the overall cohort, and 67 days (IQR 78, 38–116 days) for patients with DDLPS (Appendix C.3). Surgery had the lowest median waiting time (44 days: IQR 75, 23–98 days). The mean duration of treatment (for patients receiving chemotherapy or radiotherapy) was 36.6 days overall (Appendix C.4).
The survival rates of patients with liposarcoma from diagnosis record.
IQR: interquartile range.
aThis analysis includes patients who have at least 1 day of follow-up from index date. Six patients were excluded as they died on or before their recorded index date (diagnosis date) based on the ONS Death registration date.
The survival rates of treated patients with liposarcoma from treatment start date.
Kaplan-Meier plots for the survival rates are provided in Appendix D.
Discussion
This study is the first report exploring the epidemiology, treatments and outcomes of LPS patients based on data from one of the largest linked primary care databases in the UK.
This study primarily aimed to describe the demographics, tumour characteristics and survival of patients newly diagnosed with LPS. The majority of LPS patients were male and over the age of 60 years. LPS was the first cancer a patient was diagnosed with and the tumours were of the lower grade and stage when reported (though the tumour stage and grade information was missing for a high proportion of patients). Patients with DDLPS had similar characteristics as patients with other LPS subtypes, including myxoid liposarcoma. The likelihood of survival following diagnosis was worse for those with Grade 4 tumours (where this information was recorded).
For the secondary aims, we described the proportion of LPS patients by treatment, quantified first treatment duration following diagnosis and described the survival time of LPS patients following first treatment. Most patients did not have a first treatment recorded, and whilst patient age did not appear to be a driver for the high proportion of patients receiving no treatment (as reported by Tirotta et al. 29 ), our results may be impacted by missing data. For those who did receive treatment, the most common first treatment was surgery, in keeping with surgical resection being the mainstay of localized disease management. 12 DDLPS patients had a longer waiting time to first treatment (compared to other subtypes) and a greater proportion of patients with this type of tumour were treated with chemotherapy (Table 3). Those with Grade 3/4 tumours (where this information was recorded) had the lowest likelihood of survival following first treatment.
To date, only one other study in England (for data collected between 2013 and 2017) has reported epidemiology and survival data for STS and its main types. 8 Over the 5-year period, the reported 1-year and 5-years net survival rates for LPS were 90% and 81.5%, respectively. 8 Another report from the same authors investigated treatment patterns for sarcomas that occurred in the retroperitoneum, which included several STS types (but more commonly LPS and leiomyosarcoma), 30 and outcomes associated specifically with surgery in England. 29 The study found that types of treatment received (including surgery, radiation, chemotherapy and other care) varied considerably by age group and that surgery survival outcomes were much improved if treatment was received in a high-volume sarcoma centre. 29
Our results were similar to those reported by Bacon et al. (in terms of age and survival rates 8 ) and we were able to additionally present further information on LPS patients including BMI and prior cancer history.
Strengths and limitations
There were several strengths to this study. The large sample size, general representativeness of the English population and longitudinal coverage 19 makes it a robust source for gathering RWE. Additionally, the data are based on real world practice and contain a wealth of clinical information that can enhance research. We also incorporated data from several sources in order to build a more comprehensive picture of the characteristics and outcomes for LPS patients. Lastly, the study is authored by experts in fields of LPS and conducting studies using RWD.
However, limitations in the study should also be acknowledged. Despite the increased recognition of the utility of RWD studies and RWE in supporting drug development and approvals, there are specific requirements on data reliability and relevance. 9 Several examples (including our study) have highlighted the challenges associated with conducting large RWD studies in rare diseases and in sarcomas specifically. In the main, these were attributed to disease heterogeneity, variability in coding and data completeness and quality. This underlines the need for improved data collection and reporting.8,9,31 For example, while there was much information on the characteristics of LPS patients in this study, there were areas of limited data (e.g., stage, grade) which are vital in improving knowledge of the disease and the development of new effective treatments.
Additionally, we may have missed eligible patients from the study due to the cohort definition. In this study we used the CPRD database and the ICD-10 C49 code for identifying patients with LPS, regardless of anatomical location. In contrast, the Bacon et al. study extracted disease classification data from the larger but unlinked national NCRAS database and ICDO-3 disease classification, which includes the original disease anatomical as well as morphology and behaviour codes. 8 It is also noted that identifying STS patients (specifically liposarcoma patients) using ICD coding is limited due to the inconsistency in the coding used for diagnosis and treatment for this form of cancer. 32 Future studies should aim to use a definition which permits a much broader capture of patients. Additional analyses stratified according to anatomical site and histological type should also be considered to better understand outcomes in patients whose disease occurs in complex anatomical regions. However, when using real world data, these analyses have to be performed in compliance with local regulations surrounding data protection to prevent re-identification of patients with rare characteristics or conditions.
There may also have been misclassification in terms of LPS subtypes or confirmation of a LPS diagnosis (e.g., erroneous coding, missing information). The incomplete data from stage and grade, though consistent with variations in sarcoma data capture as previously reported, 8 also limited our ability to report on these characteristics. There was also incomplete information on other characteristics (e.g., BMI). Furthermore, the study results will be specific to England and therefore may not be generalisable to populations based in other countries due to differences in demographics or healthcare systems.
Conclusion
LPS is a challenging type of cancer that, until recently, has had limited epidemiological data, particularly in England. Our research indicates that LPS is a rare cancer with a significant treatment burden. We observed a decrease in survival rates with increasing patient age and advancing disease grade and/or stage. Furthermore, our study underscores the complexities inherent in collecting and reporting data for rare diseases such as LPS. Consequently, the enhancement and standardisation of data reporting, especially for rare heterogeneous cancers such as STS and LPS, is of paramount importance. This will optimise the use of patient data registries and RWE, ultimately leading to improved patient outcomes.
Supplemental Material
Supplemental Material - Epidemiology and survival outcomes for liposarcoma patients in England: An observational cohort study using real world data
Supplemental Material for Epidemiology and survival outcomes for liposarcoma patients in England: An observational cohort study using real world data by Jessie O. Oyinlola, Mounia Beloueche-Babari, Monika Frysz, Eleanor Yelland, Amy Walker, Rachael Williams and Robin L. Jones in Rare Tumors
Footnotes
Authors’ note
The authors did not receive payment related to the development of the manuscript and meet the criteria for authorship as recommended by the International Committee of Medical Journal Editors (ICMJE).
Acknowledgements
This study was supported and funded by Boehringer-Ingelheim who was given the opportunity to review the manuscript for medical and scientific accuracy as well as intellectual property considerations. This study is based on data from CPRD (obtained under licence from the UK MHRA), ONS and NHS England. The data is provided by patients and collected by the NHS as part of their care and support. The interpretation and conclusions contained in this study are those of the authors alone. We acknowledge NHS England for their support in provisioning the data for this project and the MHRA CPRD CPRD Research Data Governance (RDG) who reviewed the study to ensure it was methodologically robust, of public health benefit and maintained public trust. Additionally, we would like to acknowledge the support of Helen Booth who also provided oversight for delivery of the project and quality assurance for some of the analysis. The authors would like to thank Dr Ruth Farmer of Boehringer Ingelheim Ltd. for her advisory input into the study results interpretation and her assistance in reviewing the manuscript. We also thank Smit Patel who was employed by Boehringer Ingelheim Ltd when this study was initiated for his input into the project management and study protocol. Lastly, we would like to acknowledge the patients who contribute their data to CPRD and the other datasets used in this study. Without their support, such research would not be possible.
Ethical statement
Author contributions
Mounia Beloueche-Babari (MBB), Monika Frysz (MF) and Amy Walker (AW) developed the design of the study, edited the study protocol, and drafted parts of and edited the manuscript. Jessie O. Oyinlola (JO) developed the statistical analysis plan (SAP), performed the data management and analysis and drafted the manuscript. Eleanor Yelland (EY) drafted the study protocol and project managed the research at CPRD including developing and monitoring the project plan, providing analysis quality assurance, and acting as the key contact to Boehringer-Ingelheim throughout the duration of the project. She also edited the manuscript. Rachael Williams (RW) provided oversight of the delivery of the project as well as edited the manuscript. Robin L Jones (RLJ) developed the design of the study, edited the study protocol, provided clinical input and interpretation, and edited the manuscript.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This study was supported and funded by Boehringer-Ingelheim.
Declaration of conflicting interests
The authors declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: This study was supported and funded by Boehringer-Ingelheim. MBB, MF and AW are employees of Boehringer Ingelheim. JO, EY and RW are employees of CPRD. CPRD is jointly sponsored by the UK MHRA and the National Institute for Health Research (NIHR). As a not-for-profit UK government body, CPRD seeks to recoup the cost of delivering its research services to academic, industry and government through fees. This work was conducted as part of a commissioned research agreement between CPRD and Boehringer-Ingelheim.
Data Availability Statement
Supplemental Material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
