Abstract

Introduction
The term ‘big data’ is used to describe large, complex datasets and the associated advances in technology and analytics.1,2 Within healthcare, this covers many different types of data including, for example: routinely collected clinic data in electronic health records; healthcare claims data, genomics data; public data releases, such as aggregated population health data or clinical trials results; data from automated sensors and smart devices; and information from healthcare forums and social media. In late 2013, the Association of the British Pharmaceutical Industry published its ‘Big data road map’, which sets out an action plan to direct progress in the use of big data over the next four years. 1 The road map highlights the need to increase awareness and understanding of big data, build capability and capacity, create sustainable data ecosystems and accelerate high-value opportunities. These suggestions do not just apply to pharma, however, but also to healthcare more generally as clinicians, patients and policy-makers look to take advantage of the potential benefits of big data. 3
There is, however, a risk that patient-reported outcome (PRO) data may be underrepresented in the big data ‘revolution’. PRO data are important in providing patients’ perspective on their own health. However, in contrast to countries such as the US and Netherlands, 4 PROs are not routinely collected in UK clinical settings (other than for a small number of procedures, via the NHS PROMs programme). 5 If this situation is not addressed, we risk continuing our overreliance on routinely collected clinical indicators to provide evidence for health policy: a potential backwards step in the move towards patient-centred care.
Potential benefits of big data
Big data analytics offer a range of exciting opportunities, including the facilitation and acceleration of targeted drug design, the development of diagnostic algorithms, a shift towards stratified medicine. 1 Big data can provide ‘real-world’ post-licensing safety data, assist comparative effectiveness research and inform predictive impact modelling. All of these initiatives have the potential to lead to significant patient benefits, through newer and more targeted treatments and provision of information to promote shared decision-making. Increasing amounts and improved access to electronic healthcare data, coupled with advances in analytic methods and computational power and an increased need for ‘real-world’ evidence of effectiveness, are driving the rapid growth in this field. 6 Collaboration between healthcare providers, academics and the pharmaceutical industry will be essential to maximise the use of big data to inform healthcare policy and delivery. 7
PROs and big data
PROs are collected using questionnaires providing a systematic way of measuring patients’ subjective views about their own health. In trials, PROs provide additional ‘patient-centred’ data which are unique in capturing the patient’s own opinion on the impact of their disease, and its treatment, on their life.
8
This is important: evidence suggests clinical indicators, when used in isolation, can underestimate the impact of a disease upon the individual and overestimate the effectiveness of healthcare interventions.
9
PRO trial data may be used to inform clinical care and decision-making, predict long-term outcomes and influence health policy such as the appraisal of new health technologies by the National Institute for Health and Care Excellence.
10
The use of PROs in clinical trials is well established, but widespread use in routine UK clinical practice is relatively uncommon; where it does take place, data collection and reporting tends to have a focus on provider performance rather than individual patient care.
11
Routinely collected secondary healthcare data have traditionally concentrated on clinical outcomes such as rates of postsurgical mortality, readmission to hospital, reoperation, infection and serious adverse events. Thus, we have readily accessible big data on these clinically relevant, but relatively rare, safety outcomes.
12
In addition, we have at our fingertips millions of linked primary and secondary care records including information on medical history, diagnostic test results and treatment, however, these records generally lack PRO information. It is important that this situation be addressed, in order that we unlock the potential benefits of big PRO data in the future. These benefits may be realised in enhanced clinical audit activity, in the delivery of tailored patient care and in the facilitation of clinically important pragmatic trials (Box 1). For instance, routine assessment of PROs may be used to guide patient care:
At a population level. Where PRO data may be used as an outcome in prognostic models to predict those patients most likely to obtain health gains and at an individual level to identify those who are a priority for care.
13
At an individual level. Where real-time monitoring of PRO data may help with early identification of problems requiring a prompt response (PRO-Alerts), facilitate improved communication between patients and their clinical team and allow rapid referral to appropriate specialist care when necessary.
14
Potential uses of PRO data.
What are the opportunities for ‘Big PRO Data’ and how does this work in practice?
PROs provide us with a unique and far-reaching opportunity to drive healthcare reform (Box 1); however, this requires timely integration into electronic systems and a thoughtful evidence-based approach to their use. In the first instance, we should ensure that big datasets include PROs that are acceptable to a range of stakeholders, the most important of which are patients. This may necessitate the development of ‘core outcome sets including PROs’ for routine clinical care. 15 PROs should also be rigorously selected based on the rationale for their use, ensuring that they are valid and reliable for the population of interest. Appropriately selected instruments have potential to provide data for more than one purpose but need to be piloted and reviewed to ensure that they are feasible, do not adversely affect the clinical workflow and are cost-effective. 16 For example, routine electronic capture of the EQ-5D across a range of settings may help facilitate cost-effectiveness analyses both pre- and post-marketing launch and provide auditable data. Large-scale implementation needs to be supported with appropriate guidance and formal evaluation. While traditional paper-based questionnaires continue to be used in a number of settings the future undoubtedly will lie with electronic data capture. Technological advancement such as patient portals and mobile phone applications may facilitate advancement in the integration of PROs with electronic health records.17,18 For instance, a current UK-led initiative is working to link routinely collected electronic PRO data provided by cancer survivors, to cancer registry data, in order to inform risk stratification. 19
In addition, social media and health forums have been identified as potential sources of evidence regarding patient views that may be used to inform commercial strategies. 20 The use of these complex unstructured data to inform health policy is the subject of considerable debate. 21 Further research will be necessary to develop and refine methods to utilise social media data effectively.
Challenges for big PRO data
The current fragmented approach to PRO data collection in routine care represents a major challenge to the use of PRO data in big datasets. 22 Standardised collection would facilitate pooling of data across providers and facilitate evidence synthesis. It would also allow comparison across healthcare providers, giving valuable information on patient outcomes both to patients and referring GPs (who may use this data to inform decisions when choosing healthcare providers) and commissioners. Harmonisation is required both within and across institutions with collaboration between academia and industry, patient advocates, regulatory bodies, commissioners and policy-makers. 23
The NHS made laudable progress towards the routine collection of PRO data in healthcare settings with the introduction of the PROMs programme in 2009 – but subsequent progress rolling this out beyond the four elective surgical procedures which were its initial focus has been limited. Furthermore, the means by which data are collected and analysed do not facilitate the real-time use of individual patients’ responses in shared decision-making with clinicians. There are technical and other challenges in further developing the PROMs programme so that the data can be used both by and for patients as well as, in more aggregated forms, by budget holders and policy-makers and others making decisions that affect patients. This includes selection of measures with appropriate psychometric properties for use at an individual patient and group level that have cultural validity and are available in a range of languages. Acceptability of measures that are not over-burdensome will also be crucial. Attempts to standardise measures for use across patient groups (e.g. Patient-reported Outcomes Measurement Information System), use of appropriate methods for instrument development (item response theory) and computerised adaptive testing may go some way to address this. 24
A key challenge to be faced in the use of PROs for research, audit and to inform individualised care is ensuring that patients understand who has access to their data and, crucially, how this information will be acted upon. 25 Patients’ PRO data can sometimes reveal ‘worrying levels of psychological distress or physical symptoms that may require an immediate response’, known as a ‘PRO alert’. 25 For PRO data collected in the clinical care setting, there should be a plan in place to monitor for alerts and an outline of the response options available to healthcare staff. PRO alerts may lead to increased healthcare utilisation; however, they also have the potential to bring cost savings if care can be tailored to those in need. Where PRO data are collected for research purposes, there is debate over whether the responsibility for PRO alert management rests with the research team or with the participant’s regular clinical care providers. Such issues should be addressed during the research design phase and require close collaboration between researchers and clinicians. Whatever the PRO alert management plan in operation, both patients and study participants should be fully informed of how their PRO data will be handled.
Increased assessment of PRO data may risk over-medicalisation. Working in partnership with patients and having a clearly defined rationale for assessment will be key to successful integration of PROs in big data. Expansion in the use of PROs will also require the active support and involvement of doctors, nurses, pharmacists, funders and others involved. Convincing examples of the impact of bringing PROs into routine consultations and care, together with standard approaches and technology, will be needed to persuade healthcare professionals of the overall merit of bringing in PROs.
Conclusion
The rapid acceleration in electronic data capture and analytic techniques means that we are at the brink of healthcare revolution. Big PROs data offer exciting possibilities, but we need to do more to facilitate increased routine PRO collection in clinical care for the benefits to be realised. In order to maintain and further develop a patient-centred approach to care, we require the rapid standardised introduction of PROs within electronic health records. This will necessitate a coordinated approach, guided by input from a range of stakeholders, to ensure a clear rationale for PRO assessment utilising optimal instruments, aimed at facilitating high-quality data capture and appropriate governance. 16 Adding PROs to the heart of the big data agenda will open up an important new opportunity in the progress to patient-centred care.
