Abstract
Purpose:
The purpose of this systematic literature review was to examine whether different assessment methods contribute to the variance in delirium incidence detected in populations of patients with acute stroke. Specifically, the aim was to address the influence of (1) choice of assessment tool, (2) frequency of assessment, and (3) type of health professional doing the assessment.
Methods:
We searched MEDLINE, EMBASE, and PsycINFO and included pro- and retrospective cohort studies assessing delirium during hospitalization of adult acute stroke patients.
Results:
In 30 articles, 24 unique populations were identified and included in the review. Delirium incidence ranged from 1.4% to 75.6% in total and a chi-square test showed a significant heterogeneity across studies (χ2 = 536.5, df = 23, P < .0001). No studies had an assessment for delirium before a patient entered the study. No specific patterns regarding the influence of tool, assessment frequency or health professional were discernible.
Discussion:
Subgroups analyses were not conducted due to the heterogeneity across studies. Studies comparing delirium assessment tools directly with each other are needed.
Conclusions:
Delirium is a common complication in acute stroke. No firm conclusions about a possible correlation of choice of tool, assessment frequency, and delirium incidence could be made due to the great heterogeneity of the study populations. Only 1 study compared 2 tools directly with each other. Further studies comparing delirium assessment tools directly with each other are needed.
Introduction
Development of acute delirium is a serious complication in patients hospitalized with acute stroke. Delirium interferes with initiation of stroke rehabilitation and causes prolonged stay in hospital, worse functional outcome after stroke, and increased mortality. 1 In 2012, 2 reviews on the occurrence of delirium in acute stroke reported incidences in the ranges of 10% to 48% 1 and 2.3% to 66%, 2 respectively.
The DSM-V 3 is the newest edition of the Diagnostic and Statistical Manual of Mental Disorders from the American Psychiatric Association. Previous editions also relevant to this review include the DSM-IV-TR, the DSM-IV, and the DSM-III-R.4-6 The definition of delirium is the same in DSM-IV-TR and the DSM-IV and differs only slightly from the definition in the DSM-V.
The DRS (Delirium Rating Scale) is a 10-item scale made to rate the severity of delirium symptoms. 7 The original version is from 1988 with a later revision in 1998 (the Delirium Rating Scale Revised 1998 [DRS-R-98]). The DRS-R-98 consists of a 13-item symptom severity scale and 3 additional diagnostic items. 8 Both scales are intended to be completed by a clinician using all available information about the patient under assessment.
The Confusion Assessment Method (CAM) was developed to provide an easy-to-use tool to health staff members not specifically trained in psychiatry. 9 The Confusion Assessment Method for the Intensive Care Unit (CAM-ICU) is an edition of the CAM designed for use in the intensive care setting. 10
In patients with stroke, the diagnosis of delirium is complicated by the fact that acute stroke itself as well as dementia caused by previous vascular events may cause delirium-like symptoms differing only in time of onset and development. In addition, the symptoms and signs of delirium have a fluctuating course over hours and days. It may therefore be difficult for unexperienced observers to correctly identify delirium. Some classes of health professionals may also be better equipped than others in the assessment of delirium and this may affect a given tool’s ability to identify delirium. Furthermore, it is possible that the frequency of assessments will influence the number of delirious patients found.
The purpose of this systematic literature review was to examine whether different assessment methods contribute to the variance in delirium incidence. Specifically, the aim was to address the influence of (1) choice of assessment tool, (2) frequency of assessment, and (3) type of health professional doing the assessment. It was not the purpose of this review to evaluate studies directly comparing 2 or more tools with each other.
Methods
A protocol for this review was published online at the PROSPERO register (ID CRD42017068360). 11 The protocol was based on the PRISMA-P guidelines. 12 The article has been written in accordance with the PRISMA guidelines. 13
As described in detail in the systematic review protocol, we searched the databases MEDLINE, EMBASE, and PsycINFO. A health science librarian was consulted on the search strategy. The Medical Subject Headings (MeSH terms) ‘stroke’, ‘cerebrovascular disorders’, ‘delirium’, ‘confusion’ were searched in combination with the words ‘cerebrovascular syndrome’, ‘brain ischemia’, ‘brain vascular accident’, ‘cerebrovascular accident’, ‘confusional state’, ‘psychosyndrome’, ‘brain syndrome’, ‘metabolic encephalopathy’. See review protocol for the exact MEDLINE search string. 11 The searches at EMBASE and PsycINFO were built on the same search words. No date restrictions were imposed on any of the searches and all searches were finalized in August 2017.
Screening, data extraction, and risk of bias assessments were done independently by 2 reviewers (J.S. and J.V.S.) and consensus established. Disagreement was sought to be resolved through discussion between J.S. and J.V.S. If consensus could not be reached, a third reviewer (T.C.) resolved the disagreement.
Study designs eligible for the review were prospective and retrospective observational cohort studies. Study designs excluded from the review were interventional studies, observational studies other than cohort studies, studies with 3 or less patients, and reviews.
To be included, studies had to estimate the delirium incidence, that is, new cases of delirium found during the study period, in a population of patients ⩾18 years of age admitted with acute stroke as hospital inpatients. Stroke populations with any combination of ischaemic stroke, intracerebral haemorrhages and subarachnoid haemorrhages were eligible. So were populations of patients with symptoms lasting less than 24 hours if they had relevant acute lesions on cerebral imaging. Patients with transient ischaemic attacks without demonstrable acute brain damage on brain imaging were excluded as they had full remission of symptoms and constituted a group which was treated heterogeneously, for example, in some institutions, these patients were admitted to an acute stroke wards and at other places they were handled via outpatient clinics. If a study encompassed patients both fulfilling and not fulfilling the inclusion criteria and the data from the patients fulfilling the inclusion criteria could be extracted for separate analysis, the study was considered eligible.
A 95% confidence interval (CI) for each study’s delirium incidence was calculated. The chi square test of heterogeneity was performed and due to significant heterogeneity the DerSimonian and Laird Random Effects Model 14 was used to synthesize the overall delirium incidence estimate. Note that in the DerSimonian and Laird Random Effects Model each study is given a weight based on the number of patients, but the range of weights are compressed as compared with a fixed effects model. Data storage, analyses, and figures were done in Microsoft Excel and RStudio (©2009-2017 RStudio, Inc, Boston, MA, USA).
Results
A total of 3748 unique titles were identified. A total of 134 articles were read in full length and 30 were included in the review.15-44 In these 30 articles, 24 unique populations were identified (Figure 1). The discrepancy between the number of articles and the number of unique populations was caused by the fact that the same population was described in more than one article. Thus, 2 populations were described in 2 separate articles and 2 populations were reported on in 3 separate articles. The included studies were published during the years 1987 to 2017 and consisted of 2 retrospective and 22 prospective cohort studies. Table 1 is an overview of the included studies. A total of 21 studies reported a mean for age which was in the range of 51 to 79.2 years. The median age (range of 68-74 years) was available for those 3 studies which did not report a mean age.

Flowchart of the search and article selection process.
Summary of included studies.
Abbreviations: AAD, acute agitated delirium; ACS, acute confusional state; AIS, acute ischaemic stroke; CAM: Confusion Assessment Method; CAM-ICU, Confusion Assessment Method for the intensive care unit; DRS, Delirium Rating Scale; DRS-R-98, Delirium Rating Scale Revised 1998; DSM-III-R, Diagnostic and Statistical Manual of Mental Disorders (Third Edition Revised); DSM-IV, Diagnostic and Statistical Manual of Mental Disorders (Fourth Edition), DSM-IV-TR, Diagnostic and Statistical Manual of Mental Disorders (Fourth Edition, Text Revision); DSM-V, Diagnostic and Statistical Manual of Mental Disorders (Fifth Edition); ICH, intracerebral haemorrhage; ICU, intensive care unit; MD, medical doctor; SAH, subarachnoid haemorrhage.
The studies marked with green are the 11 studies in which the tool use was (nearly) daily or more often.
Only data on age of the ACS patients were reported.
ACS was defined as (a) a sudden disturbance of consciousness, manifested by a reduced clarity of awareness, together with (b) a change in cognition, including memory impairment, language disturbances, and disorientation to time or place.
Not specified if any specific score triggered a delirium diagnosis or if the 3 additional diagnostic items were used to indicate whether delirium was present or not.
Only 101 patients were ever delirium assessed but the reported ages are from the total 111 patients (10 patients were never assessed for nondisclosed reasons).
Only 98 patients were ever delirium assessed but the reported ages are from the in total 114 patients (16 patients were never assessed because of persistent coma).
Only 150 were ever delirium assessed but the reported ages are from the in total 174 patients (24 patients were not assessed due to ‘neurologic injury’).
ACS is failure to maintain a coherent stream of thought or action with inattention and distractibility and ACS causes characteristic cognitive impairments such as disorientation, memory disturbance, unconcern of illness, agraphia, anomia, and constructional disability. AAD have the same features plus vivid hallucinations, delusion, and affective and autonomic excitement.
Delirium incidence
The delirium incidence range was 1.4% to 75.6% (see Figure 2). Figure 3 shows a graphic representation of the data synthesis for the delirium incidence. The chi-square test showed a significant heterogeneity across studies (χ2 = 536.5, df = 23, P < 0.0001). Cochrane Q = 743.5 (P < .0001) and I2 = 96.9% and the random effects model showed a delirium incidence of 22.8%, 95% CI: 18.2-27.4 (23.9%, 95% CI: 19.5-28.4%, with the 2 retrospective studies removed from the model).

Figure illustrating the delirium incidences categorized according to which assessment tools were used in each of the studies. Note that there are 4 studies using solely the CAM, 2 studies found incidences of 27.4% and 28.1%, respectively, and are almost nondiscernible from each other on the figure. Please note that the study by McManus et al is depicted twice, once for the CAM (delirium incidence 28.1%) and once for the DRS (delirium incidence 26.8%).

Forest plot. The size of each square visualizes a study’s percentile weight, the horizontal bars indicate the 95% confidence interval (CI), the centre of the diamond indicates the overall delirium incidence estimate, and the width of the diamond represents the 95% CI of the overall estimate.
Assessment tools
In 16 studies, the tools used were the CAM, CAM-ICU, DRS, and DRS-R-98 (exclusively, in combination with each other or in combination with DSM criteria). The study by McManus et al 40 reported 2 delirium incidences for the same population; one for the DRS (delirium incidence 26.8%) and one for the CAM (delirium incidence 28.1%). Six studies solely used some version of the DSM criteria (1 used DSM-V, 3 used DSM-IV or DSM-IV-TR, and 2 used DSM-III-R criteria) and 2 studies used some other specified criteria for delirium (see Table 1). Looking only at the studies using the CAM, CAM-ICU, DRS, and DRS-R-98 (alone or in combinations with each other or in combination with DSM criteria), the delirium incidences were within a narrower range of 6.6% to 35.3% (Figure 2).
Frequency and timing of assessment tool use
In the prospective studies, delirium assessment was done once a day or more in 6 studies. The remaining 16 prospective studies assessed with a smaller frequency, for example, twice a week. The retrospective studies used patient records, either covering the entire admission or limited to days 1 through 7.
Marked in green in Table 1 are the 11 prospective studies that assessed roughly once a day (eg, daily except Sundays) or more often. Looking at the table, comparing these 11 studies with those that assessed with a frequency less than roughly daily, there is no discernible pattern with respect to delirium incidence.
Type of health professional
Information about the type of health professional(s) using a specific tool was available from 20 studies (see Table 1). Study authors were contacted on 8 occasions to clarify the information about which type of health professional had done the delirium assessments in their studies. Nine prospective studies specifically stated that the persons using the tools had been trained to use their specific tool or to apply the delirium criteria. No specific pattern as to whether the type of health professional doing the assessments had an influence on the delirium incidence was discernible.
Quality assessment
A panel of questions was designed to perform a quality assessment of the included studies. The questions were taken from Watt et al 45 and were, for some questions, modified slightly. Questions on exclusion of aphasic patients, delirium assessment before inclusion into a study, and the timeframes used for delirium assessments were added. Table 2 shows the questions and the answers for each study in this review. Of the 24 studies, 5 did not have clearly described characteristics regarding age, disease status, and preexisting cognitive impairment. The recruitment strategy was nonconsecutive in one population and unclear in another three. The inclusion and exclusion criteria were judged not to allow for appropriate selection of the respective studies’ target populations in 2 cases (unclear in 3 cases) and aphasic patients were excluded in 9 studies. There were missing data on delirium assessments in 11 studies and 2 of these did not document reasons for this missing data. The delirium assessment was not performed by an independent assessor in 8 studies (unclear in an additional 6 studies) but only in 2 studies were the delirium assessments not done in a standard manner. No studies had an assessment negative for delirium before a patient entered the study and only 2 studies described the time frame used for a given tool’s application.
Quality assessment table.
Abbreviation: NA, not applicable.
Discussion
In our review, we found a large range in delirium incidences from 1.4% to 75.6%. It must be emphasized that none of the studies tested a certain screening method against a predefined golden standard. The study by Mitasova et al 38 was a study comparing blinded assessments between a Czech version of the CAM-ICU and the DSM-IV; however, they reported delirium incidence for the DSM-IV only and therefore only this was included into this review. Based on the data in the included studies, we were able to calculate an overall estimate of delirium incidence of approximately 23%.
Even though all populations are acute stroke populations, heterogeneity across populations should be assumed. We did not do any test for a potential statistically significant difference between subgroups (ie, studies using the CAM, DRS, or DRS-R-98 vs the rest) as the studies were of observational design and the heterogeneity was, as mentioned, significant. This means that the subgroup pooled estimates and I2 values might very well had turned out to be different for other reasons than the chosen subgrouping because of unknown confounders and any confident conclusions of such tests would therefore be impossible to make.
However, the narrower range of incidences from studies using the CAM, CAM-ICU, DRS, or DRS-R-98 generates the hypothesis that using standardized and validated delirium assessment tools such as the CAM, CAM-ICU, DRS, or DRS-R-98, a more precise detection of acute delirium may be obtained in patients with acute stroke. We emphasize that results from this review do not allow actual comparison of groups of studies with each other due to the heterogeneity of the studies. Only one study in this review compared 2 tools directly with each other. Further studies comparing delirium assessment tools directly with each other are needed.
A study by Infante et al 46 was not included into this review, because it had patients with transient ischaemic attack (symptoms lasting less than 24 hours and no acute cerebral lesions demonstrated on imaging) intermixed in its population but is still worth mentioning. The study found a delirium incidence of 52% when using the 4AT instrument but a lower incidence of 32% when the DSM-V criteria were applied to the same patients by the same neurologist. The 4AT was available for the first time in 2011 and is designed as an instrument for rapid delirium screening. 47
Concerning the types of health professionals using the tools and the delirium incidence found, no discernible pattern was evident from our review. As mentioned above, the heterogeneity among the studies was considerable and no subgroup analyses were therefore done. To our knowledge, no previous studies have specifically addressed the possible influence of the type of health professional on the detection rate of delirium. Yet, this question must remain unanswered.
Due to the fluctuating symptoms of delirium, it could be speculated that the frequency of assessments will have an impact on the detection rate of delirium. However, due to the multitude of different tools in use across these 22 prospective studies, no firm conclusions can be made for any of the tools as to whether there is a correlation between the frequency and delirium detection.
Several sources for bias may exist. An important bias is the fact that the studies had clearly different types of settings and spanned a rather wide range of publication years. For example, some were conducted in modern stroke wards, others in neurological semi-intensive care units and others again in older settings. The types of stroke also differed across studies. Some reported both on ischaemic and haemorrhagic strokes, others only one type. The underlying stroke etiology itself might also in some situations contribute to the development of delirium. Another potential bias may come in- or exclusion of aphasic patients. The exclusion of aphasic patients may result in a lower incidence of delirium. Thought content and attention may be harder to judge when language is impaired, and this makes severely aphasic patients more difficult to assess for delirium. A validated tool for assessing severely aphasic patients is obviously needed. The CAM-ICU 10 does not require the patient to speak but severe aphasia might still interfere with patient’s ability to understand what is being said without confusion being present. In addition, perceptual deficits other than aphasia might also make a delirium assessment difficult. Data on perceptual deficits other than aphasia were not collected in this review.
Assessment for delirium before the patients entered a given study was not done in any of the included studies. It is quite conceivable that some patients might have been delirious before inclusion (and before admission or before their stroke). Another bias might come from the time frames used when evaluating patients for the presence of delirium. For a given tool, some studies might consistently have used longer time frames (eg, looking back 24 hours instead of 8 hours) than others. The effect this might have on the incidence of delirium is unknown.
Conclusions
Delirium is a common complication in acute stroke. The wide ranges of delirium incidence reported in the different stroke studies imply that delirium can be difficult to recognize in patients with stroke, leading to both under- and overdiagnosis.
No firm conclusions about a possible correlation of choice of tool, assessment frequency, and delirium incidence could be made due to the great heterogeneity of the study populations. Only 1 study compared 2 tools directly with each other. Further studies comparing delirium assessment tools directly with each other are needed.
Footnotes
Acknowledgements
The authors wish to thank Jette Meelby, health science librarian at Nordsjællands Hospital, for her help constructing the search strings and procuring articles. They also wish to thank Jacob Mesot Liljehult, RN, cand.scient.san., PhD fellow at Nordsjællands Hospital, for help with the forest plot.
Funding:
The authors disclose receipt of the following financial support for the research, authorship, and/or publication of this article: JS is a PhD-student with a salary supported by research grants from ‘Nordsjællands Hospital’ and ‘Familien Hede Nielsens Fond’.
Declaration of conflicting interests:
The authors declare no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Author Contributions
JS and JVS researched literature. JS, JVS, and TC conceived the study. All authors were involved in protocol development and data analysis. JS wrote the first draft of the manuscript. All authors reviewed and edited the manuscript and approved the final version of the manuscript.
Ethical approval
Ethical approval was not sought for this article because there was no indication that any of the studies in review posed any ethical dilemma.
Informed consent
Informed consent was not sought for this article because no patient identifiable data were obtained for this review from any of the studies reviewed. All studies included into the review featured anonymized patient data.
Guarantor
J.S. is the guarantor of this study.
