Abstract
Background:
The electrocardiograph has been integral to the diagnosis of acute coronary syndromes since the mid-20th Century and is an important initial investigation that chest pain patients undergo on presentation to the Emergency Department. The Fourth Universal Definition of Myocardial Infarction recommends using dichotomous cut-offs to identify ischaemic electrocardiographs.
Objectives:
We aimed to summarise the existing knowledge to inform emergency clinicians about the diagnostic accuracy of the new guidelines.
Methods:
We performed a systematic review and a narrative analysis due to the heterogeneity of the studies.
Results:
We were able to obtain diagnostic characteristics for 10 papers. The ST-depression criteria were highly specific but poorly sensitive in five papers, with a specificity of 97.2%–99.3% and a sensitivity of 16.6%–20.0%. The remaining papers reported a higher sensitivity of 25.7%–58.6% but a lower specificity of 86.0%–91.2%. T wave inversion demonstrated poor specificity; the papers that looked at 0.1 mV T wave inversion demonstrated a sensitivity of 26.9%–46.8% and a specificity of 68.6%–86.4%.
Conclusion:
The heterogeneous evidence database demonstrates that the Fourth universal definition’s diagnostic performance varies wildly. Apart from two outlying papers, ST-depression has suboptimal sensitivity but high specificity. T wave inversion appears to be more sensitive yet less specific.
Keywords
Introduction
Acute chest pain is the second most common cause of unplanned admission to hospital in England. 1 The majority of these patients present to the Emergency Department (ED), and accurate recognition of acute coronary syndromes (ACS) in this cohort is integral to emergency medicine diagnostics. The mortality rate of patients with missed acute myocardial infarction (AMI) is twice that of patients who are diagnosed accurately. 2
A possible solution would be to admit all patients for serial electrocardiograph (ECG) measurements and serial biomarker studies but this comes with its own pitfalls such as hospital acquired infections and an extraordinary demand on finite resources. With a prevalence of ACS in previous cohorts being as low as 7%, 3 increasing focus has been devoted to early rule-out strategies that combine elements from the history, clinical examination, an ECG and the initial cardiac biomarker.4–7
Recently, in August 2018, the Fourth Universal Definition of Myocardial Infarction was published and recommended the use of objective cut-offs in much the same way as the ST-elevation myocardial infarction guidelines. 8 Thygesen and colleagues attempted to provide strict criteria for the diagnosis of myocardial injury based on changes in troponin concentration and their findings can be summarised in the ‘Ten Commandments’. 9 One can make a diagnosis of non-ST-elevation myocardial infarction (NSTEMI) if the cardiac troponin value is above the 99th percentile of the upper reference limit and there is evidence of cardiac symptoms or a culprit lesion on angiography or ECG changes. The guidelines define ECG signs of non-ST-elevation acute coronary syndrome (NSTEACS) to be 0.05 mV or more of ST-depression in two contiguous leads and T wave inversion of 0.1 mV or more in two contiguous leads. The overall effect of these new guidelines in the acute care setting is not clear; this is an important question given the clinical importance in recognising AMI, and low prevalence of the condition in chest pain patients. To this end, we have performed a systematic review of the literature to demonstrate how these guidelines perform in an ED setting.
Methods
Search strategy and eligibility criteria
This systematic review adhered to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines for diagnostic test accuracy reviews. 10 A three-part search strategy was used, combining free type, database-specific terms and Boolean operators, focusing on the ECG, the ED and suspected ACS. There was no time limit and all the references of relevant papers were examined. The search strategy explored the libraries of Medline, Embase and all Evidence-Based Medicine reviews, including Cochrane, using the Ovid interface. A grey literature search was also performed. Authors were contacted to seek further diagnostic raw data or further clarification regarding a papers method.
All diagnostic studies, whether they are retrospective or prospective, observational or interventional, were eligible if they were based in the ED. To be included in the analysis, studies had to report the diagnostic characteristics of the ECG in identifying NSTEMI or major adverse cardiac events (MACE). If the diagnostic characteristics were not available, we contacted the authors. We restricted our search to English language papers. A full breakdown of the search strategy is available on Prospero (CRD42018111749).
Index test and outcome measures
The ECG is a cornerstone of ACS diagnosis and its use has remained relatively static. ST-depression and T wave inversion are the classic signs of ischaemia on an ECG in the NSTEACS cohort. 8 The ECG has varying diagnostic performance based on the dichotomous cut-off chosen and the place where it is measured. We have reported all the various cut-offs used by the studies alongside their diagnostic characteristics.
The primary outcome of interest was a diagnosis of AMI. The third universal definition of myocardial infarction (which was in use until very recently) was considered to be the optimal reference standard. 11 This definition requires a rise and or/fall in the cardiac biomarker, with one value above the 99th percentile, and signs or symptoms suggestive of myocardial ischaemia. If diagnostic accuracy for AMI was not reported, we also considered studies that evaluated the incidence of MACE at 30 days. MACE is a composite diagnosis of AMI, revascularisation and all-cause mortality, in an attempt to take account of unstable angina (UA). It is customary to report this as a 30-day outcome but we did not require the incidence of MACE to be reported as part of our inclusion criteria.
Data extraction
Database-specific syntax was used to search the electronic databases. All of the papers abstracts were searched and screened independently by two investigators (N.M. and C.R.) for eligibility. The investigators were middle grade emergency physicians with 8 and 4 years of experience, respectively, and both are doctoral emergency medicine research fellows with numerous ACS publications. Both investigators then undertook a full-text review, including a review of the references, to establish which papers were suitable for data extraction. After this was performed, a meeting was held to discuss findings and resolve discrepancies. Authors were contacted for missing data and possible new material. We independently analysed the papers using the Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2) tool. 12 This was performed over a 2-month period. Discrepancies were resolved by consensus between the investigators. In the event of any irreconcilable differences in opinion, a third investigator who is a professor of emergency medicine (R.B.) was set to provide a final adjudication. There were no disagreements.
Statistical analysis
After extracting data, we considered the appropriateness of pooling the results in a meta-analysis. Unfortunately, there was no consistency between studies with regard to the measurement of the ECG (index test), the adjudication of the primary outcome varied widely, and the patient selection was strikingly different. We had planned to formally assess the heterogeneity of the papers using the Cochrane Q chi-square test and the I 2 statistic but this was also deemed unnecessary as there was overt evidence of clinical heterogeneity between studies, rendering meta-analysis inappropriate.
Results
The literature review identified 19 potential papers from 979 eligible articles. A search of the grey literature and manual inspection of references managed to find five more articles.13–17 We are incredibly grateful for those who provided further information on request.4,13,18–20 We were able to obtain diagnostic characteristics for 10 papers.4,3,7,13,14,18–22 We summarise the review process in Figure 1 and the characteristics of the included studies are described in Tables 1–4.

A flow diagram summarising the paper selection process.
QUADAS-2.
ECG: electrocardiograph; QUADAS-2: Quality Assessment of Diagnostic Accuracy Studies-2.
Characteristics of included studies.
STEMI: ST-elevation myocardial infarction; ACS: acute coronary syndromes; NSTEMI: non-ST-elevation myocardial infarction; UA: unstable angina; ECG: electrocardiograph; ED: Emergency Department LBBB: left bundle branch block; LVH: left ventricular hypertrophy; EM: Emergency Medicine; TWI: T wave inversion.
The diagnostic characteristics of studies that looked at ST-depression.
ACS: acute coronary syndromes; PPV: positive predictive value; NPV: negative predictive value.
The diagnostic characteristics of studies that looked at T wave inversion.
ACS: acute coronary syndromes; PPV: positive predictive value; NPV: negative predictive value.
There were some concerns regarding patient selection in the included studies. Hess and Ong excluded patients under the age of 24 and 25, respectively, while Liu and Ngako went further, excluding those under the age of 30 and 65, respectively.7,13,18,20 Fleischmann and Ong also excluded patients that had more than 1 mm of ST-depression in two or more contiguous leads.20,21 There was also concern regarding how representative the patient populations were with some papers having a prevalence between 24% and 34% or as low as 7%.3,13,14,18,21,22
The method for analysing the index test, the ECG, was unclear in several papers but the majority of queries were cleared up thanks to beneficial responses via email.4,13,14,18–20 Only four papers had the treating clinician reviewing the ECG; the remainder were evaluated by independent investigators affecting the applicability of their findings.13,19,20,22
A number of the papers used low sensitivity troponins or failed to define AMI using the universal definition of myocardial infarction, meaning that the reference standard may incorrectly diagnose the target condition, although once again most queries were resolved via email. The use of different assays affected the applicability of the reference standard in several papers. The ST-depression was measured in many ways, using different criteria. We outlined the exact cut-offs in the diagnostic characteristics table, describing how certain researchers used the J point while other used 60 ms past the J point. Other papers left it up to the treating clinician’s discretion. This variability was similar in the T wave inversion articles, with only two papers using T wave inversion of 0.1 mV in two contiguous leads.18,20
The ST-depression criteria appear to be highly specific but poorly sensitive in 4 papers, with a specificity of 97.2%–99.3% and a sensitivity of 16.6%–20.0%.3,7,13, 22 The other papers reported a higher sensitivity of 25.7%–58.6% but a lower specificity of 86%–91.2%.3,4,14,18–20 One article had an impressive combination of sensitivity and specificity, but this was a small study with a remarkably high prevalence. 21 The vast disparities may be due to certain papers with high sensitivity using Cardiologist assessment with less stringent diagnostic criteria.4,14,19 Apart from the paper from Hess et al., T wave inversion demonstrates poor specificity; this is likely due to Hess et al. 7 using 0.2 mV of T wave inversion. The papers that looked at 0.1 mV demonstrated a sensitivity of 26.9%–46.8% and a specificity of 68.6%–86.4%.13,18,20
We do not feel that the Fourth Universal Definition will be the final iteration. As Sambola et al. note there are likely to be future changes in the ECG diagnostic criteria: global ST-depression with ST-elevation in aVR may be used to diagnose a left main coronary artery stenosis. 23 Furthermore, the way clinician’s judge ST-depression may change: Valentine et al. note that tachydysrhythmias may develop diffuse ST-segment depression resembling myocardial ischaemia that in fact represents non-ischaemic repolarization changes that the Fourth Universal Definition term ‘cardiac memory’. 24 The inclusion of this term in the new Universal Definition may affect the diagnostic performance of clinicians in the future. In light of these possible changes, we aim to prospectively assess the diagnostic accuracy of these guidelines in a multi-centre study using explicit inclusion and exclusion criteria.
Limitations
The primary outcome for this study was NSTEMI, and we considered that the optimal reference standard was adjudication according to the universal definition, but we also reviewed studies that reported the composite result of MACE. We made an a priori plan to synthesise these outcomes in two separate analyses. Unfortunately, we were unable to produce a meta-analysis due to unclear definitions and different troponin assays. The issues with synthesis were further compounded by the studies’ different ways of measuring ST-depression and T wave inversion. It appears that as one digs deeper into the literature, there is no standardised measurement of ST-depression despite the universal definition for NSTEACS being in its current incarnation for the past 6 years.8,13 This finding has concerning implications for the utility of existing clinical prediction models: while there is no consensus in the literature for what defines ischaemic ST-changes, one should not suppose that clinicians will be able to use these models accurately and reliably. This proposition is given credence by the variability in the diagnostic performance of the History, ECG, Age, Risk factors, and initial Troponin (HEART) Pathway. 25
Conclusion
Current Emergency Department research into the diagnostic accuracy of the ECG for NSTEACS demonstrates a sensitivity and specificity that varies greatly between studies. The main issue appears to be inconsistent definitions of the primary outcome and the index test. The extent of between-study heterogeneity in the approach to interpreting ST-changes on the ECG clearly identifies a pressing need for greater standardisation of the approach. This also emphasises the potential value of a prediction model using objective parameters to optimise the approach to interpreting ECGs for evidence of ischaemia.
Footnotes
Authorship
All three authors contributed to all the phases of this research.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Ethical approval
This article was a systematic review of previously published literature and did not require ethical approval.
Human rights
This article was a systematic review of previously published literature and did not infringe anyone’s human rights.
Informed consent
This paper did not use patient data and thus no consent was required.
