Concordance Rate between HER2 Immunohistochemistry and in Situ Hybridization in Gastric Carcinoma: Systematic Review and Meta-Analysis

Abstract

Purpose

The aim of this study was to investigate the diagnostic accuracy of HER2 immunohistochemistry (IHC) in gastric carcinoma (GC) through a systematic review, meta-analysis and diagnostic test accuracy review.

Method

The current study included 12,679 GC cases and 181 subsets in 45 eligible studies. We performed concordance analysis between HER2 IHC and in situ hybridization (ISH) in GC. Diagnostic test accuracy was analyzed and the area under the curve (AUC) on the summary receiver operating characteristic (SROC) curve was calculated.

Results

HER2 amplification rates were 3.0%, 31.8%, and 93.0% in the IHC score 0/1+, 2+, and 3+ groups, respectively. The concordance rates between IHC and ISH were 0.969 (95% confidence interval [CI] 0.962-0.975), 0.393 (95% CI 0.331-0.458) and 0.915 (95% CI 0.882-0.939) in the HER2 IHC score 0/1+, 2+, and 3+ groups, respectively. For all the HER2 IHC score groups, the positive rates were higher in the silver ISH (SISH) subgroup than in the fluorescence ISH (FISH) and chromogenic ISH (CISH) subgroups. In diagnostic test accuracy review, the pooled sensitivity and specificity were 0.86 (95% CI 0.84-0.87) and 0.91 (95% CI 0.90-0.91). The AUC on SROC curve was 0.958. However, there was no significant difference in the values of AUC between the ISH methods.

Conclusions

Our results showed that HER2 IHC was well concordant with ISH in HER2 IHC score 0/1+ or 3+. Although this meta-analysis showed higher diagnostic accuracy of HER2 IHC, more detailed criteria for HER2 IHC score 2+ cases will be required to predict HER2 status.

Keywords

Diagnostic test accuracy review Gastric carcinoma HER2 Immunohistochemistry In situ hybridization Meta-analysis

Introduction

Gastric carcinoma (GC) is a biologically and genetically heterogeneous tumor and one of the most common malignant tumors worldwide (1). In previous studies, several biological and molecular biomarkers have been introduced in GC. Among these markers, HER2 status is an important marker for the application of treatment modalities using the HER2 monoclonal antibody in GC (2). The HER2oncogene, also known as ERBB2 or neu, encodes a transmembrane receptor glycoprotein with tyrosine kinase activity. This protein is associated with adhesion, migration, and differentiation of tumor cells and is a well-known marker of poor prognosis in a variety of tumors (3). HER2 overexpression and/or gene amplification was found in 7%-34% of GC cases (4). A survival benefit from trastuzumab has been reported in patients with immunohistochemistry (IHC) 2+/in situ hybridization (ISH)+ or IHC 3+ (4). Therefore, the accurate evaluation of HER2 status is very important in the application of targeted HER2 monoclonal antibody therapy.

Diagnostic tests for the evaluation of HER2 status have used HER2 IHC and ISH in daily practice. For HER2 IHC score 2+ cases, an additional ISH should be conducted. Previous studies have reported the correlation between HER2 IHC and ISH (5-6-7-8-9-10-11-12-13-14-15-16-17-18-19-20-21-22-23-24-25-26-27-28-29-30-31-32-33-34-35-36-37-38-39-40-41-42-43-44-45-46-47-48-49). The studies used various IHC methods with different fixation, processing protocols, and antibody clones (5-6-7-8-9-10-11-12-13-14-15-16-17-18-19-20-21-22-23-24-25-26-27-28-29-30-31-32-33-34-35-36-37-38-39-40-41-42-43-44-45-46-47-48-49). In addition, HER2 IHC as an initial tool to evaluate HER2 status may show less agreement between pathologists in daily practice. However, whether these discrepancies influence diagnostic accuracy and concordance has not yet been fully elucidated. Cumulative studies or systematic reviews on these discrepancies are needed to elucidate the correlation between HER2 IHC and ISH.

In the present study, we performed a systematic review and meta-analysis in an attempt to investigate the concordance between HER2 IHC and ISH in GC. Subsequently, diagnostic test accuracy review was conducted for the diagnostic role of HER2 IHC in GC.

Materials and methods

Published Study Search and Selection Criteria

Relevant articles were obtained by searching the PubMed and MEDLINE databases through January 31, 2015. These databases were searched using the following keywords: “gastric carcinoma”, “HER2” or “HER-2”, “immunohistochemistry” and “in situ hybridization”. The titles and abstracts of all searched articles were screened for exclusion. Review articles were also screened to find additional eligible studies. The search results were then scanned according to the following inclusion and exclusion criteria: 1) HER2 status was investigated in human GC tissue; 2) the correlation between HER2 IHC and ISH was included; 3) case reports or non-original articles were excluded; and 4) all were English-language publications.

Data Extraction

Data from all eligible studies were extracted by 2 independent authors. The included data were extracted from each of the eligible studies (5-6-7-8-9-10-11-12-13-14-15-16-17-18-19-20-21-22-23-24-25-26-27-28-29-30-31-32-33-34-35-36-37-38-39-40-41-42-43-44-45-46-47-48-49): the first author's name, year of publication, antibody clone and manufacturer, study location, dilution ratio of antibody, cutoff value, number of patients analyzed, and ISH method. For meta-analysis, we extracted all data associated with IHC results (scores 0, 1+, 2+ and 3+) and ISH results (number of amplification and non-amplification).

Statistical Analysis

To perform the meta-analysis, all data were analyzed using the Comprehensive Meta-Analysis software package (Biostat, Englewood, NJ, USA). We investigated the rates of HER2-amplified cases according to HER2 IHC scores for meta-analysis. The concordance rates were measured by agreement rates between HER2 IHC and ISH. In addition, subgroup analyses based on ISH methods were performed. Heterogeneity between the studies was checked by the Q and I² statistics and demonstrated as p values. Additionally, sensitivity analysis was conducted to assess the heterogeneity of eligible studies and the impact of each study on the combined effect. For the assessment of publication bias, Begg's funnel plot and Egger's test were used. If significant publication bias was found, the fail-safe N and trim-and-fill tests were additionally conducted to confirm the degree of publication bias. The results were considered statistically significant at p<0.05.

Diagnostic test accuracy review was carried out using the Meta-Disc program (version 1.4) (50). Forest plots and the summary receiver operating characteristic (SROC) curve were obtained. The area under the curve on SROC and the diagnostic odds ratio (OR) were calculated by the Meta-Disc program.

Results

Selection and Characteristics of the Studies

In this systematic review and meta-analysis, 461 reports were identified in the database search. Among them, 249 were excluded because of insufficient or no information on the correlation between HER2 IHC and ISH. In addition, 90 reports were excluded because they reported the results of nongastric cancer and other diseases, or used animals or cell lines, and 77 were excluded because they were non-English or non-original articles. Finally, 45 studies were included in this systematic review and meta-analysis (Fig. 1 and Tab. I). There were 181 subsets from 45 eligible studies in terms of separate counting of IHC scores, IHC antibodies, IHC scoring system, and ISH methods. The total number of patients was 12,679. Eligible studies used various manufacturers’ antibodies and ISH methods, as shown in Table I. The overall HER2-IHC-positive rate and HER2-ISH amplification rate were 20.4% and 14.6%, respectively.

Table I

Main characteristics of the eligible studies

Author, Year	Location	Antibody	Corp.	Type of specimen	ISH	IHC 0/1+		IHC 2+		IHC 3+
Author, Year	Location	Antibody	Corp.	Type of specimen	ISH	N	ISH GA	N	ISH GA	N	ISH GA
Park 2006 (5)	Korea	human HER-2/neu	Zymed	Whole section	FISH	10	0	18	2	11	5
				Whole section	CISH	10	0	18	2	11	5
Yano 2006 (6)	Japan	HercepTest	Dako	Whole section	FISH	153	17	12	7	34	30
Kim 2007 (7)	Korea	A0485	Dako	TMA	FISH	192	4	46	5	10	10
Hofmann 2008 (8)	Germany, China, Mexico	HercepTest	Dako	Whole section	FISH	136	6	14	5	18	18
Liang 2008 (9)	China	HercepTest	Dako	Whole section	FISH	87	2	7	3	6	6
Barros-Silva 2009 (10)	Portugal	CB11	Novocastra	Whole section	FISH	25	1	18	12	25	25
Yan 2010 (11)	Singapore	4B5	Ventana	TMA	FISH	115	2	1	1	12	12
				TMA	CISH	115	2	1	1	12	12
Boers 2011 (12)	Netherlands	SP3	Labvision	Whole section	SISH	129	5	6	6	11	11
		4B5	Ventana	Whole section	SISH	123	1	6	4	17	17
		SP3	Labvision	Whole section	FISH	25	5	6	6	11	11
		4B5	Ventana	Whole section	FISH	19	1	6	4	17	17
Cidon 2011 (13)	Spain	HercepTest	Dako	Whole section	FISH	106	0	6	1	8	8
Im 2011 (14)	Korea	A0485	Dako	Whole section	FISH	118	0	10	3	12	10
Kim 2011 (15)	Korea	A0485	Dako	TMA	FISH	1080	3	289	51	106	106
Lee 2011 (16)	Australia	HercepTest	Dako	Whole section	SISH	49	2	17	6	31	30
Moelans 2011 (17)	Netherlands, Poland, Finland, USA	HercepTest	Dako	TMA	CISH	180	7	9	4	6	6
Tafe 2011 (18)	USA	4B5 (ASCO/CAP)	Ventana	Whole section	FISH	104	1	8	3	16	16
		4B5 (ToGA)	Ventana	Whole section	FISH	104	3	8	4	16	16
Wang 2011 (19)	China	HercepTest	Dako	Whole section	FISH	56	3	9	7	4	4
Yan 2011 (20)	China	HercepTest	Dako	Whole section	FISH	123	3	12	5	10	10
Cho 2012 (21)	Korea	HercepTest	Dako	TMA	FISH	249	8	18	8	22	22
		A0485	Dako	TMA	FISH	243	6	22	9	24	23
		4B5	Ventana	TMA	FISH	249	9	14	5	26	24
		CB11	Novocastra	TMA	FISH	262	15	6	2	21	21
Kim 2012 (22)	Korea	A0485	Dako	TMA	FISH	98	3	7	1	6	6
Kiyose 2012 (23)	Japan	HercepTest	Dako	TMA	FISH	148	2	13	11	37	37
				TMA	CISH	148	3	13	12	37	37
Kunz 2012 (24)	USA	A0485 (ASCO/CAP)	Dako	TMA	FISH	155	9	5	2	5	5
		A0485 (ToGA)	Dako	TMA	FISH	142	3	12	3	10	10
Liu 2012 (25)	China	SP3	Labvision	Whole section	FISH	120	3	43	6	86	60
Park 2012 (26)	Korea	HercepTest	Dako	Whole section	SISH	923	38	51	17	68	68
Yang 2012 (27)	China	Ultra Sensitive	Maixin-Bio	Whole section (Surgical)	FISH	93	1	5	3	19	18
		Ultra Sensitive	Maixin-Bio	Whole section (Biopsy)	FISH	125	2	7	2	16	14
Zhou 2012 (28)	China	MAB-0198	Maixin-Bio	Whole section	FISH	193	0	11	4	23	23
Cho 2013 (29)	Korea	4B5	Ventana	Whole section	SISH	-	-	44	21	-	-
Cruz-Reyes 2013 (30)	Mexico	HercepTest	Dako	Whole section	CISH	258	0	6	5	5	5
				Whole section	FISH	258	0	6	5	5	5
Gasljevic 2013 (31)	Slovenia	HercepTest	Dako	TMA	FISH	336	14	122	14	41	41
Gordon 2013 (32)	USA	R60	CLIA-approved clinical laboratory	Whole section	FISH	130	7	10	3	8	7
Grin 2013 (33)	Canada	4B5	Ventana	Whole section	FISH	33	0	10	1	7	5
		4B5	Ventana	Whole section	SISH	33	0	10	2	7	5
He 2013 (34)	China	HercepTest	Dako	Whole section	FISH	153	7	25	10	19	14
Kataoka 2013 (35)	Japan	4B5	Ventana	Whole section	DISH	182	1	13	7	18	18
Lee 2013 (36)	Korea	A0485	Dako	Whole section	FISH	122	17	98	54	50	48
Pala 2013 (37)	Turkey	A0485	Dako	Whole section	SISH	70	3	6	3	12	12
		HercepTest	Dako	Whole section	SISH	73	3	3	3	12	12
		A0485	Dako	Whole section	FISH	70	1	6	3	11	11
		HercepTest	Dako	Whole section	FISH	73	1	2	2	12	12
Pirrelli 2013 (38)	Italy	A0485	Dako	Whole section (Surgical)	CISH	47	0	7	1	7	6
		A0485	Dako	Whole section (Biopsy)	CISH	45	0	9	0	7	6
Shan 2013 (39)	China	4B5	Ventana	Whole section	FISH	230	3	38	11	40	39
Xu 2013 (40)	China	HercepTest	Dako	Whole section	FISH	96	0	23	8	7	7
Aizawa 2014 (41)	Japan	4B5	Ventana	TMA	FISH	815	43	94	21	-	-
Chen 2014 (42)	China	Ultra Sensitive	Maixin-Bio	Whole section	FISH	73	5	16	7	29	26
Kandel 2014 (43)	France	HercepTest	Dako	TMA	FISH	72	0	3	3	7	7
Kimura 2014 (44)	Japan	HercepTest	Dako	Whole section	FISH	-	-	21	0	18	12
Selcukbiricik 2014 (45)	Turkey	HercepTest	Dako	Whole section	SISH	62	1	8	5	11	11
Tang 2014 (46)	China	HercepTest	Dako	Whole section	FISH	87	3	19	6	15	10
Werner 2014 (47)	Germany	4B5	Ventana	TMA	SISH	78	3	9	4	9	7
Yoshida 2014 (48)	Japan	HercepTest	Dako	Whole section	FISH	140	13	25	14	35	33
Zhu 2014 (49)	China	HercepTest	Dako	Whole section	FISH	359	4	-	-	-	-

ISH = in situ hybridization; GA = gene amplification; FISH = fluorescence in situ hybridization; CISH = chromogenic in situ hybridization; SISH = silver in situ hybridization; TMA = tissue microarray; ASCO/CAP = the american society of clinical oncology/college of american pathologists; ToGA = the trastuzumab for gastric cancer.

Fig. 1

Flow chart of study search and selection methods.

Systematic Review and Meta-Analysis

Correlation between HER2 IHC and ISH

In 45 eligible studies, concordance analysis between HER2 IHC and ISH was conducted in 61, 60, and 60 subsets for the HER2 IHC score 0/1+, 2+, and 3+ groups. In the HER2 IHC score 0/1+ group, the concordance rates were 0.957 (95% confidence interval [CI] 0.952-0.962) and 0.969 (95% CI 0.962-0.975) in the fixed-effect and random-effect models, respectively (Tab. II). In the HER2 IHC score 2+ group, the concordance rates were 0.333 (95% CI 0.306-0.362) and 0.393 (95% CI 0.331-0.458) in the fixed-effect and random-effect models, respectively. For the HER2 IHC score 3+ group, the concordance rates were 0.855 (95% CI 0.825-0.880) and 0.915 (95% CI 0.882-0.939) in the fixed-effect and random-effect models, respectively. The HER2 score 0/1+, 2+, and 3+ groups showed significant heterogeneity (p<0.001, p<0.001, and p<0.001, respectively). Sensitivity analysis was conducted, and none of the eligible studies influenced the pooled concordance rate. The ranges of concordance rates in the sensitivity analysis were 0.969-0.970, 0.384-0.402, and 0.911-0.919 in the HER2 IHC score 0/1+, 2+, and 3+ groups, respectively. For the assessment of publication bias, we used Begg's funnel plot and Egger's test. In Egger's test, significant publication biases were identified in the IHC score 0/1+, 2+ and 3+ groups (p<0.001, p = 0.006, and p<0.001, respectively), and asymmetry was identified in Begg's funnel plots. The fail-safe N and trim-and-fill tests were additionally conducted to confirm the degree of publication bias. However, publication biases were not significant in the fail-safe N test or the trim-and-fill test.

Table II

Concordance analysis between HER2 immunohistochemistry and in situ hybridization

	Number of subsets	Number of patients	Fixed-effect model (95% CI)	Heterogeneity (p value)	Random-effect model (95% CI)
IHC score 0 / 1+	61	10098	0.957 (0.952, 0.962)	<0.001	0.969 (0.962, 0.975)
FISH	45	7963	0.956 (0.950, 0.961)	<0.001	0.969 (0.960, 0.976)
SISH	9	1332	0.956 (0.943, 0.966)	0.040	0.958 (0.925, 0.977)
CISH	7	803	0.986 (0.973, 0.993)	0.816	0.986 (0.973, 0.993)
IHC score 2+	60	1385	0.333 (0.306, 0.362)	<0.001	0.393 (0.331, 0.458)
FISH	44	1163	0.316 (0.286, 0.346)	<0.001	0.375 (0.305, 0.449)
SISH	10	160	0.430 (0.352, 0.512)	0.154	0.453 (0.344, 0.566)
CISH	6	62	0.379 (0.226, 0.561)	0.001	0.391 (0.118, 0.755)
IHC score 3+	60	1196	0.855 (0.825, 0.880)	<0.001	0.915 (0.882, 0.939)
FISH	44	933	0.851 (0.818, 0.879)	<0.001	0.916 (0.878, 0.943)
SISH	9	178	0.921 (0.846, 0.961)	0.196	0.933 (0.848, 0.972)
CISH	7	85	0.794 (0.639, 0.893)	0.030	0.880 (0.666, 0.964)

IHC = immunohistochemistry; CI = confidence interval; ISH = in situ hybridization; FISH = fluorescence in situ hybridization; SISH = silver in situ hybridization; CISH = chromogenic in situ hybridization.

Subgroup Analysis

We performed subgroup analysis to evaluate the low concordance rate of the HER2 IHC score 2+ group. Eligible studies in the study groups were subdivided into 3 categories according to the ISH methods: the fluorescence ISH (FISH), silver ISH (SISH), and chromogenic ISH (CISH) subgroups (Tab. II). In the random-effect model, the concordance rates were 0.375 (95% CI 0.305-0.449), 0.453 (95% CI 0.344-0.566), and 0.391 (95% CI 0.118-0.755) in the FISH, SISH, and CISH subgroups, respectively. In addition, for the HER2 IHC 0/1+ and 3+ groups, the positive rates were higher in the SISH subgroup than in the FISH and CISH subgroups. In subgroup analysis, there were no significant differences in concordance rates between the study locations.

The eligible studies used biopsied and/or surgical specimens to evaluate HER2 expression. To elucidate the impact of specimen type on the interpretation of HER2 expression, we performed subgroup analysis based on specimen types. In the IHC 2+ group, the concordance rate of the biopsied specimen subgroup was higher than that of the surgical specimen subgroup (0.588, 95% CI 0.299-0.789 vs. 0.344, 95% CI 0.280-0.415), but there was no statistical significance (p = 0.101). In addition, there was no difference in the concordance rates between biopsied and surgical specimen types in the IHC 0/1+ (0.968, 95% CI 0.917-0.988 vs. 0.965, 95% CI 0.957-0.971) and IHC 3+ groups (0.925, 95% CI 0.838-0.968 vs. 0.910, 95% CI 0.864-0.941).

Diagnostic Test Accuracy Review

To evaluate the diagnostic accuracy of HER2 IHC, we performed a diagnostic test accuracy review. In this review, HER2 IHC scores 2+ and 3+ were considered IHC positive. The sensitivity and specificity ranges of studies with HER2 IHC scores 2+ and 3+ were from 0.44 to 1.00 and from 0.31 to 1.00, respectively, and their pooled sensitivity and specificity were 0.86 (95% CI 0.84-0.87) and 0.91 (95% CI 0.90-0.91), respectively (Fig. 2). The SROC curve is shown in Figure 3, and the AUC on the SROC curve was 0.958, very close to 1. In addition, the diagnostic accuracy was investigated according to the ISH methods. The values of AUC were 0.955, 0.961, and 0.976 in the FISH, SISH, and CISH methods, respectively. Next, subgroup analysis was conducted according to the manufacturer's antibodies, which were divided into HercepTest, A0485, 4B5, and others. As shown in Table III, the pooled sensitivity and specificity were highest in the 4B5 subgroup: 0.90 (95% CI 0.86-0.93) and 0.94 (95% CI 0.93-0.95), respectively. Also the diagnostic OR and the AUC on the SROC curve were highest in the 4B5 subgroup.

Table III

Sensitivity, specificity, diagnostic odds ratio and area under the curve of summary receiver operating characteristic curve according to the HER2 antibody

Antibody	Sensitivity (95% CI)	Specificity (95% CI)	Diagnostic OR (95% CI)	AUC
HercepTest	0.82 (0.79, 0.84)	0.93 (0.92, 0.94)	102.52 (58.27, 180.37)	0.9524
A0485	0.89 (0.85, 0.92)	0.86 (0.84, 0.87)	62.44 (29.66, 131.45)	0.9489
4B5	0.90 (0.86, 0.93)	0.94 (0.93, 0.95)	164.6 (83.32, 325.18)	0.9715
Others	0.86 (0.82, 0.90)	0.88 (0.86, 0.90)	76.44 (36.09, 161.92)	0.9531

CI = confidence interval; AUC = area under the curve; OR = odds ratio.

Fig. 2

Forest plot diagram of the pooled sensitivity (A) and specificity (B) of HER2 immunohistochemistry.

Fig. 3

Summary receiver operating characteristic (SROC) curve of HER2 immunohistochemistry. AUC = area under the curve; SE = standard error.

Discussion

HER2 overexpression/gene amplification is an important factor for prognosis and treatment in GC and breast cancer. However, the concordance between HER2 IHC and ISH has not yet been fully elucidated in GC. This is the first meta-analysis of the concordance between HER2 IHC and ISH and the diagnostic accuracy of HER2 IHC in GC.

HER2 overexpression/gene amplification has been reported in up to 34% of all GC cases (4). In our systematic review, it varied from 4.2% to 54.4% in the eligible studies. In addition, the ranges of HER2 IHC positivity and gene amplification rates were 7.7%-74.4% and 3.7%-55.9%, respectively. Previous studies have shown that HER2 overexpression/gene amplification is significantly associated with poor prognosis (10, 51, 52). Furthermore, targeted therapy using the HER2 monoclonal antibody has been shown to produce survival benefit, and this treatment modality in advanced GC patients is one of the important treatment options (4). Some previous studies have reported that patients with HER2 IHC negativity and HER2 gene amplification have no therapeutic benefit from trastuzumab (4, 53). Therefore, the accurate evaluation of HER2 status could be of prime importance in the management of GC patients.

For the evaluation of HER2 status, IHC is used as an initial test and cases with IHC scores of 0/1+ or 3+ are confirmed without additional ISH tests. If the HER2 IHC score is 2+, an additional HER2 ISH test should be performed to confirm HER2 gene amplification. Although many studies have reported the concordance between HER2 IHC and ISH, we have not yet reached a conclusion on the concordance rate. In our systematic review, the concordance rate ranges were 0.800-1.000 and 0.455-1.000 in the HER2 IHC score 0/1+ and 3+ groups, respectively. However, because each eligible study used different IHC antibodies in the ISH methods, concordance analysis via a systematic review and meta-analysis is needed to elucidate the concordance between HER2 IHC and ISH. Also, diagnostic test accuracy review, which can determine why results vary between studies and compare the performances of alternative tests, should be done to confirm the diagnostic accuracy of HER2 IHC (54).

In this meta-analysis, the concordance rates between HER2 IHC and ISH were 96%, 36%, and 91% in the HER2 IHC score 0/1+, 2+, and 3+ groups, respectively, for GC, which is similar to the rates for breast cancer (55). As shown in Table II, the concordance rates were 0.969 and 0.915 in the HER2 IHC score 0/1+ and 3+ groups, respectively. HER2 IHC is useful for the evaluation of HER2 status as an initial test. However, because the discordance rates were 3.1% and 8.5% in the HER2 IHC 0/1+ and 3+ groups, respectively, the possibility of opposite results in ISH should be considered in cases with IHC scores of 0/1+ or 3+. If the false positive rate is high, this accounts for the failure to benefit from administered trastuzumab. In daily practice, precise evaluation of the HER2 status may be more important in the HER2 IHC 3+ group without additional ISH tests than in the HER2 IHC 2+ group. In addition, the choice of an ISH method or concurrent usage of IHC antibody could be important to elevate the sensitivity of the HER2 test. In the subgroup analysis in terms of ISH methods, the concordance rate was higher in the SISH subgroup than in the FISH and CISH subgroups for each of the HER2 IHC score groups. In our diagnostic test accuracy review, however, there was no significant difference in the AUC on the SROC curve between the ISH methods.

The concordance rate of the HER2 IHC 2+ group was 0.393 (95% CI 0.331-0.458) and more heterogeneous than in the HER2 IHC 0/1+ and 3+ groups. Although the HER2 IHC 2+ group is basically heterogeneous compared with the other HER2 IHC score groups, the heterogeneity of the concordance rate may have been caused by variable factors including ISH methods, IHC antibodies/methods, type of specimens, and examining pathologists. Asioli et al (56) showed that the sensitivity and accuracy of HER2 IHC were increased by additional block analysis. Thus, this heterogeneity might be influenced by tumor size and sampling for investigation of HER2 expression. Also, the intratumoral heterogeneity of HER2 expression could have contributed to the lower concordance rate in the HER2 IHC 2+ group (36). To confirm the cause of the heterogeneity, subgroup analysis was additionally performed based on specimen types, ISH methods, and IHC antibodies. Unlike breast cancer, the quantitative criteria of GC for HER2 overexpression are different between biopsied and surgical specimens. To elucidate the impact of specimen type on the interpretation of HER2 expression, we performed subgroup analysis based on specimen types, i.e., surgical and biopsied specimens. However, there was no difference in the concordance rates between biopsied and surgical specimens in the subgroup analysis. For the HER2 IHC 2+ group, the concordance rates were 0.375, 0.453, and 0.391 in the FISH, SISH, and CISH subgroups, respectively (Tab. II). Although the SISH technique was developed to overcome disadvantages of the FISH technique, whether the discrepancy between SISH and FISH may have influenced the heterogeneity of the concordance rate could not be explained in our meta-analysis. Moreover, since a HER2 IHC score of 2+ or 3+ was considered positive for IHC, we conducted a diagnostic test accuracy review. In the review, the AUCs on the SROCs were 0.955, 0.961, and 0.976 in the FISH, SISH, and CISH subgroups, respectively. Although the diagnostic accuracy was highest in CISH, the difference in the AUC on SROC between the groups was not statistically significant.

In daily practice, various HER2 antibodies, such as HercepTest, A0485, 4B5, CB11, and SP3, are used as an initial HER2 test; however, the same criteria are applied for the evaluation of HER2 IHC regardless of the antibody. Cho et al (21) reported that the rates of HER2 overexpression were different between antibodies, and that among the antibodies tested A0485 showed the highest positive rate. Furthermore, the HER2 test using CB11 had lower sensitivity and higher specificity than HER2 tests using other antibodies. In the present meta-analysis, however, the pooled sensitivity and specificity, the diagnostic OR, and the AUC value were highest in the HER2 IHC test using the 4B5 antibody. This discrepancy could have been caused by various factors including patient characteristics, IHC methods, and study location. In our meta-analysis we were unable to ascertain whether differences between antibodies affect the concordance rates. Bahreini et al (55) reported that in breast cancer, the HER2 test with CB11 and the HercepTest had higher sensitivity and specificity than tests using other antibodies. Further cumulative studies are needed to identify the ideal antibody for the HER2 test in GC.

There are some limitations to the present analysis. First, some eligible studies with a small number of patients showed low positive rates and concordance rates, which may have influenced the pooled values. Although ISH tests are expensive, multi-step and time-consuming tests compared to IHC, whether HER2 ISH as a first-line test for the evaluation of HER2 status is more advantageous could not be confirmed in our meta-analysis. In addition, because GC can show considerable intratumoral heterogeneity of HER2 expression (unlike breast cancer, which shows less heterogeneity), IHC is usually performed first to locate the HER2-positive region as well as the positivity of HER2 expression. Thus, it could be difficult to perform ISH before IHC because ISH can only be performed in a limited area. In addition, the impact of tumor size and sampling on the heterogeneity of HER2 expression could not be established in our systematic review and meta-analysis.

In conclusion, the present meta-analysis showed higher concordance rates between HER2 ISH and IHC scores 0/1+ and 3+, and higher diagnostic accuracy of HER2 IHC. The results of this meta-analysis suggest that HER2 IHC may be useful for evaluating HER2 status as an initial test and that more detailed criteria for HER2 IHC score 2+ cases will be required to predict HER2 gene amplification.

Footnotes

Financial support: Supported by grant No. 23-2015-0170 from the SNUH Research Fund.

Conflict of interest: The authors declare that they have no conflict of interest.

References

Bosman

Carneiro

Hruban

Theise

WHO classification of tumours of the digestive system. 4^th ed. Lyon, France International Agency for Research on Cancer 2010.

Gravalos

Jimeno

HER2 in gastric cancer: a new prognostic factor and a novel therapeutic target. Ann Oncol 2008 19 9 1523–1529

Kim

Jung

Lee

Yang

Kim

In situ analysis of HER2 mRNA in gastric carcinoma: comparison with fluorescence in situ hybridization, dual-color silver in situ hybridization, and immunohistochemistry. Hum Pathol 2013 44 4 487–494

Bang

Van Cutsem

Feyereislova

et al; ToGA Trial Investigators. Trastuzumab in combination with chemotherapy versus chemotherapy alone for treatment of HER2-positive advanced gastric or gastro-oesophageal junction cancer (ToGA): a phase 3, open-label, randomised controlled trial. Lancet 2010 376 9742 687–697

Park

Yun

Park

. HER-2/neu amplification is an independent prognostic factor in gastric cancer. Dig Dis Sci 2006 51 8 1371–1379

Yano

Doi

Ohtsu

. Comparison of HER2 gene amplification assessed by fluorescence in situ hybridization and HER2 protein expression assessed by immunohistochemistry in gastric cancer. Oncol Rep 2006 15 1 65–71

Kim

Jung

Lee

. Evaluation of HER-2 gene status in gastric carcinoma using immunohistochemistry, fluorescence in situ hybridization, and real-time quantitative polymerase chain reaction. Hum Pathol 2007 38 9 1386–1393

Hofmann

Stoss

Shi

. Assessment of a HER2 scoring system for gastric cancer: results from a validation study. Histopathology 2008 52 7 797–805

Liang

Zeng

Gao

. Analysis of EGFR, HER2, and TOP2A gene status and chromosomal polysomy in gastric adenocarcinoma from Chinese patients. BMC Cancer 2008 8 1 363

10.

Barros-Silva

Leitão

Afonso

. Association of ERBB2 gene status with histopathological parameters and disease-specific survival in gastric carcinoma patients. Br J Cancer 2009 100 3 487–493

11.

Yan

Yau

Bte Omar

. A study of HER2 gene amplification and protein expression in gastric cancer. J Clin Pathol 2010 63 9 839–842

12.

Boers

Meeuwissen

Methorst

HER2 status in gastro-oesophageal adenocarcinomas assessed by two rabbit monoclonal antibodies (SP3 and 4B5) and two in situ hybridization methods (FISH and SISH). Histopathology 2011 58 3 383–394

13.

Cidon

Centeno

Lagarto

Peral

HER-2 evaluation in a specific gastric cancer population with the highest rate of mortality in Spain. J Oncol 2011 2011391564

14.

Kim

. Clinicopathologic characteristics of patients with stage III/IV (M(0)) advanced gastric cancer, according to HER2 status assessed by immunohistochemistry and fluorescence in situ hybridization. Diagn Mol Pathol 2011 20 2 94–100

15.

Kim

Lee

Yang

Bang

Kim

Heterogeneous amplification of ERBB2 in primary lesions is responsible for the discordant ERBB2 status of primary and metastatic lesions in gastric carcinoma. Histopathology 2011 59 5 822–831

16.

Lee

de Boer

Fermoyle

Platten

Kumarasinghe

Human epidermal growth factor receptor 2 testing in gastric carcinoma: issues related to heterogeneity in biopsies and resections. Histopathology 2011 59 5 832–840

17.

Moelans

Milne

Morsink

Offerhaus

van Diest

Low frequency of HER2 amplification and overexpression in early onset gastric cancer. Cell Oncol (Dordr) 2011 34 2 89–95

18.

Tafe

Janjigian

Zaidinski

. Human epidermal growth factor receptor 2 testing in gastroesophageal cancer: correlation between immunohistochemistry and fluorescence in situ hybridization. Arch Pathol Lab Med 2011 135 11 1460–1465

19.

Wang

Gao

Yun

. Assessment of ERBB2 and EGFR gene amplification and protein expression in gastric carcinoma by immunohistochemistry and fluorescence in situ hybridization. Mol Cytogenet 2011 4 1 14

20.

Yan

Yau

Choo

. Dual-colour HER2/chromosome 17 chromogenic in situ hybridisation assay enables accurate assessment of HER2 genomic status in gastric cancer and has potential utility in HER2 testing of biopsy samples. J Clin Pathol 2011 64 10 880–883

21.

Cho

Srivastava

Park

. Comparison of four immunohistochemical tests and FISH for measuring HER2 expression in gastric carcinomas. Pathology 2012 44 3 216–220

22.

Kim

. The prognostic significance of HER2 positivity for advanced gastric cancer patients undergoing first-line modified FOLFOX-6 regimen. Anticancer Res 2012 32 4 1547–1553

23.

Kiyose

Igarashi

Nagura

. Chromogenic in situ hybridization (CISH) to detect HER2 gene amplification in breast and gastric cancer: comparison with immunohistochemistry (IHC) and fluorescence in situ hybridization (FISH). Pathol Int 2012 62 11 728–734

24.

Kunz

Mojtahed

Fisher

. HER2 expression in gastric and gastroesophageal junction adenocarcinoma in a US population: clinicopathologic analysis with proposed approach to HER2 assessment. Appl Immunohistochem Mol Morphol 2012 20 1 13–24

25.

Liu

Zhong

Chen

HER-2/neu overexpression is an independent prognostic factor for intestinal-type and early-stage gastric cancer patients. J Clin Gastroenterol 2012 46 4 e31–e37

26.

Park

Hwang

Park

. Comprehensive analysis of HER2 expression and gene amplification in gastric cancers using immunohistochemistry and in situ hybridization: which scoring system should we use? Hum Pathol 2012 43 3 413–422

27.

Yang

Luo

. Intratumoral heterogeneity determines discordant results of diagnostic tests for human epidermal growth factor receptor (HER) 2 in gastric cancer specimens. Cell Biochem Biophys 2012 62 1 221–228

28.

Zhou

Jiang

. Prognosis significance of HER-2/neu overexpression/amplification in Chinese patients with curatively resected gastric cancer after the ToGA clinical trial. World J Surg Oncol 2012 10 1 274

29.

Cho

Jeong

Sung

. A large cohort of consecutive patients confirmed frequent HER2 positivity in gastric carcinomas with advanced stages. Ann Surg Oncol 2013 20 Suppl 3 S477–S484

30.

Cruz-Reyes

Gamboa-Dominguez

HER2 amplification in gastric cancer is a rare event restricted to the intestinal phenotype. Int J Surg Pathol 2013 21 3 240–246

31.

Gasljevic

Lamovec

Contreras

Zadnik

Blas

Gasparov

HER2 in gastric cancer: an immunohistochemical study on tissue microarrays and the corresponding whole-tissue sections with a supplemental FISH study. Pathol Oncol Res 2013 19 4 855–865

32.

Gordon

Gundacker

Benedetti

. Assessment of HER2 gene amplification in adenocarcinomas of the stomach or gastroesophageal junction in the INT-0116/SWOG9008 clinical trial. Ann Oncol 2013 24 7 1754–1761

33.

Grin

Brezden-Masley

Bauer

Streutker

HER2 in situ hybridization in gastric and gastroesophageal adenocarcinoma: comparison of automated dual ISH to FISH. Appl Immunohistochem Mol Morphol 2013 21 6 561–566

34.

Yang

. Combined evaluation of centromere protein H and Ki-67 as prognostic biomarker for patients with gastric carcinoma. Eur J Surg Oncol 2013 39 2 141–149

35.

Kataoka

Okabe

Yoshizawa

. HER2 expression and its clinicopathological features in resectable gastric cancer. Gastric Cancer 2013 16 1 84–93

36.

Lee

Park

Yoo

. Clinical significance of intratumoral HER2 heterogeneity in gastric cancer. Eur J Cancer 2013 49 6 1448–1457

37.

Pala

Bayol

Ozguzer

Akman

HER2 status in gastric cancer: a comparison of two novel in situ hybridization methods (IQ FISH and dual color SISH) and two immunohistochemistry methods (A0485 and HercepTest™). Pathol Res Pract 2013 209 9 548–554

38.

Pirrelli

Caruso

Di Maggio

Armentano

Valentini

Are biopsy specimens predictive of HER2 status in gastric cancer patients?

Dig Dis Sci 2013 58 2 397–404

39.

Shan

Ying

HER2 expression and relevant clinicopathological features in gastric and gastroesophageal junction adenocarcinoma in a Chinese population. Diagn Pathol 2013 8 1 76

40.

Yue

Wei

. Significance of TFF3 protein and Her-2/neu status in patients with gastric adenocarcinoma. Pathol Res Pract 2013 209 8 479–485

41.

Aizawa

Nagatsuma

Kitada

. Evaluation of HER2-based biology in 1,006 cases of gastric cancer in a Japanese population. Gastric Cancer 2014 17 1 34–42

42.

Chen

Ming

Biao

Zheng

Comparison of HER2 status by fluorescence in situ hybridisation and immunohistochemistry in gastric cancer. Contemp Oncol (Pozn) 2014 18 2 95–99

43.

Kandel

Leclair

Bou-Hanna

Laboisse

Mosnier

Association of HER1 amplification with poor prognosis in well differentiated gastric carcinomas. J Clin Pathol 2014 67 4 307–312

44.

Kimura

Oki

Yoshida

. Significance of accurate human epidermal growth factor receptor-2 (HER2) evaluation as a new biomarker in gastric cancer. Anticancer Res 2014 34 8 4207–4212

45.

Selcukbiricik

Erdamar

Buyukunal

Serrdengecti

Demirelli

Is her-2 status in the primary tumor correlated with matched lymph node metastases in patients with gastric cancer undergoing curative gastrectomy?

Asian Pac J Cancer Prev 2014 15 24 10607–10611

46.

Tang

Liu

Shen

. Assessment and prognostic analysis of EGFR, HER2, and HER3 protein expression in surgically resected gastric adenocarcinomas. Onco Targets Ther 2015 87–14

47.

Werner

Battmann

Steinmetz

. The validation of a novel method combining both HER2 immunohistochemistry and HER2 dual-colour silver in situ hybridization on one slide for gastric carcinoma testing. J Transl Med 2014 12 1 160

48.

Yoshida

Yamamoto

Taniguchi

. Comparison of HER2 status between surgically resected specimens and matched biopsy specimens of gastric intestinal-type adenocarcinoma. Virchows Arch 2014 465 2 145–154

49.

Zhu

Fang

Zhang

Detection of Her-2/neu expression in gastric cancer: quantitative PCR versus immunohistochemistry. Exp Ther Med 2014 8 5 1501–1507

50.

Zamora

Abraira

Muriel

Khan

Coomarasamy

Meta-DiSc: a software for meta-analysis of test accuracy data. BMC Med Res Methodol 2006 6 1 31

51.

Wang

Zheng

Chen

Xiong

Effect of HER-2/neu over-expression on prognosis in gastric cancer: a meta-analysis. Asian Pac J Cancer Prev 2011 12 6 1417–1423

52.

Chen

Yang

. Prognostic role of human epidermal growth factor receptor in gastric cancer: a systematic review and meta-analysis. Arch Med Res 2013 44 5 380–389

53.

Van Cutsem

Bang

Feng-Yi

. HER2 screening data from ToGA: targeting HER2 in gastric and gastroesophageal junction cancer. Gastric Cancer 2015 18 3 476–484

54.

Leeflang

Deeks

Takwoingi

Macaskill

Cochrane diagnostic test accuracy reviews. Syst Rev 2013 2 1 82

55.

Bahreini

Soltanian

Mehdipour

A meta-analysis on concordance between immunohistochemistry (IHC) and fluorescence in situ hybridization (FISH) to detect HER2 gene overexpression in breast cancer. Breast Cancer 2014 Apr 10 [Epub ahead of print]

56.

Asioli

Maletta

Verdun di Cantogno

. Approaching heterogeneity of human epidermal growth factor receptor 2 in surgical specimens of gastric cancer. Hum Pathol 2012 43 11 2070–2079