Establishment and Validation of a Nomogram for Tonsil Squamous Cell Carcinoma: A Retrospective Study Based on the SEER Database

Abstract

This study aimed to establish and validate a comprehensive nomogram for predicting the cause-specific survival (CSS) probability in tonsillar squamous cell carcinoma (TSCC). We screened and extracted data from the SEER (Surveillance, Epidemiology, and End Results) database for the period 2004 to 2016. We randomly divided the 7243 identified patients into a training cohort (70%) for constructing the model and a validation cohort (30%) for evaluating the model using R software. Multivariate Cox stepwise regression was used to select predictive variables. The concordance index (C-index), the area under the time-dependent receiver operating characteristics curve (AUC), the net reclassification improvement (NRI), the integrated discrimination improvement (IDI), calibration plotting, and decision-curve analysis (DCA) were used to evaluate the model. The multivariate Cox stepwise regression analysis successfully established a nomogram for the 1-, 3-, and 5-year CSS probabilities for TSCC patients. The C-index, AUC, NRI, and IDI were all showed that the model has good discrimination. The calibration plots were very close to the standard lines, indicating that the model has a good degree of calibration, and the DCA curve further illustrated that the model has good clinical validity. We have established the first nomogram for predicting the 1-, 3-, and 5-year CSS probabilities for TSCC based on a large retrospective sample. Our rigorous validation and evaluation indicated that the model can provide useful guidance to clinical workers making clinical decisions about individual patients.

Keywords

tonsillar squamous cell carcinoma nomogram AJCC cause-specific survival seer prognostic

Introduction

Head and neck cancer includes many cancers of the oral cavity, oropharynx, and larynx. Tonsil cancer is a type of oropharyngeal cancer, with squamous cell carcinoma (SCC) being the most common histological type, as is also the case for other cancers of the head and neck.¹ According to an epidemiological study, tonsillar squamous cell carcinoma (TSCC) represents about 15 to 20% of all intraoral and oropharyngeal SCCs in the United States.² The incidence of oral cancer has declined in most parts of the world in recent years, whereas the incidence of oropharyngeal cancer has increased in several countries.³ One study found that TSCC represented the largest proportion of cancers at pharyngeal sites.⁴ However, the current understanding of tonsil cancer is insufficient, and its increasing incidence and different characteristics from other oropharyngeal cancers make it necessary to analyze it as an entity.

Some researchers have proposed that the etiology of TSCC differs from those of other oropharyngeal cancers, with TSCC patients also having a better prognosis.^5,6 Smoking and heavy drinking are recognized risk factors for head and neck cancer, but the prognostic factors for tonsil cancer remain unclear. Hammarstedt et al. found that while the incidence of lung cancer has decreased in males, the incidence of tonsil cancer is increasing by 2.6% annually.⁷ Demographic characteristics such as age, race, and sex have also been identified as prognostic factors.^8,9 Surgery, radiation, and chemotherapy in various combinations are utilized in the management of head and neck squamous cell carcinoma. Limited or early-stage disease usually treated with surgery or radiation alone. For most patients with locally advanced disease, the treatment is multimodal, with either surgery followed by adjuvant radiation or chemoradiation as indicated by pathologic features or definitive chemoradiation.^10,11

The traditional American Joint Committee on Cancer (AJCC) staging system has always been an important reference for cancer treatment. However, the AJCC staging system lacks certain demographic and pathological characteristics, and has specific limitations when applied to the prognosis of TSCC. Therefore, a more-comprehensive and detailed prediction model is needed to provide comprehensive guidance to clinical workers in a convenient manner.

Nomograms are accurate but simple tools that are widely used in tumor prediction models. A nomogram can be used to calculate the survival probability in individual patients.¹² Many researchers have established nomograms of different cancers, such as lung cancer,¹³ prostate cancer,¹⁴ and bladder cancer,¹⁵ but a nomogram specifically designed for TSCC has not been reported previously. In order to further explore the prognostic factors for TSCC and individualized treatments, we used relevant data from the Surveillance, Epidemiology, and End Results (SEER) database to establish and evaluate a TSCC nomogram.

This study analyzed some basic characteristics of TSSC patients in the SEER database and the treatment methods applied to them with the aim of establishing a comprehensive nomogram that incorporates the important demographic factors, clinicopathological characteristics, and treatment methods. Our novel nomogram can provide clinical workers with the survival probabilities of patients more comprehensively and on an individual basis, which makes it clinically superior to previous methods.

Patients and Methods

Data Sources and Research Factors

We screened and extracted data from the SEER database using the SEER*Stat software. Part of the SEER database is available to the public, and we additionally applied for access to the specific SEER chemotherapy database.¹⁶ We extracted TSCC patients from the SEER database by selecting the primary sites of TSCC using the terms “C09.0 Tonsillar fossa,” “C09.1 Tonsillar pillar,” “C09.8 Overlapping lesion of tonsil,” and “C09.9 Tonsil, NOS.” Additionally, the following ICD-O-3 (third revision of the International Classification of Diseases for Oncology) histology/behavior codes for TSCC were selected: “8070/3: Squamous cell carcinoma, NOS,” “8071/3: Squamous cell carcinoma, keratinizing, NOS,” “8072/3: Squamous cell carcinoma, large cell, nonkeratinizing, NOS,” and “8083/3: Basaloid squamous cell carcinoma.”

We selected several factors that may be associated with the disease prognosis, including age at diagnosis, race, sex, marital status, tumor grade, tumor size, laterality, AJCC stage, surgery status, radiotherapy status, and chemotherapy status. The AJCC staging system is determined by the TNM staging system, so it includes tumor extension, lymph nodes metastasis, and distant metastasis. If these variables are included in the analysis together, it will lead to severe multicollinearity, so this study only included the AJCC staging system. The outcome variable was cancer-specific survival (CSS). The data obtained in this study from the SEER database do not include personally identifiable information, and so it was not necessary to obtain informed patient consents.

Date Sorting

We selected data on patients for whom complete basic information and information on survival time were available. The tumor grade is divided into 4 levels according to the SEER database. The 4-grade system describes the tumor as Grade I: well-differentiated; Grade II: moderately differentiated; Grade III: poorly differentiated; Grade IV: undifferentiated or anaplastic.¹⁷ We employed the seventh edition of the AJCC staging system. Tumor size was divided into 3 grades: <2 cm, 2 to 4 cm, and >4 cm. Applying the above methods initially identified 9811 TSCC patients for the period 2004 to 2016. After excluding 2100 patients with unclear pathological grading, 38 with unknown AJCC stage, and 430 with unknown tumor size, the study finally included 7243 patients with TSCC. We randomly divided these patients into a training cohort (70%) to construct the model and a validation cohort (30%) to evaluate the model using R software (version 3.4.1, http://www.r-project.org). The data screening process is shown in Figure 1.

Figure 1.

Flowchart of sample selection.

Nomogram and Statistical Analysis

A log-rank test performed after allocating all of the subjects to the 2 study groups demonstrated that there were no statistically significant intergroup differences. We then used SPSS Statistics software (version 23.0, IBM SPSS, Chicago, IL, USA) to describe the basic characteristics of all factors for the 2 study cohorts. The age at diagnosis was expressed as median and interquartile range (IQR) values, while other categorical variables were represented as frequencies and percentages. Cox regression was used to identify factors associated with CSS from TSCC (p = 0.05), and these factors were used to establish a nomogram for predicting the 1-, 3-, and 5-year CSS probabilities for TSCC. After establishing the nomogram, we employed a series of indicators to evaluate the model. The concordance index (C-index) and the area under the time-dependent receiver operating characteristics (ROC) curve (AUC) were used to evaluate the discrimination ability of the nomogram. The AUC and C-index are widely used, but their increment is not obvious when comparing 2 present models. Therefore, in order to determine whether the new model was advantageous, we also applied 2 relatively new indicators: the net reclassification improvement (NRI) and integrated discrimination improvement (IDI). The NRI is mainly used to compare the predictive powers of new and old models at a set tangent level, while the IDI considers different tangent lines, which can be used to assess the overall improvement of the model.^18,19 These 2 indicators are easy to calculate and understand in practical clinical applications.

We drew a calibration plot to visually reflect the difference between 2 values. The degree of calibration of a model reflects the degree of consistency between its predicted and actual values. The consistency of a model is better when its calibration curve is closer to the 45-degree standard line. Finally, we used the decision-curve analysis (DCA) curve to evaluate the clinical validity of the model. The abscissa and ordinate of a DCA curve are the threshold probability and net benefit, respectively, of the model. A model with a higher DCA curve provides a greater net benefit.²⁰

All of the statistical analyses were conducted with the SPSS Statistics and R software packages. SPSS Statistics software was used to describe the basic characteristics of the cohorts, while R software was used to randomly divide the data into training and validation cohorts, and perform the log-rank test. The Cox regression analysis, proportional-risk construction test, and the establishment and evaluation of the nomogram were completed using R software with the following R packages: survival, rms, foreign, survival, survival ROC, nricens, and DCA packages. A bilateral probability value of p < 0.05 was considered to be indicative of statistical significance.

Results

General Characteristics

After randomly dividing 7243 patients into 2 cohorts, we applied the log-rank test, and the obtained probability value (p = 0.8) indicated that there was no significant difference between these cohorts. We then used SPSS to describe the basic demographic and clinical characteristics of 2 cohorts, as listed in Table 1.

Table 1.

Demographic and Clinical Characteristics of the 2 Cohorts of Patients.

Variable	Training Cohort	Validation Cohort
Number of Patients n (%)	5070(70)	2173(30)
Age of diagnosis	59(53-66)	58(52-65)
Sex n (%)
Male	4176(82.4)	1775(81.7)
Female	894(17.6)	398(18.3)
Race n (%)
White	4472(88.2)	1927(88.7)
Black	419(8.3)	162(7.5)
Other	179(3.5)	84(3.9)
Marital status n (%)
Married	3848(75.9)	1636(75.3)
Unmarried	924(18.2)	400(18.4)
Other	298(5.9)	137(6.3)
Site n (%)
C09.0	612(12.1)	244(11.2)
C09.1	296(5.8)	132(6.1)
C09.8	42(0.8)	21(1.0)
C09.9	4120(81.3)	1776(81.7)
ICD n (%)
8070	3320(65.5)	1483(68.2)
8071	621(12.2)	253(11.6)
8072	811(16.0)	313(14.4)
8083	318(6.3)	124(5.7)
Grade n (%)
I	216(4.3)	95(4.4)
II	2019(39.8)	891(41.0)
III	2764(54.5)	1161(53.4)
IV	71(1.4)	26(1.2)
Size n (%)
<2	1399(27.6)	569(26.2)
[2,4)	2567(50.6)	1098(50.5)
≥4	1104(21.8)	506(23.3)
Laterality n (%)
Left	2478(48.9)	1108(51.0)
Right	2558(50.5)	1052(48.4)
Bilateral	25(0.5)	10(0.5)
Other	9(0.2)	3(0.1)
AJCC stage n (%)
I	323(6.4)	111(5.1)
II	402(7.9)	203(9.3)
III	1105(21.8)	467(21.5)
IVA	2763(54.5)	1180(54.3)
IVB	372(7.3)	152(7.0)
IVC	105(2.1)	60(2.8)
Surgery n (%)
Yes	2841(56.0)	1197(55.1)
NO/Unknown	2229(44.0)	976(44.9)
Radiotherapy n (%)
Yes	4280(84.4)	1854(85.3)
NO/Unknown	790(15.6)	319(14.7)
Chemotherapy n (%)
Yes	3373(66.5)	1478(68.0)
NO/Unknown	1697(33.5)	695(32.0)

ICD = International Classification of Diseases.

The median age at diagnosis was 59 years (IQR = 53-65 years) in the training cohort and 58 years (IQR = 52-65 years) in the validation cohort. Most of the patients were male (82.4% and 81.7% in the training and validation cohorts, respectively), white (88.2% and 88.7%), and married (75.9% and 75.3%). The primary tumor site in most patients was C09.9, and the predominant histological type was 8070/3. Most patients had tumors of grade II (about 40%) and grade III (about 54%). About half of the patients (50.6% and 50.5% in the training and validation cohorts, respectively) had tumor diameters of 2 to 4 cm. The TSCC was on the left in 48.9% and 51.0% of those in the training and validation cohorts, respectively, and on the right in 50.5% and 48.4%. The AJCC stage for most patients was stage IVA. Most patients had received surgery, radiotherapy, or chemotherapy. The median survival time was 35 years (IQR = 19-55 months) in the training cohort and 34 years (IQR = 18-55 months) in the validation cohort.

Constructing a Nomogram Using the Training Cohort

After performing a multivariate Cox stepwise regression analysis, we screened out the following 8 factors related to CSS (p < 0.05): age at diagnosis, race, marital status, tumor grade, tumor size, AJCC stage, surgery status, and radiotherapy status. Table 2 details the variables that were significant after the multivariate Cox regression analysis, which were age at diagnosis (hazard ratio [HR] = 1.027, p < 0.001), black (HR = 1.549, p < 0.001 versus white), unmarried (HR = 1.340, p < 0.01 versus married), grade III (HR = 0.554, p < 0.001 versus grade I), grade IV (HR = 0.238, p < 0.05 versus grade I), size of 2 to 4 cm (HR = 1.377, p < 0.05 versus <2 cm), size >4 cm (HR = 1.988, p < 0.001 versus <2 cm), AJCC stage III (HR = 1.889, p < 0.05 versus AJCC stage I), AJCC stage IVA (HR = 2.946, p < 0.001 versus AJCC stage I), AJCC stage IVB (HR = 5.268, p < 0.001 versus AJCC stage I), AJCC stage IVC (HR = 14.319, p < 0.001 versus AJCC stage I), no/unknown surgery (HR = 2.460, p < 0.001 versus surgery), and no/unknown radiotherapy (HR = 2.646, p < 0.001 versus radiotherapy).

Table 2.

Selected Variables by Multivariate Cox Stepwise Regression Analysis.

Variable	Multivariate analysis
Variable	HR	95%CI	p-value
Age of diagnosis	1.027	1.019-1.035	0.000***
Race
White	Reference
Black	1.549	1.244-1.930	0.000***
Other	1.073	0.718-1.604	0.730
Marital status
Married	Reference
Unmarried	1.340	1.117-1.608	0.001**
Other	0.945	0.677-1.317	0.737
Grade
I	Reference
II	0.751	0.534-1.055	0.099
III	0.554	0.394-0.778	0.000***
IV	0.238	0.073-0.774	0.017*
Size
<2	Reference
2-4	1.377	1.077-1.761	0.011*
≥4	1.988	1.530-2.583	0.000***
AJCC stage
I	Reference
II	1.151	0.584-2.268	0.684
III	1.889	1.051-3.393	0.033*
IVA	2.946	1.673-5.188	0.000***
IVB	5.268	2.912-9.529	0.000***
IVC	14.319	7.800-26.286	0.000***
Surgery
Yes	Reference
NO/Unknown	2.460	2.049-2.954	0.000***
Radiotherapy
Yes	Reference
NO/Unknown	2.646	2.172-3.222	0.000***

HR = hazard ratio; ∗p < 0.05; ∗∗p < 0.01; ∗∗∗p < 0.001.

Figure 2 shows the nomogram that we finally constructed, which is a simple graph based on the multiple regression model that can be used to comprehensively predict the probability of CSS based on the above related indicators. Figure 2 shows that the AJCC stage has the greatest impact on survival rate, followed by age of diagnosis, surgery status, radiotherapy status, tumor size, and finally tumor grade, marital status, and race. Each factor is included as a line segment on the nomogram, and the numerical scale on the line segment indicates the degree of risk contributed by this factor. Adding the scores for all of the factors for an individual patient yields the total scores corresponding to the 1-, 3-, and 5-year CSS probabilities for that patient.

Figure 2.

Nomogram predicting 1-, 3-, and 5-years CSS probability. Mari-marital status; Surg –surgery status; Rad – radiotherapy status.

Evaluating the Nomogram Using the Validation Cohort

The C-index of the nomogram model is 0.766 in the training cohort and 0.751 in the validation cohort. We then plotted the 1-, 3-, and 5-year ROC curves, and calculated the corresponding AUCs. The 1-, 3-, and 5-year AUCs were 0.837, 0.781, and 0.768, respectively, in the training cohort, and 0.788, 0.769, and 0.758 in the validation cohort (Figure 3).

Figure 3.

ROC curves. The area under the ROC curve (AUC) for 1-, 3-, and 5-years CSS probability of the training cohort (A) and validation cohort (B).

We used NRI and IDI to evaluate the discrimination ability of the nomogram. The NRI values for 1-, 3-, and 5-year CSS probabilities were 0.370 (95% confidence interval [CI] = 0.306-0.464), 0.511 (95% CI = 0.426-0.599), and 0.487 (95% CI = 0.430-0.627), respectively, in the training cohort, and 0.357 (95% CI = 0.245-0.496), 0.545 (95% CI = 0.419-0.688), and 0.515 (95% CI = 0.345-0.637) in the validation cohort. The IDI values for 1-, 3-, and 5-year CSS probabilities were 0.050, 0.087, and 0.098, respectively (p < 0.001), in the training cohort, and 0.041, 0.071, and 0.082 (p < 0.001) in the validation cohort.

Calibration plots were used to verify the consistency between the actual and ideal values of the model after verifying its discrimination ability. As shown in Figure 4, the calibration plots for 1-, 3-, and 5-year CSS probabilities for the model are very close to the standard lines, indicating that the model has a good degree of calibration.

Figure 4.

Calibration curves. Calibration curves for 1-, 3-, and 5-years CSS probability depict the calibration of each model in terms of the agreement between the predicted probabilities and observed outcomes of the training cohort (A, B, C) and validation cohort (D, E, F).

Finally, we plotted DCA curves to illustrate the clinical effectiveness of the nomogram. The survival probability curves for the new model in Figure 5 are all higher than those for the AJCC model, which means that the net benefits in using the model to predict the 1-, 3-, and 5-year CSS probabilities are significantly greater than those obtained when using the AJCC model.

Figure 5.

Decision curve analysis curves. Decision curve analysis of the training cohort (A, B, C) and validation cohort (D, E, F) for 1-, 3-, and 5-years CSS probability.

Discussion

Head and neck cancer constitutes a complex system of tumors that can occur in many locations. Most of the published research studies have considered it as a single system, but tonsil cancer has characteristics that differ from those of other head and neck tumors,²¹ and its incidence has increased recently in some countries.^22,23 In addition, tonsil cancer is more sensitive to radiotherapy and has a better prognosis than some other head and neck cancers.²⁴ These characteristics indicate the need to establish a specific clinical prediction nomogram for tonsil cancer in order to help clinicians to make better decisions. We therefore used the SEER database to successfully construct a prognosis nomogram based on a comprehensive analysis of the demographic characteristics and clinicopathological features. This study compared our novel model with the AJCC staging system to determine whether it is superior.

The results from the Cox regression as included in the nomogram show that the AJCC stage is the factor that has the greatest influence on the CSS probability, which is mainly due to the AJCC staging system containing information about the regional lymph node metastasis and distant metastasis, both of which are very important prognostic factors for TSCC.^25,26 Among demographic characteristics, age has always been an important prognostic factor for tumors, and the present results are no exception. In addition, being of black race presents a worse prognosis than being white or another race, which is consistent with the findings of a previous study.²⁷ The incidence of tonsil cancer was previously found to be higher in males than in females,⁴ but sex was not a prognostic factor in the present study. A particularly interesting aspect of the present study is that few previous studies have explored the influence of marital status on the prognosis of TSCC, whereas this study found that being unmarried is a risk factor for the prognosis. In terms of clinicopathological features, there was no difference in prognosis between TSCC on the left and right sides. The size of the primary tumor is known to affect the choice of treatment, outcome, and prognosis.²⁸ Our study found that the tumor size significantly affected the TSCC CSS probability, as did the tumor grade. As can be seen from Figure 2, the prognosis is poor for well-differentiated tumors and good for poorly differentiated tumors, which might be due to poorly differentiated cells being more sensitive to chemotherapy or radiotherapy.

Surgery and radiotherapy treatment were also significant prognostic factors. Radiotherapy exerts different effects on head and neck tumors in different locations.^29,30 Radiotherapy is currently the preferred treatment modality in clinical practice, while the role of chemotherapy has not been report previously. The present study found that chemotherapy was not a prognostic factor for TSCC.

After constructing the nomogram and considering the identified prognostic factors, we performed a series of evaluations on the model, which are essential for any clinical prediction model before it is used in practice. We first verified the discrimination power of the model. The traditional ROC curve is a relatively intuitive method,³¹ and Figure 3 shows that the AUC was >0.75 for the nomogram. This indicates that the nomogram has good overall discrimination performance. In addition, for survival data, the C-index is a more-general indicator for predicting the model discrimination ability.³² The present results also show that the new model has a good discrimination ability. Compared with the AUC and C-index, the NRI focuses more on changes in the number of research objects correctly classified by the 2 models at a certain set of cutoff points, which are often used to compare the accuracies of the prediction abilities of 2 models.³³ The NRI shows that the proportions of correct classifications for the 1-, 3-, and 5-year CSS probabilities increased by 37.0%, 51.1%, and 48.7%, respectively, in the training cohort, and by 35.7%, 54.5%, and 51.5% in the validation cohort (p < 0.001). The IDI is another indicator that considers the situation of different cutoff points, which can be used to reflect the overall improvement of the model, and this to some extent complements the NRI.³⁴ The IDI values revealed that the new model has an improved prediction ability compared with the AJCC model for the 1-, 3-, and 5-year CSS probabilities, by 5.0%, 8.7%, and 9.8%, respectively, in the training cohort, and by 4.1%, 7.1%, and 8.2% in the validation cohort (p < 0.001).

The above-4 indicators clearly show that the nomogram has a good discrimination ability, and provides preliminarily evidence that the model has the ability to correctly classify the survival probability in TSCC patients. We further verified the calibration degree of the model by drawing a calibration plot. As can be seen in Figure 4, the calibration curve of the model is very close to the standard line and shows an even distribution, indicating that the incidence rates predicted by the model are very close to the actual incidence rates; that is, the model exhibits good consistency. Combined with the evaluation of the discrimination ability and calibration, the good overall performance of the model has been demonstrated, indicating that it can be used to predict the 1-, 3-, and 5-year CSS probabilities for TSCC patients.

Finally, we assessed the clinical effectiveness of the model. DCA is being employed by an increasing number of researchers to assess the net benefit to patients of receiving clinical treatment. The horizontal line in Figure 5 represents the net benefit of treating no males, while the oblique line represents the net benefit of a strategy of treating all males.^19,35 It can be seen from the figure that the overall net benefit of the new model is higher than that of the AJCC staging system, and that the threshold of the survival probability is higher. This indicates that the new model can bring greater net benefits to patients and help clinicians to make better clinical decisions.

This study naturally has some limitations. First of all, it had a retrospective design and analyzed data obtained from the SEER database, which may have resulted in information bias. The second limitation is that the study factors were not sufficiently comprehensive, with some genetic markers, biological markers, behavioral habits, and other factors not being included in the study. A future cohort study is needed to more-accurately identify the significant prognostic factors, especially HPV status or expression of P16. Incorporating more prognostic factors and validating the model with an external cohort to obtain the most-accurate results will be a focus of our future research.

Conclusion

In summary, we have established the first nomogram for predicting the 1-, 3-, and 5-year CSS probabilities for TSCC patients based on a large retrospective population. This nomogram contains both demographic and clinicopathological factors, and the rigorous validation and evaluation indicate that the model can provide useful and straightforward guidance to clinical workers making clinical decisions for individual patients. We look forward to building a more-comprehensive nomogram based on a wider range of data sources in the future.

Footnotes

Authors’ Contributions

Chengzhuo Li and Jin Yang contributed equally to this work. JL and CZL designed the study; JY, SZ, and FSX collected and analyzed the data; YJ collected important background information; CZL drafted the initial manuscript; SPW and JL revised the article critically; DDH, LB, and YLW reviewed and edited the article; CZL and JY are co-first authors; SPW and JL are correspondence authors. All authors approved the final manuscript.

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Ethics Statement

SEER collects cancer incidence data from population-based cancer registries covering approximately 34.6 percent of the U.S. population. SEER releases a standard set of research data every spring based on the previous November’s submission of data from the registries. Because the database used contains publicly available information and no personal identifiers, the study did not require approval of the Institutional Reviewer Board. We accessed these through the SEER*Stat software with additional approvals.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The study was supported by The National Social Science Foundation of China (grant no. 16BGL183).

ORCID iD

Jun Lyu

References

Marur

D’Souza

Westra

Forastiere

. HPV-associated head and neck cancer: a virus-related cancer epidemic. Lancet Oncol. 2010;11(8):781–789.

Frisch

Hjalgrim

Jæger

Biggar

. Changing patterns of tonsillar squamous cell carcinoma in the United States. Cancer Causes and Control. 2000;11(0957-5243):489–495.

Chaturvedi

Anderson

Lortet-Tieulent

, et al. Worldwide trends in incidence rates for oral cavity and oropharyngeal cancers. J Clin Oncol. 2013;31(36):4550–4559.

Shiboski

Schmidt

Jordan

. Tongue and tonsil carcinoma: increasing trends in the U.S. population ages 20-44 years. Cancer. 2005;103(9):1843–1849.

D’Souza

Kreimer

Viscidi

, et al. Case–control study of human papillomavirus and oropharyngeal cancer. N Engl J Med. 2007;356(19):1944–1956.

Haeggblom

Attoff

Hammarstedt-Nordenvall

Nasman

. Human papillomavirus and survival of patients per histological subsite of tonsillar squamous cell carcinoma. Cancer Med. 2018;7(5):1717–1722.

Hammarstedt

Dahlstrand

Lindquist

, et al. The incidence of tonsillar cancer in Sweden is increasing. Acta Oto-Laryngologica. 2007;127(9):988–992.

Micheli

Ciampichini

Oberaigner

, et al. The advantage of women in cancer survival: an analysis of EUROCARE-4 data. Eur J Cancer. 2009;45(6):1017–1027.

Ildstad

Tollerud

Bigelow

Remensnyder

. A multivariate analysis of determinants of survival for patients with squamous cell carcinoma of the head and neck. Annal Surg. 1989;209(2):237–241.

10.

Marur

Forastiere

. Head and neck squamous cell carcinoma: update on epidemiology, diagnosis, and treatment. Mayo Clin Proc. 2016;91(3):386–396.

11.

Denaro

Russi

Adamo

Colantonio

Merlano

. Postoperative therapy in head and neck cancer: state of the art, risk subset, prognosis and unsolved questions. Oncology 2011;81(1):21–29.

12.

Balachandran

Gonen

Smith

DeMatteo

. Nomograms in oncology: more than meets the eye. Lancet Oncol. 2015;16(4):e173–e180.

13.

Liang

Zhang

Jiang

, et al. Development and validation of a nomogram for predicting survival in patients with resected non-small-cell lung cancer. J Clin Oncol. 2015;33(8):861–869.

14.

Brockman

Alanee

Vickers

, et al. Nomogram predicting prostate cancer-specific mortality for men with biochemical recurrence after radical prostatectomy. Eur Urol. 2015;67(6):1160–1167.

15.

Kluth

Black

Bochner

, et al. Prognostic and prediction tools in bladder cancer: a comprehensive review of the literature. Eur Urol. 2015;68(2):238–253.

16.

Yang

Liu

, et al. Brief introduction of medical database and data mining technology in big data era. J Evid Based Med. 2020;13(1):57–69.

17.

Mehta

Schantz

. Population-based analysis of oral and oropharyngeal carcinoma: changing trends of histopathologic differentiation, survival and patient demographics. Laryngoscope. 2010;120(11):2203–2212.

18.

Parikh

Coca

Thiessen-Philbrook

, et al. Postoperative biomarkers predict acute kidney injury and poor outcomes after adult cardiac surgery. J Am Soc Nephrol. 2011;22(9):1748–1757.

19.

Steyerberg

Vickers

Cook

, et al. Assessing the performance of prediction models: a framework for traditional and novel measures. Epidemiology. 2010;21(1):128–138.

20.

Vickers

Cronin

Elkin

Gonen

. Extensions to decision curve analysis, a novel method for evaluating diagnostic tests, prediction models and molecular markers. BMC Med Inform Decis Mak. 2008;8(1):53.

21.

Kreimer

Clifford

Boyle

Franceschi

. Human papillomavirus types in head and neck squamous cell carcinomas worldwide: a systematic review. Cancer Epidem Biomar. 2005;14(2):467–475.

22.

Braakhuis

Visser

Leemans

. Oral and oropharyngeal cancer in the Netherlands between 1989 and 2006: Increasing incidence, but not in young adults. Oral Oncol. 2009;45(9): e85–89.

23.

Conway

Stockton

Warnakulasuriya

Ogden

Macpherson

. Incidence of oral and oropharyngeal cancer in United Kingdom (1990-1999)—recent trends and regional variation. Oral Oncol. 2006;42(6):586–592.

24.

Pulte

Brenner

. Changes in survival in head and neck cancers in the late 20th and early 21st century: a period analysis. Oncologist. 2010;15(9):994–1001.

25.

Goldenberg

Begum

Westra

, et al. Cystic lymph node metastasis in patients with head and neck cancer: an HPV-associated phenomenon. Head Neck. 2008;30(7):898–903.

26.

Vainshtein

Spector

Ibrahim

, et al. Matted nodes: high distant-metastasis risk and a potential indication for intensification of systemic therapy in human papillomavirus-related oropharyngeal cancer. Head Neck. 2016;38(1):E805–814.

27.

Albert

Giri

Kanakamedala

, et al. Racial disparities in tumor features and outcomes of patients with squamous cell carcinoma of the tonsil. Laryngoscope. 2019;129(3):643–654.

28.

Woolgar

. Histopathological prognosticators in oral and oropharyngeal squamous cell carcinoma. Oral Oncol. 2006;42(3):229–239.

29.

Parsons

Mendenhall

Stringer

, et al. Squamous cell carcinoma of the oropharynx: surgery, radiation therapy, or both. Cancer. 2002;94(11):2967–2980.

30.

Yao

Dornfeld

Buatti

, et al. Intensity-modulated radiation treatment for head-and-neck squamous cell carcinoma—the University of Iowa experience. Int J Radiat Oncol Biol Phys. 2005;63(2):410–421.

31.

Hanley

McNeil

. A method of comparing the areas under receiver operating characteristic curves derived from the same cases. Radiology. 1983;148(3):839–843.

32.

Wang

Xia

, et al. Prognostic nomogram for intrahepatic cholangiocarcinoma after partial hepatectomy. J Clin Oncol. 2013;31(9):1188–1195.

33.

Iki

Fujita

Tamaki

, et al. Trabecular bone score may improve FRAX(R) prediction accuracy for major osteoporotic fractures in elderly Japanese men: the Fujiwara-kyo Osteoporosis Risk in Men (FORMEN) Cohort Study. Osteoporos Int. 2015;26(6):1841–1848.

34.

Chambless

Cummiskey

Cui

. Several methods to assess improvement in risk prediction models: extension to survival analysis. Stat Med. 2011;30(1):22–38.

35.

Vickers

van Calster

Steyerberg

. A simple, step-by-step guide to interpreting decision curve analysis. Diagn Progn Res. 2019;3(1):18.