Indolizine Compound Selection for HPV Anticancer Active Prediction Using CNN Classifier with ADME Descriptors

Abstract

Despite the significant progress made in developing different in silico methodology for structure activity research over the past few decades. The ability to predict correlation structure activity (CSA) from absorption distribution metabolism excretion (ADME) descriptors to select indolizine compounds for human papilloma virus (HPV) anticancer activity continues to pose a challenge. This study employed five machine learning (ML) algorithms for classification, viz., stochastic gradient descent (SGD), random forest (RF), support vector machine (SVM), convolutional neural network (CNN), and logistic regression (LR), to perform the classification based on ADME-related physiochemical descriptors of 8,900 indolizine compounds to predict the CSA. The present study focuses on 26 well-known parameters to optimize the results, which are utilized for ML models SGD, RF, SVM, CNN, and LR for classification. The CNN achieved the best results with the highest overall accuracy and average loss values of 98.33% and 0.16, respectively. On the other hand, the SGD, RF, SVM, and LR recorded the accuracy values of 95.32%, 93.23%, 96.03%, 94.03%, and loss values of 0.046, 0.067, 0.039, and 0.059, respectively. It is stated that from the obtained results, the CNN is performing better compared to other methods. The cross-validation and results are done with the relationship of descriptors, viz., accuracy, correlation, distribution, area under the receiver operating characteristic, area under the precision recall curve, and bootstrap error analysis. This study demonstrated the utility of ML to facilitate early prediction of indolizine compounds for HPV anticancer activity in preclinical development.

Keywords

ADME SGD RF SVM CNN LR indolizine cancer HPV

Get full access to this article

View all access options for this article.

References

Siegel

, Giaquinto

, Jemal

. Cancer statistics, 2024. CA Cancer J Clin, 2024; 74(1):12–49.

Zur Hausen

. Papillomaviruses and cancer: From basic studies to clinical application. Nat Rev Cancer, 2002; 2(5):342–350; doi: 10.1038/nrc798

McLaughlin-Drubin

and, Münger

. Oncogenic activities of human papillomaviruses. Virus Res, 2009; 143(2):195–208.

Moody

and, Laimins

. Human papillomavirus oncoproteins: Pathways to transformation. Nat Rev Cancer, 2010; 10(8):550–560.

Schelhaas

. Come in and take your coat off–how host cells provide endocytosis for virus entry. Cell Microbiol, 2010; 12(10):1378–1388.

Doorbar

. The papillomavirus life cycle. Journal of Clinical Virology, 2005; 32:7–15.

Graham

. The human papillomavirus replication cycle, and its links to cancer progression: A comprehensive review. Clin Sci (Lond), 2017; 131(17):2201–2221.

Aksoy

, Gottschalk

, Meneses

. HPV entry into cells. Mutat Res Rev Mutat Res, 2017; 772:13–22.

Kono

, Ozawa

, Laimins

. The roles of DNA damage repair and innate immune surveillance pathways in HPV pathogenesis. Virology, 2024; 600:110266.

10.

Ciurlă-Lucescu

, Bîcu

, Belei

, et al. New indazole-indolizine-triazine hybrid molecules with farnesyltransferase inhibitory activity. Results Chem, 2024; 7:101451.

11.

Sandeep

, Venugopala

, Mohammed

, et al. Review on chemistry of natural and synthetic indolizines with their chemical and pharmacological properties. J Basic Clin Pharm, 2016; 8:49–61.

12.

Dawood

, Abbas

. Inhibitory activities of indolizine derivatives: A patent review. Expert Opin Ther Pat, 2020; 30(9):695–714.

13.

Keri

, et al. Indolizine derivatives as potential anticancer agents: Design, synthesis and biological evaluation. Eur J Med Chem, 2017; 127:107–120.

14.

Kumar

, et al. Tubulin-targeting indolizine scaffolds as potent antiproliferative agents: NCI-60 screening and SAR analysis. Bioorg Chem, 2023; 132:106357.

15.

Bhosle

, et al. Design and synthesis of indolizine–phenothiazine hybrids as dual tubulin and farnesyltransferase inhibitors. Eur J Med Chem, 2020; 186:111877.

16.

Gulevich

, et al. Synthesis and antiproliferative activity of indolizine-fused lactones. RSC Adv, 2023; 13:21567–21577.

17.

Zhang

, et al. Novel indolizine derivatives as tubulin destabilizers with potent anticancer activity. J Med Chem, 2024; 67(4):3125–3140.

18.

Haddad

, et al. Anticancer activity of pyrido[2,3-b]indolizine derivatives in colorectal cancer models. Bioorganic & Medicinal Chemistry Letters, 2014; 24(9):2137–2141.

19.

Wang

, et al. Synthesis and anticancer evaluation of indolizine–chalcone hybrids as apoptosis inducers. Bioorganic & Medicinal Chemistry Letters, 2018; 28(23):3645–3649.

20.

, et al. Indolizine-based small molecules trigger mitochondrial apoptosis in hepatocellular carcinoma cells. Bioorganic & Medicinal Chemistry, 2019; 27(15):3300–3310.

21.

Di Francesco

, et al. Pyrimidopyrrolizine derivatives as P-glycoprotein inhibitors and anticancer agents. Eur J Med Chem, 2021; 210:112987.

22.

Chen

, et al. Indolizine derivatives induce ER stress-mediated apoptosis through PI3K/Akt pathway inhibition in lung cancer cells. Bioorg Chem, 2020; 99:103827.

23.

Doorbar

, Quint

, Banks

, et al. The biology and life-cycle of human papillomaviruses. Vaccine, 2012; 30(Suppl 5):F55–F70.

24.

Hoppe-Seyler

, Bossler

, Braun

, Herrmann

, Hoppe-Seyler

. The HPV E6/E7 oncogenes: Key factors for viral carcinogenesis and therapeutic targets. Trends Microbiol, 2018; 26(2):158–168.

25.

Mesri

, Feitelson

, Munger

. Human viral oncogenesis: A cancer hallmarks analysis. Cell Host Microbe, 2014; 15(3):266–282.

26.

Wang

, Wang

H-K

, Li

, et al. MicroRNAs are biomarkers of oncogenic human papillomavirus infections. Proc Natl Acad Sci USA, 2014; 111(11):4262–4267.

27.

Yuan

, Filippova

, Monks

, et al. Small-molecule inhibitors of the HPV16–E6 interaction with caspase 8. Proceedings of the National Academy of Sciences, 2012; 109(10):3789–3794.

28.

Leemans

, Snijders

PJF

, Brakenhoff

. The molecular landscape of head and neck cancer. Nat Rev Cancer, 2018; 18(5):269–282.

29.

Gillison

, Trotti

, Harris

, et al. Radiotherapy plus cetuximab or cisplatin for human papillomavirus–positive oropharyngeal cancer: A randomized, multicentre, non-inferiority trial. N Engl J Med, 2019; 381(2):189–199.

30.

Mayr

, Klambauer

, Unterthiner

, Hochreiter

. DeepTox: Toxicity prediction using deep learning. Front Environ Sci, 2016; 3:80.

31.

Lusci

, Pollastri

, Baldi

. Deep architectures and deep learning in chemoinformatics: The prediction of aqueous solubility for drug-like molecules. J Chem Inf Model, 2013; 53(7):1563–1575.

32.

Wijnhoven

RGJ

, With

PHN

. Fast training of object detection using stochastic gradient descent. In: 2010 10th International Conference on Pattern Recognition. IEEE; 2010. p. 112.

33.

Deepa

, Prabadevi

, Maddikunta

, et al. An AI-based intelligent system for healthcare analysis using Ridge-Adaline stochastic gradient descent classifier. J Supercomput, 2021; 77(2):1998–2017.

34.

Bouke

, Alramli

, Abdullah

. XAIRF-WFP: A novel XAI-based random forest classifier for advanced email spam detection. Int J Inf Secur, 2025; 24(1).

35.

Zhang

, Chen

, Xiang

, et al. In silico prediction of mitochondrial toxicity by using GA-CG-SVM approach. Toxicol In Vitro, 2009; 23(1):134–140.

36.

Kumari

, Akhtar

, Tanveer

, et al. Diagnosis of breast cancer using a flexible pinball loss support vector machine. Appl Soft Comput, 2024; 157:111454.

37.

Yang

, Huang

, Li

, et al. An integrated scheme for feature selection and parameter setting in the support vector machine modelling and its application to the prediction of pharmacokinetic properties of drugs. Artif Intell Med, 2009; 46(2):155–163.

38.

Kaur

, Gupta

, et al. High-accuracy lung disease classification via logistic regression and advanced feature extraction techniques. Egypt Inform J, 2025; 29:100596.

39.

Manica

, Oskooei

, Born

, Subramanian

, Sáez-Rodríguez

, Martínez

. Towards explainable anticancer compound sensitivity prediction via multimodal attention-based convolutional encoders. Bioinformatics, 2019; 35(14):i313–i321.

40.

Cherkasov

, Muratov

, Fourches

, et al. QSAR modeling: Where have you been? Where are you going? J Med Chem, 2014; 57(12):4977–5010.

41.

Mendez

, Gaulton

, Bento

, et al. ChEMBL: Towards direct deposition of bioassay data. Nucleic Acids Res, 2019; 47(D1):D930–D940.

42.

Gomes

, Cabral

, Silva

, et al. Functionalized indolizines with potent anticancer activity. Eur J Med Chem, 2016; 121:1–11.

43.

Chávez

, de la Cruz

, López

, Mena

. Novel indolizine derivatives with cytotoxic activity against human cancer cells. Bioorganic & Medicinal Chemistry Letters, 2020; 30(9):127021.

44.

Hewitt

, Tighe

, Santaguida

, et al. Sustained Mps1 activity is required in mitosis to recruit O-Mad2 to the Mad1–C-Mad2 core complex. J Med Chem, 2010; 53(22):8477–8480.

45.

Maia

ARR

, de Man

, Boon

, et al. Inhibition of Mps1 promotes chromosome missegregation and suppresses tumor growth. Cancer Cell, 2015; 27(5):762–775.

46.

Koch

, Maia

ARR

, Janssen

, et al. Selective inhibitor of the spindle assembly checkpoint kinase TTK (BAY 1217389) in cancer therapy. Mol Cancer Ther, 2021; 20(5):859–870.

47.

Tannous

, Wong

, Tang

, Kamps

. Structural basis of Mps1 kinase inhibition. Nat Chem Biol, 2013; 9:576–582.

48.

Cao

, Zheng

, Yang

. Anticancer properties of Amaryllidaceae alkaloids: A review. Phytochemistry Reviews, 2020; 19:327–361.

49.

Lamoral-Theys

, Andolfi

, Van Goietsenoven

, et al. Lycorine induces apoptosis in cancer cells. Biochem Pharmacol, 2009; 78(3):319–328.

50.

Liao

, Hsu

, Lin

, et al. Anticancer activities of phenanthroindolizidine and tylophorine derivatives. Cancer Res, 2005; 65(15):6712–6719.

51.

Liang

, Li

, Zheng

. DCB-3503 inhibits translation and suppresses breast cancer progression. Oncotarget, 2016; 7(4):4599–4613.

52.

Sotto

, Valipour

, Azari

, et al. Benzoindolizidine alkaloids tylophorine and lycorine and their analogues with antiviral, anti-inflammatory, and anticancer properties: Promises and challenges. Biomedicines, 2023; 11:2619.

53.

Zhou

, Xu

, Fu

, Zhang

, Kang

. Antiproliferative and pro-apoptotic effects of oxindole alkaloids (mitraphylline, uncarine E, isorhynchophylline) in cancer cells. J Ethnopharmacol, 2017; 195:232–240.

54.

Blumenschein

, Anand

, Lin

. Capmatinib in MET-altered advanced cancer. Lancet Oncol, 2019; 20(5):692–703.

55.

Kavanaugh

, Genovese

, Smolen

. Filgotinib: A selective JAK1 inhibitor in clinical development. Arthritis & Rheumatology, 2017; 69(4):599–609.

56.

Jordan

, Wilson

. Microtubules as a target for anticancer drugs. Nat Rev Cancer, 2004; 4(4):253–265.

57.

Mahaur

, Upadhyay

. Indolizine: In-silico identification of inhibitors against mutated BCR-ABL protein of chronic myeloid leukemia. Research Journal of Pharmacology, 2021:2321–5836.

58.

Daina

, Michielin

, Zoete

. SwissADME: A free web tool to evaluate pharmacokinetics, drug likeness and medicinal chemistry friendliness of small molecules. Sci Rep, 2017; 7:42717.

59.

Daina

, Michielin

, Zoete

. iLOGP: A simple, robust and efficient description of n-octanol/water partition coefficient for drug-design using the GB/SA approach. J Chem Inf Model, 2014; 54(12):3284–3301.

60.

Daina

, Michielin

, Zoete

. A BOILED-egg to predict gastrointestinal absorption and brain penetration of small molecules. Chem Med Chem, 2016; 11(11):1117–1121.

61.

Jannuzzi

, Goler

AMY

, Biswas

, et al. Prospects for prostate cancer chemotherapy: Cytotoxic evaluation and mechanistic insights of quinoline quinones with ADME/PK profile. Biomedicines, 2024; 12(6):1241.

62.

Maltarollo

VGC

, Gertrudes

, Oliveira

, et al. Applying machine learning techniques for ADME-tox prediction: A review. Expert Opin Drug Metab Toxicol, 2015; 11(2):259–271.

63.

Banerjee

, Halder

, Ghosh

. Computational approaches for anticancer drug design. Expert Opin Drug Discov, 2018; 13(9):809–823.

64.

Cortés-Ciriano

, Murrell

, van Westen

GJP

, Bender

, Malliavin

, Glen

. Machine-learning models for kinase inhibitor bioactivity prediction. J Cheminform, 2016; 8:27.

65.

Lipinski

, Lombardo

, Dominy

, et al. Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Adv Drug Deliv Rev, 1997; 23(1–3):3–25.

66.

Shwartz-Ziv

, Armon

. (2022). Tabular data: Deep learning is not all you need. In Proceedings of the NeurIPS 2022 Workshop on Data-Centric Machine Learning.

67.

Arik

SÖ

, Pfister

. TabNet: Attentive interpretable tabular learning. arXiv Preprint, 2019arXiv:1908.07442.

68.

Liu

, Zhang

, Li

. Convolutional neural networks for structured tabular data analysis. Pattern Recognit, 2021; 117:107994.

69.

Zhao

, Yang

, Yi

. 1D-CNN models for numerical sequence classification. Applied Intelligence, 2020; 50:1738–1751.

70.

Jain

, Singh

. Deep 1D convolutional networks for QSAR modeling. J Cheminform, 2021; 13(1):55.

71.

Fukushima

, Tanaka

, Yamada

. 1D convolutional neural networks for physicochemical descriptor-based prediction. J Chem Inf Model, 2020; 60(12):6234–6245.

72.

Fang

, Wu

, Li

. CNN-based QSAR modeling for drug discovery. J Cheminform, 2022; 14:15.

73.

Chen

, Guestrin

. (2016). XGBoost: A scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD , 785–794.

74.

, Meng

, Finley

, et al. (2017). LightGBM: A highly efficient gradient boosting decision tree. NeurIPS , 3146–3154.

75.

, Ramsundar

, Feinberg

, et al. MoleculeNet: A benchmark for molecular machine learning. Chem Sci, 2018; 9(2):513–530.

76.

LeCun

, Bottou

, Bengio

, Haffner

. Gradient-based learning applied to document recognition. Proc IEEE, 1998; 86(11):2278–2324.

77.

Basith

, Manavalan

, Govindaraj

, Lee

. SDAC-DeepPred: Convolutional neural networks for drug activity prediction. Brief Bioinform, 2021; 22(4):bbaa379.

78.

Zhang

, Tan

, Han

, Zhu

. From machine learning to deep learning: Progress in drug discovery. Drug Discov Today, 2020; 25(7):1307–1315.