Sage Journals: Discover world-class research

Abstract

Multilocus variable number tandem repeat analysis (MLVA) is a molecular subtyping technique that remains useful for those without the resources to access whole genome sequencing for the tracking and tracing of bacterial contaminants. Unlike techniques such as multilocus sequence typing (MLST) and pulsed-field gel electrophoresis, MLVA did not emerge as a standardized subtyping method for Listeria monocytogenes, and as a result, there is no reference database of virulent or food-associated MLVA subtypes as there is for MLST-based clonal complexes (CCs). Having previously shown the close congruence of a 5-loci MLVA scheme with MLST, a predictive model was created using the XGBoost machine learning (ML) technique, which enabled the prediction of CCs from MLVA patterns with ∼85% (±4%) accuracy. As well as validating the model on existing data, a straightforward update protocol was simulated for if and when previously unseen subtypes might arise. This article illustrates how ML techniques can be applied with elementary coding skills to add value to previous-generation molecular subtyping data in-built food processing environments.

Get full access to this article

View all access options for this article.

References

Andrews

, McCabe

, Wall

, et al. Validating the Utility of Multilocus Variable Number Tandem-repeat Analysis (MLVA) as a Subtyping Strategy to Monitor Listeria monocytogenes in built food processing environments. J Food Prot, 2023; 86(10):100147; doi: 10.1016/j.jfp.2023.100147

Anonymous. Ascent of machine learning in medicine. Nat Mater, 2019; 18(5):407; doi: 10.1038/s41563-019-0360-1

Bandoy

DDR

, Weimer

. Biological machine learning combined with campylobacter population genomics reveals virulence gene allelic variants cause disease. Microorganisms, 2020; 8(4):549; doi: 10.3390/microorganisms8040549

Behravan

, Hartikainen

, Tengstrom

, et al. Machine learning identifies interacting genetic variants contributing to breast cancer risk: A case study in Finnish cases and controls. Sci Rep, 2018; 8(1):13149; doi: 10.1038/s41598-018-31573-5

Carpentier

, Cerf

. Review – Persistence of Listeria monocytogenes in food industry equipment and premises. Int J Food Microbiol, 2011; 145(1):1–8; doi: 10.1016/j.ijfoodmicro.2011.01.005

Carrico

, Silva-Costa

, Melo-Cristino

, et al. Illustration of a common framework for relating multiple typing methods by application to macrolide-resistant Streptococcus pyogenes . J Clin Microbiol, 2006; 44(7):2524–2532; doi: 10.1128/jcm.02536-05

Chen

, Guestrin

. XGBoost: A scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, 2016:785-794; doi: 10.1145/2939672.2939785

Chenal-Francisque

, Diancourt

, Cantinelli

, et al. Optimized Multilocus variable-number tandem-repeat analysis assay and its complementarity with pulsed-field gel electrophoresis and multilocus sequence typing for Listeria monocytogenes clone identification and surveillance. J Clin Microbiol, 2013; 51(6):1868–1880; doi: 10.1128/jcm.00606-13

FAO/WHO. Listeria monocytogenes in ready-to-eat (RTE) foods: attribution, characterization and monitoring – Meeting report. In: Microbiological Risk Assessment Series No. 38. WHO: Rome; 2022.

10.

Ferreira

, Wiedmann

, Teixeira

, et al. Listeria monocytogenes persistence in food-associated environments: Epidemiology, strain characteristics, and implications for public health. J Food Prot, 2014; 77(1):150–170; doi: 10.4315/0362-028X.JFP-13-150

11.

Friedman

, Hastie

, Tibshirani

. Special invited paper. Additive logistic regression: A statistical view of boosting. Ann Statist, 2000; 28(2):337–407; doi: 10.1214/aos/1016218223

12.

HPSC. Listeriosis in Ireland, 2018. In: Annual Epidemiological Report. HSE Health Protection Surveillance Centre: Dublin; 2019.

13.

Hurley

, Luque-Sastre

, Parker

, et al. Whole-Genome sequencing-based characterization of 100 listeria monocytogenes isolates collected from food processing environments over a four-year period. mSphere, 2019; 4(4):e00252-19; doi: 10.1128/mSphere.00252-19

14.

Johnstone

, Titterington

. Statistical challenges of high-dimensional data. Philos Trans A Math Phys Eng Sci, 2009; 367(1906):4237–4253; doi: 10.1098/rsta.2009.0159

15.

Jordan

, Mitchell

. Machine learning: Trends, perspectives, and prospects. Science, 2015; 349(6245):255–260; doi: 10.1126/science.aaa8415

16.

Kapoor

, Narayanan

. Leakage and the reproducibility crisis in ML-based science. arXiv, 2022; doi: 10.4855/0/arXiv.2207.07048

17.

Kirkwood

BRS

, Jonathan

. Medical Statistics. Blackwell Science: Massachusetts, USA; 2003.

18.

Koopmans

, Brouwer

, Vázquez-Boland

, et al. Human Listeriosis. Clin Microbiol Rev, 2023; 36(1):e0006019; doi: 10.1128/cmr.00060-19

19.

Kotsiantis

. Supervised machine learning: A review of classification techniques. Informatica (Ljubljana), 2007; 31:249–268; doi: 10.5555/1566770.1566773

20.

, Zhang

, Wang

, et al. Genomic prediction of breeding values using a subset of SNPs identified by three machine learning methods. Front Genet, 2018; 9:237; doi: 10.3389/fgene.2018.00237

21.

Malley

, Butts

, Wiedmann

. Seek and destroy process: Listeria monocytogenes process controls in the ready-to-eat meat and poultry industry. J Food Prot, 2015; 78(2):436–445; doi: 10.4315/0362-028X.JFP-13-507

22.

Martin

, Bover-Cid

, Aymerich

. MLVA subtyping of Listeria monocytogenes isolates from meat products and meat processing plants. Food Res Int, 2018; 106:225–232; doi: 10.1016/j.foodres.2017.12.052

23.

Maury

, Bracq-Dieye

, Huang

, et al. Hypervirulent Listeria monocytogenes clones' adaption to mammalian gut accounts for their association with dairy products. Nat Commun, 2019; 10(1):2488; doi: 10.1038/s41467-019-10380-0

24.

Maury

, Tsai

, Charlier

, et al. Uncovering Listeria monocytogenes hypervirulence by harnessing its biodiversity. Nat Genet, 2016; 48(3):308–313; doi: 10.1038/ng.3501

25.

Nguyen

, Long

, McDermott

, et al. Using machine learning to predict antimicrobial MICs and associated genomic features for nontyphoidal salmonella . J Clin Microbiol, 2019; 57(2):e01260-18; doi: 10.1128/jcm.01260-18

26.

Painset

, Bjorkman

, Kiil

, et al. LiSEQ – Whole-genome sequencing of a cross-sectional survey of Listeria monocytogenes in ready-to-eat foods and human clinical cases in Europe. Microb Genom, 2019; 5(2):e000257; doi: 10.1099/mgen.0.000257

27.

Parisi

, Kemker

, Part

, et al. Continual lifelong learning with neural networks: A review. Neural Netw, 2019; 113:54–71; doi: 10.1016/j.neunet.2019.01.012

28.

Teschendorff

. Avoiding common pitfalls in machine learning omic data science. Nat Mater, 2019; 18(5):422–427; doi: 10.1038/s41563-018-0241-z

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.25 MB

0.44 MB

0.21 MB

0.38 MB

Prediction of Listeria monocytogenes Clonal Complexes from Multilocus Variable Number Tandem Repeat Analysis Patterns Using a Machine Learning Approach

Abstract

Get full access to this article

References

Supplementary Material