Skip to content
1887

Abstract

Infections caused by multidrug-resistant organisms (MDROs) are difficult to treat and often life threatening and place a burden on the healthcare system. Minimizing the transmission of MDROs in hospitals is a global priority with genomics proving to be a powerful tool for identifying the transmission of MDROs. To optimize the utility of genomics for prospective infection control surveillance, results must be available in real time, reproducible and simple to communicate to clinicians. Traditional reference-based approaches suffer from several limitations for prospective genomic surveillance. Whilst reference-free or pairwise genome comparisons avoid some of these limitations, they can be computationally intensive and time consuming. Split -mer analysis (SKA) offers a viable alternative facilitating rapid reference-free pairwise comparisons of genomic data, but the optimum SKA parameters for the detection of transmission have not been determined. Additionally, the accuracy of SKA-based inferences has not been measured, nor whether modified quality control parameters are required. Here, we explore the performance of 60 SKA parameter combinations across 50 simulations to quantify the false negative and positive SNP proportions for , , and . Using the optimum parameter combination, we explore concordance between SKA, multilocus sequence typing (MLST), core genome MLST (cgMLST) and Snippy in a real-world dataset. Lastly, we investigate whether simulated plasmid gain or loss could impact SNP detection with SKA. This work identifies that the use of SKA with sequencing reads, a -mer length of 19 and a minor allele frequency filter of 0.01 is optimal for MDRO transmission detection. Whilst SNP detection with SKA (when used with sequencing reads) undercalls SNPs compared to Snippy, it is significantly faster, especially with larger datasets. SKA has excellent concordance with MLST and cgMLST and is not impacted by simulated plasmid movement. We propose that the use of SKA for the detection of bacterial pathogen transmission is superior to traditional methodologies, capable of providing results in a much shorter timeframe.

Funding
This study was supported by the:
  • National Health and Medical Research Council (Award GNT1105905)
    • Principal Award Recipient: NotApplicable
  • Melbourne Genomics Health Alliance
    • Principal Award Recipient: NotApplicable
  • This is an open-access article distributed under the terms of the Creative Commons Attribution License. This article was made open access via a Publish and Read agreement between the Microbiology Society and the corresponding author’s institution.
Loading

Article metrics loading...

/content/journal/mgen/10.1099/mgen.0.001347
2025-03-06
2026-01-15

Metrics

Loading full text...

Full text loading...

/deliver/fulltext/mgen/11/3/mgen001347.html?itemId=/content/journal/mgen/10.1099/mgen.0.001347&mimeType=html&fmt=ahah

References

  1. World Health Organization WHO publishes list of bacteria for which new antibiotics are urgently needed; 2017
  2. World Health Organization WHO bacterial priority pathogens list, 2024: bacterial pathogens of public health importance to guide research, development and strategies to prevent and control antimicrobial resistance; 2024
  3. Cassini A, Högberg LD, Plachouras D, Quattrocchi A, Hoxha A et al. Attributable deaths and disability-adjusted life-years caused by infections with antibiotic-resistant bacteria in the EU and the European Economic Area in 2015: a population-level modelling analysis. Lancet Infect Dis 2019; 19:56–66 [View Article] [PubMed]
    [Google Scholar]
  4. Murray CJL, Ikuta KS, Sharara F, Swetschinski L, Robles Aguilar G et al. Global burden of bacterial antimicrobial resistance in 2019: a systematic analysis. The Lancet 2022; 399:629–655 [View Article]
    [Google Scholar]
  5. Quainoo S, Coolen JPM, van Hijum SAFT, Huynen MA, Melchers WJG et al. Whole-genome sequencing of bacterial pathogens: the future of nosocomial outbreak analysis. Clin Microbiol Rev 2017; 30:1015–1063 [View Article] [PubMed]
    [Google Scholar]
  6. Jauneikaite E, Baker KS, Nunn JG, Midega JT, Hsu LY et al. Genomics for antimicrobial resistance surveillance to support infection prevention and control in health-care facilities. Lancet Microbe 2023; 4:e1040–e1046 [View Article] [PubMed]
    [Google Scholar]
  7. Gorrie CL, Da Silva AG, Ingle DJ, Higgs C, Seemann T et al. Key parameters for genomics-based real-time detection and tracking of multidrug-resistant bacteria: a systematic analysis. Lancet Microbe 2021; 2:e575–e583 [View Article] [PubMed]
    [Google Scholar]
  8. Harris SR. Ska: split kmer analysis toolkit for bacterial genomic epidemiology. BioRxiv 2018; 453142 [View Article]
    [Google Scholar]
  9. Higgs C, Sherry NL, Seemann T, Horan K, Walpola H et al. Optimising genomic approaches for identifying vancomycin-resistant Enterococcus faecium transmission in healthcare settings. Nat Commun 2022; 13:509 [View Article] [PubMed]
    [Google Scholar]
  10. Maechler F, Weber A, Schwengers O, Schwab F, Denkel L et al. Split k-mer analysis compared to cgMLST and SNP-based core genome analysis for detecting transmission of vancomycin-resistant enterococci: results from routine outbreak analyses across different hospitals and hospitals networks in Berlin, Germany. Microb Genom 2023; 9:mgen000937 [View Article] [PubMed]
    [Google Scholar]
  11. Kühl MA, Stich B, Ries DC. Mutation-Simulator: fine-grained simulation of random mutations in any genome. Bioinformatics 2021; 37:568–569 [View Article] [PubMed]
    [Google Scholar]
  12. Huang W, Li L, Myers JR, Marth GT. ART: A next-generation sequencing read simulator. Bioinformatics 2012; 28:593–594 [View Article] [PubMed]
    [Google Scholar]
  13. Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 2018; 34:i884–i890 [View Article]
    [Google Scholar]
  14. Dale RK, Pedersen BS, Quinlan AR. Pybedtools: a flexible Python library for manipulating genomic datasets and annotations. Bioinformatics 2011; 27:3423–3424 [View Article] [PubMed]
    [Google Scholar]
  15. Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 2010; 26:841–842 [View Article] [PubMed]
    [Google Scholar]
  16. Sherry NL, Gorrie CL, Kwong JC, Higgs C, Stuart RL et al. Multi-site implementation of whole genome sequencing for hospital infection control: a prospective genomic epidemiological analysis. Lancet Reg Health West Pac 2022; 23:100446 [View Article] [PubMed]
    [Google Scholar]
  17. Jolley KA, Maiden MCJ. BIGSdb: Scalable analysis of bacterial genome variation at the population level. BMC Bioinformatics 2010; 11:595 [View Article] [PubMed]
    [Google Scholar]
  18. Silva M, Machado MP, Silva DN, Rossi M, Moran-Gilad J et al. chewBBACA: a complete suite for gene-by-gene schema creation and strain identification. Microb Genom 2018; 4:e000166 [View Article] [PubMed]
    [Google Scholar]
  19. Piazza A, Comandatore F, Romeri F, Brilli M, Dichirico B et al. Identification of bla VIM-1 gene in ST307 and ST661 klebsiella pneumoniae clones in italy: old acquaintances for new combinations. Microbial Drug Resistance 2019; 25:787–790 [View Article]
    [Google Scholar]
  20. Weber RE, Pietsch M, Frühauf A, Pfeifer Y, Martin M et al. Is26-mediated transfer of blaNDM-1 as the main route of resistance transmission during a polyclonal, multispecies outbreak in a German hospital. Front Microbiol 2019; 10: [View Article]
    [Google Scholar]
  21. de Been M, Pinholt M, Top J, Bletz S, Mellmann A et al. Core genome multilocus sequence typing scheme for high-resolution typing of Enterococcus faecium. J Clin Microbiol 2015; 53:3788–3797 [View Article] [PubMed]
    [Google Scholar]
  22. Leopold SR, Goering RV, Witten A, Harmsen D, Mellmann A. Bacterial whole-genome sequencing revisited: portable, scalable, and standardized analysis for typing and detection of virulence and antibiotic resistance genes. J Clin Microbiol 2014; 52:2365–2370 [View Article] [PubMed]
    [Google Scholar]
  23. Senthakumaran T, Brandal LT, Lindstedt BA, Jørgensen SB, Charnock C et al. Implications of stx loss for clinical diagnostics of Shiga toxin-producing Escherichia coli. Eur J Clin Microbiol Infect Dis 2018; 37:2361–2370 [View Article] [PubMed]
    [Google Scholar]
  24. Zhou Z, Alikhan NF, Mohamed K, Fan Y, Achtman M. The enterobase user’s guide, with case studies on salmonella transmissions, yersinia pestis phylogeny, and Escherichia core genomic diversity. Genome Res 2020; 30:138–152 [View Article]
    [Google Scholar]
  25. Gorrie CL, Mirceta M, Wick RR, Edwards DJ, Thomson NR et al. Gastrointestinal carriage is a major reservoir of Klebsiella pneumoniae infection in intensive care patients. Clin Infect Dis 2017; 65:208–215 [View Article] [PubMed]
    [Google Scholar]
  26. Harris SR, Cartwright EJP, Török ME, Holden MTG, Brown NM et al. Whole-genome sequencing for analysis of an outbreak of meticillin-resistant Staphylococcus aureus: a descriptive study. Lancet Infect Dis 2013; 13:130–136 [View Article] [PubMed]
    [Google Scholar]
  27. Raven KE, Gouliouris T, Brodrick H, Coll F, Brown NM et al. Complex routes of nosocomial vancomycin-resistant enterococcus faecium transmission revealed by genome sequencing. Clin Infect Dis 2017; 64:886–893 [View Article] [PubMed]
    [Google Scholar]
  28. Sherry NL, Lane CR, Kwong JC, Schultz M, Sait M et al. Genomics for molecular epidemiology and detecting transmission of carbapenemase-producing enterobacterales in Victoria, Australia, 2012 to 2016. J Clin Microbiol 2019; 57:e00573-19 [View Article] [PubMed]
    [Google Scholar]
  29. Pak TR, Kasarskis A. How next-generation sequencing and multiscale data analysis will transform infectious disease management. Clin Infect Dis 2015; 61:1695–1702 [View Article] [PubMed]
    [Google Scholar]
  30. Bussi Y, Kapon R, Reich Z. Large-scale k-mer-based analysis of the informational properties of genomes, comparative genomics and taxonomy. PLoS One 2021; 16:e0258693 [View Article] [PubMed]
    [Google Scholar]
  31. Ondov BD, Treangen TJ, Melsted P, Mallonee AB, Bergman NH et al. Mash: fast genome and metagenome distance estimation using MinHash. Genome Biol 2016; 17:132 [View Article] [PubMed]
    [Google Scholar]
  32. Pornputtapong N, Acheampong DA, Patumcharoenpol P, Jenjaroenpun P, Wongsurawat T et al. Kitsune: a tool for identifying empirically optimal k-mer length for alignment-free phylogenomic analysis. Front Bioeng Biotechnol 2020; 8:556413 [View Article] [PubMed]
    [Google Scholar]
  33. Health Laboratories Network P. Ensuring national capacity in genomics-guided public health laboratory surveillance ensuring national capacity in genomics-guided public health laboratory surveillance A report of the public health laboratory network (PHLN) expert advisory group on whole genome sequencing PHLN recommendations background; 2014
  34. Kwong JC, McCallum N, Sintchenko V, Howden BP. Whole genome sequencing in clinical and public health microbiology. Pathology 2015; 47:199–210 [View Article] [PubMed]
    [Google Scholar]
  35. Roosaare M, Puustusmaa M, Möls M, Vaher M, Remm M. PlasmidSeeker: identification of known plasmids from bacterial whole genome sequencing reads. PeerJ 2018; 6:e4588 [View Article] [PubMed]
    [Google Scholar]
  36. Derelle R, von Wachsmann J, Mäklin T, Hellewell J, Russell T et al. Seamless, rapid and accurate analyses of outbreak genomic data using Split K-mer Analysis (SKA). Bioinformatics 2024 [View Article]
    [Google Scholar]
  37. Didelot X, Ribeca P. Didelot and Ribeca KPop: Accurate, assembly-free, and scalable comparative analysis of microbial genomes https://github.com/PaoloRibeca/KPop
  38. Lees JA, Harris SR, Tonkin-Hill G, Gladstone RA, Lo SW et al. Fast and flexible bacterial genomic epidemiology with PopPUNK. Genome Res 2019; 29:304–316 [View Article] [PubMed]
    [Google Scholar]
  39. Moore MP, Laager M, Ribeca P, Didelot X. KmerAperture: retaining k-mer synteny for alignment-free extraction of core and accessory differences between bacterial genomes. PLoS Genet 2024; 20:e1011184 [View Article] [PubMed]
    [Google Scholar]
/content/journal/mgen/10.1099/mgen.0.001347
Loading
/content/journal/mgen/10.1099/mgen.0.001347
Loading

Data & Media loading...

Supplements

Supplementary material 1

PDF

Supplementary material 2

EXCEL
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An error occurred
Approval was partially successful, following selected items could not be processed due to error