Skip to content
1887

Graphical Abstract

Graphical abstract

Abstract

The most common approach to sampling the bacterial populations within an infected or colonized host is to sequence genomes from a single colony obtained from a culture plate. However, it is recognized that this method does not capture the genetic diversity in the population. Sequencing a mixture of several colonies (pool-seq) is a better approach to detect population heterogeneity, but it is more complex to analyse due to different types of heterogeneity, such as within-clone polymorphisms, multi-strain mixtures, multi-species mixtures and contamination. Here, we compared 8 single-colony isolates (singles) and pool-seq on a set of 2286 culture samples to identify features that can distinguish pure samples, samples undergoing intraclonal variation and mixed strain samples. The samples were obtained by swabbing 3 body sites on 85 human participants quarterly for a year, who initially presented with a methicillin-resistant skin and soft-tissue infection (SSTI). We compared parameters such as sequence quality, contamination, allele frequency, nucleotide diversity and pangenome diversity in each pool to those for the corresponding singles. Comparing singles from the same culture plate, we found that 18% of sample collections contained mixtures of multiple multilocus sequence types (MLSTs or STs). We showed that pool-seq data alone could predict the presence of multi-ST populations with 95% accuracy. We also showed that pool-seq could be used to estimate the number of intra-clonal polymorphic sites in the population. Additionally, we found that the pool may contain clinically relevant genes such as antimicrobial resistance markers that may be missed when only examining singles. These results highlight the potential advantage of analysing genome sequences of total populations obtained from clinical cultures rather than single colonies.

Funding
This study was supported by the:
  • National Institutes of Health (US) (Award AI139188)
    • Principle Award Recipient: TimothyD Read
  • Division of Microbiology and Infectious Diseases, National Institute of Allergy and Infectious Diseases (Award AI158452-01A1)
    • Principle Award Recipient: MichaelZ. David
  • Division of Microbiology and Infectious Diseases, National Institute of Allergy and Infectious Diseases (Award AI158452-01A1)
    • Principle Award Recipient: TimothyD Read
  • This is an open-access article distributed under the terms of the Creative Commons Attribution License.
Loading

Article metrics loading...

/content/journal/mgen/10.1099/mgen.0.001111
2023-11-07
2025-06-19
Loading full text...

Full text loading...

/deliver/fulltext/mgen/9/11/mgen001111.html?itemId=/content/journal/mgen/10.1099/mgen.0.001111&mimeType=html&fmt=ahah

References

  1. Raghuram V. Code for analysis available in github: github.com/VishnuRaghuram94/GASP. figshare. Dataset. Raw data for the Genomic Analysis of Singles and Pools (GASP) project; 2023 https://doi.org/10.6084/m9.figshare.24161634.v1
  2. Giulieri SG, Guérillot R, Duchene S, Hachani A, Daniel D et al. Niche-specific genome degradation and convergent evolution shaping Staphylococcus aureus adaptation during severe infections. Elife 2022; 11:e77195 [View Article] [PubMed]
    [Google Scholar]
  3. Talbot BM, Jacko NF, Petit RA, Pegues DA, Shumaker MJ et al. Unsuspected clonal spread of methicillin-resistant Staphylococcus aureus causing bloodstream infections in hospitalized adults detected using whole genome sequencing. Clin Infect Dis 2022; 75:2104–2112 [View Article] [PubMed]
    [Google Scholar]
  4. Armstrong GL, MacCannell DR, Taylor J, Carleton HA, Neuhaus EB et al. Pathogen genomics in public health. N Engl J Med 2019; 381:2569–2580 [View Article] [PubMed]
    [Google Scholar]
  5. Chaguza C, Senghore M, Bojang E, Gladstone RA, Lo SW et al. Within-host microevolution of Streptococcus pneumoniae is rapid and adaptive during natural colonisation. Nat Commun 2020; 11:3442 [View Article] [PubMed]
    [Google Scholar]
  6. Golubchik T, Batty EM, Miller RR, Farr H, Young BC et al. Within-host evolution of Staphylococcus aureus during asymptomatic carriage. PLoS ONE 2013; 8:e61319 [View Article] [PubMed]
    [Google Scholar]
  7. Marvig RL, Sommer LM, Molin S, Johansen HK. Convergent evolution and adaptation of Pseudomonas aeruginosa within patients with cystic fibrosis. Nat Genet 2015; 47:57–64 [View Article] [PubMed]
    [Google Scholar]
  8. Azimi S, Roberts AEL, Peng S, Weitz JS, McNally A et al. Allelic polymorphism shapes community function in evolving Pseudomonas aeruginosa populations. ISME J 2020; 14:1929–1942 [View Article] [PubMed]
    [Google Scholar]
  9. Didelot X, Walker AS, Peto TE, Crook DW, Wilson DJ. Within-host evolution of bacterial pathogens. Nat Rev Microbiol 2016; 14:150–162 [View Article] [PubMed]
    [Google Scholar]
  10. Paterson GK, Harrison EM, Murray GGR, Welch JJ, Warland JH et al. Capturing the cloud of diversity reveals complexity and heterogeneity of MRSA carriage, infection and transmission. Nat Commun 2015; 6:6560 [View Article] [PubMed]
    [Google Scholar]
  11. Markussen T, Marvig RL, Gómez-Lozano M, Aanæs K, Burleigh AE et al. Environmental heterogeneity drives within-host diversification and evolution of Pseudomonas aeruginosa. mBio 2014; 5:e01592-14 [View Article] [PubMed]
    [Google Scholar]
  12. Lieberman TD, Flett KB, Yelin I, Martin TR, McAdam AJ et al. Genetic variation of a bacterial pathogen within individuals with cystic fibrosis provides a record of selective pressures. Nat Genet 2014; 46:82–87 [View Article] [PubMed]
    [Google Scholar]
  13. Tonkin-Hill G, Ling C, Chaguza C, Salter SJ, Hinfonthong P et al. Pneumococcal within-host diversity during colonization, transmission and treatment. Nat Microbiol 2022; 7:1791–1804 [View Article] [PubMed]
    [Google Scholar]
  14. Bryant JM, Brown KP, Burbaud S, Everall I, Belardinelli JM et al. Stepwise pathogenic evolution of Mycobacterium abscessus. Science 2021; 372:eabb8699 [View Article] [PubMed]
    [Google Scholar]
  15. Wilkinson DJ, Dickins B, Robinson K, Winter JA. Genomic diversity of Helicobacter pylori populations from different regions of the human stomach. Gut Microbes 2022; 14:2152306 [View Article] [PubMed]
    [Google Scholar]
  16. Liu Q, Via LE, Luo T, Liang L, Liu X et al. Within patient microevolution of Mycobacterium tuberculosis correlates with heterogeneous responses to treatment. Sci Rep 2015; 5:17507 [View Article] [PubMed]
    [Google Scholar]
  17. Holt KE, Teo YY, Li H, Nair S, Dougan G et al. Detecting SNPs and estimating allele frequencies in clonal bacterial populations by sequencing pooled DNA. Bioinformatics 2009; 25:2074–2075 [View Article] [PubMed]
    [Google Scholar]
  18. Rossen JWA, Friedrich AW, Moran-Gilad J. Practical issues in implementing whole-genome-sequencing in routine diagnostic microbiology. Clin Microbiol Infect 2018; 24:355–360 [View Article] [PubMed]
    [Google Scholar]
  19. Brodrick HJ, Raven KE, Kallonen T, Jamrozy D, Blane B et al. Longitudinal genomic surveillance of multidrug-resistant Escherichia coli carriage in a long-term care facility in the United Kingdom. Genome Med 2017; 9:70 [View Article] [PubMed]
    [Google Scholar]
  20. Jorth P, Durfey S, Rezayat A, Garudathri J, Ratjen A et al. Cystic fibrosis lung function decline after within-host evolution increases virulence of infecting Pseudomonas aeruginosa. Am J Respir Crit Care Med 2021; 203:637–640 [View Article] [PubMed]
    [Google Scholar]
  21. Joseph SJ, Bommana S, Ziklo N, Kama M, Dean D et al. Patterns of within-host spread of Chlamydia trachomatis between vagina, endocervix and rectum revealed by comparative genomic analysis. bioRxiv 20232023 [View Article] [PubMed]
    [Google Scholar]
  22. Octavia S, Wang Q, Tanaka MM, Sintchenko V, Lan R. Genomic heterogeneity of Salmonella enterica serovar Typhimurium bacteriuria from chronic infection. Infect Genet Evol 2017; 51:17–20 [View Article] [PubMed]
    [Google Scholar]
  23. David MZ, Daum RS. Treatment of Staphylococcus aureus Infections. Curr Top Microbiol Immunol 2017; 409:325–383 [View Article] [PubMed]
    [Google Scholar]
  24. Kourtis AP, Hatfield K, Baggs J, Mu Y, See I et al. Vital signs: epidemiology and recent trends in methicillin-resistant and in methicillin-susceptible Staphylococcus aureus bloodstream infections - United States. MMWR Morb Mortal Wkly Rep 2019; 68:214–219 [View Article] [PubMed]
    [Google Scholar]
  25. Coll F, Harrison EM, Toleman MS, Reuter S, Raven KE et al. Longitudinal genomic surveillance of MRSA in the UK reveals transmission patterns in hospitals and the community. Sci Transl Med 2017; 9:eaak9745 [View Article] [PubMed]
    [Google Scholar]
  26. Giulieri SG, Baines SL, Guerillot R, Seemann T, Gonçalves da Silva A et al. Genomic exploration of sequential clinical isolates reveals a distinctive molecular signature of persistent Staphylococcus aureus bacteraemia. Genome Med 2018; 10:65 [View Article] [PubMed]
    [Google Scholar]
  27. Sabat AJ, Hermelijn SM, Akkerboom V, Juliana A, Degener JE et al. Complete-genome sequencing elucidates outbreak dynamics of CA-MRSA USA300 (ST8-spa t008) in an academic hospital of Paramaribo, Republic of Suriname. Sci Rep 2017; 7:41050 [View Article] [PubMed]
    [Google Scholar]
  28. Xu Z, Yuan C. Molecular epidemiology of Staphylococcus aureus in China reveals the key gene features involved in epidemic transmission and adaptive evolution. Microbiol Spectr 2022; 10:e0156422 [View Article] [PubMed]
    [Google Scholar]
  29. Yebra G, Harling-Lee JD, Lycett S, Aarestrup FM, Larsen G et al. Multiclonal human origin and global expansion of an endemic bacterial pathogen of livestock. Proc Natl Acad Sci U S A 2022; 119:e2211217119 [View Article] [PubMed]
    [Google Scholar]
  30. Petit RA, Read TD. Bactopia: a flexible pipeline for complete analysis of bacterial genomes. mSystems 2020; 5:e00190-20 [View Article] [PubMed]
    [Google Scholar]
  31. SourceForge BBMap; 2022 https://sourceforge.net/projects/bbmap/ accessed 3 April 2023
  32. Souvorov A, Agarwala R, Lipman DJ. SKESA: strategic k-mer extension for scrupulous assemblies. Genome Biol 2018; 19:153 [View Article] [PubMed]
    [Google Scholar]
  33. Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 2015; 25:1043–1055 [View Article] [PubMed]
    [Google Scholar]
  34. Gurevich A, Saveliev V, Vyahhi N, Tesler G. QUAST: quality assessment tool for genome assemblies. Bioinformatics 2013; 29:1072–1075 [View Article] [PubMed]
    [Google Scholar]
  35. Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics 2014; 30:2068–2069 [View Article]
    [Google Scholar]
  36. Feldgarden M, Brover V, Gonzalez-Escalona N, Frye JG, Haendiges J et al. AMRFinderPlus and the reference gene catalog facilitate examination of the genomic links among antimicrobial resistance, stress response, and virulence. Sci Rep 2021; 11:12728 [View Article] [PubMed]
    [Google Scholar]
  37. Seemann T. Snippy; 2023 https://github.com/tseemann/snippy accessed 3 April 2023
  38. Ondov BD, Treangen TJ, Melsted P, Mallonee AB, Bergman NH et al. Mash: fast genome and metagenome distance estimation using MinHash. Genome Biol 2016; 17:132 [View Article] [PubMed]
    [Google Scholar]
  39. Seemann T. mlst; 2023 https://github.com/tseemann/mlst accessed 3 April 2023
  40. Wood DE, Salzberg SL. Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biol 2014; 15:R46 [View Article] [PubMed]
    [Google Scholar]
  41. Seemann T. Source code for snp-dists software. Zenodo; 2018 https://zenodo.org/record/1411986 accessed 16 March 2023
  42. Wick RR, Holt KE. rrwick/assembly-dereplicator: assembly dereplicator v0.1.0; 2019 https://zenodo.org/record/3365572 accessed 3 April 2023
  43. Treangen TJ, Ondov BD, Koren S, Phillippy AM. The Harvest suite for rapid core-genome alignment and visualization of thousands of intraspecific microbial genomes. Genome Biol 2014; 15:524 [View Article] [PubMed]
    [Google Scholar]
  44. Nguyen L-T, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol 2015; 32:268–274 [View Article] [PubMed]
    [Google Scholar]
  45. Yu G, Smith DK, Zhu H, Guan Y, Lam TTY. Ggtree: an R package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Methods Ecol Evol 2017; 8:28–36 [View Article]
    [Google Scholar]
  46. Li H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 2011; 27:2987–2993 [View Article]
    [Google Scholar]
  47. Raghuram V, Read T. Help, I have too many genome sequences!; 2022 https://zenodo.org/record/7278310 accessed 24 April 2023
  48. Croucher NJ, Page AJ, Connor TR, Delaney AJ, Keane JA et al. Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins. Nucleic Acids Res 2015; 43:e15 [View Article] [PubMed]
    [Google Scholar]
  49. Olm MR, Crits-Christoph A, Bouma-Gregson K, Firek BA, Morowitz MJ et al. inStrain profiles population microdiversity from metagenomic data and sensitively detects shared microbial strains. Nat Biotechnol 2021; 39:727–736 [View Article] [PubMed]
    [Google Scholar]
  50. Marius Mutation-Simulator; 2023 https://github.com/mkpython3/Mutation-Simulator accessed 28 August 2023
  51. Gourlé H, Karlsson-Lindsjö O, Hayer J, Bongcam-Rudloff E, Hancock J. Simulating Illumina metagenomic data with InSilicoSeq. Bioinformatics 2019; 35:521–522 [View Article]
    [Google Scholar]
  52. R:The R Project for Statistical Computing. n.d https://www.r-project.org/ accessed 4 April 2023
  53. InformationValue: performance analysis and companion functions for binary classification models version 1.2.3 from CRAN. n.d https://rdrr.io/cran/InformationValue/ accessed 26 April 2023
  54. Tahk A. Political science computational laboratory; 2023 https://github.com/atahk/pscl accessed 4 April 2023
  55. Fox J, Weisberg S. An R companion to applied regression. Third. Thousand Oaks, CA: Sage; 2019 https://socialsciences.mcmaster.ca/jfox/Books/Companion/
  56. rstatix A. rstatix: pipe-friendly framework for basic statistical tests; 2023 https://CRAN.R-project.org/package=rstatix accessed 4 April 2023
  57. Wickham H. ggplot2: elegant graphics for data analysis. Springer-Verlag New York; 2016 https://ggplot2.tidyverse.org
  58. Duerr S. Bioicons; 2023 https://github.com/duerrsimon/bioicons accessed 30 March 2023
  59. jgraph/drawio JGraph; 2023 https://github.com/jgraph/drawio accessed 30 March 2023
  60. Bosshard L, Peischl S, Ackermann M, Excoffier L. Dissection of the mutation accumulation process during bacterial range expansions. BMC Genomics 2020; 21:253 [View Article] [PubMed]
    [Google Scholar]
  61. Biek R, Pybus OG, Lloyd-Smith JO, Didelot X. Measurably evolving pathogens in the genomic era. Trends Ecol Evol 2015; 30:306–313 [View Article] [PubMed]
    [Google Scholar]
  62. Patro R, Duggal G, Love MI, Irizarry RA, Kingsford C. Salmon provides fast and bias-aware quantification of transcript expression. Nat Methods 2017; 14:417–419 [View Article] [PubMed]
    [Google Scholar]
  63. Liao H, Ji Y, Sun Y. High-resolution strain-level microbiome composition analysis from short reads. Microbiome 2023; 11:183 [View Article] [PubMed]
    [Google Scholar]
/content/journal/mgen/10.1099/mgen.0.001111
Loading
/content/journal/mgen/10.1099/mgen.0.001111
Loading

Data & Media loading...

Supplements

Supplementary material 1

PDF

Supplementary material 2

Supplementary material 3

EXCEL
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error