1887

Abstract

(pneumococcus) is a leading cause of morbidity and mortality worldwide. Although multi-valent pneumococcal vaccines have curbed the incidence of disease, their introduction has resulted in shifted serotype distributions that must be monitored. Whole genome sequence (WGS) data provide a powerful surveillance tool for tracking isolate serotypes, which can be determined from nucleotide sequence of the capsular polysaccharide biosynthetic operon (). Although software exists to predict serotypes from WGS data, most are constrained by requiring high-coverage next-generation sequencing reads. This can present a challenge in respect of accessibility and data sharing. Here we present PfaSTer, a machine learning-based method to identify 65 prevalent serotypes from assembled genome sequences. PfaSTer combines dimensionality reduction from k-mer analysis with a Random Forest classifier for rapid serotype prediction. By leveraging the model’s built-in statistical framework, PfaSTer determines confidence in its predictions without the need for coverage-based assessments. We then demonstrate the robustness of this method, returning >97 % concordance when compared to biochemical results and other serotyping tools. PfaSTer is open source and available at: https://github.com/pfizer-opensource/pfaster.

Funding
This study was supported by the:
  • Pfizer
    • Principle Award Recipient: JonathanT Lee
  • This is an open-access article distributed under the terms of the Creative Commons Attribution License.
Loading

Article metrics loading...

/content/journal/mgen/10.1099/mgen.0.001033
2023-06-06
2024-03-28
Loading full text...

Full text loading...

/deliver/fulltext/mgen/9/6/mgen001033.html?itemId=/content/journal/mgen/10.1099/mgen.0.001033&mimeType=html&fmt=ahah

References

  1. Blasi F, Mantero M, Santus P, Tarsia P. Understanding the burden of pneumococcal disease in adults. Clin Microbiol Infect 2012; 18 Suppl 5:7–14 [View Article] [PubMed]
    [Google Scholar]
  2. Collaborators GBDLRI. Estimates of the global, regional, and national morbidity, mortality, and aetiologies of lower respiratory infections in 195 countries, 1990-2016: a systematic analysis for the global burden of disease study 2016. Lancet Infect Dis 2018; 18:1191–1210
    [Google Scholar]
  3. Drijkoningen JJC, Rohde GGU. Pneumococcal infection in adults: burden of disease. Clin Microbiol Infect 2014; 20 Suppl 5:45–51 [View Article] [PubMed]
    [Google Scholar]
  4. Harboe ZB, Dalby T, Weinberger DM, Benfield T, Mølbak K et al. Impact of 13-valent pneumococcal conjugate vaccination in invasive pneumococcal disease incidence and mortality. Clin Infect Dis 2014; 59:1066–1073 [View Article]
    [Google Scholar]
  5. McLaughlin JM, Jiang Q, Isturiz RE, Sings HL, Swerdlow DL et al. Effectiveness of 13-valent pneumococcal conjugate vaccine against hospitalization for community-acquired pneumonia in older US adults: a test-negative design. Clin Infect Dis 2018; 67:1498–1506 [View Article] [PubMed]
    [Google Scholar]
  6. Mavroidi A, Aanensen DM, Godoy D, Skovsted IC, Kaltoft MS et al. Genetic relatedness of the Streptococcus pneumoniae capsular biosynthetic loci. J Bacteriol 2007; 189:7841–7855 [View Article] [PubMed]
    [Google Scholar]
  7. Ganaie F, Saad JS, McGee L, van Tonder AJ, Bentley SD et al. A new pneumococcal capsule type, 10D, is the 100th serotype and has a large cps fragment from an oral Streptococcus. mBio 2020; 11:e00937-20 [View Article] [PubMed]
    [Google Scholar]
  8. Hausdorff WP, Hanage WP. Interim results of an ecological experiment - Conjugate vaccination against the pneumococcus and serotype replacement. Hum Vaccin Immunother 2016; 12:358–374 [View Article] [PubMed]
    [Google Scholar]
  9. Essink B, Sabharwal C, Cannon K, Frenck R, Lal H et al. Pivotal phase 3 randomized clinical trial of the safety, tolerability, and immunogenicity of 20-valent pneumococcal conjugate vaccine in adults aged ≥18 years. Clin Infect Dis 2022; 75:390–398 [View Article] [PubMed]
    [Google Scholar]
  10. Jauneikaite E, Tocheva AS, Jefferies JMC, Gladstone RA, Faust SN et al. Current methods for capsular typing of Streptococcus pneumoniae. J Microbiol Methods 2015; 113:41–49 [View Article] [PubMed]
    [Google Scholar]
  11. Porter BD, Ortika BD, Satzke C. Capsular serotyping of Streptococcus pneumoniae by latex agglutination. J Vis Exp 2014; 2014:51747 [View Article] [PubMed]
    [Google Scholar]
  12. Kapatai G, Sheppard CL, Al-Shahib A, Litt DJ, Underwood AP et al. Whole genome sequencing of Streptococcus pneumoniae: development, evaluation and verification of targets for serogroup and serotype prediction using an automated pipeline. PeerJ 2016; 4:e2477 [View Article] [PubMed]
    [Google Scholar]
  13. Epping L, van Tonder AJ, Gladstone RA. The Global Pneumococcal Sequencing Consortium Bentley SD et al. SeroBA: rapid high-throughput serotyping of Streptococcus pneumoniae from whole genome sequence data. Microb Genom 2018; 4: [View Article]
    [Google Scholar]
  14. Knight JR, Dunne EM, Mulholland EK, Saha S, Satzke C et al. Determining the serotype composition of mixed samples of pneumococcus using whole-genome sequencing. Microb Genom 2021; 7:mgen000494 [View Article] [PubMed]
    [Google Scholar]
  15. Sheppard CL, Manna S, Groves N, Litt DJ, Amin-Chowdhury Z et al. PneumoKITy: a fast, flexible, specific, and sensitive tool for Streptococcus pneumoniae serotype screening and mixed serotype detection from genome sequence data. Microb Genom 2022; 8:12 [View Article] [PubMed]
    [Google Scholar]
  16. Jolley KA, Bray JE, Maiden MCJ. Open-access bacterial population genomics: BIGSdb software, the PubMLST.org website and their applications. Wellcome Open Res 2018; 3:124 [View Article] [PubMed]
    [Google Scholar]
  17. Ondov BD, Treangen TJ, Melsted P, Mallonee AB, Bergman NH et al. Mash: fast genome and metagenome distance estimation using MinHash. Genome Biol 2016; 17:132 [View Article] [PubMed]
    [Google Scholar]
  18. Ondov BD, Starrett GJ, Sappington A, Kostic A, Koren S et al. Mash Screen: high-throughput sequence containment estimation for genome discovery. Genome Biol 2019; 20:232 [View Article] [PubMed]
    [Google Scholar]
  19. McGinnis S, Madden TL. BLAST: at the core of a powerful and diverse set of sequence analysis tools. Nucleic Acids Res 2004; 32:W20–5 [View Article] [PubMed]
    [Google Scholar]
  20. Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 2012; 19:455–477 [View Article] [PubMed]
    [Google Scholar]
  21. Ganaie F, Maruhn K, Li C, Porambo RJ, Elverdal PL et al. Structural, genetic, and serological elucidation of Streptococcus pneumoniae serogroup 24 serotypes: discovery of a new serotype, 24C, with a variable capsule structure. J Clin Microbiol 2021; 59:e0054021 [View Article] [PubMed]
    [Google Scholar]
  22. Spencer BL, Shenoy AT, Orihuela CJ, Nahm MH. The pneumococcal serotype 15C capsule is partially O-acetylated and allows for limited evasion of 23-valent pneumococcal polysaccharide vaccine-elicited anti-serotype 15B antibodies. Clin Vaccine Immunol 2017; 24:e00099-17 [View Article] [PubMed]
    [Google Scholar]
  23. McEllistrem MC. Genetic diversity of the pneumococcal capsule: implications for molecular-based serotyping. Future Microbiol 2009; 4:857–865 [View Article] [PubMed]
    [Google Scholar]
  24. Lo SW, Gladstone RA, van Tonder AJ, Hawkins PA, Kwambana-Adams B et al. Global distribution of invasive serotype 35D Streptococcus pneumoniae isolates following introduction of 13-valent pneumococcal conjugate vaccine. J Clin Microbiol 2018; 56:e00228-18 [View Article] [PubMed]
    [Google Scholar]
  25. Petti CA, Woods CW, Reller LB. Streptococcus pneumoniae antigen test using positive blood culture bottles as an alternative method to diagnose pneumococcal bacteremia. J Clin Microbiol 2005; 43:2510–2512 [View Article]
    [Google Scholar]
  26. Harju I, Lange C, Kostrzewa M, Maier T, Rantakokko-Jalava K et al. Improved differentiation of Streptococcus pneumoniae and other S. mitis group Streptococci by MALDI biotyper using an improved MALDI biotyper database content and a novel result interpretation algorithm. J Clin Microbiol 2017; 55:914–922 [View Article] [PubMed]
    [Google Scholar]
  27. Malley JD, Kruppa J, Dasgupta A, Malley KG, Ziegler A. Probability machines: consistent probability estimation using nonparametric learning machines. Methods Inf Med 2012; 51:74–81 [View Article] [PubMed]
    [Google Scholar]
  28. van Selm S, van Cann LM, Kolkman MAB, van der Zeijst BAM, van Putten JPM. Genetic basis for the structural difference between Streptococcus pneumoniae serotype 15B and 15C capsular polysaccharides. Infect Immun 2003; 71:6192–6198 [View Article] [PubMed]
    [Google Scholar]
  29. Hao L, Kuttel MM, Ravenscroft N, Thompson A, Prasad AK et al. Streptococcus pneumoniae serotype 15B polysaccharide conjugate elicits a cross-functional immune response against serotype 15C but not 15A. Vaccine 2022; 40:4872–4880 [View Article] [PubMed]
    [Google Scholar]
  30. Zhou M, Wang Z, Zhang L, Kudinha T, An H et al. Serotype distribution, antimicrobial susceptibility, multilocus sequencing type and virulence of invasive Streptococcus pneumoniae in China: a six-year multicenter study. Front Microbiol 2021; 12:798750 [View Article] [PubMed]
    [Google Scholar]
  31. Ceyhan M, Aykac K, Gurler N, Ozsurekci Y, Öksüz L et al. Serotype distribution of Streptococcus pneumonia in children with invasive disease in Turkey: 2015-2018. Hum Vaccin Immunother 2020; 16:2773–2778 [View Article] [PubMed]
    [Google Scholar]
  32. Habibi Ghahfarokhi S, Mosadegh M, Ahmadi A, Pourmand MR, Azarsa M et al. Serotype distribution and antibiotic susceptibility of Streptococcus pneumoniae isolates in Tehran, Iran: a surveillance study. Infect Drug Resist 2020; 13:333–340 [View Article] [PubMed]
    [Google Scholar]
  33. Isturiz R, Grant L, Gray S, Alexander-Parrish R, Jiang Q et al. Expanded analysis of 20 pneumococcal serotypes associated with radiographically confirmed community-acquired pneumonia in hospitalized US adults. Clin Infect Dis 2021; 73:1216–1222 [View Article]
    [Google Scholar]
  34. Lister AJJ, Le CF, Cheah ESG, Desa MNM, Cleary DW et al. Serotype distribution of invasive, non-invasive and carried Streptococcus pneumoniae in Malaysia: a meta-analysis. Pneumonia 2021; 13:9 [View Article] [PubMed]
    [Google Scholar]
  35. Wiese AD, Griffin MR, Grijalva CG. Impact of pneumococcal conjugate vaccines on hospitalizations for pneumonia in the United States. Expert Rev Vaccines 2019; 18:327–341 [View Article] [PubMed]
    [Google Scholar]
http://instance.metastore.ingenta.com/content/journal/mgen/10.1099/mgen.0.001033
Loading
/content/journal/mgen/10.1099/mgen.0.001033
Loading

Data & Media loading...

Supplements

Supplementary material 1

PDF

Supplementary material 2

EXCEL

Supplementary material 3

EXCEL
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error