1887

Abstract

Determination of serotypes of is essential for monitoring current vaccine programmes. Since October 2017, pneumococcal serotypes in England have been derived from whole genome sequencing (WGS) data using our bioinformatic tool PneumoCaT. That tool was designed for serotype determination from pure cultures in a reference laboratory. To help determine multiple serotypes in pneumococcal carriage samples, we developed a new software tool named PneumoKITy (Pneumococcal K-mer Integrated Typing) that uses the powerful Mash k-mer screening method for pneumococcal serotyping. Mash k-mer screening is more sequence specific and much faster than the mapping method used in PneumoCaT and can determine 54 (58.1  %) of the 93 serotypes in the SSI Diagnostica phenotypical serotyping scheme to type level with the remainder called to serogroup or subgroup level (e.g., 11A/D). PneumoKITy can be run on both FastQ and assembly input, requiring up to 11× less memory and running up to 29× faster than the current version of PneumoCaT (1.2.1) on FastQ files. PneumoKITy can be used as a rapid, flexible serotype screening method which adds sensitive detection of mixed serotypes, e.g., for nasopharyngeal carriage studies where the presence of multiple serotypes is common. PneumoKITy’s ability to function from assembly file, for pure culture serotype detection, increases its speed. This speed potentially enables the software to be run using low infrastructure overhead via web-based platforms. PneumoKITy could be used as a fast initial screening method with other tools used for those serotypes that could not be fully determined to type level if necessary. PneumoKITy was found to be highly accurate and sensitive when run on a panel of FastQ files derived from mixed cultures with all serotypes in 47/51 (92.2  %) of samples being accurately detected. PneumoKITy was also able to accurately estimate the relative abundance of serotypes in the same sample. Estimates being within a mean relative abundance of 1.5 % of the expected abundance in mixtures with known concentrations. PneumoKITy was able to detect minor serotypes with expected abundance of 1 % in the known mixture serotypes. PneumoKITy is a rapid, flexible tool with wide-ranging applications outside of the pure-culture, reference laboratory serotyping remit of PneumoCaT.

  • This information is licensed under the Open Government Licence 3.0. This is an open-access article distributed under the terms of the Creative Commons Attribution License. This article was made open access via a Publish and Read agreement between the Microbiology Society and the corresponding author’s institution.
Loading

Article metrics loading...

/content/journal/mgen/10.1099/mgen.0.000904
2022-12-14
2024-04-30
Loading full text...

Full text loading...

/deliver/fulltext/mgen/8/12/mgen000904.html?itemId=/content/journal/mgen/10.1099/mgen.0.000904&mimeType=html&fmt=ahah

References

  1. Sheppard CL, Manna S, Groves N, Litt DJ, Amin-Chowdhury Z et al.PneumoKITy: A fast, flexible, specific, and sensitive tool for Streptococcus pneumoniae serotype screening and mixed serotype detection from genome sequence data FigShare 2022 [View Article]
    [Google Scholar]
  2. Troeger C, Blacker B, Khalil IA, Rao PC, Cao J et al. Estimates of the global, regional, and national morbidity, mortality, and aetiologies of lower respiratory infections in 195 countries, 1990-2016: a systematic analysis for the Global Burden of Disease study 2016. Lancet Infect Dis 2018; 18:1191–1210 [View Article]
    [Google Scholar]
  3. Ganaie F, Saad JS, McGee L, van Tonder AJ, Bentley SD et al. A new pneumococcal capsule type, 10D, is the 100th serotype and has a large cps fragment from an oral Streptococcus. mBio 2020; 11:e00937-20 [View Article]
    [Google Scholar]
  4. Bentley SD, Aanensen DM, Mavroidi A, Saunders D, Rabbinowitsch E et al. Genetic analysis of the capsular biosynthetic locus from all 90 pneumococcal serotypes. PLoS Genet 2006; 2:e31 [View Article]
    [Google Scholar]
  5. Kapatai G, Sheppard CL, Al-Shahib A, Litt DJ, Underwood AP et al. Whole genome sequencing of Streptococcus pneumoniae: development, evaluation and verification of targets for serogroup and serotype prediction using an automated pipeline. PeerJ 2016; 4:e2477 [View Article]
    [Google Scholar]
  6. Chaguza C, Senghore M, Bojang E, Lo SW, Ebruke C et al. Carriage dynamics of pneumococcal serotypes in naturally colonized infants in a rural African setting during the first year of life. Front Pediatr 2020; 8:587730 [View Article]
    [Google Scholar]
  7. Sá-Leão R, Tomasz A, Santos Sanches I, de Lencastre H. Pilot study of the genetic diversity of the pneumococcal nasopharyngeal flora among children attending day care centers. J Clin Microbiol 2002; 40:3577–3585 [View Article]
    [Google Scholar]
  8. Kamng’ona AW, Hinds J, Bar-Zeev N, Gould KA, Chaguza C et al. High multiple carriage and emergence of Streptococcus pneumoniae vaccine serotype variants in Malawian children. BMC Infect Dis 2015; 15:234 [View Article]
    [Google Scholar]
  9. Kaltoft MS, Skov Sørensen UB, Slotved HC, Konradsen HB. An easy method for detection of nasopharyngeal carriage of multiple Streptococcus pneumoniae serotypes. J Microbiol Methods 2008; 75:540–544 [View Article]
    [Google Scholar]
  10. Epping L, van Tonder AJ, Gladstone RA, Bentley SD et al. SeroBA: rapid high-throughput serotyping of Streptococcus pneumoniae from whole genome sequence data. Microb Genom 2018; 4:e000186 [View Article]
    [Google Scholar]
  11. Knight JR, Dunne EM, Mulholland EK, Saha S, Satzke C et al. Determining the serotype composition of mixed samples of pneumococcus using whole-genome sequencing. Microb Genom 2021; 7: [View Article]
    [Google Scholar]
  12. Kapatai G, Sheppard CL, Troxler LJ, Litt DJ, Furrer J et al. Pneumococcal 23B molecular subtype identified using whole genome sequencing. Genome Biol Evol 2017; 9:2122–2135 [View Article]
    [Google Scholar]
  13. Ondov BD, Treangen TJ, Melsted P, Mallonee AB, Bergman NH et al. Mash: fast genome and metagenome distance estimation using MinHash. Genome Biol 2016; 17:132 [View Article]
    [Google Scholar]
  14. Ondov BD, Starrett GJ, Sappington A, Kostic A, Koren S et al. Mash screen: high-throughput sequence containment estimation for genome discovery. Genome Biol 2019; 20:232 [View Article]
    [Google Scholar]
  15. van Rossum G, Drake FL. Python 3 Reference Manual Scotts Valley, CA: CreateSpace; 2009
    [Google Scholar]
  16. Elberse K, Witteveen S, van der Heide H, van de Pol I, Schot C et al. Sequence diversity within the capsular genes of Streptococcus pneumoniae serogroup 6 and 19. PLoS One 2011; 6:e25018 [View Article]
    [Google Scholar]
  17. Wick RR, Judd LM, Gorrie CL, Holt KE. Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. PLoS Comput Biol 2017; 13:e1005595 [View Article]
    [Google Scholar]
  18. R Core team R: The R Project for Statistical Computing. R Foundation for Statistical Computing. Vienna, Austria: 2017 https://www.r-project.org/
  19. Wickham H. ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag New York; 2016 https://ggplot2.tidyverse.org
  20. Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics 2014; 30:2068–2069 [View Article]
    [Google Scholar]
  21. Cock PJA, Antao T, Chang JT, Chapman BA, Cox CJ et al. Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics 2009; 25:1422–1423 [View Article]
    [Google Scholar]
  22. Kumar S, Stecher G, Li M, Knyaz C, Tamura K. MEGA X: Molecular Evolutionary Genetics Analysis across computing platforms. Mol Biol Evol 2018; 35:1547–1549 [View Article]
    [Google Scholar]
  23. Sheppard CL, Groves N, Andrews N, Litt DJ, Fry NK et al. The genomics of Streptococcus pneumoniae carriage isolates from UK children and their household contacts, Pre-PCV7 to Post-PCV13. Genes 2019; 10:E687 [View Article]
    [Google Scholar]
  24. Dunne EM, Tikkanen L, Balloch A, Gould K, Yoannes M et al. Characterization of 19A-like 19F pneumococcal isolates from Papua new Guinea and Fiji. New Microbes New Infect 2015; 7:86–88 [View Article]
    [Google Scholar]
  25. Manna S, Dunne EM, Ortika BD, Pell CL, Kama M et al. Discovery of a Streptococcus pneumoniae serotype 33F capsular polysaccharide locus that lacks wcjE and contains a wcyO pseudogene. PLoS One 2018; 13:e0206622 [View Article]
    [Google Scholar]
http://instance.metastore.ingenta.com/content/journal/mgen/10.1099/mgen.0.000904
Loading
/content/journal/mgen/10.1099/mgen.0.000904
Loading

Data & Media loading...

Supplements

Supplementary material 1

PDF

Supplementary material 2

EXCEL

Supplementary material 3

EXCEL

Supplementary material 4

EXCEL

Supplementary material 5

EXCEL

Supplementary material 6

EXCEL

Supplementary material 7

PDF
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error