- Volume 3, Issue 8, 2017
Volume 3, Issue 8, 2017
- Research Article
-
- Systems Microbiology
- Pangenome Analysis, Big-data Approaches
-
-
Phylogenomics and comparative genomics of Lactobacillus salivarius, a mammalian gut commensal
More LessThe genus Lactobacillus is a diverse group with a combined species count of over 200. They are the largest group within the lactic acid bacteria and one of the most important bacterial groups involved in food microbiology and human nutrition because of their fermentative and probiotic properties. Lactobacillus salivarius, a species commonly isolated from the gastrointestinal tract of humans and animals, has been described as having potential probiotic properties and results of previous studies have revealed considerable functional diversity existing on both the chromosomes and plasmids. Our study consists of comparative genomic analyses of the functional and phylogenomic diversity of 42 genomes of strains of L . salivarius using bioinformatic techniques. The main aim of the study was to describe intra-species diversity and to determine how this diversity is spread across the replicons. We found that multiple phylogenomic and non-phylogenomic methods used for reconstructing trees all converge on similar tree topologies, showing that different metrics largely agree on the evolutionary history of the species. The greatest genomic variation lies on the small plasmids, followed by the repA-type circular megaplasmid, with the chromosome varying least of all. Additionally, the presence of extra linear and circular megaplasmids is noted in several strains, while small plasmids are not always present. Glycosyl hydrolases, bacteriocins and proteases vary considerably on all replicons while two exopolysaccharide clusters and several clustered regularly interspaced short palindromic repeats-associated systems show a lot of variation on the chromosome. Overall, despite its reputation as a mammalian gastrointestinal tract specialist, the intra-specific variation of L. salivarius reveals potential strain-dependant effects on human health.
-
- Microbial Evolution and Epidemiology
- Population Genomics
-
-
Whole-genome sequencing to investigate a non-clonal melioidosis cluster on a remote Australian island
Melioidosis is a tropical disease caused by the bacterium Burkholderia pseudomallei. Outbreaks are uncommon and can generally be attributed to a single point source and strain. We used whole-genome sequencing to analyse B. pseudomallei isolates collected from an historical 2-year long case cluster that occurred in a remote northern Australian indigenous island community, where infections were previously linked to a contaminated communal water supply. We analysed the genome-wide relatedness of the two most common multilocus sequence types (STs) involved in the outbreak, STs 125 and 126. This analysis showed that although these STs were closely related on a whole-genome level, they demonstrated evidence of multiple recombination events that were unlikely to have occurred over the timeframe of the outbreak. Based on epidemiological and genetic data, we also identified two additional patients not previously associated with this outbreak. Our results confirm the previous hypothesis that a single unchlorinated water source harbouring multiple B. pseudomallei strains was linked to the outbreak, and that increased melioidosis risk in this community was associated with Piper methysticum root (kava) consumption.
-
- Communicable Disease Genomics
-
-
Emergence and genomic diversification of a virulent serogroup W:ST-2881(CC175) Neisseria meningitidis clone in the African meningitis belt
Countries of the African ‘meningitis belt’ are susceptible to meningococcal meningitis outbreaks. While in the past major epidemics have been primarily caused by serogroup A meningococci, W strains are currently responsible for most of the cases. After an epidemic in Mecca in 2000, W:ST-11 strains have caused many outbreaks worldwide. An unrelated W:ST-2881 clone was described for the first time in 2002, with the first meningitis cases caused by these bacteria reported in 2003. Here we describe results of a comparative whole-genome analysis of 74 W:ST-2881 strains isolated within the framework of two longitudinal colonization and disease studies conducted in Ghana and Burkina Faso. Genomic data indicate that the W:ST-2881 clone has emerged from Y:ST-175(CC175) bacteria by capsule switching. The circulating W:ST-2881 populations were composed of a variety of closely related but distinct genomic variants with no systematic differences between colonization and disease isolates. Two distinct and geographically clustered phylogenetic clonal variants were identified in Burkina Faso and a third in Ghana. On the basis of the presence or absence of 17 recombination fragments, the Ghanaian variant could be differentiated into five clusters. All 25 Ghanaian disease isolates clustered together with 23 out of 40 Ghanaian isolates associated with carriage within one cluster, indicating that W:ST-2881 clusters differ in virulence. More than half of the genes affected by horizontal gene transfer encoded proteins of the ‘cell envelope’ and the ‘transport/binding protein’ categories, which indicates that exchange of non-capsular antigens plays an important role in immune evasion.
-
- Short Paper
-
- Systems Microbiology
- Large-scale Comparative Genomics
-
-
Resolving plasmid structures in Enterobacteriaceae using the MinION nanopore sequencer: assessment of MinION and MinION/Illumina hybrid data assembly approaches
This study aimed to assess the feasibility of using the Oxford Nanopore Technologies (ONT) MinION long-read sequencer in reconstructing fully closed plasmid sequences from eight Enterobacteriaceae isolates of six different species with plasmid populations of varying complexity. Species represented were Escherichia coli, Klebsiella pneumoniae, Citrobacter freundii, Enterobacter cloacae, Serratia marcescens and Klebsiella oxytoca, with plasmid populations ranging from 1–11 plasmids with sizes of 2–330 kb. Isolates were sequenced using Illumina (short-read) and ONT’s MinION (long-read) platforms, and compared with fully resolved PacBio (long-read) sequence assemblies for the same isolates. We compared the performance of different assembly approaches including SPAdes, plasmidSPAdes, hybridSPAdes, Canu, Canu+Pilon (canuPilon) and npScarf in recovering the plasmid structures of these isolates by comparing with the gold-standard PacBio reference sequences. Overall, canuPilon provided consistently good quality assemblies both in terms of assembly statistics (N50, number of contigs) and assembly accuracy [presence of single nucleotide polymorphisms (SNPs)/indels with respect to the reference sequence]. For plasmid reconstruction, Canu recovered 70 % of the plasmids in complete contigs, and combining three assembly approaches (Canu or canuPilon, hybridSPAdes and plasmidSPAdes) resulted in a total 78 % recovery rate for all the plasmids. The analysis demonstrated the potential of using MinION sequencing technology to resolve important plasmid structures in Enterobacteriaceae species independent of and in conjunction with Illumina sequencing data. A consensus assembly derived from several assembly approaches could present significant benefit in accurately resolving the greatest number of plasmid structures.
-
- Review
-
- Microbial evolution and epidemiology
- Population Genomics
-
-
Comparison of classical multi-locus sequence typing software for next-generation sequencing data
Multi-locus sequence typing (MLST) is a widely used method for categorizing bacteria. Increasingly, MLST is being performed using next-generation sequencing (NGS) data by reference laboratories and for clinical diagnostics. Many software applications have been developed to calculate sequence types from NGS data; however, there has been no comprehensive review to date on these methods. We have compared eight of these applications against real and simulated data, and present results on: (1) the accuracy of each method against traditional typing methods, (2) the performance on real outbreak datasets, (3) the impact of contamination and varying depth of coverage, and (4) the computational resource requirements.
-