-
Volume 2,
Issue 7,
2016
Volume 2, Issue 7, 2016
- Research Paper
-
- Systems Microbiology
- Large-scale comparative genomics
-
-
Comparative genomics and evolution of transcriptional regulons in Proteobacteria
Comparative genomics approaches are broadly used for analysis of transcriptional regulation in bacterial genomes. In this work, we identified binding sites and reconstructed regulons for 33 orthologous groups of transcription factors (TFs) in 196 reference genomes from 21 taxonomic groups of Proteobacteria. Overall, we predict over 10 600 TF binding sites and identified more than 15 600 target genes for 1896 TFs constituting the studied orthologous groups of regulators. These include a set of orthologues for 21 metabolism-associated TFs from Escherichia coli and/or Shewanella that are conserved in five or more taxonomic groups and several additional TFs that represent non-orthologous substitutions of the metabolic regulators in some lineages of Proteobacteria. By comparing gene contents of the reconstructed regulons, we identified the core, taxonomy-specific and genome-specific TF regulon members and classified them by their metabolic functions. Detailed analysis of ArgR, TyrR, TrpR, HutC, HypR and other amino-acid-specific regulons demonstrated remarkable differences in regulatory strategies used by various lineages of Proteobacteria. The obtained genomic collection of in silico reconstructed TF regulons contains a large number of new regulatory interactions that await future experimental validation. The collection provides a framework for future evolutionary studies of transcriptional regulatory networks in Bacteria. It can be also used for functional annotation of putative metabolic transporters and enzymes that are abundant in the reconstructed regulons.
-
-
-
Transcriptomic analysis of staphylococcal sRNAs: insights into species-specific adaption and the evolution of pathogenesis
More LessNext-generation sequencing technologies have dramatically increased the rate at which new genomes are sequenced. Accordingly, automated annotation programs have become adept at identifying and annotating protein coding regions, as well as common and conserved RNAs. Additionally, RNAseq techniques have advanced our ability to identify and annotate regulatory RNAs (sRNAs), which remain significantly understudied. Recently, our group catalogued and annotated all previously known and newly identified sRNAs in several Staphylococcus aureus strains. These complete annotation files now serve as tools to compare the sRNA content of S. aureus with other bacterial strains to investigate the conservation of their sRNomes. Accordingly, in this study we performed RNAseq on two staphylococcal species, Staphylococcus epidermidis and Staphylococcus carnosus, identifying 118 and 89 sRNAs in these organisms, respectively. The sRNA contents of all three species were then compared to elucidate their common and species-specific sRNA content, identifying a core set of between 53 and 36 sRNAs encoded in each organism. In addition, we determined that S. aureus has the largest set of unique sRNAs (137) while S. epidermidishas the fewest (25). Finally, we identify a highly conserved sequence and structural motif differentially represented within, yet common to, both S. aureus and S. epidermidis. Collectively, in this study, we uncover the sRNome common to three staphylococcal species, shedding light on sRNAs that are likely to be involved in basic physiological processes common to the genus. More significantly, we have identified species-specific sRNAs that are likely to influence the individual lifestyle and behaviour of these diverse staphylococcal strains.
-
- Microbial evolution and epidemiology
- Mechanisms of evolution
-
-
Quantitative assessment of insertion sequence impact on bacterial genome architecture
More LessInsertion sequence (IS) elements are important mediators of genome plasticity and can lead to phenotypic changes with evolutionary significance. In multidrug-resistant Acinetobacter baumannii and Klebsiella pneumoniae, IS elements have contributed significantly to the mobilization of genes that encode resistance to antimicrobial drugs. A systematic analysis of IS elements is needed for a more comprehensive understanding of their evolutionary impact. We developed a computational approach (ISseeker) to annotate IS elements in draft genome assemblies and applied the method to analysis of IS elements in all publicly available A. baumannii(>1000) and K. pneumoniae(>800) genome sequences, in a phylogenetic context. Most IS elements in A. baumanniigenomes are species-specific ISAba elements, whereas K. pneumoniaegenomes contain significant numbers of both ISKpn elements and elements that are found throughout the Enterobacteriaceae. A. baumanniigenomes have a higher density of IS elements than K. pneumoniae, averaging ~33 vs ~27 copies per genome. In K. pneumoniae, several insertion sites are shared by most genomes in the ST258 clade, whereas in A. baumannii, different IS elements are abundant in different phylogenetic groups, even among closely related Global Clone 2 strains. IS elements differ in the distribution of insertion locations relative to genes, with some more likely to disrupt genes and others predominantly in intergenic regions. Several genes and intergenic regions had multiple independent insertion events, suggesting that those events may confer a selective advantage. Genome- and taxon-wide characterization of insertion locations revealed that IS elements have been active contributors to genome diversity in both species.
-
- Genomic Methodologies
- Genome-phenotype association
-
-
In silico serotyping of E. coli from short read data identifies limited novel O-loci but extensive diversity of O:H serotype combinations within and between pathogenic lineages
The lipopolysaccharide (O) and flagellar (H) surface antigens of Escherichia coli are targets for serotyping that have traditionally been used to identify pathogenic lineages. These surface antigens are important for the survival of E. coli within mammalian hosts. However, traditional serotyping has several limitations, and public health reference laboratories are increasingly moving towards whole genome sequencing (WGS) to characterize bacterial isolates. Here we present a method to rapidly and accurately serotype E. coli isolates from raw, short read WGS data. Our approach bypasses the need for de novo genome assembly by directly screening WGS reads against a curated database of alleles linked to known and novel E. coli O-groups and H-types (the EcOH database) using the software package srst2. We validated the approach by comparing in silico results for 197 enteropathogenic E. coli isolates with those obtained by serological phenotyping in an independent laboratory. We then demonstrated the utility of our method to characterize isolates in public health and clinical settings, and to explore the genetic diversity of >1500 E. coli genomes from multiple sources. Importantly, we showed that transfer of O- and H-antigen loci between E. coli chromosomal backbones is common, with little evidence of constraints by host or pathotype, suggesting that E. coli ‘strain space’ may be virtually unlimited, even within specific pathotypes. Our findings show that serotyping is most useful when used in combination with strain genotyping to characterize microevolution events within an inferred population structure.
-
- Microbial communities
-
-
First insight into the faecal microbiota of the high Arctic muskoxen (Ovibos moschatus)
More LessThe faecal microbiota of muskoxen (n=3) pasturing on Ryøya (69° 33′ N 18° 43′ E), Norway, in late September was characterized using high-throughput sequencing of partial 16S rRNA gene regions. A total of 16 209 high-quality sequence reads from bacterial domains and 19 462 from archaea were generated. Preliminary taxonomic classifications of 806 bacterial operational taxonomic units (OTUs) resulted in 53.7–59.3 % of the total sequences being without designations beyond the family level. Firmicutes (70.7–81.1 % of the total sequences) and Bacteroidetes (16.8–25.3 %) constituted the two major bacterial phyla, with uncharacterized members within the family Ruminococcaceae (28.9–40.9 %) as the major phylotype. Multiple-library comparisons between muskoxen and other ruminants indicated a higher similarity for muskoxen faeces and reindeer caecum (P>0.05) and some samples from cattle faeces. The archaeal sequences clustered into 37 OTUs, with dominating phylotypes affiliated to the methane-producing genus Methanobrevibacter (80–92 % of the total sequences). UniFrac analysis demonstrated heterogeneity between muskoxen archaeal libraries and those from reindeer and roe deer (P=1.0e-02, Bonferroni corrected), but not with foregut fermenters. The high proportion of cellulose-degrading Ruminococcus-affiliated bacteria agrees with the ingestion of a highly fibrous diet. Further experiments are required to elucidate the role played by these novel bacteria in the digestion of this fibrous Artic diet eaten by muskoxen.
-
- Microbial evolution and epidemiology
- Population Genomics
-
-
Whole-genome sequencing of a quarter-century melioidosis outbreak in temperate Australia uncovers a region of low-prevalence endemicity
Melioidosis, caused by the highly recombinogenic bacterium Burkholderia pseudomallei, is a disease with high mortality. Tracing the origin of melioidosis outbreaks and understanding how the bacterium spreads and persists in the environment are essential to protecting public and veterinary health and reducing mortality associated with outbreaks. We used whole-genome sequencing to compare isolates from a historical quarter-century outbreak that occurred between 1966 and 1991 in the Avon Valley, Western Australia, a region far outside the known range of B. pseudomallei endemicity. All Avon Valley outbreak isolates shared the same multilocus sequence type (ST-284), which has not been identified outside this region. We found substantial genetic diversity among isolates based on a comparison of genome-wide variants, with no clear correlation between genotypes and temporal, geographical or source data. We observed little evidence of recombination in the outbreak strains, indicating that genetic diversity among these isolates has primarily accrued by mutation. Phylogenomic analysis demonstrated that the isolates confidently grouped within the Australian B. pseudomallei clade, thereby ruling out introduction from a melioidosis-endemic region outside Australia. Collectively, our results point to B. pseudomallei ST-284 being present in the Avon Valley for longer than previously recognized, with its persistence and genomic diversity suggesting long-term, low-prevalence endemicity in this temperate region. Our findings provide a concerning demonstration of the potential for environmental persistence of B. pseudomallei far outside the conventional endemic regions. An expected increase in extreme weather events may reactivate latent B. pseudomallei populations in this region.
-
Most Read This Month
