- Volume 6, Issue 10, 2020
Volume 6, Issue 10, 2020
- Research Article
-
- Microbial Evolution and Epidemiology
- Population Genomics
-
-
Forensic genomics of a novel Klebsiella quasipneumoniae type from a neonatal intensive care unit in China reveals patterns of colonization, evolution and epidemiology
During March 2017, a neonatal patient with severe diarrhoea subsequently developed septicaemia and died, with Klebsiella isolated as the causative microorganism. In keeping with infection control protocols, the coincident illness of an attending staff member and three other neonates with Klebsiella infection triggered an outbreak response, leading to microbiological assessment of isolates collected from the staff member and all 21 co-housed neonates. Multilocus sequence typing and genomic sequencing identified that the isolates from the 21 neonates were of a new Klebsiella sequence type, ST2727, and taxonomically belonged to K. quasipneumoniae subsp. similipneumoniae (formerly referred to as KpIIB). Genomic characterization showed that the isolated ST2727 strains had diverged from other K. quasipneumoniae subsp. similipneumoniae strains at least 90 years ago, whereas the neonatal samples were highly similar with a genomic divergence of 3.6 months. There was no relationship to the Klebsiella isolate from the staff member. This demonstrates that no transmission occurred from staff to patient or between patients. Rather, the data suggest that ST2727 colonized each neonate from a common hospital source. Sequence-based analysis of the genomes revealed several genes for antimicrobial resistance and some virulence features, but suggest that ST2727 is neither extremely-drug resistant nor hypervirulent. Our results highlight the clinical significance and genomic properties of ST2727 and urge genome-based measures be implemented for diagnostics and surveillance within hospital environments. Additionally, the present study demonstrates the need to scale the power of genomic analysis in retrospective studies where relatively few samples are available.
-
-
-
Effect of arsenite and growth in biofilm conditions on the evolution of Thiomonas sp. CB2
Kelle C. Freel, Stephanie Fouteau, David Roche, Julien Farasin, Aline Huber, Sandrine Koechler, Martina Peres, Olfa Chiboub, Hugo Varet, Caroline Proux, Julien Deschamps, Romain Briandet, Rachel Torchet, Stephane Cruveiller, Didier Lièvremont, Jean-Yves Coppée, Valérie Barbe and Florence Arsène-PloetzeThiomonas bacteria are ubiquitous at acid mine drainage sites and play key roles in the remediation of water at these locations by oxidizing arsenite to arsenate, favouring the sorption of arsenic by iron oxides and their coprecipitation. Understanding the adaptive capacities of these bacteria is crucial to revealing how they persist and remain active in such extreme conditions. Interestingly, it was previously observed that after exposure to arsenite, when grown in a biofilm, some strains of Thiomonas bacteria develop variants that are more resistant to arsenic. Here, we identified the mechanisms involved in the emergence of such variants in biofilms. We found that the percentage of variants generated increased in the presence of high concentrations of arsenite (5.33 mM), especially in the detached cells after growth under biofilm-forming conditions. Analysis of gene expression in the parent strain CB2 revealed that genes involved in DNA repair were upregulated in the conditions where variants were observed. Finally, we assessed the phenotypes and genomes of the subsequent variants generated to evaluate the number of mutations compared to the parent strain. We determined that multiple point mutations accumulated after exposure to arsenite when cells were grown under biofilm conditions. Some of these mutations were found in what is referred to as ICE19, a genomic island (GI) carrying arsenic-resistance genes, also harbouring characteristics of an integrative and conjugative element (ICE). The mutations likely favoured the excision and duplication of this GI. This research aids in understanding how Thiomonas bacteria adapt to highly toxic environments, and, more generally, provides a window to bacterial genome evolution in extreme environments.
-
- Mechanisms of Evolution
-
-
Genomic instability in an interspecific hybrid of the genus Saccharomyces: a matter of adaptability
More LessAncient events of polyploidy have been linked to huge evolutionary leaps in the tree of life, while increasing evidence shows that newly established polyploids have adaptive advantages in certain stress conditions compared to their relatives with a lower ploidy. The genus Saccharomyces is a good model for studying such events, as it contains an ancient whole-genome duplication event and many sequenced Saccharomyces cerevisiae are, evolutionary speaking, newly formed polyploids. Many polyploids have unstable genomes and go through large genome erosions; however, it is still unknown what mechanisms govern this reduction. Here, we sequenced and studied the natural S. cerevisiae × Saccharomyces kudriavzevii hybrid strain, VIN7, which was selected for its commercial use in the wine industry. The most singular observation is that its nuclear genome is highly unstable and drastic genomic alterations were observed in only a few generations, leading to a widening of its phenotypic landscape. To better understand what leads to the loss of certain chromosomes in the VIN7 cell population, we looked for genetic features of the genes, such as physical interactions, complex formation, epistatic interactions and stress responding genes, which could have beneficial or detrimental effects on the cell if their dosage is altered by a chromosomal copy number variation. The three chromosomes lost in our VIN7 population showed different patterns, indicating that multiple factors could explain the mechanisms behind the chromosomal loss. However, one common feature for two out of the three chromosomes is that they are among the smallest ones. We hypothesize that small chromosomes alter their copy numbers more frequently as a low number of genes is affected, meaning that it is a by-product of genome instability, which might be the chief driving force of the adaptability and genome architecture of this hybrid.
-
-
-
Comparative genomics of Clostridioides difficile toxinotypes identifies module-based toxin gene evolution
Clostridioides difficile is a common cause of nosocomial diarrhoea. Toxins TcdA and TcdB are considered to be the main virulence factors and are encoded by the PaLoc region, while the binary toxin encoded in the CdtLoc region also contributes to pathogenicity. Variant toxinotypes reflect the genetic diversity of a key toxin-encoding 19 kb genetic element (the PaLoc). Here, we present analysis of a comprehensive collection of all known major C. difficile toxinotypes to address the evolutionary relationships of the toxin gene variants, the mechanisms underlying the origin and development of variability in toxin genes and the PaLoc, and the relationship between structure and function in TcdB variants. The structure of both toxin genes is modular, composed of interspersed blocks of sequences corresponding to functional domains and having different evolutionary histories, as shown by the distribution of mutations along the toxin genes and by incongruences of domain phylogenies compared to overall C. difficile cluster organization. In TcdB protein, four mutation patterns could be differentiated, which correlated very well with the type of TcdB cytopathic effect (CPE) on cultured cells. Mapping these mutations to the three-dimensional structure of the TcdB showed that the majority of the variation occurs in surface residues and that point mutation at residue 449 in alpha helix 16 differentiated strains with different types of CPE. In contrast to the PaLoc, phylogenetic trees of the CdtLoc were more consistent with the core genome phylogenies, but there were clues that CdtLoc can also be exchanged between strains.
-
- Genomic Methodologies
- Data Clustering Methods
-
-
Analysis of the diversity of the glycoside hydrolase family 130 in mammal gut microbiomes reveals a novel mannoside-phosphorylase function
Mannoside phosphorylases are involved in the intracellular metabolization of mannooligosaccharides, and are also useful enzymes for the in vitro synthesis of oligosaccharides. They are found in glycoside hydrolase family GH130. Here we report on an analysis of 6308 GH130 sequences, including 4714 from the human, bovine, porcine and murine microbiomes. Using sequence similarity networks, we divided the diversity of sequences into 15 mostly isofunctional meta-nodes; of these, 9 contained no experimentally characterized member. By examining the multiple sequence alignments in each meta-node, we predicted the determinants of the phosphorolytic mechanism and linkage specificity. We thus hypothesized that eight uncharacterized meta-nodes would be phosphorylases. These sequences are characterized by the absence of signal peptides and of the catalytic base. Those sequences with the conserved E/K, E/R and Y/R pairs of residues involved in substrate binding would target β-1,2-, β-1,3- and β-1,4-linked mannosyl residues, respectively. These predictions were tested by characterizing members of three of the uncharacterized meta-nodes from gut bacteria. We discovered the first known β-1,4-mannosyl-glucuronic acid phosphorylase, which targets a motif of the Shigella lipopolysaccharide O-antigen. This work uncovers a reliable strategy for the discovery of novel mannoside-phosphorylases, reveals possible interactions between gut bacteria, and identifies a biotechnological tool for the synthesis of antigenic oligosaccharides.
-
-
-
Universal whole-sequence-based plasmid typing and its utility to prediction of host range and epidemiological surveillance
More LessBacterial plasmids play a large role in allowing bacteria to adapt to changing environments and can pose a significant risk to human health if they confer virulence and antimicrobial resistance (AMR). Plasmids differ significantly in the taxonomic breadth of host bacteria in which they can successfully replicate, this is commonly referred to as ‘host range’ and is usually described in qualitative terms of ‘narrow’ or ‘broad’. Understanding the host range potential of plasmids is of great interest due to their ability to disseminate traits such as AMR through bacterial populations and into human pathogens. We developed the MOB-suite to facilitate characterization of plasmids and introduced a whole-sequence-based classification system based on clustering complete plasmid sequences using Mash distances (https://github.com/phac-nml/mob-suite). We updated the MOB-suite database from 12 091 to 23 671 complete sequences, representing 17 779 unique plasmids. With advances in new algorithms for rapidly calculating average nucleotide identity (ANI), we compared clustering characteristics using two different distance measures – Mash and ANI – and three clustering algorithms on the unique set of plasmids. The plasmid nomenclature is designed to group highly similar plasmids together that are unlikely to have multiple representatives within a single cell. Based on our results, we determined that clusters generated using Mash and complete-linkage clustering at a Mash distance of 0.06 resulted in highly homogeneous clusters while maintaining cluster size. The taxonomic distribution of plasmid biomarker sequences for replication and relaxase typing, in combination with MOB-suite whole-sequence-based clusters have been examined in detail for all high-quality publicly available plasmid sequences. We have incorporated prediction of plasmid replication host range into the MOB-suite based on observed distributions of these sequence features in combination with known plasmid hosts from the literature. Host range is reported as the highest taxonomic rank that covers all of the plasmids which share replicon or relaxase biomarkers or belong to the same MOB-suite cluster code. Reporting host range based on these criteria allows for comparisons of host range between studies and provides information for plasmid surveillance.
-
-
-
Metagenome-assembled genome binning methods with short reads disproportionately fail for plasmids and genomic Islands
Metagenomic methods enable the simultaneous characterization of microbial communities without time-consuming and bias-inducing culturing. Metagenome-assembled genome (MAG) binning methods aim to reassemble individual genomes from this data. However, the recovery of mobile genetic elements (MGEs), such as plasmids and genomic islands (GIs), by binning has not been well characterized. Given the association of antimicrobial resistance (AMR) genes and virulence factor (VF) genes with MGEs, studying their transmission is a public-health priority. The variable copy number and sequence composition of MGEs makes them potentially problematic for MAG binning methods. To systematically investigate this issue, we simulated a low-complexity metagenome comprising 30 GI-rich and plasmid-containing bacterial genomes. MAGs were then recovered using 12 current prediction pipelines and evaluated. While 82–94 % of chromosomes could be correctly recovered and binned, only 38–44 % of GIs and 1–29 % of plasmid sequences were found. Strikingly, no plasmid-borne VF nor AMR genes were recovered, and only 0–45 % of AMR or VF genes within GIs. We conclude that short-read MAG approaches, without further optimization, are largely ineffective for the analysis of mobile genes, including those of public-health importance, such as AMR and VF genes. We propose that researchers should explore developing methods that optimize for this issue and consider also using unassembled short reads and/or long-read approaches to more fully characterize metagenomic data.
-
- Systems Microbiology
- Large-scale Comparative Genomics
-
-
Comprehensive genome data analysis establishes a triple whammy of carbapenemases, ICEs and multiple clinically relevant bacteria
More LessCarbapenemases inactivate most β-lactam antibiotics, including carbapenems, and have frequently been reported among Enterobacteriaceae , Acinetobacter spp. and Pseudomonas spp. Traditionally, the horizontal gene transfer of carbapenemase-encoding genes (CEGs) has been linked to plasmids. However, given that integrative and conjugative elements (ICEs) are possibly the most abundant conjugative elements among prokaryotes, we conducted an in silico analysis to ascertain the likely role of ICEs in the spread of CEGs among all bacterial genomes (n=182 663). We detected 17 520 CEGs, of which 66 were located within putative ICEs among several bacterial species (including clinically relevant bacteria, such as Pseudomonas aeruginosa , Klebsiella pneumoniae and Escherichia coli ). Most CEGs detected within ICEs belong to the IMP, NDM and SPM metallo-beta-lactamase families, and the serine beta-lactamase KPC and GES families. Different mechanisms were likely responsible for acquisition of these genes. The majority of CEG-bearing ICEs belong to the MPFG, MPFT and MPFF classes and often encode resistance to other antibiotics (e.g. aminoglycosides and fluoroquinolones). This study provides a snapshot of the different CEGs associated with ICEs among available bacterial genomes and sheds light on the underappreciated contribution of ICEs to the spread of carbapenem resistance globally.
-
-
-
Phylogenetic and genomic analysis reveals high genomic openness and genetic diversity of Clostridium perfringens
Clostridium perfringens is associated with a variety of diseases in both humans and animals. Recent advances in genomic sequencing make it timely to re-visit this important pathogen. Although the genome sequence of C. perfringens was first determined in 2002, large-scale comparative genomics with isolates of different origins is still lacking. In this study, we used whole-genome sequencing of 45 C . perfringens isolates with isolation time spanning an 80‐year period and performed comparative analysis of 173 genomes from worldwide strains. We also conducted phylogenetic lineage analysis and introduced an openness index (OI) to evaluate the openness of bacterial genomes. We classified all these genomes into five lineages and hypothesized that the origin of C. perfringens dates back to ~80 000 years ago. We showed that the pangenome of the 173 C . perfringens strains contained a total of 26 954 genes, while the core genome comprised 1020 genes, accounting for about a third of the genome of each isolate. We demonstrated that C. perfringens had the highest OI compared with 51 other bacterial species. Intact prophage sequences were found in nearly 70.0 % of C. perfringens genomes, while CRISPR sequences were found only in ~40.0 %. Plasmids were prevalent in C. perfringens isolates, and half of the virulence genes and antibiotic resistance genes (ARGs) identified in all the isolates could be found in plasmids. ARG-sharing network analysis showed that C. perfringens shared its 11 ARGs with 55 different bacterial species, and a high frequency of ARG transfer may have occurred between C. perfringens and species in the genera Streptococcus and Staphylococcus . Correlation analysis showed that the ARG number in C. perfringens strains increased with time, while the virulence gene number was relative stable. Our results, taken together with previous studies, revealed the high genome openness and genetic diversity of C. perfringens and provide a comprehensive view of the phylogeny, genomic features, virulence gene and ARG profiles of worldwide strains.
-
- Transcriptomics, Proteomics, Networks
-
-
Transcriptomic analysis of cyanobacterial alkane overproduction reveals stress-related genes and inhibitors of lipid droplet formation
More LessThe cyanobacterium Nostoc punctiforme can form lipid droplets (LDs), internal inclusions containing triacylglycerols, carotenoids and alkanes. LDs are enriched for a 17 carbon-long alkane in N. punctiforme , and it has been shown that the overexpression of the aar and ado genes results in increased LD and alkane production. To identify transcriptional adaptations associated with increased alkane production, we performed comparative transcriptomic analysis of an alkane overproduction strain. RNA-seq data identified a large number of highly upregulated genes in the overproduction strain, including genes potentially involved in rRNA processing, mycosporine-glycine production and synthesis of non-ribosomal peptides, including nostopeptolide A. Other genes encoding helical carotenoid proteins, stress-induced proteins and those for microviridin synthesis were also upregulated. Construction of N. punctiforme strains with several upregulated genes or operons on multi-copy plasmids resulted in reduced alkane accumulation, indicating possible negative regulators of alkane production. A strain containing four genes for microviridin biosynthesis completely lost the ability to synthesize LDs. This strain exhibited wild-type growth and lag phase recovery under standard conditions, and slightly faster growth under high light. The transcriptional changes associated with increased alkane production identified in this work will provide the basis for future experiments designed to use cyanobacteria as a production platform for biofuel or high-value hydrophobic products.
-
-
-
Expanded roles of pyruvate-sensing PdhR in transcription regulation of the Escherichia coli K-12 genome: fatty acid catabolism and cell motility
More LessThe transcription factor PdhR has been recognized as the master regulator of the pyruvate catabolism pathway in Escherichia coli , including both NAD-linked oxidative decarboxylation of pyruvate to acetyl-CoA by PDHc (pyruvate dehydrogenase complex) and respiratory electron transport of NADH to oxygen by Ndh-CyoABCD enzymes. To identify the whole set of regulatory targets under the control of pyruvate-sensing PdhR, we performed genomic SELEX (gSELEX) screening in vitro. A total of 35 PdhR-binding sites were identified along the E. coli K-12 genome, including previously identified targets. Possible involvement of PdhR in regulation of the newly identified target genes was analysed in detail by gel shift assay, RT-qPCR and Northern blot analysis. The results indicated the participation of PdhR in positive regulation of fatty acid degradation genes and negative regulation of cell mobility genes. In fact, GC analysis indicated an increase in free fatty acids in the mutant lacking PdhR. We propose that PdhR is a bifunctional global regulator for control of a total of 16–23 targets, including not only the genes involved in central carbon metabolism but also some genes for the surrounding pyruvate-sensing cellular pathways such as fatty acid degradation and flagella formation. The activity of PdhR is controlled by pyruvate, the key node between a wide variety of metabolic pathways, including generation of metabolic energy and cell building blocks.
-
-
-
Transcriptome-wide expression profiling of Sporothrix schenckii yeast and mycelial forms and the establishment of the Sporothrix Genome DataBase
Sporothrix schenckii is a dimorphic fungus existing as mould in the environment and as yeast in the host. The morphological shift between mycelial/yeast phases is crucial for its virulence, but the transcriptional networks implicated in dimorphic transition are still not fully understood. Here, we report the global transcriptomic differences occurring between mould and yeast phases of S. schenckii, including changes in gene expression profiles associated with these distinct cellular phenotypes. Moreover, we also propose a new genome annotation, which reveals a more complex transcriptional architecture than previously assumed. Using RNA-seq, we identified a total of 17 307 genes, of which 11 217 were classified as protein-encoding genes, whereas 6090 were designated as non-coding RNAs (ncRNAs). Approximately ~71 % of all annotated genes were found to overlap and the different-strand overlapping type was the most common. Gene expression analysis revealed that 8795 genes were differentially regulated among yeast and mould forms. Differential gene expression was also observed for antisense ncRNAs overlapping neighbouring protein-encoding genes. The release of transcriptome-wide data and the establishment of the Sporothrix Genome DataBase (http://sporothrixgenomedatabase.unime.it) represent an important milestone for Sporothrix research, because they provide a strong basis for future studies on the molecular pathways involved in numerous biological processes.
-
- Genome Annotation, Metabolic Reconstructions
-
-
Platon: identification and characterization of bacterial plasmid contigs in short-read draft assemblies exploiting protein sequence-based replicon distribution scores
Plasmids are extrachromosomal genetic elements that replicate independently of the chromosome and play a vital role in the environmental adaptation of bacteria. Due to potential mobilization or conjugation capabilities, plasmids are important genetic vehicles for antimicrobial resistance genes and virulence factors with huge and increasing clinical implications. They are therefore subject to large genomic studies within the scientific community worldwide. As a result of rapidly improving next-generation sequencing methods, the quantity of sequenced bacterial genomes is constantly increasing, in turn raising the need for specialized tools to (i) extract plasmid sequences from draft assemblies, (ii) derive their origin and distribution, and (iii) further investigate their genetic repertoire. Recently, several bioinformatic methods and tools have emerged to tackle this issue; however, a combination of high sensitivity and specificity in plasmid sequence identification is rarely achieved in a taxon-independent manner. In addition, many software tools are not appropriate for large high-throughput analyses or cannot be included in existing software pipelines due to their technical design or software implementation. In this study, we investigated differences in the replicon distributions of protein-coding genes on a large scale as a new approach to distinguish plasmid-borne from chromosome-borne contigs. We defined and computed statistical discrimination thresholds for a new metric: the replicon distribution score (RDS), which achieved an accuracy of 96.6 %. The final performance was further improved by the combination of the RDS metric with heuristics exploiting several plasmid-specific higher-level contig characterizations. We implemented this workflow in a new high-throughput taxon-independent bioinformatics software tool called Platon for the recruitment and characterization of plasmid-borne contigs from short-read draft assemblies. Compared to PlasFlow, Platon achieved a higher accuracy (97.5 %) and more balanced predictions (F1=82.6 %) tested on a broad range of bacterial taxa and better or equal performance against the targeted tools PlasmidFinder and PlaScope on sequenced Escherichia coli isolates. Platon is available at: http://platon.computational.bio/.
-
- Short Communication
-
- Microbial Evolution and Epidemiology
- Population Genomics
-
-
Nuclear and mitochondrial genome sequencing of North-African Leishmania infantum isolates from cured and relapsed visceral leishmaniasis patients reveals variations correlating with geography and phenotype
Although several studies have investigated genetic diversity of Leishmania infantum in North Africa, genome-wide analyses are lacking. Here, we conducted comparative analyses of nuclear and mitochondrial genomes of seven L. infantum isolates from Tunisia with the aim to gain insight into factors that drive genomic and phenotypic adaptation. Isolates were from cured (n=4) and recurrent (n=3) visceral leishmaniasis (VL) cases, originating from northern (n=2) and central (n=5) Tunisia, where respectively stable and emerging VL foci are observed. All isolates from relapsed patients were from Kairouan governorate (Centre); one showing resistance to the anti-leishmanial drug Meglumine antimoniate. Nuclear genome diversity of the isolates was analysed by comparison to the L. infantum JPCM5 reference genome. Kinetoplast maxi and minicircle sequences (1 and 59, respectively) were extracted from unmapped reads and identified by blast analysis against public data sets. The genome variation analysis grouped together isolates from the same geographical origins. Strains from the North were very different from the reference showing more than 34 587 specific single nucleotide variants, with one isolate representing a full genetic hybrid as judged by variant frequency. Composition of minicircle classes within isolates corroborated this geographical population structure. Read depth analysis revealed several significant gene copy number variations correlating with either geographical origin (amastin and Hsp33 genes) or relapse (CLN3 gene). However, no specific gene copy number variation was found in the drug-resistant isolate. In contrast, resistance was associated with a specific minicircle pattern suggesting Leishmania mitochondrial DNA as a potential novel source for biomarker discovery.
-
- Method
-
- Microbial Communities
- Other - Animals, Insects, Plants
-
-
Accurate quantification of bacterial abundance in metagenomic DNAs accounting for variable DNA integrity levels
The quantification of the total microbial content in metagenomic samples is critical for investigating the interplay between the microbiome and its host, as well as for assessing the accuracy and precision of the relative microbial composition which can be strongly biased in low microbial biomass samples. In the present study, we demonstrate that digital droplet PCR (ddPCR) can provide accurate quantification of the total copy number of the 16S rRNA gene, the gene usually exploited for assessing total bacterial abundance in metagenomic DNA samples. Notably, using DNA templates with different integrity levels, as measured by the DNA integrity number (DIN), we demonstrated that 16S rRNA copy number quantification is strongly affected by DNA quality and determined a precise correlation between quantification underestimation and DNA degradation levels. Therefore, we propose an input DNA mass correction, according to the observed DIN value, which could prevent inaccurate quantification of 16S copy number in degraded metagenomic DNAs. Our results highlight that a preliminary evaluation of the metagenomic DNA integrity should be considered before performing metagenomic analyses of different samples, both for the assessment of the reliability of observed differential abundances in different conditions and to obtain significant functional insights.
-