-
Volume 6,
Issue 8,
2020
Volume 6, Issue 8, 2020
- Review
-
- Genomic Methodologies
- Data Clustering Methods
-
-
Metagenomic approaches in microbial ecology: an update on whole-genome and marker gene sequencing analyses
More LessMetagenomics and marker gene approaches, coupled with high-throughput sequencing technologies, have revolutionized the field of microbial ecology. Metagenomics is a culture-independent method that allows the identification and characterization of organisms from all kinds of samples. Whole-genome shotgun sequencing analyses the total DNA of a chosen sample to determine the presence of micro-organisms from all domains of life and their genomic content. Importantly, the whole-genome shotgun sequencing approach reveals the genomic diversity present, but can also give insights into the functional potential of the micro-organisms identified. The marker gene approach is based on the sequencing of a specific gene region. It allows one to describe the microbial composition based on the taxonomic groups present in the sample. It is frequently used to analyse the biodiversity of microbial ecosystems. Despite its importance, the analysis of metagenomic sequencing and marker gene data is quite a challenge. Here we review the primary workflows and software used for both approaches and discuss the current challenges in the field.
-
- Research Article
-
- Microbial Evolution and Epidemiology
- Population Genomics
-
-
A publicly accessible database for Clostridioides difficile genome sequences supports tracing of transmission chains and epidemics
Martinique Frentrup, Zhemin Zhou, Matthias Steglich, Jan P. Meier-Kolthoff, Markus Göker, Thomas Riedel, Boyke Bunk, Cathrin Spröer, Jörg Overmann, Marion Blaschitz, Alexander Indra, Lutz von Müller, Thomas A. Kohl, Stefan Niemann, Christian Seyboldt, Frank Klawonn, Nitin Kumar, Trevor D. Lawley, Sergio García-Fernández, Rafael Cantón, Rosa del Campo, Ortrud Zimmermann, Uwe Groß, Mark Achtman and Ulrich NübelClostridioides difficile is the primary infectious cause of antibiotic-associated diarrhea. Local transmissions and international outbreaks of this pathogen have been previously elucidated by bacterial whole-genome sequencing, but comparative genomic analyses at the global scale were hampered by the lack of specific bioinformatic tools. Here we introduce a publicly accessible database within EnteroBase (http://enterobase.warwick.ac.uk) that automatically retrieves and assembles C. difficile short-reads from the public domain, and calls alleles for core-genome multilocus sequence typing (cgMLST). We demonstrate that comparable levels of resolution and precision are attained by EnteroBase cgMLST and single-nucleotide polymorphism analysis. EnteroBase currently contains 18 254 quality-controlled C. difficile genomes, which have been assigned to hierarchical sets of single-linkage clusters by cgMLST distances. This hierarchical clustering is used to identify and name populations of C. difficile at all epidemiological levels, from recent transmission chains through to epidemic and endemic strains. Moreover, it puts newly collected isolates into phylogenetic and epidemiological context by identifying related strains among all previously published genome data. For example, HC2 clusters (i.e. chains of genomes with pairwise distances of up to two cgMLST alleles) were statistically associated with specific hospitals (P<10−4) or single wards (P=0.01) within hospitals, indicating they represented local transmission clusters. We also detected several HC2 clusters spanning more than one hospital that by retrospective epidemiological analysis were confirmed to be associated with inter-hospital patient transfers. In contrast, clustering at level HC150 correlated with k-mer-based classification and was largely compatible with PCR ribotyping, thus enabling comparisons to earlier surveillance data. EnteroBase enables contextual interpretation of a growing collection of assembled, quality-controlled C. difficile genome sequences and their associated metadata. Hierarchical clustering rapidly identifies database entries that are related at multiple levels of genetic distance, facilitating communication among researchers, clinicians and public-health officials who are combatting disease caused by C. difficile .
-
- Microbe-Niche Interactions
- Pathogenesis
-
-
Whole-genome analyses reveal gene content differences between nontypeable Haemophilus influenzae isolates from chronic obstructive pulmonary disease compared to other clinical phenotypes
Nontypeable Haemophilus influenzae (NTHi) colonizes human upper respiratory airways and plays a key role in the course and pathogenesis of acute exacerbations of chronic obstructive pulmonary disease (COPD). Currently, it is not possible to distinguish COPD isolates of NTHi from other clinical isolates of NTHi using conventional genotyping methods. Here, we analysed the core and accessory genome of 568 NTHi isolates, including 40 newly sequenced isolates, to look for genetic distinctions between NTHi isolates from COPD with respect to other illnesses, including otitis media, meningitis and pneumonia. Phylogenies based on polymorphic sites in the core-genome did not show discrimination between NTHi strains collected from different clinical phenotypes. However, pan-genome-wide association studies identified 79 unique NTHi accessory genes that were significantly associated with COPD. Furthermore, many of the COPD-related NTHi genes have known or predicted roles in virulence, transmembrane transport of metal ions and nutrients, cellular respiration and maintenance of redox homeostasis. This indicates that specific genes may be required by NTHi for its survival or virulence in the COPD lung. These results advance our understanding of the pathogenesis of NTHi infection in COPD lungs.
-
- Host Adaptation
-
-
The compact genome of Giardia muris reveals important steps in the evolution of intestinal protozoan parasites
Diplomonad parasites of the genus Giardia have adapted to colonizing different hosts, most notably the intestinal tract of mammals. The human-pathogenic Giardia species, Giardia intestinalis, has been extensively studied at the genome and gene expression level, but no such information is available for other Giardia species. Comparative data would be particularly valuable for Giardia muris, which colonizes mice and is commonly used as a prototypic in vivo model for investigating host responses to intestinal parasitic infection. Here we report the draft-genome of G. muris. We discovered a highly streamlined genome, amongst the most densely encoded ever described for a nuclear eukaryotic genome. G. muris and G. intestinalis share many known or predicted virulence factors, including cysteine proteases and a large repertoire of cysteine-rich surface proteins involved in antigenic variation. Different to G. intestinalis, G. muris maintains tandem arrays of pseudogenized surface antigens at the telomeres, whereas intact surface antigens are present centrally in the chromosomes. The two classes of surface antigens engage in genetic exchange. Reconstruction of metabolic pathways from the G. muris genome suggest significant metabolic differences to G. intestinalis. Additionally, G. muris encodes proteins that might be used to modulate the prokaryotic microbiota. The responsible genes have been introduced in the Giardia genus via lateral gene transfer from prokaryotic sources. Our findings point to important evolutionary steps in the Giardia genus as it adapted to different hosts and it provides a powerful foundation for mechanistic exploration of host–pathogen interaction in the G. muris–mouse pathosystem.
-
-
-
Comparative genome analysis of Erysipelothrix rhusiopathiae isolated from domestic pigs and wild boars suggests host adaptation and selective pressure from the use of antibiotics
The disease erysipelas caused by Erysipelothrix rhusiopathiae (ER) is a major concern in pig production. In the present study the genomes of ER from pigs (n=87), wild boars (n=71) and other sources (n=85) were compared in terms of whole-genome SNP variation, accessory genome content and the presence of genetic antibiotic resistance determinants. The aim was to investigate if genetic features among ER were associated with isolate origin in order to better estimate the risk of transmission of porcine-adapted strains from wild boars to free-range pigs and to increase our understanding of the evolution of ER. Pigs and wild boars carried isolates representing all ER clades, but clade one only occurred in healthy wild boars and healthy pigs. Several accessory genes or gene variants were found to be significantly associated with the pig and wild boar hosts, with genes predicted to encode cell wall-associated or extracellular proteins overrepresented. Gene variants associated with serovar determination and capsule production in serovars known to be pathogenic for pigs were found to be significantly associated with pigs as hosts. In total, 30 % of investigated pig isolates but only 6 % of wild boar isolates carried resistance genes, most commonly tetM (tetracycline) and lsa(E) together with lnu(B) (lincosamides, pleuromutilin and streptogramin A). The incidence of variably present genes including resistance determinants was weakly linked to phylogeny, indicating that host adaptation in ER has evolved multiple times in diverse lineages mediated by recombination and the acquisition of mobile genetic elements. The presented results support the occurrence of host-adapted ER strains, but they do not indicate frequent transmission between wild boars and domestic pigs. This article contains data hosted by Microreact.
-
- Genomic Methodologies
- Genome Variation Detection
-
-
Genomic variant-identification methods may alter Mycobacterium tuberculosis transmission inferences
Pathogen genomic data are increasingly used to characterize global and local transmission patterns of important human pathogens and to inform public health interventions. Yet, there is no current consensus on how to measure genomic variation. To test the effect of the variant-identification approach on transmission inferences for Mycobacterium tuberculosis, we conducted an experiment in which five genomic epidemiology groups applied variant-identification pipelines to the same outbreak sequence data. We compared the variants identified by each group in addition to transmission and phylogenetic inferences made with each variant set. To measure the performance of commonly used variant-identification tools, we simulated an outbreak. We compared the performance of three mapping algorithms, five variant callers and two variant filters in recovering true outbreak variants. Finally, we investigated the effect of applying increasingly stringent filters on transmission inferences and phylogenies. We found that variant-calling approaches used by different groups do not recover consistent sets of variants, which can lead to conflicting transmission inferences. Further, performance in recovering true variation varied widely across approaches. While no single variant-identification approach outperforms others in both recovering true genome-wide and outbreak-level variation, variant-identification algorithms calibrated upon real sequence data or that incorporate local reassembly outperform others in recovering true pairwise differences between isolates. The choice of variant filters contributed to extensive differences across pipelines, and applying increasingly stringent filters rapidly eroded the accuracy of transmission inferences and quality of phylogenies reconstructed from outbreak variation. Commonly used approaches to identify M. tuberculosis genomic variation have variable performance, particularly when predicting potential transmission links from pairwise genetic distances. Phylogenetic reconstruction may be improved by less stringent variant filtering. Approaches that improve variant identification in repetitive, hypervariable regions, such as long-read assemblies, may improve transmission inference.
-
- Genome-phenotype Association
-
-
Genomic analyses of Staphylococcus aureus clonal complex 45 isolates does not distinguish nasal carriage from bacteraemia
Staphylococcus aureus is a colonizing opportunistic pathogen and a leading cause of bloodstream infection with high morbidity and mortality. S. aureus carriage frequency is reportedly between 20 and 40 % among healthy adults, with S. aureus colonization considered to be a risk factor for S. aureus bacteraemia. It is unknown whether a genetic component of the bacterium is associated with S. aureus bacteraemia in comparison to nasal carriage strains. Previous association studies primarily focusing on the clinical outcome of an S. aureus infection have produced conflicting results, often limited by study design challenged by sample collections and the clonal diversity of S. aureus . To date, no study has investigated whether genomic features separate nasal carriage isolates from S. aureus bacteraemia isolates within a single clonal lineage. Here we have investigated whether genomic features, including single-nucleotide polymorphisms (SNPs), genes, or kmers, distinguish S. aureus nasal carriage isolates from bacteraemia isolates that all belong to the same clonal lineage [clonal complex 45 (CC45)] using whole-genome sequencing (WGS) and a genome-wide association (GWA) approach. From CC45, 100 isolates (50 bacteraemia and 50 nasal carriage, geographically and temporally matched) from Denmark were whole-genome sequenced and subjected to GWA analyses involving gene copy number variation, SNPs, gene content, kmers and gene combinations, while correcting for lineage effects. No statistically significant association involving SNPs, specific genes, gene variants, gene copy number variation, or a combination of genes was identified that could distinguish bacteraemia isolates from nasal carriage isolates. The presented results suggest that all S. aureus nasal CC45 isolates carry the potential to cause invasive disease, as no core or accessory genome content or variations were statistically associated with invasiveness.
-
- Systems Microbiology
- Transcriptomics, Proteomics, Networks
-
-
Defining the transcriptional responses of Aspergillus nidulans to cation/alkaline pH stress and the role of the transcription factor SltA
More LessFungi have developed the ability to overcome extreme growth conditions and thrive in hostile environments. The model fungus Aspergillus nidulans tolerates, for example, ambient alkalinity up to pH 10 or molar concentrations of multiple cations. The ability to grow under alkaline pH or saline stress depends on the effective function of at least three regulatory pathways mediated by the zinc-finger transcription factor PacC, which mediates the ambient pH regulatory pathway, the calcineurin-dependent CrzA and the cation homeostasis responsive factor SltA. Using RNA sequencing, we determined the effect of external pH alkalinization or sodium stress on gene expression. The data show that each condition triggers transcriptional responses with a low degree of overlap. By sequencing the transcriptomes of the null mutant, the role of SltA in the above-mentioned homeostasis mechanisms was also studied. The results show that the transcriptional role of SltA is wider than initially expected and implies, for example, the positive control of the PacC-dependent ambient pH regulatory pathway. Overall, our data strongly suggest that the stress response pathways in fungi include some common but mostly exclusive constituents, and that there is a hierarchical relationship among the main regulators of stress response, with SltA controlling pacC expression, at least in A. nidulans.
-
- Short Communication
-
- Microbial Evolution and Epidemiology
- Population Genomics
-
-
Community-acquired infection caused by the uncommon hypervirulent Klebsiella pneumoniae ST66-K2 lineage
Klebsiella pneumoniae (Kp) reference strain Kp52.145 is widely used in experimental Klebsiella pathophysiology. Since 1935, only one other strain of the same sublineage (sequence type ST66, capsular serotype K2) was isolated (AJ210, Australia). Here, we describe a community-acquired invasive infection caused by a ST66-K2 Kp strain in France. Four hypermucoviscous Kp isolates responsible for acute otitis media, meningitis, bacteraemia and bacteriuria, respectively, were obtained from a patient with a history of chronic alcoholism and diabetes mellitus, and infected with HIV. The isolates were characterized by phenotypic and genomic methods. The four genetically identical ST66-K2 isolates presented a full antimicrobial susceptibility profile, including to ampicillin, corresponding to a single strain (SB5881), which was more closely related to AJ210 (135 SNPs) than to Kp52.145 (388 SNPs). Colibactin and yersiniabactin gene clusters were present on the integrative and conjugative element ICEKp10 in the chromosome. The two plasmids from Kp52.145 were detected in SB5881. In addition to carrying genes for virulence factors RmpA, aerobactin and salmochelin, plasmid II has acquired in SB5881, the conjugation machinery gene cluster from plasmid I. We report the first case of community-acquired infection caused by a hypervirulent ST66-K2 Kp strain in Europe. This demonstrates the long-term persistence of the high-virulence and laboratory model ST66-K2 sublineage. The combination of a conjugative apparatus and major virulence genes on a single plasmid may contribute to the co-occurrence of hypervirulence and multidrug resistance in single Kp strains.
-
- Communicable Disease Genomics
-
-
Mycobacterium bovis genomics reveals transmission of infection between cattle and deer in Ireland
Control of bovine tuberculosis (bTB), caused by Mycobacterium bovis , in the Republic of Ireland costs €84 million each year. Badgers are recognized as being a wildlife source for M. bovis infection of cattle. Deer are thought to act as spillover hosts for infection; however, population density is recognized as an important driver in shifting their epidemiological role, and deer populations across the country have been increasing in density and range. County Wicklow represents one specific area in the Republic of Ireland with a high density of deer that has had consistently high bTB prevalence for over a decade, despite control operations in both cattle and badgers. Our research used whole-genome sequencing of M. bovis sourced from infected cattle, deer and badgers in County Wicklow to evaluate whether the epidemiological role of deer could have shifted from spillover host to source. Our analyses reveal that cattle and deer share highly similar M. bovis strains, suggesting that transmission between these species is occurring in the area. In addition, the high level of diversity observed in the sampled deer population suggests deer may be acting as a source of infection for local cattle populations. These findings have important implications for the control and ultimate eradication of bTB in Ireland.
-
Most Read This Month
![Loading](/images/jp/spinner.gif)