- Volume 7, Issue 12, 2021
Volume 7, Issue 12, 2021
- Outbreak Reports
-
- Pathogens and Epidemiology
-
-
Using whole-genome sequencing to assess the diversity of Paenibacillus larvae within an outbreak and a beekeeping operation
More LessThe spore-forming bacterium Paenibacillus larvae is the causative agent of American foulbrood (AFB), a devastating disease of honeybees (Apis mellifera). In the present study, we used whole-genome sequencing (WGS) to investigate an extensive outbreak of AFB in northwestern Slovenia in 2019. A total of 59 P . larvae isolates underwent WGS, of which 40 originated from a single beekeeping operation, to assess the diversity of P. larvae within the beekeeping operation, apiary and colony. By applying a case-specific single-linkage threshold of 34 allele differences (AD), whole-genome multilocus sequence typing (wgMLST) identified two outbreak clusters represented by ERIC II-ST11 clones. All isolates from a single beekeeping operation fell within cluster 1 and the median pairwise AD between them was 10 (range=1–22). The median pairwise AD for apiaries of the same beekeeping operation ranged from 8 to 11 (min.=1, max.=22). For colonies of the same apiary and honey samples from these colonies, the median pairwise AD ranged from 8 to 14 (min.=1, max.=20). The maximum within-cluster distance was 33 pairwise AD for cluster 1 and 44 for cluster 2 isolates. The minimum distance between the outbreak-related and non-related isolates was 37 AD, confirming the importance of associated epidemiological data for delineating outbreak clusters. The observed transmission events could be explained by the activities of honeybees and beekeepers. The present study provides insight into the genetic diversity of P. larvae at different levels and thus provides information for future AFB surveillance.
-
- Research Articles
-
- Genomic Methodologies
-
-
Cluster-specific gene markers enhance Shigella and enteroinvasive Escherichia coli in silico serotyping
More LessShigella and enteroinvasive Escherichia coli (EIEC) cause human bacillary dysentery with similar invasion mechanisms and share similar physiological, biochemical and genetic characteristics. Differentiation of Shigella from EIEC is important for clinical diagnostic and epidemiological investigations. However, phylogenetically, Shigella and EIEC strains are composed of multiple clusters and are different forms of E. coli , making it difficult to find genetic markers to discriminate between Shigella and EIEC. In this study, we identified 10 Shigella clusters, seven EIEC clusters and 53 sporadic types of EIEC by examining over 17000 publicly available Shigella and EIEC genomes. We compared Shigella and EIEC accessory genomes to identify cluster-specific gene markers for the 17 clusters and 53 sporadic types. The cluster-specific gene markers showed 99.64% accuracy and more than 97.02% specificity. In addition, we developed a freely available in silico serotyping pipeline named Shigella EIEC Cluster Enhanced Serotype Finder (ShigEiFinder) by incorporating the cluster-specific gene markers and established Shigella and EIEC serotype-specific O antigen genes and modification genes into typing. ShigEiFinder can process either paired-end Illumina sequencing reads or assembled genomes and almost perfectly differentiated Shigella from EIEC with 99.70 and 99.74% cluster assignment accuracy for the assembled genomes and read mapping respectively. ShigEiFinder was able to serotype over 59 Shigella serotypes and 22 EIEC serotypes and provided a high specificity of 99.40% for assembled genomes and 99.38% for read mapping for serotyping. The cluster-specific gene markers and our new serotyping tool, ShigEiFinder (installable package: https://github.com/LanLab/ShigEiFinder, online tool: https://mgtdb.unsw.edu.au/ShigEiFinder/), will be useful for epidemiological and diagnostic investigations.
-
-
-
Chemical biology-whole genome engineering datasets predict new antibacterial combinations
Trimethoprim and sulfamethoxazole are used commonly together as cotrimoxazole for the treatment of urinary tract and other infections. The evolution of resistance to these and other antibacterials threatens therapeutic options for clinicians. We generated and analysed a chemical-biology-whole-genome data set to predict new targets for antibacterial combinations with trimethoprim and sulfamethoxazole. For this we used a large transposon mutant library in Escherichia coli BW25113 where an outward-transcribing inducible promoter was engineered into one end of the transposon. This approach allows regulated expression of adjacent genes in addition to gene inactivation at transposon insertion sites, a methodology that has been called TraDIS-Xpress. These chemical genomic data sets identified mechanisms for both reduced and increased susceptibility to trimethoprim and sulfamethoxazole. The data identified that over-expression of FolA reduced trimethoprim susceptibility, a known mechanism for reduced susceptibility. In addition, transposon insertions into the genes tdk, deoR, ybbC, hha, ldcA, wbbK and waaS increased susceptibility to trimethoprim and likewise for rsmH, fadR, ddlB, nlpI and prc with sulfamethoxazole, while insertions in ispD, uspC, minC, minD, yebK, truD and umpG increased susceptibility to both these antibiotics. Two of these genes’ products, Tdk and IspD, are inhibited by AZT and fosmidomycin respectively, antibiotics that are known to synergise with trimethoprim. Thus, the data identified two known targets and several new target candidates for the development of co-drugs that synergise with trimethoprim, sulfamethoxazole or cotrimoxazole. We demonstrate that the TraDIS-Xpress technology can be used to generate information-rich chemical-genomic data sets that can be used for antibacterial development.
-
-
-
Accuracy of an amplicon-sequencing nanopore approach to identify variants in tuberculosis drug-resistance-associated genes
A rapid and accurate diagnostic assay represents an important means to detect Mycobacterium tuberculosis , identify drug-resistant strains and ensure treatment success. Currently employed techniques to diagnose drug-resistant tuberculosis include slow phenotypic tests or more rapid molecular assays that evaluate a limited range of drugs. Whole-genome-sequencing-based approaches can detect known drug-resistance-conferring mutations and novel variations; however, the dependence on growing samples in culture, and the associated delays in achieving results, represents a significant limitation. As an alternative, targeted sequencing strategies can be directly performed on clinical samples at high throughput. This study proposes a targeted sequencing assay to rapidly detect drug-resistant strains of M. tuberculosis using the Nanopore MinION sequencing platform. We designed a single-tube assay that targets nine genes associated with drug resistance to seven drugs and two phylogenetic-determining regions to determine strain lineage and tested it in nine clinical isolates and six sputa. The study’s main aim is to calibrate MinNION variant calling to detect drug-resistance-associated mutations with different frequencies to match the accuracy of Illumina (the current gold-standard sequencing technology) from both culture and sputum samples. After calibrating Nanopore MinION variant calling, we demonstrated 100% agreement between Illumina WGS and our MinION set up to detect known drug resistance and phylogenetic variants in our dataset. Importantly, other variants in the amplicons are also detected, decreasing the recall. We identify minority variants and insertions/deletions as crucial bioinformatics challenges to fully reproduce Illumina WGS results.
-
-
-
High quality genome assembly of the amitochondriate eukaryote Monocercomonoides exilis
More LessMonocercomonoides exilis is considered the first known eukaryote to completely lack mitochondria. This conclusion is based primarily on a genomic and transcriptomic study which failed to identify any mitochondrial hallmark proteins. However, the available genome assembly has limited contiguity and around 1.5 % of the genome sequence is represented by unknown bases. To improve the contiguity, we re-sequenced the genome and transcriptome of M. exilis using Oxford Nanopore Technology (ONT). The resulting draft genome is assembled in 101 contigs with an N50 value of 1.38 Mbp, almost 20 times higher than the previously published assembly. Using a newly generated ONT transcriptome, we further improve the gene prediction and add high quality untranslated region (UTR) annotations, in which we identify two putative polyadenylation signals present in the 3′UTR regions and characterise the Kozak sequence in the 5′UTR regions. All these improvements are reflected by higher BUSCO genome completeness values. Regardless of an overall more complete genome assembly without missing bases and a better gene prediction, we still failed to identify any mitochondrial hallmark genes, thus further supporting the hypothesis on the absence of mitochondrion.
-
- Functional Genomics and Microbe–Niche Interactions
-
-
Acidovorax pan-genome reveals specific functional traits for plant beneficial and pathogenic plant-associations
More LessBeta-proteobacteria belonging to the genus Acidovorax have been described from various environments. Many strains can interact with a range of hosts, including humans and plants, forming neutral, beneficial or detrimental associations. In the frame of this study, we investigated the genomic properties of 52 bacterial strains of the genus Acidovorax , isolated from healthy roots of Lotus japonicus, with the intent of identifying traits important for effective plant-growth promotion. Based on single-strain inoculation bioassays with L. japonicus, performed in a gnotobiotic system, we distinguished seven robust plant-growth promoting strains from strains with no significant effects on plant-growth. We showed that the genomes of the two groups differed prominently in protein families linked to sensing and transport of organic acids, production of phytohormones, as well as resistance and production of compounds with antimicrobial properties. In a second step, we compared the genomes of the tested isolates with those of plant pathogens and free-living strains of the genus Acidovorax sourced from public repositories. Our pan-genomics comparison revealed features correlated with commensal and pathogenic lifestyle. We showed that commensals and pathogens differ mostly in their ability to use plant-derived lipids and in the type of secretion-systems being present. Most free-living Acidovorax strains did not harbour any secretion-systems. Overall, our data indicate that Acidovorax strains undergo extensive adaptations to their particular lifestyle by horizontal uptake of novel genetic information and loss of unnecessary genes.
-
-
-
Machine learning prediction of novel pectinolytic enzymes in Aspergillus niger through integrating heterogeneous (post-) genomics data
More LessPectinolytic enzymes are a variety of enzymes involved in breaking down pectin, a complex and abundant plant cell-wall polysaccharide. In nature, pectinolytic enzymes play an essential role in allowing bacteria and fungi to depolymerize and utilize pectin. In addition, pectinases have been widely applied in various industries, such as the food, wine, textile, paper and pulp industries. Due to their important biological function and increasing industrial potential, discovery of novel pectinolytic enzymes has received global interest. However, traditional enzyme characterization relies heavily on biochemical experiments, which are time consuming, laborious and expensive. To accelerate identification of novel pectinolytic enzymes, an automatic approach is needed. We developed a machine learning (ML) approach for predicting pectinases in the industrial workhorse fungus, Aspergillus niger. The prediction integrated a diverse range of features, including evolutionary profile, gene expression, transcriptional regulation and biochemical characteristics. Results on both the training and the independent testing dataset showed that our method achieved over 90 % accuracy, and recalled over 60 % of pectinolytic genes. Application of the ML model on the A. niger genome led to the identification of 83 pectinases, covering both previously described pectinases and novel pectinases that do not belong to any known pectinolytic enzyme family. Our study demonstrated the tremendous potential of ML in discovery of new industrial enzymes through integrating heterogeneous (post-) genomimcs data.
-
-
-
Single-cell genomics-based analysis reveals a vital ecological role of Thiocapsa sp. LSW in the meromictic Lake Shunet, Siberia
Meromictic lakes usually harbour certain prevailing anoxygenic phototrophic bacteria in their anoxic zone, such as the purple sulfur bacterium (PSB) Thiocapsa sp. LSW (hereafter LSW) in Lake Shunet, Siberia. PSBs have been suggested to play a vital role in carbon, nitrogen and sulfur cycling at the oxic–anoxic interface of stratified lakes; however, the ecological significance of PSBs in the lake remains poorly understood. In this study, we explored the potential ecological role of LSW using a deep-sequencing analysis of single-cell genomics associated with flow cytometry. An approximately 2.7 Mb draft genome was obtained based on the co-assembly of five single-cell genomes. LSW might grow photolithoautotrophically and could play putative roles not only as a carbon fixer and diazotroph, but also as a sulfate reducer/oxidizer in the lake. This study provides insights into the potential ecological role of Thiocapsa sp. in meromictic lakes.
-
-
-
Worldwide distribution and environmental origin of the Adelaide imipenemase (AIM-1), a potent carbapenemase in Pseudomonas aeruginosa
Carbapenems are potent broad-spectrum β-lactam antibiotics reserved for the treatment of serious infections caused by multidrug-resistant bacteria such as Pseudomonas aeruginosa . The surge in P. aeruginosa resistant to carbapenems is an urgent threat, as very few treatment options remain. Resistance to carbapenems is predominantly due to the presence of carbapenemase enzymes. The assessment of 147 P . aeruginosa isolates revealed that 32 isolates were carbapenem non-wild-type. These isolates were screened for carbapenem resistance genes using PCR. One isolate from wastewater contained the Adelaide imipenemase gene (bla AIM-1) and was compared phenotypically with a highly carbapenem-resistant clinical isolate containing the bla AIM-1 gene. A further investigation of wastewater samples from various local healthcare and non-healthcare sources as well as river water, using probe-based qPCR, revealed the presence of the bla AIM-1 gene in all the samples analysed. The widespread occurrence of bla AIM-1 throughout Adelaide hinted at the possibility of more generally extensive spread of this gene than originally thought. A blast search revealed the presence of the bla AIM-1 gene in Asia, North America and Europe. To elucidate the identity of the organism(s) carrying the bla AIM-1 gene, shotgun metagenomic sequencing was conducted on three wastewater samples from different locations. Comparison of these nucleotide sequences with a whole-genome sequence of a P. aeruginosa isolate revealed that, unlike the genetic environment and arrangement in P. aeruginosa , the bla AIM-1 gene was not carried as part of any mobile genetic elements. A phylogenetic tree constructed with the deduced amino acid sequences of AIM-1 suggested that the potential origin of the bla AIM-1 gene in P. aeruginosa might be the non-pathogenic environmental organism, Pseudoxanthomonas mexicana .
-
-
-
Genome-wide analysis of fitness-factors in uropathogenic Escherichia coli during growth in laboratory media and during urinary tract infections
Uropathogenic Escherichia coli (UPEC) UTI89 is a well-characterized strain, which has mainly been used to study UPEC virulence during urinary tract infection (UTI). However, little is known on UTI89 key fitness-factors during growth in lab media and during UTI. Here, we used a transposon-insertion-sequencing approach (TraDIS) to reveal the UTI89 essential-genes for in vitro growth and fitness-gene-sets for growth in Luria broth (LB) and EZ-MOPS medium without glucose, as well as for human bacteriuria and mouse cystitis. A total of 293 essential genes for growth were identified and the set of fitness-genes was shown to differ depending on the growth media. A modified, previously validated UTI murine model, with administration of glucose prior to infection was applied. Selected fitness-genes for growth in urine and mouse-bladder colonization were validated using deletion-mutants. Novel fitness-genes, such as tusA, corA and rfaG; involved in sulphur-acquisition, magnesium-uptake, and LPS-biosynthesis, were proved to be important during UTI. Moreover, rfaG was confirmed as relevant in both niches, and therefore it may represent a target for novel UTI-treatment/prevention strategies.
-
- Microbial Communities
-
-
Unique roles of vaginal Megasphaera phylotypes in reproductive health
The composition of the human vaginal microbiome has been extensively studied and is known to influence reproductive health. However, the functional roles of individual taxa and their contributions to negative health outcomes have yet to be well characterized. Here, we examine two vaginal bacterial taxa grouped within the genus Megasphaera that have been previously associated with bacterial vaginosis (BV) and pregnancy complications. Phylogenetic analyses support the classification of these taxa as two distinct species. These two phylotypes, Megasphaera phylotype 1 (MP1) and Megasphaera phylotype 2 (MP2), differ in genomic structure and metabolic potential, suggestive of differential roles within the vaginal environment. Further, these vaginal taxa show evidence of genome reduction and changes in DNA base composition, which may be common features of host dependence and/or adaptation to the vaginal environment. In a cohort of 3870 women, we observed that MP1 has a stronger positive association with bacterial vaginosis whereas MP2 was positively associated with trichomoniasis. MP1, in contrast to MP2 and other common BV-associated organisms, was not significantly excluded in pregnancy. In a cohort of 52 pregnant women, MP1 was both present and transcriptionally active in 75.4 % of vaginal samples. Conversely, MP2 was largely absent in the pregnant cohort. This study provides insight into the evolutionary history, genomic potential and predicted functional role of two clinically relevant vaginal microbial taxa.
-
-
-
Mobility of β-lactam resistance under ampicillin treatment in gut microbiota suffering from pre-disturbance
Ingestion of food- or waterborne antibiotic-resistant bacteria may lead to dissemination of antibiotic resistance genes (ARGs) in the gut microbiota. The gut microbiota often suffers from various disturbances. It is not clear whether and how disturbed microbiota may affect ARG mobility under antibiotic treatments. For proof of concept, in the presence or absence of streptomycin pre-treatment, mice were inoculated orally with a β-lactam-susceptible Salmonella enterica serovar Heidelberg clinical isolate (recipient) and a β-lactam resistant Escherichia coli O80:H26 isolate (donor) carrying a blaCMY-2 gene on an IncI2 plasmid. Immediately following inoculation, mice were treated with or without ampicillin in drinking water for 7 days. Faeces were sampled, donor, recipient and transconjugant were enumerated, blaCMY-2 abundance was determined by quantitative PCR, faecal microbial community composition was determined by 16S rRNA amplicon sequencing and cecal samples were observed histologically for evidence of inflammation. In faeces of mice that received streptomycin pre-treatment, the donor abundance remained high, and the abundance of S. Heidelberg transconjugant and the relative abundance of Enterobacteriaceae increased significantly during the ampicillin treatment. Co-blooming of the donor, transconjugant and commensal Enterobacteriaceae in the inflamed intestine promoted significantly (P<0.05) higher and possibly wider dissemination of the blaCMY-2 gene in the gut microbiota of mice that received the combination of streptomycin pre-treatment and ampicillin treatment (Str–Amp) compared to the other mice. Following cessation of the ampicillin treatment, faecal shedding of S. Heidelberg transconjugant persisted much longer from mice in the Str–Amp group compared to the other mice. In addition, only mice in the Str–Amp group shed a commensal E. coli O2:H6 transconjugant, which carries three copies of the blaCMY-2 gene, one on the IncI2 plasmid and two on the chromosome. The findings highlight the significance of pre-existing gut microbiota for ARG dissemination and persistence during and following antibiotic treatments of infectious diseases.
-
-
-
Revealing microbial species diversity using sequence capture by hybridization
More LessTargeting small parts of the 16S rDNA phylogenetic marker by metabarcoding reveals microorganisms of interest but cannot achieve a taxonomic resolution at the species level, precluding further precise characterizations. To identify species behind operational taxonomic units (OTUs) of interest, even in the rare biosphere, we developed an innovative strategy using gene capture by hybridization. From three OTU sequences detected upon polyphenol supplementation and belonging to the rare biosphere of the human gut microbiota, we revealed 59 nearly full-length 16S rRNA genes, highlighting high bacterial diversity hidden behind OTUs while evidencing novel taxa. Inside each OTU, revealed 16S rDNA sequences could be highly distant from each other with similarities down to 85 %. We identified one new family belonging to the order Clostridiales , 39 new genera and 52 novel species. Related bacteria potentially involved in polyphenol degradation have also been identified through genome mining and our results suggest that the human gut microbiota could be much more diverse than previously thought.
-
-
-
The bacterial biome of ticks and their wildlife hosts at the urban–wildland interface
Advances in sequencing technologies have revealed the complex and diverse microbial communities present in ticks (Ixodida). As obligate blood-feeding arthropods, ticks are responsible for a number of infectious diseases that can affect humans, livestock, domestic animals and wildlife. While cases of human tick-borne diseases continue to increase in the northern hemisphere, there has been relatively little recognition of zoonotic tick-borne pathogens in Australia. Over the past 5 years, studies using high-throughput sequencing technologies have shown that Australian ticks harbour unique and diverse bacterial communities. In the present study, free-ranging wildlife (n=203), representing ten mammal species, were sampled from urban and peri-urban areas in New South Wales (NSW), Queensland (QLD) and Western Australia (WA). Bacterial metabarcoding targeting the 16S rRNA locus was used to characterize the microbiomes of three sample types collected from wildlife: blood, ticks and tissue samples. Further sequence information was obtained for selected taxa of interest. Six tick species were identified from wildlife: Amblyomma triguttatum, Ixodes antechini, Ixodes australiensis, Ixodes holocyclus, Ixodes tasmani and Ixodes trichosuri. Bacterial 16S rRNA metabarcoding was performed on 536 samples and 65 controls, generating over 100 million sequences. Alpha diversity was significantly different between the three sample types, with tissue samples displaying the highest alpha diversity (P<0.001). Proteobacteria was the most abundant taxon identified across all sample types (37.3 %). Beta diversity analysis and ordination revealed little overlap between the three sample types (P<0.001). Taxa of interest included Anaplasmataceae , Bartonella , Borrelia , Coxiellaceae , Francisella , Midichloria , Mycoplasma and Rickettsia . Anaplasmataceae bacteria were detected in 17.7% (95/536) of samples and included Anaplasma , Ehrlichia and Neoehrlichia species. In samples from NSW, ‘Ca. Neoehrlichia australis’, ‘Ca. Neoehrlichia arcana’, Neoehrlichia sp. and Ehrlichia sp. were identified. A putative novel Ehrlichia sp. was identified from WA and Anaplasma platys was identified from QLD. Nine rodent tissue samples were positive for a novel Borrelia sp. that formed a phylogenetically distinct clade separate from the Lyme Borrelia and relapsing fever groups. This novel clade included recently identified rodent-associated Borrelia genotypes, which were described from Spain and North America. Bartonella was identified in 12.9% (69/536) of samples. Over half of these positive samples were obtained from black rats (Rattus rattus), and the dominant bacterial species identified were Bartonella coopersplainsensis and Bartonella queenslandensis . The results from the present study show the value of using unbiased high-throughput sequencing applied to samples collected from wildlife. In addition to understanding the sylvatic cycle of known vector-associated pathogens, surveillance work is important to ensure preparedness for potential zoonotic spillover events.
-
-
-
Secondary metabolite biosynthetic diversity in Arctic Ocean metagenomes
Polyketide synthases (PKSs) and non-ribosomal peptide synthetases (NRPSs) are mega enzymes responsible for the biosynthesis of a large fraction of natural products (NPs). Molecular markers for biosynthetic genes, such as the ketosynthase (KS) domain of PKSs, have been used to assess the diversity and distribution of biosynthetic genes in complex microbial communities. More recently, metagenomic studies have complemented and enhanced this approach by allowing the recovery of complete biosynthetic gene clusters (BGCs) from environmental DNA. In this study, the distribution and diversity of biosynthetic genes and clusters from Arctic Ocean samples (NICE-2015 expedition), was assessed using PCR-based strategies coupled with high-throughput sequencing and metagenomic analysis. In total, 149 KS domain OTU sequences were recovered, 36 % of which could not be assigned to any known BGC. In addition, 74 bacterial metagenome-assembled genomes were recovered, from which 179 BGCs were extracted. A network analysis identified potential new NP families, including non-ribosomal peptides and polyketides. Complete or near-complete BGCs were recovered, which will enable future heterologous expression efforts to uncover the respective NPs. Our study represents the first report of biosynthetic diversity assessed for Arctic Ocean metagenomes and highlights the potential of Arctic Ocean planktonic microbiomes for the discovery of novel secondary metabolites. The strategy employed in this study will enable future bioprospection, by identifying promising samples for bacterial isolation efforts, while providing also full-length BGCs for heterologous expression.
-
- Pathogens and Epidemiology
-
-
SNPPar: identifying convergent evolution and other homoplasies from microbial whole-genome alignments
More LessHomoplasic SNPs are considered important signatures of strong (positive) selective pressure, and hence of adaptive evolution for clinically relevant traits such as antibiotic resistance and virulence. Here we present a new tool, SNPPar, for efficient detection and analysis of homoplasic SNPs from large whole genome sequencing datasets (>1000 isolates and/or >100 000 SNPs). SNPPar takes as input an SNP alignment, tree and annotated reference genome, and uses a combination of simple monophyly tests and ancestral state reconstruction (ASR, via TreeTime) to assign mutation events to branches and identify homoplasies. Mutations are annotated at the level of codon and gene, to facilitate analysis of convergent evolution. Testing on simulated data (120 Mycobacterium tuberculosis alignments representing local and global samples) showed SNPPar can detect homoplasic SNPs with very high specificity (zero false-positives in all tests) and high sensitivity (zero false-negatives in 89 % of tests). SNPPar analysis of three empirically sampled datasets ( Elizabethkingia anophelis , Burkholderia dolosa and M. tuberculosis ) produced results that were in concordance with previous studies, in terms of both individual homoplasies and evidence of convergence at the codon and gene levels. SNPPar analysis of a simulated alignment of ~64 000 genome-wide SNPs from 2000 M. tuberculosis genomes took ~23 min and ~2.6 GB of RAM to generate complete annotated results on a laptop. This analysis required ASR be conducted for only 1.25 % of SNPs, and the ASR step took ~23 s and 0.4 GB of RAM. SNPPar automates the detection and annotation of homoplasic SNPs efficiently and accurately from large SNP alignments. As demonstrated by the examples included here, this information can be readily used to explore the role of homoplasy in parallel and/or convergent evolution at the level of nucleotide, codon and/or gene.
-
-
-
Alterations in chromosomal genes nfsA, nfsB, and ribE are associated with nitrofurantoin resistance in Escherichia coli from the United Kingdom
Antimicrobial resistance in enteric or urinary Escherichia coli is a risk factor for invasive E. coli infections. Due to widespread trimethoprim resistance amongst urinary E. coli and increased bacteraemia incidence, a national recommendation to prescribe nitrofurantoin for uncomplicated urinary tract infection was made in 2014. Nitrofurantoin resistance is reported in <6% urinary E. coli isolates in the UK, however, mechanisms underpinning nitrofurantoin resistance in these isolates remain unknown. This study aimed to identify the genetic basis of nitrofurantoin resistance in urinary E. coli isolates collected from north west London and then elucidate resistance-associated genetic alterations in available UK E. coli genomes. As a result, an algorithm was developed to predict nitrofurantoin susceptibility. Deleterious mutations and gene-inactivating insertion sequences in chromosomal nitroreductase genes nfsA and/or nfsB were identified in genomes of nine confirmed nitrofurantoin-resistant urinary E. coli isolates and additional 11 E. coli isolates that were highlighted by the prediction algorithm and subsequently validated to be nitrofurantoin-resistant. Eight categories of allelic changes in nfsA, nfsB, and the associated gene ribE were detected in 12412 E. coli genomes from the UK. Evolutionary analysis of these three genes revealed homoplasic mutations and explained the previously reported order of stepwise mutations. The mobile gene complex oqxAB, which is associated with reduced nitrofurantoin susceptibility, was identified in only one of the 12412 genomes. In conclusion, mutations and insertion sequences in nfsA and nfsB were leading causes of nitrofurantoin resistance in UK E. coli . As nitrofurantoin exposure increases in human populations, the prevalence of nitrofurantoin resistance in carriage E. coli isolates and those from urinary and bloodstream infections should be monitored.
-
-
-
Phylogenomics of two ST1 antibiotic-susceptible non-clinical Acinetobacter baumannii strains reveals multiple lineages and complex evolutionary history in global clone 1
Acinetobacter baumannii is an opportunistic pathogen that is difficult to treat due to its resistance to extreme conditions, including desiccation and antibiotics. Most strains causing outbreaks around the world belong to two main global lineages, namely global clones 1 and 2 (GC1 and GC2). Here, we used a combination of Illumina short read and MinION (Oxford Nanopore) long-read sequence data with a hybrid assembly approach to complete the genome sequence of two antibiotic-sensitive GC1 strains, Ex003 and Ax270, recovered in Lebanon from water and a rectal swab of a cat, respectively. Phylogenetic analysis of Ax270 and Ex003 with 186 publicly available GC1 genomes revealed two major clades, including five main lineages (L1–L5), and four single-isolate lineages outside of the two clades. Ax270 and Ex003, along with AB307-0294 and MRSN7213 (both predicted antibiotic-susceptible isolates) represent these individual lineages. Antibiotic resistance islands and transposons interrupting the comM gene remain important features in L1–L5, with L1 associated with the AbaR-type resistance islands, L2 with AbaR4, L3 strains containing either AbaR4 or its variants as well as Tn6022::ISAba42, and L4 and L5 associated with Tn6022 or its variants. Analysis of the capsule (KL) and outer core (OCL) polysaccharide loci further revealed a complex evolutionary history probably involving many recombination events. As more genomes become available, more GC1 lineages continue to emerge. However, genome sequence data from more diverse geographical regions are needed to draw a more accurate population structure of this globally distributed clone.
-
-
-
Genome reorganization during emergence of host-associated Mycobacterium abscessus
More LessMycobacterium abscessus is a rapid growing, free-living species of bacterium that also causes lung infections in humans. Human infections are usually acquired from the environment; however, dominant circulating clones (DCCs) have emerged recently in both M. abscessus subsp. massiliense and subsp. abscessus that appear to be transmitted among humans and are now globally distributed. These recently emerged clones are potentially informative about the ecological and evolutionary mechanisms of pathogen emergence and host adaptation. The geographical distribution of DCCs has been reported, but the genomic processes underlying their transition from environmental bacterium to human pathogen are not well characterized. To address this knowledge gap, we delineated the structure of M. abscessus subspecies abscessus and massiliense using genomic data from 200 clinical isolates of M. abscessus from seven geographical regions. We identified differences in overall patterns of lateral gene transfer (LGT) and barriers to LGT between subspecies and between environmental and host-adapted bacteria. We further characterized genome reorganization that accompanied bacterial host adaptation, inferring selection pressures acting at both genic and intergenic loci. We found that both subspecies encode an expansive pangenome with many genes at rare frequencies. Recombination appears more frequent in M. abscessus subsp. massiliense than in subsp. abscessus, consistent with prior reports. We found evidence suggesting that phage are exchanged between subspecies, despite genetic barriers evident elsewhere throughout the genome. Patterns of LGT differed according to niche, with less LGT observed among host-adapted DCCs versus environmental bacteria. We also found evidence suggesting that DCCs are under distinct selection pressures at both genic and intergenic sites. Our results indicate that host adaptation of M. abscessus was accompanied by major changes in genome evolution, including shifts in the apparent frequency of LGT and impacts of selection. Differences were evident among the DCCs as well, which varied in the degree of gene content remodelling, suggesting they were placed differently along the evolutionary trajectory toward host adaptation. These results provide insight into the evolutionary forces that reshape bacterial genomes as they emerge into the pathogenic niche.
-
-
-
Comparative genomics of Chinese and international isolates of Escherichia albertii: population structure and evolution of virulence and antimicrobial resistance
Escherichia albertii is a recently recognized species in the genus Escherichia that causes diarrhoea. The population structure, genetic diversity and genomic features have not been fully examined. Here, 169 E. albertii isolates from different sources and regions in China were sequenced and combined with 312 publicly available genomes (from additional 14 countries) for genomic analyses. The E. albertii population was divided into two clades and eight lineages, with lineage 3 (L3), L5 and L8 more common in China. Clinical isolates were observed in all clades/lineages. Virulence genes were found to be distributed differently among lineages: subtypes of the intimin encoding gene eae and the cytolethal distending toxin gene cdtB were lineage associated, and the second type three secretion system (ETT2) island was truncated in L3 and L6. Seven new eae subtypes and one new cdtB subtype (cdtB-VI) were identified. Alarmingly, 85.9 % of the Chinese E. albertii isolates were predicted to be multidrug-resistant (MDR) with 35.9 % harbouring genes capable of conferring resistance to 10 to 14 different drug classes. The majority of the MDR isolates were of poultry source from China and belonged to four sequence types (STs) [ST4638, ST4479, ST4633 and ST4488]. Thirty-four plasmids with some carrying MDR and virulence genes, and 130 prophages were identified from 17 complete E. albertii genomes. The 130 intact prophages were clustered into five groups, with group five prophages harbouring more virulence genes. We further identified three E. albertii specific genes as markers for the identification of this species. Our findings provided fundamental insights into the population structure, virulence variation and drug resistance of E. albertii .
-
-
-
Cryptosporidium felis differs from other Cryptosporidium spp. in codon usage
More LessCryptosporidium spp. are important enteric pathogens in a wide range of vertebrates including humans. Previous comparative analysis revealed conservation in genome composition, gene content, and gene organization among Cryptosporidium spp., with a progressive reductive evolution in metabolic pathways and invasion-related proteins. In this study, we sequenced the genome of zoonotic pathogen Cryptosporidium felis and conducted a comparative genomic analysis. While most intestinal Cryptosporidium species have similar genomic characteristics and almost complete genome synteny, fewer protein-coding genes and some sequence inversions and translocations were found in the C. felis genome. The C. felis genome exhibits much higher GC content (39.6 %) than other Cryptosporidium species (24.3–32.9 %), especially at the third codon position (GC3) of protein-coding genes. Thus, C. felis has a different codon usage, which increases the use of less energy costly amino acids (Gly and Ala) encoded by GC-rich codons. While the tRNA usage is conserved among Cryptosporidium species, consistent with its higher GC content, C. felis uses a unique tRNA for GTG for valine instead of GTA in other Cryptosporidium species. Both mutational pressures and natural selection are associated with the evolution of the codon usage in Cryptosporidium spp., while natural selection seems to drive the codon usage in C. felis. Other unique features of the C. felis genome include the loss of the entire traditional and alternative electron transport systems and several invasion-related proteins. Thus, the preference for the use of some less energy costly amino acids in C. felis may lead to a more harmonious parasite–host interaction, and the strengthened host-adaptation is reflected by the further reductive evolution of metabolism and host invasion-related proteins.
-
-
-
The global population structure and evolutionary history of the acquisition of major virulence factor-encoding genetic elements in Shiga toxin-producing Escherichia coli O121:H19
Ruriko Nishida, Keiji Nakamura, Itsuki Taniguchi, Kazunori Murase, Tadasuke Ooka, Yoshitoshi Ogura, Yasuhiro Gotoh, Takehiko Itoh, Atsushi Toyoda, Jacques Georges Mainil, Denis Piérard, Kazuko Seto, Tetsuya Harada, Junko Isobe, Keiko Kimata, Yoshiki Etoh, Mitsuhiro Hamasaki, Hiroshi Narimatsu, Jun Yatsuyanagi, Mitsuhiro Kameyama, Yuko Matsumoto, Yuhki Nagai, Jun Kawase, Eiji Yokoyama, Kazuhiko Ishikawa, Takayuki Shiomoto, Kenichi Lee, Dongchon Kang, Koichi Akashi, Makoto Ohnishi, Sunao Iyoda and Tetsuya HayashiShiga toxin (Stx)-producing Escherichia coli (STEC) are foodborne pathogens causing serious diseases, such as haemorrhagic colitis and haemolytic uraemic syndrome. Although O157:H7 STEC strains have been the most prevalent, incidences of STEC infections by several other serotypes have recently increased. O121:H19 STEC is one of these major non-O157 STECs, but systematic whole genome sequence (WGS) analyses have not yet been conducted on this STEC. Here, we performed a global WGS analysis of 638 O121:H19 strains, including 143 sequenced in this study, and a detailed comparison of 11 complete genomes, including four obtained in this study. By serotype-wide WGS analysis, we found that O121:H19 strains were divided into four lineages, including major and second major lineages (named L1 and L3, respectively), and that the locus of enterocyte effacement (LEE) encoding a type III secretion system (T3SS) was acquired by the common ancestor of O121:H19. Analyses of 11 complete genomes belonging to L1 or L3 revealed remarkable interlineage differences in the prophage pool and prophage-encoded T3SS effector repertoire, independent acquisition of virulence plasmids by the two lineages, and high conservation in the prophage repertoire, including that for Stx2a phages in lineage L1. Further sequence determination of complete Stx2a phage genomes of 49 strains confirmed that Stx2a phages in lineage L1 are highly conserved short-tailed phages, while those in lineage L3 are long-tailed lambda-like phages with notable genomic diversity, suggesting that an Stx2a phage was acquired by the common ancestor of L1 and has been stably maintained. Consistent with these genomic features of Stx2a phages, most lineage L1 strains produced much higher levels of Stx2a than lineage L3 strains. Altogether, this study provides a global phylogenetic overview of O121:H19 STEC and shows the interlineage genomic differences and the highly conserved genomic features of the major lineage within this serotype of STEC.
-
-
-
Characterization of the mitochondrial genomes of three powdery mildew pathogens reveals remarkable variation in size and nucleotide composition
More LessPowdery mildews comprise a large group of economically important phytopathogenic fungi. However, limited information exists on their mitochondrial genomes. Here, we assembled and compared the mitochondrial genomes of the powdery mildew pathogens Blumeria graminis f. sp. tritici, Erysiphe pisi, and Golovinomyces cichoracearum. Included in the comparative analysis was also the mitochondrial genome of Erysiphe necator that was previously analysed. The mitochondrial genomes of the four Erysiphales exhibit a similar gene content and organization but a large variation in size, with sizes ranging from 109800 bp in B. graminis f. sp. tritici to 332165 bp in G. cichoracearum, which is the largest mitochondrial genome of a fungal pathogen reported to date. Further comparative analysis revealed an unusual bimodal GC distribution in the mitochondrial genomes of B. graminis f. sp. tritici and G. cichoracearum that was not previously observed in fungi. The cytochrome b (cob) genes of E. necator, E. pisi, and G. cichoracearum were also exceptionally rich in introns, which in turn harboured rare open reading frames encoding reverse transcriptases that were likely acquired horizontally. Golovinomyces cichoracearum had also the longest cob gene (45 kb) among 703 fungal cob genes analysed. Collectively, these results provide novel insights into the organization of mitochondrial genomes of powdery mildew pathogens and represent valuable resources for population genetics and evolutionary studies.
-
-
-
Genomic comparisons of Escherichia coli ST131 from Australia
Dmitriy Li, Ethan R. Wyrsch, Paarthiphan Elankumaran, Monika Dolejska, Marc S. Marenda, Glenn F. Browning, Rhys N. Bushell, Jessica McKinnon, Piklu Roy Chowdhury, Nola Hitchick, Natalie Miller, Erica Donner, Barbara Drigo, Dave Baker, Ian G. Charles, Timothy Kudinha, Veronica M. Jarocki and Steven Philip DjordjevicEscherichia coli ST131 is a globally dispersed extraintestinal pathogenic E. coli lineage contributing significantly to hospital and community acquired urinary tract and bloodstream infections. Here we describe a detailed phylogenetic analysis of the whole genome sequences of 284 Australian ST131 E. coli isolates from diverse sources, including clinical, food and companion animals, wildlife and the environment. Our phylogeny and the results of single nucleotide polymorphism (SNP) analysis show the typical ST131 clade distribution with clades A, B and C clearly displayed, but no niche associations were observed. Indeed, interspecies relatedness was a feature of this study. Thirty-five isolates (29 of human and six of wild bird origin) from clade A (32 fimH41, 2 fimH89, 1 fimH141) were observed to differ by an average of 76 SNPs. Forty-five isolates from clade C1 from four sources formed a cluster with an average of 46 SNPs. Within this cluster, human sourced isolates differed by approximately 37 SNPs from isolates sourced from canines, approximately 50 SNPs from isolates from wild birds, and approximately 52 SNPs from isolates from wastewater. Many ST131 carried resistance genes to multiple antibiotic classes and while 41 (14 %) contained the complete class one integron–integrase intI1, 128 (45 %) isolates harboured a truncated intI1 (462–1014 bp), highlighting the ongoing evolution of this element. The module intI1–dfrA17–aadA5–qacEΔ1–sul1–ORF–chrA–padR–IS1600–mphR–mrx–mphA, conferring resistance to trimethoprim, aminoglycosides, quaternary ammonium compounds, sulphonamides, chromate and macrolides, was the most common structure. Most (73 %) Australian ST131 isolates carry at least one extended spectrum β-lactamase gene, typically bla CTX-M-15 and bla CTX-M-27. Notably, dual parC-1aAB and gyrA-1AB fluoroquinolone resistant mutations, a unique feature of clade C ST131 isolates, were identified in some clade A isolates. The results of this study indicate that the the ST131 population in Australia carries diverse antimicrobial resistance genes and plasmid replicons and indicate cross-species movement of ST131 strains across diverse reservoirs.
-
-
-
Epidemiology and population structure of Haemophilus influenzae causing invasive disease
This study provides an update on invasive Haemophilus influenzae disease in Bellvitge University Hospital (2014–2019), reporting its evolution from a previous period (2008–2013) and analysing the non-typeable H. influenzae (NTHi) population structure using a clade-related classification. Clinical data, antimicrobial susceptibility and serotyping were studied and compared with those of the previous period. Population structure was assessed by multilocus sequence typing (MLST), SNP-based phylogenetic analysis and clade-related classification. The incidence of invasive H. influenzae disease remained constant between the two periods (average 2.07 cases per 100 000 population), while the 30 day mortality rate decreased (20.7–14.7 %, respectively). Immunosuppressive therapy (40 %) and malignancy (36 %) were the most frequent comorbidities. Ampicillin and fluoroquinolone resistance rates had increased between the two periods (10–17.6 % and 0–4.4 %, respectively). NTHi was the main cause of invasive disease in both periods (84.3 and 85.3 %), followed by serotype f (12.9 and 8.8 %). NTHi displayed high genetic diversity. However, two clusters of 13 (n=20) and 5 sequence types (STs) (n=10) associated with clade V included NTHi strains of the most prevalent STs (ST3 and ST103), many of which showed increased frequency over time. Moreover, ST103 and ST160 from clade V were associated with β-lactam resistance. Invasive H. influenzae disease is uncommon, but can be severe, especially in the elderly with comorbidities. NTHi remains the main cause of invasive disease, with ST103 and ST160 (clade V) responsible for increasing β-lactam resistance over time.
-
-
-
Genomic diversity of antimicrobial resistance in non-typhoidal Salmonella in Victoria, Australia
Non-typhoidal Salmonella (NTS) is the second most common cause of foodborne bacterial gastroenteritis in Australia with antimicrobial resistance (AMR) increasing in recent years. Whole-genome sequencing (WGS) provides opportunities for in silico detection of AMR determinants. The objectives of this study were two-fold: (1) establish the utility of WGS analyses for inferring phenotypic resistance in NTS, and (2) explore clinically relevant genotypic AMR profiles to third generation cephalosporins (3GC) in NTS lineages. The concordance of 2490 NTS isolates with matched WGS and phenotypic susceptibility data against 13 clinically relevant antimicrobials was explored. In silico serovar prediction and typing was performed on assembled reads and interrogated for known AMR determinants. The surrounding genomic context, plasmid determinants and co-occurring AMR patterns were further investigated for multidrug resistant serovars harbouring bla CMY-2, bla CTX-M-55 or bla CTX-M-65. Our data demonstrated a high correlation between WGS and phenotypic susceptibility testing. Phenotypic-genotypic concordance was observed between 2440/2490 (98.0 %) isolates, with overall sensitivity and specificity rates >98 % and positive and negative predictive values >97 %. The most common AMR determinants were bla TEM-1, sul2, tet(A), strA-strB and floR. Phenotypic resistance to cefotaxime and azithromycin was low and observed in 6.2 % (151/2486) and 0.9 % (16/1834) of the isolates, respectively. Several multi-drug resistant NTS lineages were resistant to 3GC due to different genetic mechanisms including bla CMY-2, bla CTX-M-55 or bla CTX-M-65. This study shows WGS can enhance existing AMR surveillance in NTS datasets routinely produced in public health laboratories to identify emerging AMR in NTS. These approaches will be critical for developing capacity to detect emerging public health threats such as resistance to 3GC.
-
-
-
Prophages encoding human immune evasion cluster genes are enriched in Staphylococcus aureus isolated from chronic rhinosinusitis patients with nasal polyps
Prophages affect bacterial fitness on multiple levels. These include bacterial infectivity, toxin secretion, virulence regulation, surface modification, immune stimulation and evasion and microbiome competition. Lysogenic conversion arms bacteria with novel accessory functions thereby increasing bacterial fitness, host adaptation and persistence, and antibiotic resistance. These properties allow the bacteria to occupy a niche long term and can contribute to chronic infections and inflammation such as chronic rhinosinusitis (CRS). In this study, we aimed to identify and characterize prophages present in Staphylococcus aureus from patients suffering from CRS in relation to CRS disease phenotype and severity. Prophage regions were identified using PHASTER. Various in silico tools like ResFinder and VF Analyzer were used to detect virulence genes and antibiotic resistance genes respectively. Progressive MAUVE and maximum likelihood were used for multiple sequence alignment and phylogenetics of prophages respectively. Disease severity of CRS patients was measured using computed tomography Lund–Mackay scores. Fifty-eight S. aureus clinical isolates (CIs) were obtained from 28 CRS patients without nasal polyp (CRSsNP) and 30 CRS patients with nasal polyp (CRSwNP). All CIs carried at least one prophage (average=3.6) and prophages contributed up to 7.7 % of the bacterial genome. Phage integrase genes were found in 55/58 (~95 %) S. aureus strains and 97/211 (~46 %) prophages. Prophages belonging to Sa3int integrase group (phiNM3, JS01, phiN315) (39/97, 40%) and Sa2int (phi2958PVL) (14/97, 14%) were the most prevalent prophages and harboured multiple virulence genes such as sak, scn, chp, lukE/D, sea. Intact prophages were more frequently identified in CRSwNP than in CRSsNP (P=0.0021). Intact prophages belonging to the Sa3int group were more frequent in CRSwNP than in CRSsNP (P=0.0008) and intact phiNM3 were exclusively found in CRSwNP patients (P=0.007). Our results expand the knowledge of prophages in S. aureus isolated from CRS patients and their possible role in disease development. These findings provide a platform for future investigations into potential tripartite associations between bacteria-prophage-human immune system, S. aureus evolution and CRS disease pathophysiology.
-
-
-
Genetic differentiation of Xylella fastidiosa following the introduction into Taiwan
The economically important plant pathogen Xylella fastidiosa has been reported in multiple regions of the globe during the last two decades, threatening a growing list of plants. Particularly, X. fastidiosa subspecies fastidiosa causes Pierce’s disease (PD) of grapevines, which is a problem in the USA, Spain, and Taiwan. In this work, we studied PD-causing subsp. fastidiosa populations and compared the genome sequences of 33 isolates found in Central Taiwan with 171 isolates from the USA and two from Spain. Phylogenetic relationships, haplotype networks, and genetic diversity analyses confirmed that subsp. fastidiosa was recently introduced into Taiwan from the Southeast USA (i.e. the PD-I lineage). Recent core-genome recombination events were detected among introduced subsp. fastidiosa isolates in Taiwan and contributed to the development of genetic diversity. The genetic diversity observed includes contributions through recombination from unknown donors, suggesting that higher genetic diversity exists in the region. Nevertheless, no recombination event was detected between X. fastidiosa subsp. fastidiosa and the endemic sister species Xylella taiwanensis , which is the causative agent of pear leaf scorch disease. In summary, this study improved our understanding of the genetic diversity of an important plant pathogenic bacterium after its invasion to a new region.
-
-
-
ECTyper: in silico Escherichia coli serotype and species prediction from raw and assembled whole-genome sequence data
Escherichia coli is a priority foodborne pathogen of public health concern and phenotypic serotyping provides critical information for surveillance and outbreak detection activities. Public health and food safety laboratories are increasingly adopting whole-genome sequencing (WGS) for characterizing pathogens, but it is imperative to maintain serotype designations in order to minimize disruptions to existing public health workflows. Multiple in silico tools have been developed for predicting serotypes from WGS data, including SRST2, SerotypeFinder and EToKi EBEis, but these tools were not designed with the specific requirements of diagnostic laboratories, which include: speciation, input data flexibility (fasta/fastq), quality control information and easily interpretable results. To address these specific requirements, we developed ECTyper (https://github.com/phac-nml/ecoli_serotyping) for performing both speciation within Escherichia and Shigella , and in silico serotype prediction. We compared the serotype prediction performance of each tool on a newly sequenced panel of 185 isolates with confirmed phenotypic serotype information. We found that all tools were highly concordant, with 92–97 % for O-antigens and 98–100 % for H-antigens, and ECTyper having the highest rate of concordance. We extended the benchmarking to a large panel of 6954 publicly available E. coli genomes to assess the performance of the tools on a more diverse dataset. On the public data, there was a considerable drop in concordance, with 75–91 % for O-antigens and 62–90 % for H-antigens, and ECTyper and SerotypeFinder being the most concordant. This study highlights that in silico predictions show high concordance with phenotypic serotyping results, but there are notable differences in tool performance. ECTyper provides highly accurate and sensitive in silico serotype predictions, in addition to speciation, and is designed to be easily incorporated into bioinformatic workflows.
-
-
-
SARS-CoV-2 genetic variations associated with COVID-19 pathogenicity
In this study, we performed genome-wide association analyses on SARS-CoV-2 genomes to identify genetic mutations associated with pre-symptomatic/asymptomatic COVID-19 cases. Various potential covariates and confounding factors of COVID-19 severity, including patient age, gender and country, as well as virus phylogenetic relatedness were adjusted for. In total, 3021 full-length genomes of SARS-CoV-2 generated from original clinical samples and whose patient status could be determined conclusively as either ‘pre-symptomatic/asymptomatic’ or ‘symptomatic’ were retrieved from the GISAID database. We found that the mutation 11 083G>T, located in the coding region of non-structural protein 6, is significantly associated with asymptomatic COVID-19. Patient age is positively correlated with symptomatic infection, while gender is not significantly correlated with the development of the disease. We also found that the effects of the mutation, patient age and gender do not vary significantly among countries, although each country appears to have varying baseline chances of COVID-19 symptom development.
-
-
-
Introduction and adaptation of an emerging pathogen to olive trees in Italy
The invasive plant pathogen Xylella fastidiosa currently threatens European flora through the loss of economically and culturally important host plants. This emerging vector-borne bacterium, native to the Americas, causes several important diseases in a wide range of plants including crops, ornamentals, and trees. Previously absent from Europe, and considered a quarantine pathogen, X. fastidiosa was first detected in Apulia, Italy in 2013 associated with a devastating disease of olive trees (Olive Quick Decline Syndrome, OQDS). OQDS has led to significant economic, environmental, cultural, as well as political crises. Although the biology of X. fastidiosa diseases have been studied for over a century, there is still no information on the determinants of specificity between bacterial genotypes and host plant species, which is particularly relevant today as X. fastidiosa is expanding in the naive European landscape. We analysed the genomes of 79 X . fastidiosa samples from diseased olive trees across the affected area in Italy as well as genomes of the most genetically closely related strains from Central America. We provided insights into the ecological and evolutionary emergence of this pathogen in Italy. We first showed that the outbreak in Apulia is due to a single introduction from Central America that we estimated to have occurred in 2008 [95 % HPD: 1930–2016]. By using a combination of population genomic approaches and evolutionary genomics methods, we further identified a short list of genes that could play a major role in the adaptation of X. fastidiosa to this new environment. We finally provided experimental evidence for the adaptation of the strain to this new environment.
-
-
-
The evolutionary history of Shigella flexneri serotype 6 in Asia
Shigella flexneri serotype 6 is an understudied cause of diarrhoeal diseases in developing countries, and has been proposed as one of the major targets for vaccine development against shigellosis. Despite being named as S. flexneri , Shigella flexneri serotype 6 is phylogenetically distinct from other S. flexneri serotypes and more closely related to S. boydii . This unique phylogenetic relationship and its low sampling frequency have hampered genomic research on this pathogen. Herein, by utilizing whole genome sequencing (WGS) and analyses of Shigella flexneri serotype 6 collected from epidemiological studies (1987–2013) in four Asian countries, we revealed its population structure and evolutionary history in the region. Phylogenetic analyses supported the delineation of Asian Shigella flexneri serotype 6 into two phylogenetic groups (PG-1 and −2). Notably, temporal phylogenetic approaches showed that extant Asian S. flexneri serotype 6 could be traced back to an inferred common ancestor arising in the 18th century. The dominant lineage PG-1 likely emerged in the 1970s, which coincided with the times to most recent common ancestors (tMRCAs) inferred from other major Southeast Asian S. flexneri serotypes. Similar to other S. flexneri serotypes in the same period in Asia, genomic analyses showed that resistance to first-generation antimicrobials was widespread, while resistance to more recent first-line antimicrobials was rare. These data also showed a number of gene inactivation and gene loss events, particularly on genes related to metabolism and synthesis of cellular appendages, emphasizing the continuing role of reductive evolution in the adaptation of the pathogen to an intracellular lifestyle. Together, our findings reveal insights into the genomic evolution of the understudied Shigella flexneri serotype 6, providing a new piece in the puzzle of Shigella epidemiology and evolution.
-
-
-
Clostridioides difficile strain-dependent and strain-independent adaptations to a microaerobic environment
More LessClostridioides difficile (formerly Clostridium difficile ) colonizes the gastrointestinal tract following disruption of the microbiota and can initiate a spectrum of clinical manifestations ranging from asymptomatic to life-threatening colitis. Following antibiotic treatment, luminal oxygen concentrations increase, exposing gut microbes to potentially toxic reactive oxygen species. Though typically regarded as a strict anaerobe, C. difficile can grow at low oxygen concentrations. How this bacterium adapts to a microaerobic environment and whether those responses to oxygen are conserved amongst strains is not entirely understood. Here, two C. difficile strains (630 and CD196) were cultured in 1.5% oxygen and the transcriptional response to long-term oxygen exposure was evaluated via RNA-sequencing. During growth in a microaerobic environment, several genes predicted to protect against oxidative stress were upregulated, including those for rubrerythrins and rubredoxins. Transcription of genes involved in metal homeostasis was also positively correlated with increased oxygen levels and these genes were amongst the most differentially transcribed. To directly compare the transcriptional landscape between C. difficile strains, a ‘consensus-genome’ was generated. On the basis of the identified conserved genes, basal transcriptional differences as well as variations in the response to oxygen were evaluated. While several responses were similar between the strains, there were significant differences in the abundance of transcripts involved in amino acid and carbohydrate metabolism. Furthermore, intracellular metal concentrations significantly varied both in an oxygen-dependent and oxygen-independent manner. Overall, these results indicate that C. difficile adapts to grow in a low oxygen environment through transcriptional changes, though the specific strategy employed varies between strains.
-
-
-
Context-aware genomic surveillance reveals hidden transmission of a carbapenemase-producing Klebsiella pneumoniae
Genomic surveillance can inform effective public health responses to pathogen outbreaks. However, integration of non-local data is rarely done. We investigate two large hospital outbreaks of a carbapenemase-carrying Klebsiella pneumoniae strain in Germany and show the value of contextual data. By screening about 10 000 genomes, over 400 000 metagenomes and two culture collections using in silico and in vitro methods, we identify a total of 415 closely related genomes reported in 28 studies. We identify the relationship between the two outbreaks through time-dated phylogeny, including their respective origin. One of the outbreaks presents extensive hidden transmission, with descendant isolates only identified in other studies. We then leverage the genome collection from this meta-analysis to identify genes under positive selection. We thereby identify an inner membrane transporter (ynjC) with a putative role in colistin resistance. Contextual data from other sources can thus enhance local genomic surveillance at multiple levels and should be integrated by default when available.
-
-
-
Genome placement of alpha-haemolysin cluster is associated with alpha-haemolysin sequence variation, adhesin and iron acquisition factor profile of Escherichia coli
Since the discovery of haemolysis, many studies focused on a deeper understanding of this phenotype in Escherichia coli and its association with other virulence genes, diseases and pathogenic attributes/functions in the host. Our virulence-associated factor profiling and genome-wide association analysis of genomes of haemolytic and nonhaemolytic E. coli unveiled high prevalence of adhesins, iron acquisition genes and toxins in haemolytic bacteria. In the case of fimbriae with high prevalence, we analysed sequence variation of FimH, EcpD and CsgA, and showed that different adhesin variants were present in the analysed groups, indicating altered adhesive capabilities of haemolytic and nonhaemolytic E. coli . Analysis of over 1000 haemolytic E. coli genomes revealed that they are pathotypically, genetically and antigenically diverse, but their adhesin and iron acquisition repertoire is associated with genome placement of hlyCABD cluster. Haemolytic E. coli with chromosome-encoded alpha-haemolysin had high frequency of P, S, Auf fimbriae and multiple iron acquisition systems such as aerobactin, yersiniabactin, salmochelin, Fec, Sit, Bfd and hemin uptake systems. Haemolytic E. coli with plasmid-encoded alpha-haemolysin had similar adhesin profile to nonpathogenic E. coli, with high prevalence of Stg, Yra, Ygi, Ycb, Ybg, Ycf, Sfm, F9 fimbriae, Paa, Lda, intimin and type 3 secretion system encoding genes. Analysis of HlyCABD sequence variation revealed presence of variants associated with genome placement and pathotype.
-
- Short Communications
-
- Pathogens and Epidemiology
-
-
Whole genome sequencing reveals large deletions and other loss of function mutations in Mycobacterium tuberculosis drug resistance genes
More LessDrug resistance in Mycobacterium tuberculosis , the causative agent of tuberculosis disease, arises from genetic mutations in genes coding for drug-targets or drug-converting enzymes. SNPs linked to drug resistance have been extensively studied and form the basis of molecular diagnostics and sequencing-based resistance profiling. However, alternative forms of functional variation such as large deletions and other loss of function (LOF) mutations have received much less attention, but if incorporated into diagnostics they are likely to improve their predictive performance. Our work aimed to characterize the contribution of LOF mutations found in 42 established drug resistance genes linked to 19 anti-tuberculous drugs across 32689 sequenced clinical isolates. The analysed LOF mutations included large deletions (n=586), frameshifts (n=4764) and premature stop codons (n=826). We found LOF mutations in genes strongly linked to pyrazinamide (pncA), isoniazid (katG), capreomycin (tlyA), streptomycin (e.g. gid) and ethionamide (ethA, mshA) (P<10−5), but also in some loci linked to drugs where relatively less phenotypic data is available [e.g. cycloserine, delaminid, bedaquiline, para-aminosalicylic acid (PAS), and clofazimine]. This study reports that large deletions (median size 1115 bp) account for a significant portion of resistance variants found for PAS (+7.1% of phenotypic resistance percentage explained), pyrazinamide (+3.5%) and streptomycin (+2.6%) drugs, and can be used to improve the prediction of cryptic resistance. Overall, our work highlights the importance of including LOF mutations (e.g. large deletions) in predicting genotypic drug resistance, thereby informing tuberculosis infection control and clinical decision-making.
-
-
-
Genomic insights into the circulation of pandemic fluoroquinolone-resistant extra-intestinal pathogenic Escherichia coli ST1193 in Vietnam
Extra-intestinal pathogenic Escherichia coli (ExPEC) ST1193, a globally emergent fluoroquinolone-resistant clone, has become an important cause of bloodstream infections (BSIs) associated with significant morbidity and mortality. Previous studies have reported the emergence of fluoroquinolone-resistant ExPEC ST1193 in Vietnam; however, limited data exist regarding the genetic structure, antimicrobial resistance (AMR) determinants and transmission dynamics of this pandemic clone. Here, we performed genomic and phylogenetic analyses of 46 ST1193 isolates obtained from BSIs and healthy individuals in Ho Chi Minh City, Vietnam, to investigate the pathogen population structure, molecular mechanisms of AMR and potential transmission patterns. We further examined the phylogenetic structure of ST1193 isolates in a global context. We found that the endemic E. coli ST1193 population was heterogeneous and highly dynamic, largely driven by multiple strain importations. Several well-supported phylogenetic clusters (C1–C6) were identified and associated with distinct bla CTX-M variants, including bla CTXM-27 (C1–C3, C5), bla CTXM-55 (C4) and bla CTXM-15 (C6). Most ST1193 isolates were multidrug-resistant and carried an extensive array of AMR genes. ST1193 isolates also exhibited the ability to acquire further resistance while circulating in Vietnam. There were phylogenetic links between ST1193 isolates from BSIs and healthy individuals, suggesting these organisms may both establish long-term colonization in the human intestinal tract and induce infections. Our study uncovers factors shaping the population structure and transmission dynamics of multidrug-resistant ST1193 in Vietnam, and highlights the urgent need for local One Health genomic surveillance to capture new emerging ExPEC clones and to better understand the origins and transmission patterns of these pathogens.
-
- Methods
-
- Genomic Methodologies
-
-
GBS-SBG - GBS Serotyping by Genome Sequencing
More LessGroup B Streptococcus (GBS; Streptococcus agalactiae ) is the most common cause of neonatal meningitis and a rising cause of sepsis in adults. Recently, it has also been shown to cause foodborne disease. As with many other bacteria, the polysaccharide capsule of GBS is antigenic, enabling its use for strain serotyping. Recent advances in DNA sequencing have made sequence-based typing attractive (as has been implemented for several other bacteria, including Escherichia coli , Klebsiella pneumoniae species complex, Streptococcus pyogenes , and others). For GBS, existing WGS-based serotyping systems do not provide complete coverage of all known GBS serotypes (specifically including subtypes of serotype III), and none are simultaneously compatible with the two most common data types, raw short reads and assembled sequences. Here, we create a serotyping database (GBS-SBG, GBS Serotyping by Genome Sequencing), with associated scripts and running instructions, that can be used to call all currently described GBS serotypes, including subtypes of serotype III, using both direct short-read- and assembly-based typing. We achieved higher concordance using GBS-SBG on a previously reported data set of 790 strains. We further validated GBS-SBG on a new set of 572 strains, achieving 99.8% concordance with PCR-based molecular serotyping using either short-read- or assembly-based typing. The GBS-SBG package is publicly available and will hopefully accelerate and simplify serotyping by sequencing for GBS.
-