- Volume 9, Issue 5, 2023
Volume 9, Issue 5, 2023
- Reviews
-
- Genomic Methodologies
-
-
Challenges in prokaryote pangenomics
More LessHorizontal gene transfer (HGT) and the resulting patterns of gene gain and loss are a fundamental part of bacterial evolution. Investigating these patterns can help us to understand the role of selection in the evolution of bacterial pangenomes and how bacteria adapt to a new niche. Predicting the presence or absence of genes can be a highly error-prone process that can confound efforts to understand the dynamics of horizontal gene transfer. This review discusses both the challenges in accurately constructing a pangenome and the potential consequences errors can have on downstream analyses. We hope that by summarizing these issues researchers will be able to avoid potential pitfalls, leading to improved bacterial pangenome analyses.
-
- Pathogens and Epidemiology
-
-
The Swiss Pathogen Surveillance Platform – towards a nation-wide One Health data exchange platform for bacterial, viral and fungal genomics and associated metadata
Aitana Neves, Daniel Walther, Trinidad Martin-Campos, Valerie Barbie, Claire Bertelli, Dominique Blanc, Gérard Bouchet, Frédéric Erard, Gilbert Greub, Hans H. Hirsch, Michael Huber, Laurent Kaiser, Stephen L. Leib, Karoline Leuzinger, Vladimir Lazarevic, Mirjam Mäusezahl, Jorge Molina, Richard A. Neher, Vincent Perreten, Alban Ramette, Tim Roloff, Jacques Schrenzel, Helena M. B. Seth-Smith, Roger Stephan, Dillenn Terumalai, Fanny Wegner and Adrian EgliThe Swiss Pathogen Surveillance Platform (SPSP) is a shared secure surveillance platform between human and veterinary medicine, to also include environmental and foodborne isolates. It enables rapid and detailed transmission monitoring and outbreak surveillance of pathogens using whole genome sequencing data and associated metadata. It features controlled data access, complex dynamic queries, dedicated dashboards and automated data sharing with international repositories, providing actionable results for public health and the vision to improve societal well-being and health.
-
- Bioresources
-
- Genomic Methodologies
-
-
NCTC3000: a century of bacterial strain collecting leads to a rich genomic data resource
Jo Dicks, Mohammed-Abbas Fazal, Karen Oliver, Nicholas E. Grayson, Jake D. Turnbull, Evangeline Bane, Edward Burnett, Ana Deheer-Graham, Nancy Holroyd, Dorota Kaushal, Jacqueline Keane, Gemma Langridge, Jane Lomax, Hannah McGregor, Steve Picton, Michael Quail, Deepak Singh, Alan Tracey, Jonas Korlach, Julie E. Russell, Sarah Alexander and Julian ParkhillThe National Collection of Type Cultures (NCTC) was founded on 1 January 1920 in order to fulfil a recognized need for a centralized repository for bacterial and fungal strains within the UK. It is among the longest-established collections of its kind anywhere in the world and today holds approximately 6000 type and reference bacterial strains – many of medical, scientific and veterinary importance – available to academic, health, food and veterinary institutions worldwide. Recently, a collaboration between NCTC, Pacific Biosciences and the Wellcome Sanger Institute established the NCTC3000 project to long-read sequence and assemble the genomes of up to 3000 NCTC strains. Here, at the beginning of the collection’s second century, we introduce the resulting NCTC3000 sequence read datasets, genome assemblies and annotations as a unique, historically and scientifically relevant resource for the benefit of the international bacterial research community.
-
- Research Articles
-
- Genomic Methodologies
-
-
Assessment of ferroptosis-associated gene signatures as potential biomarkers for differentiating latent from active tuberculosis in children
More LessFerroptotic cell death is a regulated process that is governed by iron-dependent membrane lipid peroxide accumulation that plays a pathogenic role in several disease-related settings. The use of ferroptosis-related genes (FRGs) to distinguish active tuberculosis (ATB) from latent tuberculosis infection (LTBI) among children, however, remains to be analysed. Tuberculosis-related gene expression data and FRG lists were obtained, respectively, from Gene Expression Omnibus (GEO) and FerrDb. Differentially expressed FRGs (DE-FRGs) detected when comparing samples from paediatric ATB and LTBI patients were explored using appropriate bioinformatics techniques, after which enrichment analyses were performed for these genes and hub genes were identified, with these genes then being used to explore potential drug interactions and construct competing endogenous RNA (ceRNA) networks. The GSE39939 dataset yielded 124 DE-FRGs that were primarily related to responses to oxidative, chemical and extracellular stimulus-associated stress. In total, the LASSO and SVM-RFE algorithms enabled the identification of nine hub genes (MAPK14, EGLN2, IDO1, USP11, SCD, CBS, PARP8, PARP16, CDC25A) that exhibited good diagnostic utility. Functional enrichment analyses of these genes suggested that they may govern ATB transition from LTBI through the control of many pathways, including the immune response, DNA repair, transcription, RNA degradation, and glycan and energy metabolism pathways. The CIBERSORT algorithm suggested that these genes were positively correlated with inflammatory and myeloid cell activity while being negatively correlated with the activity of lymphocytes. A total of 50 candidate drugs targeting 6 hub DE-FRGs were also identified, and a ceRNA network was used to explore the complex interplay among these hub genes. The nine hub FRGs defined in this study may serve as valuable biomarkers differentiating between ATB and LTBI in young patients.
-
-
-
Development of Kaptive databases for Vibrio parahaemolyticus O- and K-antigen genotyping
More LessVibrio parahaemolyticus is an important food-borne human pathogen and presents immunogenic surface polysaccharides, which can be used to distinguish problematic and disease-causing lineages. V. parahaemolyticus is divided in 16 O-serotypes (O-antigen) and 71 K-serotypes (K-antigen). Agglutination tests are still the gold standard for serotyping, but many V. parahaemolyticus isolates are not typable by agglutination. An alternative for agglutination tests is genotyping using whole-genome sequencing data, by which K- and O- genotypes have been curated and identified previously for other clinically relevant organisms with the software tool Kaptive. In this study, V. parahaemolyticus isolates were serotyped and sequenced, and all known and several novel O- and K-loci were identified. We developed Kaptive databases for all O- and K-loci after manual curation of the loci. In our study, we could genotype the O- and K-loci of 98 and 93 % of the genomes, respectively, with a Kaptive confidence score higher than ‘none’. The newly developed Kaptive databases with the identified V. parahaemolyticus O- and K-loci can be used to identify the O- and K-genotypes of V. parahaemolyticus isolates from genome sequences.
-
- Functional Genomics and Microbe–Niche Interactions
-
-
Comparative genome analysis of Sesbania cannabina-nodulating Rhizobium spp. revealing the symbiotic and transferrable characteristics of symbiosis plasmids
More LessSymbiotic nitrogen fixation between legumes and rhizobia makes a great contribution to the terrestrial ecosystem. The successful symbiosis between the partners mainly depends on the nod and nif genes in rhizobia, while the specific symbiosis is mainly determined by the structure of Nod factors and the corresponding secretion systems (type III secretion system; T3SS), etc. These symbiosis genes are usually located on symbiotic plasmids or a chromosomal symbiotic island, both could be transferred interspecies. In our previous studies, Sesbania cannabina-nodulating rhizobia across the world were classified into 16 species of four genera and all the strains, especially those of Rhizobium spp., harboured extraordinarily highly conserved symbiosis genes, suggesting that horizontal transfer of symbiosis genes might have happened among them. In order to learn the genomic basis of diversification of rhizobia under the selection of host specificity, we performed this study to compare the complete genome sequences of four Rhizobium strains associated with S. cannabina, YTUBH007, YTUZZ027, YTUHZ044 and YTUHZ045. Their complete genomes were sequenced and assembled at the replicon level. Each strain represents a different species according to the average nucleotide identity (ANI) values calculated using the whole-genome sequences; furthermore, except for YTUBH007, which was classified as Rhizobium binae , the remaining three strains were identified as new candidate species. A single symbiotic plasmid sized 345–402 kb containing complete nod, nif, fix, T3SS and conjugal transfer genes was detected in each strain. The high ANI and amino acid identity (AAI) values, as well as the close phylogenetic relationships among the entire symbiotic plasmid sequences, indicate that they have the same origin and the entire plasmid has been transferred among different Rhizobium species. These results indicate that S. cannabina stringently selects a certain symbiosis gene background of the rhizobia for nodulation, which might have forced the symbiosis genes to transfer from some introduced rhizobia to the related native or local-condition-adapted bacteria. The existence of almost complete conjugal transfer related elements, but not the gene virD, indicated that the self-transfer of the symbiotic plasmid in these rhizobial strains may be realized via a virD-independent pathway or through another unidentified gene. This study provides insight for the better understanding of high-frequency symbiotic plasmid transfer, host-specific nodulation and the host shift for rhizobia.
-
-
-
Expanded roles of lactate-sensing LldR in transcription regulation of the Escherichia coli K-12 genome: lactate utilisation and acid resistance
More LessLldR is a lactate-responsive transcription factor (TF) that transcriptionally regulates the lldPRD operon consisting of lactate permease and lactate dehydrogenase. The lldPRD operon facilitates the utilisation of lactic acid in bacteria. However, the role of LldR in whole genomic transcriptional regulation, and the mechanism involved in adaptation to lactate remains unclear. We used genomic SELEX (gSELEX) to comprehensively analyse the genomic regulatory network of LldR to understand the overall regulatory mechanism of lactic acid adaptation of the model intestinal bacterium Escherichia coli . In addition to the involvement of the lldPRD operon in utilising lactate as a carbon source, genes related to glutamate-dependent acid resistance and altering the composition of membrane lipids were identified as novel targets of LldR. A series of in vitro and in vivo regulatory analyses led to the identification of LldR as an activator of these genes. Furthermore, the results of lactic acid tolerance tests and co-culture experiments with lactic acid bacteria suggested that LldR plays a significant role in adapting to the acid stress induced by lactic acid. Therefore, we propose that LldR is an l-/d-lactate sensing TF for utilising lactate as a carbon source and for resistance to lactate-induced acid stress in intestinal bacteria.
-
-
-
Genome-wide fitness analysis identifies genes required for in vitro growth and macrophage infection by African and global epidemic pathovariants of Salmonella enterica Enteritidis
Salmonella enterica Enteritidis is the second most common serovar associated with invasive non-typhoidal Salmonella (iNTS) disease in sub-Saharan Africa. Previously, genomic and phylogenetic characterization of S . enterica Enteritidis isolates from the human bloodstream led to the discovery of the Central/Eastern African clade (CEAC) and West African clade, which were distinct from the gastroenteritis-associated global epidemic clade (GEC). The African S . enterica Enteritidis clades have unique genetic signatures that include genomic degradation, novel prophage repertoires and multi-drug resistance, but the molecular basis for the enhanced propensity of African S . enterica Enteritidis to cause bloodstream infection is poorly understood. We used transposon insertion sequencing (TIS) to identify the genetic determinants of the GEC representative strain P125109 and the CEAC representative strain D7795 for growth in three in vitro conditions (LB or minimal NonSPI2 and InSPI2 growth media), and for survival and replication in RAW 264.7 murine macrophages. We identified 207 in vitro-required genes that were common to both S . enterica Enteritidis strains and also required by S . enterica Typhimurium, S . enterica Typhi and Escherichia coli , and 63 genes that were only required by individual S . enterica Enteritidis strains. Similar types of genes were required by both P125109 and D7795 for optimal growth in particular media. Screening the transposon libraries during macrophage infection identified 177 P125109 and 201 D7795 genes that contribute to bacterial survival and replication in mammalian cells. The majority of these genes have proven roles in Salmonella virulence. Our analysis uncovered candidate strain-specific macrophage fitness genes that could encode novel Salmonella virulence factors.
-
- Pathogens and Epidemiology
-
-
A panel of diverse Klebsiella pneumoniae clinical isolates for research and development
Klebsiella pneumoniae are a leading cause of healthcare-associated infections worldwide. In particular, strains expressing extended-spectrum β-lactamases (ESBLs) and carbapenemases pose serious treatment challenges, leading the World Health Organization (WHO) to designate ESBL and carbapenem-resistant Enterobacteriaceae as ‘critical’ threats to human health. Research efforts to combat these pathogens can be supported by accessibility to diverse and clinically relevant isolates for testing novel therapeutics. Here, we describe a panel of 100 diverse K. pneumoniae isolates that are publicly available to assist the research community in this endeavour. Whole-genome sequencing (WGS) was performed on 3878 K . pneumoniae clinical isolates housed at the Multidrug-Resistant Organism Repository and Surveillance Network. The isolates were cultured from 63 facilities in 19 countries between 2001 and 2020. Core-genome multilocus sequence typing and high-resolution single-nucleotide polymorphism-based phylogenetic analyses captured the genetic diversity of the collection and were used to select the final panel of 100 isolates. In addition to known multidrug-resistant (MDR) pandemic lineages, the final panel includes hypervirulent lineages and isolates with specific and diverse resistance genes and virulence biomarkers. A broad range of antibiotic susceptibilities, ranging from pan-sensitive to extensively drug-resistant isolates, are described. The panel collection, and all associated metadata and genome sequences, are available at no additional cost and will be an important resource for the research community and for the design and development of novel antimicrobial agents and diagnostics against this important pathogen.
-
-
-
High-resolution characterization of short-term temporal variability in the taxonomic and resistome composition of wastewater influent
Wastewater-based epidemiology (WBE) for population-level surveillance of antimicrobial resistance (AMR) is gaining significant traction, but the impact of wastewater sampling methods on results is unclear. In this study, we characterized taxonomic and resistome differences between single-timepoint-grab and 24 h composites of wastewater influent from a large UK-based wastewater treatment work [WWTW (population equivalent: 223 435)]. We autosampled hourly influent grab samples (n=72) over three consecutive weekdays, and prepared additional 24 h composites (n=3) from respective grabs. For taxonomic profiling, metagenomic DNA was extracted from all samples and 16S rRNA gene sequencing was performed. One composite and six grabs from day 1 underwent metagenomic sequencing for metagenomic dissimilarity estimation and resistome profiling. Taxonomic abundances of phyla varied significantly across hourly grab samples but followed a repeating diurnal pattern for all 3 days. Hierarchical clustering grouped grab samples into four time periods dissimilar in both 16S rRNA gene-based profiles and metagenomic distances. 24H-composites resembled mean daily phyla abundances and showed low variability of taxonomic profiles. Of the 122 AMR gene families (AGFs) identified across all day 1 samples, single grab samples identified a median of six (IQR: 5–8) AGFs not seen in the composite. However, 36/36 of these hits were at lateral coverage <0.5 (median: 0.19; interquartile range: 0.16–0.22) and potential false positives. Conversely, the 24H-composite identified three AGFs not seen in any grab with higher lateral coverage (0.82; 0.55–0.84). Additionally, several clinically significant human AGFs (bla VIM, bla IMP, bla KPC) were intermittently or completely missed by grab sampling but captured by the 24 h composite. Wastewater influent undergoes significant taxonomic and resistome changes on short timescales potentially affecting interpretation of results based on sampling strategy. Grab samples are more convenient and potentially capture low-prevalence/transient targets but are less comprehensive and temporally variable. Therefore, we recommend 24H-composite sampling where feasible. Further validation and optimization of WBE methods is vital for its development into a robust AMR surveillance approach.
-
-
-
The Gonococcal Genetic Island defines distinct sub-populations of Neisseria gonorrhoeae
The incidence of gonorrhoea is increasing at an alarming pace, and therapeutic options continue to narrow as a result of worsening drug resistance. Neisseria gonorrhoeae is naturally competent, allowing the organism to adapt rapidly to selection pressures including antibiotics. A sub-population of N. gonorrhoeae carries the Gonococcal Genetic Island (GGI), which encodes a type IV secretion system (T4SS) that secretes chromosomal DNA. Previous research has shown that the GGI increases transformation efficiency in vitro, but the extent to which it contributes to horizontal gene transfer (HGT) during infection is unknown. Here we analysed genomic data from clinical isolates of N. gonorrhoeae to better characterize GGI+ and GGI− sub-populations and to delineate patterns of variation at the locus itself. We found the element segregating at an intermediate frequency (61%), and it appears to act as a mobile genetic element with examples of gain, loss, exchange and intra-locus recombination within our sample. We further found evidence suggesting that GGI+ and GGI− sub-populations preferentially inhabit distinct niches with different opportunities for HGT. Previously, GGI+ isolates were reported to be associated with more severe clinical infections, and our results suggest this could be related to metal-ion trafficking and biofilm formation. The co-segregation of GGI+ and GGI− isolates despite mobility of the element suggests that both niches inhabited by N. gonorrhoeae remain important to its overall persistence as has been demonstrated previously for cervical- and urethral-adapted sub-populations. These data emphasize the complex population structure of N. gonorrhoeae and its capacity to adapt to diverse niches.
-
-
-
The highly diverse plasmid population found in Escherichia coli colonizing travellers to Laos and its role in antimicrobial resistance gene carriage
Increased colonization by antimicrobial-resistant organisms is closely associated with international travel. This study investigated the diversity of mobile genetic elements involved with antimicrobial resistance (AMR) gene carriage in extended-spectrum beta-lactamase (ESBL)-producing Escherichia coli that colonized travellers to Laos. Long-read sequencing was used to reconstruct complete plasmid sequences from 48 isolates obtained from the daily stool samples of 23 travellers over a 3 week period. This method revealed a collection of 105 distinct plasmids, 38.1 % (n=40) of which carried AMR genes. The plasmids in this population were diverse, mostly unreported and included 38 replicon types, with F-type plasmids (n=23) the most prevalent amongst those carrying AMR genes. Fine-scale analysis of all plasmids identified numerous AMR gene contexts and emphasized the importance of IS elements, specifically members of the IS6/IS26 family, in the evolution of complex multidrug resistance regions. We found a concerning convergence of ESBL and colistin resistance determinants, with three plasmids from two different F-type lineages carrying bla CTX-M and mcr genes. The extensive diversity seen here highlights the worrying probability that stable new vehicles for AMR will evolve in E. coli populations that can disseminate internationally through travel networks.
-
-
-
A global survey of Salmonella plasmids and their associations with antimicrobial resistance
More LessPlasmids are the primary vector for horizontal transfer of antimicrobial resistance (AMR) within bacterial populations. We applied the MOB-suite, a toolset for reconstructing and typing plasmids, to 150 767 publicly available Salmonella whole-genome sequencing samples covering 1204 distinct serovars to produce a large-scale population survey of plasmids based on the MOB-suite plasmid nomenclature. Reconstruction yielded 183 017 plasmids representing 1044 primary MOB-clusters and 830 potentially novel MOB-clusters. Replicon and relaxase typing were able to type 83.4 and 58 % of plasmids, respectively, compared to 99.9 % for MOB-clusters. Within this work, we developed an approach to characterize the horizonal transfer of MOB-clusters and AMR genes across different serotypes, as well as the diversity of MOB-cluster associations with AMR genes. Aggregating conjugative mobility predictions provided by the MOB-suite and their corresponding serovar entropy demonstrated that non-mobilizable plasmids were associated with fewer serotypes compared to mobilizable or conjugative MOB-clusters. The host-range predictions for MOB-clusters also showed differences between the mobility classes, with mobilizable MOB-clusters accounting for 88.3 % of the multi-phyla (broad-host-range) predictions compared to 3 and 8.6 % for conjugative and non-mobilizable, respectively. A total of 296 (22 %) of identified MOB-clusters were associated with at least one resistance gene, indicating that the majority of Salmonella plasmids are not involved in AMR dissemination. Shannon entropy analysis of horizontal transfer of AMR genes across serovars and MOB-clusters demonstrated that horizonal transfer of genes is higher between serovars compared to transfer between different MOB-clusters. In addition to the population structure characterization based on primary MOB-clusters, we characterized a multi-plasmid outbreak responsible for the global dissemination of bla CMY-2 across different serotypes using higher resolution MOB-suite secondary cluster codes. The plasmid characterization approach developed here can be applied to different organisms to identify plasmids and genes which pose high risks for horizontal transfer.
-
-
-
Within–patient and global evolutionary dynamics of Klebsiella pneumoniae ST17
Klebsiella pneumoniae sequence type (ST) 17 is a global problem clone that causes multidrug-resistant (MDR) hospital infections worldwide. In 2008–2009, an outbreak of MDR ST17 occurred at a neonatal intensive care unit (NICU) in Stavanger, Norway. Fifty-seven children were colonized. We observed intestinal persistence of ST17 in all of the children for up to two years after hospital discharge. Here, we investigated the within–host evolution of ST17 in 45 of those children during long-term colonization and compared the outbreak with 254 global strains. Ninety-two outbreak-related isolates were whole-genome sequenced. They had capsule locus KL25, O locus O5 and carried yersiniabactin. During within–host colonization ST17 remained stable with few single nucleotide polymorphisms, no acquisition of antimicrobial resistance (AMR) or virulence determinants, and persistent carriage of a bla CTX-M-15-encoding IncFII(K) IncFIB(K) plasmid (pKp2177_1). The global collection included ST17 from 1993 to 2020 from 34 countries, that were from human infection (41.3%), colonization (39.3%) and respiratory specimens (7.3%), from animals (9.3%), and from the environment (2.7%). We estimate that ST17 emerged mid-to-late 19th century (1859, 95 % HPD 1763–1939) and diversified through recombinations of the K and O loci to form several sublineages, with various AMR genes, virulence loci and plasmids. There was limited evidence of persistence of AMR genes in any of these lineages. A globally disseminated sublineage with KL25/O5 accounted for 52.7 % of the genomes. It included a monophyletic subclade that emerged in the mid-1980s, which comprised the Stavanger NICU outbreak and 10 genomes from three other countries, which all carried pKp2177_1. The plasmid was also observed in a KL155/OL101 subclade from the 2000s. Three clonal expansions of ST17 were identified; all were healthcare-associated and carried either yersiniabactin and/or pKp2177_1. To conclude, ST17 is globally disseminated and associated with opportunistic hospital-acquired infections. It contributes to the burden of global MDR infections, but many diverse lineages persist without acquired AMR. We hypothesize that non-human sources and human colonization may play a crucial role for severe infections in vulnerable patients, such as preterm neonates.
-
-
-
Genomic analysis of 1710 surveillance-based Neisseria gonorrhoeae isolates from the USA in 2019 identifies predominant strain types and chromosomal antimicrobial-resistance determinants
This study characterized high-quality whole-genome sequences of a sentinel, surveillance-based collection of 1710 Neisseria gonorrhoeae (GC) isolates from 2019 collected in the USA as part of the Gonococcal Isolate Surveillance Project (GISP). It aims to provide a detailed report of strain diversity, phylogenetic relationships and resistance determinant profiles associated with reduced susceptibilities to antibiotics of concern. The 1710 isolates represented 164 multilocus sequence types and 21 predominant phylogenetic clades. Common genomic determinants defined most strains’ phenotypic, reduced susceptibility to current and historic antibiotics (e.g. bla TEM plasmid for penicillin, tetM plasmid for tetracycline, gyrA for ciprofloxacin, 23S rRNA and/or mosaic mtr operon for azithromycin, and mosaic penA for cefixime and ceftriaxone). The most predominant phylogenetic clade accounted for 21 % of the isolates, included a majority of the isolates with low-level elevated MICs to azithromycin (2.0 µg ml−1), carried a mosaic mtr operon and variants in PorB, and showed expansion with respect to data previously reported from 2018. The second largest clade predominantly carried the GyrA S91F variant, was largely ciprofloxacin resistant (MIC ≥1.0 µg ml−1), and showed significant expansion with respect to 2018. Overall, a low proportion of isolates had medium- to high-level elevated MIC to azithromycin ((≥4.0 µg ml−1), based on C2611T or A2059G 23S rRNA variants). One isolate carried the penA 60.001 allele resulting in elevated MICs to cefixime and ceftriaxone of 1.0 µg ml−1. This high-resolution snapshot of genetic profiles of 1710 GC sequences, through a comparison with 2018 data (1479 GC sequences) within the sentinel system, highlights change in proportions and expansion of select GC strains and the associated genetic mechanisms of resistance. The knowledge gained through molecular surveillance may support rapid identification of outbreaks of concern. Continued monitoring may inform public health responses to limit the development and spread of antibiotic-resistant gonorrhoea.
-
-
-
Genomic variation during culture adaptation of genetically complex Plasmodium falciparum clinical isolates
Experimental studies on the biology of malaria parasites have mostly been based on laboratory-adapted lines, but there is limited understanding of how these may differ from parasites in natural infections. Loss-of-function mutants have previously been shown to emerge during culture of some Plasmodium falciparum clinical isolates in analyses focusing on single-genotype infections. The present study included a broader array of isolates, mostly representing multiple-genotype infections, which are more typical in areas where malaria is highly endemic. Genome sequence data from multiple time points over several months of culture adaptation of 28 West African isolates were analysed, including previously available sequences along with new genome sequences from additional isolates and time points. Some genetically complex isolates eventually became fixed over time to single surviving genotypes in culture, whereas others retained diversity, although proportions of genotypes varied over time. Drug resistance allele frequencies did not show overall directional changes, suggesting that resistance-associated costs are not the main causes of fitness differences among parasites in culture. Loss-of-function mutants emerged during culture in several of the multiple-genotype isolates, affecting genes (including AP2-HS, EPAC and SRPK1) for which loss-of-function mutants were previously seen to emerge in single-genotype isolates. Parasite clones were derived by limiting dilution from six of the isolates, and sequencing identified de novo variants not detected in the bulk isolate sequences. Interestingly, several of these were nonsense mutants and frameshifts disrupting the coding sequence of EPAC, the gene with the largest number of independent nonsense mutants previously identified in laboratory-adapted lines. Analysis of genomic identity by descent to explore relatedness among clones revealed co-occurring non-identical sibling parasites, illustrative of the natural genetic structure within endemic populations.
-
-
-
The evolutionary tale of eight novel plasmids in a colistin-resistant environmental Acinetobacter baumannii isolate
More LessAcinetobacter baumannii is an important opportunistic pathogen known for its high levels of resistance to many antibiotics, particularly those considered last resorts such as colistin and carbapenems. Plasmids of this organism are increasingly associated with the spread of clinically important antibiotic resistance genes. Although A. baumannii is a ubiquitous organism, to date, most of the focus has been on studying strains recovered from clinical samples ignoring those isolated in the environment (soil, water, food, etc.). Here, we analysed the genetic structures of eight novel plasmids carried by an environmental colistin-resistant A. baumannii (strain E-072658) recovered in a recycled fibre pulp in a paper mill in Finland. It was shown that E-072658 carries a new variant of the mcr-4 colistin resistance gene (mcr-4.7) in a novel Tn3-family transposon (called Tn6926) carried by a novel plasmid p8E072658. E-072658 is also resistant to sulphonamide compounds; consistent with this, the sul2 sulphonamide resistance gene was found in a pdif module. E-072658 also carries six additional plasmids with no antibiotic resistance genes, but they contained several pdif modules shared with plasmids carried by clinical strains. Detailed analysis of the genetic structure of all eight plasmids carried by E-072658 showed a complex evolutionary history revealing genetic exchange events within the genus Acinetobacter beyond the clinical or environmental origin of the strains. This work provides evidence that environmental strains might act as a source for some of the clinically significant antibiotic resistance genes.
-
-
-
Lack of methoxy-mycolates characterizes the geographically restricted lineage 7 of Mycobacterium tuberculosis complex
Lineage 7 (L7) emerged in the phylogeny of the Mycobacterium tuberculosis complex (MTBC) subsequent to the branching of ‘ancient’ lineage 1 and prior to the Eurasian dispersal of ‘modern’ lineages 2, 3 and 4. In contrast to the major MTBC lineages, the current epidemiology suggests that prevalence of L7 is highly confined to the Ethiopian population, or when identified outside of Ethiopia, it has mainly been in patients of Ethiopian origin. To search for microbiological factors that may contribute to its restricted distribution, we compared the genome of L7 to the genomes of globally dispersed MTBC lineages. The frequency of predicted functional mutations in L7 was similar to that documented in other lineages. These include mutations characteristic of modern lineages – such as constitutive expression of nitrate reductase – as well as mutations in the VirS locus that are commonly found in ancient lineages. We also identified and characterized multiple lineage-specific mutations in L7 in biosynthesis pathways of cell wall lipids, including confirmed deficiency of methoxy-mycolic acids due to a stop-gain mutation in the mmaA3 gene that encodes a methoxy-mycolic acid synthase. We show that the abolished biosynthesis of methoxy-mycolates of L7 alters the cell structure and colony morphology on selected growth media and impacts biofilm formation. The loss of these mycolic acid moieties may change the host–pathogen dynamic for L7 isolates, explaining the limited geographical distribution of L7 and contributing to further understanding the spread of MTBC lineages across the globe.
-
-
-
Evaluation of core genome and whole genome multilocus sequence typing schemes for Campylobacter jejuni and Campylobacter coli outbreak detection in the USA
Campylobacter is a leading causing of bacterial foodborne and zoonotic illnesses in the USA. Pulsed-field gene electrophoresis (PFGE) and 7-gene multilocus sequence typing (MLST) have been historically used to differentiate sporadic from outbreak Campylobacter isolates. Whole genome sequencing (WGS) has been shown to provide superior resolution and concordance with epidemiological data when compared with PFGE and 7-gene MLST during outbreak investigations. In this study, we evaluated epidemiological concordance for high-quality SNP (hqSNP), core genome (cg)MLST and whole genome (wg)MLST to cluster or differentiate outbreak-associated and sporadic Campylobacter jejuni and Campylobacter coli isolates. Phylogenetic hqSNP, cgMLST and wgMLST analyses were also compared using Baker’s gamma index (BGI) and cophenetic correlation coefficients. Pairwise distances comparing all three analysis methods were compared using linear regression models. Our results showed that 68/73 sporadic C. jejuni and C. coli isolates were differentiated from outbreak-associated isolates using all three methods. There was a high correlation between cgMLST and wgMLST analyses of the isolates; the BGI, cophenetic correlation coefficient, linear regression model R 2 and Pearson correlation coefficients were >0.90. The correlation was sometimes lower comparing hqSNP analysis to the MLST-based methods; the linear regression model R 2 and Pearson correlation coefficients were between 0.60 and 0.86, and the BGI and cophenetic correlation coefficient were between 0.63 and 0.86 for some outbreak isolates. We demonstrated that C. jejuni and C. coli isolates clustered in concordance with epidemiological data using WGS-based analysis methods. Discrepancies between allele and SNP-based approaches may reflect the differences between how genomic variation (SNPs and indels) are captured between the two methods. Since cgMLST examines allele differences in genes that are common in most isolates being compared, it is well suited to surveillance: searching large genomic databases for similar isolates is easily and efficiently done using allelic profiles. On the other hand, use of an hqSNP approach is much more computer intensive and not scalable to large sets of genomes. If further resolution between potential outbreak isolates is needed, wgMLST or hqSNP analysis can be used.
-
-
-
Fast and cost-effective SARS-CoV-2 variant detection using Oxford Nanopore full-length spike gene sequencing
Most biologically relevant and diagnostic mutations in the severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) genome have been identified in the S gene through global genomic surveillance efforts. However, large-scale whole-genome sequencing (WGS) is still challenging in developing countries due to higher costs, reagent delays and limited infrastructure. Consequently, only a small fraction of SARS-CoV-2 samples are characterized through WGS in these regions. Here, we present a complete workflow consisting of a fast library preparation protocol based on tiled amplification of the S gene, followed by a PCR barcoding step and sequencing using Nanopore platforms. This protocol facilitates fast and cost-effective identification of main variants of concern and mutational surveillance of the S gene. By applying this protocol, report time and overall costs for SARS-CoV-2 variant detection could be reduced, contributing to improved genomic surveillance programmes, particularly in low-income regions.
-