- Volume 8, Issue 6, 2022
Volume 8, Issue 6, 2022
- Bioresources
-
- Genomic Methodologies
-
-
Curation of the AMRFinderPlus databases: applications, functionality and impact
Antimicrobial resistance (AMR) is a significant public health threat. Low-cost whole-genome sequencing, which is often used in surveillance programmes, provides an opportunity to assess AMR gene content in these genomes using in silico approaches. A variety of bioinformatic tools have been developed to identify these genomic elements. Most of those tools rely on reference databases of nucleotide or protein sequences and collections of models and rules for analysis. While the tools are critical for the identification of AMR genes, the databases themselves also provide significant utility for researchers, for applications ranging from sequence analysis to information about AMR phenotypes. Additionally, these databases can be evaluated by domain experts and others to ensure their accuracy. Here we describe how we curate the genes, point mutations and blast rules, and hidden Markov models used in NCBI’s AMRFinderPlus, along with the quality-control steps we take to ensure database quality. We also describe the web interfaces that display the full structure of the database and their newly developed cross-browser relationships. Then, using the Reference Gene Catalog as an example, we detail how the databases, rules and models are made publicly available, as well as how to access the software. In addition, as part of the Pathogen Detection system, we have analysed over 1 million publicly available genomes using AMRFinderPlus and its databases. We discuss how the computed analyses generated by those tools can be accessed through a web interface. Finally, we conclude with NCBI’s plans to make these databases accessible over the long-term.
-
- Research Articles
-
-
-
GR13-type plasmids in Acinetobacter potentiate the accumulation and horizontal transfer of diverse accessory genes
Carbapenem and other antibiotic resistance genes (ARGs) can be found in plasmids in Acinetobacter , but many plasmid types in this genus have not been well-characterized. Here we describe the distribution, diversity and evolutionary capacity of rep group 13 (GR13) plasmids that are found in Acinetobacter species from diverse environments. Our investigation was prompted by the discovery of two GR13 plasmids in A. baumannii isolated in an intensive care unit (ICU). The plasmids harbour distinct accessory genes: pDETAB5 contains bla NDM-1 and genes that confer resistance to four further antibiotic classes, while pDETAB13 carries putative alcohol tolerance determinants. Both plasmids contain multiple dif modules, which are flanked by pdif sites recognized by XerC/XerD tyrosine recombinases. The ARG-containing dif modules in pDETAB5 are almost identical to those found in pDETAB2, a GR34 plasmid from an unrelated A. baumannii isolated in the same ICU a month prior. Examination of a further 41 complete, publicly available plasmid sequences revealed that the GR13 pangenome consists of just four core but 1186 accessory genes, 123 in the shell and 1063 in the cloud, reflecting substantial capacity for diversification. The GR13 core genome includes genes for replication and partitioning, and for a putative tyrosine recombinase. Accessory segments encode proteins with diverse putative functions, including for metabolism, antibiotic/heavy metal/alcohol tolerance, restriction-modification, an anti-phage system and multiple toxin–antitoxin systems. The movement of dif modules and actions of insertion sequences play an important role in generating diversity in GR13 plasmids. Discrete GR13 plasmid lineages are internationally disseminated and found in multiple Acinetobacter species, which suggests they are important platforms for the accumulation, horizontal transmission and persistence of accessory genes in this genus.
-
-
- Functional Genomics and Microbe–Niche Interactions
-
-
The niche-specialist and age-related oral microbial ecosystem: crosstalk with host immune cells in homeostasis
Although characterization of the baseline oral microbiota has been discussed, the current literature seems insufficient to draw a definitive conclusion on the interactions between the microbes themselves or with the host. This study focuses on the spatial and temporal characteristics of the oral microbial ecosystem in a mouse model and its crosstalk with host immune cells in homeostasis. The V3V4 regions of the 16S rRNA gene of 20 samples from four niches (tongue, buccal mucosa, keratinized gingiva and hard palate) and 10 samples from two life stages (adult and old) were analysed. Flow cytometry (FCM) was used to investigate the resident immune cells. The niche-specialist and age-related communities, characterized based on the microbiota structure, interspecies communications, microbial functions and interactions with immune cells, were addressed. The phylum Firmicutes was the major component in the oral community. The microbial community profiles at the genus level showed that the relative abundances of the genera Bacteroides , Lactobacillus and Porphyromonas were enriched in the gingiva. The abundance of the genera Streptococcus , Faecalibaculum and Veillonella was increased in palatal samples, while the abundance of Neisseria and Bradyrhizobium was enriched in buccal samples. The genera Corynebacterium , Stenotrophomonas, Streptococcus and Fusobacterium were proportionally enriched in old samples, while Prevotella and Lacobacillus were enriched in adult samples. Network analysis showed that the genus Lactobacillus performed as a central node in the buccal module, while in the gingiva module, the central nodes were Nesterenkonia and Hydrogenophilus . FCM showed that the proportion of Th1 cells in the tongue samples (38.18 % [27.03–49.34 %]) (mean [range]) was the highest. The proportion of γδT cells in the buccal mucosa (25.82 % [22.1–29.54 %]) and gingiva (20.42 % [18.31–22.53 %]) samples was higher (P<0.01) than those in the palate (14.18 % [11.69–16.67 %]) and tongue (9.38 % [5.38–13.37 %] samples. The proportion of Th2 (31.3 % [16.16–46.44 %]), Th17 (27.06 % [15.76–38.36 %]) and Treg (29.74 % [15.71–43.77 %]) cells in the old samples was higher than that in the adult samples (P<0.01). Further analysis of the interplays between the microbiomes and immune cells indicated that Th1 cells in the adult group, nd Th2, Th17 and Treg cells in the old group were the main immune factors strongly associated with the oral microbiota. For example, Th2, Th17 and Treg cells showed a significantly positive correlation with age-related microorganisms such as Sphingomonas , Streptococcus and Acinetobacter , while Th1 cells showed a negative correlation. Another positive correlation occurred between Th1 cells and several commensal microbiomes such as Lactobacillus , Jeotgalicoccus and Sporosarcina . Th2, Th17 and Treg cells showed the opposite trend. Together, our findings identify the niche-specialist and age-related characteristics of the oral microbial ecosystem and the potential associations between the microbiomes and the mucosal immune cells, providing critical insights into mucosal microbiology.
-
-
-
Identification and characterization of virus-encoded circular RNAs in host cells
Emerging evidence has identified viral circular RNAs (circRNAs) in human cells infected by viruses, interfering with the immune system and inducing diseases including human cancer. However, the biogenesis and regulatory mechanisms of virus-encoded circRNAs in host cells remain unknown. In this study, we used the circRNA detection tool CIRI2 to systematically determine the virus-encoded circRNAs in virus-infected cancer cell lines and cancer patients, by analysing RNA-Seq datasets derived from RNase R-treated samples. Based on the thousands of viral circRNAs we identified, the biological characteristics and potential roles of viral circRNAs in regulating host cell function were determined. In addition, we developed a Viral-circRNA Database (http://www.hywanglab.cn/vcRNAdb/), which is open to all users to search, browse and download information on circRNAs encoded by viruses upon infection.
-
- Microbial Communities
-
-
Insights into plastic biodegradation: community composition and functional capabilities of the superworm (Zophobas morio) microbiome in styrofoam feeding trials
More LessPlastics are inexpensive and widely used organic polymers, but their high durability hinders biodegradation. Polystyrene, including extruded polystyrene (also known as styrofoam), is among the most commonly produced plastics worldwide and is recalcitrant to microbial degradation. In this study, we assessed changes in the gut microbiome of superworms (Zophobas morio) reared on bran, polystyrene or under starvation conditions over a 3 weeks period. Superworms on all diets were able to complete their life cycle to pupae and imago, although superworms reared on polystyrene had minimal weight gains, resulting in lower pupation rates compared to bran reared worms. The change in microbial gut communities from baseline differed considerably between diet groups, with polystyrene and starvation groups characterized by a loss of microbial diversity and the presence of opportunistic pathogens. Inferred microbial functions enriched in the polystyrene group included transposon movements, membrane restructuring and adaptations to oxidative stress. We detected several encoded enzymes with reported polystyrene and styrene degradation abilities, supporting previous reports of polystyrene-degrading bacteria in the superworm gut. By recovering metagenome-assembled genomes (MAGs) we linked phylogeny and functions and identified genera including Pseudomonas , Rhodococcus and Corynebacterium that possess genes associated with polystyrene degradation. In conclusion, our results provide the first metagenomic insights into the metabolic pathways used by the gut microbiome of superworms to degrade polystyrene. Our results also confirm that superworms can survive on polystyrene feed, but this diet has considerable negative impacts on host gut microbiome diversity and health.
-
- Pathogens and Epidemiology
-
-
Population genomics of pneumococcal carriage in South Africa following the introduction of the 13-valent pneumococcal conjugate vaccine (PCV13) immunization
Streptococcus pneumoniae is a major human pathogen responsible for over 317000 deaths in children <5 years of age with the burden of the disease being highest in low- and middle-income countries including South Africa. Following the introduction of the 7-valent and 13-valent pneumococcal conjugate vaccine (PCV) in South Africa in 2009 and 2011, respectively, a decrease in both invasive pneumococcal infections and asymptomatic carriage of vaccine-type pneumococci were reported. In this study, we described the changing epidemiology of the pneumococcal carriage population in South Africa, by sequencing the genomes of 1825 isolates collected between 2009 and 2013. Using these genomic data, we reported the changes in serotypes, Global Pneumococcal Sequence Clusters (GPSCs), and antibiotic resistance before and after the introduction of PCV13. The pneumococcal carriage population in South Africa has a high level of diversity, comprising of 126 GPSCs and 49 serotypes. Of the ten most prevalent GPSCs detected, six were predominantly found in Africa (GPSC22, GPSC21, GPSC17, GPSC33, GPSC34 and GPSC52). We found a significant decrease in PCV7 serotypes (19F, 6B, 23F and 14) and an increase in non-vaccine serotypes (NVT) (16F, 34, 35B and 11A) among children <2 years of age. The increase in NVTs was driven by pneumococcal lineages GPSC33, GPSC34, GPSC5 and GPSC22. Overall, a decrease in antibiotic resistance for 11 antimicrobials was detected in the PCV13 era. Further, we reported a higher resistance prevalence among vaccine types (VTs), as compared to NVTs; however, an increase in penicillin resistance among NVT was observed between the PCV7 and PCV13 eras. The carriage isolates from South Africa predominantly belonged to pneumococcal lineages, which are endemic to Africa. While the introduction of PCV resulted in an overall reduction of resistance in pneumococcal carriage isolates, an increase in penicillin resistance among NVTs was detected in children aged between 3 and 5 years, driven by the expansion of penicillin-resistant clones associated with NVTs in the PCV13 era.
-
-
-
Local accessory gene sharing among Egyptian Campylobacter potentially promotes the spread of antimicrobial resistance
Campylobacter is the most common cause of bacterial gastroenteritis worldwide, and diarrhoeal disease is a major cause of child morbidity, growth faltering and mortality in low- and middle-income countries. Despite evidence of high incidence and differences in disease epidemiology, there is limited genomic data from studies in developing countries. In this study, we aimed to quantify the extent of gene sharing in local and global populations. We characterized the genetic diversity and accessory-genome content of a collection of Campylobacter isolates from the Cairo metropolitan area, Egypt. In total, 112 Campylobacter isolates were collected from broiler carcasses (n=31), milk and dairy products (n=24), and patients suffering from gastroenteritis (n=57). Among the most common sequence types (STs), we identified the globally disseminated host generalist ST-21 clonal complex (CC21) and the poultry specialists CC206, CC464 and CC48. Notably, CC45 and the cattle-specialist CC42 were under-represented, with a total absence of CC61. Core- and accessory-genome sharing was compared among isolates from Egypt and a comparable collection from the UK (Oxford). Lineage-specific accessory-genome sharing was significantly higher among isolates from the same country, particularly CC21, which demonstrated greater local geographical clustering. In contrast, no geographical clustering was noted in either the core or accessory genome of CC828, suggesting a highly admixed population. A greater proportion of Campylobacter coli isolates were multidrug resistant compared to Campylobacter jejuni . Our results suggest that there is more horizontal transfer of accessory genes between strains in Egypt. This has strong implications for controlling the spread of antimicrobial resistance among this important pathogen.
-
-
-
A critical evaluation of Mycobacterium bovis pangenomics, with reference to its utility in outbreak investigation
More LessThe increased accessibility of next generation sequencing has allowed enough genomes from a given bacterial species to be sequenced to describe the distribution of genes in the pangenome, without limiting analyses to genes present in reference strains. Although some taxa have thousands of whole genome sequences available on public databases, most genomes were sequenced with short read technology, resulting in incomplete assemblies. Studying pangenomes could lead to important insights into adaptation, pathogenicity, or molecular epidemiology, however given the known information loss inherent in analyzing contig-level assemblies, these inferences may be biased or inaccurate. In this study we describe the pangenome of a clonally evolving pathogen, Mycobacterium bovis , and examine the utility of gene content variation in M. bovis outbreak investigation. We constructed the M. bovis pangenome using 1463 de novo assembled genomes. We tested the assumption of strict clonal evolution by studying evidence of recombination in core genes and analyzing the distribution of accessory genes among core monophyletic groups. To determine if gene content variation could be utilized in outbreak investigation, we carefully examined accessory genes detected in a well described M. bovis outbreak in Minnesota. We found significant errors in accessory gene classification. After accounting for these errors, we show that M. bovis has a much smaller accessory genome than previously described and provide evidence supporting ongoing clonal evolution and a closed pangenome, with little gene content variation generated over outbreaks. We also identified frameshift mutations in multiple genes, including a mutation in glpK, which has recently been associated with antibiotic tolerance in Mycobacterium tuberculosis . A pangenomic approach enables a more comprehensive analysis of genome dynamics than is possible with reference-based approaches; however, without critical evaluation of accessory gene content, inferences of transmission patterns employing these loci could be misguided.
-
-
-
Whole-genome sequencing enhances existing pathogen and antimicrobial-resistance surveillance schemes within a neonatal unit
More LessIn some neonatal units, the screening of isolates for antimicrobial-resistant organisms is a matter of routine, with theoretical benefits including the prevention or early detection of outbreaks. This study sought to use whole-genome sequencing (WGS) retrospectively to characterize the genomic epidemiology of Gram-negative organisms obtained from a screening programme in a 32-bed unit providing intensive, high-dependency and special care at City Hospital, Birmingham, UK, identifying occult transmission events and clinically important antimicrobial-resistance (AMR) genes. WGS was performed for 155 isolates collected from rectal and umbilical screening swabs over a 2 month period from 44 individual neonates. Genomic epidemiological analysis showed possible transmission events involving Escherichia coli , Enterobacter cloacae , Klebsiella oxytoca and Klebsiella pneumoniae not detected by routine screening, with eight putative clusters involving different individuals identified. Within phylogenetic clusters, the relatedness of organisms – as determined by the abundance of SNPs – varied widely, indicating that a variety of transmission routes may be implicated. While clinically important AMR genes were not present in the putative transmission clusters, our observation of suspected interspecies horizontal transfer of bla CTX-M-15 within individuals highlights the potential for their spread between organisms as well as individuals in this environment, with implications for surveillance. Our data show that WGS may reveal occult Gram-negative transmission events, demonstrating the potential of sequencing-based surveillance systems for nosocomial pathogens. Challenges remain in understanding how to utilize WGS surveillance to maximum effect in real-world settings.
-
-
-
Genome plasticity driven by aneuploidy and loss of heterozygosity in Trypanosoma cruzi
Trypanosoma cruzi the causative agent of Chagas disease shows a marked genetic diversity and divided into at least six Discrete Typing Units (DTUs). High intra genetic variability has been observed in the TcI DTU, the most widely distributed DTU, where patterns of genomic diversity can provide information on ecological and evolutionary processes driving parasite population structure and genome organization. Chromosomal aneuploidies and rearrangements across multigene families represent an evidence of T. cruzi genome plasticity. We explored genomic diversity among 18 Colombian T. cruzi I clones and 15 T. cruzi I South American strains. Our results confirm high genomic variability, heterozygosity and presence of a clade compatible with the TcIdom genotype, described for strains from humans in Colombia and Venezuela. TcI showed high structural plasticity across the geographical region studied. Differential events of whole and segmental aneuploidy (SA) along chromosomes even between clones from the same strain were found and corroborated by the depth and allelic frequency. We detected loss of heterozygosity (LOH) events in different chromosomes, however, the size and location of segments under LOH varied between clones. Genes adjacent to breakpoints were evaluated, and retrotransposon hot spot genes flanked the beginning of segmental aneuploidies. Our results suggest that T. cruzi genomes, like those of Leishmania, may have a highly unstable structure and there is now an urgent need to design experiments to explore any potential adaptive role for the plasticity observed.
-
-
-
Virulence plasmid pINV as a genetic signature for Shigella flexneri phylogeny
More LessShigella flexneri is a major health burden in low- and middle-income countries, where it is a leading cause of mortality associated with diarrhoea in children, and shows an increasing incidence among travellers and men having sex with men. Like all Shigella spp., S. flexneri has evolved from commensal Escherichia coli following the acquisition of a large plasmid pINV, which contains genes essential for virulence. Current sequence typing schemes of Shigella are based on combinations of chromosomal genetic loci, since pINV-encoded virulence genes are often lost during growth in the laboratory, making these elements inappropriate for sequence typing. By performing comparative analysis of pINVs from S. flexneri strains isolated from different geographical regions and belonging to different serotypes, we found that in contrast to plasmid-encoded virulence genes, plasmid maintenance genes are highly stable pINV-encoded elements. For the first time, to our knowledge, we have developed a S. flexneri plasmid multilocus sequence typing (pMLST) method based on different combinations of alleles of the vapBC and yacAB toxin–antitoxin (TA) systems, and the parAB partitioning system. This enables typing of S. flexneri pINV plasmids into distinct ‘virulence sequence types’ (vSTs). Furthermore, the phylogenies of vST alleles and bacterial host core genomes suggests an intimate co-evolution of pINV with the chromosome of its bacterial host, consistent with previous findings. This work demonstrates the potential of plasmid maintenance loci as genetic characteristics to study as well as to trace the molecular phylogenesis of S. flexneri pINV and the phylogenetic relationship of this plasmid with its bacterial host.
-
- Evolution and Responses to Interventions
-
-
Genetic background of Cambodian pneumococcal carriage isolates following pneumococcal conjugate vaccine 13
Streptococcus pneumoniae (the pneumococcus) is a leading cause of childhood mortality globally and in Cambodia. It is commensal in the human nasopharynx, occasionally resulting in invasive disease. Monitoring population genetic shifts, characterized by lineage and serotype expansions, as well as antimicrobial-resistance (AMR) patterns is crucial for assessing and predicting the impact of vaccination campaigns. We sought to elucidate the genetic background (global pneumococcal sequence clusters; GPSCs) of pneumococci carried by Cambodian children following perturbation by pneumococcal conjugate vaccine (PCV) 13. We sequenced pre-PCV13 (01/2013–12/2015, N=258) and post-PCV13 carriage isolates (01/2016–02/2017, N=428) and used PopPUNK and SeroBA to determine lineage prevalence and serotype composition. Following PCV13 implementation in Cambodia, we saw expansions of non-vaccine type (NVT) serotypes 23A (GPSC626), 34 (GPSC45) and 6D (GPSC16). We predicted antimicrobial susceptibility using the CDC-AMR pipeline and determined concordance with phenotypic data. The CDC-AMR pipeline had >90 % concordance with the phenotypic antimicrobial-susceptibility testing. We detected a high prevalence of AMR in both expanding non-vaccine serotypes and residual vaccine serotype 6B. Persistently high levels of AMR, specifically persisting multidrug-resistant lineages, warrant concern. The implementation of PCV13 in Cambodia has resulted in NVT serotype expansion reflected in the carriage population and driven by specific genetic backgrounds. Continued monitoring of these GPSCs during the ongoing collection of additional carriage isolates in this population is necessary.
-
- Methods
-
- Genomic Methodologies
-
-
Catwalk: identifying closely related sequences in large microbial sequence databases
More LessThere is a need to identify microbial sequences that may form part of transmission chains, or that may represent importations across national boundaries, amidst large numbers of SARS-CoV-2 and other bacterial or viral sequences. Reference-based compression is a sequence analysis technique that allows both a compact storage of sequence data and comparisons between sequences. Published implementations of the approach are being challenged by the large sample collections now being generated. Our aim was to develop a fast software detecting highly similar sequences in large collections of microbial genomes, including millions of SARS-CoV-2 genomes. To do so, we developed Catwalk, a tool that bypasses bottlenecks in the generation, comparison and in-memory storage of microbial genomes generated by reference mapping. It is a compiled solution, coded in Nim to increase performance. It can be accessed via command line, rest api or web server interfaces. We tested Catwalk using both SARS-CoV-2 and Mycobacterium tuberculosis genomes generated by prospective public-health sequencing programmes. Pairwise sequence comparisons, using clinically relevant similarity cut-offs, took about 0.39 and 0.66 μs, respectively; in 1 s, between 1 and 2 million sequences can be searched. Catwalk operates about 1700 times faster than, and uses about 8 % of the RAM of, a Python reference-based compression and comparison tool in current use for outbreak detection. Catwalk can rapidly identify close relatives of a SARS-CoV-2 or M. tuberculosis genome amidst millions of samples.
-