Bacteriophage
Guest-edited by Professor Tetsuya Hayashi (Kyushu University), this collection brings together original Research Articles, Methods, Mini Reviews, and full-length Reviews relating to the diversity of bacteriophages and genomics-based research with a focus on their roles in the evolution of bacteria and ecosystems. Microbial Genomics also welcomes large-scale genomic analyses of bacteriophages which provide the basis for developing novel phage therapies.
This collection is now open for submissions. Submit here and state that your manuscript is part of the Bacteriophage Collection.
Collection Contents
15 results
-
-
Co-transfer of functionally interdependent genes contributes to genome mosaicism in lambdoid phages
More LessLambdoid (or Lambda-like) phages are a group of related temperate phages that can infect Escherichia coli and other gut bacteria. A key characteristic of these phages is their mosaic genome structure, which served as the basis for the ‘modular genome hypothesis’. Accordingly, lambdoid phages evolve by transferring genomic regions, each of which constitutes a functional unit. Nevertheless, it is unknown which genes are preferentially transferred together and what drives such co-transfer events. Here we aim to characterize genome modularity by studying co-transfer of genes among 95 distantly related lambdoid (pro-)phages. Based on gene content, we observed that the genomes cluster into 12 groups, which are characterized by a highly similar gene content within the groups and highly divergent gene content across groups. Highly similar proteins can occur in genomes of different groups, indicating that they have been transferred. About 26 % of homologous protein clusters in the four known operons (i.e. the early left, early right, immunity and late operon) engage in gene transfer, which affects all operons to a similar extent. We identified pairs of genes that are frequently co-transferred and observed that these pairs tend to be near one another on the genome. We find that frequently co-transferred genes are involved in related functions and highlight interesting examples involving structural proteins, the cI repressor and Cro regulator, proteins interacting with DNA, and membrane-interacting proteins. We conclude that epistatic effects, where the functioning of one protein depends on the presence of another, play an important role in the evolution of the modular structure of these genomes.
-
-
-
Cryptic prophages within a Streptococcus pyogenes genotype emm4 lineage
More LessThe major human pathogen Streptococcus pyogenes shares an intimate evolutionary history with mobile genetic elements, which in many cases carry genes encoding bacterial virulence factors. During recent whole-genome sequencing of a longitudinal sample of S. pyogenes isolates in England, we identified a lineage within emm4 that clustered with the reference genome MEW427. Like MEW427, this lineage was characterized by substantial gene loss within all three prophage regions, compared to MGAS10750 and isolates outside of the MEW427-like lineage. Gene loss primarily affected lysogeny, replicative and regulatory modules, and to a lesser and more variable extent, structural genes. Importantly, prophage-encoded superantigen and DNase genes were retained in all isolates. In isolates where the prophage elements were complete, like MGAS10750, they could be induced experimentally, but not in MEW427-like isolates with degraded prophages. We also found gene loss within the chromosomal island SpyCIM4 of MEW427-like isolates, although surprisingly, the SpyCIM4 element could not be experimentally induced in either MGAS10750-like or MEW427-like isolates. This did not, however, appear to abolish expression of the mismatch repair operon, within which this element resides. The inclusion of further emm4 genomes in our analyses ratified our observations and revealed an international emm4 lineage characterized by prophage degradation. Intriguingly, the USA population of emm4 S. pyogenes appeared to constitute predominantly MEW427-like isolates, whereas the UK population comprised both MEW427-like and MGAS10750-like isolates. The degraded and cryptic nature of these elements may have important phenotypic and fitness ramifications for emm4 S. pyogenes , and the geographical distribution of this lineage raises interesting questions on the population dynamics of the genotype.
-
-
-
Whole-genome epidemiology links phage-mediated acquisition of a virulence gene to the clonal expansion of a pandemic Salmonella enterica serovar Typhimurium clone
Epidemic and pandemic clones of bacterial pathogens with distinct characteristics continually emerge, replacing those previously dominant through mechanisms that remain poorly characterized. Here, whole-genome-sequencing-powered epidemiology linked horizontal transfer of a virulence gene, sopE, to the emergence and clonal expansion of a new epidemic Salmonella enterica serovar Typhimurium (S. Typhimurium) clone. The sopE gene is sporadically distributed within the genus Salmonella and rare in S . enterica Typhimurium lineages, but was acquired multiple times during clonal expansion of the currently dominant pandemic monophasic S. Typhimurium sequence type (ST) 34 clone. Ancestral state reconstruction and time-scaled phylogenetic analysis indicated that sopE was not present in the common ancestor of the epidemic clade, but later acquisition resulted in increased clonal expansion of sopE-containing clones that was temporally associated with emergence of the epidemic, consistent with increased fitness. The sopE gene was mainly associated with a temperate bacteriophage mTmV, but recombination with other bacteriophage and apparent horizontal gene transfer of the sopE gene cassette resulted in distribution among at least four mobile genetic elements within the monophasic S . enterica Typhimurium ST34 epidemic clade. The mTmV prophage lysogenic transfer to other S. enterica serovars in vitro was limited, but included the common pig-associated S . enterica Derby (S. Derby). This may explain mTmV in S. Derby co-circulating on farms with monophasic S. Typhimurium ST34, highlighting the potential for further transfer of the sopE virulence gene in nature. We conclude that whole-genome epidemiology pinpoints potential drivers of evolutionary and epidemiological dynamics during pathogen emergence, and identifies targets for subsequent research in epidemiology and bacterial pathogenesis.
-
-
-
Genomic analysis of 40 prophages located in the genomes of 16 carbapenemase-producing clinical strains of Klebsiella pneumoniae
Klebsiella pneumoniae is the clinically most important species within the genus Klebsiella and, as a result of the continuous emergence of multi-drug resistant (MDR) strains, the cause of severe nosocomial infections. The decline in the effectiveness of antibiotic treatments for infections caused by MDR bacteria has generated particular interest in the study of bacteriophages. In this study, we characterized a total of 40 temperate bacteriophages (prophages) with a genome range of 11.454–84.199 kb, predicted from 16 carbapenemase-producing clinical strains of K. pneumoniae belonging to different sequence types, previously identified by multilocus sequence typing. These prophages were grouped into the three families in the order Caudovirales (27 prophages belonging to the family Myoviridae, 10 prophages belonging to the family Siphoviridae and 3 prophages belonging to the family Podoviridae). Genomic comparison of the 40 prophage genomes led to the identification of four prophages isolated from different strains and of genome sizes of around 33.3, 36.1, 39.6 and 42.6 kb. These prophages showed sequence similarities (query cover >90 %, identity >99.9 %) with international Microbe Versus Phage (MVP) (http://mvp.medgenius.info/home) clusters 4762, 4901, 3499 and 4280, respectively. Phylogenetic analysis revealed the evolutionary proximity among the members of the four groups of the most frequently identified prophages in the bacterial genomes studied (33.3, 36.1, 39.6 and 42.6 kb), with bootstrap values of 100 %. This allowed the prophages to be classified into three clusters: A, B and C. Interestingly, these temperate bacteriophages did not infect the highest number of strains as indicated by a host-range assay, these results could be explained by the development of superinfection exclusion mechanisms. In addition, bioinformatic analysis of the 40 identified prophages revealed the presence of 2363 proteins. In total, 59.7 % of the proteins identified had a predicted function, mainly involving viral structure, transcription, replication and regulation (lysogenic/lysis). Interestingly, some proteins had putative functions associated with bacterial virulence (toxin expression and efflux pump regulators), phage defence profiles such as toxin–antitoxin modules, an anti-CRISPR/Cas9 protein, TerB protein (from terZABCDE operon) and methyltransferase proteins.
-
-
-
Comparison of Shiga toxin-encoding bacteriophages in highly pathogenic strains of Shiga toxin-producing Escherichia coli O157:H7 in the UK
More LessOver the last 35 years in the UK, the burden of Shiga toxin-producing Escherichia coli (STEC) O157:H7 infection has, during different periods of time, been associated with five different sub-lineages (1983–1995, Ia, I/IIa and I/IIb; 1996–2014, Ic; and 2015–2018, IIb). The acquisition of a stx2a-encoding bacteriophage by these five sub-lineages appears to have coincided with their respective emergences. The Oxford Nanopore Technologies (ONT) system was used to sequence, characterize and compare the stx-encoding prophages harboured by each sub-lineage to investigate the integration of this key virulence factor. The stx2a-encoding prophages from each of the lineages causing clinical disease in the UK were all different, including the two UK sub-lineages (Ia and I/IIa) circulating concurrently and causing severe disease in the early 1980s. Comparisons between the stx2a-encoding prophage in sub-lineages I/IIb and IIb revealed similarity to the prophage commonly found to encode stx2c, and the same site of bacteriophage integration (sbcB) as stx2c-encoding prophage. These data suggest independent acquisition of previously unobserved stx2a-encoding phage is more likely to have contributed to the emergence of STEC O157:H7 sub-lineages in the UK than intra-UK lineage to lineage phage transmission. In contrast, the stx2c-encoding prophage showed a high level of similarity across lineages and time, consistent with the model of stx2c being present in the common ancestor to extant STEC O157:H7 and maintained by vertical inheritance in the majority of the population. Studying the nature of the stx-encoding bacteriophage contributes to our understanding of the emergence of highly pathogenic strains of STEC O157:H7.
-
-
-
A window into lysogeny: revealing temperate phage biology with transcriptomics
More LessProphages are integrated phage elements that are a pervasive feature of bacterial genomes. The fitness of bacteria is enhanced by prophages that confer beneficial functions such as virulence, stress tolerance or phage resistance, and these functions are encoded by ‘accessory’ or ‘moron’ loci. Whilst the majority of phage-encoded genes are repressed during lysogeny, accessory loci are often highly expressed. However, it is challenging to identify novel prophage accessory loci from DNA sequence data alone. Here, we use bacterial RNA-seq data to examine the transcriptional landscapes of five Salmonella prophages. We show that transcriptomic data can be used to heuristically enrich for prophage features that are highly expressed within bacterial cells and represent functionally important accessory loci. Using this approach, we identify a novel antisense RNA species in prophage BTP1, STnc6030, which mediates superinfection exclusion of phage BTP1. Bacterial transcriptomic datasets are a powerful tool to explore the molecular biology of temperate phages.
-
-
-
Paenibacillus larvae bacteriophages: obscure past, promising future
More LessPaenibacillus larvae is a Gram-positive, spore-forming bacterium that is the causative agent of American foulbrood (AFB), the most devastating bacterial disease of the honeybee. P. larvae is antibiotic resistant, complicating treatment efforts. Bacteriophages that target P. larvae are rapidly emerging as a promising treatment. The first P. larvae phages were isolated in the 1950s, but as P. larvae was not antibiotic resistant at the time, interest in them remained scant. Interest in P. larvae phages has grown rapidly since the first P. larvae phage genome was sequenced in 2013. Since then, the number of sequenced P. larvae phage genomes has reached 48 and is set to grow further. All sequenced P. larvae phages encode a conserved N-acetylmuramoyl-l-alanine amidase that is responsible for cleaving the peptidoglycan cell wall of P. larvae . All P. larvae phages also encode either an integrase, excisionase or Cro/CI, indicating that they are temperate. In the last few years, several studies have been published on using P. larvae phages and the P. larvae phage amidase as treatments for AFB. Studies were conducted on infected larvae in vitro and also on hives in the field. The phages have a prophylactic effect, preventing infection, and also a curative effect, helping resolve infection. P. larvae phages have a narrow range, lysing only P. larvae , and are unable to lyse even related Paenibacillus species. P. larvae phages thus appear to be safe to use and effective as treatment for AFB, and interest in them in the coming years will continue to grow.
-
-
-
Differential dynamics and impacts of prophages and plasmids on the pangenome and virulence factor repertoires of Shiga toxin-producing Escherichia coli O145:H28
Keiji Nakamura, Kazunori Murase, Mitsuhiko P. Sato, Atsushi Toyoda, Takehiko Itoh, Jacques Georges Mainil, Denis Piérard, Shuji Yoshino, Keiko Kimata, Junko Isobe, Kazuko Seto, Yoshiki Etoh, Hiroshi Narimatsu, Shioko Saito, Jun Yatsuyanagi, Kenichi Lee, Sunao Iyoda, Makoto Ohnishi, Tadasuke Ooka, Yasuhiro Gotoh, Yoshitoshi Ogura and Tetsuya HayashiPhages and plasmids play important roles in bacterial evolution and diversification. Although many draft genomes have been generated, phage and plasmid genomes are usually fragmented, limiting our understanding of their dynamics. Here, we performed a systematic analysis of 239 draft genomes and 7 complete genomes of Shiga toxin (Stx)-producing Escherichia coli O145:H28, the major virulence factors of which are encoded by prophages (PPs) or plasmids. The results indicated that PPs are more stably maintained than plasmids. A set of ancestrally acquired PPs was well conserved, while various PPs, including Stx phages, were acquired by multiple sublineages. In contrast, gains and losses of a wide range of plasmids have frequently occurred across the O145:H28 lineage, and only the virulence plasmid was well conserved. The different dynamics of PPs and plasmids have differentially impacted the pangenome of O145:H28, with high proportions of PP- and plasmid-associated genes in the variably present and rare gene fractions, respectively. The dynamics of PPs and plasmids have also strongly impacted virulence gene repertoires, such as the highly variable distribution of stx genes and the high conservation of a set of type III secretion effectors, which probably represents the core effectors of O145:H28 and the genes on the virulence plasmid in the entire O145:H28 population. These results provide detailed insights into the dynamics of PPs and plasmids, and show the application of genomic analyses using a large set of draft genomes and appropriately selected complete genomes.
-
-
-
Analysis of genetic recombination and the pan-genome of a highly recombinogenic bacteriophage species
More LessBacteriophages are the most prevalent biological entities impacting on the ecosystem and are characterized by their extensive diversity. However, there are two aspects of phages that have remained largely unexplored: genetic flux by recombination between phage populations and characterization of specific phages in terms of the pan-genome. Here, we examined the recombination and pan-genome in Helicobacter pylori prophages at both the genome and gene level. In the genome-level analysis, we applied, for the first time, chromosome painting and fineSTRUCTURE algorithms to a phage species, and showed novel trends in inter-population genetic flux. Notably, hpEastAsia is a phage population that imported a higher proportion of DNA fragments from other phages, whereas the hpSWEurope phages showed weaker signatures of inter-population recombination, suggesting genetic isolation. The gene-level analysis showed that, after parameter tuning of the prokaryote pan-genome analysis program, H. pylori phages have a pan-genome consisting of 75 genes and a soft-core genome of 10 genes, which includes genes involved in the lytic and lysogenic life cycles. Quantitative analysis of recombination events of the soft-core genes showed no substantial variation in the intensity of recombination across the genes, but rather equally frequent recombination among housekeeping genes that were previously reported to be less prone to recombination. The signature of frequent recombination appears to reflect the host–phage evolutionary arms race, either by contributing to escape from bacterial immunity or by protecting the host by producing defective phages.
-
-
-
Correlation between bacterial G+C content, genome size and the G+C content of associated plasmids and bacteriophages
More LessBased on complete bacterial genome sequence data, we demonstrate a correlation between bacterial chromosome length and the G+C content of the genome, with longer genomes having higher G+C contents. The correlation value decreases at shorter genome sizes, where there is a wider spread of G+C values. However, although significant (P<0.001), the correlation value (Pearson R=0.58) suggests that other factors also have a significant influence. A similar pattern was seen for plasmids; longer plasmids had higher G+C values, although the large number of shorter plasmids had a wide spread of G+C values. There was also a significant (P<0.0001) correlation between the G+C content of plasmids and the G+C content of their bacterial host. Conversely, the G+C content of bacteriophages tended to reduce with larger genome sizes, and although there was a correlation between host genome G+C content and that of the bacteriophage, it was not as strong as that seen between plasmids and their hosts.
-
-
-
The lytic Myoviridae of Enterobacteriaceae form tight recombining assemblages separated by discontinuities in genome average nucleotide identity and lateral gene flow
More LessIn Bacteria, a working consensus of species circumscription may have been reached and one of the most prominent criteria is high average nucleotide identity (ANI). ANI in effect groups strains that may recombine more or less frequently, depending on their biology, as opposed to rare interspecies gene transfer. For bacteriophages, which show various lifestyles, the nature of the fundamental natural unit, if it exists in a biological sense, is not well understood and defined. The approaches based on dot-plots are useful to group similar bacteriophages, yet are not quantitative and use arbitrarily set cut-offs. Here, we focus on lytic Myoviridae and test the ANI metric for group delineation. We show that ANI-based groups are in agreement with the International Committee on Taxonomy of Viruses (ICTV) classification and already established dot-plot groups, which are occasionally further refined owing to higher resolution of ANI analysis. Furthermore, these groups are separated among themselves by clear ANI discontinuities. Their members readily exchange core genes with each other while they do not with bacteriophages of other ANI groups, not even with the most similar. Thus, ANI-delineated groups may represent the natural units in lytic Myoviridae evolution with features that resemble those encountered in bacterial species.
-
-
-
Evolution of a zoonotic pathogen: investigating prophage diversity in enterohaemorrhagic Escherichia coli O157 by long-read sequencing
Enterohaemorrhagic Escherichia coli (EHEC) O157 is a zoonotic pathogen for which colonization of cattle and virulence in humans is associated with multiple horizontally acquired genes, the majority present in active or cryptic prophages. Our understanding of the evolution and phylogeny of EHEC O157 continues to develop primarily based on core genome analyses; however, such short-read sequences have limited value for the analysis of prophage content and its chromosomal location. In this study, we applied Single Molecule Real Time (SMRT) sequencing, using the Pacific Biosciences long-read sequencing platform, to isolates selected from the main sub-clusters of this clonal group. Prophage regions were extracted from these sequences and from published reference strains. Genome position and prophage diversity were analysed along with genetic content. Prophages could be assigned to clusters, with smaller prophages generally exhibiting less diversity and preferential loss of structural genes. Prophages encoding Shiga toxin (Stx) 2a and Stx1a were the most diverse, and more variable compared to prophages encoding Stx2c, further supporting the hypothesis that Stx2c-prophage integration was ancestral to acquisition of other Stx types. The concept that phage type (PT) 21/28 (Stx2a+, Stx2c+) strains evolved from PT32 (Stx2c+) was supported by analysis of strains with excised Stx-encoding prophages. Insertion sequence elements were over-represented in prophage sequences compared to the rest of the genome, showing integration in key genes such as stx and an excisionase, the latter potentially acting to capture the bacteriophage into the genome. Prophage profiling should allow more accurate prediction of the pathogenic potential of isolates.
-
-
-
Evolutionary interpretations of mycobacteriophage biodiversity and host-range through the analysis of codon usage bias
More LessIn an genomics course sponsored by the Howard Hughes Medical Institute (HHMI), undergraduate students have isolated and sequenced the genomes of more than 1,150 mycobacteriophages, creating the largest database of sequenced bacteriophages able to infect a single host, Mycobacterium smegmatis, a soil bacterium. Genomic analysis indicates that these mycobacteriophages can be grouped into 26 clusters based on genetic similarity. These clusters span a continuum of genetic diversity, with extensive genomic mosaicism among phages in different clusters. However, little is known regarding the primary hosts of these mycobacteriophages in their natural habitats, nor of their broader host ranges. As such, it is possible that the primary host of many newly isolated mycobacteriophages is not M. smegmatis, but instead a range of closely related bacterial species. However, determining mycobacteriophage host range presents difficulties associated with mycobacterial cultivability, pathogenicity and growth. Another way to gain insight into mycobacteriophage host range and ecology is through bioinformatic analysis of their genomic sequences. To this end, we examined the correlations between the codon usage biases of 199 different mycobacteriophages and those of several fully sequenced mycobacterial species in order to gain insight into the natural host range of these mycobacteriophages. We find that UPGMA clustering tends to match, but not consistently, clustering by shared nucleotide sequence identify. In addition, analysis of GC content, tRNA usage and correlations between mycobacteriophage and mycobacterial codon usage bias suggests that the preferred host of many clustered mycobacteriophages is not M. smegmatis but other, as yet unknown, members of the mycobacteria complex or closely allied bacterial species.
-
-
-
PhagePhisher: a pipeline for the discovery of covert viral sequences in complex genomic datasets
More LessObtaining meaningful viral information from large sequencing datasets presents unique challenges distinct from prokaryotic and eukaryotic sequencing efforts. The difficulties surrounding this issue can be ascribed in part to the genomic plasticity of viruses themselves as well as the scarcity of existing information in genomic databases. The open-source software PhagePhisher (http://www.putonti-lab.com/phagephisher) has been designed as a simple pipeline to extract relevant information from complex and mixed datasets, and will improve the examination of bacteriophages, viruses, and virally related sequences, in a range of environments. Key aspects of the software include speed and ease of use; PhagePhisher can be used with limited operator knowledge of bioinformatics on a standard workstation. As a proof-of-concept, PhagePhisher was successfully implemented with bacteria–virus mixed samples of varying complexity. Furthermore, viral signals within microbial metagenomic datasets were easily and quickly identified by PhagePhisher, including those from prophages as well as lysogenic phages, an important and often neglected aspect of examining phage populations in the environment. PhagePhisher resolves viral-related sequences which may be obscured by or imbedded in bacterial genomes.
-
-
-
Applying phylogenomics to understand the emergence of Shiga-toxin-producing Escherichia coli O157:H7 strains causing severe human disease in the UK
Shiga-toxin-producing Escherichia coli (STEC) O157:H7 is a recently emerged zoonotic pathogen with considerable morbidity. Since the emergence of this serotype in the 1980s, research has focussed on unravelling the evolutionary events from the E. coli O55:H7 ancestor to the contemporaneous globally dispersed strains observed today. In this study, the genomes of over 1000 isolates from both human clinical cases and cattle, spanning the history of STEC O157:H7 in the UK, were sequenced. Phylogenetic analysis revealed the ancestry, key acquisition events and global context of the strains. Dated phylogenies estimated the time to evolution of the most recent common ancestor of the current circulating global clone to be 175 years ago. This event was followed by rapid diversification. We show the acquisition of specific virulence determinates has occurred relatively recently and coincides with its recent detection in the human population. We used clinical outcome data from 493 cases of STEC O157:H7 to assess the relative risk of severe disease including haemolytic uraemic syndrome from each of the defined clades in the population and show the dramatic effect Shiga toxin repertoire has on virulence. We describe two strain replacement events that have occurred in the cattle population in the UK over the last 30 years, one resulting in a highly virulent strain that has accounted for the majority of clinical cases in the UK over the last decade. There is a need to understand the selection pressures maintaining Shiga-toxin-encoding bacteriophages in the ruminant reservoir and the study affirms the requirement for close surveillance of this pathogen in both ruminant and human populations.
-