Implementation of Genomics in Clinical and Public Health Microbiology
Microbial sequencing is rapidly being incorporated into public health and clinical microbiology, informed by over two decades of microbial genomics research and methods development. However, the integration of genomics into public health and clinical services and workflows is complex and requires consideration of a wide range of issues including workflow management, data analysis and reporting, and evaluation of how microbial genomic data can and should be used to support decision-making. This collection puts the spotlight on studies supporting microbial genomics implementation – aiming to understand what, why, and how microbial genomics works in “real world” clinical and health laboratory settings, and testing approaches to improving the integration of genomics into routine microbiological diagnostics, surveillance and outbreak investigation.
Image: Samples in a laboratory. Credit: Adrian Wressell, Heart of England NHS FT. Attribution 4.0 International (CC BY 4.0)
Collection Contents
1 - 20 of 24 results
-
-
Enhancing capacity for national genomics surveillance of antimicrobial resistance in public health laboratories in Kenya
Genomic surveillance is vital for detecting outbreaks and understanding the epidemiology and transmission of bacterial strains, yet it is not integrated into many national antimicrobial resistance (AMR) surveillance programmes. Key factors are that few scientists in the public health sector are trained in bacterial genomics, and the diverse sequencing platforms and bioinformatic tools make it challenging to generate reproducible outputs. In Kenya, these gaps were addressed by training public health scientists to conduct genomic surveillance on isolates from the national AMR surveillance repository and produce harmonized reports. The 2-week training combined theory and laboratory and bioinformatic experiences with Klebsiella pneumoniae isolates from the surveillance repository. Whole-genome sequences generated on Illumina and Nanopore sequencers were analysed using publicly available bioinformatic tools, and a harmonized report was generated using the HAMRonization tool. Pre- and post-training tests and self-assessments were used to assess the effectiveness of the training. Thirteen scientists were trained and generated data on the K. pneumoniae isolates, summarizing the AMR genes present consistently with the reported phenotypes and identifying the plasmid replicons that could transmit antibiotic resistance. Ninety per cent of the participants demonstrated an overall improvement in their post-training test scores, with an average increase of 14 %. Critical challenges were experienced in delayed delivery of equipment and supplies, power fluctuations and internet connections that were inadequate for bioinformatic analysis. Despite this, the training built the knowledge and skills to implement bacterial genomic surveillance. More advanced and immersive training experiences and building supporting infrastructure would solidify these gains to produce tangible public health outcomes.
-
-
-
Public health implementation of pathogen genomics: the role for accreditation and application of ISO standards
More LessPathogen genomics has transitioned rapidly from the research setting into a powerful tool now routinely used in public health microbiology, for surveillance, outbreak investigations and disease control. As these investigations can have significant public health, treatment and legal impacts, we must ensure the accuracy of these results through validation of testing processes. For laboratories working in this space, it is important to approach this work with a quality and accreditation framework in mind, working towards implementation of quality systems and test validation that meet international regulatory standards. Here we outline the key international standards and processes that lead toward accreditation for pathogen genomics.
-
-
-
Results of the 2020 Genomic Proficiency Test for the network of European Union Reference Laboratory for Antimicrobial Resistance assessing whole-genome-sequencing capacities
The global surveillance and outbreak investigation of antimicrobial resistance (AMR) is amidst a paradigm shift from traditional biology to bioinformatics. This is due to developments in whole-genome-sequencing (WGS) technologies, bioinformatics tools, and reduced costs. The increased use of WGS is accompanied by challenges such as standardization, quality control (QC), and data sharing. Thus, there is global need for inter-laboratory WGS proficiency test (PT) schemes to evaluate laboratories' capacity to produce reliable genomic data. Here, we present the results of the first iteration of the Genomic PT (GPT) organized by the Global Capacity Building Group at the Technical University of Denmark in 2020. Participating laboratories sequenced two isolates and corresponding DNA of Salmonella enterica , Escherichia coli and Campylobacter coli , using WGS methodologies routinely employed at their laboratories. The participants' ability to obtain consistently good-quality WGS data was assessed based on several QC WGS metrics. A total of 21 laboratories from 21 European countries submitted WGS and meta-data. Most delivered high-quality sequence data with only two laboratories identified as overall underperforming. The QC metrics, N50 and number of contigs, were identified as good indicators for high-sequencing quality. We propose QC thresholds for N50 greater than 20 000 and 25 000 for Campylobacter coli and Escherichia coli, respectively, and number of contigs >200 bp greater than 225, 265 and 100 for Salmonella enterica , Escherichia coli and Campylobacter coli , respectively. The GPT2020 results confirm the importance of systematic QC procedures, ensuring the submission of reliable WGS data for surveillance and outbreak investigation to meet the requirements of the paradigm shift in methodology.
-
-
-
Accelerating bioinformatics implementation in public health
We have adopted an open bioinformatics ecosystem to address the challenges of bioinformatics implementation in public health laboratories (PHLs). Bioinformatics implementation for public health requires practitioners to undertake standardized bioinformatic analyses and generate reproducible, validated and auditable results. It is essential that data storage and analysis are scalable, portable and secure, and that implementation of bioinformatics fits within the operational constraints of the laboratory. We address these requirements using Terra, a web-based data analysis platform with a graphical user interface connecting users to bioinformatics analyses without the use of code. We have developed bioinformatics workflows for use with Terra that specifically meet the needs of public health practitioners. These Theiagen workflows perform genome assembly, quality control, and characterization, as well as construction of phylogeny for insights into genomic epidemiology. Additonally, these workflows use open-source containerized software and the WDL workflow language to ensure standardization and interoperability with other bioinformatics solutions, whilst being adaptable by the user. They are all open source and publicly available in Dockstore with the version-controlled code available in public GitHub repositories. They have been written to generate outputs in standardized file formats to allow for further downstream analysis and visualization with separate genomic epidemiology software. Testament to this solution meeting the requirements for bioinformatic implementation in public health, Theiagen workflows have collectively been used for over 5 million sample analyses in the last 2 years by over 90 public health laboratories in at least 40 different countries. Continued adoption of technological innovations and development of further workflows will ensure that this ecosystem continues to benefit PHLs.
-
-
-
European Union Reference Laboratories support the National food, feed and veterinary Reference Laboratories with rolling out whole genome sequencing in Europe
The Inter European Union Reference Laboratories (EURLs) Working Group on Next Generation Sequencing (NGS) involves eight EURLs for microbiological food and feed hazards and has been working since 2017 to promote the adoption of NGS by the National Reference Laboratories (NRLs) in the European Union. This work illustrates the results of the first 5 years of activity. By working together, the EURLs involved have released guidance documents for assisting NRLs in all the steps of NGS, helping the transition from classical molecular methods towards whole genome sequencing while ensuring harmonization, with the final aim of improving preparedness in the use of NGS to characterize microbial hazards and trace the sources of infection.
-
-
-
Rapid metagenomic sequencing for diagnosis and antimicrobial sensitivity prediction of canine bacterial infections
Antimicrobial resistance is a major threat to human and animal health. There is an urgent need to ensure that antimicrobials are used appropriately to limit the emergence and impact of resistance. In the human and veterinary healthcare setting, traditional culture and antimicrobial sensitivity testing typically requires 48–72 h to identify appropriate antibiotics for treatment. In the meantime, broad-spectrum antimicrobials are often used, which may be ineffective or impact non-target commensal bacteria. Here, we present a rapid, culture-free, diagnostics pipeline, involving metagenomic nanopore sequencing directly from clinical urine and skin samples of dogs. We have planned this pipeline to be versatile and easily implementable in a clinical setting, with the potential for future adaptation to different sample types and animals. Using our approach, we can identify the bacterial pathogen present within 5 h, in some cases detecting species which are difficult to culture. For urine samples, we can predict antibiotic sensitivity with up to 95 % accuracy. Skin swabs usually have lower bacterial abundance and higher host DNA, confounding antibiotic sensitivity prediction; an additional host depletion step will likely be required during the processing of these, and other types of samples with high levels of host cell contamination. In summary, our pipeline represents an important step towards the design of individually tailored veterinary treatment plans on the same day as presentation, facilitating the effective use of antibiotics and promoting better antimicrobial stewardship.
-
-
-
Pathogen genomics in public health laboratories: successes, challenges, and lessons learned from California’s SARS-CoV-2 Whole-Genome Sequencing Initiative, California COVIDNet
The capacity for pathogen genomics in public health expanded rapidly during the coronavirus disease 2019 (COVID-19) pandemic, but many public health laboratories did not have the infrastructure in place to handle the vast amount of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) sequence data generated. The California Department of Public Health, in partnership with Theiagen Genomics, was an early adopter of cloud-based resources for bioinformatics and genomic epidemiology, resulting in the creation of a SARS-CoV-2 genomic surveillance system that combined the efforts of more than 40 sequencing laboratories across government, academia and industry to form California COVIDNet, California’s SARS-CoV-2 Whole-Genome Sequencing Initiative. Open-source bioinformatics workflows, ongoing training sessions for the public health workforce, and automated data transfer to visualization tools all contributed to the success of California COVIDNet. While challenges remain for public health genomic surveillance worldwide, California COVIDNet serves as a framework for a scaled and successful bioinformatics infrastructure that has expanded beyond SARS-CoV-2 to other pathogens of public health importance,
-
-
-
A systematic review of economic evaluations of whole-genome sequencing for the surveillance of bacterial pathogens
Whole-genome sequencing (WGS) has unparalleled ability to distinguish between bacteria, with many public health applications. The generation and analysis of WGS data require significant financial investment. We describe a systematic review summarizing economic analyses of genomic surveillance of bacterial pathogens, reviewing the evidence for economic viability. The protocol was registered on PROSPERO (CRD42021289030). Six databases were searched on 8 November 2021 using terms related to ‘WGS’, ‘population surveillance’ and ‘economic analysis’. Quality was assessed with the Drummond–Jefferson checklist. Following data extraction, a narrative synthesis approach was taken. Six hundred and eighty-one articles were identified, of which 49 proceeded to full-text screening, with 9 selected for inclusion. All had been published since 2019. Heterogeneity was high. Five studies assessed WGS for hospital surveillance and four analysed foodborne pathogens. Four were cost–benefit analyses, one was a cost–utility analysis, one was a cost-effectiveness analysis, one was a combined cost-effectiveness and cost–utility analysis, one combined cost-effectiveness and cost–benefit analyses and one was a partial analysis. All studies supported the use of WGS as a surveillance tool on economic grounds. The available evidence supports the use of WGS for pathogen surveillance but is limited by marked heterogeneity. Further work should include analysis relevant to low- and middle-income countries and should use real-world effectiveness data.
-
-
-
How public health authorities can use pathogen genomics in health protection practice: a consensus-building Delphi study conducted in the United Kingdom
Pathogen sequencing guided understanding of SARS-CoV-2 evolution during the COVID-19 pandemic. Many health systems developed pathogen genomics services to monitor SARS-CoV-2. There are no agreed guidelines about how pathogen genomic information should be used in public health practice. We undertook a modified Delphi study in three rounds to develop expert consensus statements about how genomic information should be used. Our aim was to inform health protection policy, planning and practice. Participants were from organisations that produced or used pathogen genomics information in the United Kingdom. The first round posed questions derived from a rapid literature review. Responses informed statements for the subsequent rounds. Consensus was accepted when 70 % or more of the responses were strongly agree/agree, or 70 % were disagree/strongly disagree on the five-point Likert scale. Consensus was achieved in 26 (96 %) of 27 statements. We grouped the statements into six categories: monitoring the emergence of new variants; understanding the epidemiological context of genomic data; using genomic data in outbreak risk assessment and risk management; prioritising the use of limited sequencing capacity; sequencing service performance; and sequencing service capability. The expert consensus statements will help guide public health authorities and policymakers to integrate pathogen genomics in health protection practice.
-
-
-
Multiplex MinION sequencing suggests enteric adenovirus F41 genetic diversity comparable to pre-COVID-19 era
Human adenovirus F41 causes acute gastroenteritis in children, and has recently been associated with an apparent increase in paediatric hepatitis of unknown aetiology in the UK, with further cases reported in multiple countries. Relatively little is known about the genetic diversity of adenovirus F41 in UK children; and it is unclear what, if any, impact the COVID-19 pandemic has had on viral diversity in the UK. Methods that allow F41 to be sequenced from clinical samples without the need for viral culture are required to provide the genomic data to address these questions. Therefore, we evaluated an overlapping-amplicon method of sequencing adenovirus genomes from clinical samples using Oxford Nanopore technology. We applied this method to a small sample of adenovirus-species-F-positive extracts collected as part of standard care in the East of England region in January–May 2022. This method produced genomes with >75 % coverage in 13/22 samples and >50 % coverage in 19/22 samples. We identified two F41 lineages present in paediatric patients in the East of England in 2022. Where F41 genomes from paediatric hepatitis cases were available (n=2), these genomes fell within the diversity of F41 from the UK and continental Europe sequenced before and after the 2020–2021 phase of the COVID-19 pandemic. Our analyses suggest that overlapping amplicon sequencing is an appropriate method for generating F41 genomic data from high-virus-load clinical samples, and currently circulating F41 viral lineages were present in the UK and Europe before the COVID-19 pandemic.
-
-
-
Multi-laboratory evaluation of the Illumina iSeq platform for whole genome sequencing of Salmonella, Escherichia coli and Listeria
Patrick K. Mitchell, Leyi Wang, Bryce J. Stanhope, Brittany D. Cronk, Renee Anderson, Shipra Mohan, Lijuan Zhou, Susan Sanchez, Paula Bartlett, Carol Maddox, Vanessa DeShambo, Rinosh Mani, Lindsy M. Hengesbach, Sarah Gresch, Katie Wright, Sunil Mor, Shuping Zhang, Zhenyu Shen, Lifang Yan, Rebecca Mackey, Rebecca Franklin-Guild, Yan Zhang, Melanie Prarat, Katherine Shiplett, Akhilesh Ramachandran, Sai Narayanan, Justin Sanders, Andree A. Hunkapiller, Kevin Lahmers, Amanda A. Carbonello, Nicole Aulik, Ailam Lim, Jennifer Cooper, Angelica Jones, Jake Guag, Sarah M. Nemser, Gregory H. Tyson, Ruth Timme, Errol Strain, Renate Reimschuessel, Olgica Ceric and Laura B. GoodmanThere is a growing need for public health and veterinary laboratories to perform whole genome sequencing (WGS) for monitoring antimicrobial resistance (AMR) and protecting the safety of people and animals. With the availability of smaller and more affordable sequencing platforms coupled with well-defined bioinformatic protocols, the technological capability to incorporate this technique for real-time surveillance and genomic epidemiology has greatly expanded. There is a need, however, to ensure that data are of high quality. The goal of this study was to assess the utility of a small benchtop sequencing platform using a multi-laboratory verification approach. Thirteen laboratories were provided the same equipment, reagents, protocols and bacterial reference strains. The Illumina DNA Prep and Nextera XT library preparation kits were compared, and 2×150 bp iSeq i100 chemistry was used for sequencing. Analyses comparing the sequences produced from this study with closed genomes from the provided strains were performed using open-source programs. A detailed, step-by-step protocol is publicly available via protocols.io (https://www.protocols.io/view/iseq-bacterial-wgs-protocol-bij8kcrw). The throughput for this method is approximately 4–6 bacterial isolates per sequencing run (20–26 Mb total load). The Illumina DNA Prep library preparation kit produced high-quality assemblies and nearly complete AMR gene annotations. The Prep method produced more consistent coverage compared to XT, and when coverage benchmarks were met, nearly all AMR, virulence and subtyping gene targets were correctly identified. Because it reduces the technical and financial barriers to generating WGS data, the iSeq platform is a viable option for small laboratories interested in genomic surveillance of microbial pathogens.
-
-
-
Evaluation of WGS performance for bacterial pathogen characterization with the Illumina technology optimized for time-critical situations
Whole genome sequencing (WGS) has become the reference standard for bacterial outbreak investigation and pathogen typing, providing a resolution unattainable with conventional molecular methods. Data generated with Illumina sequencers can however only be analysed after the sequencing run has finished, thereby losing valuable time during emergency situations. We evaluated both the effect of decreasing overall run time, and also a protocol to transfer and convert intermediary files generated by Illumina sequencers enabling real-time data analysis for multiple samples part of the same ongoing sequencing run, as soon as the forward reads have been sequenced. To facilitate implementation for laboratories operating under strict quality systems, extensive validation of several bioinformatics assays (16S rRNA species confirmation, gene detection against virulence factor and antimicrobial resistance databases, SNP-based antimicrobial resistance detection, serotype determination, and core genome multilocus sequence typing) for three bacterial pathogens ( Mycobacterium tuberculosis , Neisseria meningitidis , and Shiga-toxin producing Escherichia coli ) was performed by evaluating performance in function of the two most critical sequencing parameters, i.e. read length and coverage. For the majority of evaluated bioinformatics assays, actionable results could be obtained between 14 and 22 h of sequencing, decreasing the overall sequencing-to-results time by more than half. This study aids in reducing the turn-around time of WGS analysis by facilitating a faster response in time-critical scenarios and provides recommendations for time-optimized WGS with respect to required read length and coverage to achieve a minimum level of performance for the considered bioinformatics assay(s), which can also be used to maximize the cost-effectiveness of routine surveillance sequencing when response time is not essential.
-
-
-
Whole-genome-based phylogenomic analysis of the Belgian 2016–2017 influenza A(H3N2) outbreak season allows improved surveillance
Seasonal influenza epidemics are associated with high mortality and morbidity in the human population. Influenza surveillance is critical for providing information to national influenza programmes and for making vaccine composition predictions. Vaccination prevents viral infections, but rapid influenza evolution results in emerging mutants that differ antigenically from vaccine strains. Current influenza surveillance relies on Sanger sequencing of the haemagglutinin (HA) gene. Its classification according to World Health Organization (WHO) and European Centre for Disease Prevention and Control (ECDC) guidelines is based on combining certain genotypic amino acid mutations and phylogenetic analysis. Next-generation sequencing technologies enable a shift to whole-genome sequencing (WGS) for influenza surveillance, but this requires laboratory workflow adaptations and advanced bioinformatics workflows. In this study, 253 influenza A(H3N2) positive clinical specimens from the 2016–2017 Belgian season underwent WGS using the Illumina MiSeq system. HA-based classification according to WHO/ECDC guidelines did not allow classification of all samples. A new approach, considering the whole genome, was investigated based on using powerful phylogenomic tools including beast and Nextstrain, which substantially improved phylogenetic classification. Moreover, Bayesian inference via beast facilitated reassortment detection by both manual inspection and computational methods, detecting intra-subtype reassortants at an estimated rate of 15 %. Real-time analysis (i.e. as an outbreak is ongoing) via Nextstrain allowed positioning of the Belgian isolates into the globally circulating context. Finally, integration of patient data with phylogenetic groups and reassortment status allowed detection of several associations that would have been missed when solely considering HA, such as hospitalized patients being more likely to be infected with A(H3N2) reassortants, and the possibility to link several phylogenetic groups to disease severity indicators could be relevant for epidemiological monitoring. Our study demonstrates that WGS offers multiple advantages for influenza monitoring in (inter)national influenza surveillance, and proposes an improved methodology. This allows leveraging all information contained in influenza genomes, and allows for more accurate genetic characterization and reassortment detection.
-
-
-
Centre-specific bacterial pathogen typing affects infection-control decision making
Whole-genome sequencing is becoming the de facto standard for bacterial outbreak surveillance and infection prevention. This is accompanied by a variety of bioinformatic tools and needs bioinformatics expertise for implementation. However, little is known about the concordance of reported outbreaks when using different bioinformatic workflows. In this multi-centre proficiency testing among 13 major Dutch healthcare-affiliated centres, bacterial whole-genome outbreak analysis was assessed. Centres who participated obtained two randomized bacterial datasets of Illumina sequences, a Klebsiella pneumoniae and a Vancomycin-resistant Enterococcus faecium, and were asked to apply their bioinformatic workflows. Centres reported back on antimicrobial resistance, multi-locus sequence typing (MLST), and outbreak clusters. The reported clusters were analysed using a method to compare landscapes of phylogenetic trees and calculating Kendall–Colijn distances. Furthermore, fasta files were analysed by state-of-the-art single nucleotide polymorphism (SNP) analysis to mitigate the differences introduced by each centre and determine standardized SNP cut-offs. Thirteen centres participated in this study. The reported outbreak clusters revealed discrepancies between centres, even when almost identical bioinformatic workflows were used. Due to stringent filtering, some centres failed to detect extended-spectrum beta-lactamase genes and MLST loci. Applying a standardized method to determine outbreak clusters on the reported de novo assemblies, did not result in uniformity of outbreak-cluster composition among centres.
-
-
-
Phylogenomics and antimicrobial resistance of Salmonella Typhi and Paratyphi A, B and C in England, 2016–2019
The emergence of antimicrobial resistance (AMR) to first- and second-line treatment regimens of enteric fever is a global public-health problem, and routine genomic surveillance to inform clinical and public-health management guidance is essential. Here, we present the prospective analysis of genomic data to monitor trends in incidence, AMR and travel, and assess hierarchical clustering (HierCC) methodology of 1742 isolates of typhoidal salmonellae. Trend analysis of Salmonella Typhi and S. Paratyphi A cases per year increased 48 and 17.3%, respectively, between 2016 and 2019 in England, mainly associated with travel to South Asia. S. Paratyphi B cases have remained stable and are mainly associated with travel to the Middle East and South America. There has been an increase in the number of S. Typhi exhibiting a multidrug-resistant (MDR) profile and the emergence of extensively drug resistant (XDR) profiles. HierCC was a robust method to categorize clonal groups into clades and clusters associated with travel and AMR profiles. The majority of cases that had XDR S. Typhi reported recent travel to Pakistan (94 %) and belonged to a subpopulation of the 4.3.1 (H58) clone (HC5_1452). The phenotypic and genotypic AMR results showed high concordance for S. Typhi and S. Paratyphi A, B and C, with 99.99 % concordance and only three (0.01 %) discordant results out of a possible 23 178 isolate/antibiotic combinations. Genomic surveillance of enteric fever has shown the recent emergence and increase of MDR and XDR S. Typhi strains, resulting in a review of clinical guidelines to improve management of imported infections.
-
-
-
Harmonization of whole-genome sequencing for outbreak surveillance of Enterobacteriaceae and Enterococci
Whole-genome sequencing (WGS) is becoming the de facto standard for bacterial typing and outbreak surveillance of resistant bacterial pathogens. However, interoperability for WGS of bacterial outbreaks is poorly understood. We hypothesized that harmonization of WGS for outbreak surveillance is achievable through the use of identical protocols for both data generation and data analysis. A set of 30 bacterial isolates, comprising of various species belonging to the Enterobacteriaceae family and Enterococcus genera, were selected and sequenced using the same protocol on the Illumina MiSeq platform in each individual centre. All generated sequencing data were analysed by one centre using BioNumerics (6.7.3) for (i) genotyping origin of replications and antimicrobial resistance genes, (ii) core-genome multi-locus sequence typing (cgMLST) for Escherichia coli and Klebsiella pneumoniae and whole-genome multi-locus sequencing typing (wgMLST) for all species. Additionally, a split k-mer analysis was performed to determine the number of SNPs between samples. A precision of 99.0% and an accuracy of 99.2% was achieved for genotyping. Based on cgMLST, a discrepant allele was called only in 2/27 and 3/15 comparisons between two genomes, for E. coli and K. pneumoniae, respectively. Based on wgMLST, the number of discrepant alleles ranged from 0 to 7 (average 1.6). For SNPs, this ranged from 0 to 11 SNPs (average 3.4). Furthermore, we demonstrate that using different de novo assemblers to analyse the same dataset introduces up to 150 SNPs, which surpasses most thresholds for bacterial outbreaks. This shows the importance of harmonization of data-processing surveillance of bacterial outbreaks. In summary, multi-centre WGS for bacterial surveillance is achievable, but only if protocols are harmonized.
-
-
-
Application of a strain-level shotgun metagenomics approach on food samples: resolution of the source of a Salmonella food-borne outbreak
Food-borne outbreak investigation currently relies on the time-consuming and challenging bacterial isolation from food, to be able to link food-derived strains to more easily obtained isolates from infected people. When no food isolate can be obtained, the source of the outbreak cannot be unambiguously determined. Shotgun metagenomics approaches applied to the food samples could circumvent this need for isolation from the suspected source, but require downstream strain-level data analysis to be able to accurately link to the human isolate. Until now, this approach has not yet been applied outside research settings to analyse real food-borne outbreak samples. In September 2019, a Salmonella outbreak occurred in a hotel school in Bruges, Belgium, affecting over 200 students and teachers. Following standard procedures, the Belgian National Reference Center for human salmonellosis and the National Reference Laboratory for Salmonella in food and feed used conventional analysis based on isolation, serotyping and MLVA (multilocus variable number tandem repeat analysis) comparison, followed by whole-genome sequencing, to confirm the source of the contamination over 2 weeks after receipt of the sample, which was freshly prepared tartar sauce in a meal cooked at the school. Our team used this outbreak as a case study to deliver a proof of concept for a short-read strain-level shotgun metagenomics approach for source tracking. We received two suspect food samples: the full meal and some freshly made tartar sauce served with this meal, requiring the use of raw eggs. After analysis, we could prove, without isolation, that Salmonella was present in both samples, and we obtained an inferred genome of a Salmonella enterica subsp. enterica serovar Enteritidis that could be linked back to the human isolates of the outbreak in a phylogenetic tree. These metagenomics-derived outbreak strains were separated from sporadic cases as well as from another outbreak circulating in Europe at the same time period. This is, to our knowledge, the first Salmonella food-borne outbreak investigation uniquely linking the food source using a metagenomics approach and this in a fast time frame.
-
-
-
Rapid nanopore-based DNA sequencing protocol of antibiotic-resistant bacteria for use in surveillance and outbreak investigation
Outbreak investigations are essential to control and prevent the dissemination of pathogens. This study developed and validated a complete analysis protocol for faster and more accurate surveillance and outbreak investigations of antibiotic-resistant microbes based on Oxford Nanopore Technologies (ONT) DNA whole-genome sequencing. The protocol was developed using 42 methicillin-resistant Staphylococcus aureus (MRSA) isolates identified from former well-characterized outbreaks. The validation of the protocol was performed using Illumina technology (MiSeq, Illumina). Additionally, a real-time outbreak investigation of six clinical S. aureus isolates was conducted to test the ONT-based protocol. The suggested protocol includes: (1) a 20 h sequencing run; (2) identification of the sequence type (ST); (3) de novo genome assembly; (4) polishing of the draft genomes; and (5) phylogenetic analysis based on SNPs. After the sequencing run, it was possible to identify the ST in 2 h (20 min per isolate). Assemblies were achieved after 4 h (40 min per isolate) while the polishing was carried out in 7 min per isolate (42 min in total). The phylogenetic analysis took 0.6 h to confirm an outbreak. Overall, the developed protocol was able to at least discard an outbreak in 27 h (mean) after the bacterial identification and less than 33 h to confirm it. All these estimated times were calculated considering the average time for six MRSA isolates per sequencing run. During the real-time S. aureus outbreak investigation, the protocol was able to identify two outbreaks in less than 31 h. The suggested protocol enables identification of outbreaks in early stages using a portable and low-cost device along with a streamlined downstream analysis, therefore having the potential to be incorporated in routine surveillance analysis workflows. In addition, further analysis may include identification of virulence and antibiotic resistance genes for improved pathogen characterization.
-
-
-
Large-scale sequencing of SARS-CoV-2 genomes from one region allows detailed epidemiology and enables local outbreak management
Andrew J. Page, Alison E. Mather, Thanh Le-Viet, Emma J. Meader, Nabil-Fareed Alikhan, Gemma L. Kay, Leonardo de Oliveira Martins, Alp Aydin, David J. Baker, Alexander J. Trotter, Steven Rudder, Ana P. Tedim, Anastasia Kolyva, Rachael Stanley, Muhammad Yasir, Maria Diaz, Will Potter, Claire Stuart, Lizzie Meadows, Andrew Bell, Ana Victoria Gutierrez, Nicholas M. Thomson, Evelien M. Adriaenssens, Tracey Swingler, Rachel A. J. Gilroy, Luke Griffith, Dheeraj K. Sethi, Dinesh Aggarwal, Colin S. Brown, Rose K. Davidson, Robert A. Kingsley, Luke Bedford, Lindsay J. Coupland, Ian G. Charles, Ngozi Elumogo, John Wain, Reenesh Prakash, Mark A. Webber, S. J. Louise Smith, Meera Chand, Samir Dervisevic, Justin O’Grady and The COVID-19 Genomics UK (COG-UK) ConsortiumThe COVID-19 pandemic has spread rapidly throughout the world. In the UK, the initial peak was in April 2020; in the county of Norfolk (UK) and surrounding areas, which has a stable, low-density population, over 3200 cases were reported between March and August 2020. As part of the activities of the national COVID-19 Genomics Consortium (COG-UK) we undertook whole genome sequencing of the SARS-CoV-2 genomes present in positive clinical samples from the Norfolk region. These samples were collected by four major hospitals, multiple minor hospitals, care facilities and community organizations within Norfolk and surrounding areas. We combined clinical metadata with the sequencing data from regional SARS-CoV-2 genomes to understand the origins, genetic variation, transmission and expansion (spread) of the virus within the region and provide context nationally. Data were fed back into the national effort for pandemic management, whilst simultaneously being used to assist local outbreak analyses. Overall, 1565 positive samples (172 per 100 000 population) from 1376 cases were evaluated; for 140 cases between two and six samples were available providing longitudinal data. This represented 42.6 % of all positive samples identified by hospital testing in the region and encompassed those with clinical need, and health and care workers and their families. In total, 1035 cases had genome sequences of sufficient quality to provide phylogenetic lineages. These genomes belonged to 26 distinct global lineages, indicating that there were multiple separate introductions into the region. Furthermore, 100 genetically distinct UK lineages were detected demonstrating local evolution, at a rate of ~2 SNPs per month, and multiple co-occurring lineages as the pandemic progressed. Our analysis: identified a discrete sublineage associated with six care facilities; found no evidence of reinfection in longitudinal samples; ruled out a nosocomial outbreak; identified 16 lineages in key workers which were not in patients, indicating infection control measures were effective; and found the D614G spike protein mutation which is linked to increased transmissibility dominates the samples and rapidly confirmed relatedness of cases in an outbreak at a food processing facility. The large-scale genome sequencing of SARS-CoV-2-positive samples has provided valuable additional data for public health epidemiology in the Norfolk region, and will continue to help identify and untangle hidden transmission chains as the pandemic evolves.
-