Skip to content
1887

Abstract

The COVID-19 pandemic led to a large global effort to sequence SARS-CoV-2 genomes from patient samples to track viral evolution and inform the public health response. Millions of SARS-CoV-2 genome sequences have been deposited in global public repositories. The Canadian COVID-19 Genomics Network (CanCOGeN – VirusSeq), a consortium tasked with coordinating expanded sequencing of SARS-CoV-2 genomes across Canada early in the pandemic, created the Canadian VirusSeq Data Portal, with associated data pipelines and procedures, to support these efforts. The goal of VirusSeq was to allow open access to Canadian SARS-CoV-2 genomic sequences and enhanced, standardized contextual data that were unavailable in other repositories and that meet FAIR standards (Findable, Accessible, Interoperable and Reusable). In addition, the portal data submission pipeline contains data quality checking procedures and appropriate acknowledgement of data generators that encourages collaboration. From inception to execution, the portal was developed with a conscientious focus on strong data governance principles and practices. Extensive efforts ensured a commitment to Canadian privacy laws, data security standards, and organizational processes. This portal has been coupled with other resources, such as Viral AI, and was further leveraged by the Coronavirus Variants Rapid Response Network (CoVaRR-Net) to produce a suite of continually updated analytical tools and notebooks. Here we highlight this portal (https://virusseq-dataportal.ca/), including its contextual data not available elsewhere, and the Duotang (https://covarr-net.github.io/duotang/duotang.html), a web platform that presents key genomic epidemiology and modelling analyses on circulating and emerging SARS-CoV-2 variants in Canada. Duotang presents dynamic changes in variant composition of SARS-CoV-2 in Canada and by province, estimates variant growth, and displays complementary interactive visualizations, with a text overview of the current situation. The VirusSeq Data Portal and Duotang resources, alongside additional analyses and resources computed from the portal (COVID-MVP, CoVizu), are all open source and freely available. Together, they provide an updated picture of SARS-CoV-2 evolution to spur scientific discussions, inform public discourse, and support communication with and within public health authorities. They also serve as a framework for other jurisdictions interested in open, collaborative sequence data sharing and analyses.

Funding
This study was supported by the:
  • Public Health Agency of Canada (Award N/A)
    • Principal Award Recipient: CPHLNConsortium
  • CIHR (Award N/A)
    • Principal Award Recipient: MuhammadZohaib Anwar
  • Michael Smith Health Research BC (Award N/A)
    • Principal Award Recipient: MuhammadZohaib Anwar
  • Fonds de Recherche du Québec - Santé (Award N/A)
    • Principal Award Recipient: JulieG Hussin
  • Innovation, Science and Economic Development Canada (ISED) (Award N/A)
    • Principal Award Recipient: NotApplicable
  • Digital Research Alliance of Canada (Award N/A)
    • Principal Award Recipient: VirusSeqData Portal Academic and Health Network
  • Simon Fraser University (Research Computing Group) (Award N/A)
    • Principal Award Recipient: FionaS L Brinkman
  • Ontario Genomics (Award #OGI-201)
    • Principal Award Recipient: JaredT Simpson
  • Genome Canada (Award #OGI-136)
    • Principal Award Recipient: JaredT Simpson
  • Genome Alberta (Award ACRRP-5)
    • Principal Award Recipient: MatthewA Croxen
  • US National Institutes of Health (Award #U24CA253529)
    • Principal Award Recipient: NotApplicable
  • IVADO COVID19 Rapid Response (Award #CVD19-030)
    • Principal Award Recipient: NotApplicable
  • CIHR (Award #174924)
    • Principal Award Recipient: NotApplicable
  • Genome Canada (Award #286GET)
    • Principal Award Recipient: NotApplicable
  • CIHR (Award #ARR-175622)
    • Principal Award Recipient: NotApplicable
  • Genome Canada (Award #E09CMA)
    • Principal Award Recipient: CanCOGeNConsortium
  • This is an open-access article distributed under the terms of the Creative Commons Attribution License. This article was made open access via a Publish and Read agreement between the Microbiology Society and the corresponding author’s institution.
Loading

Article metrics loading...

/content/journal/mgen/10.1099/mgen.0.001293
2024-10-14
2026-02-10

Metrics

Loading full text...

Full text loading...

/deliver/fulltext/mgen/10/10/mgen001293.html?itemId=/content/journal/mgen/10.1099/mgen.0.001293&mimeType=html&fmt=ahah

References

  1. Cameron R, Savić-Kallesøe S, Griffiths EJ, Dooley D, Sridhar A et al. SARS-CoV-2 genomic contextual data harmonization: recommendations from a mixed methods analysis of COVID-19 case report forms across Canada; 2022 https://www.researchsquare.com/article/rs-1871614/v1 accessed 9 April 2024
  2. Shu Y, McCauley J. GISAID: global initiative on sharing all influenza data – from vision to reality. Eurosurveillance 2017; 22:30494 [View Article]
    [Google Scholar]
  3. Khare S, Gurry C, Freitas L, Schultz MB, Bach G et al. GISAID’s role in pandemic response. China CDC Wkly 2021; 3:1049–1051 [View Article] [PubMed]
    [Google Scholar]
  4. Elbe S, Buckland‐Merrett G. Data, disease and diplomacy: GISAID’s innovative contribution to global health. Global Challenges 2017; 1:33–46 [View Article]
    [Google Scholar]
  5. Kalia K, Saberwal G, Sharma G. The lag in SARS-CoV-2 genome submissions to GISAID. Nat Biotechnol 2021; 39:1058–1060 [View Article] [PubMed]
    [Google Scholar]
  6. Enserink M, Cohen J. Control issues. Science 2023; 380:332–339 [View Article] [PubMed]
    [Google Scholar]
  7. Lenharo M. GISAID in crisis: can the controversial COVID genome database survive?. Nature 2023; 617:455–457 [View Article]
    [Google Scholar]
  8. Gill IS, Griffiths EJ, Dooley D, Cameron R, Savić Kallesøe S et al. The DataHarmonizer: a tool for faster data harmonization, validation, aggregation and analysis of pathogen genomics contextual information. Microb Genom 2023; 9:mgen000908 [View Article]
    [Google Scholar]
  9. Rahman N, O’Cathail C, Zyoud A, Sokolov A, Oude Munnink B et al. Mobilisation and analyses of publicly available SARS-CoV-2 data for pandemic responses. Microb Genom 2024; 10:001188 [View Article]
    [Google Scholar]
  10. Smith EA, Libuit KG, Kapsak CJ, Scribner MR, Wright SM et al. Pathogen genomics in public health laboratories: successes, challenges, and lessons learned from California’s SARS-CoV-2 Whole-Genome Sequencing Initiative, California COVIDNet. Microb Genom 2023; 9:mgen001027 [View Article]
    [Google Scholar]
  11. Griffiths EJ, Timme RE, Mendes CI, Page AJ, Alikhan N-F et al. Future-proofing and maximizing the utility of metadata: the PHA4GE SARS-CoV-2 contextual data specification package. GigaScience 2022; 11:giac003 [View Article]
    [Google Scholar]
  12. Virological. Monitoring the evolution and spread of Delta sublineages AY.25 and AY.27 in Canada - SARS-CoV-2 coronavirus / nCoV-2019 Genomic Epidemiology; 2021 https://virological.org/t/monitoring-the-evolution-and-spread-of-delta-sublineages-ay-25-and-ay-27-in-canada/767 accessed 9 April 2024
  13. Steiner MC, Novembre J. Population genetic models for the spatial spread of adaptive variants: a review in light of SARS-CoV-2 evolution. PLoS Genet 2022; 18:e1010391 [View Article] [PubMed]
    [Google Scholar]
  14. Overture - Home. n.d https://www.overture.bio/ accessed 9 April 2024
  15. Keycloak. n.d https://www.keycloak.org/ accessed 9 April 2024
  16. Viral AI, DNAstack. n.d https://viral.ai/collections/virusseq/overview accessed 9 April 2024
  17. O’Toole Á, Pybus OG, Abram ME, Kelly EJ, Rambaut A. Pango lineage designation and assignment using SARS-CoV-2 spike gene nucleotide sequences. BMC Genom 2022; 23:121 [View Article] [PubMed]
    [Google Scholar]
  18. Turakhia Y, Thornlow B, Hinrichs AS, De Maio N, Gozashti L et al. Ultrafast Sample placement on Existing tRees (UShER) enables real-time phylogenetics for the SARS-CoV-2 pandemic. Nat Genet 2021; 53:809–816 [View Article] [PubMed]
    [Google Scholar]
  19. Anwar MZ, Gill IS, Iseminger M, Sehar A, Igwacho KD et al. COVID-MVP: an interactive visualization for tracking SARS-cov-2 mutations, variants, and prevalence, enabled by curated functional annotations and portable genomics workflow. bioRxiv 2022 [View Article]
    [Google Scholar]
  20. Canada PHA of. Health Infobase - Health data in Canada; 2013 https://health-infobase.canada.ca/ accessed 9 April 2024
  21. Liyanage K, Samarakoon H, Parameswaran S, Gamaarachchi H. Efficient end-to-end long-read sequence mapping using minimap2-fpga integrated with hardware accelerated chaining. Sci Rep 2023; 13:20174 [View Article] [PubMed]
    [Google Scholar]
  22. Ferreira R-C, Wong E, Gugan G, Wade K, Liu M et al. CoVizu: rapid analysis and visualization of the global diversity of SARS-CoV-2 genomes. Virus Evol 2021; 7:veab092 [View Article] [PubMed]
    [Google Scholar]
  23. Nguyen L-T, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol 2015; 32:268–274 [View Article] [PubMed]
    [Google Scholar]
  24. Cibiv/IQ-TREE. CIBIV - Center for Integrative Bioinformatics Vienna; 2024 https://github.com/Cibiv/IQ-TREE accessed 9 April 2024
  25. Sagulenko P, Puller V, Neher RA. TreeTime: maximum-likelihood phylodynamic analysis. Virus Evol 2018; 4:vex042 [View Article] [PubMed]
    [Google Scholar]
  26. Poon A. ArtPoon/ggfree; 2024 https://github.com/ArtPoon/ggfree accessed 9 April 2024
  27. rstudio/r2d3. RStudio; 2024 https://github.com/rstudio/r2d3 accessed 9 April 2024
  28. plotly/plotly.R. Plotly; 2024 https://github.com/plotly/plotly.R accessed 9 April 2024
  29. Neher RA. Contributions of adaptation and purifying selection to SARS-CoV-2 evolution. Virus Evol 2022; 8:veac113 [View Article] [PubMed]
    [Google Scholar]
  30. Chen C, Nadeau S, Yared M, Voinov P, Xie N et al. CoV-Spectrum: analysis of globally shared SARS-CoV-2 data to identify and characterize new variants. Bioinformatics 2022; 38:1735–1737 [View Article]
    [Google Scholar]
  31. Bolker [aut B, cre, Team RDC, Giné-Vázquez I. bbmle: Tools for General Maximum Likelihood Estimation; 2023 https://cran.r-project.org/web/packages/bbmle/index.html accessed 9 April 2024
  32. RandomFromHessianOrMCMC function - RDocumentation. n.d https://www.rdocumentation.org/packages/HelpersMG/versions/5.1/topics/RandomFromHessianOrMCMC accessed 9 April 2024
  33. Respiratory virus dashboard. n.d https://www.alberta.ca/covid-19-alberta-data.aspx accessed 9 April 2024
  34. Population and demography statistics. n.d https://www.statcan.gc.ca/en/subjects-start/population_and_demography accessed 9 April 2024
  35. Song L, Liu H, Brinkman FSL, Gill E, Griffiths EJ et al. Addressing privacy concerns in sharing viral sequences and minimum contextual data in a public repository during the COVID-19 pandemic. Front Genet 2021; 12:716541 [View Article] [PubMed]
    [Google Scholar]
  36. Savic Kallesoe SA, Rabbani T, Gill EE, Brinkman F, Griffiths EJ et al. Canadians’ opinions towards COVID-19 data-sharing: a national cross-sectional survey. BMJ Open 2023; 13:e066418 [View Article] [PubMed]
    [Google Scholar]
  37. Rabbani T, Kallesøe SAS, Gill EE, Brinkman FSL, Griffiths EJ et al. Opinions matter: public opinions on data-sharing and how they can shape pathogen genomic surveillance in Canada. Popul Med; n.d https://www.populationmedicine.eu/Opinions-matter-public-opinions-on-data-sharing-and-how-they-can-shape-pathogen-genomic,163811,0,2.html accessed 23 April 2024
  38. Christoffels A, Mboowa G, van Heusden P, Makhubela S, Githinji G et al. A pan-African pathogen genomics data sharing platform to support disease outbreaks. Nat Med 2023; 29:1052–1055 [View Article] [PubMed]
    [Google Scholar]
  39. Sofonea MT, Roquebert B, Foulongne V, Morquin D, Verdurme L et al. Analyzing and modeling the spread of SARS-CoV-2 omicron lineages BA.1 and BA.2, France, September 2021–February 2022 - Volume 28, Number 7—July 2022-- Emerging Infectious Diseases Journal - CDC. n.d https://wwwnc.cdc.gov/eid/article/28/7/22-0033_article accessed 8 August 2024
  40. Otto SP, Day T, Arino J, Colijn C, Dushoff J et al. The origins and potential future of SARS-CoV-2 variants of concern in the evolving COVID-19 pandemic. Curr Biol 2021; 31:R918–R929 [View Article]
    [Google Scholar]
  41. Kotwa JD, Lobb B, Massé A, Gagnier M, Aftanas P et al. Genomic and transcriptomic characterization of delta SARS-CoV-2 infection in free-ranging white-tailed deer (Odocoileus virginianus). iScience 2023; 26:108319 [View Article] [PubMed]
    [Google Scholar]
  42. Bashor L, Gagne RB, Bosco-Lauth AM, Bowen RA, Stenglein M et al. SARS-CoV-2 evolution in animals suggests mechanisms for rapid variant selection. Proc Natl Acad Sci U S A 2021; 118:e2105253118 [View Article] [PubMed]
    [Google Scholar]
  43. Cele S, Karim F, Lustig G, San JE, Hermanus T et al. SARS-CoV-2 prolonged infection during advanced HIV disease evolves extensive immune escape. Cell Host Microbe 2022; 30:154–162 [View Article] [PubMed]
    [Google Scholar]
  44. Universal Declaration of Human Rights - Wikisource, the free online library. n.d https://en.wikisource.org/wiki/Universal_Declaration_of_Human_Rights accessed 9 April 2024
  45. Murall CL, Fournier E, Galvez JH, N’Guessan A, Reiling SJ et al. A small number of early introductions seeded widespread transmission of SARS-CoV-2 in Québec, Canada. Genome Med 2021; 13:169 [View Article] [PubMed]
    [Google Scholar]
  46. Otto SP, MacPherson A, Colijn C. Endemic does not mean constant as SARS-CoV-2 continues to evolve. Evolution 2024; 78:1092–1108 [View Article]
    [Google Scholar]
  47. Baker SJC, Nfonsam LE, Leto D, Rutherford C, Smieja M et al. Chronic COVID-19 infection in an immunosuppressed patient shows changes in lineage over time: a case report. Virol J 2024; 21:8 [View Article] [PubMed]
    [Google Scholar]
  48. Isabel S, Abdulnoor M, Boissinot K, Isabel MR, de Borja R et al. Emergence of a mutation in the nucleocapsid gene of SARS-CoV-2 interferes with PCR detection in Canada. Sci Rep 2022; 12:10867 [View Article] [PubMed]
    [Google Scholar]
  49. The European Respiratory Virus Surveillance Summary (ERVISS). n.d https://www.ecdc.europa.eu/en/publications-data/european-respiratory-virus-surveillance-summary-erviss accessed 8 August 2024
  50. Colijn C, Earn DJ, Dushoff J, Ogden NH, Li M et al. The need for linked genomic surveillance of SARS-CoV-2. Can Commun Dis Rep 2022; 48:131–139 [View Article] [PubMed]
    [Google Scholar]
/content/journal/mgen/10.1099/mgen.0.001293
Loading
/content/journal/mgen/10.1099/mgen.0.001293
Loading

Data & Media loading...

Supplements

Supplementary material 1

PDF
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An error occurred
Approval was partially successful, following selected items could not be processed due to error