A comprehensive profile of genomic variations in the SARS-CoV-2 isolates from the state of Telangana, India Open Access

Abstract

The novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) causing COVID-19 has rapidly turned into a pandemic, infecting millions and causing 1 157 509 (as of 27 October 2020) deaths across the globe. In addition to studying the mode of transmission and evasion of host immune system, analysing the viral mutational landscape constitutes an area under active research. The latter is expected to impart knowledge on the emergence of different clades, subclades, viral protein functions and protein–protein and protein–RNA interactions during replication/transcription cycle of virus and response to host immune checkpoints. In this study, we have attempted to bring forth the viral genomic variants defining the major clade(s) as identified from samples collected from the state of Telangana, India. We further report a comprehensive draft of all genomic variations (including unique mutations) present in SARS-CoV-2 strain in the state of Telangana. Our results reveal the presence of two mutually exclusive subgroups defined by specific variants within the dominant clade present in the population. This work attempts to bridge the critical gap regarding the genomic landscape and associate mutations in SARS-CoV-2 from a highly infected southern region of India, which was lacking to date.

Funding
This study was supported by the:
  • Department of Science and Technology, Ministry of Science and Technology (Award SB/S2/RJN-071/2018)
    • Principle Award Recipient: RadhakrishnanSabarinathan
  • Science and Engineering Research Board (Award NPDF, PDF/2019/002427)
    • Principle Award Recipient: AsmitaGupta
  • Department of Biotechnology, Ministry of Science and Technology (Award BT/INF/22/SP28169/2019,07/03/2019)
    • Principle Award Recipient: AshwinDalal
Loading

Article metrics loading...

/content/journal/jgv/10.1099/jgv.0.001562
2021-02-15
2024-03-28
Loading full text...

Full text loading...

/deliver/fulltext/jgv/102/3/vir001562.html?itemId=/content/journal/jgv/10.1099/jgv.0.001562&mimeType=html&fmt=ahah

References

  1. Wang H, Li X, Li T, Zhang S, Wang L et al. The genetic sequence, origin, and diagnosis of SARS-CoV-2. Eur J Clin Microbiol Infect Dis 2020
    [Google Scholar]
  2. Oostra M, Hagemeijer MC, van Gent M, Bekker CPJ, te Lintelo EG et al. Topology and membrane anchoring of the coronavirus replication complex: not all hydrophobic domains of NSP3 and nsp6 are membrane spanning. J Virol 2008; 82:12392–12405 [View Article]
    [Google Scholar]
  3. Perlman S, Netland J. Coronaviruses post-SARS: update on replication and pathogenesis. Nat Rev Microbiol 2009; 7:439–450 [View Article]
    [Google Scholar]
  4. Kim D, Lee J-Y, Yang J-S, Kim JW, Kim VN et al. The architecture of SARS-CoV-2 transcriptome. Cell 2020; 181:914–921 [View Article]
    [Google Scholar]
  5. Brian DA, Baric RS. Coronavirus genome structure and replication. Curr Top Microbiol Immunol 2005; 287:1–30
    [Google Scholar]
  6. Petrosillo N, Viceconte G, Ergonul O, Ippolito G, Petersen E. COVID-19, SARS and MERS: are they closely related?. Clin Microbiol Infect 2020; 26:729–734 [View Article][PubMed]
    [Google Scholar]
  7. Kim J-M, Chung Y-S, Jo HJ, Lee N-J, Kim MS et al. Identification of coronavirus isolated from a patient in Korea with COVID-19. Osong Public Health Res Perspect 2020; 11:3–7 [View Article]
    [Google Scholar]
  8. Petersen E, Koopmans M, Go U, Hamer DH, Petrosillo N et al. Comparing SARS-CoV-2 with SARS-CoV and influenza pandemics. Lancet Infect Dis 2020; 20:e238–e244 [View Article]
    [Google Scholar]
  9. Li Q, Guan X, Wu P, Wang X, Zhou L et al. Early transmission dynamics in Wuhan, China, of novel Coronavirus–Infected pneumonia. N Engl J Med 2020; 382:1199–1207 [View Article]
    [Google Scholar]
  10. Lu R, Zhao X, Li J, Niu P, Yang B et al. Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding. The Lancet 2020; 395:565–574 [View Article]
    [Google Scholar]
  11. Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 2018; 34:i884–i890 [View Article]
    [Google Scholar]
  12. Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J 2011; 17:10 [View Article]
    [Google Scholar]
  13. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 2009; 25:1754–1760 [View Article]
    [Google Scholar]
  14. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J et al. The sequence Alignment/Map format and SAMtools. Bioinformatics 2009; 25:2078–2079 [View Article]
    [Google Scholar]
  15. Grubaugh ND, Gangavarapu K, Quick J, Matteson NL, De Jesus JG et al. An amplicon-based sequencing framework for accurately measuring intrahost virus diversity using PrimalSeq and iVar. Genome Biol 2019; 20:8 [View Article]
    [Google Scholar]
  16. Cingolani P, Platts A, Wang LL, Coon M, Nguyen T et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 2012; 6:80–92 [View Article][PubMed]
    [Google Scholar]
  17. Katoh K. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res 2002; 30:3059–3066 [View Article]
    [Google Scholar]
  18. Minh BQ, Schmidt HA, Chernomor O, Schrempf D, Woodhams MD et al. Corrigendum to: IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol Biol Evol 2020; 37:2461 [View Article][PubMed]
    [Google Scholar]
  19. Lanave C, Preparata G, Sacone C, Serio G. A new method for calculating evolutionary substitution rates. J Mol Evol 1984; 20:86–93 [View Article]
    [Google Scholar]
  20. Sagulenko P, Puller V, Neher RA. TreeTime: maximum-likelihood phylodynamic analysis. Virus Evol 2018; 4:vex042 [View Article]
    [Google Scholar]
  21. Letunic I, Bork P. Interactive tree of life (iTOL) V4: recent updates and new developments. Nucleic Acids Res 2019; 47:W256–W259 [View Article]
    [Google Scholar]
  22. Rambaut A, Holmes EC, O’Toole Áine, Hill V, McCrone JT et al. A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology. Nat Microbiol 2020; 5:1403–1407 [View Article]
    [Google Scholar]
  23. Banu S, Jolly B, Mukherjee P, Singh P, Khan S et al. A distinct phylogenetic cluster of Indian SARS-CoV-2 isolates. Open Forum Infect Dis 2020ofaa434
    [Google Scholar]
  24. Korber B, Fischer WM, Gnanakaran S, Yoon H, Theiler J et al. Tracking changes in SARS-CoV-2 spike: evidence that D614G increases infectivity of the COVID-19 virus. Cell 2020; 182:812–827 [View Article]
    [Google Scholar]
  25. Korber B, Fischer W, Gnanakaran SG, Yoon H, Theiler J et al. Spike mutation pipeline reveals the emergence of a more transmissible form of SARS-CoV-2. BioRxiv 2020
    [Google Scholar]
  26. Biswas N, Majumder P. Analysis of RNA sequences of 3636 SARS-CoV-2 collected from 55 countries reveals selective sweep of one virus type. Indian J Med Res 2020
    [Google Scholar]
  27. Zhang L, Jackson CB, Mou H, Ojha A, Rangarajan ES et al. The D614G mutation in the SARS-CoV-2 spike protein reduces S1 shedding and increases infectivity. bioRxiv 2020 12 Jun 2020 [View Article][PubMed]
    [Google Scholar]
  28. Maitra A, Sarkar MC, Raheja H, Biswas NK, Chakraborti S et al. Mutations in SARS-CoV-2 viral RNA identified in eastern India: possible implications for the ongoing outbreak in India and impact on viral structure and host susceptibility. J Biosci 2020; 45:76 [View Article]
    [Google Scholar]
  29. Shen L, Dien Bard J, Biegel JA, Judkins AR, Gai X. Comprehensive genome analysis of 6,000 USA SARS-CoV-2 isolates reveals haplotype signatures and localized transmission patterns by state and by country. Front Microbiol 2020; 11:573430 [View Article]
    [Google Scholar]
  30. Song S. The global landscape of SARS-CoV-2 genomes, variants, and haplotypes in 2019nCoVR. http://biorxiv.org/lookup/doi/10.1101/2020.08.30.273235 ; 2020
  31. Paul D, Jani K, Kumar J, Chauhan R. Phylogenomic analysis of SARS-CoV-2 genomes from Western India reveals unique linked mutations. bioRxiv 2020
    [Google Scholar]
  32. Sanjuán R, Nebot MR, Chirico N, Mansky LM, Belshaw R. Viral mutation rates. J Virol 2010; 84:9733–9748 [View Article]
    [Google Scholar]
  33. Bhattacharyya C. Global Spread of SARS-CoV-2 Subtype with Spike Protein Mutation D614G is Shaped by Human Genomic Variations that Regulate Expression of TMPRSS2 and MX1 Genes. http://biorxiv.org/lookup/doi/10.1101/2020.05.04.075911 ; 2020
  34. Wolff G, Limpens RWAL, Zevenhoven-Dobbe JC, Laugks U, Zheng S et al. A molecular pore spans the double membrane of the coronavirus replication organelle. Science 2020; 369:1395–1398 [View Article]
    [Google Scholar]
  35. Hasanoglu I, Korukluoglu G, Asilturk D, Cosgun Y, Kalem AK et al. Higher viral loads in asymptomatic COVID-19 patients might be the invisible part of the iceberg. Infection 2020; 382: [View Article]
    [Google Scholar]
  36. Benefield A. SARS-CoV-2 viral load peaks prior to symptom onset: a systematic review and individual-pooled analysis of coronavirus viral load from 66 studies. http://medrxiv.org/lookup/doi/10.1101/2020.09.28.20202028 ; 2020
  37. Nikolai LA, Meyer CG, Kremsner PG, Velavan TP. Asymptomatic SARS coronavirus 2 infection: invisible yet invincible. Int J Infect Dis 2020; 100:112–116 [View Article][PubMed]
    [Google Scholar]
  38. Angeletti S, Benvenuto D, Bianchi M, Giovanetti M, Pascarella S et al. COVID‐2019: the role of the NSP2 and NSP3 in its pathogenesis. J Med Virol 2020; 92:584–588 [View Article]
    [Google Scholar]
  39. Lei J, Kusov Y, Hilgenfeld R. Nsp3 of coronaviruses: structures and functions of a large multi-domain protein. Antiviral Res 2018; 149:58–74 [View Article]
    [Google Scholar]
  40. Imbert I, Snijder EJ, Dimitrova M, Guillemot J-C, Lécine P et al. The SARS-coronavirus plnC domain of NSP3 as a replication/transcription scaffolding protein. Virus Res 2008; 133:136–148 [View Article]
    [Google Scholar]
  41. Siu K-L, Yuen K-S, Castaño-Rodriguez C, Ye Z-W, Yeung M-L et al. Severe acute respiratory syndrome coronavirus ORF3a protein activates the NLRP3 inflammasome by promoting TRAF3-dependent ubiquitination of ASC. Faseb J 2019; 33:8865–8877 [View Article][PubMed]
    [Google Scholar]
  42. Issa E, Merhi G, Panossian B, Salloum T, Tokajian S. SARS-CoV-2 and ORF3a: nonsynonymous mutations, functional domains, and viral pathogenesis. mSystems 2020; 5:e00266–20 [View Article]
    [Google Scholar]
  43. Sankar R. Comprehensive analysis of genomic diversity of SARS-CoV-2 in different geographic regions of India: An endeavour to classify Indian SARS-CoV-2 strains on the basis of co-existing mutations. http://biorxiv.org/lookup/doi/10.1101/2020.07.14.203463 .
  44. Simmonds P. Detection of genome-scale ordered RNA structure (GORS) in genomes of positive-stranded RNA viruses: implications for virus evolution and host persistence. RNA 2004; 10:1337–1351 [View Article]
    [Google Scholar]
  45. Garmann RF, Gopal A, Athavale SS, Knobler CM, Gelbart WM et al. Visualizing the global secondary structure of a viral RNA genome with cryo-electron microscopy. RNA 2015; 21:877–886 [View Article]
    [Google Scholar]
  46. Andrews RJ. An in silico map of the SARS-CoV-2 RNA Structurome. http://biorxiv.org/lookup/doi/10.1101/2020.04.17.045161 ; 2020
  47. Huston NC, Wan H, Araujo Tavares R, Wilen C, Pyle AM. Comprehensive in-vivo secondary structure of the SARS-CoV-2 genome reveals novel regulatory motifs and mechanisms.
http://instance.metastore.ingenta.com/content/journal/jgv/10.1099/jgv.0.001562
Loading
/content/journal/jgv/10.1099/jgv.0.001562
Loading

Data & Media loading...

Supplements

Supplementary material 1

PDF

Supplementary material 2

EXCEL

Most cited Most Cited RSS feed