1887

Abstract

Dengue virus (DENV) represents a public health threat and economic burden in affected countries. The availability of genomic data is key to understanding viral evolution and dynamics, supporting improved control strategies. Currently, the use of high-throughput sequencing (HTS) technologies, which can be applied both directly to patient samples (shotgun metagenomics) and to PCR-amplified viral sequences (amplicon sequencing), is potentially the most informative approach to monitor viral dissemination and genetic diversity by providing, in a single methodological step, identification and characterization of the whole viral genome at the nucleotide level. Despite many advantages, these technologies require bioinformatics expertise and appropriate infrastructure for the analysis and interpretation of the resulting data. In addition, the many software solutions available can hamper the reproducibility and comparison of results. Here we present DEN-IM, a one-stop, user-friendly, containerized and reproducible workflow for the analysis of DENV short-read sequencing data from both amplicon and shotgun metagenomics approaches. It is able to infer the DENV coding sequence (CDS), identify the serotype and genotype, and generate a phylogenetic tree. It can easily be run on any UNIX-like system, from local machines to high-performance computing clusters, performing a comprehensive analysis without the requirement for extensive bioinformatics expertise. Using DEN-IM, we successfully analysed two types of DENV datasets. The first comprised 25 shotgun metagenomic sequencing samples from patients with variable serotypes and genotypes, including an spiked sample containing the four known serotypes. The second consisted of 106 paired-end and 76 single-end amplicon sequences of DENV 3 genotype III and DENV 1 genotype I, respectively, where DEN-IM allowed detection of the intra-genotype diversity. The DEN-IM workflow, parameters and execution configuration files, and documentation are freely available at https://github.com/B-UMMI/DEN-IM).

Funding
This study was supported by the:
  • Catarina I Mendes , Fundação para a Ciência e a Tecnologia , (Award SFRH/BD/129483/2017)
Loading

Article metrics loading...

/content/journal/mgen/10.1099/mgen.0.000328
2020-03-05
2020-06-04
Loading full text...

Full text loading...

/deliver/fulltext/mgen/6/3/mgen000328.html?itemId=/content/journal/mgen/10.1099/mgen.0.000328&mimeType=html&fmt=ahah

References

  1. World Health Organization Dengue: guidelines for diagnosis, treatment, prevention, and control. Spec Program Res Train Trop Dis 2009; x:147
    [Google Scholar]
  2. Diamond MS, Pierson TC. Molecular insight into dengue virus pathogenesis and its implications for disease control. Cell 2015; 162:488–492 [CrossRef]
    [Google Scholar]
  3. Bhatt S, Gething PW, Brady OJ, Messina JP, Farlow AW et al. The global distribution and burden of dengue. Nature 2013; 496:504–507 [CrossRef]
    [Google Scholar]
  4. Lourenço J, Tennant W, Faria NR, Walker A, Gupta S et al. Challenges in dengue research: a computational perspective. Evol Appl 2018; 11:516–533 [CrossRef]
    [Google Scholar]
  5. Leitmeyer KC, Vaughn DW, Watts DM, Salas R, Villalobos I et al. Dengue virus structural differences that correlate with pathogenesis. J Virol 1999; 73:4738–4747 [CrossRef]
    [Google Scholar]
  6. Yozwiak NL, Skewes-Cox P, Stenglein MD, Balmaseda A, Harris E et al. Virus identification in unknown tropical febrile illness cases using deep sequencing. PLoS Negl Trop Dis 2012; 6:e1485 [CrossRef]
    [Google Scholar]
  7. Lee CK, Chua CW, Chiu L, Koay ES-C. Clinical use of targeted high-throughput whole-genome sequencing for a dengue virus variant. Clin Chem Lab Med 2017; 55:e209 [CrossRef]
    [Google Scholar]
  8. Fatima Z, Idrees M, Bajwa MA, Tahir Z, Ullah O et al. Serotype and genotype analysis of dengue virus by sequencing followed by phylogenetic analysis using samples from three mini outbreaks-2007-2009 in Pakistan. BMC Microbiol 2011; 11:200 [CrossRef]
    [Google Scholar]
  9. Fonseca V, Libin PJK, Theys K, Faria NR, Nunes MRT et al. A computational method for the identification of dengue, Zika and Chikungunya virus species and genotypes. PLoS Negl Trop Dis 2019; 13:e0007231 [CrossRef]
    [Google Scholar]
  10. Vilsker M, Moosa Y, Nooij S, Fonseca V, Ghysens Y et al. Genome detective: an automated system for virus identification from high-throughput sequencing data. Bioinformatics 2019; 35:871–873 [CrossRef]
    [Google Scholar]
  11. Li Y, Wang H, Nie K, Zhang C, Zhang Y et al. Vip: an integrated pipeline for metagenomics of virus identification and discovery. Sci Rep 2016; 6:1–10 [CrossRef]
    [Google Scholar]
  12. Yamashita A, Sekizuka T, Kuroda M. VirusTAP: viral Genome-Targeted assembly pipeline. Front Microbiol 2016; 7:1–5 [CrossRef]
    [Google Scholar]
  13. Lin H-H, Liao Y-C. drVM: a new tool for efficient genome assembly of known eukaryotic viruses from metagenomes. Gigascience 2017; 6:1–10 [CrossRef]
    [Google Scholar]
  14. Di Tommaso P, Chatzou M, Floden EW, Barja PP, Palumbo E et al. Nextflow enables reproducible computational workflows. Nat Biotechnol 2017; 35:316–319 [CrossRef]
    [Google Scholar]
  15. Gerhardt L, Bhimji W, Canon S, Fasel M, Jacobsen D et al. Shifter: containers for HPC. J Phys Conf Ser 2017; 898:082021 [CrossRef]
    [Google Scholar]
  16. Kurtzer GM, Sochat V, Bauer MW. Singularity: scientific containers for mobility of compute. PLoS One 2017; 12:e0177459–20 [CrossRef]
    [Google Scholar]
  17. Schmieder R, Edwards R. Quality control and preprocessing of metagenomic datasets. Bioinformatics 2011; 27:863–864 [CrossRef]
    [Google Scholar]
  18. Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods 2012; 9:357–359 [CrossRef]
    [Google Scholar]
  19. Li H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 2011; 27:2987–2993 [CrossRef]
    [Google Scholar]
  20. Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 2012; 19:455–477 [CrossRef]
    [Google Scholar]
  21. Li D, Liu C-M, Luo R, Sadakane K, Lam T-W. MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics 2015; 31:1674–1676 [CrossRef]
    [Google Scholar]
  22. Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 2014; 9:e112963 [CrossRef]
    [Google Scholar]
  23. Machado MP, Ribeiro-Gonçalves B, Silva M, Ramirez M, Carriço JA. Epidemiological surveillance and typing methods to track antibiotic resistant strains using high throughput sequencing. Methods Mol Biol 2017; 1520:331–356 [CrossRef]
    [Google Scholar]
  24. Altschul S et al. Gapped blast and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 1997; 25:3389–3402 [CrossRef]
    [Google Scholar]
  25. Nakamura T, Yamada KD, Tomii K, Katoh K. Parallelization of MAFFT for large-scale multiple sequence alignments. Bioinformatics 2018; 34:2490–2492 [CrossRef]
    [Google Scholar]
  26. Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 2014; 30:1312–1313 [CrossRef]
    [Google Scholar]
  27. Parameswaran P, Wang C, Trivedi SB, Eswarappa M, Montoya M et al. Intrahost selection pressures drive rapid dengue virus microevolution in acute human infections. Cell Host Microbe 2017; 22:400–410 [CrossRef]
    [Google Scholar]
  28. Wood DE, Salzberg SL. Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biol 2014; 15:R46 [CrossRef]
    [Google Scholar]
  29. Marinho PES, Bretas de Oliveira D, Candiani TMS, Crispim APC, Alvarenga PPM et al. Meningitis associated with simultaneous infection by multiple dengue virus serotypes in children, Brazil. Emerg Infect Dis 2017; 23:115–118 [CrossRef]
    [Google Scholar]
  30. Reddy MN, Dungdung R, Valliyott L, Pilankatta R. Occurrence of concurrent infections with multiple serotypes of dengue viruses during 2013–2015 in northern Kerala, India. PeerJ 2017; 5:e2970 [CrossRef]
    [Google Scholar]
  31. Cuypers L, Libin P, Simmonds P, Nowé A, Muñoz-Jordán J et al. Time to harmonize dengue Nomenclature and classification. Viruses 2018; 10:569 [CrossRef]
    [Google Scholar]
  32. Pickett BE, Greer DS, Zhang Y, Stewart L, Zhou L et al. Virus pathogen database and analysis resource (ViPR): a comprehensive bioinformatics database and analysis resource for the coronavirus research community. Viruses 2012; 4:3209–3226 [CrossRef]
    [Google Scholar]
http://instance.metastore.ingenta.com/content/journal/mgen/10.1099/mgen.0.000328
Loading
/content/journal/mgen/10.1099/mgen.0.000328
Loading

Data & Media loading...

Supplements

Loading data from figshare Loading data from figshare

Most cited this month Most Cited RSS feed

This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error