Analysis of errors in finished DNA sequences: the surfactin operon of as an example Free

Abstract

SUMMARY: Increased productivity in DNA sequencing would not be valid without a straightforward detection and estimation of errors in finished sequences. The sequence of the surfactin operon from was obtained by two different groups and by chance we were also working on the same chromosome region. Taking advantage of this situation we report in this paper, the number and nature of errors found in the overlapping part of the DNA sequences obtained by the three laboratories. The coincidence of some of the errors with compression in sequence ladders and with secondary DNA structures as well as the detection of frameshift errors using computer programs, are demonstrated. Finally we discuss the definition of a new sequencing strategy that might minimize both the error rate and the cost of sequencing.

Loading

Article metrics loading...

/content/journal/micro/10.1099/13500872-141-2-345
1995-02-01
2024-03-29
Loading full text...

Full text loading...

/deliver/fulltext/micro/141/2/mic-141-2-345.html?itemId=/content/journal/micro/10.1099/13500872-141-2-345&mimeType=html&fmt=ahah

References

  1. Anagnostopoulos C., Piggot P. J., Hoch J. A. 1993; The genetic map of Bacillus subtilis . In Bacillus and Other Gram-Positive Bacteria: Biochemistry, Physiology and Molecular Genetics pp 425–461 Edited by Sonenshein A. L., Hoch J. A., Losick R. Washington, DC: American Society for Microbiology;
    [Google Scholar]
  2. Altschul S. F., Gish W., Miller W., Myers E. W., Lipman D. J. 1990; Basic local alignment search tool. J Mol Biol 215:403–410
    [Google Scholar]
  3. Barnett R. S., Davidson J. N. 1989; Coating gel plates. Focus 11:75
    [Google Scholar]
  4. Chen W. Q., Hunkapiller T. 1992; Sequence accuracy of large DNA sequencing projects. DNA Seq 2:335–342
    [Google Scholar]
  5. Churchill G. A., Waterman M. S. 1992; The accuracy of DNA sequences: estimating sequence quality. Genomics 14:89–98
    [Google Scholar]
  6. Cosmina P., Rodriguez F., de Ferra F., Grandi G., Perego M., Venema G., van Sinderen D. 1993; Sequence and analysis of the genetic locus responsible for surfactin synthesis in Bacillus subtilis . Mol Microbiol 8:821–831
    [Google Scholar]
  7. De Araujo Novaes M., Denizot F. 1993; An automatic approach for DNA sequencing. Biochimie 75:347–351
    [Google Scholar]
  8. Devereux J., Haeberli P., Smithies O. 1984; A comprehensive set of sequence analysis programs for the VAX. Nucleic Acids Res 12:387–395
    [Google Scholar]
  9. Fichant G., Gautier C. 1987; Statistical method for predicting protein coding regions in nucleic acid sequences. CABIOS 3:287–295
    [Google Scholar]
  10. Fuma S., Fujishima Y., Corbell N., D’Souza C., Nakano M. M., Zuber P., Yamane K. 1993; Nucleotide sequence of 5’ portion of srf A that contains the region required for competence es­ tablishment in Bacillus subtilis . Nucleic Acids Res 21:93–97
    [Google Scholar]
  11. Garoff H., Ansorge W. 1981; Improvements of DNA sequencing. Anal Biochem 115:450–457
    [Google Scholar]
  12. Howard J. 1992; Formamide gels: relief from compression artefacts. Comments (USB) 19:62–63
    [Google Scholar]
  13. Khurshid F., Beck S. 1993; Error analysis in manual and automated DNA sequencing. Anal Biochem 208:138–143
    [Google Scholar]
  14. Koop B. F., Rowan L., Chen W. Q., Deshpande P., Lee H., Hood L. 1993; Sequence length and error analysis of sequenase and automated Taq cycle sequencing methods. BioTechniques 14:442–447
    [Google Scholar]
  15. Krawetz S. A. 1989; Sequence errors described in GenBank: a means to determine the accuracy of DNA sequence interpretation. Nucleic Acids Res 17:3951–3956
    [Google Scholar]
  16. Kristensen T., Lopez S., Prydz H. 1992; An estimate of the sequencing error frequency in the DNA sequence databases. DNA Seq 2:343–346
    [Google Scholar]
  17. Lang B. F., Burger G. 1990; A rapid, high resolution DNA sequencing gel system. Anal Biochem 188:176–180
    [Google Scholar]
  18. Mizusawa S., Nishimura S., Sela F. 1986; Improvement of the dideoxy chain termination method of DNA sequencing by use of deoxy-7-deazaguanosine triphosphate in place of dGTP. Nucleic Acids Res 14:1319–1324
    [Google Scholar]
  19. Radola B. J. 1980; Ultrathin-layer isoelectric focusing in 50-100 μm polyacrylamide gels on silanized glass or polyester films. Electrophoresis 1:43–56
    [Google Scholar]
  20. Rychlik W., Rhoads R. E. 1990; A computer program for choosing optimal oligonucleotides for filter hybridization, sequencing, and in vitro amplification of DNA. Nucleic Acids Res 17:8543–8551
    [Google Scholar]
  21. Salles C., Creancier L., Claverys J. P., Mejean V. 1992; The high level streptomycin gene from Streptococcus pneumoniae is a homolog of the ribosomal protein S12 gene from Escherichia coli . Nucleic Acids Res 20: 6103
    [Google Scholar]
  22. Sambrook J., Fritsch E. F., Maniatis T. 1989 Molecular Cloning: a Laboratory Manual Cold Spring Harbor, NY: Cold Spring Harbor Laboratory;
    [Google Scholar]
  23. Sanger F., Nicklen S., Coulson A. R. 1977; DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci USA 74:5463–5467
    [Google Scholar]
  24. Sulston J. and others 1992; The C. elegans genome sequencing project: a beginning. Nature 356:37–41
    [Google Scholar]
  25. Wilson R. 1994 and others 2·2 Mb of contiguous nucleotide sequence from chromosome III of C.elegans . Nature 368:32–38
    [Google Scholar]
  26. Zimmermann J., Voss H., Schwager C., Stegemann J., Erfle H., Stucky K., Kristensen T., Ansorge W. 1990; A simplified protocol for fast plasmid DNA sequencing. Nucleic Acids Res 18:1067
    [Google Scholar]
http://instance.metastore.ingenta.com/content/journal/micro/10.1099/13500872-141-2-345
Loading
/content/journal/micro/10.1099/13500872-141-2-345
Loading

Data & Media loading...

Most cited Most Cited RSS feed