Subtilist: a relational database for the Bacillus subtilis genome

Ivan Moszer; Philippe Glaser; Antoine Danchin

doi:10.1099/13500872-141-2-261

Volume 141, Issue 2

Review Article

Free

Subtilist: a relational database for the Bacillus subtilis genome

Ivan Moszer¹, Philippe Glaser¹ and Antoine Danchin¹
View Affiliations Hide Affiliations

Affiliations: ¹ Unité de Régulation de l'Expression Génétique, Institut Pasteur, 28 rue du Docteur Roux, 75724 Paris Cedex 15, France
Author for correspondence: Antoine Danchin. Tel: +33 1 45 68 84 41. Fax: +33 1 45 68 89 48. e-mail: [email protected]
Published: 01 February 1995 https://doi.org/10.1099/13500872-141-2-261

Abstract

SUMMARY

In the framework of the international collaborative project aiming to sequence the whole Bacillus subtilis chromosome, we have created a relational database for managing and analysing information associated with the molecular genetics of this bacterium: Subtilist. It allows recovery of non-redundant DNA sequences of the B. subtilis genome, as well as related information, i.e. genes, proteins, etc. A logical structure has been designed with appropriate links between the different objects, and a set of procedures has been implemented for data updating and management. The database is organized around a core constituted by all known contigs of B. subtilis, i.e. sets of nonredundant sequences created from original entries in the EMBL data library. A user-friendly interface has been developed to make the database easy to consult. Sequence analysis tools have been integrated into the database, such as a program for rapid similarity searching of protein data banks, and a powerful DNA pattern searching program. Thanks to the consistency of Subtilist, we have performed a codon usage analysis by Factorial Correspondence Analysis, and a study of the distribution of the isoelectric points of known proteins of B. subtilis. The Subti List database is available through anonymous ftp (address ‘ftp.pasteur.fr’ or IP number 157.99.64.12, directory ‘/pub/GenomeDB/SubtiList’

Received: 13/07/1994
Accepted: 04/10/1994
Revised: 16/09/1994
Published Online: 01/02/1995

Keyword(s): Bacillus subtilis , codon usage , database , genome sequencing and isoelectric point

Article metrics loading...

/content/journal/micro/10.1099/13500872-141-2-261

1995-02-01

2024-04-24

Full text loading...

/deliver/fulltext/micro/141/2/mic-141-2-261.html?itemId=/content/journal/micro/10.1099/13500872-141-2-261&mimeType=html&fmt=ahah

References

Anagnostopoulos C., Piggot P. J., Hoch J. A. 1993 The genetic map of Bacillus subtilis. . In Bacillus subtilis and Other Gram-positive Bacteria: Biochemistry, Physiology and Molecular Genetics, pp 425–461 Edited by Sonenshein A. L., Hoch J. A., Losick R. Washington, DC: American Society for Microbiology;
[Google Scholar]
Bairoch A., Boeckmann B. 1993; The SWISS-PROT protein sequence data bank, recent developments. Nucleic Acids Res 21:3093–3096
[Google Scholar]
Bouffard G., Ostell J., Rudd K. E. 1992; GeneScape: a relational database of Escherichia coli genomic map data for Macintosh computers.. Comput Appl Biosci 8:563–567
[Google Scholar]
Delorme M. O., Hénaut A. 1988; Merging of distance matrices and classification by dynamic clustering.. Comput Appl Biosci 4:453–458
[Google Scholar]
Diday E. 1971; Une nouvelle méthode en classification auto- matique et reconnaissance des formes: la méthode des nuées dynamiques.. Rev Stat Appl 19:19–33
[Google Scholar]
Hill M. O. 1974; Correspondence analysis: a neglected multivariate method.. Appl Stat 23:340–353
[Google Scholar]
Itaya M., Tanaka T. 1991; Complete physical map of the Bacillus subtilis 168 chromosome constructed by a gene-directed mutagenesis method.. J Mol biol 220:631–648
[Google Scholar]
Kohara Y., Akiyama K., Isono K. 1987; The physical map of the whole E. coli chromosome: application of a new strategy for rapid analysis and sorting of a large genomic library.. Cell 50:495–508
[Google Scholar]
Kröger M., Wahl R., Rice P. 1993; Compilation of DNA sequences of Escherichia coli (update 1993).. Nucleic Acids Res 21:2973–3000
[Google Scholar]
Kunisawa T., Nakamura M., Watanabe H., Otsuka J., Tsugita A., Yeh L.S., George D. G., Barker W. C. 1990; Escherichia coli K12 genomic database.. Protein Sequences Data Anal 3:157–162
[Google Scholar]
Kunst F., Vassarotti A., Danchin A. 1995; Organization of the European Bacillus subtilis genome sequencing project.. Microbiology 141:249–255
[Google Scholar]
Lipman D. J., Pearson W. R. 1985; Rapid and sensitive protein similarity searches.. Science 227:1435–1441
[Google Scholar]
Médigue C., Rouxel T., Vigier P., Hénaut A., Danchin A. 1991; Evidence for horizontal gene transfer in Escherichia coli speciation.. J Mol biol 222:851–856
[Google Scholar]
Médigue C., Viari A., Hénaut A, Danchin A. 1993; Colibri: a functional data base for the Escherichia coli genome.. Microbiol Rev 57:623–654
[Google Scholar]
Needleman S. B., Wunsch C. D. 1970; A general method applicable to the search for similarities in the amino acid sequence of two proteins.. J Mol biol 48:443–453
[Google Scholar]
Perrière G., Gautier C. 1993; ColiGene: object-centered representation for the study of E. coli gene expressivity by sequence analysis.. Biochimie 75:415–422
[Google Scholar]
Rice C. M., Fuchs R., Higgins D. G., Stoehr P. J., Cameron G. N. 1993; The EMBL data library.. Nucleic Acids Res 21:2967–2971
[Google Scholar]
Rudd K. E. 1993; Maps, genes, sequences, and computers: an Escherichia coli case study.. ASM News 59:335–341
[Google Scholar]
Sellers P. H. 1974; On the theory and computation of evolutionary distances.. SIAM J Appl Math 26:787–793
[Google Scholar]
Sharp P. M., Higgins D. G., Shields D. C., Devine K. M. 1990a; Protein-coding genes: DNA sequence database and codon usage.. In Molecular Biological Methods for Bacillus, pp 557–569 Edited by Harwood C. R., Cutting S. M. Chichester: John Wiley and Sons;
[Google Scholar]
Sharp P. M., Higgins D. G., Shields D. C., Devine K. M., Hoch J. A. 1990b In Bacillus suhtilis gene sequences. Genetics and Biotechnology of Bacilli pp 89–98 Edited by Zukowski M. M., Ganesan A. T., Hoch J. A. San Diego: Academic Press;
[Google Scholar]
Shields D. C., Sharp P. M. 1987; Synonymous codon usage in Bacillus subtilis reflects both translational selection and mutational biases.. Nucleic AcidsRes 15:8023–8040
[Google Scholar]
Shin D. G., Lee C., Zhang J., Rudd K. E., Berg C. M. 1992; Redesigning, implementing and integrating Escherichia coli genome software tools with an object-oriented database system.. Comput Appl Biosci 8:227–238
[Google Scholar]
Slonimski P. P., Brouillet S. 1993; A data-base of chromosome III of Saccharomyces cerevisiae. . Yeast 9:941–1029
[Google Scholar]
Wilbur W. J., Lipman D. J. 1983; Rapid similarity searches of nucleic acid and protein data banks.. Proc Natl Acad Sci USA 80:726–730
[Google Scholar]

http://instance.metastore.ingenta.com/content/journal/micro/10.1099/13500872-141-2-261

Subtilist: a relational database for the Bacillus subtilis genome

Microbiology 141, 261 (1995); https://doi.org/10.1099/13500872-141-2-261

/content/journal/micro/10.1099/13500872-141-2-261

Data & Media loading...

Volume 141, Issue 2

Review Article

Free

Subtilist: a relational database for the Bacillus subtilis genome

Abstract

Most read this month

Most cited Most Cited RSS feed

Generic Assignments, Strain Histories and Properties of Pure Cultures of Cyanobacteria

Metals, minerals and microbes: geomicrobiology and bioremediation

Quantification of biofilm structures by the novel computer program comstat

Autotrophic growth of anaerobic ammonium-oxidizing micro-organisms in a fluidized bed reactor

Clustered regularly interspaced short palindrome repeats (CRISPRs) have spacers of extrachromosomal origin

Plant-beneficial effects of Trichoderma and of its genes

The ecology, epidemiology and virulence of Enterococcus

Quorum sensing and Chromobacterium violaceum: exploitation of violacein production and inhibition for the detection of N-acylhomoserine lactones

Microbe Profile: Pseudomonas aeruginosa: opportunistic pathogen and lab rat