Lipoprotein computational prediction in spirochaetal genomes

João C. Setubal; Marcelo Reis; James Matsunaga; David A. Haake

doi:10.1099/mic.0.28317-0

Volume 152, Issue 1

Other

Free

Lipoprotein computational prediction in spirochaetal genomes

João C. Setubal¹, Marcelo Reis², James Matsunaga^3,4 and David A. Haake^3,4
View Affiliations Hide Affiliations

Affiliations: ¹ Virginia Bioinformatics Institute, Virginia Tech, Bioinformatics 1, Box 0477, Blacksburg, VA 24060-0477, USA ² Laboratório de Bioinformática, Instituto de Computação, Universidade Estadual de Campinas, Caixa Postal 6076, Campinas, SP 13084-071, Brazil ³ Division of Infectious Diseases, 111F, Veterans Affairs Greater Los Angeles Healthcare System, Los Angeles, CA 90073 USA ⁴ Department of Medicine, David Geffen School of Medicine at UCLA, Los Angeles, CA 90095 USA
CorrespondenceDavid A. Haake  [email protected]
Published: 01 January 2006 https://doi.org/10.1099/mic.0.28317-0

Abstract

Lipoproteins are of great interest in understanding the molecular pathogenesis of spirochaetes. Because spirochaete lipobox sequences exhibit more plasticity than those of other bacteria, application of existing prediction algorithms to emerging sequence data has been problematic. In this paper a novel lipoprotein prediction algorithm is described, designated SpLip, constructed as a hybrid of a lipobox weight matrix approach supplemented by a set of lipoprotein signal peptide rules allowing for conservative amino acid substitutions. Both the weight matrix and the rules are based on a training set of 28 experimentally verified spirochaetal lipoproteins. The performance of the SpLip algorithm was compared to that of the hidden Markov model-based LipoP program and the rules-based algorithm Psort for all predicted protein-coding genes of Leptospira interrogans sv. Copenhageni, L. interrogans sv. Lai, Borrelia burgdorferi, Borrelia garinii, Treponema pallidum and Treponema denticola. Psort sensitivity (13–35 %) was considerably less than that of SpLip (93–100 %) or LipoP (50–84 %) due in part to the requirement of Psort for Ala or Gly at the −1 position, a rule based on E. coli lipoproteins. The percentage of false-positive lipoprotein predictions by the LipoP algorithm (8–30 %) was greater than that of SpLip (0–1 %) or Psort (4–27 %), due in part to the lack of rules in LipoP excluding unprecedented amino acids such as Lys and Arg in the −1 position. This analysis revealed a higher number of predicted spirochaetal lipoproteins than was previously known. The improved performance of the SpLip algorithm provides a more accurate prediction of the complete lipoprotein repertoire of spirochaetes. The hybrid approach of supplementing weight matrix scoring with rules based on knowledge of protein secretion biochemistry may be a general strategy for development of improved prediction algorithms.

Received: 02/07/2005
Accepted: 18/10/2005
Revised: 20/09/2005
Published Online: 01/01/2006

Keyword(s): PPCG, predicted protein-coding gene and TS, training set; WM, weight matrix

SGM

Article metrics loading...

/content/journal/micro/10.1099/mic.0.28317-0

2006-01-01

2024-04-20

Full text loading...

/deliver/fulltext/micro/152/1/113.html?itemId=/content/journal/micro/10.1099/mic.0.28317-0&mimeType=html&fmt=ahah

References

Akins D. R, Purcell B. K, Mitra M. M, Norgard M. V, Radolf J. D. 1993; Lipid modification of the 17-kilodalton membrane immunogen of Treponema pallidum determines macrophage activation as well as amphiphilicity. Infect Immun 61:1202–1210
[Google Scholar]
Aliprantis A. O, Yang R. B, Mark M. R, Suggett S, Devaux B, Radolf J. D, Klimpel G. R, Godowski P, Zychlinsky A. 1999; Cell activation and apoptosis by bacterial lipoproteins through toll-like receptor-2. Science 285:736–739 [CrossRef]
[Google Scholar]
Bairoch A, Apweiler R. 2000; The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res 28:45–48 [CrossRef]
[Google Scholar]
Barnett J. K, Barnett D, Bolin C. A, Summers T. A, Wagar E. A, Cheville N. F, Hartskeerl R. A, Haake D. A. 1999; Expression and distribution of leptospiral outer membrane components during renal infection of hamsters. Infect Immun 67:853–861
[Google Scholar]
Braun V, Wolff H. 1970; The murein-lipoprotein linkage in the cell wall of Escherichia coli . Eur J Biochem 14:387–391 [CrossRef]
[Google Scholar]
Brightbill H. D, Libraty D. H, Krutzik S. R. 24 other authors 1999; Host defense mechanisms triggered by microbial lipoproteins through toll-like receptors. Science 285:732–736 [CrossRef]
[Google Scholar]
Chamberlain N. R, Radolf J. D, Hsu P. L, Sell S, Norgard M. V. 1988; Genetic and physicochemical characterization of the recombinant DNA-derived 47-kilodalton surface immunogen of Treponema pallidum subsp. pallidum . Infect Immun 56:71–78
[Google Scholar]
Cullen P. A, Coutts S. A, Cordwell S. J, Bulach D. M, Adler B. 2003a; Characterization of a locus encoding four paralogous outer membrane lipoproteins of Brachyspira hyodysenteriae . Microbes Infect 5:275–283 [CrossRef]
[Google Scholar]
Cullen P. A, Haake D. A, Bulach D. M, Zuerner R. L, Adler B. 2003b; LipL21 is a novel surface-exposed lipoprotein of pathogenic Leptospira species. Infect Immun 71:2414–2421 [CrossRef]
[Google Scholar]
Durbin R, Eddy S. R, Krogh A, Mitchison G. 1998; Biological Sequence Analysis. Probabilistic Models of Proteins and Nucleic Acids Cambridge: Cambridge University Press;
[Google Scholar]
Fraser C. M, Casjens S, Huang W. M. 35 other authors 1997; Genomic sequence of a Lyme disease spirochaete, Borrelia burgdorferi . Nature 390:580–586 [CrossRef]
[Google Scholar]
Fraser C. M, Norris S. J, Weinstock G. M. 29 other authors 1998; Complete genome sequence of Treponema pallidum , the syphilis spirochete. Science 281:375–388 [CrossRef]
[Google Scholar]
Glockner G, Lehmann R, Romualdi A, Pradella S, Schulte-Spechtel U, Schilhabel M, Wilske B, Suhnel J, Platzer M. 2004; Comparative analysis of the Borrelia garinii genome. Nucleic Acids Res 32:6038–6046 [CrossRef]
[Google Scholar]
Gonnet P, Rudd K. E, Lisacek F. 2004; Fine-tuning the prediction of sequences cleaved by signal peptidase II: a curated set of proven and predicted lipoproteins of Escherichia coli K-12. Proteomics 4:1597–1613 [CrossRef]
[Google Scholar]
Guo B. P, Brown E. L, Dorward D. W, Rosenberg L. C, Hook M. 1998; Decorin-binding adhesins from Borrelia burgdorferi . Mol Microbiol 30:711–723 [CrossRef]
[Google Scholar]
Haake D. A. 2000; Spirochaetal lipoproteins and pathogenesis. Microbiology 146:1491–1504
[Google Scholar]
Haake D. A, Chao G, Zuerner R. L, Barnett J. K, Barnett D, Mazel M, Matsunaga J, Levett P. N, Bolin C. A. 2000; The leptospiral major outer membrane protein LipL32 is a lipoprotein expressed during mammalian infection. Infect Immun 68:2276–2285 [CrossRef]
[Google Scholar]
Hellwage J, Meri T, Heikkila T, Alitalo A, Panelius J, Lahdenne P, Seppala I. J, Meri S. 2001; The complement regulator factor H binds to the surface protein OspE of Borrelia burgdorferi . J Biol Chem 276:8427–8435 [CrossRef]
[Google Scholar]
Howe T. R, Mayer L. W, Barbour A. G. 1985; A single recombinant plasmid expressing two major outer surface proteins of the Lyme disease spirochete. Science 227:645–646 [CrossRef]
[Google Scholar]
Juncker A. S, Willenbrock H, Von Heijne G, Brunak S, Nielsen H, Krogh A. 2003; Prediction of lipoprotein signal peptides in Gram-negative bacteria. Protein Sci 12:1652–1662 [CrossRef]
[Google Scholar]
Kornacki J. A, Oliver D. B. 1998; Lyme-disease-causing Borrelia species encode multiple lipoproteins homologous to peptide-binding proteins of ABC-type transporters. Infect Immun 66:4115–4122
[Google Scholar]
Kyte J, Doolittle R. F. 1982; A simple method for displaying the hydropathic character of a protein. J Mol Biol 157:105–132 [CrossRef]
[Google Scholar]
Madan Babu M, Sankaran K. 2002; DOLOP – database of bacterial lipoproteins. Bioinformatics 18:641–643 [CrossRef]
[Google Scholar]
Matsunaga J, Barocchi M. A, Croda J. 8 other authors 2003; Pathogenic Leptospira species express surface-exposed proteins belonging to the bacterial immunoglobulin superfamily. Mol Microbiol 49:929–945 [CrossRef]
[Google Scholar]
Minamino T, Namba K. 2004; Self-assembly and type III protein export of the bacterial flagellum. J Mol Microbiol Biotechnol 7:5–17 [CrossRef]
[Google Scholar]
Mount D. W. 2001 Bioinformatics: Sequence and Genome Analysis: Cold Spring Harbor NY: Cold Spring Harbor Laboratory;
[Google Scholar]
Nakai K, Horton P. 1999; Psort: a program for detecting sorting signals in proteins and predicting their subcellular localization. Trends Biochem Sci 24:34–36 [CrossRef]
[Google Scholar]
Nascimento A. L, Ko A. I, Martins E. A. 43 other authors 2004; Comparative genomics of two Leptospira interrogans serovars reveals novel insights into physiology and pathogenesis. J Bacteriol 186:2164–2172 [CrossRef]
[Google Scholar]
Paetzel M, Karla A, Strynadka N. C, Dalbey R. E. 2002; Signal peptidases. Chem Rev 102:4549–4580 [CrossRef]
[Google Scholar]
Probert W. S, Johnson B. J. 1998; Identification of a 47 kDa fibronectin-binding protein expressed by Borrelia burgdorferi isolate B31. Mol Microbiol 30:1003–1015 [CrossRef]
[Google Scholar]
Ren S. X, Fu G, Jiang X. G. 36 other authors 2003; Unique physiological and pathogenic features of Leptospira interrogans revealed by whole-genome sequencing. Nature 422:888–893 [CrossRef]
[Google Scholar]
Schwan T. G, Piesman J, Golde W. T, Dolan M. C, Rosa P. A. 1995; Induction of an outer surface protein on Borrelia burgdorferi during tick feeding. Proc Natl Acad Sci U S A 92:2909–2913 [CrossRef]
[Google Scholar]
Seshadri R, Myers G. S, Tettelin H. 36 other authors 2004; Comparison of the genome of the oral pathogen Treponema denticola with other spirochete genomes. Proc Natl Acad Sci U S A 101:5646–5651 [CrossRef]
[Google Scholar]
Shang E. S, Summers T. A, Haake D. A. 1996; Molecular cloning and sequence analysis of the gene encoding LipL41, a surface-exposed lipoprotein of pathogenic Leptospira species. Infect Immun 64:2322–2330
[Google Scholar]
Stewart E. J, Katzen F, Beckwith J. 1999; Six conserved cysteines of the membrane protein DsbD are required for the transfer of electrons from the cytoplasm to the periplasm of Escherichia coli . EMBO J 18:5963–5971 [CrossRef]
[Google Scholar]
von Heijne G. 1989; The structure of signal peptides from bacterial lipoproteins. Protein Eng 2:531–534 [CrossRef]
[Google Scholar]
Zhang P, Cheng X, Duhamel G. E. 2000; Cloning and DNA sequence analysis of an immunogenic glucose-galactose MglB lipoprotein homologue from Brachyspira pilosicoli , the agent of colonic spirochetosis. Infect Immun 68:4559–4565 [CrossRef]
[Google Scholar]

http://instance.metastore.ingenta.com/content/journal/micro/10.1099/mic.0.28317-0

Lipoprotein computational prediction in spirochaetal genomes

Microbiology 152, 113 (2006); https://doi.org/10.1099/mic.0.28317-0

/content/journal/micro/10.1099/mic.0.28317-0

Volume 152, Issue 1

Other

Free

Lipoprotein computational prediction in spirochaetal genomes

Abstract

Supplementary material 1

Supplementary material 2

Supplementary material 3

Supplementary material 4

Supplementary material 5

Supplementary material 6

Supplementary material 7

Supplementary material 8

Supplementary material 9

Supplementary material 10

Supplementary material 11

Supplementary material 12

Most read this month

Most cited Most Cited RSS feed

Generic Assignments, Strain Histories and Properties of Pure Cultures of Cyanobacteria

Metals, minerals and microbes: geomicrobiology and bioremediation

Quantification of biofilm structures by the novel computer program comstat

Autotrophic growth of anaerobic ammonium-oxidizing micro-organisms in a fluidized bed reactor

Plant-beneficial effects of Trichoderma and of its genes

Clustered regularly interspaced short palindrome repeats (CRISPRs) have spacers of extrachromosomal origin

The ecology, epidemiology and virulence of Enterococcus

Microbe Profile: Pseudomonas aeruginosa: opportunistic pathogen and lab rat

Quorum sensing and Chromobacterium violaceum: exploitation of violacein production and inhibition for the detection of N-acylhomoserine lactones