The extremely diverse genus Lactobacillus is the largest among the lactic acid bacteria, with over 145 recognized species. In this work, which to our knowledge is the largest comparative phylogenomics study of a single genus to date, 12 genomes of Lactobacillus strains were subjected to an array of whole-genome and single-marker phylogenetic approaches, to investigate the case for extracting subgeneric groups and to determine whether a single congruent phylogeny could be identified. We conclude that GroEL is a more robust single-gene phylogenetic marker for the genus Lactobacillus than the 16S rRNA gene, when no whole-genome information is available. Significant incongruence was found, both within a set of trees based on 141 core proteins and within those phylogenies based on numbers of orthologues, concatenated RNA polymerase subunits and single gene/protein markers. This is possibly due to different evolutionary rates, hidden paralogies or horizontal gene transfer. Such phylogenetic ambiguities are efficiently visualized with cluster-networks. Although the genus contains some highly unstable taxa, four subgeneric groups were distinguished. Qualitative and quantitative gene analysis of these groups resulted in three findings: there is a relatively small number of group-specific proteins, the majority of which are poorly characterized; major groupings are functionally better distinguishable by absent genes rather than gained/retained genes; and, finally, a gene cluster possibly involved in purine metabolism is uniquely present in four lactobacilli associated with meat. In conclusion, because of either significantly different branching patterns or the availability of too few members, three of the four identified groups could not serve as the basis for identifying candidate novel genera within the current genus. We therefore suggest targeted sequencing of key taxonomic species identified here, which are likely to add sufficient depth for a future reclassification, followed by phylogenomic analysis involving the core proteins identified here. This will ideally be combined with phenotypic data using a polyphasic approach.
Altermann, E., Russell, W. M., Azcarate-Peril, M. A., Barrangou, R., Buck, B. L., McAuliffe, O., Souther, N., Dobson, A., Duong, T. & other authors(2005). Complete genome sequence of the probiotic lactic acid bacterium Lactobacillus acidophilus NCFM. Proc Natl Acad Sci U S A102, 3906–3912.[CrossRef][Google Scholar]
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J.(1990). Basic local alignment search tool. J Mol Biol215, 403–410.[CrossRef][Google Scholar]
Berger, B., Pridmore, R. D., Barretto, C., Delmas-Julien, F., Schreiber, K., Arigoni, F. & Brussow, H.(2007). Similarity and differences in the Lactobacillus acidophilus group identified by polyphasic analysis and comparative genomics. J Bacteriol189, 1311–1321.[CrossRef][Google Scholar]
Bininda-Emonds, O. R.(2004). The evolution of supertrees. Trends Ecol Evol19, 315–322.[CrossRef][Google Scholar]
Bolotin, A., Quinquis, B., Renault, P., Sorokin, A., Ehrlich, S. D., Kulakauskas, S., Lapidus, A., Goltsman, E., Mazur, M. & other authors(2004). Complete sequence and comparative genome analysis of the dairy bacterium Streptococcus thermophilus. Nat Biotechnol22, 1554–1558.[CrossRef][Google Scholar]
Bringel, F., Castioni, A., Olukoya, D. K., Felis, G. E., Torriani, S. & Dellaglio, F.(2005).Lactobacillus plantarum subsp. argentoratensis subsp. nov., isolated from vegetable matrices. Int J Syst Evol Microbiol55, 1629–1634.[CrossRef][Google Scholar]
Callanan, M., Kaleta, P., O'Callaghan, J., O'Sullivan, O., Jordan, K., McAuliffe, O., Sangrador-Vegas, A., Slattery, L., Fitzgerald, G. F. & other authors(2008). Genome sequence of Lactobacillus helveticus, an organism distinguished by selective gene loss and insertion sequence element expansion. J Bacteriol190, 727–735.[CrossRef][Google Scholar]
Canchaya, C., Claesson, M. J., Fitzgerald, G. F., van Sinderen, D. & O'Toole, P. W.(2006). Diversity of the genus Lactobacillus revealed by comparative genomics of five species. Microbiology152, 3185–3196.[CrossRef][Google Scholar]
Chaillou, S., Champomier-Verges, M. C., Cornet, M., Crutz-Le Coq, A. M., Dudez, A. M., Martin, V., Beaufils, S., Darbon-Rongere, E., Bossy, R. & other authors(2005). The complete genome sequence of the meat-borne lactic acid bacterium Lactobacillus sakei 23K. Nat Biotechnol23, 1527–1533.[CrossRef][Google Scholar]
Choi, I. G. & Kim, S. H.(2007). Global extent of horizontal gene transfer. Proc Natl Acad Sci U S A104, 4489–4494.[CrossRef][Google Scholar]
Claesson, M. J., Li, Y., Leahy, S., Canchaya, C., van Pijkeren, J. P., Cerdeno-Tarraga, A. M., Parkhill, J., Flynn, S., O'Sullivan, G. C. & other authors(2006). Multireplicon genome architecture of Lactobacillus salivarius. Proc Natl Acad Sci U S A103, 6718–6723.[CrossRef][Google Scholar]
Creevey, C. J. & McInerney, J. O.(2005). Clann: investigating phylogenetic information through supertree analyses. Bioinformatics21, 390–392.[CrossRef][Google Scholar]
Dellaglio, F. & Felis, G. E.(2005). Taxonomy of lactobacilli and bifidobacteria. In Probiotics and Prebiotics: Scientific Aspects, pp. 25–49. Edited by G. W. Tannock. Wymondham, UK: Caister Academic.
Delsuc, F., Brinkmann, H. & Philippe, H.(2005). Phylogenomics and the reconstruction of the tree of life. Nat Rev Genet6, 361–375.
[Google Scholar]
Edgar, R. C.(2004).muscle: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res32, 1792–1797.[CrossRef][Google Scholar]
Eisen, J. A. & Fraser, C. M.(2003). Phylogenomics: intersection of evolution and genomics. Science300, 1706–1707.[CrossRef][Google Scholar]
Eisen, J. A. & Hanawalt, P. C.(1999). A phylogenomic study of DNA repair genes, proteins, and processes. Mutat Res435, 171–213.[CrossRef][Google Scholar]
Enright, A. J., Van Dongen, S. & Ouzounis, C. A.(2002). An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res30, 1575–1584.[CrossRef][Google Scholar]
Felis, G. E. & Dellaglio, F.(2007). Taxonomy of lactobacilli and bifidobacteria. Curr Issues Intest Microbiol8, 44–61.
[Google Scholar]
Gevers, D., Cohan, F. M., Lawrence, J. G., Spratt, B. G., Coenye, T., Feil, E. J., Stackebrandt, E., Van de Peer, Y., Vandamme, P. & other authors(2005). Opinion: re-evaluating prokaryotic species. Nat Rev Microbiol3, 733–739.[CrossRef][Google Scholar]
Goris, J., Konstantinidis, K. T., Klappenbach, J. A., Coenye, T., Vandamme, P. & Tiedje, J. M.(2007). DNA–DNA hybridization values and their relationship to whole-genome sequence similarities. Int J Syst Evol Microbiol57, 81–91.[CrossRef][Google Scholar]
Guindon, S. & Gascuel, O.(2003). A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol52, 696–704.[CrossRef][Google Scholar]
Hill, J. E., Penny, S. L., Crowell, K. G., Goh, S. H. & Hemmingsen, S. M.(2004). cpnDB: a chaperonin sequence database. Genome Res14, 1669–1675.[CrossRef][Google Scholar]
Holland, B. R., Huber, K. T., Moulton, V. & Lockhart, P. J.(2004). Using consensus networks to visualize contradictory evidence for species phylogeny. Mol Biol Evol21, 1459–1461.[CrossRef][Google Scholar]
Huson, D. H. & Bryant, D.(2006). Application of phylogenetic networks in evolutionary studies. Mol Biol Evol23, 254–267.
[Google Scholar]
Huson, D. H., Dezulian, T., Klopper, T. & Steel, M. A.(2004). Phylogenetic super-networks from partial trees. IEEE/ACM Trans Comput Biol Bioinform1, 151–158.[CrossRef][Google Scholar]
Jeffroy, O., Brinkmann, H., Delsuc, F. & Philippe, H.(2006). Phylogenomics: the beginning of incongruence? Trends Genet22, 225–231.[CrossRef][Google Scholar]
Jian, W., Zhu, L. & Dong, X.(2001). New approach to phylogenetic analysis of the genus Bifidobacterium based on partial HSP60 gene sequences. Int J Syst Evol Microbiol51, 1633–1638.[CrossRef][Google Scholar]
Keane, T. M., Creevey, C. J., Pentony, M. M., Naughton, T. J. & McInerney, J. O.(2006). Assessment of methods for amino acid matrix selection and their use on empirical data shows that ad hoc assumptions for choice of matrix are not justified. BMC Evol Biol6, 29[CrossRef][Google Scholar]
Kleerebezem, M., Boekhorst, J., van Kranenburg, R., Molenaar, D., Kuipers, O. P., Leer, R., Tarchini, R., Peters, S. A., Sandbrink, H. M. & other authors(2003). Complete genome sequence of Lactobacillus plantarum WCFS1. Proc Natl Acad Sci U S A100, 1990–1995.[CrossRef][Google Scholar]
Konstantinidis, K. T., Ramette, A. & Tiedje, J. M.(2006). The bacterial species definition in the genomic era. Philos Trans R Soc Lond B Biol Sci361, 1929–1940.[CrossRef][Google Scholar]
Korbel, J. O., Snel, B., Huynen, M. A. & Bork, P.(2002).shot: a web server for the construction of genome phylogenies. Trends Genet18, 158–162.[CrossRef][Google Scholar]
Kunst, F., Ogasawara, N., Moszer, I., Albertini, A. M., Alloni, G., Azevedo, V., Bertero, M. G., Bessieres, P., Bolotin, A. & other authors(1997). The complete genome sequence of the gram-positive bacterium Bacillus subtilis. Nature390, 249–256.[CrossRef][Google Scholar]
Liolios, K., Mavromatis, K., Tavernarakis, N. & Kyrpides, N. C.(2008). The Genomes On Line Database (GOLD) in 2007: status of genomic and metagenomic projects and their associated metadata. Nucleic Acids Res36, D475–D479.[CrossRef][Google Scholar]
Ludwig, W. & Schleifer, K. H.(1999). Phylogeny of bacteria beyond the 16S rRNA standard. ASM News65, 752–757.
[Google Scholar]
Makarova, K. S. & Koonin, E. V.(2007). Evolutionary genomics of lactic acid bacteria. J Bacteriol189, 1199–1208.[CrossRef][Google Scholar]
Makarova, K., Slesarev, A., Wolf, Y., Sorokin, A., Mirkin, B., Koonin, E., Pavlov, A., Pavlova, N., Karamychev, V. & other authors(2006). Comparative genomics of the lactic acid bacteria. Proc Natl Acad Sci U S A103, 15611–15616.[CrossRef][Google Scholar]
Ochman, H.(2005). Genomes on the shrink. Proc Natl Acad Sci U S A102, 11959–11960.[CrossRef][Google Scholar]
Paulsen, I. T., Banerjei, L., Myers, G. S., Nelson, K. E., Seshadri, R., Read, T. D., Fouts, D. E., Eisen, J. A., Gill, S. R. & other authors(2003). Role of mobile DNA in the evolution of vancomycin-resistant Enterococcus faecalis. Science299, 2071–2074.[CrossRef][Google Scholar]
Pearson, W. R. & Lipman, D. J.(1988). Improved tools for biological sequence comparison. Proc Natl Acad Sci U S A85, 2444–2448.[CrossRef][Google Scholar]
Philippe, H. & Douady, C. J.(2003). Horizontal gene transfer and phylogenetics. Curr Opin Microbiol6, 498–505.[CrossRef][Google Scholar]
Pridmore, R. D., Berger, B., Desiere, F., Vilanova, D., Barretto, C., Pittet, A. C., Zwahlen, M. C., Rouvet, M., Altermann, E. & other authors(2004). The genome sequence of the probiotic intestinal bacterium Lactobacillus johnsonii NCC 533. Proc Natl Acad Sci U S A101, 2512–2517.[CrossRef][Google Scholar]
Quevillon, E., Silventoinen, V., Pillai, S., Harte, N., Mulder, N., Apweiler, R. & Lopez, R.(2005). InterProScan: protein domains identifier. Nucleic Acids Res33, W116–W120[Google Scholar]
Retief, J. D.(2000). Phylogenetic analysis using phylip. Methods Mol Biol132, 243–258.
[Google Scholar]
Robinson, D. R. & Foulds, L. R.(1981). Comparison of phylogenetic trees. Math Biosci53, 131–147.[CrossRef][Google Scholar]
Rosselló-Mora, R. & Amann, R.(2001). The species concept for prokaryotes. FEMS Microbiol Rev25, 39–67.[CrossRef][Google Scholar]
Tamura, K., Dudley, J., Nei, M. & Kumar, S.(2007).mega4: molecular evolutionary genetics analysis (mega) software version 4.0. Mol Biol Evol24, 1596–1599.[CrossRef][Google Scholar]
Tatusov, R. L., Fedorova, N. D., Jackson, J. D., Jacobs, A. R., Kiryutin, B., Koonin, E. V., Krylov, D. M., Mazumder, R., Mekhedov, S. L. & other authors(2003). The COG database: an updated version includes eukaryotes. BMC Bioinformatics4, 41[CrossRef][Google Scholar]
van de Guchte, M., Penaud, S., Grimaldi, C., Barbe, V., Bryson, K., Nicolas, P., Robert, C., Oztas, S., Mangenot, S. & other authors(2006). The complete genome sequence of Lactobacillus bulgaricus reveals extensive and ongoing reductive evolution. Proc Natl Acad Sci U S A103, 9274–9279.[CrossRef][Google Scholar]
Vandamme, P., Pot, B., Gillis, M., de Vos, P., Kersters, K. & Swings, J.(1996). Polyphasic taxonomy, a consensus approach to bacterial systematics. Microbiol Rev60, 407–438.
[Google Scholar]
Ventura, M., Canchaya, C., Zink, R., Fitzgerald, G. F. & van Sinderen, D.(2004). Characterization of the groEL and groES loci in Bifidobacterium breve UCC 2003: genetic, transcriptional, and phylogenetic analyses. Appl Environ Microbiol70, 6197–6209.[CrossRef][Google Scholar]
Supplementary Fig. S1. Supernetwork based on a
combination of 141 separate protein trees.
[PDF](18 KB)
[PDF file of Supplementary
Tables S1-S3](20 KB)
Supplementary Table S4. Breakdown of all protein
collections in Table 1, group-specific present and absent
proteins,
Lactobacillus -specific proteins and meat-specific
proteins into protein locus tags, functional annotation and
low-level COG categories.
[Excel
file](422 KB)