Thirty-two protein-encoding genes that are distributed widely among bacterial genomes were tested for the potential usefulness of their DNA sequences in assigning bacterial strains to species. From publicly available data, it was possible to make 49 pairwise comparisons of whole bacterial genomes that were related at the genus or subgenus level. DNA sequence identity scores for eight of the genes correlated strongly with overall sequence identity scores for the genome pairs. Even single-gene alignments could predict overall genome relatedness with a high degree of precision and accuracy. Predictions could be refined further by including two or three genes in the analysis. The proposal that sequence analysis of a small set of protein-encoding genes could reliably assign novel strains or isolates to bacterial species is strongly supported.
BrennerD. J.,
FanningG. R.,
JohnsonK. E.,
CitarellaR. V.,
FalkowS.1969; Polynucleotide sequence relationships among members of Enterobacteriaceae . J Bacteriol 98:637–650
BrennerD. J.,
FanningG. R.,
SkermanF. J.,
FalkowS.1972; Polynucleotide sequence divergence among strains of Escherichia coli and closely related organisms. J Bacteriol 109:933–965
BrownJ. R.,
DouadyC. J.,
ItaliaM. J.,
MarshallW. E.,
StanhopeM. J.2001; Universal trees based on large combined protein sequence data sets. Nat Genet 28:281–285[CrossRef]
DelcherA. L.,
PhillippyA.,
CarltonJ.,
SalzbergS. L.2002; Fast algorithms for large-scale genome alignment and comparison. Nucleic Acids Res 30:2478–2483[CrossRef]
FoxG. E.,
WisotzkeyJ. D.,
JurtshukP.Jr1992; How close is close: 16S rRNA sequence identity may not be sufficient to guarantee species identity. Int J Syst Bacteriol 42:166–170[CrossRef]
JohnsonJ. L.1994; Similarity analysis of DNAs. In Methods for General and Molecular Bacteriology . pp 655–682Edited byGerhardtP.,
MurrayR. G. E.,
WoodW. A.,
KriegN. R.
Washington, DC: American Society for Microbiology;
Rosselló-MoraR., AmannR.2001; The species concept for prokaryotes. FEMS Microbiol Rev 25:39–67[CrossRef]
StackebrandtE.,
GoebelB. M.1994; Taxonomic note: a place for DNA-DNA reassociation and 16S rRNA sequence analysis in the present species definition in bacteriology. Int J Syst Bacteriol 44:846–849[CrossRef]
StackebrandtE.,
FrederiksenW.,
GarrityG. M.10 other authors2002; Report of the ad hoc committee for the re-evaluation of the species definition in bacteriology. Int J Syst Evol Microbiol 52:1043–1047[CrossRef]
ThompsonJ. D.,
HigginsD. G.,
GibsonT. J.1994; clustal w: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 22:4673–4680[CrossRef]
WayneL. G.,
BrennerD. J.,
ColwellR. R.9 other authors1987; International Committee on Systematic Bacteriology. Report of the ad hoc committee on reconciliation of approaches to bacterial systematics. Int J Syst Bacteriol 37:463–464[CrossRef]