@article{mbs:/content/journal/micro/10.1099/13500872-142-11-3005, author = "Capuano, Véronique and Galleron, Nathalie and Pujic, Petar and Sorokin, Alexei and Ehrlich, S. Dusko", title = "Organization of the Bacillus subtilis 168 chromosome between kdg and the attachment site of the SPβ prophage: use of Long Accurate PCR and yeast artificial chromosomes for sequencing", journal= "Microbiology", year = "1996", volume = "142", number = "11", pages = "3005-3015", doi = "https://doi.org/10.1099/13500872-142-11-3005", url = "https://www.microbiologyresearch.org/content/journal/micro/10.1099/13500872-142-11-3005", publisher = "Microbiology Society", issn = "1465-2080", type = "Journal Article", keywords = "Bacillus subtilis", keywords = "Long Accurate PCR", keywords = "yeast artificial chromosome", keywords = "genome sequencing", abstract = "Within the Bacillus subtilis genome sequencing project, the region between lysA and ilvA was assigned to our laboratory. In this report we present the sequence of the last 36 kb of this region, between the kdg operon and the attachment site of the SPβ prophage. A two-step strategy was used for the sequencing. In the first step, total chromosomal DNA was cloned in phage M13-based vectors and the clones carrying inserts from the target region were identified by hybridization with a cognate yeast artificial chromosome (YAC) from our collection. Sequencing of the clones allowed us to establish a number of contigs. In the second step the contigs were mapped by Long Accurate (LA) PCR and the remaining gaps closed by sequencing of the PCR products. The level of sequence inaccuracy due to LA PCR errors appeared to be about 1 in 10000, which does not affect significantly the final sequence quality. This two-step strategy is efficient and we suggest that it can be applied to sequencing of longer chromosomal regions. The 36 kb sequence contains 38 coding sequences (CDSs), 19 of which encode unknown proteins. Seven genetic loci already mapped in this region, xpt, metB, ilvA, ilvD, thyB, dfrA and degR were identified. Eleven CDSs were found to display significant similarities to known proteins from the data banks, suggesting possible functions for some of the novel genes: cspD may encode a cold shock protein; bcsA, the first bacterial homologue of chalcone synthase; exol, a 5′ to 3′ exonuclease, similar to that of DNA polymerase I of Escherichia coli; and bsaA, a stress-response-associated protein. The protein encoded by ypIP has homology with the transcriptional NifA-like regulators. The arrangement of the genes relative to possible promoters and terminators suggests 19 potential transcription units.", }