Construction of a contiguous 874-kb sequence of the Escherichia coli-K12 genome corresponding to 50.0-68.8 min on the linkage map and analysis of its sequence features.
Yamamoto Y., Aiba H., Baba T., Hayashi K., Inada T., Isono K., Itoh T., Kimura S., Kitagawa M., Makino K., Miki T., Mitsuhashi N., Mizobuchi K., Mori H., Nakade S., Nakamura Y., Nashimoto H., Oshima T., Oyama S., Saito N., Sampei G., Satoh Y., Sivasundaram S., Tagami H., Takahashi H., Takeda J., Takemoto K., Uehara K., Wada C., Yamagata S., Horiuchi T., et al.
The contiguous 874.423 base pair sequence corresponding to the 50.0-68.8 min region on the genetic map of the Escherichia coli K-12 (W3110) was constructed by the determination of DNA sequences in the 50.0-57.9 min region (360 kb) and two large (100 kb in all) and five short gaps in the 57.9-68.8 min region whose sequences had been registered in the DNA databases. We analyzed its sequence features and found that this region contained at least 894 potential open reading frames (ORFs), of which 346 (38.7%) were previously reported, 158 (17.7%) were homologous to other known genes, 232 (26.0%) were identical or similar to hypothetical genes registered in databases, and the remaining 158 (17.7%) showed no significant similarity to any other genes. A homology search of the ORFs also identified several new gene clusters. Those include two clusters of fimbrial genes, a gene cluster of three genes encoding homologues of the human long chain fatty acid degradation enzyme complex in the mitochondrial membrane, a cluster of at least nine genes involved in the utilization of ethanolamine, a cluster of the secondary set of 11 hyc genes participating in the formate hydrogenlyase reaction and a cluster of five genes coding for the homologues of degradation enzymes for aromatic hydrocarbons in Pseudomonas putida. We also noted a variety of novel genes, including two ORFs, which were homologous to the putative genes encoding xanthine dehydrogenase in the fly and a protein responsible for axonal guidance and outgrowth of the rat, mouse and nematode. An isoleucine tRNA gene, designated ileY, was also newly identified at 60.0 min.