Prediction of the coding sequences of unidentified human genes. XV. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro.
In order to obtain information on the coding sequences of unidentified human genes, we newly determined the sequences of 100 cDNA clones of unknown human genes, which we named KIAA1193 to KIAA1292, from two sets of size-fractionated human adult and fetal brain cDNA libraries. The results of our particular strategy to select cDNA clones which have the potentiality of coding for large proteins in vitro revealed that the average sizes of the inserts and the corresponding open reading frames reached 5.2 kb and 2.8 kb (933 amino acid residues), respectively. By the computational analysis of the predicted amino acid sequences against the OWL and Pfam databases, 58 predicted gene products were classified into the following five functional categories: cell signaling/communication, cell structure/motility, nucleic acid management, protein management and metabolism. It was also found that 30 gene products had homologues in the public databases which were similar in sequence throughout almost their entire regions to the newly identified genes. The chromosomal loci of the genes were assigned by using human-rodent hybrid panels unless their mapping data were already available in the public databases. The expression profiles of the genes were studied in 10 human tissues, 8 brain regions, spinal cord, fetal brain and fetal liver by reverse transcription-coupled polymerase chain reaction, products of which were quantified by enzyme-linked immunosorbent assay.