Construction and characterization of a full length-enriched and a 5'-end-enriched cDNA library.
Using 'oligo-capped' mRNA [Maruyama, K., Sugano, S., 1994. Oligo-capping: a simple method to replace the cap structure of eukaryotic mRNAs with oligoribonucleotides. Gene 138, 171-174], whose cap structure was replaced by a synthetic oligonucleotide, we constructed two types of cDNA library. One is a 'full length-enriched cDNA library' which has a high content of full-length cDNA clones and the other is a '5'-end-enriched cDNA library', which has a high content of cDNA clones with their mRNA start sites. The 5'-end-enriched library was constructed especially for isolating the mRNA start sites of long mRNAs. In order to characterize these libraries, we performed one-pass sequencing of randomly selected cDNA clones from both libraries (84 clones for the full length-enriched cDNA library and 159 clones for the 5'-end-enriched cDNA library). The cDNA clones of the polypeptide chain elongation factor 1 alpha were most frequently (nine clones) isolated, and more than 80% of them (eight clones) contained the mRNA start site of the gene. Furthermore, about 80% of the cDNA clones of both libraries whose sequence matched with known genes had the known 5' ends or sequences upstream of the known 5' ends (28 out of 35 for the full length-enriched library and 51 out of 62 for the 5'-end-enriched library). The longest full-length clone of the full length-enriched cDNA library was about 3300 bp (among 28 clones). In contrast, seven clones (out of the 51 clones with the mRNA start sites) from the 5'-end-enriched cDNA library came from mRNAs whose length is more than 3500 bp. These cDNA libraries may be useful for generating 5' ESTs with the information of the mRNA start sites that are now scarce in the EST database.