Sequence evaluation of four pooled-tissue normalized bovine cDNA libraries and construction of a gene index for cattle.
Smith T.P.L., Grosse W.M., Freking B.A., Roberts A.J., Stone R.T., Casas E., Wray J.E., White J., Cho J., Fahrenkrug S.C., Bennett G.L., Heaton M.P., Laegreid W.W., Rohrer G.A., Chitko-McKown C.G., Pertea G., Holt I., Karamycheva S., Liang F., Quackenbush J., Keele J.W.
An essential component of functional genomics studies is the sequence of DNA expressed in tissues of interest. To provide a resource of bovine-specific expressed sequence data and facilitate this powerful approach in cattle research, four normalized cDNA libraries were produced and arrayed for high-throughput sequencing. The libraries were made with RNA pooled from multiple tissues to increase efficiency of normalization and maximize the number of independent genes for which sequence data were obtained. Target tissues included those with highest likelihood to have impact on production parameters of animal health, growth, reproductive efficiency, and carcass merit. Success of normalization and inter- and intralibrary redundancy were assessed by collecting 6000-23,000 sequences from each of the libraries (68,520 total sequences deposited in GenBank). Sequence comparison and assembly of these sequences was performed in combination with 56,500 other bovine EST sequences present in the GenBank dbEST database to construct a cattle Gene Index (available from The Institute for Genomic Research at http://www.tigr.org/tdb/tgi.shtml). The 124,381 bovine ESTs present in GenBank at the time of the analysis form 16,740 assemblies that are listed and annotated on the Web site. Analysis of individual library sequence data indicates that the pooled-tissue approach was highly effective in preparing libraries for efficient deep sequencing.