Nucleotide sequence and structure of the human apolipoprotein E gene.
The gene for human apolipoprotein E (apo-E) was selected from a library of cloned genomic DNA by screening with a specific cDNA hybridization probe, and its structure was characterized. The complete nucleotide sequence of the gene as well as 856 nucleotides of the 5' flanking region and 629 nucleotides of the 3' flanking region were determined. Analysis of the sequence showed that the mRNA-encoding region of the apo-E gene consists of four exons separated by three introns. In comparison to the structure of the mRNA, the introns are located in the 5' noncoding region, in the codon for glycine at position -4 of the signal peptide region, and in the codon for arginine at position +61 of the mature protein. The overall lengths of the apo-E gene and its corresponding mRNA are 3597 and 1163 nucleotides, respectively; a mature plasma protein of 299 amino acids is produced by this gene. Examination of the 5' terminus of the gene by S1 nuclease mapping shows apparent multiple transcription initiation sites. The proximal 5' flanking region contains a "TATA box" element as well as two nearby inverted repeat elements. In addition, there are four Alu family sequences associated with the apo-E gene: an Alu sequence located near each end of the gene and two Alu sequences located in the second intron. This knowledge of the structure permits a molecular approach to characterizing the regulation of the apo-E gene.