Skip Header

 
Contribute Send feedback

Reviewed, UniProtKB/Swiss-Prot Q09472 (EP300_HUMAN)

Last modified November 25, 2008. Version 118. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (6) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Binary interactions · Sequence annotation (Features) · Sequences · References · Web resources · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Histone acetyltransferase p300
    EC=2.3.1.48
Alternative name(s):
    E1A-associated protein p300
Gene names
Name: EP300
Synonyms: P300
OrganismHomo sapiens (Human)
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length2414 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceEvidence at protein level.

General annotation (Comments)

Function

Functions as histone acetyltransferase and regulates transcription via chromatin remodeling. Acetylates all four core histones in nucleosomes. Histone acetylation gives an epigenetic tag for transcriptional activation. Binds to and may be involved in the transforming capacity of the adenovirus E1A protein. Mediates cAMP-gene regulation by binding specifically to phosphorylated CREB protein. In case of HIV-1 infection, it is recruited by the viral protein Tat. Regulates Tat's transactivating activity and may help inducing chromatin remodeling of proviral genes.

Catalytic activity

Acetyl-CoA + histone = CoA + acetylhistone.

Subunit structure

Interacts with phosphorylated CREB1 By similarity. Interacts with DTX1, EID1, ELF3, FEN1, LEF1, NCOA1, NCOA6, NR3C1, PCAF, PELP1, SPIB, SRY, TCF7L2, TP53, SRCAP, TTC5, JMY and TRERF1. The TAZ-type 1 domain interacts with HIF1A. Probably part of a complex with HIF1A and CREBBP. Part of a complex containing CARM1 and NCOA2/GRIP1. Interacts with ING4 and this interaction may be indirect. Interacts with the C-terminal region of CITED4. Interacts with HTLV-1 Tax and p30II. Interacts with and acetylates HIV-1 Tat.

Subcellular location

Nucleus.

Post-translational modification

Phosphorylated.

Methylated at Arg-580 and Arg-604 in the KIX domain by CARM1, which blocks association with CREB, inhibits CREB signaling and activates apoptotic response. Also methylated at Arg-2142 by CARM1, which impairs interaction with NCOA2/GRIP1.

Citrullinated at Arg-2142 by PADI4, which impairs methylation by CARM1 and promotes interaction with NCOA2/GRIP1.

Involvement in disease

Defects in EP300 may play a role in epithelial cancer.

Chromosomal aberrations involving EP300 may be a cause of acute myeloid leukemias. Translocation t(8;22)(p11;q13) with MYST3.

Defects in EP300 are a cause of Rubinstein-Taybi syndrome (RSTS) [MIM:180849]. RSTS is an autosomal dominant disorder characterized by craniofacial abnormalities, broad thumbs, broad big toes, mental retardation and a propensity for development of malignancies.

Sequence similarities

Contains 1 bromo domain.

Contains 1 KIX domain.

Contains 2 TAZ-type zinc fingers.

Contains 1 ZZ-type zinc finger.

Ontologies

Keywords

   Biological processCell cycle
Host-virus interaction
Transcription
Transcription regulation
   Cellular componentNucleus
   Coding sequence diversityChromosomal rearrangement
Polymorphism
   DiseaseDisease mutation
   DomainBromodomain
Zinc-finger
   LigandMetal-binding
Zinc
   Molecular functionTransferase
   PTMAcetylation
Citrullination
Methylation
Phosphoprotein
   Technical term3D-structure
Direct protein sequencing

Gene Ontology (GO)

   Biological processN-terminal peptidyl-lysine acetylation

Inferred from direct assay. Source: UniProtKB

apoptosis

Inferred from mutant phenotype. Source: UniProtKB

cell cycle

Inferred from electronic annotation. Source: UniProtKB-KW

histone acetylation

Inferred from electronic annotation. Source: InterPro

homeostatic process

Traceable author statement. Source: UniProtKB

interspecies interaction between organisms

Inferred from electronic annotation. Source: UniProtKB-KW

nervous system development

Traceable author statement. Source: ProtInc

positive regulation of transcription factor activity

Inferred from direct assay. Source: UniProtKB

response to hypoxia

Inferred from direct assay. Source: UniProtKB

signal transduction

Non-traceable author statement. Source: UniProtKB

   Cellular componentcytoplasm

Inferred from direct assay. Source: HPA

histone acetyltransferase complex

Inferred from electronic annotation. Source: InterPro

   Molecular functionhistone acetyltransferase activity

Inferred from direct assay. Source: UniProtKB

protein C-terminus binding

Traceable author statement. Source: ProtInc

transcription coactivator activity

Inferred from direct assay. Source: UniProtKB

transcription factor activity

Traceable author statement. Source: ProtInc

zinc ion binding

Inferred from electronic annotation. Source: InterPro

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 24142414Histone acetyltransferase p300
PRO_0000211193

Regions

Domain566 – 64580KIX
Domain1067 – 113973Bromo
Zinc finger331 – 41787TAZ-type 1
Zinc finger1664 – 170744ZZ-type
Zinc finger1728 – 180982TAZ-type 2
Region1572 – 1818247Binding region for E1A adenovirus
Region2003 – 2212210Interaction with HTLV-1 Tax
Region2041 – 2240200Interaction with NCOA2
Motif11 – 177Nuclear localization signal Potential
Compositional bias797 – 8004Poly-Ser
Compositional bias1519 – 15268Poly-Glu
Compositional bias2066 – 20694Poly-Gln
Compositional bias2190 – 21956Poly-Gln

Sites

Site31 – 322Breakpoint for translocation to form MYST3-EP300 and EP300-MYST3
Site20881Interaction with NCOA2
Site21421Interaction with NCOA2

Amino acid modifications

Modified residue5801Omega-N-methylated arginine
Modified residue6041Omega-N-methylated arginine
Modified residue15541N6-acetyllysine
Modified residue15551N6-acetyllysine
Modified residue15581N6-acetyllysine
Modified residue15601N6-acetyllysine
Modified residue21421Asymmetric dimethylarginine; alternate
Modified residue21421Citrulline; alternate

Natural variations

Natural variant8271L → P in breast cancer.
VAR_014428
Natural variant9971I → V: dbSNP rs20551.
VAR_020425
Natural variant10131E → G in breast cancer.
VAR_014429
Natural variant16501S → Y in pancreatic cancer.
VAR_014430
Natural variant21741T → S: dbSNP rs5758252.
VAR_038376
Natural variant22211P → Q in colorectal cancer. dbSNP rs28937578.
VAR_014431
Natural variant22231P → Q: dbSNP rs1046088.
VAR_038377

Experimental info

Mutagenesis20561R → K: No effect on interaction with NCOA2
Mutagenesis20881R → K: Abolishes interaction with NCOA2
Mutagenesis21421R → K: Strongly reduces interaction with NCOA2

Secondary structure

....................................................... 2414
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
Q09472-1 [UniParc].

Last modified November 1, 1996. Version 1.
Checksum: 6BFF909EE4B9D693

FASTA2,414264,144
        10         20         30         40         50         60 
MAENVVEPGP PSAKRPKLSS PALSASASDG TDFGSLFDLE HDLPDELINS TELGLTNGGD 

        70         80         90        100        110        120 
INQLQTSLGM VQDAASKHKQ LSELLRSGSS PNLNMGVGGP GQVMASQAQQ SSPGLGLINS 

       130        140        150        160        170        180 
MVKSPMTQAG LTSPNMGMGT SGPNQGPTQS TGMMNSPVNQ PAMGMNTGTN AGMNPGMLAA 

       190        200        210        220        230        240 
GNGQGIMPNQ VMNGSIGAGR GRQDMQYPNP GMGSAGNLLT EPLQQGSPQM GGQTGLRGPQ 

       250        260        270        280        290        300 
PLKMGMMNNP NPYGSPYTQN PGQQIGASGL GLQIQTKTVL SNNLSPFAMD KKAVPGGGMP 

       310        320        330        340        350        360 
NMGQQPAPQV QQPGLVTPVA QGMGSGAHTA DPEKRKLIQQ QLVLLLHAHK CQRREQANGE 

       370        380        390        400        410        420 
VRQCNLPHCR TMKNVLNHMT HCQSGKSCQV AHCASSRQII SHWKNCTRHD CPVCLPLKNA 

       430        440        450        460        470        480 
GDKRNQQPIL TGAPVGLGNP SSLGVGQQSA PNLSTVSQID PSSIERAYAA LGLPYQVNQM 

       490        500        510        520        530        540 
PTQPQVQAKN QQNQQPGQSP QGMRPMSNMS ASPMGVNGGV GVQTPSLLSD SMLHSAINSQ 

       550        560        570        580        590        600 
NPMMSENASV PSLGPMPTAA QPSTTGIRKQ WHEDITQDLR NHLVHKLVQA IFPTPDPAAL 

       610        620        630        640        650        660 
KDRRMENLVA YARKVEGDMY ESANNRAEYY HLLAEKIYKI QKELEEKRRT RLQKQNMLPN 

       670        680        690        700        710        720 
AAGMVPVSMN PGPNMGQPQP GMTSNGPLPD PSMIRGSVPN QMMPRITPQS GLNQFGQMSM 

       730        740        750        760        770        780 
AQPPIVPRQT PPLQHHGQLA QPGALNPPMG YGPRMQQPSN QGQFLPQTQF PSQGMNVTNI 

       790        800        810        820        830        840 
PLAPSSGQAP VSQAQMSSSS CPVNSPIMPP GSQGSHIHCP QLPQPALHQN SPSPVPSRTP 

       850        860        870        880        890        900 
TPHHTPPSIG AQQPPATTIP APVPTPPAMP PGPQSQALHP PPRQTPTPPT TQLPQQVQPS 

       910        920        930        940        950        960 
LPAAPSADQP QQQPRSQQST AASVPTPNAP LLPPQPATPL SQPAVSIEGQ VSNPPSTSST 

       970        980        990       1000       1010       1020 
EVNSQAIAEK QPSQEVKMEA KMEVDQPEPA DTQPEDISES KVEDCKMEST ETEERSTELK 

      1030       1040       1050       1060       1070       1080 
TEIKEEEDQP STSATQSSPA PGQSKKKIFK PEELRQALMP TLEALYRQDP ESLPFRQPVD 

      1090       1100       1110       1120       1130       1140 
PQLLGIPDYF DIVKSPMDLS TIKRKLDTGQ YQEPWQYVDD IWLMFNNAWL YNRKTSRVYK 

      1150       1160       1170       1180       1190       1200 
YCSKLSEVFE QEIDPVMQSL GYCCGRKLEF SPQTLCCYGK QLCTIPRDAT YYSYQNRYHF 

      1210       1220       1230       1240       1250       1260 
CEKCFNEIQG ESVSLGDDPS QPQTTINKEQ FSKRKNDTLD PELFVECTEC GRKMHQICVL 

      1270       1280       1290       1300       1310       1320 
HHEIIWPAGF VCDGCLKKSA RTRKENKFSA KRLPSTRLGT FLENRVNDFL RRQNHPESGE 

      1330       1340       1350       1360       1370       1380 
VTVRVVHASD KTVEVKPGMK ARFVDSGEMA ESFPYRTKAL FAFEEIDGVD LCFFGMHVQE 

      1390       1400       1410       1420       1430       1440 
YGSDCPPPNQ RRVYISYLDS VHFFRPKCLR TAVYHEILIG YLEYVKKLGY TTGHIWACPP 

      1450       1460       1470       1480       1490       1500 
SEGDDYIFHC HPPDQKIPKP KRLQEWYKKM LDKAVSERIV HDYKDIFKQA TEDRLTSAKE 

      1510       1520       1530       1540       1550       1560 
LPYFEGDFWP NVLEESIKEL EQEEEERKRE ENTSNESTDV TKGDSKNAKK KNNKKTSKNK 

      1570       1580       1590       1600       1610       1620 
SSLSRGNKKK PGMPNVSNDL SQKLYATMEK HKEVFFVIRL IAGPAANSLP PIVDPDPLIP 

      1630       1640       1650       1660       1670       1680 
CDLMDGRDAF LTLARDKHLE FSSLRRAQWS TMCMLVELHT QSQDRFVYTC NECKHHVETR 

      1690       1700       1710       1720       1730       1740 
WHCTVCEDYD LCITCYNTKN HDHKMEKLGL GLDDESNNQQ AAATQSPGDS RRLSIQRCIQ 

      1750       1760       1770       1780       1790       1800 
SLVHACQCRN ANCSLPSCQK MKRVVQHTKG CKRKTNGGCP ICKQLIALCC YHAKHCQENK 

      1810       1820       1830       1840       1850       1860 
CPVPFCLNIK QKLRQQQLQH RLQQAQMLRR RMASMQRTGV VGQQQGLPSP TPATPTTPTG 

      1870       1880       1890       1900       1910       1920 
QQPTTPQTPQ PTSQPQPTPP NSMPPYLPRT QAAGPVSQGK AAGQVTPPTP PQTAQPPLPG 

      1930       1940       1950       1960       1970       1980 
PPPTAVEMAM QIQRAAETQR QMAHVQIFQR PIQHQMPPMT PMAPMGMNPP PMTRGPSGHL 

      1990       2000       2010       2020       2030       2040 
EPGMGPTGMQ QQPPWSQGGL PQPQQLQSGM PRPAMMSVAQ HGQPLNMAPQ PGLGQVGISP 

      2050       2060       2070       2080       2090       2100 
LKPGTVSQQA LQNLLRTLRS PSSPLQQQQV LSILHANPQL LAAFIKQRAA KYANSNPQPI 

      2110       2120       2130       2140       2150       2160 
PGQPGMPQGQ PGLQPPTMPG QQGVHSNPAM QNMNPMQAGV QRAGLPQQQP QQQLQPPMGG 

      2170       2180       2190       2200       2210       2220 
MSPQAQQMNM NHNTMPSQFR DILRRQQMMQ QQQQQGAGPG IGPGMANHNQ FQQPQGVGYP 

      2230       2240       2250       2260       2270       2280 
PQPQQRMQHH MQQMQQGNMG QIGQLPQALG AEAGASLQAY QQRLLQQQMG SPVQPNPMSP 

      2290       2300       2310       2320       2330       2340 
QQHMLPNQAQ SPHLQGQQIP NSLSNQVRSP QPVPSPRPQS QPPHSSPSPR MQPQPSPHHV 

      2350       2360       2370       2380       2390       2400 
SPQTSSPHPG LVAAQANPME QGHFASPDQN SMLSQLASNP GMANLHGASA TDLGLSTDNS 

      2410 
DLNSNLSQST LDIH 

« Hide

References

« Hide 'large scale' references
[1]"Molecular cloning and functional analysis of the adenovirus E1A-associated 300-kD protein (p30