Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Histone-lysine N-methyltransferase 2A

Gene

KMT2A

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Histone methyltransferase that plays an essential role in early development and hematopoiesis. Catalytic subunit of the MLL1/MLL complex, a multiprotein complex that mediates both methylation of 'Lys-4' of histone H3 (H3K4me) complex and acetylation of 'Lys-16' of histone H4 (H4K16ac). In the MLL1/MLL complex, it specifically mediates H3K4me, a specific tag for epigenetic transcriptional activation. Has weak methyltransferase activity by itself, and requires other component of the MLL1/MLL complex to obtain full methyltransferase activity. Has no activity toward histone H3 phosphorylated on 'Thr-3', less activity toward H3 dimethylated on 'Arg-8' or 'Lys-9', while it has higher activity toward H3 acetylated on 'Lys-9'. Required for transcriptional activation of HOXA9. Promotes PPP1R15A-induced apoptosis. Plays a critical role in the control of circadian gene expression and is essential for the transcriptional activation mediated by the CLOCK-ARNTL/BMAL1 heterodimer. Establishes a permissive chromatin state for circadian transcription by mediating a rhythmic methylation of 'Lys-4' of histone H3 (H3K4me) and this histone modification directs the circadian acetylation at H3K9 and H3K14 allowing the recruitment of CLOCK-ARNTL/BMAL1 to chromatin (By similarity).By similarity4 Publications

Catalytic activityi

S-adenosyl-L-methionine + L-lysine-[histone] = S-adenosyl-L-homocysteine + N(6)-methyl-L-lysine-[histone].2 Publications

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sitei3765 – 37651Important for WDR5-recognition and binding1 Publication
Binding sitei3839 – 38391S-adenosyl-L-methioninePROSITE-ProRule annotation
Binding sitei3841 – 38411S-adenosyl-L-methioninePROSITE-ProRule annotation
Binding sitei3883 – 38831S-adenosyl-L-methioninePROSITE-ProRule annotation
Metal bindingi3909 – 39091Zinc2 Publications
Metal bindingi3957 – 39571Zinc2 Publications
Binding sitei3958 – 39581S-adenosyl-L-methioninePROSITE-ProRule annotation
Metal bindingi3959 – 39591Zinc2 Publications
Metal bindingi3964 – 39641Zinc2 Publications

Regions

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
DNA bindingi169 – 18012A.T hook 1Add
BLAST
DNA bindingi217 – 22711A.T hook 2Add
BLAST
DNA bindingi301 – 3099A.T hook 3
Zinc fingeri1147 – 119549CXXC-typePROSITE-ProRule annotationAdd
BLAST
Zinc fingeri1431 – 148252PHD-type 1PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri1479 – 153355PHD-type 2PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri1566 – 162762PHD-type 3PROSITE-ProRule annotationAdd
BLAST

GO - Molecular functioni

  • AT DNA binding Source: UniProtKB
  • chromatin binding Source: Ensembl
  • core promoter sequence-specific DNA binding Source: UniProtKB
  • histone-lysine N-methyltransferase activity Source: Reactome
  • histone methyltransferase activity (H3-K4 specific) Source: UniProtKB
  • identical protein binding Source: IntAct
  • lysine-acetylated histone binding Source: UniProtKB
  • protein homodimerization activity Source: UniProtKB
  • transcription factor activity, sequence-specific DNA binding Source: ProtInc
  • transcription regulatory region DNA binding Source: UniProtKB
  • unmethylated CpG binding Source: UniProtKB
  • zinc ion binding Source: UniProtKB

GO - Biological processi

Complete GO annotation...

Keywords - Molecular functioni

Chromatin regulator, Methyltransferase, Transferase

Keywords - Biological processi

Apoptosis, Biological rhythms, Transcription, Transcription regulation

Keywords - Ligandi

DNA-binding, Metal-binding, S-adenosyl-L-methionine, Zinc

Enzyme and pathway databases

ReactomeiR-HSA-3214841. PKMTs methylate histone lysines.
SIGNORiQ03164.

Names & Taxonomyi

Protein namesi
Recommended name:
Histone-lysine N-methyltransferase 2A (EC:2.1.1.43)
Short name:
Lysine N-methyltransferase 2A
Alternative name(s):
ALL-1
CXXC-type zinc finger protein 7
Myeloid/lymphoid or mixed-lineage leukemia
Myeloid/lymphoid or mixed-lineage leukemia protein 1
Trithorax-like protein
Zinc finger protein HRX
Cleaved into the following 2 chains:
Alternative name(s):
N-terminal cleavage product of 320 kDa
Short name:
p320
Alternative name(s):
C-terminal cleavage product of 180 kDa
Short name:
p180
Gene namesi
Name:KMT2A
Synonyms:ALL1, CXXC7, HRX, HTRX, MLL, MLL1, TRX1
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 11

Organism-specific databases

HGNCiHGNC:7132. KMT2A.

Subcellular locationi

MLL cleavage product C180 :
  • Nucleus

  • Note: Localizes to a diffuse nuclear pattern when not associated with MLL cleavage product N320.

GO - Cellular componenti

  • cytoplasm Source: HPA
  • histone methyltransferase complex Source: UniProtKB
  • MLL1 complex Source: UniProtKB
  • nucleoplasm Source: HPA
  • nucleus Source: UniProtKB
Complete GO annotation...

Keywords - Cellular componenti

Nucleus

Pathology & Biotechi

Involvement in diseasei

Wiedemann-Steiner syndrome (WDSTS)1 Publication
The disease is caused by mutations affecting the gene represented in this entry.
Disease descriptionA syndrome characterized by hairy elbows (hypertrichosis cubiti), intellectual disability, a distinctive facial appearance, and short stature. Facial characteristics include long eyelashes, thick or arched eyebrows with a lateral flare, and downslanting and vertically narrow palpebral fissures.
See also OMIM:605130

Chromosomal aberrations involving KMT2A are a cause of acute leukemias. Translocation t(1;11)(q21;q23) with MLLT11/AF1Q; translocation t(3;11)(p21;q23) with NCKIPSD/AF3p21; translocation t(3,11)(q25,q23) with GMPS; translocation t(4;11)(q21;q23) with AFF1/MLLT2/AF4; insertion ins(5;11)(q31;q13q23) with AFF4/AF5Q31; translocation t(5;11)(q12;q23) with AF5-alpha/CENPK; translocation t(6;11)(q27;q23) with MLLT4/AF6; translocation t(9;11)(p22;q23) with MLLT3/AF9; translocation t(10;11)(p11.2;q23) with ABI1; translocation t(10;11)(p12;q23) with MLLT10/AF10; t(11;15)(q23;q14) with CASC5 and ZFYVE19; translocation t(11;17)(q23;q21) with MLLT6/AF17; translocation t(11;19)(q23;p13.3) with ELL; translocation t(11;19)(q23;p13.3) with MLLT1/ENL; translocation t(11;19)(q23;p23) with GAS7; translocation t(X;11)(q13;q23) with FOXO4/AFX1. Translocation t(3;11)(q28;q23) with LPP. Translocation t(10;11)(q22;q23) with TET1. Translocation t(9;11)(q34;q23) with DAB2IP. Translocation t(4;11)(p12;q23) with FRYL. Fusion proteins KMT2A-MLLT1, KMT2A-MLLT3 and KMT2A-ELL interact with PPP1R15A and, on the contrary to unfused KMT2A, inhibit PPP1R15A-induced apoptosis.

A chromosomal aberration involving KMT2A may be a cause of chronic neutrophilic leukemia. Translocation t(4;11)(q21;q23) with SEPT11.

Mutagenesis

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Mutagenesisi1151 – 11511R → A: Impairs DNA-binding. 1 Publication
Mutagenesisi1153 – 11531R → A: No effect on stability or DNA-binding. 1 Publication
Mutagenesisi1154 – 11541R → A: Impairs DNA-binding. 1 Publication
Mutagenesisi1155 – 11551C → A: Abolishes zinc-binding, leading to unfold the CXXC-type zinc finger and abolish DNA-binding. 1 Publication
Mutagenesisi1158 – 11581C → A: Abolishes zinc-binding, leading to unfold the CXXC-type zinc finger and abolish DNA-binding. 1 Publication
Mutagenesisi1161 – 11611C → A: Abolishes zinc-binding, leading to unfold the CXXC-type zinc finger and abolish DNA-binding. 1 Publication
Mutagenesisi1162 – 11621Q → A: No effect on stability or DNA-binding. 1 Publication
Mutagenesisi1166 – 11661D → A: Abolishes zinc-binding, leading to unfold the CXXC-type zinc finger and abolish DNA-binding. 1 Publication
Mutagenesisi1167 – 11671C → A: Abolishes zinc-binding, leading to unfold the CXXC-type zinc finger and abolish DNA-binding. 1 Publication
Mutagenesisi1170 – 11701C → A: Abolishes zinc-binding, leading to unfold the CXXC-type zinc finger and abolish DNA-binding. 1 Publication
Mutagenesisi1172 – 11721N → A: No effect on stability or DNA-binding. 1 Publication
Mutagenesisi1173 – 11731C → A: Abolishes zinc-binding, leading to unfold the CXXC-type zinc finger and abolish DNA-binding. 1 Publication
Mutagenesisi1175 – 11751D → A: Impairs DNA-binding. 1 Publication
Mutagenesisi1176 – 11761K → A: Impairs DNA-binding. 1 Publication
Mutagenesisi1178 – 11814KFGG → AAAA: Abolishes zinc-binding, leading to unfold the CXXC-type zinc finger and abolish DNA-binding. 1 Publication
Mutagenesisi1178 – 11781K → A: Impairs DNA-binding. 1 Publication
Mutagenesisi1179 – 11791F → A: Impairs DNA-binding. 1 Publication
Mutagenesisi1183 – 11831N → A: Impairs DNA-binding. 1 Publication
Mutagenesisi1185 – 11851K → A: Impairs DNA-binding. 1 Publication
Mutagenesisi1186 – 11861K → A: Impairs DNA-binding. 1 Publication
Mutagenesisi1187 – 11871Q → A: Impairs DNA-binding. 1 Publication
Mutagenesisi1188 – 11881C → A: No effect on stability or DNA-binding. 1 Publication
Mutagenesisi1189 – 11891C → A: Abolishes zinc-binding, leading to unfold the CXXC-type zinc finger and abolish DNA-binding. 1 Publication
Mutagenesisi1192 – 11921R → A: Abolishes zinc-binding, leading to unfold the CXXC-type zinc finger and abolish DNA-binding. 1 Publication
Mutagenesisi1193 – 11931K → A: Impairs DNA-binding. 1 Publication
Mutagenesisi1194 – 11941C → A: Impair zinc-binding, leading to unfold the CXXC-type zinc finger and abolish DNA-binding. 1 Publication
Mutagenesisi1195 – 11951Q → A: No effect on stability or DNA-binding. 1 Publication
Mutagenesisi1196 – 11961N → A: No effect on stability or DNA-binding. 1 Publication
Mutagenesisi2666 – 26672DG → AA: Reduces cleavage without abolishing it. Abolishes cleavage by TASP1; when associated with 2718-A--A-2720. 1 Publication
Mutagenesisi2718 – 27203DGV → AAA: Abolishes cleavage by TASP1; when associated with 2666-A-A-2667. 2 Publications
Mutagenesisi3858 – 38581Y → A: Impairs methyltransferase activity toward unmodified or monomethylated H3K4me. 1 Publication
Mutagenesisi3858 – 38581Y → F: Slightly affects methyltransferase activity toward unmodified or monomethylated H3K4me. 1 Publication
Mutagenesisi3867 – 38671Q → A: Slightly affects methyltransferase activity of the enzyme alone, while it impairs methyltransferase activity in complex; when associated with A-3871. 1 Publication
Mutagenesisi3869 – 38691D → A: Does not affect methyltransferase activity of the enzyme alone or in complex; when associated with A-3872. 1 Publication
Mutagenesisi3871 – 38711R → A: Slightly affects methyltransferase activity of the enzyme alone, while it impairs methyltransferase activity in complex; when associated with A-3867. 1 Publication
Mutagenesisi3872 – 38721E → A: Does not affect methyltransferase activity of the enzyme alone or in complex; when associated with A-3869. 1 Publication
Mutagenesisi3874 – 38741Y → A: Affects methyltransferase activity of the enzyme alone, while it does not affect methyltransferase activity in complex; when associated with A-3878. 1 Publication
Mutagenesisi3878 – 38781K → A: Affects methyltransferase activity of the enzyme alone, while it does not affect methyltransferase activity in complex; when associated with A-3874. 1 Publication
Mutagenesisi3906 – 39061N → A: Loss of the histone H3 methyltransferase activity. 1 Publication
Mutagenesisi3942 – 39421Y → A or F: Impairs methyltransferase activity toward unmodified or monomethylated H3K4me. 2 Publications
Mutagenesisi3942 – 39421Y → F: Shifts from a specific monomethyltransferase to a di- and trimethyltransferase activity. 2 Publications

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sitei1334 – 13352Breakpoint for translocation to form KMT2A-ZFYVE19 oncogene
Sitei1362 – 13632Breakpoint for translocation to form KMT2A-AF3P21 and KMT2A-CASC5 oncogenes
Sitei1362 – 13632Breakpoint for translocation to form KMT2A-CENPK oncogene
Sitei1362 – 13621Breakpoint for translocation to form KMT2A-FRYL fusion protein
Sitei1406 – 14072Breakpoint for translocation to form KMT2A-AFF4 fusion protein
Sitei1444 – 14452Breakpoint for translocation to form KMT2A-GAS7 oncogene
Sitei1444 – 14452Breakpoint for translocation to form KMT2A-LPP

Keywords - Diseasei

Proto-oncogene

Organism-specific databases

MalaCardsiKMT2A.
MIMi159555. gene+phenotype.
605130. phenotype.
Orphaneti402017. 'Acute myeloid leukemia with t(9;11)(p22;q23)'.
98837. Acute biphenotypic leukemia.
98831. Acute myeloid leukemia with 11q23 abnormalities.
98835. Acute undifferentiated leukemia.
98836. Bilineal acute leukemia.
99860. Precursor B-cell acute lymphoblastic leukemia.
319182. Wiedemann-Steiner syndrome.
PharmGKBiPA241.

Chemistry

ChEMBLiCHEMBL3137282.

Polymorphism and mutation databases

BioMutaiKMT2A.
DMDMi146345435.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 39693969Histone-lysine N-methyltransferase 2APRO_0000124876Add
BLAST
Chaini1 – 27182718MLL cleavage product N320PRO_0000390949Add
BLAST
Chaini2719 – 39691251MLL cleavage product C180PRO_0000390950Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Modified residuei153 – 1531PhosphoserineCombined sources
Modified residuei197 – 1971PhosphoserineCombined sources
Cross-linki216 – 216Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in ubiquitin)1 Publication
Cross-linki220 – 220Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in ubiquitin)1 Publication
Cross-linki221 – 221Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in ubiquitin)1 Publication
Modified residuei239 – 2391N6-acetyllysineBy similarity
Modified residuei373 – 3731N6-acetyllysineBy similarity
Modified residuei518 – 5181PhosphoserineCombined sources
Modified residuei636 – 6361N6-acetyllysineCombined sources
Modified residuei680 – 6801PhosphoserineCombined sources
Modified residuei840 – 8401PhosphothreonineCombined sources
Modified residuei926 – 9261PhosphoserineCombined sources
Modified residuei1056 – 10561PhosphoserineCombined sources
Modified residuei1130 – 11301N6-acetyllysineCombined sources
Modified residuei1235 – 12351N6-acetyllysineCombined sources
Modified residuei1837 – 18371PhosphoserineCombined sources
Modified residuei1845 – 18451PhosphothreonineCombined sources
Modified residuei1858 – 18581PhosphoserineCombined sources
Modified residuei2098 – 20981PhosphoserineCombined sources
Modified residuei2147 – 21471PhosphothreonineCombined sources
Modified residuei2151 – 21511PhosphoserineCombined sources
Modified residuei2201 – 22011PhosphoserineCombined sources
Modified residuei2525 – 25251PhosphothreonineCombined sources
Modified residuei2611 – 26111PhosphoserineCombined sources
Modified residuei2796 – 27961PhosphoserineCombined sources
Modified residuei2955 – 29551PhosphoserineCombined sources
Modified residuei2958 – 29581N6-acetyllysineBy similarity
Modified residuei3036 – 30361PhosphoserineCombined sources
Modified residuei3372 – 33721PhosphothreonineCombined sources
Modified residuei3462 – 34621N6-acetyllysineBy similarity
Modified residuei3511 – 35111PhosphoserineCombined sources
Modified residuei3515 – 35151PhosphoserineCombined sources
Modified residuei3527 – 35271PhosphoserineCombined sources

Post-translational modificationi

Proteolytic cleavage by TASP1 generates MLL cleavage product N320 and MLL cleavage product C180, which reassemble through a non-covalent association. 2 cleavage sites exist, cleavage site 1 (CS1) and cleavage site 2 (CS2), to generate MLL cleavage products N320 and C180. CS2 is the major site.2 Publications

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sitei2666 – 26672Cleavage; by TASP1, site 11 Publication
Sitei2718 – 27192Cleavage; by TASP1, site 21 Publication

Keywords - PTMi

Acetylation, Isopeptide bond, Phosphoprotein, Ubl conjugation

Proteomic databases

EPDiQ03164.
MaxQBiQ03164.
PaxDbiQ03164.
PeptideAtlasiQ03164.
PRIDEiQ03164.

PTM databases

iPTMnetiQ03164.
PhosphoSiteiQ03164.

Expressioni

Tissue specificityi

Heart, lung, brain and T- and B-lymphocytes.

Gene expression databases

BgeeiENSG00000118058.
CleanExiHS_MLL.
ExpressionAtlasiQ03164. baseline and differential.
GenevisibleiQ03164. HS.

Organism-specific databases

HPAiCAB017794.
CAB024270.
HPA044910.

Interactioni

Subunit structurei

MLL cleavage product N320 heterodimerizes with MLL cleavage product C180 (via SET and FYRC domains). Component of some MLL1/MLL complex, at least composed of the core components KMT2A/MLL1, ASH2L, HCFC1/HCF1, HCFC2, WDR5, DPY30 and RBBP5, as well as the facultative components BAP18, CHD8, E2F6, HSP70, INO80C, KANSL1, LAS1L, MAX, MCRS1, MEN1, MGA, KAT8/MOF, PELP1, PHF20, PRP31, RING2, RUVB1/TIP49A, RUVB2/TIP49B, SENP3, TAF1, TAF4, TAF6, TAF7, TAF9 and TEX10. Interacts with WDR5; the interaction is direct. Interacts with KAT8/MOF; the interaction is direct. Interacts with SBF1 and PPP1R15A. Interacts with ZNF335. Interacts with CLOCK and ARNTL/BMAL1 in a circadian manner (By similarity).By similarity13 Publications

Binary interactionsi

WithEntry#Exp.IntActNotes
itself5EBI-591370,EBI-591370
CDC73Q6P1J94EBI-591370,EBI-930143
CrebbpP454817EBI-591370,EBI-296306From a different organism.
CTR9Q6PD625EBI-591370,EBI-1019583
HIST1H3DP6843111EBI-591370,EBI-79722
KAT6AQ9279410EBI-2638616,EBI-948013
KAT8Q9H7Z63EBI-591370,EBI-896414
PAF1Q8N7H54EBI-591370,EBI-2607770
PAX5Q025482EBI-2610266,EBI-296331
PPIEQ9UNP94EBI-591370,EBI-591818
RBBP5Q152916EBI-591370,EBI-592823
WDR5P6196410EBI-591370,EBI-540834

GO - Molecular functioni

  • identical protein binding Source: IntAct
  • lysine-acetylated histone binding Source: UniProtKB
  • protein homodimerization activity Source: UniProtKB

Protein-protein interaction databases

BioGridi110443. 98 interactions.
DIPiDIP-29221N.
IntActiQ03164. 39 interactions.
MINTiMINT-4532017.
STRINGi9606.ENSP00000436786.

Chemistry

BindingDBiQ03164.

Structurei

Secondary structure

1
3969
Legend: HelixTurnBeta strand
Show more details
Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Helixi114 – 13320Combined sources
Beta strandi135 – 1384Combined sources
Beta strandi150 – 1523Combined sources
Beta strandi1151 – 11544Combined sources
Beta strandi1156 – 11583Combined sources
Helixi1159 – 11624Combined sources
Beta strandi1168 – 11703Combined sources
Helixi1171 – 11755Combined sources
Helixi1177 – 11793Combined sources
Beta strandi1183 – 11853Combined sources
Turni1190 – 11923Combined sources
Beta strandi1197 – 12004Combined sources
Turni1204 – 12063Combined sources
Beta strandi1566 – 15683Combined sources
Turni1570 – 15723Combined sources
Beta strandi1575 – 15773Combined sources
Turni1578 – 15825Combined sources
Beta strandi1585 – 15873Combined sources
Turni1589 – 15913Combined sources
Beta strandi1594 – 15963Combined sources
Helixi1597 – 15993Combined sources
Helixi1604 – 16129Combined sources
Helixi1614 – 16174Combined sources
Turni1622 – 16243Combined sources
Beta strandi1627 – 16293Combined sources
Helixi1631 – 165222Combined sources
Helixi1655 – 16617Combined sources
Helixi1708 – 17169Combined sources
Helixi1723 – 174018Combined sources
Helixi1745 – 176521Combined sources
Helixi1771 – 17733Combined sources
Helixi2847 – 285610Combined sources
Helixi3764 – 37663Combined sources
Helixi3796 – 37994Combined sources
Helixi3809 – 38113Combined sources
Helixi3816 – 38205Combined sources
Helixi3823 – 38308Combined sources
Beta strandi3831 – 38355Combined sources
Beta strandi3837 – 384711Combined sources
Beta strandi3854 – 38574Combined sources
Beta strandi3860 – 38645Combined sources
Helixi3865 – 38673Combined sources
Helixi3868 – 387710Combined sources
Beta strandi3884 – 38863Combined sources
Beta strandi3888 – 38947Combined sources
Turni3896 – 38983Combined sources
Helixi3901 – 39044Combined sources
Beta strandi3912 – 39209Combined sources
Beta strandi3923 – 393210Combined sources
Beta strandi3939 – 39424Combined sources

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
EntryMethodResolution (Å)ChainPositionsPDBsum
2AGHNMR-C2840-2869[»]
2J2SNMR-A1143-1214[»]
2JYINMR-A1147-1203[»]
2KKFNMR-A1147-1203[»]
2KU7NMR-A1585-1628[»]
2KYUNMR-A1564-1628[»]
2LXSNMR-B2840-2858[»]
2LXTNMR-B2840-2858[»]
2MSRNMR-A140-160[»]
2MTNNMR-A110-160[»]
2W5YX-ray2.00A3785-3969[»]
2W5ZX-ray2.20A3785-3969[»]
3EG6X-ray1.72C3762-3773[»]
3EMHX-ray1.37B3764-3776[»]
3LQHX-ray1.72A1566-1784[»]
3LQIX-ray1.92A/B/C1566-1784[»]
3LQJX-ray1.90A/B1566-1784[»]
3P4FX-ray2.35C3761-3770[»]
3U85X-ray3.00B6-25[»]
3U88X-ray3.00M/N103-153[»]
4ESGX-ray1.70C/D3755-3771[»]
4GQ6X-ray1.55B6-15[»]
4NW3X-ray2.82A1147-1204[»]
5F5EX-ray1.80A3813-3969[»]
5F6LX-ray1.90A3813-3969[»]
ProteinModelPortaliQ03164.
SMRiQ03164. Positions 6-39, 103-135, 1146-1214, 1564-1779, 2840-2869, 3790-3969.
ModBaseiSearch...
MobiDBiSearch...

Miscellaneous databases

EvolutionaryTraceiQ03164.

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini1703 – 174846Bromo; divergentPROSITE-ProRule annotationAdd
BLAST
Domaini2018 – 207457FYR N-terminalPROSITE-ProRule annotationAdd
BLAST
Domaini3666 – 374782FYR C-terminalPROSITE-ProRule annotationAdd
BLAST
Domaini3829 – 3945117SETPROSITE-ProRule annotationAdd
BLAST
Domaini3953 – 396917Post-SETPROSITE-ProRule annotationAdd
BLAST

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni3906 – 39072S-adenosyl-L-methionine binding

Motif

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Motifi2847 – 285599aaTAD1 Publication

Compositional bias

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Compositional biasi17 – 10286Ala/Gly/Ser-richAdd
BLAST
Compositional biasi137 – 1437Poly-Gly
Compositional biasi561 – 5644Poly-Pro
Compositional biasi568 – 5714Poly-Pro

Domaini

The 9aaTAD motif is a transactivation domain present in a large number of yeast and animal transcription factors.1 Publication
The SET domain structure is atypical and is not in an optimal position to have methyltransferase activity. It requires other components of the MLL1/MLL complex, such as ASH2L or RBBP5, to order the active site and obtain optimal histone methyltransferase activity.1 Publication
The CXXC-type zinc finger binds bind to nonmethyl-CpG dinucleotides.1 Publication

Sequence similaritiesi

Belongs to the class V-like SAM-binding methyltransferase superfamily. Histone-lysine methyltransferase family. TRX/MLL subfamily.PROSITE-ProRule annotation
Contains 3 A.T hook DNA-binding domains.Curated
Contains 1 bromo domain.PROSITE-ProRule annotation
Contains 1 CXXC-type zinc finger.PROSITE-ProRule annotation
Contains 1 FYR C-terminal domain.PROSITE-ProRule annotation
Contains 1 FYR N-terminal domain.PROSITE-ProRule annotation
Contains 3 PHD-type zinc fingers.PROSITE-ProRule annotation
Contains 1 post-SET domain.PROSITE-ProRule annotation
Contains 1 SET domain.PROSITE-ProRule annotation

Zinc finger

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Zinc fingeri1147 – 119549CXXC-typePROSITE-ProRule annotationAdd
BLAST
Zinc fingeri1431 – 148252PHD-type 1PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri1479 – 153355PHD-type 2PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri1566 – 162762PHD-type 3PROSITE-ProRule annotationAdd
BLAST

Keywords - Domaini

Bromodomain, Repeat, Zinc-finger

Phylogenomic databases

eggNOGiKOG1084. Eukaryota.
COG2940. LUCA.
GeneTreeiENSGT00760000119228.
HOVERGENiHBG051927.
InParanoidiQ03164.
KOiK09186.
OMAiRIMSPMR.
OrthoDBiEOG091G001P.
PhylomeDBiQ03164.
TreeFamiTF319820.

Family and domain databases

Gene3Di1.20.920.10. 1 hit.
3.30.40.10. 2 hits.
InterProiIPR001487. Bromodomain.
IPR003889. FYrich_C.
IPR003888. FYrich_N.
IPR016569. MeTrfase_trithorax.
IPR003616. Post-SET_dom.
IPR001214. SET_dom.
IPR002857. Znf_CXXC.
IPR011011. Znf_FYVE_PHD.
IPR001965. Znf_PHD.
IPR019787. Znf_PHD-finger.
IPR013083. Znf_RING/FYVE/PHD.
[Graphical view]
PfamiPF05965. FYRC. 1 hit.
PF05964. FYRN. 1 hit.
PF00628. PHD. 2 hits.
PF00856. SET. 1 hit.
PF02008. zf-CXXC. 1 hit.
[Graphical view]
PIRSFiPIRSF010354. Methyltransferase_trithorax. 1 hit.
SMARTiSM00297. BROMO. 1 hit.
SM00542. FYRC. 1 hit.
SM00541. FYRN. 1 hit.
SM00249. PHD. 4 hits.
SM00508. PostSET. 1 hit.
SM00317. SET. 1 hit.
[Graphical view]
SUPFAMiSSF47370. SSF47370. 1 hit.
SSF57903. SSF57903. 2 hits.
PROSITEiPS50014. BROMODOMAIN_2. 1 hit.
PS51543. FYRC. 1 hit.
PS51542. FYRN. 1 hit.
PS50868. POST_SET. 1 hit.
PS50280. SET. 1 hit.
PS51058. ZF_CXXC. 1 hit.
PS01359. ZF_PHD_1. 3 hits.
PS50016. ZF_PHD_2. 3 hits.
[Graphical view]

Sequences (3)i

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

This entry describes 3 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: Q03164-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MAHSCRWRFP ARPGTTGGGG GGGRRGLGGA PRQRVPALLL PPGPPVGGGG
60 70 80 90 100
PGAPPSPPAV AAAAAAAGSS GAGVPGGAAA ASAASSSSAS SSSSSSSSAS
110 120 130 140 150
SGPALLRVGP GFDAALQVSA AIGTNLRRFR AVFGESGGGG GSGEDEQFLG
160 170 180 190 200
FGSDEEVRVR SPTRSPSVKT SPRKPRGRPR SGSDRNSAIL SDPSVFSPLN
210 220 230 240 250
KSETKSGDKI KKKDSKSIEK KRGRPPTFPG VKIKITHGKD ISELPKGNKE
260 270 280 290 300
DSLKKIKRTP SATFQQATKI KKLRAGKLSP LKSKFKTGKL QIGRKGVQIV
310 320 330 340 350
RRRGRPPSTE RIKTPSGLLI NSELEKPQKV RKDKEGTPPL TKEDKTVVRQ
360 370 380 390 400
SPRRIKPVRI IPSSKRTDAT IAKQLLQRAK KGAQKKIEKE AAQLQGRKVK
410 420 430 440 450
TQVKNIRQFI MPVVSAISSR IIKTPRRFIE DEDYDPPIKI ARLESTPNSR
460 470 480 490 500
FSAPSCGSSE KSSAASQHSS QMSSDSSRSS SPSVDTSTDS QASEEIQVLP
510 520 530 540 550
EERSDTPEVH PPLPISQSPE NESNDRRSRR YSVSERSFGS RTTKKLSTLQ
560 570 580 590 600
SAPQQQTSSS PPPPLLTPPP PLQPASSISD HTPWLMPPTI PLASPFLPAS
610 620 630 640 650
TAPMQGKRKS ILREPTFRWT SLKHSRSEPQ YFSSAKYAKE GLIRKPIFDN
660 670 680 690 700
FRPPPLTPED VGFASGFSAS GTAASARLFS PLHSGTRFDM HKRSPLLRAP
710 720 730 740 750
RFTPSEAHSR IFESVTLPSN RTSAGTSSSG VSNRKRKRKV FSPIRSEPRS
760 770 780 790 800
PSHSMRTRSG RLSSSELSPL TPPSSVSSSL SISVSPLATS ALNPTFTFPS
810 820 830 840 850
HSLTQSGESA EKNQRPRKQT SAPAEPFSSS SPTPLFPWFT PGSQTERGRN
860 870 880 890 900
KDKAPEELSK DRDADKSVEK DKSRERDRER EKENKRESRK EKRKKGSEIQ
910 920 930 940 950
SSSALYPVGR VSKEKVVGED VATSSSAKKA TGRKKSSSHD SGTDITSVTL
960 970 980 990 1000
GDTTAVKTKI LIKKGRGNLE KTNLDLGPTA PSLEKEKTLC LSTPSSSTVK
1010 1020 1030 1040 1050
HSTSSIGSML AQADKLPMTD KRVASLLKKA KAQLCKIEKS KSLKQTDQPK
1060 1070 1080 1090 1100
AQGQESDSSE TSVRGPRIKH VCRRAAVALG RKRAVFPDDM PTLSALPWEE
1110 1120 1130 1140 1150
REKILSSMGN DDKSSIAGSE DAEPLAPPIK PIKPVTRNKA PQEPPVKKGR
1160 1170 1180 1190 1200
RSRRCGQCPG CQVPEDCGVC TNCLDKPKFG GRNIKKQCCK MRKCQNLQWM
1210 1220 1230 1240 1250
PSKAYLQKQA KAVKKKEKKS KTSEKKDSKE SSVVKNVVDS SQKPTPSARE
1260 1270 1280 1290 1300
DPAPKKSSSE PPPRKPVEEK SEEGNVSAPG PESKQATTPA SRKSSKQVSQ
1310 1320 1330 1340 1350
PALVIPPQPP TTGPPRKEVP KTTPSEPKKK QPPPPESGPE QSKQKKVAPR
1360 1370 1380 1390 1400
PSIPVKQKPK EKEKPPPVNK QENAGTLNIL STLSNGNSSK QKIPADGVHR
1410 1420 1430 1440 1450
IRVDFKEDCE AENVWEMGGL GILTSVPITP RVVCFLCASS GHVEFVYCQV
1460 1470 1480 1490 1500
CCEPFHKFCL EENERPLEDQ LENWCCRRCK FCHVCGRQHQ ATKQLLECNK
1510 1520 1530 1540 1550
CRNSYHPECL GPNYPTKPTK KKKVWICTKC VRCKSCGSTT PGKGWDAQWS
1560 1570 1580 1590 1600
HDFSLCHDCA KLFAKGNFCP LCDKCYDDDD YESKMMQCGK CDRWVHSKCE
1610 1620 1630 1640 1650
NLSDEMYEIL SNLPESVAYT CVNCTERHPA EWRLALEKEL QISLKQVLTA
1660 1670 1680 1690 1700
LLNSRTTSHL LRYRQAAKPP DLNPETEESI PSRSSPEGPD PPVLTEVSKQ
1710 1720 1730 1740 1750
DDQQPLDLEG VKRKMDQGNY TSVLEFSDDI VKIIQAAINS DGGQPEIKKA
1760 1770 1780 1790 1800
NSMVKSFFIR QMERVFPWFS VKKSRFWEPN KVSSNSGMLP NAVLPPSLDH
1810 1820 1830 1840 1850
NYAQWQEREE NSHTEQPPLM KKIIPAPKPK GPGEPDSPTP LHPPTPPILS
1860 1870 1880 1890 1900
TDRSREDSPE LNPPPGIEDN RQCALCLTYG DDSANDAGRL LYIGQNEWTH
1910 1920 1930 1940 1950
VNCALWSAEV FEDDDGSLKN VHMAVIRGKQ LRCEFCQKPG ATVGCCLTSC
1960 1970 1980 1990 2000
TSNYHFMCSR AKNCVFLDDK KVYCQRHRDL IKGEVVPENG FEVFRRVFVD
2010 2020 2030 2040 2050
FEGISLRRKF LNGLEPENIH MMIGSMTIDC LGILNDLSDC EDKLFPIGYQ
2060 2070 2080 2090 2100
CSRVYWSTTD ARKRCVYTCK IVECRPPVVE PDINSTVEHD ENRTIAHSPT
2110 2120 2130 2140 2150
SFTESSSKES QNTAEIISPP SPDRPPHSQT SGSCYYHVIS KVPRIRTPSY
2160 2170 2180 2190 2200
SPTQRSPGCR PLPSAGSPTP TTHEIVTVGD PLLSSGLRSI GSRRHSTSSL
2210 2220 2230 2240 2250
SPQRSKLRIM SPMRTGNTYS RNNVSSVSTT GTATDLESSA KVVDHVLGPL
2260 2270 2280 2290 2300
NSSTSLGQNT STSSNLQRTV VTVGNKNSHL DGSSSSEMKQ SSASDLVSKS
2310 2320 2330 2340 2350
SSLKGEKTKV LSSKSSEGSA HNVAYPGIPK LAPQVHNTTS RELNVSKIGS
2360 2370 2380 2390 2400
FAEPSSVSFS SKEALSFPHL HLRGQRNDRD QHTDSTQSAN SSPDEDTEVK
2410 2420 2430 2440 2450
TLKLSGMSNR SSIINEHMGS SSRDRRQKGK KSCKETFKEK HSSKSFLEPG
2460 2470 2480 2490 2500
QVTTGEEGNL KPEFMDEVLT PEYMGQRPCN NVSSDKIGDK GLSMPGVPKA
2510 2520 2530 2540 2550
PPMQVEGSAK ELQAPRKRTV KVTLTPLKME NESQSKNALK ESSPASPLQI
2560 2570 2580 2590 2600
ESTSPTEPIS ASENPGDGPV AQPSPNNTSC QDSQSNNYQN LPVQDRNLML
2610 2620 2630 2640 2650
PDGPKPQEDG SFKRRYPRRS ARARSNMFFG LTPLYGVRSY GEEDIPFYSS
2660 2670 2680 2690 2700
STGKKRGKRS AEGQVDGADD LSTSDEDDLY YYNFTRTVIS SGGEERLASH
2710 2720 2730 2740 2750
NLFREEEQCD LPKISQLDGV DDGTESDTSV TATTRKSSQI PKRNGKENGT
2760 2770 2780 2790 2800
ENLKIDRPED AGEKEHVTKS SVGHKNEPKM DNCHSVSRVK TQGQDSLEAQ
2810 2820 2830 2840 2850
LSSLESSRRV HTSTPSDKNL LDTYNTELLK SDSDNNNSDD CGNILPSDIM
2860 2870 2880 2890 2900
DFVLKNTPSM QALGESPESS SSELLNLGEG LGLDSNREKD MGLFEVFSQQ
2910 2920 2930 2940 2950
LPTTEPVDSS VSSSISAEEQ FELPLELPSD LSVLTTRSPT VPSQNPSRLA
2960 2970 2980 2990 3000
VISDSGEKRV TITEKSVASS ESDPALLSPG VDPTPEGHMT PDHFIQGHMD
3010 3020 3030 3040 3050
ADHISSPPCG SVEQGHGNNQ DLTRNSSTPG LQVPVSPTVP IQNQKYVPNS
3060 3070 3080 3090 3100
TDSPGPSQIS NAAVQTTPPH LKPATEKLIV VNQNMQPLYV LQTLPNGVTQ
3110 3120 3130 3140 3150
KIQLTSSVSS TPSVMETNTS VLGPMGGGLT LTTGLNPSLP TSQSLFPSAS
3160 3170 3180 3190 3200
KGLLPMSHHQ HLHSFPAATQ SSFPPNISNP PSGLLIGVQP PPDPQLLVSE
3210 3220 3230 3240 3250
SSQRTDLSTT VATPSSGLKK RPISRLQTRK NKKLAPSSTP SNIAPSDVVS
3260 3270 3280 3290 3300
NMTLINFTPS QLPNHPSLLD LGSLNTSSHR TVPNIIKRSK SSIMYFEPAP
3310 3320 3330 3340 3350
LLPQSVGGTA ATAAGTSTIS QDTSHLTSGS VSGLASSSSV LNVVSMQTTT
3360 3370 3380 3390 3400
TPTSSASVPG HVTLTNPRLL GTPDIGSISN LLIKASQQSL GIQDQPVALP
3410 3420 3430 3440 3450
PSSGMFPQLG TSQTPSTAAI TAASSICVLP STQTTGITAA SPSGEADEHY
3460 3470 3480 3490 3500
QLQHVNQLLA SKTGIHSSQR DLDSASGPQV SNFTQTVDAP NSMGLEQNKA
3510 3520 3530 3540 3550
LSSAVQASPT SPGGSPSSPS SGQRSASPSV PGPTKPKPKT KRFQLPLDKG
3560 3570 3580 3590 3600
NGKKHKVSHL RTSSSEAHIP DQETTSLTSG TGTPGAEAEQ QDTASVEQSS
3610 3620 3630 3640 3650
QKECGQPAGQ VAVLPEVQVT QNPANEQESA EPKTVEEEES NFSSPLMLWL
3660 3670 3680 3690 3700
QQEQKRKESI TEKKPKKGLV FEISSDDGFQ ICAESIEDAW KSLTDKVQEA
3710 3720 3730 3740 3750
RSNARLKQLS FAGVNGLRML GILHDAVVFL IEQLSGAKHC RNYKFRFHKP
3760 3770 3780 3790 3800
EEANEPPLNP HGSARAEVHL RKSAFDMFNF LASKHRQPPE YNPNDEEEEE
3810 3820 3830 3840 3850
VQLKSARRAT SMDLPMPMRF RHLKKTSKEA VGVYRSPIHG RGLFCKRNID
3860 3870 3880 3890 3900
AGEMVIEYAG NVIRSIQTDK REKYYDSKGI GCYMFRIDDS EVVDATMHGN
3910 3920 3930 3940 3950
AARFINHSCE PNCYSRVINI DGQKHIVIFA MRKIYRGEEL TYDYKFPIED
3960
ASNKLPCNCG AKKCRKFLN
Length:3,969
Mass (Da):431,764
Last modified:May 1, 2007 - v5
Checksum:i1150F37EAB1430D3
GO
Isoform 2 (identifier: Q03164-2) [UniParc]FASTAAdd to basket
Also known as: 14P-18B

The sequence of this isoform differs from the canonical sequence as follows:
     1407-1444: Missing.

Show »
Length:3,931
Mass (Da):427,733
Checksum:iB8E736C88E83D50B
GO
Isoform 3 (identifier: Q03164-3) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1603-1603: S → SGTE

Show »
Length:3,972
Mass (Da):432,052
Checksum:i18CFDD8B9A763204
GO

Sequence cautioni

The sequence AAA58669 differs from that shown. Reason: Frameshift at positions 317 and 380. Curated
The sequence AAG26332 differs from that shown.Contaminating sequence. Potential poly-A sequence.Curated
The sequence BAD92745 differs from that shown. Reason: Frameshift at position 1098. Curated

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti144 – 1441E → ELTTQIPCSWRTKGHIHDKK TEPFRLLAWSWCLN in CAA93625 (PubMed:8703835).Curated
Sequence conflicti556 – 5561Q → E in CAA93625 (PubMed:8703835).Curated
Sequence conflicti556 – 5561Q → E in L04731 (PubMed:1423625).Curated
Sequence conflicti1347 – 13471V → A in AAG26335 (PubMed:10706619).Curated
Sequence conflicti1487 – 14871R → G in AAA18644 (PubMed:8162575).Curated
Sequence conflicti1490 – 14901Q → R in AAG26335 (PubMed:10706619).Curated
Sequence conflicti1507 – 15071P → L in AAG26335 (PubMed:10706619).Curated
Sequence conflicti1513 – 15131N → T in AAG26335 (PubMed:10706619).Curated
Sequence conflicti1600 – 16001E → G in AAG26335 (PubMed:10706619).Curated
Sequence conflicti1616 – 16161S → C in AAB34770 (PubMed:7598802).Curated
Sequence conflicti1937 – 19371Q → H in AAA92511 (PubMed:1303259).Curated
Sequence conflicti2181 – 21811P → S in AAA92511 (PubMed:1303259).Curated
Sequence conflicti3556 – 35561K → N in L04731 (PubMed:1423625).Curated
Sequence conflicti3718 – 37181R → G in CAA93625 (PubMed:8703835).Curated
Sequence conflicti3759 – 37591N → D in CAA93625 (PubMed:8703835).Curated
Sequence conflicti3813 – 38131D → G in CAA93625 (PubMed:8703835).Curated
Sequence conflicti3901 – 39011A → R in AAA58669 (PubMed:1423624).Curated

Natural variant

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Natural varianti30 – 301A → G.1 Publication
Corresponds to variant rs9332745 [ dbSNP | Ensembl ].
VAR_021317
Natural varianti53 – 531A → V.1 Publication
Corresponds to variant rs9332747 [ dbSNP | Ensembl ].
VAR_021318
Natural varianti502 – 5021E → K.1 Publication
Corresponds to variant rs9332772 [ dbSNP | Ensembl ].
VAR_021319
Natural varianti1975 – 19751Q → P.
Corresponds to variant rs693598 [ dbSNP | Ensembl ].
VAR_052652
Natural varianti2319 – 23191S → T.1 Publication
Corresponds to variant rs9332837 [ dbSNP | Ensembl ].
VAR_021320
Natural varianti2354 – 23541P → R.1 Publication
Corresponds to variant rs9332838 [ dbSNP | Ensembl ].
VAR_021321
Natural varianti2387 – 23871Q → R.1 Publication
Corresponds to variant rs9332839 [ dbSNP | Ensembl ].
VAR_021322
Natural varianti3714 – 37141V → I.1 Publication
Corresponds to variant rs9332859 [ dbSNP | Ensembl ].
VAR_021323
Natural varianti3773 – 37731S → A.1 Publication
Corresponds to variant rs9332861 [ dbSNP | Ensembl ].
VAR_021324

Alternative sequence

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Alternative sequencei1407 – 144438Missing in isoform 2. 1 PublicationVSP_006666Add
BLAST
Alternative sequencei1603 – 16031S → SGTE in isoform 3. 2 PublicationsVSP_046879

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
L04284 mRNA. Translation: AAA58669.1. Frameshift.
Z69744
, Z69745, Z69746, Z69747, Z69748, Z69749, Z69750, Z69751, Z69752, Z69753, Z69754, Z69755, Z69756, Z69757, Z69758, Z69759, Z69760, Z69761, Z69762, Z69763, Z69764, Z69765, Z69766, Z69767, Z69768, Z69769, Z69770, Z69772, Z69773, Z69774, Z69775, Z69776, Z69777, Z69778, Z69779, Z69780 Genomic DNA. Translation: CAA93625.1.
AY373585 Genomic DNA. Translation: AAQ63624.1.
AP000941 Genomic DNA. No translation available.
AP001267 Genomic DNA. No translation available.
D14540 mRNA. Translation: BAA03407.1.
AB209508 mRNA. Translation: BAD92745.1. Frameshift.
L04731 mRNA. No translation available.
L01986 mRNA. Translation: AAA92511.1.
X83604 Genomic DNA. Translation: CAA58584.1.
S78570 mRNA. Translation: AAB34770.1.
U04737 Genomic DNA. Translation: AAA18644.1.
S66432 mRNA. Translation: AAB28545.1.
AF232001 mRNA. Translation: AAG26335.2.
AF231998 mRNA. Translation: AAG26332.2. Sequence problems.
CCDSiCCDS31686.1. [Q03164-1]
CCDS55791.1. [Q03164-3]
PIRiA44265.
I52578.
I53035.
RefSeqiNP_001184033.1. NM_001197104.1. [Q03164-3]
NP_005924.2. NM_005933.3. [Q03164-1]
UniGeneiHs.258855.

Genome annotation databases

EnsembliENST00000389506; ENSP00000374157; ENSG00000118058. [Q03164-1]
ENST00000534358; ENSP00000436786; ENSG00000118058. [Q03164-3]
GeneIDi4297.
KEGGihsa:4297.
UCSCiuc001pta.4. human. [Q03164-1]

Keywords - Coding sequence diversityi

Alternative splicing, Chromosomal rearrangement, Polymorphism

Cross-referencesi

Web resourcesi

Atlas of Genetics and Cytogenetics in Oncology and Haematology
NIEHS-SNPs

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
L04284 mRNA. Translation: AAA58669.1. Frameshift.
Z69744
, Z69745, Z69746, Z69747, Z69748, Z69749, Z69750, Z69751, Z69752, Z69753, Z69754, Z69755, Z69756, Z69757, Z69758, Z69759, Z69760, Z69761, Z69762, Z69763, Z69764, Z69765, Z69766, Z69767, Z69768, Z69769, Z69770, Z69772, Z69773, Z69774, Z69775, Z69776, Z69777, Z69778, Z69779, Z69780 Genomic DNA. Translation: CAA93625.1.
AY373585 Genomic DNA. Translation: AAQ63624.1.
AP000941 Genomic DNA. No translation available.
AP001267 Genomic DNA. No translation available.
D14540 mRNA. Translation: BAA03407.1.
AB209508 mRNA. Translation: BAD92745.1. Frameshift.
L04731 mRNA. No translation available.
L01986 mRNA. Translation: AAA92511.1.
X83604 Genomic DNA. Translation: CAA58584.1.
S78570 mRNA. Translation: AAB34770.1.
U04737 Genomic DNA. Translation: AAA18644.1.
S66432 mRNA. Translation: AAB28545.1.
AF232001 mRNA. Translation: AAG26335.2.
AF231998 mRNA. Translation: AAG26332.2. Sequence problems.
CCDSiCCDS31686.1. [Q03164-1]
CCDS55791.1. [Q03164-3]
PIRiA44265.
I52578.
I53035.
RefSeqiNP_001184033.1. NM_001197104.1. [Q03164-3]
NP_005924.2. NM_005933.3. [Q03164-1]
UniGeneiHs.258855.

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
EntryMethodResolution (Å)ChainPositionsPDBsum
2AGHNMR-C2840-2869[»]
2J2SNMR-A1143-1214[»]
2JYINMR-A1147-1203[»]
2KKFNMR-A1147-1203[»]
2KU7NMR-A1585-1628[»]
2KYUNMR-A1564-1628[»]
2LXSNMR-B2840-2858[»]
2LXTNMR-B2840-2858[»]
2MSRNMR-A140-160[»]
2MTNNMR-A110-160[»]
2W5YX-ray2.00A3785-3969[»]
2W5ZX-ray2.20A3785-3969[»]
3EG6X-ray1.72C3762-3773[»]
3EMHX-ray1.37B3764-3776[»]
3LQHX-ray1.72A1566-1784[»]
3LQIX-ray1.92A/B/C1566-1784[»]
3LQJX-ray1.90A/B1566-1784[»]
3P4FX-ray2.35C3761-3770[»]
3U85X-ray3.00B6-25[»]
3U88X-ray3.00M/N103-153[»]
4ESGX-ray1.70C/D3755-3771[»]
4GQ6X-ray1.55B6-15[»]
4NW3X-ray2.82A1147-1204[»]
5F5EX-ray1.80A3813-3969[»]
5F6LX-ray1.90A3813-3969[»]
ProteinModelPortaliQ03164.
SMRiQ03164. Positions 6-39, 103-135, 1146-1214, 1564-1779, 2840-2869, 3790-3969.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi110443. 98 interactions.
DIPiDIP-29221N.
IntActiQ03164. 39 interactions.
MINTiMINT-4532017.
STRINGi9606.ENSP00000436786.

Chemistry

BindingDBiQ03164.
ChEMBLiCHEMBL3137282.

PTM databases

iPTMnetiQ03164.
PhosphoSiteiQ03164.

Polymorphism and mutation databases

BioMutaiKMT2A.
DMDMi146345435.

Proteomic databases

EPDiQ03164.
MaxQBiQ03164.
PaxDbiQ03164.
PeptideAtlasiQ03164.
PRIDEiQ03164.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000389506; ENSP00000374157; ENSG00000118058. [Q03164-1]
ENST00000534358; ENSP00000436786; ENSG00000118058. [Q03164-3]
GeneIDi4297.
KEGGihsa:4297.
UCSCiuc001pta.4. human. [Q03164-1]

Organism-specific databases

CTDi4297.
GeneCardsiKMT2A.
HGNCiHGNC:7132. KMT2A.
HPAiCAB017794.
CAB024270.
HPA044910.
MalaCardsiKMT2A.
MIMi159555. gene+phenotype.
605130. phenotype.
neXtProtiNX_Q03164.
Orphaneti402017. 'Acute myeloid leukemia with t(9;11)(p22;q23)'.
98837. Acute biphenotypic leukemia.
98831. Acute myeloid leukemia with 11q23 abnormalities.
98835. Acute undifferentiated leukemia.
98836. Bilineal acute leukemia.
99860. Precursor B-cell acute lymphoblastic leukemia.
319182. Wiedemann-Steiner syndrome.
PharmGKBiPA241.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiKOG1084. Eukaryota.
COG2940. LUCA.
GeneTreeiENSGT00760000119228.
HOVERGENiHBG051927.
InParanoidiQ03164.
KOiK09186.
OMAiRIMSPMR.
OrthoDBiEOG091G001P.
PhylomeDBiQ03164.
TreeFamiTF319820.

Enzyme and pathway databases

ReactomeiR-HSA-3214841. PKMTs methylate histone lysines.
SIGNORiQ03164.

Miscellaneous databases

ChiTaRSiKMT2A. human.
EvolutionaryTraceiQ03164.
GeneWikiiMLL_(gene).
GenomeRNAii4297.
PROiQ03164.
SOURCEiSearch...

Gene expression databases

BgeeiENSG00000118058.
CleanExiHS_MLL.
ExpressionAtlasiQ03164. baseline and differential.
GenevisibleiQ03164. HS.

Family and domain databases

Gene3Di1.20.920.10. 1 hit.
3.30.40.10. 2 hits.
InterProiIPR001487. Bromodomain.
IPR003889. FYrich_C.
IPR003888. FYrich_N.
IPR016569. MeTrfase_trithorax.
IPR003616. Post-SET_dom.
IPR001214. SET_dom.
IPR002857. Znf_CXXC.
IPR011011. Znf_FYVE_PHD.
IPR001965. Znf_PHD.
IPR019787. Znf_PHD-finger.
IPR013083. Znf_RING/FYVE/PHD.
[Graphical view]
PfamiPF05965. FYRC. 1 hit.
PF05964. FYRN. 1 hit.
PF00628. PHD. 2 hits.
PF00856. SET. 1 hit.
PF02008. zf-CXXC. 1 hit.
[Graphical view]
PIRSFiPIRSF010354. Methyltransferase_trithorax. 1 hit.
SMARTiSM00297. BROMO. 1 hit.
SM00542. FYRC. 1 hit.
SM00541. FYRN. 1 hit.
SM00249. PHD. 4 hits.
SM00508. PostSET. 1 hit.
SM00317. SET. 1 hit.
[Graphical view]
SUPFAMiSSF47370. SSF47370. 1 hit.
SSF57903. SSF57903. 2 hits.
PROSITEiPS50014. BROMODOMAIN_2. 1 hit.
PS51543. FYRC. 1 hit.
PS51542. FYRN. 1 hit.
PS50868. POST_SET. 1 hit.
PS50280. SET. 1 hit.
PS51058. ZF_CXXC. 1 hit.
PS01359. ZF_PHD_1. 3 hits.
PS50016. ZF_PHD_2. 3 hits.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiKMT2A_HUMAN
AccessioniPrimary (citable) accession number: Q03164
Secondary accession number(s): E9PQG7
, Q13743, Q13744, Q14845, Q16364, Q59FF2, Q6UBD1, Q9HBJ3, Q9UD94, Q9UMA3
Entry historyi
Integrated into UniProtKB/Swiss-Prot: October 1, 1993
Last sequence update: May 1, 2007
Last modified: September 7, 2016
This is version 206 of the entry and version 5 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

3D-structure, Complete proteome, Direct protein sequencing, Reference proteome

Documents

  1. Human chromosome 11
    Human chromosome 11: entries, gene names and cross-references to MIM
  2. Human entries with polymorphisms or disease mutations
    List of human entries with polymorphisms or disease mutations
  3. Human polymorphisms and disease mutations
    Index of human polymorphisms and disease mutations
  4. MIM cross-references
    Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot
  5. PDB cross-references
    Index of Protein Data Bank (PDB) cross-references
  6. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.