P55200 (MLL1_MOUSE) Reviewed, UniProtKB/Swiss-Prot
Last modified
April 3, 2013.
Version 123.
History...
Names·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order
Names·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize orderNames and origin
| Protein names | Recommended name: Histone-lysine N-methyltransferase MLL EC=2.1.1.43 Alternative name(s): ALL-1 Zinc finger protein HRX Cleaved into the following 2 chains:
| ||||
| Gene names |
| ||||
| Organism | Mus musculus (Mouse) [Reference proteome] | ||||
| Taxonomic identifier | 10090 [NCBI] | ||||
| Taxonomic lineage | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Mammalia › Eutheria › Euarchontoglires › Glires › Rodentia › Sciurognathi › Muroidea › Muridae › Murinae › Mus › Mus![]() |
Protein attributes
| Sequence length | 3966 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is further processed into a mature form. |
| Protein existence | Evidence at protein level |
General annotation (Comments)
| Function | Histone methyltransferase that plays an essential role in early development and hematopoiesis. Catalytic subunit of the MLL1/MLL complex, a multiprotein complex that mediates both methylation of 'Lys-4' of histone H3 (H3K4me) complex and acetylation of 'Lys-16' of histone H4 (H4K16ac). In the MLL1/MLL complex, it specifically mediates H3K4me, a specific tag for epigenetic transcriptional activation. Has weak methyltransferase activity by itself, and requires other component of the MLL1/MLL complex to obtain full methyltransferase activity. Has no activity toward histone H3 phosphorylated on 'Thr-3', less activity toward H3 dimethylated on 'Arg-8' or 'Lys-9', while it has higher activity toward H3 acetylated on 'Lys-9'. Required for transcriptional activation of HOXA9. Promotes PPP1R15A-induced apoptosis By similarity. |
| Catalytic activity | S-adenosyl-L-methionine + L-lysine-[histone] = S-adenosyl-L-homocysteine + N(6)-methyl-L-lysine-[histone]. |
| Subunit structure | MLL cleavage product N320 heterodimerizes with MLL cleavage product C180 (via SET and FYRC domains). Interacts with WDR5; the interaction is direct. Interacts with KAT8/MOF; the interaction is direct. Interacts with SBF1 and PPP1R15A By similarity. Component of some MLL1/MLL complex, at least composed of the core components MLL, ASH2L, HCFC1/HCF1, HCFC2, WDR5, DPY30 and RBBP5, as well as the facultative components BAP18, CHD8, E2F6, HSP70, INO80C, KANSL1, LAS1L, MAX, MCRS1, MEN1, MGA, KAT8/MOF, PELP1, PHF20, PRP31, RING2, RUVB1/TIP49A, RUVB2/TIP49B, SENP3, TAF1, TAF4, TAF6, TAF7, TAF9 and TEX10. Ref.4 |
| Subcellular location | Nucleus By similarity. MLL cleavage product N320: Nucleus By similarity. MLL cleavage product C180: Nucleus By similarity. Note: Localizes to a diffuse nuclear pattern when not associated with MLL cleavage product N320. |
| Domain | The SET domain structure is atypical and is not in an optimal position to have methyltransferase activity. It requires other components of the MLL1/MLL complex, such as ASH2L or RBBP5, to order the active site and obtain optimal histone methyltransferase activity By similarity. The CXXC-type zinc finger binds bind to nonmethyl-CpG dinucleotides By similarity. |
| Post-translational modification | Proteolytic cleavage by TASP1 generates MLL cleavage product N320 and MLL cleavage product C180, which reassemble through a non-covalent association. 2 cleavage sites exist, cleavage site 1 (CS1) and cleavage site 2 (CS2), to generate MLL cleavage products N320 and C180. CS2 is the major site By similarity. |
| Sequence similarities | Belongs to the histone-lysine methyltransferase family. TRX/MLL subfamily. Contains 3 A.T hook DNA-binding domains. Contains 1 bromo domain. Contains 1 CXXC-type zinc finger. Contains 1 FYR C-terminal domain. Contains 1 FYR N-terminal domain. Contains 3 PHD-type zinc fingers. Contains 1 post-SET domain. Contains 1 SET domain. |
| Sequence caution | The sequence BAE24386.1 differs from that shown. Reason: Erroneous initiation. |
Ontologies
Alternative products
| This entry describes 2 isoforms produced by alternative splicing. [Align] [Select] | ||||||
| Isoform 1 (identifier: P55200-1) This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry. | ||||||
| Isoform 2 (identifier: P55200-2) The sequence of this isoform differs from the canonical sequence as follows: 1603-1605: Missing. |
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||
Molecule processing | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Chain | 1 – 3966 | 3966 | Histone-lysine N-methyltransferase MLL | PRO_0000124877 | |||||
| Chain | 1 – 2714 | 2714 | MLL cleavage product N320 By similarity | PRO_0000390951 | |||||
| Chain | 2715 – 3966 | 1252 | MLL cleavage product C180 By similarity | PRO_0000390952 | |||||
Regions | |||||||||
| Domain | 1705 – 1750 | 46 | Bromo; divergent | ||||||
| Domain | 2020 – 2076 | 57 | FYR N-terminal | ||||||
| Domain | 3663 – 3744 | 82 | FYR C-terminal | ||||||
| Domain | 3825 – 3946 | 122 | SET | ||||||
| Domain | 3950 – 3966 | 17 | Post-SET | ||||||
| DNA binding | 167 – 178 | 12 | A.T hook 1 | ||||||
| DNA binding | 215 – 225 | 11 | A.T hook 2 | ||||||
| DNA binding | 299 – 307 | 9 | A.T hook 3 | ||||||
| Zinc finger | 1144 – 1192 | 49 | CXXC-type | ||||||
| Zinc finger | 1430 – 1481 | 52 | PHD-type 1 | ||||||
| Zinc finger | 1478 – 1532 | 55 | PHD-type 2 | ||||||
| Zinc finger | 1565 – 1629 | 65 | PHD-type 3 | ||||||
| Region | 3903 – 3904 | 2 | S-adenosyl-L-methionine binding By similarity | ||||||
| Compositional bias | 14 – 150 | 137 | Gly-rich | ||||||
| Compositional bias | 17 – 100 | 84 | Ala/Gly/Ser-rich | ||||||
| Compositional bias | 80 – 99 | 20 | Ser-rich | ||||||
| Compositional bias | 135 – 141 | 7 | Poly-Gly | ||||||
| Compositional bias | 199 – 287 | 89 | Lys-rich | ||||||
| Compositional bias | 443 – 491 | 49 | Ser-rich | ||||||
| Compositional bias | 559 – 601 | 43 | Pro-rich | ||||||
| Compositional bias | 559 – 562 | 4 | Poly-Pro | ||||||
| Compositional bias | 566 – 569 | 4 | Poly-Pro | ||||||
| Compositional bias | 703 – 807 | 105 | Ser-rich | ||||||
| Compositional bias | 1234 – 1366 | 133 | Pro-rich | ||||||
| Compositional bias | 1819 – 1867 | 49 | Pro-rich | ||||||
| Compositional bias | 2185 – 2320 | 136 | Ser-rich | ||||||
Sites | |||||||||
| Metal binding | 3906 | 1 | Zinc By similarity | ||||||
| Metal binding | 3954 | 1 | Zinc By similarity | ||||||
| Metal binding | 3956 | 1 | Zinc By similarity | ||||||
| Metal binding | 3961 | 1 | Zinc By similarity | ||||||
| Binding site | 3836 | 1 | S-adenosyl-L-methionine By similarity | ||||||
| Binding site | 3838 | 1 | S-adenosyl-L-methionine By similarity | ||||||
| Binding site | 3880 | 1 | S-adenosyl-L-methionine By similarity | ||||||
| Binding site | 3955 | 1 | S-adenosyl-L-methionine By similarity | ||||||
| Site | 2662 – 2663 | 2 | Cleavage; by TASP1, site 1 By similarity | ||||||
| Site | 2714 – 2715 | 2 | Cleavage; by TASP1, site 2 By similarity | ||||||
| Site | 3762 | 1 | Important for WDR5-recognition and binding By similarity | ||||||
Amino acid modifications | |||||||||
| Modified residue | 151 | 1 | Phosphoserine By similarity | ||||||
| Modified residue | 195 | 1 | Phosphoserine By similarity | ||||||
| Modified residue | 516 | 1 | Phosphoserine By similarity | ||||||
| Modified residue | 634 | 1 | N6-acetyllysine By similarity | ||||||
| Modified residue | 678 | 1 | Phosphoserine By similarity | ||||||
| Modified residue | 837 | 1 | Phosphothreonine By similarity | ||||||
| Modified residue | 923 | 1 | Phosphoserine By similarity | ||||||
| Modified residue | 1053 | 1 | Phosphoserine By similarity | ||||||
| Modified residue | 1127 | 1 | N6-acetyllysine By similarity | ||||||
| Modified residue | 1232 | 1 | N6-acetyllysine By similarity | ||||||
| Modified residue | 1847 | 1 | Phosphothreonine By similarity | ||||||
| Modified residue | 1860 | 1 | Phosphoserine By similarity | ||||||
| Modified residue | 2100 | 1 | Phosphoserine By similarity | ||||||
| Modified residue | 2148 | 1 | Phosphothreonine By similarity | ||||||
| Modified residue | 2152 | 1 | Phosphoserine By similarity | ||||||
| Modified residue | 2202 | 1 | Phosphoserine By similarity | ||||||
| Modified residue | 2951 | 1 | Phosphoserine By similarity | ||||||
| Modified residue | 3032 | 1 | Phosphoserine By similarity | ||||||
| Modified residue | 3369 | 1 | Phosphothreonine By similarity | ||||||
| Modified residue | 3510 | 1 | Phosphoserine By similarity | ||||||
| Cross-link | 214 | Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in ubiquitin) By similarity | |||||||
| Cross-link | 218 | Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in ubiquitin) By similarity | |||||||
| Cross-link | 219 | Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in ubiquitin) By similarity | |||||||
Natural variations | |||||||||
| Alternative sequence | 1603 – 1605 | 3 | Missing in isoform 2. | VSP_006667 | |||||
| Natural variant | 1597 | 1 | K → T. | ||||||
Experimental info | |||||||||
| Sequence conflict | 372 | 1 | Q → E in BAE28820. Ref.3 | ||||||
| Sequence conflict | 554 | 1 | Q → K in AAA62593. Ref.2 | ||||||
| Sequence conflict | 564 | 1 | L → F in AAA62593. Ref.2 | ||||||
| Sequence conflict | 797 | 1 | P → S in BAE28820. Ref.3 | ||||||
| Sequence conflict | 806 | 1 | E → D in AAA62593. Ref.2 | ||||||
| Sequence conflict | 806 | 1 | E → D in BAE28820. Ref.3 | ||||||
| Sequence conflict | 821 | 1 | L → P in AAA62593. Ref.2 | ||||||
| Sequence conflict | 821 | 1 | L → P in BAE28820. Ref.3 | ||||||
| Sequence conflict | 1069 | 1 | C → Y in AAA62593. Ref.2 | ||||||
| Sequence conflict | 1230 | 1 | A → S in AAA62593. Ref.2 | ||||||
| Sequence conflict | 1349 | 1 | R → L in AAA62593. Ref.2 | ||||||
| Sequence conflict | 1437 | 1 | A → S in AAA62593. Ref.2 | ||||||
| Sequence conflict | 1440 | 1 | G → E in AAA62593. Ref.2 | ||||||
| Sequence conflict | 1632 | 1 | A → P in AAA62593. Ref.2 | ||||||
| Sequence conflict | 2292 | 1 | S → L in AAA62593. Ref.2 | ||||||
| Sequence conflict | 3481 | 1 | N → I in AAA62593. Ref.2 | ||||||
| Sequence conflict | 3493 | 1 | R → S in AAA62593. Ref.2 | ||||||
| Sequence conflict | 3548 | 1 | G → V in AAA62593. Ref.2 | ||||||
| Sequence conflict | 3769 | 1 | Q → K in AAA62593. Ref.2 | ||||||
Sequences
| ||||||||||||||||||||||||
References
Cross-references
Sequence databases | |
|---|---|
| EMBL GenBank DDBJ | AC061963 Genomic DNA. No translation available. AC142113 Genomic DNA. No translation available. L17069 mRNA. Translation: AAA62593.1. AK140439 mRNA. Translation: BAE24386.1. Different initiation. AK149341 mRNA. Translation: BAE28820.1. |
| IPI | IPI00315032. IPI01007845. |
| RefSeq | NP_001074518.1. NM_001081049.1. |
| UniGene | Mm.2389. |
3D structure databases | |
| ProteinModelPortal | P55200. |
| SMR | P55200. Positions 6-39, 101-133, 1144-1200, 1563-1781, 2836-2865, 3787-3966. |
| ModBase | Search... |
Protein-protein interaction databases | |
| DIP | DIP-58597N. |
| IntAct | P55200. 1 interaction. |
PTM databases | |
| PhosphoSite | P55200. |
Proteomic databases | |
| PaxDb | P55200. |
| PRIDE | P55200. |
Protocols and materials databases | |
| StructuralBiologyKnowledgebase | Search... |
Genome annotation databases | |
| Ensembl | ENSMUST00000002095; ENSMUSP00000002095; ENSMUSG00000002028. ENSMUST00000114689; ENSMUSP00000110337; ENSMUSG00000002028. |
| GeneID | 214162. |
| KEGG | mmu:214162. |
| UCSC | uc009pep.1. mouse. uc009peq.1. mouse. |
Organism-specific databases | |
| CTD | 214162. |
| MGI | MGI:96995. Mll1. |
Phylogenomic databases | |
| eggNOG | COG2940. |
| GeneTree | ENSGT00690000101661. |
| HOGENOM | HOG000112954. |
| HOVERGEN | HBG051927. |
| InParanoid | P55200. |
| KO | K09186. |
| OMA | QYFSSAK. |
| OrthoDB | EOG47H5P3. |
Gene expression databases | |
| ArrayExpress | P55200. |
| Bgee | P55200. |
| CleanEx | MM_MLL1. |
| Genevestigator | P55200. |
| GermOnline | ENSMUSG00000002028. Mus musculus. |
Family and domain databases | |
| Gene3D | 3.30.40.10. 2 hits. |
| InterPro | IPR017956. AT_hook_DNA-bd_motif. IPR001487. Bromodomain. IPR003889. FYrich_C. IPR003888. FYrich_N. IPR016569. MeTrfase_trithorax. IPR003616. Post-SET_dom. IPR001214. SET_dom. IPR002857. Znf_CXXC. IPR011011. Znf_FYVE_PHD. IPR001965. Znf_PHD. IPR019787. Znf_PHD-finger. IPR013083. Znf_RING/FYVE/PHD. [Graphical view] |
| Pfam | PF05965. FYRC. 1 hit. PF05964. FYRN. 1 hit. PF00628. PHD. 2 hits. PF00856. SET. 1 hit. PF02008. zf-CXXC. 1 hit. [Graphical view] |
| PIRSF | PIRSF010354. Methyltransferase_trithorax. 1 hit. |
| SMART | SM00384. AT_hook. 3 hits. SM00297. BROMO. 1 hit. SM00542. FYRC. 1 hit. SM00541. FYRN. 1 hit. SM00249. PHD. 4 hits. SM00508. PostSET. 1 hit. SM00317. SET. 1 hit. [Graphical view] |
| SUPFAM | SSF47370. Bromodomain. 1 hit. SSF57903. FYVE_PHD_ZnF. 2 hits. |
| PROSITE | PS00633. BROMODOMAIN_1. False negative. PS50014. BROMODOMAIN_2. 1 hit. PS51543. FYRC. 1 hit. PS51542. FYRN. 1 hit. PS50868. POST_SET. 1 hit. PS50280. SET. 1 hit. PS51058. ZF_CXXC. 1 hit. PS01359. ZF_PHD_1. 3 hits. PS50016. ZF_PHD_2. 3 hits. [Graphical view] |
| ProtoNet | Search... |
Other | |
| NextBio | 374214. |
| SOURCE | Search... |
Entry information
| Entry name | MLL1_MOUSE | ||||||||
| Accession | Primary (citable) accession number: P55200 Secondary accession number(s): E9QNE7, Q3UEU1, Q3USE7 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation program | Chordata Protein Annotation Program | ||||||||
Relevant documents
| MGD cross-references Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot |
| SIMILARITY comments Index of protein domains and families |

Clusters with
