Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot O08550 (MLL4_MOUSE)

Last modified June 16, 2009. Version 71. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Histone-lysine N-methyltransferase MLL4
    EC=2.1.1.43
Alternative name(s):
    Myeloid/lymphoid or mixed-lineage leukemia protein 4 homolog
    WW domain-binding protein 7
      Short name=WBP-7
    Trithorax homolog 2
Gene names
Name: Wbp7
Synonyms: Mll2, Trx2
OrganismMus musculus (Mouse)
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMus

Protein attributes

Sequence length2713 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceEvidence at protein level.

General annotation (Comments)

Function

Histone methyltransferase. Methylates 'Lys-4' of histone H3. H3 'Lys-4' methylation represents a specific tag for epigenetic transcriptional activation By similarity.

Catalytic activity

S-adenosyl-L-methionine + histone L-lysine = S-adenosyl-L-homocysteine + histone N(6)-methyl-L-lysine.

Subunit structure

Component of the MLL3/MLL4 complex, at least composed of MLL3, MLL4, ASH2L, RBBP5, DPY30, WDR5, NCOA6, KDM6A (or KDM6B), PAXIP1/PTIP and C16orf53/PA1 By similarity.

Subcellular location

Nucleus By similarity.

Post-translational modification

Phosphorylated upon DNA damage, probably by ATM or ATR. Ref.3 Ref.4

Sequence similarities

Belongs to the histone-lysine methyltransferase family. TRX/MLL subfamily.

Contains 3 A.T hook DNA-binding domains.

Contains 1 CXXC-type zinc finger.

Contains 3 PHD-type zinc fingers.

Contains 1 post-SET domain.

Contains 1 SET domain.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 27132713Histone-lysine N-methyltransferase MLL4
PRO_0000124882

Regions

Domain2572 – 2693122SET
Domain2697 – 271317Post-SET
DNA binding37 – 448A.T hook 1
DNA binding110 – 1178A.T hook 2
DNA binding357 – 3659A.T hook 3
Zinc finger964 – 101148CXXC-type
Zinc finger1207 – 125852PHD-type 1
Zinc finger1255 – 130955PHD-type 2
Zinc finger1341 – 140262PHD-type 3
Compositional bias6 – 106101Gly-rich
Compositional bias272 – 30029Gly-rich
Compositional bias347 – 41064Glu-rich
Compositional bias414 – 776363Pro-rich
Compositional bias1814 – 2296483Pro-rich

Amino acid modifications

Modified residue1591Phosphothreonine By similarity
Modified residue3201Phosphoserine Ref.4
Modified residue8261Phosphoserine By similarity
Modified residue8491Phosphoserine By similarity
Modified residue8661Phosphoserine By similarity
Modified residue10371Phosphoserine By similarity
Modified residue10401Phosphoserine By similarity
Modified residue11701Phosphoserine By similarity
Modified residue19261Phosphoserine By similarity
Modified residue19271Phosphoserine Ref.3
Modified residue19311Phosphothreonine By similarity

Sequences

Sequence LengthMass (Da)Tools
O08550-1 [UniParc].

Last modified January 9, 2007. Version 2.
Checksum: B24BCA2D019EB055

FASTA2,713294,836
        10         20         30         40         50         60 
MAAAAGGGSC PGPGSARVRF PGRPLGCGGG GGRGGRGNGA ERVRVALRRG GGAAGPGGAE 

        70         80         90        100        110        120 
PGEDTALLRL LGLRRGLRRL RRLWAGARVQ RGRGRGRGRG WGPNRGCMPE EESSDGESEE 

       130        140        150        160        170        180 
EEFQGFHSDE DVAPSSLRSA LRSQRGRAPR GRGRKHKTTP LPPRLADVTP VPPKAPTRKR 

       190        200        210        220        230        240 
GEEGTERMVQ ALTELLRRSQ APQPPRSRAR AREPSTPRRS RGRPPGRPAG PCRKKQQAVV 

       250        260        270        280        290        300 
LAEAAVTIPK PEPPPPVVPV KNKAGSWKCK EGPGPGPGTP KRGGQPGRGG RGGRGRGRGG 

       310        320        330        340        350        360 
LPLMIKFVSK AKKVKMGQLS QELESGQGHG QRGESWQDAP QRKDGDEPER GSCRKKQEQK 

       370        380        390        400        410        420 
LEEEEEEEEK EGEEKEEKDD NEDNNKQEEE EETERAVAEE EAMLAKEKEE AKLPSPPLTP 

       430        440        450        460        470        480 
PVPSPPPPLP PPSTSPPPPA SPLPPPVSPP PPLSPPPYPA PEKQEESPPL VPATCSRKRG 

       490        500        510        520        530        540 
RPPLTPSQRA EREAARSGPE GTLSPTPNPS TTTGSPLEDS PTVVPKSTTF LKNIRQFIMP 

       550        560        570        580        590        600 
VVSARSSRVI KTPRRFMDED PPKPPKVEAS IVRPPVATSP PAPQEPVPVS SPPRVPTPPS 

       610        620        630        640        650        660 
TPVPLPEKRR SILREPTFRW TSLTRELPPP PPAPPPAPSP PPAPATPSRR PLLLRAPQFT 

       670        680        690        700        710        720 
PSEAHLKIYE SVLTPPPLGA LETPEPELPP ADDSPAEPEP RAVGRTNHLS LPRFVPVVTS 

       730        740        750        760        770        780 
PVKVEVPPHG APALSEGQQL QLQQPPQALQ TQLLPQALPP QQPQAQPPPS PQHTPPLEKA 

       790        800        810        820        830        840 
RVASLGSLPL SGVEEKMFSL LKRAKVQLFK IDQQQQQKVA ASMPLSPAVQ TEEAVGTVKQ 

       850        860        870        880        890        900 
TPDRGCVRSE DESMEAKRDR ASGPESPLQG PRIKHVCRHA AVALGQARAM VPEDVPRLSA 

       910        920        930        940        950        960 
LPLRDRQDLA TEDTSSASET ESVPSRSQRE KVESAGPGGD SEPTGSTGAL AHTPRRSLPS 

       970        980        990       1000       1010       1020 
HHGKKMRMAR CGHCRGCLRV QDCGSCVNCL DKPKFGGPNT KKQCCVYRKC DKIEARKMER 

      1030       1040       1050       1060       1070       1080 
LAKKGRTIVK TLLPWDSDES PEASPGPPGP RRGAGAGGSR EEVGATPGPE EQDSLLLQRK 

      1090       1100       1110       1120       1130       1140 
SARRCVKQRP SYDVFEDSDD SEPGGPPAPR RRTPREHELP VLEPEEQSRP RKPTLQPVLQ 

      1150       1160       1170       1180       1190       1200 
LKARRRLDKD ALAPGPFASF PNGWTGKQKS PDGVHRVRVD FKEDCDLENV WLMGGLSVLT 

      1210       1220       1230       1240       1250       1260 
SVPGGPPMVC LLCASKGLHE LVFCQVCCDP FHPFCLEEAE RPSPQHRDTW CCRRCKFCHV 

      1270       1280       1290       1300       1310       1320 
CGRKGRGSKH LLECERCRHA YHPACLGPSY PTRATRRRRH WICSACVRCK SCGATPGKNW 

      1330       1340       1350       1360       1370       1380 
DVEWSGDYSL CPRCTELYEK GNYCPICTRC YEDNDYESKM MQCAQCDHWV HAKCEGLSDE 

      1390       1400       1410       1420       1430       1440 
DYEILSGLPD SVLYTCGPCA GATQPRWREA LSGALQGGLR QVLQGLLSSK VAGPLLLCTQ 

      1450       1460       1470       1480       1490       1500 
CGQDGKQLHP GPCDLQAVGK RFEEGLYKSV HSFMEDVVAI LMRHSEEGET PERRAGSQMK 

      1510       1520       1530       1540       1550       1560 
GLLLKLLESA FCWFDAHDPK YWRRSTRLPN GVLPNAVLPP SLDHVYAQWR QQESETPESG 

      1570       1580       1590       1600       1610       1620 
QPPGDPSAAF QSKDPAAFSH LDDPRQCALC LKYGDADSKE AGRLLYIGQN EWTHVNCAIW 

      1630       1640       1650       1660       1670       1680 
SAEVFEENDG SLKNVHAAVA RGRQMRCELC LKPGATVGCC LSSCLSNFHF MCARASYCIF 

      1690       1700       1710       1720       1730       1740 
QDDKKVFCQK HTDLLDGKEI VTPDGFDVLR RVYVDFEGIN FKRKFLTGLE PDVINVLIGS 

      1750       1760       1770       1780       1790       1800 
IRINSLGTLS DLSDCEGRLF PIGYQCSRLY WSTVDARRRC WYRCRILEYR PWGPREEPVH 

      1810       1820       1830       1840       1850       1860 
LEAAEENQTI VHSPTPSSDT DSLIPGDPVH HSPIQNLDPP LRTDSSNGPP PTPRSFSGAR 

      1870       1880       1890       1900       1910       1920 
IKVPNYSPSR RPLGGVSFGP LPSPGSPSSL THHIPTVGDS DFPAPPRRSR RPSPLATRPP 

      1930       1940       1950       1960       1970       1980 
PSRRTSSPLR TSPQLRVPLS TSVTALTPTS GELAPPDLAP SPLPPSEDLG PDFEDMEVVS 

      1990       2000       2010       2020       2030       2040 
GLSAADLDFA ASLLGTEPFQ EEIVAAGAVG SSQGGPGDSS EEEASPTTHY VHFPVTVVSG 

      2050       2060       2070       2080       2090       2100 
PALAPSSLAG APRIEQLDGV DDGTDSEAEA VQQPRGQGTP PSGPGVGRGG VLGAAGDRAQ 

      2110       2120       2130       2140       2150       2160 
PPEDLPSEIV DFVLKNLGGP GEGAAGPRED SLPSAPPLAN GSQPPQSLST SPADPTRTFA 

      2170       2180       2190       2200       2210       2220 
WLPGAPGVRV LSLGPAPEPP KPATSKIILV NKLGQVFVKM AGEGEPVAPP VKQPPLPPII 

      2230       2240       2250       2260       2270       2280 
PPTAPTSWTL PPGPLLSVLP VVGVGVVRPA PPPPPPPLTL VFSSGPPSPP RQAIRVKRVS 

      2290       2300       2310       2320       2330       2340 
TFSGRSPPVP PPNKTPRLDE DGESLEDAHH VPGISGSGFS RVRMKTPTVR GVLDLNNPGE 

      2350       2360       2370       2380       2390       2400 
QPEEESPGRP QDRCPLLPLA EAPSQALDGS SDLLFESQWH HYSAGEASSS EEEPPSPEDK 

      2410       2420       2430       2440       2450       2460 
ENQVPKRVGP HLRFEISSDD GFSVEAESLE VAWRTLIEKV QEARGHARLR HLSFSGMSGA 

      2470       2480       2490       2500       2510       2520 
RLLGIHHDAV IFLAEQLPGA QRCQHYKFRY HQQGEGQEEP PLNPHGAARA EVYLRKCTFD 

      2530       2540       2550       2560       2570       2580 
MFNFLASQHR VLPEGATCDE EEDEVQLRST RRATSLELPM AMRFRHLKKT SKEAVGVYRS 

      2590       2600       2610       2620       2630       2640 
AIHGRGLFCK RNIDAGEMVI EYSGIVIRSV LTDKREKFYD GKGIGCYMFR MDDFDVVDAT 

      2650       2660       2670       2680       2690       2700 
MHGNAARFIN HSCEPNCFSR VIHVEGQKHI VIFALRRILR GEELTYDYKF PIEDASNKLP 

      2710 
CNCGAKRCRR FLN 

« Hide

References

« Hide 'large scale' references
[1]"Murine MLL2 gene and its expression."
Yoshida K.
Submitted (JUN-2004) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [MRNA].
[2]"FBP WW domains and the Abl SH3 domain bind to a specific class of proline-rich ligands."
Bedford M.T., Chan D.C., Leder P.
EMBO J. 16:2376-2383(1997) [PubMed: 9171351] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 379-657.
[3]"Large-scale phosphorylation analysis of mouse liver."
Villen J., Beausoleil S.A., Gerber S.A., Gygi S.P.
Proc. Natl. Acad. Sci. U.S.A. 104:1488-1493(2007) [PubMed: 17242355] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-1927, MASS SPECTROMETRY.
Tissue: Liver.
[4]"ATM and ATR substrate analysis reveals extensive protein networks responsive to DNA damage."
Matsuoka S., Ballif B.A., Smogorzewska A., McDonald E.R. III, Hurov K.E., Luo J., Bakalarski C.E., Zhao Z., Solimini N., Lerenthal Y., Shiloh Y., Gygi S.P., Elledge S.J.
Science 316:1160-1166(2007) [PubMed: 17525332] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-320, MASS SPECTROMETRY.
+Additional computationally mapped references.

Cross-references

Sequence databases

AB182318 mRNA. Translation: BAD81031.1.
U92455 mRNA. Translation: AAC53192.1. Sequence problems.
IPIIPI00229651.
RefSeqNP_083550.2.
UniGeneMm.168688

3D structure databases

ModBaseSearch...

Genome annotation databases

EnsemblENSMUSG00000006307. Mus musculus. [Contig view]
GeneID75410.
KEGGmmu:75410.

Organism-specific databases

MGIMGI:109565. Wbp7.

Phylogenomic databases

HOVERGENO08550.

Gene expression databases

ArrayExpressO08550.
BgeeO08550.
CleanExMM_WBP7.
GermOnlineENSMUSG00000006307. Mus musculus.

Family and domain databases

InterProIPR017956. AT_hook_DNA-bd_CS.
IPR003889. FYrich_C.
IPR018516. FYrich_C_sg.
IPR003888. FYrich_N.
IPR018518. FYrich_N_sg.
IPR015722. MLL.
IPR003616. Post-SET_Zn_bd.
IPR001214. SET.
IPR019786. Zinc_finger_PHD-type_CS.
IPR002857. Znf_CXXC.
IPR001965. Znf_PHD.
IPR019787. Znf_PHD-finger.
[Graphical view]
PANTHERPTHR22884:SF10. MLL. 1 hit.
PfamPF05965. FYRC. 1 hit.
PF05964. FYRN. 1 hit.
PF00628. PHD. 3 hits.
PF00856. SET. 1 hit.
PF02008. zf-CXXC. 1 hit.
[Graphical view]
SMARTSM00384. AT_hook. 3 hits.
SM00542. FYRC. 1 hit.
SM00541. FYRN. 1 hit.
SM00249. PHD. 4 hits.
SM00508. PostSET. 1 hit.
SM00317. SET. 1 hit.
[Graphical view]
PROSITEPS50868. POST_SET. 1 hit.
PS50280. SET. 1 hit.
PS51058. ZF_CXXC. 1 hit.
PS01359. ZF_PHD_1. 3 hits.
PS50016. ZF_PHD_2. 3 hits.
[Graphical view]
ProtoNetSearch...

Other Resources

NextBio342938.
SOURCESearch...

Entry information

Entry nameMLL4_MOUSE
AccessionPrimary (citable) accession number: O08550
Secondary accession number(s): Q5NU09
Entry history
Integrated into UniProtKB/Swiss-Prot: December 1, 2000
Last sequence update: January 9, 2007
Last modified: June 16, 2009
This is version 71 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectHPI (Human Proteome Initiative)

Relevant documents

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents