ID SET1_USTMA Reviewed; 1468 AA. AC Q4PB36; DT 09-JAN-2007, integrated into UniProtKB/Swiss-Prot. DT 19-JUL-2005, sequence version 1. DT 16-JUN-2009, entry version 35. DE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4 specific; DE EC=2.1.1.43; DE AltName: Full=COMPASS component SET1; DE AltName: Full=SET domain-containing protein 1; GN Name=SET1; ORFNames=UM02677; OS Ustilago maydis (Smut fungus). OC Eukaryota; Fungi; Dikarya; Basidiomycota; Ustilaginomycotina; OC Ustilaginomycetes; Ustilaginales; Ustilaginaceae; Ustilago. OX NCBI_TaxID=5270; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=521; RX PubMed=17080091; DOI=10.1038/nature05248; RA Kaemper J., Kahmann R., Boelker M., Ma L.-J., Brefort T., RA Saville B.J., Banuett F., Kronstad J.W., Gold S.E., Mueller O., RA Perlin M.H., Woesten H.A.B., de Vries R., Ruiz-Herrera J., RA Reynaga-Pena C.G., Snetselaar K., McCann M., Perez-Martin J., RA Feldbruegge M., Basse C.W., Steinberg G., Ibeas J.I., Holloman W., RA Guzman P., Farman M.L., Stajich J.E., Sentandreu R., RA Gonzalez-Prieto J.M., Kennell J.C., Molina L., Schirawski J., RA Mendoza-Mendoza A., Greilinger D., Muench K., Roessel N., Scherer M., RA Vranes M., Ladendorf O., Vincon V., Fuchs U., Sandrock B., Meng S., RA Ho E.C.H., Cahill M.J., Boyce K.J., Klose J., Klosterman S.J., RA Deelstra H.J., Ortiz-Castellanos L., Li W., Sanchez-Alonso P., RA Schreier P.H., Haeuser-Hahn I., Vaupel M., Koopmann E., Friedrich G., RA Voss H., Schlueter T., Margolis J., Platt D., Swimmer C., Gnirke A., RA Chen F., Vysotskaia V., Mannhaupt G., Gueldener U., RA Muensterkoetter M., Haase D., Oesterheld M., Mewes H.-W., RA Mauceli E.W., DeCaprio D., Wade C.M., Butler J., Young S.K., RA Jaffe D.B., Calvo S.E., Nusbaum C., Galagan J.E., Birren B.W.; RT "Insights from the genome of the biotrophic fungal plant pathogen RT Ustilago maydis."; RL Nature 444:97-101(2006). CC -!- FUNCTION: Catalytic component of the COMPASS (Set1C) complex that CC specifically mono-, di- and trimethylates histone H3 to form CC H3K4me1/2/3, which subsequently plays a role in telomere length CC maintenance and transcription elongation regulation (By CC similarity). CC -!- CATALYTIC ACTIVITY: S-adenosyl-L-methionine + histone L-lysine = CC S-adenosyl-L-homocysteine + histone N(6)-methyl-L-lysine. CC -!- SUBUNIT: Component of the COMPASS (Set1C) complex (By similarity). CC -!- SUBCELLULAR LOCATION: Nucleus (Probable). CC -!- SIMILARITY: Contains 1 post-SET domain. CC -!- SIMILARITY: Contains 1 SET domain. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AACP01000088; EAK83847.1; -; Genomic_DNA. DR RefSeq; XP_758824.1; -. DR GeneID; 3630740; -. DR KEGG; uma:UM02677.1; -. DR BRENDA; 2.1.1.43; 2320. DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-KW. DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell. DR GO; GO:0018024; F:histone-lysine N-methyltransferase activity; IEA:EC. DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW. DR GO; GO:0016568; P:chromatin modification; IEA:UniProtKB-KW. DR InterPro; IPR015722; MLL. DR InterPro; IPR003616; Post-SET_Zn_bd. DR InterPro; IPR001214; SET. DR PANTHER; PTHR22884:SF10; MLL; 1. DR Pfam; PF00856; SET; 1. DR SMART; SM00508; PostSET; 1. DR SMART; SM00317; SET; 1. DR PROSITE; PS50868; POST_SET; 1. DR PROSITE; PS50280; SET; 1. PE 3: Inferred from homology; KW Chromatin regulator; Chromosomal protein; Complete proteome; KW Methyltransferase; Nucleus; RNA-binding; S-adenosyl-L-methionine; KW Transferase. FT CHAIN 1 1468 Histone-lysine N-methyltransferase, H3 FT lysine-4 specific. FT /FTId=PRO_0000269777. FT DOMAIN 1326 1448 SET. FT DOMAIN 1453 1468 Post-SET. FT COMPBIAS 27 275 Arg-rich. FT COMPBIAS 641 685 Pro-rich. SQ SEQUENCE 1468 AA; 164530 MW; 7CC6632973149DF9 CRC64; MPYSSQQNGY TSASTSRLSE QTSSHSRSSR EDRHLTEKGR RPPSPEARHR SDRDYDRRRS TEYVRDDDYR RSSRSSHDSR YADAYDHWRS ARSAYSPTPR DDRRDEARND LSSTKRHRSP EHSTSRLRHR SPESAHRRQN GTANRLDSKP DRGGDRKTGE ALDSGRSRWS QRAYEYDDWR NERPSARYER YRHDREPHRS RREDEYETKR SRDDSNGNSI YAPTRRSRSR SRSRSRSRDR YRSRDHSRER RRERSRDRSN GTYSSRDDRR PKADRSAHTI KRDEHSTRLN GTSEDSKDLR HESQRRVSAS VQSASEGPAS TPVARAVYIK HAEVDQEAPA PPTTRDYHSC PQRWPDQADS AVRASSAPNG SATAPSRSDR PPANGSSGRH SPRSLPTREK AEEARTSSTR RPSSQTNDNV NNSRDPLTQR KATSERSFGH VLLPHELPVE CRGKNYMATA TYKEGVKSIY KSAADKHLVD VDTRDPRRLG KKSSRYRESL HSASFRWDSN SRGKKPLPPP RNLVLTNLSG LLQPHQILLH ILPHGRIESS KLEIDPKIGQ SLGIFRVTFA HDFDEHGKPL ESMPAGQNPQ HGAKVAKAAC LALNGRMIGQ TRAQAFLDRD GEVIAERIKA KLAENEHKLR PTIVPPAPPA AASSSPATPS TTKQSMPPPQ VPRGPKVFMP AAPSPSYASS PASARANTDR YEYSATSHSR YRSSYEESRK LASSETYHRR RGTEEYDTYN RSKPYADAQV PAGSRSETRK DIKRPDEEIL NELRDKKRPY VHIPRPKNCD IDVTSVEAQL RSTAPIWVRE GQKGFYAAFH TSKEANQCKV VNETLTIGGY TLQVDVRSAP SQHAPSQQIR TPSGKHASVP LSMPAPPKQE RKAIDTGLRP PTADEKLKVD WSAAELQDAV FRMLQKELAD TFVRDVKSRV VGPYLTAYLK PDGEGGKMLA KATMKKPVIP TSINDHGTTL FEATGEARLP SFRKLAGAHP KKKASDADTT TSQAKRDQTD AKKKRGHTHR SKVHRDRDVS SSENESDDME RGMVVAARRN SYTRSKSSTK RRGAAAWLLE ASDAEAGTDD VDSTETDALS RSVSASVEPT GEEQIEVDVG AKAKKIPKVK AATVSKKKGT TAARKKLDVA PPEAVVEADQ GSETATPETD VPIKTAAAKA KVKPAKTSAK AKSALVDPFE AGLVEDSEDC HYLRLALEHL SRTGELASEH TLPDEIELEV EAEEQAMAAG GIPKHSTGSA RTEGYYRIPP EQKAMHLPDR NKATEDVDTS SNAQILQSAR NNRADSRRLV LGIEQHKRET ATDTDIFKFN QLRTRKKQLK FAKSPIHDWG LYAMELIPAG DMVIEYVGEV VRQQVADERE KQYERQGNFS TYLFRVDDDL VVDATHKGNI ARLMNHCCTP NCNAKILTLN GEKRIVLFAK TAIRAGEELT YDYKFQSSAD DEDAIPCLCG SPGCRRFL //