ID SET1_CHAGB Reviewed; 1076 AA. AC Q2GWF3; DT 09-JAN-2007, integrated into UniProtKB/Swiss-Prot. DT 21-MAR-2006, sequence version 1. DT 16-JUN-2009, entry version 26. DE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4 specific; DE EC=2.1.1.43; DE AltName: Full=COMPASS component SET1; DE AltName: Full=SET domain-containing protein 1; GN Name=SET1; ORFNames=CHGG_07701; OS Chaetomium globosum (Soil fungus). OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Sordariales; Chaetomiaceae; OC Chaetomium. OX NCBI_TaxID=38033; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=ATCC 6205 / CBS 148.51 / DSM 1962 / IFO 6347 / NRRL 1970; RA Birren B.W., Lander E.S., Galagan J.E., Devon K., Nusbaum C., RA Ma L.-J., Jaffe D.B., Butler J., Alvarez P., Gnerre S., Grabherr M., RA Kleber M., Mauceli E.W., Brockman W., Rounsley S., Young S.K., RA LaButti K., Pushparaj V., DeCaprio D., Crawford M., Koehrsen M., RA Engels R., Montgomery P., Pearson M., Howarth C., Kodira C.D., RA Yandava C., Zeng Q., Alvarado L., Oleary S., Untereiner W.; RT "Annotation of the Chaetomium globosum CBS 148.51 genome."; RL Submitted (MAR-2005) to the EMBL/GenBank/DDBJ databases. CC -!- FUNCTION: Catalytic component of the COMPASS (Set1C) complex that CC specifically mono-, di- and trimethylates histone H3 to form CC H3K4me1/2/3, which subsequently plays a role in telomere length CC maintenance and transcription elongation regulation (By CC similarity). CC -!- CATALYTIC ACTIVITY: S-adenosyl-L-methionine + histone L-lysine = CC S-adenosyl-L-homocysteine + histone N(6)-methyl-L-lysine. CC -!- SUBUNIT: Component of the COMPASS (Set1C) complex (By similarity). CC -!- SUBCELLULAR LOCATION: Nucleus (Probable). CC -!- SIMILARITY: Contains 1 post-SET domain. CC -!- SIMILARITY: Contains 1 SET domain. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; CH408033; EAQ86448.1; -; Genomic_DNA. DR RefSeq; XP_001225357.1; -. DR GeneID; 4393302; -. DR BRENDA; 2.1.1.43; 81575. DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-KW. DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell. DR GO; GO:0018024; F:histone-lysine N-methyltransferase activity; IEA:EC. DR GO; GO:0000166; F:nucleotide binding; IEA:InterPro. DR GO; GO:0016568; P:chromatin modification; IEA:UniProtKB-KW. DR InterPro; IPR012677; a_b_plait_nuc_bd. DR InterPro; IPR015722; MLL. DR InterPro; IPR003616; Post-SET_Zn_bd. DR InterPro; IPR001214; SET. DR Gene3D; G3DSA:3.30.70.330; a_b_plait_nuc_bd; 1. DR PANTHER; PTHR22884:SF10; MLL; 1. DR Pfam; PF00856; SET; 1. DR SMART; SM00508; PostSET; 1. DR SMART; SM00317; SET; 1. DR PROSITE; PS50868; POST_SET; 1. DR PROSITE; PS50280; SET; 1. PE 3: Inferred from homology; KW Chromatin regulator; Chromosomal protein; Complete proteome; KW Methyltransferase; Nucleus; S-adenosyl-L-methionine; Transferase. FT CHAIN 1 1076 Histone-lysine N-methyltransferase, H3 FT lysine-4 specific. FT /FTId=PRO_0000269769. FT DOMAIN 933 1055 SET. FT DOMAIN 1060 1076 Post-SET. SQ SEQUENCE 1076 AA; 121756 MW; B10CF6093311D63D CRC64; MGTIWLISAA SDDQDDAPPS DPRLAKGGRL NYINVDFHLP KARLRHAPYN LKPYKYDPKT SCGPGPPTQV VVTGFNPLIA FSKVTAVFAS FGDIAESSNK MHPDTGSYLG FATFRYRDSK PSRSRPISIT GADAAKRAIR AMHGKRIEAN MVRVEYDAEG KKSSRMLVEV LQKGNETTPA LGEPRIPTGP KPKEVAPGPP PTAPKGPAAH RGGLMNVQGV WVPKPRPDSI IEVEPVIGHL KHDPYIFVGH EHVPVMPTTV AHMKRRLKTY MFEDIRADRT GYYIVFQDSG YGRAEAERCF RSADRTAFFT YTMVMVLHLY GTDGKASHAH ASDTRRRTRT PERKHVDEAR PHREHDRSRR DEERARRDEQ DRRRREDEAD LEEEKRQRAK NYDPVLEATD VVLRGMKEQL IKIIRTKIAA PALFNFLDPV NHLAKRRRLN LEDPHSARLP PIVLDEFEDR SPVSTPNSRA DPIERRTARL DVSALPRIRK VKNAGLNTRK HGFNDPFARN RPTARRTAFR SLHYRLRSDS EGESEDEAEN RTSLGRDTEE PESRPRSRMS SDDEGDKDDY ASWGPGDDDS MTEASFALGD GPGLAKKRKL DLQVETAIKR QKKTDEELFG VTIDRIGTEF PSREDSLEDV LPPGPGGGEE KDIGSSRLPT PLLQEGKAKK KAPAKTKRKS KKQLFEEREA LKRQQQEIFE REALQSEDVD EVIPTPEPES EPKKSKVEKE KEKEEKVEKP ALDENLYPSQ KVSVLELPHD FRLDVGSLEE LALGPNDQPD LDRLRKRFGR GKIDDPELWV WRRDRIRELN STDGSAKTPV RIEGYYVPNP TGCARAEGVK KILNSEKSKY LPHHIKVKKA REERQAQNGK NAKDSVLAAA EAARLAAESL VAKGNSRANR ANNRRFVADL NDQRKTLGQD SDVLRFNQLK KRKKPVKFAR SAIHNWGLYA MENIPKDDMI IEYVGEEVRQ QIAELRENRY LKSGIGSSYL FRIDDNTVID ATKKGGIARF INHSCMPNCT AKIIKVEGSK RIVIYALRDI AQNEELTYDY KFERELGSTD RIPCLCGTAA CKGFLN //