ID C5YKQ5_SORBI Unreviewed; 1260 AA. AC C5YKQ5; DT 01-SEP-2009, integrated into UniProtKB/TrEMBL. DT 01-SEP-2009, sequence version 1. DT 24-JAN-2024, entry version 88. DE RecName: Full=Histone-lysine N-methyltransferase {ECO:0008006|Google:ProtNLM}; GN ORFNames=SORBI_3007G121700 {ECO:0000313|EMBL:EES13798.1}; OS Sorghum bicolor (Sorghum) (Sorghum vulgare). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; PACMAD clade; OC Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum. OX NCBI_TaxID=4558 {ECO:0000313|EMBL:EES13798.1, ECO:0000313|Proteomes:UP000000768}; RN [1] {ECO:0000313|EMBL:EES13798.1, ECO:0000313|Proteomes:UP000000768} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. BTx623 {ECO:0000313|Proteomes:UP000000768}; RX PubMed=19189423; DOI=10.1038/nature07723; RA Paterson A.H., Bowers J.E., Bruggmann R., Dubchak I., Grimwood J., RA Gundlach H., Haberer G., Hellsten U., Mitros T., Poliakov A., Schmutz J., RA Spannagl M., Tang H., Wang X., Wicker T., Bharti A.K., Chapman J., RA Feltus F.A., Gowik U., Grigoriev I.V., Lyons E., Maher C.A., Martis M., RA Narechania A., Otillar R.P., Penning B.W., Salamov A.A., Wang Y., Zhang L., RA Carpita N.C., Freeling M., Gingle A.R., Hash C.T., Keller B., Klein P., RA Kresovich S., McCann M.C., Ming R., Peterson D.G., Mehboob-ur-Rahman, RA Ware D., Westhoff P., Mayer K.F., Messing J., Rokhsar D.S.; RT "The Sorghum bicolor genome and the diversification of grasses."; RL Nature 457:551-556(2009). RN [2] {ECO:0000313|Proteomes:UP000000768} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=cv. BTx623 {ECO:0000313|Proteomes:UP000000768}; RX PubMed=29161754; DOI=10.1111/tpj.13781; RA McCormick R.F., Truong S.K., Sreedasyam A., Jenkins J., Shu S., Sims D., RA Kennedy M., Amirebrahimi M., Weers B.D., McKinley B., Mattison A., RA Morishige D.T., Grimwood J., Schmutz J., Mullet J.E.; RT "The Sorghum bicolor reference genome: improved assembly, gene annotations, RT a transcriptome atlas, and signatures of genome organization."; RL Plant J. 93:338-354(2018). CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00358}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; CM000766; EES13798.1; -; Genomic_DNA. DR RefSeq; XP_002444303.1; XM_002444258.1. DR AlphaFoldDB; C5YKQ5; -. DR STRING; 4558.C5YKQ5; -. DR EnsemblPlants; EES13798; EES13798; SORBI_3007G121700. DR GeneID; 8070040; -. DR Gramene; EES13798; EES13798; SORBI_3007G121700. DR KEGG; sbi:8070040; -. DR eggNOG; KOG1082; Eukaryota. DR HOGENOM; CLU_004556_1_0_1; -. DR InParanoid; C5YKQ5; -. DR OMA; HIAKYQN; -. DR OrthoDB; 5481936at2759; -. DR Proteomes; UP000000768; Chromosome 7. DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-KW. DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell. DR GO; GO:0003690; F:double-stranded DNA binding; IBA:GO_Central. DR GO; GO:0042054; F:histone methyltransferase activity; IBA:GO_Central. DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro. DR Gene3D; 2.170.270.10; SET domain; 1. DR Gene3D; 2.30.280.10; SRA-YDG; 1. DR InterPro; IPR025794; H3-K9-MeTrfase_plant. DR InterPro; IPR003616; Post-SET_dom. DR InterPro; IPR007728; Pre-SET_dom. DR InterPro; IPR015947; PUA-like_sf. DR InterPro; IPR001214; SET_dom. DR InterPro; IPR046341; SET_dom_sf. DR InterPro; IPR036987; SRA-YDG_sf. DR InterPro; IPR003105; SRA_YDG. DR PANTHER; PTHR45660; HISTONE-LYSINE N-METHYLTRANSFERASE SETMAR; 1. DR PANTHER; PTHR45660:SF11; OS08G0400200 PROTEIN; 1. DR Pfam; PF05033; Pre-SET; 1. DR Pfam; PF02182; SAD_SRA; 1. DR Pfam; PF00856; SET; 1. DR SMART; SM00468; PreSET; 1. DR SMART; SM00317; SET; 1. DR SMART; SM00466; SRA; 1. DR SUPFAM; SSF88697; PUA domain-like; 1. DR SUPFAM; SSF82199; SET domain; 1. DR PROSITE; PS50868; POST_SET; 1. DR PROSITE; PS50867; PRE_SET; 1. DR PROSITE; PS51575; SAM_MT43_SUVAR39_2; 1. DR PROSITE; PS50280; SET; 1. DR PROSITE; PS51015; YDG; 1. PE 4: Predicted; KW Chromosome {ECO:0000256|ARBA:ARBA00022454}; KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE- KW ProRule:PRU00358}; Reference proteome {ECO:0000313|Proteomes:UP000000768}; KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691}; KW Transferase {ECO:0000256|ARBA:ARBA00022679}. FT DOMAIN 802..953 FT /note="YDG" FT /evidence="ECO:0000259|PROSITE:PS51015" FT DOMAIN 1023..1083 FT /note="Pre-SET" FT /evidence="ECO:0000259|PROSITE:PS50867" FT DOMAIN 1086..1230 FT /note="SET" FT /evidence="ECO:0000259|PROSITE:PS50280" FT DOMAIN 1244..1260 FT /note="Post-SET" FT /evidence="ECO:0000259|PROSITE:PS50868" FT REGION 1..55 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" SQ SEQUENCE 1260 AA; 135107 MW; B3BA5071A06E30E5 CRC64; MDSYAARLDV ARQPRLPVPG PAAGRVGRAC PPQRADHRGR PPASPRGCGG DTRRAGGLRE HGVVAAAVAA TAIAAGSVGD DAAAGVAAAM VAREETGGSE VGSKRCLPLA AAHPPPKRRA VSARRRFPPQ CGRNVAAPLA TANNASRFGD AGAEVCSAVL DLEKTAASSP LAGGKDGALL GAVSPVSAAT SLVKQLCAVD GDAPMADGGH HGPQPGVVKS SEALRRSGGT AVDGLLDGGS QGVAAVSMED RGKGAWSGEL GRKELVPDAH LQANPRMSLD ERSFPLGYGK DAVLSLLLAG GSSKVCLPLK SRPTDKLGVL EEAVVATHGC ISSVQGHYIK IVSNDGTVQD YELEDGEIPP ELVVQESQVS TGVALHESTD CRHGPSVPEV NAKETFAMQP CNEKTGGDTL QCGEKISSYL VANDVEVVNK SIGSSVNVVA ASLAEDLSKQ NMVGKRVSES AMMNIVDVTA GGCGNGTTMR CVTMHDSAAC RHGDSVPEIS AVETSVMQSS NEKTGGNILQ CGDKKSSCLV TKDIEVMNKS IRISCTAVAG PLAEDSFKQN LMGKRVSESA RMNRASDDVA AATSGNSIMM RSKVRFTPRK VIKLSKVIQK STLDTRHRHC PEDREKETEL SRRGIINKIE DTDKLTKDRV LQAPMTQEKE AATTRGFFGP RKRVKVKVPA HLQMKIASTC ALGCKVKLDD KVASSLDDDD ILKALVVNEG NLELFLNSYS SLTSARCQMK HGSQNADARS KFKMLCRRFE FVCRALVQAV EQNSLKIRRI DLQADRVIRK LPGFTKSGPI VGQVPGVQVG DEFLYRVQLA IVGLHLAYQG GIDTTIYRNG ERIAISIVAS GGYPDELSSS GELIYSGSGG KPAGKKDHED QKLERGNLAL KNCIKTKTPV RVIYGFKAQN NRVGSHSRAR EVSTFTYDGL YRVLDFWMDG QPGSRVFKYK LKKIPGQPKL PMHMAEGMRK SKTRPGLCEI DISQGKEGIP ICVINTVDTE RPAPFRYTTR IRYPFELTKK RHQGCDCTNG CSDSVSCACA VKNGGEIPFN LNGAIVNEKP LIFECGPSCK CPPSCQNKVS QHGLKIPLEV FKTTKTGWGV RSLRSISSGS FICEYVGELL YGNEADERRN SNFLFDIGLN HGDENFCNGL LSDVSDMKSS SSSSQILGDV GFTIDSAECG NIGRFINHSC SPNLYAQNVL WDHDDLRIPH IMFFAAETIP PLQELTYDYN YEIDHVEDVN GRIKFKVCQC GSSGCSGRLY //