ID B4GM96_DROPE Unreviewed; 1548 AA. AC B4GM96; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 24-JAN-2024, entry version 94. DE SubName: Full=GL12290 {ECO:0000313|EMBL:EDW37970.1}; GN Name=Dper\GL12290 {ECO:0000313|EMBL:EDW37970.1}; GN ORFNames=Dper_GL12290 {ECO:0000313|EMBL:EDW37970.1}; OS Drosophila persimilis (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea; OC Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7234 {ECO:0000313|Proteomes:UP000008744}; RN [1] {ECO:0000313|EMBL:EDW37970.1, ECO:0000313|Proteomes:UP000008744} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=MSH-3 / Tucson 14011-0111.49 RC {ECO:0000313|Proteomes:UP000008744}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RA Clark A.G., Eisen M.B., Smith D.R., Bergman C.M., Oliver B., Markow T.A., RA Kaufman T.C., Kellis M., Gelbart W., Iyer V.N., Pollard D.A., Sackton T.B., RA Larracuente A.M., Singh N.D., Abad J.P., Abt D.N., Adryan B., Aguade M., RA Akashi H., Anderson W.W., Aquadro C.F., Ardell D.H., Arguello R., RA Artieri C.G., Barbash D.A., Barker D., Barsanti P., Batterham P., RA Batzoglou S., Begun D., Bhutkar A., Blanco E., Bosak S.A., Bradley R.K., RA Brand A.D., Brent M.R., Brooks A.N., Brown R.H., Butlin R.K., Caggese C., RA Calvi B.R., Bernardo de Carvalho A., Caspi A., Castrezana S., RA Celniker S.E., Chang J.L., Chapple C., Chatterji S., Chinwalla A., RA Civetta A., Clifton S.W., Comeron J.M., Costello J.C., Coyne J.A., Daub J., RA David R.G., Delcher A.L., Delehaunty K., Do C.B., Ebling H., Edwards K., RA Eickbush T., Evans J.D., Filipski A., Findeiss S., Freyhult E., Fulton L., RA Fulton R., Garcia A.C., Gardiner A., Garfield D.A., Garvin B.E., Gibson G., RA Gilbert D., Gnerre S., Godfrey J., Good R., Gotea V., Gravely B., RA Greenberg A.J., Griffiths-Jones S., Gross S., Guigo R., Gustafson E.A., RA Haerty W., Hahn M.W., Halligan D.L., Halpern A.L., Halter G.M., Han M.V., RA Heger A., Hillier L., Hinrichs A.S., Holmes I., Hoskins R.A., Hubisz M.J., RA Hultmark D., Huntley M.A., Jaffe D.B., Jagadeeshan S., Jeck W.R., RA Johnson J., Jones C.D., Jordan W.C., Karpen G.H., Kataoka E., RA Keightley P.D., Kheradpour P., Kirkness E.F., Koerich L.B., Kristiansen K., RA Kudrna D., Kulathinal R.J., Kumar S., Kwok R., Lander E., Langley C.H., RA Lapoint R., Lazzaro B.P., Lee S.J., Levesque L., Li R., Lin C.F., Lin M.F., RA Lindblad-Toh K., Llopart A., Long M., Low L., Lozovsky E., Lu J., Luo M., RA Machado C.A., Makalowski W., Marzo M., Matsuda M., Matzkin L., RA McAllister B., McBride C.S., McKernan B., McKernan K., Mendez-Lago M., RA Minx P., Mollenhauer M.U., Montooth K., Mount S.M., Mu X., Myers E., RA Negre B., Newfeld S., Nielsen R., Noor M.A., O'Grady P., Pachter L., RA Papaceit M., Parisi M.J., Parisi M., Parts L., Pedersen J.S., Pesole G., RA Phillippy A.M., Ponting C.P., Pop M., Porcelli D., Powell J.R., RA Prohaska S., Pruitt K., Puig M., Quesneville H., Ram K.R., Rand D., RA Rasmussen M.D., Reed L.K., Reenan R., Reily A., Remington K.A., RA Rieger T.T., Ritchie M.G., Robin C., Rogers Y.H., Rohde C., Rozas J., RA Rubenfield M.J., Ruiz A., Russo S., Salzberg S.L., Sanchez-Gracia A., RA Saranga D.J., Sato H., Schaeffer S.W., Schatz M.C., Schlenke T., RA Schwartz R., Segarra C., Singh R.S., Sirot L., Sirota M., Sisneros N.B., RA Smith C.D., Smith T.F., Spieth J., Stage D.E., Stark A., Stephan W., RA Strausberg R.L., Strempel S., Sturgill D., Sutton G., Sutton G.G., Tao W., RA Teichmann S., Tobari Y.N., Tomimura Y., Tsolas J.M., Valente V.L., RA Venter E., Venter J.C., Vicario S., Vieira F.G., Vilella A.J., RA Villasante A., Walenz B., Wang J., Wasserman M., Watts T., Wilson D., RA Wilson R.K., Wing R.A., Wolfner M.F., Wong A., Wong G.K., Wu C.I., Wu G., RA Yamamoto D., Yang H.P., Yang S.P., Yorke J.A., Yoshida K., Zdobnov E., RA Zhang P., Zhang Y., Zimin A.V., Baldwin J., Abdouelleil A., Abdulkadir J., RA Abebe A., Abera B., Abreu J., Acer S.C., Aftuck L., Alexander A., An P., RA Anderson E., Anderson S., Arachi H., Azer M., Bachantsang P., Barry A., RA Bayul T., Berlin A., Bessette D., Bloom T., Blye J., Boguslavskiy L., RA Bonnet C., Boukhgalter B., Bourzgui I., Brown A., Cahill P., Channer S., RA Cheshatsang Y., Chuda L., Citroen M., Collymore A., Cooke P., Costello M., RA D'Aco K., Daza R., De Haan G., DeGray S., DeMaso C., Dhargay N., Dooley K., RA Dooley E., Doricent M., Dorje P., Dorjee K., Dupes A., Elong R., Falk J., RA Farina A., Faro S., Ferguson D., Fisher S., Foley C.D., Franke A., RA Friedrich D., Gadbois L., Gearin G., Gearin C.R., Giannoukos G., Goode T., RA Graham J., Grandbois E., Grewal S., Gyaltsen K., Hafez N., Hagos B., RA Hall J., Henson C., Hollinger A., Honan T., Huard M.D., Hughes L., RA Hurhula B., Husby M.E., Kamat A., Kanga B., Kashin S., Khazanovich D., RA Kisner P., Lance K., Lara M., Lee W., Lennon N., Letendre F., LeVine R., RA Lipovsky A., Liu X., Liu J., Liu S., Lokyitsang T., Lokyitsang Y., RA Lubonja R., Lui A., MacDonald P., Magnisalis V., Maru K., Matthews C., RA McCusker W., McDonough S., Mehta T., Meldrim J., Meneus L., Mihai O., RA Mihalev A., Mihova T., Mittelman R., Mlenga V., Montmayeur A., Mulrain L., RA Navidi A., Naylor J., Negash T., Nguyen T., Nguyen N., Nicol R., Norbu C., RA Norbu N., Novod N., O'Neill B., Osman S., Markiewicz E., Oyono O.L., RA Patti C., Phunkhang P., Pierre F., Priest M., Raghuraman S., Rege F., RA Reyes R., Rise C., Rogov P., Ross K., Ryan E., Settipalli S., Shea T., RA Sherpa N., Shi L., Shih D., Sparrow T., Spaulding J., Stalker J., RA Stange-Thomann N., Stavropoulos S., Stone C., Strader C., Tesfaye S., RA Thomson T., Thoulutsang Y., Thoulutsang D., Topham K., Topping I., RA Tsamla T., Vassiliev H., Vo A., Wangchuk T., Wangdi T., Weiand M., RA Wilkinson J., Wilson A., Yadav S., Young G., Yu Q., Zembek L., Zhong D., RA Zimmer A., Zwirko Z., Jaffe D.B., Alvarez P., Brockman W., Butler J., RA Chin C., Gnerre S., Grabherr M., Kleber M., Mauceli E., MacCallum I.; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; CH479185; EDW37970.1; -; Genomic_DNA. DR RefSeq; XP_002019336.1; XM_002019300.1. DR STRING; 7234.B4GM96; -. DR EnsemblMetazoa; FBtr0177905; FBpp0176397; FBgn0149897. DR GeneID; 6594368; -. DR KEGG; dpe:6594368; -. DR eggNOG; KOG1080; Eukaryota. DR HOGENOM; CLU_001226_3_0_1; -. DR OMA; RMIEMTA; -. DR OrthoDB; 950362at2759; -. DR PhylomeDB; B4GM96; -. DR Proteomes; UP000008744; Unassembled WGS sequence. DR GO; GO:0048188; C:Set1C/COMPASS complex; IEA:InterPro. DR GO; GO:0042800; F:histone H3K4 methyltransferase activity; IEA:InterPro. DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW. DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW. DR CDD; cd19169; SET_SETD1; 1. DR Gene3D; 2.170.270.10; SET domain; 1. DR InterPro; IPR024657; COMPASS_Set1_N-SET. DR InterPro; IPR003616; Post-SET_dom. DR InterPro; IPR044570; Set1-like. DR InterPro; IPR001214; SET_dom. DR InterPro; IPR046341; SET_dom_sf. DR InterPro; IPR037841; SET_SETD1A/B. DR PANTHER; PTHR45814; HISTONE-LYSINE N-METHYLTRANSFERASE SETD1; 1. DR PANTHER; PTHR45814:SF2; HISTONE-LYSINE N-METHYLTRANSFERASE SETD1; 1. DR Pfam; PF11764; N-SET; 1. DR Pfam; PF00856; SET; 1. DR SMART; SM01291; N-SET; 1. DR SMART; SM00508; PostSET; 1. DR SMART; SM00317; SET; 1. DR SUPFAM; SSF82199; SET domain; 1. DR PROSITE; PS50868; POST_SET; 1. DR PROSITE; PS50280; SET; 1. PE 4: Predicted; KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603}; KW Nucleus {ECO:0000256|ARBA:ARBA00023242}; KW Reference proteome {ECO:0000313|Proteomes:UP000008744}; KW RNA-binding {ECO:0000256|ARBA:ARBA00022884}; KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691}; KW Transferase {ECO:0000256|ARBA:ARBA00022679}. FT DOMAIN 1409..1526 FT /note="SET" FT /evidence="ECO:0000259|PROSITE:PS50280" FT DOMAIN 1532..1548 FT /note="Post-SET" FT /evidence="ECO:0000259|PROSITE:PS50868" FT REGION 26..270 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 291..375 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 388..510 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 703..883 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 905..963 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 968..987 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1104..1133 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1170..1215 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 64..81 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 82..137 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 146..255 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 256..270 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 291..308 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 340..360 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 426..443 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 453..472 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 474..496 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 716..737 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 770..793 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 795..821 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 831..859 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 925..940 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1189..1208 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" SQ SEQUENCE 1548 AA; 176534 MW; 9A78E442F5406DF3 CRC64; MGKILNVFCD PFGAILRKHI ETLTNPVAPK PVLGATTTKA PPHPGTAVGS QPDLADYSHD YSEKSAVDNL NSAYSSFPKN SDIGERSRDR NWERDRERER DRDRDRDRYY KDSSQHSAER SYDRDRAMRD KYGSSLRHDR HFYRRRSRDK SPDPSRDSRD RLFTGRERDR ARDPDARSRD YHRSKERDYF RERNRSRDHS RDRSREKRDL RFSVSKERDY RERDRDRDRD RSPDRDRRDR VSIKYKKRFT SHEYAETDIH SSSSSKTHQY YTAASASGLP AGSYAYSSHA YSMTDSTQSW SEHKSWNAPQ PDFQPKPKPP PPPPPEEEEN WDDPPVLPQK FEQQTVSLLE SASTKLQSPK AGTAVAEPNA ETENVDLDTR IALIFKGKTF GNAPPFLQMD SSDSETDKAK PEGGVEANET RSLSDSNNSD NKKKCDKKIS ELQQPHGASD ISSDDDIMVK NKRCKPGFTQ CEKDDDNMSL SSLSSHEESP TALANTADRG NKKPLSPYMY PNSSNQHPYY YHPSDYGHYP SGISSNPVLT SRYFSNPAYM QPAYLAGVGA FNLDPYVQTY GYQYPPSDQN DEIKENVKKV INYIVEELKQ ILKRDVNKRM IEITAFKNFE SWWDEHTLKA RSKPLPDSVE GITNLRQSSA NANLMKENAD CEKPPNMNQL TNTQVEMSDF KSFSSIGIRA AMPKLPSFRR ICKHPSPIPN QRISLERDLS DQEEMVQRSD SEKEDSNMGG FEAPGSNAKD RVLDREMATI RLPSPEMLSK RKGSASSFFS SSSSSSSEAE NEVNDAADGT EKEKSSDDDS FDDDHPPRNL NRKSRLIKKN RRVAFKEPGD DNIENIYSDS EDEKRHTDRM HVQKRGRTKK KNIYSDSDAD IESTAGRGFL PALRTKVIST IPSDLEDISK DSSFGLDEED PDDKVEAELK TIRKMVDESR RSLTPVPPPD YNEEPVEKYD DTDLRKPVYE YDRIYSDSDE EREYQEKRRR NTEYMAQIER EFLEEQQKNN QNSFSEGRSI SERKLEIVKK PSGDMADTVD VPEVPLTPGI NILAELVDEI NKPILETNIE QNASKIAEID GKLKSNLNSS DVVEATTILN GDVQSFSNTN TSKKGTQSPA SSDGGSSQAS QASQVALEHC YSLPPHAQSA SFTDATSGRL GLDAQNKNRF DGQENVAGGS QMQRPGPGRP RKDSTRLQKK KKDSVPRQSN AKNKVAPITD ALAELARQTA NFVPYEMYKA RDQNEEMVIL YTFLTKGIDA EDIKYIKMSY IEHLQKEPYA MFLNNTHWVD HCTTDRAFWP PPPKKRRKDD ELMRHKTGCA RTEGYYKLDV REKAKHKYHH AKANIDDAEN EDRCDEPTAL TNHHHNKLIS KMQGISREAR SNQRRLLTAF GSMGESELLK FNQLKFRKKQ LKFAKSAIHD WGLFAMEPIA ADEMVIEYVG QMIRPVVADL RETKYEAIGI GSSYLFRIDM ETIIDATKCG NLARFINHSC NPNCYAKVIT IESEKKIVIY SKQPIGVNEE ITYDYKFPLE DEKIPCLCGA QGCRGTLN //