ID B4GIE1_DROPE Unreviewed; 1141 AA. AC B4GIE1; DT 23-SEP-2008, integrated into UniProtKB/TrEMBL. DT 23-SEP-2008, sequence version 1. DT 27-MAR-2024, entry version 88. DE SubName: Full=GL17699 {ECO:0000313|EMBL:EDW36261.1}; GN Name=Dper\GL17699 {ECO:0000313|EMBL:EDW36261.1}; GN ORFNames=Dper_GL17699 {ECO:0000313|EMBL:EDW36261.1}; OS Drosophila persimilis (Fruit fly). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea; OC Drosophilidae; Drosophila; Sophophora. OX NCBI_TaxID=7234 {ECO:0000313|Proteomes:UP000008744}; RN [1] {ECO:0000313|EMBL:EDW36261.1, ECO:0000313|Proteomes:UP000008744} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=MSH-3 / Tucson 14011-0111.49 RC {ECO:0000313|Proteomes:UP000008744}; RX PubMed=17994087; DOI=10.1038/nature06341; RG Drosophila 12 genomes consortium; RA Clark A.G., Eisen M.B., Smith D.R., Bergman C.M., Oliver B., Markow T.A., RA Kaufman T.C., Kellis M., Gelbart W., Iyer V.N., Pollard D.A., Sackton T.B., RA Larracuente A.M., Singh N.D., Abad J.P., Abt D.N., Adryan B., Aguade M., RA Akashi H., Anderson W.W., Aquadro C.F., Ardell D.H., Arguello R., RA Artieri C.G., Barbash D.A., Barker D., Barsanti P., Batterham P., RA Batzoglou S., Begun D., Bhutkar A., Blanco E., Bosak S.A., Bradley R.K., RA Brand A.D., Brent M.R., Brooks A.N., Brown R.H., Butlin R.K., Caggese C., RA Calvi B.R., Bernardo de Carvalho A., Caspi A., Castrezana S., RA Celniker S.E., Chang J.L., Chapple C., Chatterji S., Chinwalla A., RA Civetta A., Clifton S.W., Comeron J.M., Costello J.C., Coyne J.A., Daub J., RA David R.G., Delcher A.L., Delehaunty K., Do C.B., Ebling H., Edwards K., RA Eickbush T., Evans J.D., Filipski A., Findeiss S., Freyhult E., Fulton L., RA Fulton R., Garcia A.C., Gardiner A., Garfield D.A., Garvin B.E., Gibson G., RA Gilbert D., Gnerre S., Godfrey J., Good R., Gotea V., Gravely B., RA Greenberg A.J., Griffiths-Jones S., Gross S., Guigo R., Gustafson E.A., RA Haerty W., Hahn M.W., Halligan D.L., Halpern A.L., Halter G.M., Han M.V., RA Heger A., Hillier L., Hinrichs A.S., Holmes I., Hoskins R.A., Hubisz M.J., RA Hultmark D., Huntley M.A., Jaffe D.B., Jagadeeshan S., Jeck W.R., RA Johnson J., Jones C.D., Jordan W.C., Karpen G.H., Kataoka E., RA Keightley P.D., Kheradpour P., Kirkness E.F., Koerich L.B., Kristiansen K., RA Kudrna D., Kulathinal R.J., Kumar S., Kwok R., Lander E., Langley C.H., RA Lapoint R., Lazzaro B.P., Lee S.J., Levesque L., Li R., Lin C.F., Lin M.F., RA Lindblad-Toh K., Llopart A., Long M., Low L., Lozovsky E., Lu J., Luo M., RA Machado C.A., Makalowski W., Marzo M., Matsuda M., Matzkin L., RA McAllister B., McBride C.S., McKernan B., McKernan K., Mendez-Lago M., RA Minx P., Mollenhauer M.U., Montooth K., Mount S.M., Mu X., Myers E., RA Negre B., Newfeld S., Nielsen R., Noor M.A., O'Grady P., Pachter L., RA Papaceit M., Parisi M.J., Parisi M., Parts L., Pedersen J.S., Pesole G., RA Phillippy A.M., Ponting C.P., Pop M., Porcelli D., Powell J.R., RA Prohaska S., Pruitt K., Puig M., Quesneville H., Ram K.R., Rand D., RA Rasmussen M.D., Reed L.K., Reenan R., Reily A., Remington K.A., RA Rieger T.T., Ritchie M.G., Robin C., Rogers Y.H., Rohde C., Rozas J., RA Rubenfield M.J., Ruiz A., Russo S., Salzberg S.L., Sanchez-Gracia A., RA Saranga D.J., Sato H., Schaeffer S.W., Schatz M.C., Schlenke T., RA Schwartz R., Segarra C., Singh R.S., Sirot L., Sirota M., Sisneros N.B., RA Smith C.D., Smith T.F., Spieth J., Stage D.E., Stark A., Stephan W., RA Strausberg R.L., Strempel S., Sturgill D., Sutton G., Sutton G.G., Tao W., RA Teichmann S., Tobari Y.N., Tomimura Y., Tsolas J.M., Valente V.L., RA Venter E., Venter J.C., Vicario S., Vieira F.G., Vilella A.J., RA Villasante A., Walenz B., Wang J., Wasserman M., Watts T., Wilson D., RA Wilson R.K., Wing R.A., Wolfner M.F., Wong A., Wong G.K., Wu C.I., Wu G., RA Yamamoto D., Yang H.P., Yang S.P., Yorke J.A., Yoshida K., Zdobnov E., RA Zhang P., Zhang Y., Zimin A.V., Baldwin J., Abdouelleil A., Abdulkadir J., RA Abebe A., Abera B., Abreu J., Acer S.C., Aftuck L., Alexander A., An P., RA Anderson E., Anderson S., Arachi H., Azer M., Bachantsang P., Barry A., RA Bayul T., Berlin A., Bessette D., Bloom T., Blye J., Boguslavskiy L., RA Bonnet C., Boukhgalter B., Bourzgui I., Brown A., Cahill P., Channer S., RA Cheshatsang Y., Chuda L., Citroen M., Collymore A., Cooke P., Costello M., RA D'Aco K., Daza R., De Haan G., DeGray S., DeMaso C., Dhargay N., Dooley K., RA Dooley E., Doricent M., Dorje P., Dorjee K., Dupes A., Elong R., Falk J., RA Farina A., Faro S., Ferguson D., Fisher S., Foley C.D., Franke A., RA Friedrich D., Gadbois L., Gearin G., Gearin C.R., Giannoukos G., Goode T., RA Graham J., Grandbois E., Grewal S., Gyaltsen K., Hafez N., Hagos B., RA Hall J., Henson C., Hollinger A., Honan T., Huard M.D., Hughes L., RA Hurhula B., Husby M.E., Kamat A., Kanga B., Kashin S., Khazanovich D., RA Kisner P., Lance K., Lara M., Lee W., Lennon N., Letendre F., LeVine R., RA Lipovsky A., Liu X., Liu J., Liu S., Lokyitsang T., Lokyitsang Y., RA Lubonja R., Lui A., MacDonald P., Magnisalis V., Maru K., Matthews C., RA McCusker W., McDonough S., Mehta T., Meldrim J., Meneus L., Mihai O., RA Mihalev A., Mihova T., Mittelman R., Mlenga V., Montmayeur A., Mulrain L., RA Navidi A., Naylor J., Negash T., Nguyen T., Nguyen N., Nicol R., Norbu C., RA Norbu N., Novod N., O'Neill B., Osman S., Markiewicz E., Oyono O.L., RA Patti C., Phunkhang P., Pierre F., Priest M., Raghuraman S., Rege F., RA Reyes R., Rise C., Rogov P., Ross K., Ryan E., Settipalli S., Shea T., RA Sherpa N., Shi L., Shih D., Sparrow T., Spaulding J., Stalker J., RA Stange-Thomann N., Stavropoulos S., Stone C., Strader C., Tesfaye S., RA Thomson T., Thoulutsang Y., Thoulutsang D., Topham K., Topping I., RA Tsamla T., Vassiliev H., Vo A., Wangchuk T., Wangdi T., Weiand M., RA Wilkinson J., Wilson A., Yadav S., Young G., Yu Q., Zembek L., Zhong D., RA Zimmer A., Zwirko Z., Jaffe D.B., Alvarez P., Brockman W., Butler J., RA Chin C., Gnerre S., Grabherr M., Kleber M., Mauceli E., MacCallum I.; RT "Evolution of genes and genomes on the Drosophila phylogeny."; RL Nature 450:203-218(2007). CC -!- SUBCELLULAR LOCATION: Chromosome {ECO:0000256|ARBA:ARBA00004286}. CC Nucleus {ECO:0000256|ARBA:ARBA00004123}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; CH479183; EDW36261.1; -; Genomic_DNA. DR RefSeq; XP_002018422.1; XM_002018386.1. DR AlphaFoldDB; B4GIE1; -. DR STRING; 7234.B4GIE1; -. DR EnsemblMetazoa; FBtr0183314; FBpp0181806; FBgn0155302. DR GeneID; 6592281; -. DR KEGG; dpe:6592281; -. DR eggNOG; KOG1141; Eukaryota. DR HOGENOM; CLU_003279_0_1_1; -. DR OMA; TNKYRYL; -. DR OrthoDB; 2877903at2759; -. DR PhylomeDB; B4GIE1; -. DR Proteomes; UP000008744; Unassembled WGS sequence. DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-SubCell. DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell. DR GO; GO:0003677; F:DNA binding; IEA:InterPro. DR GO; GO:0042054; F:histone methyltransferase activity; IEA:InterPro. DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro. DR CDD; cd01395; HMT_MBD; 1. DR CDD; cd20382; Tudor_SETDB1_rpt1; 1. DR CDD; cd21181; Tudor_SETDB1_rpt2; 1. DR Gene3D; 2.30.30.140; -; 2. DR Gene3D; 2.170.270.10; SET domain; 1. DR InterPro; IPR016177; DNA-bd_dom_sf. DR InterPro; IPR001739; Methyl_CpG_DNA-bd. DR InterPro; IPR007728; Pre-SET_dom. DR InterPro; IPR046341; SET_dom_sf. DR InterPro; IPR047232; SETDB1/2-like_MBD. DR InterPro; IPR041292; Tudor_4. DR InterPro; IPR041291; TUDOR_5. DR PANTHER; PTHR46024; HISTONE-LYSINE N-METHYLTRANSFERASE EGGLESS; 1. DR PANTHER; PTHR46024:SF1; HISTONE-LYSINE N-METHYLTRANSFERASE EGGLESS; 1. DR Pfam; PF01429; MBD; 1. DR Pfam; PF05033; Pre-SET; 1. DR Pfam; PF18358; Tudor_4; 1. DR Pfam; PF18359; Tudor_5; 1. DR SMART; SM00391; MBD; 1. DR SMART; SM00468; PreSET; 1. DR SUPFAM; SSF54171; DNA-binding domain; 1. DR SUPFAM; SSF82199; SET domain; 1. DR PROSITE; PS50982; MBD; 1. DR PROSITE; PS50867; PRE_SET; 1. PE 4: Predicted; KW Nucleus {ECO:0000256|ARBA:ARBA00023242}; KW Reference proteome {ECO:0000313|Proteomes:UP000008744}; KW Repeat {ECO:0000256|ARBA:ARBA00022737}; KW Repressor {ECO:0000256|ARBA:ARBA00022491}. FT DOMAIN 871..937 FT /note="MBD" FT /evidence="ECO:0000259|PROSITE:PS50982" FT DOMAIN 999..1071 FT /note="Pre-SET" FT /evidence="ECO:0000259|PROSITE:PS50867" FT REGION 1..202 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 221..246 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 794..820 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 30..83 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 104..129 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 132..165 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 166..180 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 181..199 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" SQ SEQUENCE 1141 AA; 128667 MW; 467EB1C834B37003 CRC64; MASESAAMDI LDSAEASLNV APTALVKKER KVQPASDETT LEKKAKMEID AEMKDLTNDS PNSRKSQEKD PEALEDAKDP EADSSIELLC SPTPAEATDV DAMASKTESD NSVELLESPL KSPSSNDVND EELLPLEEKE KPGPAKELEP KESEPDSKES SKSEALADSS IELISSPTSD DSLAKEKEVE VKEEHGQQAE AQVLQEIPRK ADDSFKLQDD IAMEEDVPVP RSKAMQESKE TQKTTTKLEP LVDSIKDAIE GEPKDKRNNN VDSMEIDESH IESAKKSDNE ILAVDLEIGS PPKEAEEKSK SDFAWDGRIS YNKDCINCNC KRLKKQYVLA CVAILNFYKV PRKLKRSQYV CLDCYDTAVE MYEEYAGLLL AKQPLLLREF KQEQADFVTL DSSDEEEDEK TPEKPEFSKN VLDLIENELE DAIKKTLNKV EFSNQFNWSK TILQAKIERL AKQFEEVDLQ LAQVQGLADK MHCSVYNSCQ VVHKQLPPLD LHQNICPSDY KRLQQLPAAG DIVRPPIKIG ETYYAVKNKA IASWVSVSVM EICDTTTGGG VTVKAYKIKY QHMPYPMMKT VAAKHLAYFD PPTVRLPIGT RVIAFFDGTL VGGKEKGVVQ SAFYPGIIAE PLKQNNRFRY LIFYDDGYTQ YVHHSDVRLV CQASEKVWED VHPASRDFIQ KYVERYAVDR PMVQCTKGQS MNTESNGTWL YARVIEVDCS LVLMQFEADK NHTEWIYRGS LRLAPVFKET QNSLNADCAI HQMRVPRRTE PFIRYTKEME SSNMQVDQQI RAIARKSTSK SGSPASTAAP PTGSSSSSAV RHLNNSTIYV DDDTRPKGQV VHFTAKRNMP PKIFKSHKCN PGCPFPMMHR LDSYSPLSKP LLSGWERMFM KQKTKRTVVY RGPCGRNLRN MAEVHTYLRL TNNVLNVDNF DFTPDLRCLA EYYIESTIVK EADISKGQEK MAIPLVNYYD NTLPPPCEYA KQRIPTEGVN LNLDEEFLVC CDCEDDCSDK ESCACWQLTV TGVRYCNPKK PIEEIGYQYK RLHEGVLTGI YECNSRCKCK KNCLNRVVQH SLEMKLQVFK TSNRGWGLRC VNDIPKGAFV CIYAGHLLTE AKVTAGGQDA GDEYFADLEH IEVAEQLKGG L //