ID B0X1T0_CULQU Unreviewed; 1357 AA. AC B0X1T0; DT 08-APR-2008, integrated into UniProtKB/TrEMBL. DT 08-APR-2008, sequence version 1. DT 27-MAR-2024, entry version 90. DE RecName: Full=SET domain-containing protein {ECO:0000259|PROSITE:PS50280}; GN Name=6046399 {ECO:0000313|EnsemblMetazoa:CPIJ013516-PA}; GN ORFNames=CpipJ_CPIJ013516 {ECO:0000313|EMBL:EDS38808.1}; OS Culex quinquefasciatus (Southern house mosquito) (Culex pungens). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae; OC Culicinae; Culicini; Culex; Culex. OX NCBI_TaxID=7176; RN [1] {ECO:0000313|EMBL:EDS38808.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=JHB {ECO:0000313|EMBL:EDS38808.1}; RG The Broad Institute Genome Sequencing Platform; RA Atkinson P.W., Hemingway J., Christensen B.M., Higgs S., Kodira C., RA Hannick L., Megy K., O'Leary S., Pearson M., Haas B.J., Mauceli E., RA Wortman J.R., Lee N.H., Guigo R., Stanke M., Alvarado L., Amedeo P., RA Antoine C.H., Arensburger P., Bidwell S.L., Crawford M., Camaro F., RA Devon K., Engels R., Hammond M., Howarth C., Koehrsen M., Lawson D., RA Montgomery P., Nene V., Nusbaum C., Puiu D., Romero-Severson J., RA Severson D.W., Shumway M., Sisk P., Stolte C., Zeng Q., Eisenstadt E., RA Fraser-Liggett C., Strausberg R., Galagan J., Birren B., Collins F.H.; RT "Annotation of Culex pipiens quinquefasciatus."; RL Submitted (MAR-2007) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|EnsemblMetazoa:CPIJ013516-PA} RP IDENTIFICATION. RC STRAIN=JHB {ECO:0000313|EnsemblMetazoa:CPIJ013516-PA}; RG EnsemblMetazoa; RL Submitted (FEB-2021) to UniProtKB. CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; DS232269; EDS38808.1; -; Genomic_DNA. DR RefSeq; XP_001863602.1; XM_001863567.1. DR STRING; 7176.B0X1T0; -. DR EnsemblMetazoa; CPIJ013516-RA; CPIJ013516-PA; CPIJ013516. DR KEGG; cqu:CpipJ_CPIJ013516; -. DR VEuPathDB; VectorBase:CPIJ013516; -. DR VEuPathDB; VectorBase:CQUJHB007311; -. DR eggNOG; KOG1080; Eukaryota. DR HOGENOM; CLU_001226_3_0_1; -. DR InParanoid; B0X1T0; -. DR OMA; RMIEMTA; -. DR OrthoDB; 950362at2759; -. DR Proteomes; UP000002320; Unassembled WGS sequence. DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell. DR GO; GO:0042800; F:histone H3K4 methyltransferase activity; IEA:InterPro. DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW. DR Gene3D; 2.170.270.10; SET domain; 1. DR InterPro; IPR024657; COMPASS_Set1_N-SET. DR InterPro; IPR044570; Set1-like. DR InterPro; IPR001214; SET_dom. DR InterPro; IPR046341; SET_dom_sf. DR PANTHER; PTHR45814; HISTONE-LYSINE N-METHYLTRANSFERASE SETD1; 1. DR PANTHER; PTHR45814:SF2; HISTONE-LYSINE N-METHYLTRANSFERASE SETD1; 1. DR Pfam; PF11764; N-SET; 1. DR Pfam; PF00856; SET; 1. DR SMART; SM01291; N-SET; 1. DR SMART; SM00317; SET; 1. DR SUPFAM; SSF82199; SET domain; 1. DR PROSITE; PS50280; SET; 1. PE 4: Predicted; KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603}; KW Reference proteome {ECO:0000313|Proteomes:UP000002320}; KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691}; KW Transferase {ECO:0000256|ARBA:ARBA00022679}. FT DOMAIN 1252..1357 FT /note="SET" FT /evidence="ECO:0000259|PROSITE:PS50280" FT REGION 1..84 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 99..266 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 304..340 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 444..723 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 739..870 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 882..1041 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 8..28 FT /note="Pro residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 49..68 FT /note="Pro residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 120..180 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 220..261 FT /note="Pro residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 310..340 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 455..486 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 487..536 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 543..576 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 577..592 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 597..622 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 678..692 FT /note="Pro residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 696..723 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 739..753 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 768..787 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 796..810 FT /note="Pro residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 885..899 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 918..932 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 939..967 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 999..1016 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1027..1041 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" SQ SEQUENCE 1357 AA; 148337 MW; 7B19FC449F482518 CRC64; MATAGSSSWA PPPPPEEKPK PPPPPASAKS KQSQAALDDC DMWDDGESSK PPPPKPPSTP PPIPTGPKDD PTVAAGDDDN SGSALDLDTR IALMFKDKTF GAAPFLQLSD GEEEDKREEG EMADDSRDGS IKAELKQEDE SSPLPDVDVK VKKESVKLEP PKEEGASDIS SSEDDILAKE SPPPKPPSKP YMEDIIKRDN DQMSLSSLSS NDGKVDDTAA PPLPPEPAPD APPPPESVAP YVYPPGIGPG MGYPPAPPGT DPSYYYSQSY SQSSYEQYQS GYYQNSYMQA PYVPGLTGAY PYQTYGYSGK RDSYDEDDRY RSSYSSSERN RSEREQRKKH NRYEEAITAV IDRVTAELKQ ILKKDFNKKM IENTAYKKYE AWWDEEQEKS KGKDKVGVLS EITPLATAKV DKAPDINQLL NQTYDNLDSN SSYIGLGLRA TIPKMPSFRR IRKQPSPVPQ DEDSRRSDQE DMVHGSDSEK ETDSVSRPKT PPPSGSSADT GRGAGVSRTL SSSSVPRSEK RKASVSSFFT SSSEEDTSGS DSDDSTDSGS GGLSDVEMSY ASKKQQQQPS QQSGSGSRTE KRDKRIYSDS DSDEEASEQQ QPSTTSKFSL PSSSAGSRNK TKIYSDSDSD SEVSSKPPPR EVLPPVSTEK ARSKSPEPAT PTVLPLEQLC EALSPDVPDE SPPTPQQPPR TPGRESPKKS TYEFDRMYSD SEEEREYQEK RRRKAEYLEQ IEREFQEEQL RLAQEREEAA KAAAEAPAPE PVAKKSPVKS QAKNNRKNAT PSVNLLEKAP SPSDPITPLT SQPPPTPGAG LLEDPMVAAA ASVPQPASKK KKAEKAPKSK AKGGKKAKET NGVQAPVPEL PPIAPVEPLP MVQPVLPPRV VALGGSATPQ QQLSSSDDFF SADEEAAAAR RAAKASPASS DGGSSQASQV ALDHCYSLPP SASPSSSSPH PQSDSSVPVT VANKYAPTSS EALAHDHGYT NNDAAGAIGE MPTCPPAQVP MEVQQQQPTA TEVAQVVPAQ RSAGRPKKDP NAPKAKYKRK QDKAAAAAAA TLQTFASAAA TSSQQQALLP FAAGQQQLST FMPVPKYHER DIRTQMSILY DFLTRGIDAE DVQYIRQSYE LLLGDDTNNY WLNATHWVDH CTTDRSFLPP PPKKRKKDKE TVWDIKQHST GSARTQGFYK IDPREKAKYK YHHLRGTAAE NHLKNIETAK AVTKMQGLSR EARSNQRRLL TAFGASTESE LLKFNQLKFR KKQLKFAKSA IHDWGLFAME PIAADEMVIE YVGQMVRPSV ADLRETKYEA IGIGSSYLFR IDMETIIDAT KCGNLARFIN HSCNVSTFYG SRLFIDQIIV DNIRGYR //