ID Q7Q504_ANOGA Unreviewed; 1259 AA. AC Q7Q504; DT 15-DEC-2003, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 5. DT 27-MAR-2024, entry version 140. DE SubName: Full=AGAP004656-PA {ECO:0000313|EMBL:EAA11974.5}; GN ORFNames=AgaP_AGAP004656 {ECO:0000313|EMBL:EAA11974.5}; OS Anopheles gambiae (African malaria mosquito). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae; OC Anophelinae; Anopheles. OX NCBI_TaxID=7165 {ECO:0000313|EMBL:EAA11974.5}; RN [1] {ECO:0000313|EMBL:EAA11974.5} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PEST {ECO:0000313|EMBL:EAA11974.5}; RX PubMed=12364791; DOI=10.1126/science.1076181; RA Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R., RA Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R., RA Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z., RA Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W., RA Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C., RA Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K., RA Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V., RA Dana A., Delcher A., Dew I., Evans C.A., Flanigan M., RA Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R., RA Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J., RA Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I., RA Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A., RA McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D., RA O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H., RA Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J., RA Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B., RA Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M., RA Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I., RA Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J., RA Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M., RA Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C., RA Collins F.H., Hoffman S.L.; RT "The genome sequence of the malaria mosquito Anopheles gambiae."; RL Science 298:129-149(2002). RN [2] {ECO:0000313|EMBL:EAA11974.5} RP NUCLEOTIDE SEQUENCE. RC STRAIN=PEST {ECO:0000313|EMBL:EAA11974.5}; RG The Anopheles Genome Sequencing Consortium; RL Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases. RN [3] {ECO:0000313|EMBL:EAA11974.5} RP NUCLEOTIDE SEQUENCE. RC STRAIN=PEST {ECO:0000313|EMBL:EAA11974.5}; RX PubMed=14747013; DOI=10.1016/j.pt.2003.11.003; RA Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.; RT "The Anopheles gambiae genome: an update."; RL Trends Parasitol. 20:49-52(2004). RN [4] {ECO:0000313|EMBL:EAA11974.5} RP NUCLEOTIDE SEQUENCE. RC STRAIN=PEST {ECO:0000313|EMBL:EAA11974.5}; RX PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5; RA Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F., RA Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.; RT "Update of the Anopheles gambiae PEST genome assembly."; RL Genome Biol. 8:R5.1-R5.13(2007). RN [5] {ECO:0000313|EMBL:EAA11974.5} RP NUCLEOTIDE SEQUENCE. RC STRAIN=PEST {ECO:0000313|EMBL:EAA11974.5}; RG VectorBase; RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases. CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}. CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ CC whole genome shotgun (WGS) entry which is preliminary data. CC {ECO:0000313|EMBL:EAA11974.5}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AAAB01008961; EAA11974.5; -; Genomic_DNA. DR RefSeq; XP_316738.5; XM_316738.5. DR AlphaFoldDB; Q7Q504; -. DR STRING; 7165.Q7Q504; -. DR PaxDb; 7165-AGAP004656-PA; -. DR GeneID; 1277307; -. DR KEGG; aga:AgaP_AGAP004656; -. DR VEuPathDB; VectorBase:AGAP004656; -. DR eggNOG; KOG1081; Eukaryota. DR HOGENOM; CLU_004494_2_1_1; -. DR InParanoid; Q7Q504; -. DR OMA; DAGWPTY; -. DR OrthoDB; 950362at2759; -. DR PhylomeDB; Q7Q504; -. DR GO; GO:0000785; C:chromatin; IBA:GO_Central. DR GO; GO:0005634; C:nucleus; IBA:GO_Central. DR GO; GO:0046975; F:histone H3K36 methyltransferase activity; IBA:GO_Central. DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW. DR GO; GO:0006355; P:regulation of DNA-templated transcription; IBA:GO_Central. DR CDD; cd15565; PHD2_NSD; 1. DR CDD; cd15566; PHD3_NSD; 1. DR CDD; cd15567; PHD4_NSD; 1. DR CDD; cd20144; PWWP_NSD_rpt1; 1. DR CDD; cd05838; PWWP_NSD_rpt2; 1. DR Gene3D; 2.30.30.140; -; 2. DR Gene3D; 2.170.270.10; SET domain; 1. DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 2. DR InterPro; IPR006560; AWS_dom. DR InterPro; IPR003616; Post-SET_dom. DR InterPro; IPR000313; PWWP_dom. DR InterPro; IPR001214; SET_dom. DR InterPro; IPR046341; SET_dom_sf. DR InterPro; IPR011011; Znf_FYVE_PHD. DR InterPro; IPR001965; Znf_PHD. DR InterPro; IPR013083; Znf_RING/FYVE/PHD. DR PANTHER; PTHR22884:SF413; HISTONE-LYSINE N-METHYLTRANSFERASE CG1716-RELATED; 1. DR PANTHER; PTHR22884; SET DOMAIN PROTEINS; 1. DR Pfam; PF17907; AWS; 1. DR Pfam; PF00855; PWWP; 2. DR Pfam; PF00856; SET; 1. DR SMART; SM00570; AWS; 1. DR SMART; SM00249; PHD; 4. DR SMART; SM00293; PWWP; 1. DR SMART; SM00317; SET; 1. DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 1. DR SUPFAM; SSF82199; SET domain; 1. DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 2. DR PROSITE; PS51215; AWS; 1. DR PROSITE; PS50868; POST_SET; 1. DR PROSITE; PS50812; PWWP; 2. DR PROSITE; PS50280; SET; 1. PE 4: Predicted; KW Metal-binding {ECO:0000256|ARBA:ARBA00022723}; KW Repeat {ECO:0000256|ARBA:ARBA00022737}; KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691}; KW Transferase {ECO:0000256|ARBA:ARBA00022679}; KW Zinc {ECO:0000256|ARBA:ARBA00022833}; KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}. FT DOMAIN 264..312 FT /note="PWWP" FT /evidence="ECO:0000259|PROSITE:PS50812" FT DOMAIN 831..893 FT /note="PWWP" FT /evidence="ECO:0000259|PROSITE:PS50812" FT DOMAIN 970..1020 FT /note="AWS" FT /evidence="ECO:0000259|PROSITE:PS51215" FT DOMAIN 1022..1139 FT /note="SET" FT /evidence="ECO:0000259|PROSITE:PS50280" FT DOMAIN 1146..1162 FT /note="Post-SET" FT /evidence="ECO:0000259|PROSITE:PS50868" FT REGION 16..72 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 132..165 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 407..426 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 449..483 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 137..162 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" SQ SEQUENCE 1259 AA; 142173 MW; AAEEBE4A3ABD50A0 CRC64; MFFAATAAAT NFETDLLSPT TTQKRVKSQE STPQSQKIAK QEPMSGSEMN ESGKENAFSD QIRNHPTDEN SNTVVRVPGR YLLKQLSSSL SPKMTEINID TDRRVSRYGR HQKQKDNSDY VPVDLMKYVG STPSKFKTKS ETDDPNSVNK TLETKPHSSD ELRTGSNDIS LSLTVVPLPL IDGSIMKRPE EKQLLSMEEI KIVGMEEPYY TDKGSDNNCF MNNLLERKTS SDADSGKGSS VDLAIIYRAG NLYWAAQTRK SIHWPCIVHV DPETDQITRL YGDKFLEVHV SYFGDRGRRA WIKEHSILPF EGVDEYCTNA KNMEYGKLLK SALKSERWKN ACSIAERYMT LALSERIASF DHEVKLELAR HKVMRHRMAI EGRIKNGASQ KMPALLPSDN SEVLEHWNKR DRSSSPESPE YELLPGVFPT KNNPIKKIKF SPSMQNCILN DNRSPLDRPL HSMDSEENPP GSATKANGPK AKTLGVSDLE NLNSTEYDDI LTFIRYYMFD GHTSYEVEKG LQLYVRGICN LKKHSNRGTE RTVGRQRMHA LRKSYEILGI EPSAEVSTSL KTRNAHPVIK KEPKTLEEKF IFELDKNFLM KGVPKGFVCY ICNRPNNVTK CSKCTLHLHL VCLANDPEEV VKMQELVDQK KLCCEKCSTT SIVEKTCFIC NDEIPEKSNE QIYRCVVGKC TQAYHISCLQ LFPQVRQVSA STIICPYHTC HTCVASEPRS TASMVKTTLA HCLKCPTSYH PSGNCIPAGS VLLTTTQLIC PKHTLEQIPL NVNWCFICGK GGNLICCETC PLACHPVCLQ FTPPDEKYFC EGCESGRLPL YNEIVIAKMG SFRWWPALTL PPSEIPQNML QLRHKPSDIC VKFFNTHDMS WLNRKRMYLY QREDSESLDG SQSGSSMDKR YRSAMTEASK IFKILQTRKM LGPSASLSYS KAVPVYKKIK TNRYIPPLKP PSVNRQLDGV EDSVCRCQPS DDDPCGPTSA CLNRAIMMEC SSKTCPAKES CSNQRFTKRI YPALEVRFFS DKGFGLVALE DLKSGQFVIE YVGEVINSEE FDRRVMMMQA AKETNYYFLT VEPDLTIDAG PKGNVSRFIN HSCEPNCETQ KWTIGETRVI GLFAIKDINA GEELTFNYNL ESLGNNKRVC LCGAGKCSGF IGEKYRPPNK KDIVISMKSE RSLKNGKKRV KIRRTKTTLT QKTTTNLSET KDTNNQLNDN DVVMISAQEA VIVDLVDENA SAYPLASVHI KQEKNDYNI //