ID Q7PH82_ANOGA Unreviewed; 855 AA. AC Q7PH82; DT 15-DEC-2003, integrated into UniProtKB/TrEMBL. DT 27-JUL-2011, sequence version 4. DT 27-MAR-2024, entry version 142. DE SubName: Full=AGAP003597-PA {ECO:0000313|EMBL:EAA44650.4}; GN Name=4577143 {ECO:0000313|EnsemblMetazoa:AGAP003597-PA}; GN ORFNames=AgaP_AGAP003597 {ECO:0000313|EMBL:EAA44650.4}; OS Anopheles gambiae (African malaria mosquito). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae; OC Anophelinae; Anopheles. OX NCBI_TaxID=7165 {ECO:0000313|EMBL:EAA44650.4}; RN [1] {ECO:0000313|EMBL:EAA44650.4, ECO:0000313|Proteomes:UP000007062} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=PEST {ECO:0000313|EMBL:EAA44650.4, RC ECO:0000313|Proteomes:UP000007062}; RX PubMed=12364791; DOI=10.1126/science.1076181; RA Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R., RA Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R., RA Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z., RA Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W., RA Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C., RA Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K., RA Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V., RA Dana A., Delcher A., Dew I., Evans C.A., Flanigan M., RA Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R., RA Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J., RA Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I., RA Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A., RA McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D., RA O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H., RA Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J., RA Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B., RA Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M., RA Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I., RA Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J., RA Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M., RA Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C., RA Collins F.H., Hoffman S.L.; RT "The genome sequence of the malaria mosquito Anopheles gambiae."; RL Science 298:129-149(2002). RN [2] {ECO:0000313|EMBL:EAA44650.4} RP NUCLEOTIDE SEQUENCE. RC STRAIN=PEST {ECO:0000313|EMBL:EAA44650.4}; RG The Anopheles Genome Sequencing Consortium; RL Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases. RN [3] {ECO:0000313|EMBL:EAA44650.4} RP NUCLEOTIDE SEQUENCE. RC STRAIN=PEST {ECO:0000313|EMBL:EAA44650.4}; RX PubMed=14747013; DOI=10.1016/j.pt.2003.11.003; RA Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.; RT "The Anopheles gambiae genome: an update."; RL Trends Parasitol. 20:49-52(2004). RN [4] {ECO:0000313|EMBL:EAA44650.4} RP NUCLEOTIDE SEQUENCE. RC STRAIN=PEST {ECO:0000313|EMBL:EAA44650.4}; RX PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5; RA Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F., RA Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.; RT "Update of the Anopheles gambiae PEST genome assembly."; RL Genome Biol. 8:R5.1-R5.13(2007). RN [5] {ECO:0000313|EMBL:EAA44650.4} RP NUCLEOTIDE SEQUENCE. RC STRAIN=PEST {ECO:0000313|EMBL:EAA44650.4}; RG VectorBase; RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases. RN [6] {ECO:0000313|EnsemblMetazoa:AGAP003597-PA} RP IDENTIFICATION. RC STRAIN=PEST {ECO:0000313|EnsemblMetazoa:AGAP003597-PA}; RG EnsemblMetazoa; RL Submitted (MAY-2020) to UniProtKB. CC -!- SUBCELLULAR LOCATION: Chromosome, centromere CC {ECO:0000256|ARBA:ARBA00004584}. Nucleus CC {ECO:0000256|ARBA:ARBA00004123}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AAAB01008888; EAA44650.4; -; Genomic_DNA. DR RefSeq; XP_313355.4; XM_313355.4. DR AlphaFoldDB; Q7PH82; -. DR STRING; 7165.Q7PH82; -. DR PaxDb; 7165-AGAP003597-PA; -. DR EnsemblMetazoa; AGAP003597-RA; AGAP003597-PA; AGAP003597. DR GeneID; 4577143; -. DR VEuPathDB; VectorBase:AGAP003597; -. DR eggNOG; KOG1082; Eukaryota. DR HOGENOM; CLU_020840_8_1_1; -. DR InParanoid; Q7PH82; -. DR OrthoDB; 1512042at2759; -. DR Proteomes; UP000007062; Chromosome 2R. DR GO; GO:0000775; C:chromosome, centromeric region; IEA:UniProtKB-SubCell. DR GO; GO:0005634; C:nucleus; IBA:GO_Central. DR GO; GO:0046974; F:histone H3K9 methyltransferase activity; IBA:GO_Central. DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro. DR CDD; cd00024; CD_CSD; 1. DR CDD; cd10542; SET_SUV39H; 1. DR Gene3D; 2.40.50.40; -; 1. DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1. DR Gene3D; 2.170.270.10; SET domain; 1. DR InterPro; IPR016197; Chromo-like_dom_sf. DR InterPro; IPR000953; Chromo/chromo_shadow_dom. DR InterPro; IPR023780; Chromo_domain. DR InterPro; IPR023779; Chromodomain_CS. DR InterPro; IPR027417; P-loop_NTPase. DR InterPro; IPR003616; Post-SET_dom. DR InterPro; IPR007728; Pre-SET_dom. DR InterPro; IPR001214; SET_dom. DR InterPro; IPR046341; SET_dom_sf. DR PANTHER; PTHR46223; HISTONE-LYSINE N-METHYLTRANSFERASE SUV39H; 1. DR PANTHER; PTHR46223:SF4; HISTONE-LYSINE N-METHYLTRANSFERASE-RELATED; 1. DR Pfam; PF00385; Chromo; 1. DR Pfam; PF05033; Pre-SET; 1. DR Pfam; PF00856; SET; 1. DR SMART; SM00298; CHROMO; 1. DR SMART; SM00468; PreSET; 1. DR SMART; SM00317; SET; 1. DR SUPFAM; SSF54160; Chromo domain-like; 1. DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1. DR SUPFAM; SSF82199; SET domain; 1. DR PROSITE; PS00598; CHROMO_1; 1. DR PROSITE; PS50013; CHROMO_2; 1. DR PROSITE; PS50868; POST_SET; 1. DR PROSITE; PS50867; PRE_SET; 1. DR PROSITE; PS50280; SET; 1. PE 4: Predicted; KW Centromere {ECO:0000256|ARBA:ARBA00023328}; KW Chromosome {ECO:0000256|ARBA:ARBA00022454}; KW Metal-binding {ECO:0000256|ARBA:ARBA00022723}; KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603}; KW Reference proteome {ECO:0000313|Proteomes:UP000007062}; KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691}; KW Transferase {ECO:0000256|ARBA:ARBA00022679}. FT DOMAIN 405..451 FT /note="Chromo" FT /evidence="ECO:0000259|PROSITE:PS50013" FT DOMAIN 598..657 FT /note="Pre-SET" FT /evidence="ECO:0000259|PROSITE:PS50867" FT DOMAIN 660..785 FT /note="SET" FT /evidence="ECO:0000259|PROSITE:PS50280" FT DOMAIN 839..855 FT /note="Post-SET" FT /evidence="ECO:0000259|PROSITE:PS50868" FT REGION 1..23 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 114..152 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 168..207 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 261..401 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 793..832 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 134..152 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 261..302 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 316..348 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 365..401 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" SQ SEQUENCE 855 AA; 94553 MW; 91C19D08E2945091 CRC64; MSSGDQQMST ATTGQPHLQQ QDLSKLDITK LTPLSPEVIS RQATINIGTI GHVAHGKSTV VKAISGVQTV RFKNELERNI TIKLENLSKS FIETLQSDAK AMEAYLHKMH EQYKSPYNHG TSHSNKHTPS RKRKLSPNGL QTSPTNNGET QRMKQAKIMD FFHMSPPERI TQPSLHASRE RARRKSCPAA QGNFSGTHIP KPNGFPKRMS CSIDKVKKEV DESEDGSRFD VPTDTRLATI NQVQADLSST PDKQNIKVEC SDETPNAMSS SSQVENGTET PKTTASVKVE NGTDTPVTPV ARDNSNGLLK VMSAAKPSPK TSTPKTTASI KVENGTVTPA TPVSRDSSKG LLNVKLAAKP SPKTPATGER NSSLSAGKST PKRKSTGGGS NSKQIKKSAT TSKEYTVENI EDIQLVGNSP FFLVKWLGYT SKDNTWEPLN NVNSCAMLDS FLSAQMSLLE EWVEPLQEKI RSSPEYLESL ERQGSKTYQE ILLEHKEYDW DQLRADLIIM AKLWMNRGRN KLIWERITRL MCRELSYAKR CEQLEELRRF EKHINDHEPT LRVVVENEHD LDAPPNNFTY LQGNIPAEGI SIPNDPPVGC ECNPCTGRST CCGKLSEGRF AYSVKKRLLL QPGAPIFECN KKCSCGPDCL NRVVQNGGKC NLTLFKTPNG RGWGVRTNTV IYEGQYISEY CGEVISYDEA EKRGREYDAV GRTYLFDLDF NGTDNPYTLD AARYGNVTRF FNHSCDPNCG IWSVWIDCLD PYLPRLAFFA QRRIEIGEEL TFNYHAQVSP NNVSINGGSG GGGPPMDGDS STGGVVEEKP AENGDKATTA NGSVRNTKGV TECLCGSANC RKFIF //