ID A2D7F8_TRIV3 Unreviewed; 456 AA. AC A2D7F8; DT 20-FEB-2007, integrated into UniProtKB/TrEMBL. DT 20-FEB-2007, sequence version 1. DT 27-MAR-2024, entry version 86. DE SubName: Full=Pre-SET motif family protein {ECO:0000313|EMBL:EAY23675.1}; GN ORFNames=TVAG_120120 {ECO:0000313|EMBL:EAY23675.1}; OS Trichomonas vaginalis (strain ATCC PRA-98 / G3). OC Eukaryota; Metamonada; Parabasalia; Trichomonadida; Trichomonadidae; OC Trichomonas. OX NCBI_TaxID=412133 {ECO:0000313|EMBL:EAY23675.1, ECO:0000313|Proteomes:UP000001542}; RN [1] {ECO:0000313|EMBL:EAY23675.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=G3 {ECO:0000313|EMBL:EAY23675.1}; RA Amadeo P., Zhao Q., Wortman J., Fraser-Liggett C., Carlton J.; RL Submitted (OCT-2006) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|EMBL:EAY23675.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=G3 {ECO:0000313|EMBL:EAY23675.1}; RX PubMed=17218520; DOI=10.1126/science.1132894; RA Carlton J.M., Hirt R.P., Silva J.C., Delcher A.L., Schatz M., Zhao Q., RA Wortman J.R., Bidwell S.L., Alsmark U.C.M., Besteiro S., RA Sicheritz-Ponten T., Noel C.J., Dacks J.B., Foster P.G., Simillion C., RA Van de Peer Y., Miranda-Saavedra D., Barton G.J., Westrop G.D., Mueller S., RA Dessi D., Fiori P.L., Ren Q., Paulsen I., Zhang H., Bastida-Corcuera F.D., RA Simoes-Barbosa A., Brown M.T., Hayes R.D., Mukherjee M., Okumura C.Y., RA Schneider R., Smith A.J., Vanacova S., Villalvazo M., Haas B.J., Pertea M., RA Feldblyum T.V., Utterback T.R., Shu C.L., Osoegawa K., de Jong P.J., RA Hrdy I., Horvathova L., Zubacova Z., Dolezal P., Malik S.B., RA Logsdon J.M. Jr., Henze K., Gupta A., Wang C.C., Dunne R.L., Upcroft J.A., RA Upcroft P., White O., Salzberg S.L., Tang P., Chiu C.-H., Lee Y.-S., RA Embley T.M., Coombs G.H., Mottram J.C., Tachezy J., Fraser-Liggett C.M., RA Johnson P.J.; RT "Draft genome sequence of the sexually transmitted pathogen Trichomonas RT vaginalis."; RL Science 315:207-212(2007). CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; DS113177; EAY23675.1; -; Genomic_DNA. DR RefSeq; XP_001276923.1; XM_001276922.1. DR AlphaFoldDB; A2D7F8; -. DR STRING; 5722.A2D7F8; -. DR GeneID; 4720641; -. DR KEGG; tva:TVAG_2v0993030; -. DR VEuPathDB; TrichDB:TVAG_120120; -. DR VEuPathDB; TrichDB:TVAGG3_0993030; -. DR eggNOG; KOG1082; Eukaryota. DR InParanoid; A2D7F8; -. DR Proteomes; UP000001542; Unassembled WGS sequence. DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-KW. DR GO; GO:0005634; C:nucleus; IEA:InterPro. DR GO; GO:0003690; F:double-stranded DNA binding; IBA:GO_Central. DR GO; GO:0042054; F:histone methyltransferase activity; IBA:GO_Central. DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro. DR CDD; cd10538; SET_SETDB-like; 1. DR Gene3D; 2.170.270.10; SET domain; 1. DR InterPro; IPR003616; Post-SET_dom. DR InterPro; IPR007728; Pre-SET_dom. DR InterPro; IPR001214; SET_dom. DR InterPro; IPR046341; SET_dom_sf. DR PANTHER; PTHR45660; HISTONE-LYSINE N-METHYLTRANSFERASE SETMAR; 1. DR PANTHER; PTHR45660:SF13; HISTONE-LYSINE N-METHYLTRANSFERASE SETMAR; 1. DR Pfam; PF05033; Pre-SET; 1. DR Pfam; PF00856; SET; 1. DR SMART; SM00317; SET; 1. DR SUPFAM; SSF82199; SET domain; 1. DR PROSITE; PS50868; POST_SET; 1. DR PROSITE; PS50867; PRE_SET; 1. DR PROSITE; PS50280; SET; 1. PE 4: Predicted; KW Chromosome {ECO:0000256|ARBA:ARBA00022454}; KW Nucleus {ECO:0000256|ARBA:ARBA00023242}; KW Reference proteome {ECO:0000313|Proteomes:UP000001542}; KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691}; KW Transferase {ECO:0000256|ARBA:ARBA00022679}. FT DOMAIN 238..303 FT /note="Pre-SET" FT /evidence="ECO:0000259|PROSITE:PS50867" FT DOMAIN 304..433 FT /note="SET" FT /evidence="ECO:0000259|PROSITE:PS50280" FT DOMAIN 440..456 FT /note="Post-SET" FT /evidence="ECO:0000259|PROSITE:PS50868" SQ SEQUENCE 456 AA; 51749 MW; D78AD76B14519355 CRC64; MIPDFPLVFP SVPECFEKCK TLTAFTKCIK PLPPEIQSFL QLKLCSPYFT RLRIKAGESQ EIPITAPLQE VFSCFEICHP EYHANLAAGI AIMTGIPIDF IIDEMGLEID QKKLIIMRAL CKSTLGKTIA AVFTDDDRIE PIDSREYIPA TYNWTLEGNP FQDIPEESRD FIKECTKKQM PPDFNLSFKF TQDLSNGFNK QHGIVSVPCI NEDDDNWPRK MKWIANLEFP DMISSHYVGC DCHQHDCLTC HAIFNGQPIM KYTEAGRLDL ESFRSNYKPI IIECNSSCSC DSETCKNRVV DRKAKIHLLV CRCISKGGWG VRALEFIPKG TFICEYLGDL ITDPDKAESQ GKIYDKSGES YLFDLDGYGI NDKEMLTVDP KVTGNVSKFI NHNCDPNIIT IIIGTVNSEQ YHRIGFFALR DIYPFEDLGF HYGYKMHKID QKACNCGSLT CGGRLT //