We will be switching to the new UniProt website soon. Please explore and share your feedback.
Take me to the new website.
UniProtKB - A0A5F4WH18 (A0A5F4WH18_CALJA)
Protein
Submitted name:
Uncharacterized protein
Gene
EHMT1
Organism
Callithrix jacchus (White-tufted-ear marmoset)
Status
Functioni
GO - Molecular functioni
- C2H2 zinc finger domain binding Source: Ensembl
- histone methyltransferase activity (H3-K27 specific) Source: Ensembl
- histone methyltransferase activity (H3-K9 specific) Source: Ensembl
- p53 binding Source: Ensembl
- transcription corepressor binding Source: Ensembl
- zinc ion binding Source: InterPro
GO - Biological processi
- chromatin organization Source: Ensembl
- DNA methylation Source: Ensembl
- negative regulation of transcription by RNA polymerase II Source: Ensembl
- peptidyl-lysine dimethylation Source: Ensembl
- peptidyl-lysine monomethylation Source: Ensembl
- positive regulation of cold-induced thermogenesis Source: Ensembl
- regulation of embryonic development Source: Ensembl
Names & Taxonomyi
Protein namesi | Submitted name: Uncharacterized proteinImported |
Gene namesi | Name:EHMT1Imported |
Organismi | Callithrix jacchus (White-tufted-ear marmoset)Imported |
Taxonomic identifieri | 9483 [NCBI] |
Taxonomic lineagei | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Mammalia › Eutheria › Euarchontoglires › Primates › Haplorrhini › Platyrrhini › Cebidae › Callitrichinae › Callithrix › Callithrix |
Proteomesi |
|
Subcellular locationi
Nucleus
- nuclear body Source: Ensembl
Keywords - Cellular componenti
NucleusARBA annotationInteractioni
GO - Molecular functioni
- C2H2 zinc finger domain binding Source: Ensembl
- p53 binding Source: Ensembl
- transcription corepressor binding Source: Ensembl
Structurei
Family & Domainsi
Domains and Repeats
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Repeati | 984 – 1016 | ANKPROSITE-ProRule annotationAdd BLAST | 33 | |
Repeati | 1017 – 1049 | ANKPROSITE-ProRule annotationAdd BLAST | 33 | |
Repeati | 1050 – 1074 | ANKPROSITE-ProRule annotationAdd BLAST | 25 | |
Repeati | 1084 – 1116 | ANKPROSITE-ProRule annotationAdd BLAST | 33 | |
Repeati | 1150 – 1182 | ANKPROSITE-ProRule annotationAdd BLAST | 33 | |
Domaini | 1272 – 1335 | Pre-SETInterPro annotationAdd BLAST | 64 | |
Domaini | 1338 – 1455 | SETInterPro annotationAdd BLAST | 118 |
Region
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Regioni | 239 – 296 | DisorderedSequence analysisAdd BLAST | 58 | |
Regioni | 353 – 445 | DisorderedSequence analysisAdd BLAST | 93 | |
Regioni | 550 – 692 | DisorderedSequence analysisAdd BLAST | 143 | |
Regioni | 856 – 929 | DisorderedSequence analysisAdd BLAST | 74 | |
Regioni | 1486 – 1510 | DisorderedSequence analysisAdd BLAST | 25 |
Compositional bias
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Compositional biasi | 268 – 296 | Polar residuesSequence analysisAdd BLAST | 29 | |
Compositional biasi | 394 – 415 | Basic and acidic residuesSequence analysisAdd BLAST | 22 | |
Compositional biasi | 427 – 445 | Basic and acidic residuesSequence analysisAdd BLAST | 19 | |
Compositional biasi | 556 – 572 | Acidic residuesSequence analysisAdd BLAST | 17 | |
Compositional biasi | 588 – 606 | Basic and acidic residuesSequence analysisAdd BLAST | 19 | |
Compositional biasi | 607 – 626 | Acidic residuesSequence analysisAdd BLAST | 20 | |
Compositional biasi | 670 – 689 | Polar residuesSequence analysisAdd BLAST | 20 |
Sequence similaritiesi
Belongs to the arrestin family.ARBA annotation
Keywords - Domaini
ANK repeatPROSITE-ProRule annotationPhylogenomic databases
GeneTreei | ENSGT00940000156002 |
Family and domain databases
Gene3Di | 1.25.40.20, 1 hit 2.60.40.640, 1 hit |
InterProi | View protein in InterPro IPR002110, Ankyrin_rpt IPR036770, Ankyrin_rpt-contain_sf IPR014752, Arrestin-like_C IPR011021, Arrestin-like_N IPR038035, EHMT1 IPR043550, EHMT1/EHMT2 IPR014756, Ig_E-set IPR007728, Pre-SET_dom IPR001214, SET_dom |
PANTHERi | PTHR46307, PTHR46307, 1 hit PTHR46307:SF2, PTHR46307:SF2, 1 hit |
Pfami | View protein in Pfam PF12796, Ank_2, 3 hits PF00339, Arrestin_N, 1 hit PF05033, Pre-SET, 1 hit PF00856, SET, 1 hit |
PRINTSi | PR01415, ANKYRIN |
SMARTi | View protein in SMART SM00248, ANK, 7 hits SM00468, PreSET, 1 hit SM00317, SET, 1 hit |
SUPFAMi | SSF48403, SSF48403, 1 hit SSF81296, SSF81296, 1 hit |
PROSITEi | View protein in PROSITE PS50088, ANK_REPEAT, 5 hits PS50867, PRE_SET, 1 hit PS50280, SET, 1 hit |
(1+)i Sequence
Sequence statusi: Complete.
This entry has 1 described isoform and 9 potential isoforms that are computationally mapped.Show allAlign All
A0A5F4WH18-1 [UniParc]FASTAAdd to basket
10 20 30 40 50
MERDIGGRDR TRLGGTGRAW MERDIAGRGR AGLGGAGRAW MGRGIRGAGP
60 70 80 90 100
GLRALSAGRR RGRHCAAGRR GRGMGRVQLF EISLSHGRVV YSPGEPLAGT
110 120 130 140 150
VRVRLGAPLP FRAIRVACTG SCGVSSKAND AAWVVEESYF NSSLSLADKG
160 170 180 190 200
SLPAGEHSFP FQFLLPATAP TSFEGPFGKI VHQVRAAIHT PRFSKDHKCS
210 220 230 240 250
LVFYILSPLN LNSIPDIEAV PARGEPQQDC CVKTELLGEE TPMAADEGST
260 270 280 290 300
EKQAGEAHMA ADSETNGSCE HGDASSHANA AEHTQESIRV SPQDGTSTLT
310 320 330 340 350
RIAENGLSER DSEAGKQNHV TTDDFVQTSI VGSNGYILNK PALQTQPLRT
360 370 380 390 400
TNTLASSLPG HAAKTLPGGA GKGRTPSAFP QTPAAPPATL GEGSADTEDR
410 420 430 440 450
KPPAPGTDVK VHRARKTMPK PAVGLHAASK DPREVREARD HKEPKEEINK
460 470 480 490 500
NISDFGRQQL LPPFPSLHPS LPQNQCYMAT TKSQTACLPF VLAAAVSRKK
510 520 530 540 550
KRRMGTYSLV PKKKTKVLKQ RTVIEMFKSI THSTVGSKGE KDLGASSLHV
560 570 580 590 600
NGESLEMDSD EDDSEELEED DSHGVEQAAA FPTEDSRTSK ESMSEVDRTQ
610 620 630 640 650
KMDGESEEEQ ESADTGEEEE GGDESDLSSE SSIKKKFLKR KGKTDSPWIK
660 670 680 690 700
PARKRRRRSR KKPSGAPLGS EPYKSSSGST EQTAPGDSTG YMEVSLDSLD
710 720 730 740 750
LRVKGILSSQ AEGLANGPDV LETDGLQEVP LCSCRMETPK SREITTLANN
760 770 780 790 800
QCMATESVDH ELGRCTNSVV KYELMRPSNK APLLVLCEDH RGRMVKHQCC
810 820 830 840 850
PGCGYFCTAG NFMECQPESS ISHRFHKDCA SRVNNASYCP HCGEESSKAK
860 870 880 890 900
EVTIAKADTT STVTPVPGQE KGSALEGRAD TTTGSAAGPL LSEDDKLQGP
910 920 930 940 950
ASHAPEGFDP TGPAGLGRPT PGLSQGPGKE TLESALIALD SEKPKKLRFH
960 970 980 990 1000
PKQLYFSARQ GELQKVLLML VDGIDPNFKM EHQNKRSPLH AAAEAGHVDI
1010 1020 1030 1040 1050
CHMLVQAGAN IDTCSEDQRT PLMEAAENNH LEAVKYLIKA GALVGPKDAE
1060 1070 1080 1090 1100
GSTCLHLAAK KGHYEVVQYL LSNGQMDVNC QDDGGWTPMI WATEYKHVDL
1110 1120 1130 1140 1150
VKLLLSKGSD INIRDNEENI CLHWAAFSGC VDIAEILLAA KCDLHAVNIH
1160 1170 1180 1190 1200
GDSPLHIAAR ENRYDCVVLF LSRDSDVTLK NKEGETPLQC ASLNSQVWSA
1210 1220 1230 1240 1250
LQMSKALQDS APDRPVPVER TVSRDIARGY ERIPIPCVNA VDGEPCPSNY
1260 1270 1280 1290 1300
KYVSQNCVTS PMNIDRNITH LQYCVCIDDC SSSNCMCGQL SMRCWYDKDG
1310 1320 1330 1340 1350
RLLPEFNMAE PPLIFECNHA CSCWRNCRNR VVQNGLRARL QLYRTQDMGW
1360 1370 1380 1390 1400
GVRSLQDIPL GTFVCEYVGE LISDSEADVR EEDSYLFDLD NKDGEVYCID
1410 1420 1430 1440 1450
ARFYGNVSRF INHHCEPNLV PVRVFMAHQD LRFPRIAFFS TRLIEAGEQL
1460 1470 1480 1490 1500
GFDYGERFWD IKGKLFSCRC GSPKCRHSSA ALAQRQASAA QEAQEDGLPD
1510
TSSAAAADPL
Computationally mapped potential isoform sequencesi
There are 9 potential isoforms mapped to this entry.BLASTAlignShow allAdd to basketA0A5F4WE25 | A0A5F4WE25_CALJA | Uncharacterized protein | EHMT1 | 1,484 | Annotation score: | ||
F6QW19 | F6QW19_CALJA | Uncharacterized protein | EHMT1 | 1,298 | Annotation score: | ||
F6R632 | F6R632_CALJA | Uncharacterized protein | EHMT1 | 1,303 | Annotation score: | ||
A0A2R8MNX6 | A0A2R8MNX6_CALJA | Uncharacterized protein | EHMT1 | 1,404 | Annotation score: | ||
A0A5F4VWY3 | A0A5F4VWY3_CALJA | Uncharacterized protein | EHMT1 | 1,339 | Annotation score: | ||
A0A5F4VWY5 | A0A5F4VWY5_CALJA | Uncharacterized protein | EHMT1 | 1,349 | Annotation score: | ||
A0A5F4WL39 | A0A5F4WL39_CALJA | Uncharacterized protein | EHMT1 | 1,353 | Annotation score: | ||
A0A5F4WMN9 | A0A5F4WMN9_CALJA | Uncharacterized protein | EHMT1 | 1,423 | Annotation score: | ||
F6R5W1 | F6R5W1_CALJA | Uncharacterized protein | EHMT1 | 810 | Annotation score: |
Genome annotation databases
Ensembli | ENSCJAT00000110258; ENSCJAP00000077002; ENSCJAG00000013446 |
Similar proteinsi
Cross-referencesi
3D structure databases
ModBasei | Search... |
SWISS-MODEL-Workspacei | Submit a new modelling project... |
Genome annotation databases
Ensembli | ENSCJAT00000110258; ENSCJAP00000077002; ENSCJAG00000013446 |
Phylogenomic databases
GeneTreei | ENSGT00940000156002 |
Family and domain databases
Gene3Di | 1.25.40.20, 1 hit 2.60.40.640, 1 hit |
InterProi | View protein in InterPro IPR002110, Ankyrin_rpt IPR036770, Ankyrin_rpt-contain_sf IPR014752, Arrestin-like_C IPR011021, Arrestin-like_N IPR038035, EHMT1 IPR043550, EHMT1/EHMT2 IPR014756, Ig_E-set IPR007728, Pre-SET_dom IPR001214, SET_dom |
PANTHERi | PTHR46307, PTHR46307, 1 hit PTHR46307:SF2, PTHR46307:SF2, 1 hit |
Pfami | View protein in Pfam PF12796, Ank_2, 3 hits PF00339, Arrestin_N, 1 hit PF05033, Pre-SET, 1 hit PF00856, SET, 1 hit |
PRINTSi | PR01415, ANKYRIN |
SMARTi | View protein in SMART SM00248, ANK, 7 hits SM00468, PreSET, 1 hit SM00317, SET, 1 hit |
SUPFAMi | SSF48403, SSF48403, 1 hit SSF81296, SSF81296, 1 hit |
PROSITEi | View protein in PROSITE PS50088, ANK_REPEAT, 5 hits PS50867, PRE_SET, 1 hit PS50280, SET, 1 hit |
MobiDBi | Search... |
Entry informationi
Entry namei | A0A5F4WH18_CALJA | |
Accessioni | A0A5F4WH18Primary (citable) accession number: A0A5F4WH18 | |
Entry historyi | Integrated into UniProtKB/TrEMBL: | December 11, 2019 |
Last sequence update: | December 11, 2019 | |
Last modified: | January 19, 2022 | |
This is version 9 of the entry and version 1 of the sequence. See complete history. | ||
Entry statusi | Unreviewed (UniProtKB/TrEMBL) |