Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

ELM2 and SANT domain-containing protein 1

Gene

ELMSAN1

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 4 out of 5-Experimental evidence at protein leveli

Functioni

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Biological processi

Transcription, Transcription regulation

Keywords - Ligandi

DNA-binding

Names & Taxonomyi

Protein namesi
Recommended name:
ELM2 and SANT domain-containing protein 1
Alternative name(s):
MIDEAS1 Publication
Gene namesi
Name:ELMSAN1
Synonyms:C14orf117, C14orf43
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 14

Organism-specific databases

HGNCiHGNC:19853. ELMSAN1.

Subcellular locationi

  • Nucleus PROSITE-ProRule annotation

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Nucleus

Pathology & Biotechi

Organism-specific databases

PharmGKBiPA134877876.

Polymorphism and mutation databases

BioMutaiELMSAN1.
DMDMi118572229.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 10451045ELM2 and SANT domain-containing protein 1PRO_0000259764Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Modified residuei1 – 11N-acetylmethionineCombined sources
Modified residuei461 – 4611PhosphoserineCombined sources
Cross-linki590 – 590Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO1)Combined sources
Modified residuei655 – 6551PhosphothreonineCombined sources
Modified residuei661 – 6611PhosphoserineCombined sources
Modified residuei704 – 7041PhosphothreonineCombined sources
Modified residuei709 – 7091PhosphoserineCombined sources
Modified residuei715 – 7151PhosphothreonineCombined sources
Modified residuei923 – 9231PhosphoserineCombined sources

Keywords - PTMi

Acetylation, Isopeptide bond, Phosphoprotein, Ubl conjugation

Proteomic databases

EPDiQ6PJG2.
MaxQBiQ6PJG2.
PaxDbiQ6PJG2.
PeptideAtlasiQ6PJG2.
PRIDEiQ6PJG2.

PTM databases

iPTMnetiQ6PJG2.
PhosphoSiteiQ6PJG2.

Expressioni

Gene expression databases

BgeeiENSG00000156030.
CleanExiHS_C14orf43.
ExpressionAtlasiQ6PJG2. baseline and differential.
GenevisibleiQ6PJG2. HS.

Organism-specific databases

HPAiHPA003111.

Interactioni

Subunit structurei

Interacts with DNTTIP1 (PubMed:21573134, PubMed:25653165). Identified in a histone deacetylase complex that contains DNTTIP1, HDAC1 and ELMSAN1; this complex assembles into a tetramer that contains four copies of each protein chain (PubMed:25653165).2 Publications

GO - Molecular functioni

Protein-protein interaction databases

BioGridi124872. 37 interactions.
IntActiQ6PJG2. 31 interactions.
STRINGi9606.ENSP00000286523.

Structurei

3D structure databases

ProteinModelPortaliQ6PJG2.
SMRiQ6PJG2. Positions 722-876.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini721 – 81393ELM2PROSITE-ProRule annotationAdd
BLAST
Domaini828 – 87952SANTPROSITE-ProRule annotationAdd
BLAST

Compositional bias

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Compositional biasi111 – 1155Poly-Gly
Compositional biasi210 – 29182Gln-richAdd
BLAST
Compositional biasi234 – 395162Pro-richAdd
BLAST
Compositional biasi455 – 4595Poly-Arg
Compositional biasi1021 – 10244Poly-Lys

Sequence similaritiesi

Contains 1 ELM2 domain.PROSITE-ProRule annotation
Contains 1 SANT domain.PROSITE-ProRule annotation

Phylogenomic databases

eggNOGiKOG4167. Eukaryota.
ENOG4110BKZ. LUCA.
GeneTreeiENSGT00530000063277.
HOGENOMiHOG000111818.
HOVERGENiHBG081263.
InParanoidiQ6PJG2.
OMAiPDMNPEL.
OrthoDBiEOG091G01H7.
PhylomeDBiQ6PJG2.
TreeFamiTF106431.

Family and domain databases

InterProiIPR000949. ELM2_dom.
IPR009057. Homeodomain-like.
IPR001005. SANT/Myb.
IPR017884. SANT_dom.
[Graphical view]
PfamiPF01448. ELM2. 1 hit.
[Graphical view]
SMARTiSM01189. ELM2. 1 hit.
SM00717. SANT. 1 hit.
[Graphical view]
SUPFAMiSSF46689. SSF46689. 1 hit.
PROSITEiPS51156. ELM2. 1 hit.
PS51293. SANT. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Q6PJG2-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MNLQAQPKAQ NKRKRCLFGG QEPAPKEQPP PLQPPQQSIR VKEEQYLGHE
60 70 80 90 100
GPGGAVSTSQ PVELPPPSSL ALLNSVVYGP ERTSAAMLSQ QVASVKWPNS
110 120 130 140 150
VMAPGRGPER GGGGGVSDSS WQQQPGQPPP HSTWNCHSLS LYSATKGSPH
160 170 180 190 200
PGVGVPTYYN HPEALKREKA GGPQLDRYVR PMMPQKVQLE VGRPQAPLNS
210 220 230 240 250
FHAAKKPPNQ SLPLQPFQLA FGHQVNRQVF RQGPPPPNPV AAFPPQKQQQ
260 270 280 290 300
QQQPQQQQQQ QQAALPQMPL FENFYSMPQQ PSQQPQDFGL QPAGPLGQSH
310 320 330 340 350
LAHHSMAPYP FPPNPDMNPE LRKALLQDSA PQPALPQVQI PFPRRSRRLS
360 370 380 390 400
KEGILPPSAL DGAGTQPGQE ATGNLFLHHW PLQQPPPGSL GQPHPEALGF
410 420 430 440 450
PLELRESQLL PDGERLAPNG REREAPAMGS EEGMRAVSTG DCGQVLRGGV
460 470 480 490 500
IQSTRRRRRA SQEANLLTLA QKAVELASLQ NAKDGSGSEE KRKSVLASTT
510 520 530 540 550
KCGVEFSEPS LATKRAREDS GMVPLIIPVS VPVRTVDPTE AAQAGGLDED
560 570 580 590 600
GKGPEQNPAE HKPSVIVTRR RSTRIPGTDA QAQAEDMNVK LEGEPSVRKP
610 620 630 640 650
KQRPRPEPLI IPTKAGTFIA PPVYSNITPY QSHLRSPVRL ADHPSERSFE
660 670 680 690 700
LPPYTPPPIL SPVREGSGLY FNAIISTSTI PAPPPITPKS AHRTLLRTNS
710 720 730 740 750
AEVTPPVLSV MGEATPVSIE PRINVGSRFQ AEIPLMRDRA LAAADPHKAD
760 770 780 790 800
LVWQPWEDLE SSREKQRQVE DLLTAACSSI FPGAGTNQEL ALHCLHESRG
810 820 830 840 850
DILETLNKLL LKKPLRPHNH PLATYHYTGS DQWKMAERKL FNKGIAIYKK
860 870 880 890 900
DFFLVQKLIQ TKTVAQCVEF YYTYKKQVKI GRNGTLTFGD VDTSDEKSAQ
910 920 930 940 950
EEVEVDIKTS QKFPRVPLPR RESPSEERLE PKREVKEPRK EGEEEVPEIQ
960 970 980 990 1000
EKEEQEEGRE RSRRAAAVKA TQTLQANESA SDILILRSHE SNAPGSAGGQ
1010 1020 1030 1040
ASEKPREGTG KSRRALPFSE KKKKTETFSK TQNQENTFPC KKCGR
Length:1,045
Mass (Da):114,989
Last modified:November 28, 2006 - v2
Checksum:i1FFD4CE7557D026F
GO

Sequence cautioni

The sequence AAH06511 differs from that shown. Reason: Erroneous initiation. Translation N-terminally shortened.Curated

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti626 – 6261N → D in AAH09202 (PubMed:15489334).Curated
Sequence conflicti1020 – 10201E → K in AAH09202 (PubMed:15489334).Curated
Sequence conflicti1020 – 10201E → K in AAH15668 (PubMed:15489334).Curated

Natural variant

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Natural varianti86 – 861A → V.
Corresponds to variant rs35302179 [ dbSNP | Ensembl ].
VAR_050182
Natural varianti554 – 5541P → L.1 Publication
Corresponds to variant rs17782124 [ dbSNP | Ensembl ].
VAR_050183
Natural varianti895 – 8951D → N.
Corresponds to variant rs35905570 [ dbSNP | Ensembl ].
VAR_061361

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AC006146 Genomic DNA. No translation available.
BC006511 mRNA. Translation: AAH06511.1. Different initiation.
BC009202 mRNA. Translation: AAH09202.1.
BC015668 mRNA. Translation: AAH15668.1.
AK127785 mRNA. No translation available.
CCDSiCCDS9819.1.
RefSeqiNP_001036783.1. NM_001043318.2.
NP_919254.2. NM_194278.3.
UniGeneiHs.594157.

Genome annotation databases

EnsembliENST00000286523; ENSP00000286523; ENSG00000156030.
ENST00000394071; ENSP00000377634; ENSG00000156030.
ENST00000423556; ENSP00000407767; ENSG00000156030.
GeneIDi91748.
KEGGihsa:91748.
UCSCiuc001xot.4. human.

Keywords - Coding sequence diversityi

Polymorphism

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AC006146 Genomic DNA. No translation available.
BC006511 mRNA. Translation: AAH06511.1. Different initiation.
BC009202 mRNA. Translation: AAH09202.1.
BC015668 mRNA. Translation: AAH15668.1.
AK127785 mRNA. No translation available.
CCDSiCCDS9819.1.
RefSeqiNP_001036783.1. NM_001043318.2.
NP_919254.2. NM_194278.3.
UniGeneiHs.594157.

3D structure databases

ProteinModelPortaliQ6PJG2.
SMRiQ6PJG2. Positions 722-876.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi124872. 37 interactions.
IntActiQ6PJG2. 31 interactions.
STRINGi9606.ENSP00000286523.

PTM databases

iPTMnetiQ6PJG2.
PhosphoSiteiQ6PJG2.

Polymorphism and mutation databases

BioMutaiELMSAN1.
DMDMi118572229.

Proteomic databases

EPDiQ6PJG2.
MaxQBiQ6PJG2.
PaxDbiQ6PJG2.
PeptideAtlasiQ6PJG2.
PRIDEiQ6PJG2.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000286523; ENSP00000286523; ENSG00000156030.
ENST00000394071; ENSP00000377634; ENSG00000156030.
ENST00000423556; ENSP00000407767; ENSG00000156030.
GeneIDi91748.
KEGGihsa:91748.
UCSCiuc001xot.4. human.

Organism-specific databases

CTDi91748.
GeneCardsiELMSAN1.
H-InvDBHIX0011798.
HGNCiHGNC:19853. ELMSAN1.
HPAiHPA003111.
neXtProtiNX_Q6PJG2.
PharmGKBiPA134877876.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiKOG4167. Eukaryota.
ENOG4110BKZ. LUCA.
GeneTreeiENSGT00530000063277.
HOGENOMiHOG000111818.
HOVERGENiHBG081263.
InParanoidiQ6PJG2.
OMAiPDMNPEL.
OrthoDBiEOG091G01H7.
PhylomeDBiQ6PJG2.
TreeFamiTF106431.

Miscellaneous databases

GenomeRNAii91748.
PROiQ6PJG2.

Gene expression databases

BgeeiENSG00000156030.
CleanExiHS_C14orf43.
ExpressionAtlasiQ6PJG2. baseline and differential.
GenevisibleiQ6PJG2. HS.

Family and domain databases

InterProiIPR000949. ELM2_dom.
IPR009057. Homeodomain-like.
IPR001005. SANT/Myb.
IPR017884. SANT_dom.
[Graphical view]
PfamiPF01448. ELM2. 1 hit.
[Graphical view]
SMARTiSM01189. ELM2. 1 hit.
SM00717. SANT. 1 hit.
[Graphical view]
SUPFAMiSSF46689. SSF46689. 1 hit.
PROSITEiPS51156. ELM2. 1 hit.
PS51293. SANT. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiEMSA1_HUMAN
AccessioniPrimary (citable) accession number: Q6PJG2
Secondary accession number(s): Q6PK13, Q6PK59, Q6ZS23
Entry historyi
Integrated into UniProtKB/Swiss-Prot: November 28, 2006
Last sequence update: November 28, 2006
Last modified: September 7, 2016
This is version 109 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Human chromosome 14
    Human chromosome 14: entries, gene names and cross-references to MIM
  2. Human entries with polymorphisms or disease mutations
    List of human entries with polymorphisms or disease mutations
  3. Human polymorphisms and disease mutations
    Index of human polymorphisms and disease mutations
  4. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.