Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

SUMO-interacting motif-containing protein 1

Gene

SIMC1

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Experimental evidence at protein leveli

Functioni

GO - Molecular functioni

  • SUMO polymer binding Source: UniProtKB
Complete GO annotation...

Enzyme and pathway databases

BioCyciZFISH:ENSG00000170085-MONOMER.

Names & Taxonomyi

Protein namesi
Recommended name:
SUMO-interacting motif-containing protein 1
Gene namesi
Name:SIMC1
Synonyms:C5orf25
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 5

Organism-specific databases

HGNCiHGNC:24779. SIMC1.

Pathology & Biotechi

Organism-specific databases

OpenTargetsiENSG00000170085.
PharmGKBiPA144596506.

Polymorphism and mutation databases

BioMutaiSIMC1.
DMDMi449081288.

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00002505811 – 872SUMO-interacting motif-containing protein 1Add BLAST872

Proteomic databases

EPDiQ8NDZ2.
MaxQBiQ8NDZ2.
PaxDbiQ8NDZ2.
PeptideAtlasiQ8NDZ2.
PRIDEiQ8NDZ2.

PTM databases

iPTMnetiQ8NDZ2.
PhosphoSitePlusiQ8NDZ2.

Expressioni

Gene expression databases

BgeeiENSG00000170085.
CleanExiHS_C5orf25.
ExpressionAtlasiQ8NDZ2. baseline and differential.
GenevisibleiQ8NDZ2. HS.

Organism-specific databases

HPAiHPA037889.
HPA037890.

Interactioni

Subunit structurei

Interacts (via SIM domains) with SUMO1 and SUMO2.1 Publication

GO - Molecular functioni

  • SUMO polymer binding Source: UniProtKB

Protein-protein interaction databases

BioGridi131981. 17 interactors.
IntActiQ8NDZ2. 8 interactors.
MINTiMINT-1394569.
STRINGi9606.ENSP00000342075.

Structurei

3D structure databases

ProteinModelPortaliQ8NDZ2.
SMRiQ8NDZ2.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Motif

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Motifi26 – 30SUMO interaction motif 1 (SIM); mediates the binding to polysumoylated substrates5
Motifi45 – 49SUMO interaction motif 2 (SIM); mediates the binding to polysumoylated substrates5

Compositional bias

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Compositional biasi129 – 180Ser-richAdd BLAST52
Compositional biasi185 – 352Pro-richAdd BLAST168

Phylogenomic databases

eggNOGiENOG410IH7J. Eukaryota.
ENOG410XQXC. LUCA.
GeneTreeiENSGT00390000013414.
HOGENOMiHOG000070068.
HOVERGENiHBG062235.
InParanoidiQ8NDZ2.
PhylomeDBiQ8NDZ2.
TreeFamiTF332523.

Sequences (4)i

Sequence statusi: Complete.

This entry describes 4 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: Q8NDZ2-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MAPASASGED LRKLPTMAEV NGEQDFIDLT RETRPRTKDR SGLYVIDLTR
60 70 80 90 100
AEGENRPIAT LDLTLEPVTP SQKEPTSLQT CASLSGKAVM EGHVDRSSQP
110 120 130 140 150
TARRIINSDP VDLDLVEENT FVGPPPATSI SGGSVYPTEP NCSSATFTGN
160 170 180 190 200
LSFLASLQLS SDVSSLSPTS NNSRSSSSSS NQKAPLPCPQ QDVSRPPQAL
210 220 230 240 250
PCPLRPLPCP PRASPCPPRA SSCPPRALSC PSQTMQCQLP ALTHPPQEVP
260 270 280 290 300
CPRQNIPGPP QDSLGLPQDV PGLPQSILHP QDVAYLQDMP RSPGDVPQSP
310 320 330 340 350
SDVSPSPDAP QSPGGMPHLP GDVLHSPGDM PHSSGDVTHS PRDIPHLPGD
360 370 380 390 400
RPDFTQNDVQ NRDMPMDISA LSSPSCSPSP QSETPLEKVP WLSVMETPAR
410 420 430 440 450
KEISLSEPAK PGSAHVQSRT PQGGLYNRPC LHRLKYFLRP PVHHLFFQTL
460 470 480 490 500
IPDKDTRENK GQKLEPIPHR RLRMVTNTIE ENFPLGTVQF LMDFVSPQHY
510 520 530 540 550
PPREIVAHII QKILLSGSET VDVLKEAYML LMKIQQLHPA NAKTVEWDWK
560 570 580 590 600
LLTYVMEEEG QTLPGRVLFL RYVVQTLEDD FQQTLRRQRQ HLQQSIANMV
610 620 630 640 650
LSCDKQPHNV RDVIKWLVKA VTEDGLTQPP NGNQTSSGTG ILKASSSHPS
660 670 680 690 700
SQPNLTKNTN QLIVCQLQRM LSIAVEVDRT PTCSSNKIAE MMFGFVLDIP
710 720 730 740 750
ERSQREMFFT TMESHLLRCK VLEIIFLHSC ETPTRLPLSL AQALYFLNNS
760 770 780 790 800
TSLLKCQSDK SQWQTWDELV EHLQFLLSSY QHVLREHLRS SVIDRKDLII
810 820 830 840 850
KRIKPKPQQG DDITVVDVEK QIEAFRSRLI QMLGEPLVPQ LQDKVHLLKL
860 870
LLFYAADLNP DAEPFQKGWS GS
Length:872
Mass (Da):96,838
Last modified:February 6, 2013 - v3
Checksum:iD056ACDB8DD23805
GO
Isoform 2 (identifier: Q8NDZ2-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-89: Missing.
     377-415: SPSPQSETPL...SEPAKPGSAH → TPAWGTEQDS...FNLYRTRVKN
     416-872: Missing.

Note: No experimental confirmation available.
Show »
Length:326
Mass (Da):34,867
Checksum:i74C5629A11A5D116
GO
Isoform 3 (identifier: Q8NDZ2-3) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-415: Missing.
     416-458: VQSRTPQGGL...TLIPDKDTRE → MEDFIVISDD...TSGALPRRTV

Note: No experimental confirmation available.
Show »
Length:457
Mass (Da):52,294
Checksum:iED4057DCC14DF887
GO
Isoform 4 (identifier: Q8NDZ2-4) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-539: Missing.
     540-559: ANAKTVEWDWKLLTYVMEEE → MPRSFEQVIILKKWFLKPYK

Note: No experimental confirmation available.
Show »
Length:333
Mass (Da):38,363
Checksum:iC3A3AE3BED7095D3
GO

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti336D → G in AK126204 (PubMed:14702039).Curated1
Sequence conflicti379S → R in AAH37298 (PubMed:15489334).Curated1
Sequence conflicti693F → L in AAH66980 (PubMed:15372022).Curated1

Natural variant

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Natural variantiVAR_027568221S → F.Corresponds to variant rs2001605dbSNPEnsembl.1
Natural variantiVAR_027569463K → R.1 PublicationCorresponds to variant rs17857141dbSNPEnsembl.1
Natural variantiVAR_059603636S → F.Corresponds to variant rs2001605dbSNPEnsembl.1
Natural variantiVAR_027570772H → R.1 PublicationCorresponds to variant rs17853733dbSNPEnsembl.1

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_0206711 – 539Missing in isoform 4. 1 PublicationAdd BLAST539
Alternative sequenceiVSP_0206721 – 415Missing in isoform 3. 1 PublicationAdd BLAST415
Alternative sequenceiVSP_0206731 – 89Missing in isoform 2. 1 PublicationAdd BLAST89
Alternative sequenceiVSP_020674377 – 415SPSPQ…PGSAH → TPAWGTEQDSVSKKKKKKKR KEIPPNFLLFNLYRTRVKN in isoform 2. 1 PublicationAdd BLAST39
Alternative sequenceiVSP_020675416 – 872Missing in isoform 2. 1 PublicationAdd BLAST457
Alternative sequenceiVSP_020676416 – 458VQSRT…KDTRE → MEDFIVISDDSGSESSGGAR PGRSRRPRRALSRTSGALPR RTV in isoform 3. 1 PublicationAdd BLAST43
Alternative sequenceiVSP_020677540 – 559ANAKT…VMEEE → MPRSFEQVIILKKWFLKPYK in isoform 4. 1 PublicationAdd BLAST20

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK126204 mRNA. No translation available.
AC138956 Genomic DNA. No translation available.
AC139493 Genomic DNA. No translation available.
BC073880 mRNA. Translation: AAH73880.1.
BC037298 mRNA. Translation: AAH37298.1.
BC032390 mRNA. Translation: AAH32390.1.
BC066980 mRNA. Translation: AAH66980.1.
CCDSiCCDS4398.2. [Q8NDZ2-3]
CCDS78089.1. [Q8NDZ2-1]
CCDS78090.1. [Q8NDZ2-4]
RefSeqiNP_001295124.1. NM_001308195.1.
NP_001295125.1. NM_001308196.1. [Q8NDZ2-1]
NP_001295129.1. NM_001308200.1. [Q8NDZ2-4]
NP_940969.3. NM_198567.5. [Q8NDZ2-3]
XP_016864943.1. XM_017009454.1. [Q8NDZ2-1]
UniGeneiHs.719847.

Genome annotation databases

EnsembliENST00000332772; ENSP00000331311; ENSG00000170085. [Q8NDZ2-4]
ENST00000341199; ENSP00000342075; ENSG00000170085. [Q8NDZ2-3]
ENST00000430704; ENSP00000409287; ENSG00000170085. [Q8NDZ2-3]
ENST00000443967; ENSP00000406571; ENSG00000170085. [Q8NDZ2-1]
GeneIDi375484.
KEGGihsa:375484.
UCSCiuc003mdr.4. human. [Q8NDZ2-1]

Keywords - Coding sequence diversityi

Alternative splicing, Polymorphism

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK126204 mRNA. No translation available.
AC138956 Genomic DNA. No translation available.
AC139493 Genomic DNA. No translation available.
BC073880 mRNA. Translation: AAH73880.1.
BC037298 mRNA. Translation: AAH37298.1.
BC032390 mRNA. Translation: AAH32390.1.
BC066980 mRNA. Translation: AAH66980.1.
CCDSiCCDS4398.2. [Q8NDZ2-3]
CCDS78089.1. [Q8NDZ2-1]
CCDS78090.1. [Q8NDZ2-4]
RefSeqiNP_001295124.1. NM_001308195.1.
NP_001295125.1. NM_001308196.1. [Q8NDZ2-1]
NP_001295129.1. NM_001308200.1. [Q8NDZ2-4]
NP_940969.3. NM_198567.5. [Q8NDZ2-3]
XP_016864943.1. XM_017009454.1. [Q8NDZ2-1]
UniGeneiHs.719847.

3D structure databases

ProteinModelPortaliQ8NDZ2.
SMRiQ8NDZ2.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi131981. 17 interactors.
IntActiQ8NDZ2. 8 interactors.
MINTiMINT-1394569.
STRINGi9606.ENSP00000342075.

PTM databases

iPTMnetiQ8NDZ2.
PhosphoSitePlusiQ8NDZ2.

Polymorphism and mutation databases

BioMutaiSIMC1.
DMDMi449081288.

Proteomic databases

EPDiQ8NDZ2.
MaxQBiQ8NDZ2.
PaxDbiQ8NDZ2.
PeptideAtlasiQ8NDZ2.
PRIDEiQ8NDZ2.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000332772; ENSP00000331311; ENSG00000170085. [Q8NDZ2-4]
ENST00000341199; ENSP00000342075; ENSG00000170085. [Q8NDZ2-3]
ENST00000430704; ENSP00000409287; ENSG00000170085. [Q8NDZ2-3]
ENST00000443967; ENSP00000406571; ENSG00000170085. [Q8NDZ2-1]
GeneIDi375484.
KEGGihsa:375484.
UCSCiuc003mdr.4. human. [Q8NDZ2-1]

Organism-specific databases

CTDi375484.
GeneCardsiSIMC1.
H-InvDBHIX0024821.
HGNCiHGNC:24779. SIMC1.
HPAiHPA037889.
HPA037890.
neXtProtiNX_Q8NDZ2.
OpenTargetsiENSG00000170085.
PharmGKBiPA144596506.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiENOG410IH7J. Eukaryota.
ENOG410XQXC. LUCA.
GeneTreeiENSGT00390000013414.
HOGENOMiHOG000070068.
HOVERGENiHBG062235.
InParanoidiQ8NDZ2.
PhylomeDBiQ8NDZ2.
TreeFamiTF332523.

Enzyme and pathway databases

BioCyciZFISH:ENSG00000170085-MONOMER.

Miscellaneous databases

GenomeRNAii375484.
PROiQ8NDZ2.

Gene expression databases

BgeeiENSG00000170085.
CleanExiHS_C5orf25.
ExpressionAtlasiQ8NDZ2. baseline and differential.
GenevisibleiQ8NDZ2. HS.

Family and domain databases

ProtoNetiSearch...

Entry informationi

Entry nameiSIMC1_HUMAN
AccessioniPrimary (citable) accession number: Q8NDZ2
Secondary accession number(s): J3KQQ8
, Q6NXN8, Q6ZTU4, Q8IZ15
Entry historyi
Integrated into UniProtKB/Swiss-Prot: October 3, 2006
Last sequence update: February 6, 2013
Last modified: November 30, 2016
This is version 107 of the entry and version 3 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Human chromosome 5
    Human chromosome 5: entries, gene names and cross-references to MIM
  2. Human entries with polymorphisms or disease mutations
    List of human entries with polymorphisms or disease mutations
  3. Human polymorphisms and disease mutations
    Index of human polymorphisms and disease mutations

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.