Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Polycomb protein SCMH1

Gene

SCMH1

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: -Experimental evidence at protein leveli

Functioni

Associates with Polycomb group (PcG) multiprotein complexes; the complex class is required to maintain the transcriptionally repressive state of some genes.By similarity

GO - Biological processi

Keywordsi

Molecular functionDevelopmental protein, Repressor
Biological processTranscription, Transcription regulation

Enzyme and pathway databases

ReactomeiR-HSA-2559580 Oxidative Stress Induced Senescence
R-HSA-3108214 SUMOylation of DNA damage response and repair proteins
R-HSA-4551638 SUMOylation of chromatin organization proteins
R-HSA-4570464 SUMOylation of RNA binding proteins
R-HSA-8939243 RUNX1 interacts with co-factors whose precise effect on RUNX1 targets is not known
R-HSA-8943724 Regulation of PTEN gene transcription

Names & Taxonomyi

Protein namesi
Recommended name:
Polycomb protein SCMH1
Alternative name(s):
Sex comb on midleg homolog 1
Gene namesi
Name:SCMH1Imported
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 1

Organism-specific databases

EuPathDBiHostDB:ENSG00000010803.16
HGNCiHGNC:19003 SCMH1
MIMi616396 gene
neXtProtiNX_Q96GD3

Subcellular locationi

Extracellular region or secreted Cytosol Plasma membrane Cytoskeleton Lysosome Endosome Peroxisome ER Golgi apparatus Nucleus Mitochondrion Manual annotation Automatic computational assertionGraphics by Christian Stolte; Source: COMPARTMENTS

Keywords - Cellular componenti

Nucleus

Pathology & Biotechi

Organism-specific databases

OpenTargetsiENSG00000010803
PharmGKBiPA134870272

Chemistry databases

DrugBankiDB03345 Beta-Mercaptoethanol

Polymorphism and mutation databases

BioMutaiSCMH1
DMDMi60390956

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00001143341 – 660Polycomb protein SCMH1Add BLAST660

Proteomic databases

MaxQBiQ96GD3
PaxDbiQ96GD3
PeptideAtlasiQ96GD3
PRIDEiQ96GD3

PTM databases

iPTMnetiQ96GD3
PhosphoSitePlusiQ96GD3

Expressioni

Tissue specificityi

Strongly expressed in heart, muscle and pancreas. Weakly expressed in brain, placenta, lung, liver and kidney.1 Publication

Gene expression databases

BgeeiENSG00000010803
CleanExiHS_SCMH1
ExpressionAtlasiQ96GD3 baseline and differential
GenevisibleiQ96GD3 HS

Organism-specific databases

HPAiHPA048898
HPA053292

Interactioni

Subunit structurei

Interacts with the SAM domain of PHC1 via its SAM domain in vitro (By similarity). Associates with a PRC1-like complex.By similarity

Binary interactionsi

Show more details

Protein-protein interaction databases

BioGridi116609, 30 interactors
CORUMiQ96GD3
IntActiQ96GD3, 21 interactors
MINTiQ96GD3
STRINGi9606.ENSP00000318094

Structurei

Secondary structure

1660
Legend: HelixTurnBeta strandPDB Structure known for this area
Show more details
Feature keyPosition(s)DescriptionActionsGraphical viewLength
Helixi30 – 36Combined sources7
Helixi44 – 46Combined sources3
Beta strandi47 – 49Combined sources3
Beta strandi63 – 68Combined sources6
Beta strandi71 – 84Combined sources14
Beta strandi87 – 92Combined sources6
Beta strandi101 – 104Combined sources4
Helixi115 – 118Combined sources4
Helixi133 – 135Combined sources3
Helixi136 – 144Combined sources9
Helixi152 – 154Combined sources3
Beta strandi172 – 176Combined sources5
Beta strandi184 – 193Combined sources10
Beta strandi196 – 201Combined sources6
Turni205 – 208Combined sources4
Beta strandi210 – 213Combined sources4
Helixi224 – 228Combined sources5

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
2P0KX-ray1.75A27-238[»]
ProteinModelPortaliQ96GD3
SMRiQ96GD3
ModBaseiSearch...
MobiDBiSearch...

Miscellaneous databases

EvolutionaryTraceiQ96GD3

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Repeati28 – 126MBT 1Add BLAST99
Repeati134 – 235MBT 2Add BLAST102
Domaini593 – 658SAMPROSITE-ProRule annotationAdd BLAST66

Sequence similaritiesi

Belongs to the SCM family.Curated

Keywords - Domaini

Repeat

Phylogenomic databases

eggNOGiENOG410IMEZ Eukaryota
ENOG410XPKI LUCA
GeneTreeiENSGT00760000119024
HOGENOMiHOG000236280
HOVERGENiHBG056406
InParanoidiQ96GD3
KOiK11461
OMAiKILNNAM
OrthoDBiEOG091G05BW
PhylomeDBiQ96GD3
TreeFamiTF106488

Family and domain databases

Gene3Di1.20.1380.20, 1 hit
InterProiView protein in InterPro
IPR004092 Mbt
IPR001660 SAM
IPR013761 SAM/pointed_sf
IPR033763 SCML2_RBR
IPR021987 SLED
IPR038348 SLED_sf
PfamiView protein in Pfam
PF02820 MBT, 2 hits
PF17208 RBR, 1 hit
PF00536 SAM_1, 1 hit
PF12140 SLED, 1 hit
SMARTiView protein in SMART
SM00561 MBT, 2 hits
SM00454 SAM, 1 hit
SUPFAMiSSF47769 SSF47769, 1 hit
PROSITEiView protein in PROSITE
PS51079 MBT, 2 hits
PS50105 SAM_DOMAIN, 1 hit

Sequences (6)i

Sequence statusi: Complete.

This entry describes 6 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: Q96GD3-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MLVCYSVLAC EILWDLPCSI MGSPLGHFTW DKYLKETCSV PAPVHCFKQS
60 70 80 90 100
YTPPSNEFKI SMKLEAQDPR NTTSTCIATV VGLTGARLRL RLDGSDNKND
110 120 130 140 150
FWRLVDSAEI QPIGNCEKNG GMLQPPLGFR LNASSWPMFL LKTLNGAEMA
160 170 180 190 200
PIRIFHKEPP SPSHNFFKMG MKLEAVDRKN PHFICPATIG EVRGSEVLVT
210 220 230 240 250
FDGWRGAFDY WCRFDSRDIF PVGWCSLTGD NLQPPGTKVV IPKNPYPASD
260 270 280 290 300
VNTEKPSIHS STKTVLEHQP GQRGRKPGKK RGRTPKTLIS HPISAPSKTA
310 320 330 340 350
EPLKFPKKRG PKPGSKRKPR TLLNPPPASP TTSTPEPDTS TVPQDAATIP
360 370 380 390 400
SSAMQAPTVC IYLNKNGSTG PHLDKKKVQQ LPDHFGPARA SVVLQQAVQA
410 420 430 440 450
CIDCAYHQKT VFSFLKQGHG GEVISAVFDR EQHTLNLPAV NSITYVLRFL
460 470 480 490 500
EKLCHNLRSD NLFGNQPFTQ THLSLTAIEY SHSHDRYLPG ETFVLGNSLA
510 520 530 540 550
RSLEPHSDSM DSASNPTNLV STSQRHRPLL SSCGLPPSTA SAVRRLCSRG
560 570 580 590 600
VLKGSNERRD MESFWKLNRS PGSDRYLESR DASRLSGRDP SSWTVEDVMQ
610 620 630 640 650
FVREADPQLG PHADLFRKHE IDGKALLLLR SDMMMKYMGL KLGPALKLSY
660
HIDRLKQGKF
Note: No experimental confirmation available.Curated
Length:660
Mass (Da):73,354
Last modified:December 1, 2001 - v1
Checksum:i6544DD484DA8D037
GO
Isoform 2 (identifier: Q96GD3-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-24: MLVCYSVLACEILWDLPCSIMGSP → MQPNVIDWSDVRKHKYGHLSESASQYQEAADILD
     550-571: Missing.

Note: Gene prediction confirmed by EST data.
Show »
Length:648
Mass (Da):72,053
Checksum:i5DBA4803D9E8B3B5
GO
Isoform 3 (identifier: Q96GD3-3) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-61: Missing.

Show »
Length:599
Mass (Da):66,459
Checksum:iAC9C5AB96BDF79E1
GO
Isoform 41 Publication (identifier: Q96GD3-4) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-48: MLVCYSVLACEILWDLPCSIMGSPLGHFTWDKYLKETCSVPAPVHCFK → M
     550-571: Missing.

Show »
Length:591
Mass (Da):65,481
Checksum:iE5F43403878E53BA
GO
Isoform 51 Publication (identifier: Q96GD3-5) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-61: Missing.
     550-571: Missing.

Note: May be due to intron retention.1 Publication
Show »
Length:577
Mass (Da):63,870
Checksum:i1EC4A9E8452B66F0
GO
Isoform 6 (identifier: Q96GD3-6) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-48: MLVCYSVLACEILWDLPCSIMGSPLGHFTWDKYLKETCSVPAPVHCFK → M
     128-238: Missing.
     550-571: Missing.

Note: No experimental confirmation available.
Show »
Length:480
Mass (Da):52,908
Checksum:iCCD341EBF6748EF5
GO

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti463F → L (Ref. 3) Curated1

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_0516781 – 61Missing in isoform 3 and isoform 5. 4 PublicationsAdd BLAST61
Alternative sequenceiVSP_0516771 – 48MLVCY…VHCFK → M in isoform 4 and isoform 6. 2 PublicationsAdd BLAST48
Alternative sequenceiVSP_0516761 – 24MLVCY…IMGSP → MQPNVIDWSDVRKHKYGHLS ESASQYQEAADILD in isoform 2. CuratedAdd BLAST24
Alternative sequenceiVSP_043395128 – 238Missing in isoform 6. 1 PublicationAdd BLAST111
Alternative sequenceiVSP_051679550 – 571Missing in isoform 2, isoform 4, isoform 5 and isoform 6. 3 PublicationsAdd BLAST22

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AF149045 mRNA Translation: AAF01150.1
AF149046 mRNA Translation: AAF01151.1
AK299383 mRNA Translation: BAG61370.1
CR457161 mRNA Translation: CAG33442.1
BX640721 mRNA Translation: CAE45840.1
AL110502, AL391730 Genomic DNA Translation: CAI22109.1
AL110502, AL391730 Genomic DNA Translation: CAI22110.1
AL110502, AL391730, AL606484 Genomic DNA Translation: CAI22111.1
AL110502, AL391730 Genomic DNA Translation: CAI22112.1
AL110502, AL391730 Genomic DNA Translation: CAI22113.1
AL391730, AL110502 Genomic DNA Translation: CAH72791.1
AL391730, AL110502 Genomic DNA Translation: CAH72793.1
AL391730, AL110502, AL606484 Genomic DNA Translation: CAH72794.1
AL391730, AL110502 Genomic DNA Translation: CAH72795.1
AL391730, AL110502 Genomic DNA Translation: CAH72796.1
AL606484, AL110502, AL391730 Genomic DNA Translation: CAH72242.1
BC009752 mRNA Translation: AAH09752.1
BC021252 mRNA Translation: AAH21252.1
CCDSiCCDS30688.1 [Q96GD3-1]
CCDS461.1 [Q96GD3-4]
CCDS53301.1 [Q96GD3-5]
CCDS53302.1 [Q96GD3-3]
CCDS53303.1 [Q96GD3-6]
CCDS53304.1 [Q96GD3-2]
RefSeqiNP_001026864.1, NM_001031694.2 [Q96GD3-1]
NP_001165689.1, NM_001172218.1 [Q96GD3-5]
NP_001165690.1, NM_001172219.1 [Q96GD3-2]
NP_001165691.1, NM_001172220.1 [Q96GD3-5]
NP_001165692.1, NM_001172221.1 [Q96GD3-3]
NP_001165693.1, NM_001172222.2 [Q96GD3-6]
NP_036368.1, NM_012236.3 [Q96GD3-4]
XP_006710527.1, XM_006710464.1 [Q96GD3-3]
XP_011539335.1, XM_011541033.2 [Q96GD3-1]
XP_011539338.1, XM_011541036.2 [Q96GD3-3]
XP_016856188.1, XM_017000699.1 [Q96GD3-3]
XP_016856196.1, XM_017000707.1 [Q96GD3-6]
UniGeneiHs.571874

Genome annotation databases

EnsembliENST00000326197; ENSP00000318094; ENSG00000010803 [Q96GD3-1]
ENST00000337495; ENSP00000337352; ENSG00000010803 [Q96GD3-2]
ENST00000361191; ENSP00000354656; ENSG00000010803 [Q96GD3-5]
ENST00000361705; ENSP00000354996; ENSG00000010803 [Q96GD3-4]
ENST00000372595; ENSP00000361676; ENSG00000010803 [Q96GD3-3]
ENST00000372596; ENSP00000361677; ENSG00000010803 [Q96GD3-5]
ENST00000372597; ENSP00000361678; ENSG00000010803 [Q96GD3-4]
ENST00000397171; ENSP00000380356; ENSG00000010803 [Q96GD3-5]
ENST00000397174; ENSP00000380359; ENSG00000010803 [Q96GD3-1]
ENST00000402904; ENSP00000386079; ENSG00000010803 [Q96GD3-3]
ENST00000456518; ENSP00000403974; ENSG00000010803 [Q96GD3-6]
GeneIDi22955
KEGGihsa:22955
UCSCiuc001cgp.4 human [Q96GD3-1]

Keywords - Coding sequence diversityi

Alternative splicing

Similar proteinsi

Entry informationi

Entry nameiSCMH1_HUMAN
AccessioniPrimary (citable) accession number: Q96GD3
Secondary accession number(s): B4DRQ8
, Q5VT76, Q6IAJ4, Q8WU48, Q9UKM5, Q9UKM6
Entry historyiIntegrated into UniProtKB/Swiss-Prot: March 1, 2005
Last sequence update: December 1, 2001
Last modified: March 28, 2018
This is version 146 of the entry and version 1 of the sequence. See complete history.
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

3D-structure, Complete proteome, Reference proteome
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health