Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Transcription factor SOX-6

Gene

Sox6

Organism
Mus musculus (Mouse)
Status
Reviewed-Annotation score: -Experimental evidence at protein leveli

Functioni

Transcriptional activator. Binds specifically to the DNA sequence 5'-AACAAT-3'. Plays a key role in several developmental processes, including neurogenesis and skeleton formation.

Regions

Feature keyPosition(s)DescriptionActionsGraphical viewLength
DNA bindingi620 – 688HMG boxPROSITE-ProRule annotationAdd BLAST69

GO - Molecular functioni

  • DNA binding Source: MGI
  • DNA binding transcription factor activity Source: MGI
  • protein heterodimerization activity Source: MGI
  • RNA polymerase II proximal promoter sequence-specific DNA binding Source: NTNU_SB
  • sequence-specific DNA binding Source: MGI
  • transcriptional repressor activity, RNA polymerase II proximal promoter sequence-specific DNA binding Source: NTNU_SB
  • transcription regulatory region DNA binding Source: MGI

GO - Biological processi

  • astrocyte differentiation Source: Ensembl
  • cardiac muscle cell differentiation Source: MGI
  • cartilage development Source: MGI
  • cell fate commitment Source: MGI
  • cell morphogenesis Source: MGI
  • cellular response to transforming growth factor beta stimulus Source: MGI
  • erythrocyte development Source: MGI
  • erythrocyte differentiation Source: MGI
  • gene silencing Source: MGI
  • hemopoiesis Source: MGI
  • in utero embryonic development Source: MGI
  • muscle cell differentiation Source: MGI
  • negative regulation of cardiac muscle cell differentiation Source: MGI
  • negative regulation of transcription, DNA-templated Source: MGI
  • negative regulation of transcription by RNA polymerase II Source: MGI
  • oligodendrocyte cell fate specification Source: MGI
  • oligodendrocyte differentiation Source: MGI
  • positive regulation of cartilage development Source: MGI
  • positive regulation of chondrocyte differentiation Source: UniProtKB
  • positive regulation of mesenchymal stem cell differentiation Source: MGI
  • positive regulation of transcription, DNA-templated Source: MGI
  • positive regulation of transcription by RNA polymerase II Source: MGI
  • post-embryonic development Source: MGI
  • regulation of gene expression Source: MGI
  • regulation of transcription, DNA-templated Source: MGI
  • transcription, DNA-templated Source: UniProtKB-KW

Keywordsi

Molecular functionActivator, Developmental protein, DNA-binding
Biological processTranscription, Transcription regulation

Enzyme and pathway databases

ReactomeiR-MMU-3769402 Deactivation of the beta-catenin transactivating complex

Names & Taxonomyi

Protein namesi
Recommended name:
Transcription factor SOX-6
Alternative name(s):
SOX-LZ
Gene namesi
Name:Sox6
Synonyms:Sox-6
OrganismiMus musculus (Mouse)
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaMyomorphaMuroideaMuridaeMurinaeMusMus
Proteomesi
  • UP000000589 Componenti: Chromosome 7

Organism-specific databases

MGIiMGI:98368 Sox6

Subcellular locationi

Extracellular region or secreted Cytosol Plasma membrane Cytoskeleton Lysosome Endosome Peroxisome ER Golgi apparatus Nucleus Mitochondrion Manual annotation Automatic computational assertionGraphics by Christian Stolte; Source: COMPARTMENTS

Keywords - Cellular componenti

Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00000487301 – 827Transcription factor SOX-6Add BLAST827

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei119PhosphothreonineCombined sources1
Modified residuei399PhosphoserineCombined sources1
Modified residuei401PhosphothreonineCombined sources1
Cross-linki404Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO)By similarity
Cross-linki417Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO)By similarity
Modified residuei439PhosphoserineCombined sources1
Modified residuei442PhosphoserineCombined sources1

Post-translational modificationi

Sumoylation inhibits the transcriptional activity.By similarity

Keywords - PTMi

Isopeptide bond, Phosphoprotein, Ubl conjugation

Proteomic databases

PaxDbiP40645
PRIDEiP40645

PTM databases

iPTMnetiP40645
PhosphoSitePlusiP40645

Expressioni

Tissue specificityi

Highly expressed in testis.

Gene expression databases

BgeeiENSMUSG00000051910
CleanExiMM_SOX6
ExpressionAtlasiP40645 baseline and differential
GenevisibleiP40645 MM

Interactioni

Subunit structurei

Interacts with DAZAP2. May interact with CENPK.1 Publication

GO - Molecular functioni

  • protein heterodimerization activity Source: MGI

Protein-protein interaction databases

BioGridi203410, 5 interactors
IntActiP40645, 1 interactor
STRINGi10090.ENSMUSP00000072583

Structurei

3D structure databases

ProteinModelPortaliP40645
SMRiP40645
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Coiled coil

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Coiled coili184 – 262Sequence analysisAdd BLAST79

Compositional bias

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Compositional biasi233 – 261Gln-richAdd BLAST29
Compositional biasi240 – 243Poly-Gln4
Compositional biasi280 – 285Poly-Ala6
Compositional biasi313 – 317Poly-Ala5
Compositional biasi514 – 517Poly-Gln4

Keywords - Domaini

Coiled coil

Phylogenomic databases

eggNOGiKOG0528 Eukaryota
ENOG410YZNG LUCA
GeneTreeiENSGT00760000119274
HOGENOMiHOG000056455
HOVERGENiHBG003915
InParanoidiP40645
KOiK09269
OrthoDBiEOG091G03D9
PhylomeDBiP40645
TreeFamiTF320471

Family and domain databases

Gene3Di1.10.30.10, 1 hit
InterProiView protein in InterPro
IPR009071 HMG_box_dom
IPR036910 HMG_box_dom_sf
IPR027153 SOX-6
PANTHERiPTHR10270:SF89 PTHR10270:SF89, 1 hit
PfamiView protein in Pfam
PF00505 HMG_box, 1 hit
SMARTiView protein in SMART
SM00398 HMG, 1 hit
SUPFAMiSSF47095 SSF47095, 1 hit
PROSITEiView protein in PROSITE
PS50118 HMG_BOX_2, 1 hit

Sequences (2)i

Sequence statusi: Complete.

This entry describes 2 isoformsi produced by alternative splicing. AlignAdd to basket

Note: Additional isoforms seem to exist.
Isoform 1 (identifier: P40645-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MSSKQATSPF ACTADGEEAM TQDLTSREKE EGSDQHPASH LPLHPIMHNK
60 70 80 90 100
PHSEELPTLV STIQQDADWD SVLSSQQRME SENNKLCSLY SFRNTSTSPH
110 120 130 140 150
KPDEGSRERE IMNSVTFGTP ERRKGSLADV VDTLKQKKLE EMTRTEQEDS
160 170 180 190 200
SCMEKLLSKD WKEKMERLNT SELLGEIKGT PESLAEKERQ LSTMITQLIS
210 220 230 240 250
LREQLLAAHD EQKKLAASQI EKQRQQMDLA RQQQEQIARQ QQQLLQQQHK
260 270 280 290 300
INLLQQQIQV QGHMPPLMIP IFPHDQRTLA AAAAAQQGFL FPPGITYKPG
310 320 330 340 350
DNYPVQFIPS TMAAAAASGL SPLQLQKGHV SHPQINPRLK GISDRFGRNL
360 370 380 390 400
DPSEHGGGHS YNHRQIEQLY AAQLASMQVS PGAKMPSTPQ PPNSAGAVSP
410 420 430 440 450
TGIKNEKRGT SPVTQVKDET TAQPLNLSSR PKTAEPVKSP TSPTQNLFPA
460 470 480 490 500
SKTSPVNLPN KSSIPSPIGG SLGRGSSLDI LSSLNSPALF GDQDTVMKAI
510 520 530 540 550
QEARKMREQI QREQQQQPHG VDGKLSSMNN MGLSNCRTEK ERTRFENLGP
560 570 580 590 600
QLTGKSSEDG KLGPGVIDLT RPEDAEGSKA MNGSAAKLQQ YYCWPTGGAT
610 620 630 640 650
VAEARVYRDA RGRASSEPHI KRPMNAFMVW AKDERRKILQ AFPDMHNSNI
660 670 680 690 700
SKILGSRWKS MSNQEKQPYY EEQARLSKIH LEKYPNYKYK PRPKRTCIVD
710 720 730 740 750
GKKLRIGEYK QLMRSRRQEM RQFFTVGQQP QMPITTGTGV VYPGAITMAT
760 770 780 790 800
TTPSPQMTSD CSSTSASPEP SLPVIQSTYG MKMDGASLAG NDMINGEDEM
810 820
EAYDDYEDDP KSDYSSENEA PEPVSAN
Length:827
Mass (Da):91,803
Last modified:February 1, 1996 - v2
Checksum:iF777BAB7CFB1E93B
GO
Isoform 2 (identifier: P40645-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     327-367: Missing.

Show »
Length:786
Mass (Da):87,192
Checksum:iEFC8E350AEC84653
GO

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti312 – 313MA → SS in BAA09618 (PubMed:7791783).Curated2
Sequence conflicti632K → R in CAA46610 (PubMed:1614875).Curated1

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_002198327 – 367Missing in isoform 2. 1 PublicationAdd BLAST41

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U32614 mRNA Translation: AAC52263.1
D61689 mRNA Translation: BAA09618.1
AJ010605 mRNA Translation: CAA09270.1
X65659 mRNA Translation: CAA46610.1
CCDSiCCDS40098.1 [P40645-1]
PIRiS22944
S59121
RefSeqiNP_001264255.1, NM_001277326.1 [P40645-1]
NP_001264257.1, NM_001277328.1 [P40645-2]
NP_035575.1, NM_011445.4 [P40645-1]
XP_011240009.1, XM_011241707.2 [P40645-1]
XP_011240013.1, XM_011241711.2 [P40645-2]
UniGeneiMm.323365
Mm.487065

Genome annotation databases

EnsembliENSMUST00000072804; ENSMUSP00000072583; ENSMUSG00000051910 [P40645-1]
ENSMUST00000166207; ENSMUSP00000129027; ENSMUSG00000051910 [P40645-1]
GeneIDi20679
KEGGimmu:20679
UCSCiuc009jin.2 mouse [P40645-2]
uc009jip.2 mouse [P40645-1]

Keywords - Coding sequence diversityi

Alternative splicing

Similar proteinsi

Entry informationi

Entry nameiSOX6_MOUSE
AccessioniPrimary (citable) accession number: P40645
Secondary accession number(s): Q62250, Q9QWS5
Entry historyiIntegrated into UniProtKB/Swiss-Prot: February 1, 1995
Last sequence update: February 1, 1996
Last modified: March 28, 2018
This is version 152 of the entry and version 2 of the sequence. See complete history.
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Direct protein sequencing, Reference proteome
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health