Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Transcription factor SOX-8

Gene

Sox8

Organism
Mus musculus (Mouse)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at transcript leveli

Functioni

May play a role in central nervous system, limb and facial development. May be involved in male sex determination. Binds the consensus motif 5'-[AT][AT]CAA[AT]G-3'.

Regions

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
DNA bindingi99 – 16769HMG boxPROSITE-ProRule annotationAdd
BLAST

GO - Molecular functioni

  • DNA binding Source: MGI
  • protein heterodimerization activity Source: MGI
  • RNA polymerase II core promoter sequence-specific DNA binding Source: UniProtKB
  • sequence-specific DNA binding Source: MGI
  • sequence-specific DNA binding RNA polymerase II transcription factor activity Source: MGI
  • sequence-specific DNA binding transcription factor activity Source: MGI
  • transcription factor binding Source: UniProtKB

GO - Biological processi

  • adipose tissue development Source: MGI
  • astrocyte fate commitment Source: MGI
  • cell fate commitment Source: MGI
  • cell maturation Source: MGI
  • enteric nervous system development Source: UniProtKB
  • fat cell differentiation Source: MGI
  • in utero embryonic development Source: MGI
  • male gonad development Source: MGI
  • metanephric nephron tubule formation Source: MGI
  • morphogenesis of a branching epithelium Source: MGI
  • negative regulation of apoptotic process Source: UniProtKB
  • negative regulation of myoblast differentiation Source: MGI
  • negative regulation of photoreceptor cell differentiation Source: MGI
  • negative regulation of transcription, DNA-templated Source: UniProtKB
  • neural crest cell migration Source: UniProtKB
  • oligodendrocyte differentiation Source: MGI
  • osteoblast differentiation Source: MGI
  • peripheral nervous system development Source: MGI
  • positive regulation of branching involved in ureteric bud morphogenesis Source: UniProtKB
  • positive regulation of gene expression Source: MGI
  • positive regulation of gliogenesis Source: UniProtKB
  • positive regulation of kidney development Source: UniProtKB
  • positive regulation of osteoblast proliferation Source: MGI
  • positive regulation of transcription, DNA-templated Source: UniProtKB
  • positive regulation of transcription from RNA polymerase II promoter Source: MGI
  • regulation of hormone levels Source: MGI
  • renal vesicle induction Source: UniProtKB
  • retina development in camera-type eye Source: MGI
  • retinal rod cell differentiation Source: MGI
  • Sertoli cell development Source: UniProtKB
  • signal transduction Source: UniProtKB
  • skeletal muscle cell differentiation Source: MGI
  • spermatogenesis Source: MGI
  • transcription, DNA-templated Source: UniProtKB-KW
  • ureter morphogenesis Source: MGI
Complete GO annotation...

Keywords - Biological processi

Transcription, Transcription regulation

Keywords - Ligandi

DNA-binding

Names & Taxonomyi

Protein namesi
Recommended name:
Transcription factor SOX-8
Gene namesi
Name:Sox8
Synonyms:Sox-8
OrganismiMus musculus (Mouse)
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus
ProteomesiUP000000589 Componenti: Chromosome 17

Organism-specific databases

MGIiMGI:98370. Sox8.

Subcellular locationi

  • Nucleus PROSITE-ProRule annotation

GO - Cellular componenti

  • cytoplasm Source: UniProtKB
  • nuclear transcription factor complex Source: MGI
  • nucleus Source: MGI
Complete GO annotation...

Keywords - Cellular componenti

Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 464464Transcription factor SOX-8PRO_0000048734Add
BLAST

Proteomic databases

MaxQBiQ04886.
PaxDbiQ04886.
PRIDEiQ04886.

PTM databases

PhosphoSiteiQ04886.

Expressioni

Tissue specificityi

Brain, gut, limb, and testes. Slightly in liver, ovaries, spinal cord, lung and heart.

Gene expression databases

BgeeiQ04886.
CleanExiMM_SOX8.
ExpressionAtlasiQ04886. baseline and differential.
GenevisibleiQ04886. MM.

Interactioni

Protein-protein interaction databases

BioGridi203412. 15 interactions.
IntActiQ04886. 1 interaction.
STRINGi10090.ENSMUSP00000025003.

Structurei

3D structure databases

ProteinModelPortaliQ04886.
SMRiQ04886. Positions 97-168.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Sequence similaritiesi

Contains 1 HMG box DNA-binding domain.PROSITE-ProRule annotation

Phylogenomic databases

eggNOGiNOG324597.
GeneTreeiENSGT00760000118988.
HOGENOMiHOG000108876.
HOVERGENiHBG002061.
InParanoidiQ04886.
KOiK09270.
OMAiDQSHGSP.
OrthoDBiEOG7Q2N5K.
PhylomeDBiQ04886.
TreeFamiTF351735.

Family and domain databases

Gene3Di1.10.30.10. 1 hit.
InterProiIPR009071. HMG_box_dom.
IPR022151. Sox_N.
[Graphical view]
PfamiPF00505. HMG_box. 1 hit.
PF12444. Sox_N. 1 hit.
[Graphical view]
SMARTiSM00398. HMG. 1 hit.
[Graphical view]
SUPFAMiSSF47095. SSF47095. 1 hit.
PROSITEiPS50118. HMG_BOX_2. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Q04886-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MLDMSEARAQ PPCSPSGTAS SMSHVEDSDS DAPPSPAGSE GLGRAGGGGR
60 70 80 90 100
GDTAEAADER FPACIRDAVS QVLKGYDWSL VPMPVRGGGG GTLKAKPHVK
110 120 130 140 150
RPMNAFMVWA QAARRKLADQ YPHLHNAELS KTLGKLWRLL SESEKRPFVE
160 170 180 190 200
EAERLRVQHK KDHPDYKYQP RRRKSVKTGR SDSDSGTELG HHPGGPMYKA
210 220 230 240 250
DAVLGEAHHH SDHHTGQTHG PPTPPTTPKT DLHQASNGSK QELRLEGRRL
260 270 280 290 300
VDSGRQNIDF SNVDISELSS EVISNMDTFD VHEFDQYLPL NGHSALPTEP
310 320 330 340 350
SQATASGSYG GASYSHSGAT GIGASPVWAH KGAPSASASP TEAGPLRPQI
360 370 380 390 400
KTEQLSPSHY NDQSHGSPGR ADYGSYSAQA SVTTAASATA ASSFASAQCD
410 420 430 440 450
YTDLQASNYY SPYPGYPPSL YQYPYFHSSR RPYASPLLNG LSMPPAHSPS
460
SNWDQPVYTT LTRP
Length:464
Mass (Da):49,879
Last modified:December 1, 2000 - v2
Checksum:i3C080629F58F5AB5
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AF191325 Genomic DNA. Translation: AAF35837.1.
BC085619 mRNA. Translation: AAH85619.1.
Z18957 mRNA. Translation: CAA79482.1.
CCDSiCCDS28521.1.
PIRiS30246.
RefSeqiNP_035577.1. NM_011447.3.
UniGeneiMm.258220.

Genome annotation databases

EnsembliENSMUST00000025003; ENSMUSP00000025003; ENSMUSG00000024176.
GeneIDi20681.
KEGGimmu:20681.
UCSCiuc008bay.1. mouse.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AF191325 Genomic DNA. Translation: AAF35837.1.
BC085619 mRNA. Translation: AAH85619.1.
Z18957 mRNA. Translation: CAA79482.1.
CCDSiCCDS28521.1.
PIRiS30246.
RefSeqiNP_035577.1. NM_011447.3.
UniGeneiMm.258220.

3D structure databases

ProteinModelPortaliQ04886.
SMRiQ04886. Positions 97-168.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi203412. 15 interactions.
IntActiQ04886. 1 interaction.
STRINGi10090.ENSMUSP00000025003.

PTM databases

PhosphoSiteiQ04886.

Proteomic databases

MaxQBiQ04886.
PaxDbiQ04886.
PRIDEiQ04886.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSMUST00000025003; ENSMUSP00000025003; ENSMUSG00000024176.
GeneIDi20681.
KEGGimmu:20681.
UCSCiuc008bay.1. mouse.

Organism-specific databases

CTDi30812.
MGIiMGI:98370. Sox8.

Phylogenomic databases

eggNOGiNOG324597.
GeneTreeiENSGT00760000118988.
HOGENOMiHOG000108876.
HOVERGENiHBG002061.
InParanoidiQ04886.
KOiK09270.
OMAiDQSHGSP.
OrthoDBiEOG7Q2N5K.
PhylomeDBiQ04886.
TreeFamiTF351735.

Miscellaneous databases

ChiTaRSiSox8. mouse.
NextBioi299189.
PROiQ04886.
SOURCEiSearch...

Gene expression databases

BgeeiQ04886.
CleanExiMM_SOX8.
ExpressionAtlasiQ04886. baseline and differential.
GenevisibleiQ04886. MM.

Family and domain databases

Gene3Di1.10.30.10. 1 hit.
InterProiIPR009071. HMG_box_dom.
IPR022151. Sox_N.
[Graphical view]
PfamiPF00505. HMG_box. 1 hit.
PF12444. Sox_N. 1 hit.
[Graphical view]
SMARTiSM00398. HMG. 1 hit.
[Graphical view]
SUPFAMiSSF47095. SSF47095. 1 hit.
PROSITEiPS50118. HMG_BOX_2. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "Cloning and characterisation of the Sry-related transcription factor gene Sox8."
    Schepers G.E., Bullejos M., Hosking B.M., Koopman P.
    Nucleic Acids Res. 28:1473-1480(2000) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
    Strain: 129/Sv.
  2. "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
    The MGC Project Team
    Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
    Tissue: Olfactory epithelium.
  3. "Seven new members of the Sox gene family expressed during mouse development."
    Wright E.M., Snopek B., Koopman P.
    Nucleic Acids Res. 21:744-744(1993) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 107-162.

Entry informationi

Entry nameiSOX8_MOUSE
AccessioniPrimary (citable) accession number: Q04886
Entry historyi
Integrated into UniProtKB/Swiss-Prot: June 1, 1994
Last sequence update: December 1, 2000
Last modified: July 22, 2015
This is version 122 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. MGD cross-references
    Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot
  2. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into Uniref entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.