Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q06831 (SOX4_MOUSE) Reviewed, UniProtKB/Swiss-Prot

Last modified July 9, 2014. Version 118. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Interactions·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Transcription factor SOX-4
Gene names
Name:Sox4
Synonyms:Sox-4
OrganismMus musculus (Mouse) [Reference proteome]
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus

Protein attributes

Sequence length440 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Transcriptional activator that binds with high affinity to the T-cell enhancer motif 5'-AACAAAG-3' motif.

Subunit structure

Interacts with UBE2I By similarity.

Subcellular location

Nucleus.

Tissue specificity

Expressed in lymphocytes and in molar and incisor tooth germs.

Sequence similarities

Contains 1 HMG box DNA-binding domain.

Ontologies

Keywords
   Biological processTranscription
Transcription regulation
   Cellular componentNucleus
   LigandDNA-binding
   Molecular functionActivator
   Technical term3D-structure
Complete proteome
Reference proteome
Gene Ontology (GO)
   Biological_processDNA damage response, detection of DNA damage

Inferred from sequence or structural similarity. Source: UniProtKB

DNA damage response, signal transduction by p53 class mediator resulting in cell cycle arrest

Inferred from sequence or structural similarity. Source: UniProtKB

T cell differentiation

Inferred from mutant phenotype PubMed 9174623. Source: BHF-UCL

ascending aorta morphogenesis

Inferred from mutant phenotype PubMed 16109771. Source: BHF-UCL

atrial septum primum morphogenesis

Inferred from mutant phenotype PubMed 16109771. Source: BHF-UCL

canonical Wnt signaling pathway

Inferred from mutant phenotype PubMed 17875931. Source: BHF-UCL

cardiac right ventricle morphogenesis

Inferred from mutant phenotype PubMed 16109771. Source: BHF-UCL

cardiac ventricle formation

Inferred from mutant phenotype PubMed 20596238. Source: UniProtKB

cellular response to glucose stimulus

Inferred from mutant phenotype PubMed 18477811. Source: UniProtKB

endocrine pancreas development

Inferred from mutant phenotype PubMed 16306355. Source: MGI

glial cell development

Inferred from mutant phenotype PubMed 20147379PubMed 20646169. Source: UniProtKB

glial cell proliferation

Inferred from mutant phenotype PubMed 20147379. Source: UniProtKB

glucose homeostasis

Inferred from mutant phenotype PubMed 18477811. Source: UniProtKB

heart development

Inferred from mutant phenotype PubMed 8614465. Source: BHF-UCL

kidney morphogenesis

Inferred from mutant phenotype PubMed 16109771. Source: BHF-UCL

limb bud formation

Inferred from mutant phenotype PubMed 20596238. Source: UniProtKB

mitral valve morphogenesis

Inferred from mutant phenotype PubMed 16109771. Source: BHF-UCL

negative regulation of cell death

Inferred from mutant phenotype PubMed 20596238PubMed 20646169. Source: UniProtKB

negative regulation of cell proliferation

Inferred from sequence or structural similarity. Source: UniProtKB

negative regulation of protein export from nucleus

Inferred from sequence or structural similarity. Source: UniProtKB

negative regulation of protein ubiquitination

Inferred from sequence or structural similarity. Source: UniProtKB

neural tube formation

Inferred from mutant phenotype PubMed 20596238. Source: UniProtKB

neuroepithelial cell differentiation

Inferred from mutant phenotype PubMed 20646169. Source: UniProtKB

noradrenergic neuron differentiation

Inferred from mutant phenotype PubMed 20147379. Source: UniProtKB

positive regulation of N-terminal peptidyl-lysine acetylation

Inferred from sequence or structural similarity. Source: UniProtKB

positive regulation of Wnt signaling pathway

Inferred from mutant phenotype PubMed 17875931. Source: BHF-UCL

positive regulation of apoptotic process

Inferred from electronic annotation. Source: Ensembl

positive regulation of canonical Wnt signaling pathway

Inferred from direct assay PubMed 17875931. Source: UniProtKB

positive regulation of cell proliferation

Inferred from mutant phenotype PubMed 20596238. Source: UniProtKB

positive regulation of insulin secretion

Inferred from mutant phenotype PubMed 18477811. Source: UniProtKB

positive regulation of transcription from RNA polymerase II promoter

Inferred from direct assay PubMed 18505825. Source: UniProtKB

positive regulation of transcription, DNA-templated

Inferred from direct assay PubMed 20596238PubMed 21527504. Source: UniProtKB

positive regulation of translation

Inferred from sequence or structural similarity. Source: UniProtKB

pro-B cell differentiation

Inferred from mutant phenotype PubMed 8614465. Source: BHF-UCL

protein stabilization

Inferred from mutant phenotype PubMed 17875931. Source: BHF-UCL

regulation of protein stability

Inferred from sequence or structural similarity. Source: UniProtKB

regulation of transcription, DNA-templated

Inferred from sequence or structural similarity. Source: UniProtKB

skeletal system development

Inferred from mutant phenotype PubMed 20596238. Source: UniProtKB

somatic stem cell maintenance

Inferred from direct assay PubMed 19379700. Source: MGI

spinal cord development

Inferred from mutant phenotype PubMed 20646169. Source: UniProtKB

spinal cord motor neuron differentiation

Inferred from mutant phenotype PubMed 20646169. Source: UniProtKB

sympathetic nervous system development

Inferred from mutant phenotype PubMed 20147379. Source: UniProtKB

transcription from RNA polymerase II promoter

Inferred from direct assay PubMed 22344693. Source: GOC

ventricular septum morphogenesis

Inferred from mutant phenotype PubMed 16109771. Source: BHF-UCL

   Cellular_componentcytoplasm

Inferred from sequence or structural similarity. Source: UniProtKB

mitochondrion

Inferred from sequence or structural similarity. Source: UniProtKB

nuclear transcription factor complex

Inferred from direct assay PubMed 22344693. Source: MGI

nucleus

Inferred from direct assay PubMed 17875931. Source: UniProtKB

   Molecular_functionRNA polymerase II core promoter proximal region sequence-specific DNA binding transcription factor activity involved in positive regulation of transcription

Inferred from direct assay PubMed 18505825. Source: UniProtKB

RNA polymerase II transcription coactivator activity

Inferred from direct assay PubMed 18505825. Source: UniProtKB

core promoter sequence-specific DNA binding

Inferred from direct assay PubMed 20596238. Source: UniProtKB

nucleic acid binding transcription factor activity

Inferred from sequence or structural similarity. Source: UniProtKB

protein binding

Inferred from physical interaction PubMed 17875931. Source: IntAct

protein heterodimerization activity

Inferred from direct assay PubMed 22344693. Source: MGI

sequence-specific DNA binding RNA polymerase II transcription factor activity

Inferred from direct assay PubMed 22344693. Source: MGI

sequence-specific DNA binding transcription factor activity

Inferred from direct assay PubMed 21527504. Source: UniProtKB

transcription regulatory region sequence-specific DNA binding

Inferred from direct assay PubMed 18505825. Source: UniProtKB

Complete GO annotation...

Binary interactions

With

Entry

#Exp.

IntAct

Notes

TCF4P158842EBI-6262177,EBI-533224From a different organism.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 440440Transcription factor SOX-4
PRO_0000048725

Regions

DNA binding59 – 12769HMG box
Compositional bias347 – 36317Poly-Ser

Experimental info

Sequence conflict1751S → T in CAA49779. Ref.1
Sequence conflict1791A → T in CAA49779. Ref.1
Sequence conflict235 – 2362SA → QL in CAA49779. Ref.1
Sequence conflict2631R → H in CAA49779. Ref.1
Sequence conflict2831S → C in CAA49779. Ref.1

Secondary structure

........ 440
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
Q06831 [UniParc].

Last modified July 27, 2011. Version 2.
Checksum: 979AADBA7F674B6D

FASTA44045,044
        10         20         30         40         50         60 
MVQQTNNAEN TEALLAGESS DSGAGLELGI ASSPTPGSTA STGGKADDPS WCKTPSGHIK 

        70         80         90        100        110        120 
RPMNAFMVWS QIERRKIMEQ SPDMHNAEIS KRLGKRWKLL KDSDKIPFIQ EAERLRLKHM 

       130        140        150        160        170        180 
ADYPDYKYRP RKKVKSGNAG AGSAATAKPG EKGDKVAGSS GHAGSSHAGG GAGGSSKPAP 

       190        200        210        220        230        240 
KKSCGPKVAG SSVGKPHAKL VPAGGSKAAA SFSPEQAALL PLGEPTAVYK VRTPSAATPA 

       250        260        270        280        290        300 
ASSSPSSALA TPAKHPADKK VKRVYLFGSL GASASPVGGL GASADPSDPL GLYEDGGPGC 

       310        320        330        340        350        360 
SPDGRSLSGR SSAASSPAAS RSPADHRGYA SLRAASPAPS SAPSHASSSL SSSSSSSSGS 

       370        380        390        400        410        420 
SSSDDEFEDD LLDLNPSSNF ESMSLGSFSS SSALDRDLDF NFEPGSGSHF EFPDYCTPEV 

       430        440 
SEMISGDWLE SSISNLVFTY 

« Hide

References

« Hide 'large scale' references
[1]"Sox-4, an Sry-like HMG box protein, is a transcriptional activator in lymphocytes."
van de Wetering M., Oosterwegel M., van Norren K., Clevers H.C.
EMBO J. 12:3847-3854(1993) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA].
[2]"The murine Sox-4 protein is encoded on a single exon."
Schilham M.W., van Eijk M., van de Wetering M., Clevers H.C.
Nucleic Acids Res. 21:2009-2009(1993) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA].
[3]"Lineage-specific biology revealed by a finished genome assembly of the mouse."
Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X., Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y., Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S. expand/collapse author list , Teague B., Potamousis K., Churas C., Place M., Herschleb J., Runnheim R., Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., Lindblad-Toh K., Eichler E.E., Ponting C.P.
PLoS Biol. 7:E1000112-E1000112(2009) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: C57BL/6J.
[4]Mural R.J., Adams M.D., Myers E.W., Smith H.O., Venter J.C.
Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[5]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
Strain: C57BL/6.
Tissue: Brain.
[6]"Numerous members of the Sox family of HMG box-containing genes are expressed in developing mouse teeth."
Stock D.W., Buchanan A.V., Zhao Z., Weiss K.M.
Genomics 37:234-237(1996) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 69-122.
Strain: Swiss Webster.
Tissue: Embryonic tooth.
[7]"Phosphoproteomic analysis of the developing mouse brain."
Ballif B.A., Villen J., Beausoleil S.A., Schwartz D., Gygi S.P.
Mol. Cell. Proteomics 3:1093-1101(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
Tissue: Embryonic brain.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
X70298 mRNA. Translation: CAA49779.1.
AL606511 Genomic DNA. Translation: CAI24776.1.
CH466561 Genomic DNA. Translation: EDL32406.1.
BC052736 mRNA. Translation: AAH52736.1.
U70440 mRNA. Translation: AAC52858.1.
CCDSCCDS26411.1.
PIRS37303.
RefSeqNP_033264.2. NM_009238.2.
UniGeneMm.240627.
Mm.455819.

3D structure databases

PDBe
RCSB-PDB
PDBj
EntryMethodResolution (Å)ChainPositionsPDBsum
3U2BX-ray2.40C57-135[»]
ProteinModelPortalQ06831.
SMRQ06831. Positions 57-132.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

IntActQ06831. 5 interactions.

PTM databases

PhosphoSiteQ06831.

Proteomic databases

PRIDEQ06831.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENSMUST00000067230; ENSMUSP00000100013; ENSMUSG00000076431.
GeneID20677.
KEGGmmu:20677.
UCSCuc007pyk.1. mouse.

Organism-specific databases

CTD6659.
MGIMGI:98366. Sox4.

Phylogenomic databases

GeneTreeENSGT00690000101940.
HOGENOMHOG000231874.
HOVERGENHBG005040.
InParanoidQ5SW95.
KOK09268.
OMAAGCSPDG.
OrthoDBEOG7TMZVP.

Gene expression databases

ArrayExpressQ06831.
BgeeQ06831.
CleanExMM_SOX4.
GenevestigatorQ06831.

Family and domain databases

Gene3D1.10.30.10. 1 hit.
InterProIPR009071. HMG_box_dom.
IPR017386. SOX-12/11/4a.
[Graphical view]
PfamPF00505. HMG_box. 1 hit.
[Graphical view]
PIRSFPIRSF038098. SOX-12/11/4a. 1 hit.
SMARTSM00398. HMG. 1 hit.
[Graphical view]
SUPFAMSSF47095. SSF47095. 1 hit.
PROSITEPS50118. HMG_BOX_2. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

ChiTaRSSOX4. mouse.
NextBio299169.
PROQ06831.
SOURCESearch...

Entry information

Entry nameSOX4_MOUSE
AccessionPrimary (citable) accession number: Q06831
Secondary accession number(s): Q5SW95
Entry history
Integrated into UniProtKB/Swiss-Prot: June 1, 1994
Last sequence update: July 27, 2011
Last modified: July 9, 2014
This is version 118 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families

PDB cross-references

Index of Protein Data Bank (PDB) cross-references

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot