Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Transcription factor SOX-4

Gene

SOX4

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Transcriptional activator that binds with high affinity to the T-cell enhancer motif 5'-AACAAAG-3' motif.

Regions

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
DNA bindingi59 – 12769HMG boxPROSITE-ProRule annotationAdd
BLAST

GO - Molecular functioni

  1. core promoter sequence-specific DNA binding Source: UniProtKB
  2. nucleic acid binding transcription factor activity Source: UniProtKB
  3. RNA polymerase II core promoter proximal region sequence-specific DNA binding transcription factor activity involved in positive regulation of transcription Source: UniProtKB
  4. RNA polymerase II transcription coactivator activity Source: UniProtKB
  5. sequence-specific DNA binding transcription factor activity Source: BHF-UCL
  6. transcription regulatory region sequence-specific DNA binding Source: UniProtKB

GO - Biological processi

  1. ascending aorta morphogenesis Source: BHF-UCL
  2. atrial septum primum morphogenesis Source: BHF-UCL
  3. canonical Wnt signaling pathway Source: BHF-UCL
  4. cardiac right ventricle morphogenesis Source: BHF-UCL
  5. cardiac ventricle formation Source: UniProtKB
  6. cellular response to glucose stimulus Source: UniProtKB
  7. DNA damage response, detection of DNA damage Source: UniProtKB
  8. DNA damage response, signal transduction by p53 class mediator resulting in cell cycle arrest Source: UniProtKB
  9. endocrine pancreas development Source: Ensembl
  10. glial cell development Source: UniProtKB
  11. glial cell proliferation Source: UniProtKB
  12. glucose homeostasis Source: UniProtKB
  13. heart development Source: BHF-UCL
  14. kidney morphogenesis Source: BHF-UCL
  15. limb bud formation Source: UniProtKB
  16. mitral valve morphogenesis Source: BHF-UCL
  17. negative regulation of cell death Source: UniProtKB
  18. negative regulation of cell proliferation Source: UniProtKB
  19. negative regulation of protein export from nucleus Source: UniProtKB
  20. negative regulation of protein ubiquitination Source: UniProtKB
  21. neural tube formation Source: UniProtKB
  22. neuroepithelial cell differentiation Source: UniProtKB
  23. noradrenergic neuron differentiation Source: UniProtKB
  24. positive regulation of apoptotic process Source: UniProtKB
  25. positive regulation of canonical Wnt signaling pathway Source: UniProtKB
  26. positive regulation of cell proliferation Source: BHF-UCL
  27. positive regulation of insulin secretion Source: UniProtKB
  28. positive regulation of N-terminal peptidyl-lysine acetylation Source: UniProtKB
  29. positive regulation of transcription, DNA-templated Source: UniProtKB
  30. positive regulation of transcription from RNA polymerase II promoter Source: UniProtKB
  31. positive regulation of translation Source: UniProtKB
  32. positive regulation of Wnt signaling pathway Source: BHF-UCL
  33. pro-B cell differentiation Source: BHF-UCL
  34. protein stabilization Source: BHF-UCL
  35. regulation of protein stability Source: UniProtKB
  36. regulation of transcription, DNA-templated Source: BHF-UCL
  37. skeletal system development Source: UniProtKB
  38. somatic stem cell maintenance Source: Ensembl
  39. spinal cord development Source: UniProtKB
  40. spinal cord motor neuron differentiation Source: UniProtKB
  41. sympathetic nervous system development Source: UniProtKB
  42. T cell differentiation Source: BHF-UCL
  43. ventricular septum morphogenesis Source: BHF-UCL
Complete GO annotation...

Keywords - Molecular functioni

Activator

Keywords - Biological processi

Transcription, Transcription regulation

Keywords - Ligandi

DNA-binding

Names & Taxonomyi

Protein namesi
Recommended name:
Transcription factor SOX-4
Gene namesi
Name:SOX4
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
ProteomesiUP000005640: Chromosome 6

Organism-specific databases

HGNCiHGNC:11200. SOX4.

Subcellular locationi

Nucleus PROSITE-ProRule annotation1 Publication

GO - Cellular componenti

  1. cytoplasm Source: BHF-UCL
  2. mitochondrion Source: HPA
  3. nuclear transcription factor complex Source: Ensembl
  4. nucleoplasm Source: HPA
  5. nucleus Source: UniProtKB
Complete GO annotation...

Keywords - Cellular componenti

Nucleus

Pathology & Biotechi

Organism-specific databases

PharmGKBiPA36037.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 474474Transcription factor SOX-4PRO_0000048724Add
BLAST

Proteomic databases

MaxQBiQ06945.
PaxDbiQ06945.
PRIDEiQ06945.

PTM databases

PhosphoSiteiQ06945.

Expressioni

Tissue specificityi

Testis, brain, and heart.

Gene expression databases

BgeeiQ06945.
CleanExiHS_SOX4.
GenevestigatoriQ06945.

Organism-specific databases

HPAiHPA029901.

Interactioni

Subunit structurei

Interacts with UBE2I.1 Publication

Protein-protein interaction databases

BioGridi112542. 8 interactions.
DIPiDIP-58650N.
IntActiQ06945. 1 interaction.
STRINGi9606.ENSP00000244745.

Structurei

3D structure databases

ProteinModelPortaliQ06945.
SMRiQ06945. Positions 59-163.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Compositional bias

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Compositional biasi386 – 39712Poly-SerAdd
BLAST

Sequence similaritiesi

Contains 1 HMG box DNA-binding domain.PROSITE-ProRule annotation

Phylogenomic databases

eggNOGiNOG130659.
GeneTreeiENSGT00760000118988.
HOGENOMiHOG000231874.
HOVERGENiHBG005040.
InParanoidiQ06945.
KOiK09268.
OMAiHRSYTSL.
OrthoDBiEOG7TMZVP.
PhylomeDBiQ06945.

Family and domain databases

Gene3Di1.10.30.10. 1 hit.
InterProiIPR009071. HMG_box_dom.
IPR017386. SOX-12/11/4a.
[Graphical view]
PfamiPF00505. HMG_box. 1 hit.
[Graphical view]
PIRSFiPIRSF038098. SOX-12/11/4a. 1 hit.
SMARTiSM00398. HMG. 1 hit.
[Graphical view]
SUPFAMiSSF47095. SSF47095. 1 hit.
PROSITEiPS50118. HMG_BOX_2. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Q06945-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MVQQTNNAEN TEALLAGESS DSGAGLELGI ASSPTPGSTA STGGKADDPS
60 70 80 90 100
WCKTPSGHIK RPMNAFMVWS QIERRKIMEQ SPDMHNAEIS KRLGKRWKLL
110 120 130 140 150
KDSDKIPFIR EAERLRLKHM ADYPDYKYRP RKKVKSGNAN SSSSAAASSK
160 170 180 190 200
PGEKGDKVGG SGGGGHGGGG GGGSSNAGGG GGGASGGGAN SKPAQKKSCG
210 220 230 240 250
SKVAGGAGGG VSKPHAKLIL AGGGGGGKAA AAAAASFAAE QAGAAALLPL
260 270 280 290 300
GAAADHHSLY KARTPSASAS ASSAASASAA LAAPGKHLAE KKVKRVYLFG
310 320 330 340 350
GLGTSSSPVG GVGAGADPSD PLGLYEEEGA GCSPDAPSLS GRSSAASSPA
360 370 380 390 400
AGRSPADHRG YASLRAASPA PSSAPSHASS SASSHSSSSS SSGSSSSDDE
410 420 430 440 450
FEDDLLDLNP SSNFESMSLG SFSSSSALDR DLDFNFEPGS GSHFEFPDYC
460 470
TPEVSEMISG DWLESSISNL VFTY
Length:474
Mass (Da):47,263
Last modified:June 1, 1994 - v1
Checksum:iCB75B76ABF507A07
GO

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti71 – 711Q → P in CAA46612 (PubMed:1614875).Curated

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
X70683 mRNA. Translation: CAA50018.1.
AL136179 Genomic DNA. Translation: CAC03595.1.
BC072668 mRNA. Translation: AAH72668.1.
X65661 mRNA. Translation: CAA46612.1.
CCDSiCCDS4547.1.
PIRiI38240.
RefSeqiNP_003098.1. NM_003107.2.
UniGeneiHs.643910.

Genome annotation databases

EnsembliENST00000244745; ENSP00000244745; ENSG00000124766.
GeneIDi6659.
KEGGihsa:6659.
UCSCiuc003ndi.3. human.

Polymorphism databases

DMDMi548952.

Cross-referencesi

Web resourcesi

Atlas of Genetics and Cytogenetics in Oncology and Haematology

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
X70683 mRNA. Translation: CAA50018.1.
AL136179 Genomic DNA. Translation: CAC03595.1.
BC072668 mRNA. Translation: AAH72668.1.
X65661 mRNA. Translation: CAA46612.1.
CCDSiCCDS4547.1.
PIRiI38240.
RefSeqiNP_003098.1. NM_003107.2.
UniGeneiHs.643910.

3D structure databases

ProteinModelPortaliQ06945.
SMRiQ06945. Positions 59-163.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi112542. 8 interactions.
DIPiDIP-58650N.
IntActiQ06945. 1 interaction.
STRINGi9606.ENSP00000244745.

PTM databases

PhosphoSiteiQ06945.

Polymorphism databases

DMDMi548952.

Proteomic databases

MaxQBiQ06945.
PaxDbiQ06945.
PRIDEiQ06945.

Protocols and materials databases

DNASUi6659.
Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000244745; ENSP00000244745; ENSG00000124766.
GeneIDi6659.
KEGGihsa:6659.
UCSCiuc003ndi.3. human.

Organism-specific databases

CTDi6659.
GeneCardsiGC06P021593.
HGNCiHGNC:11200. SOX4.
HPAiHPA029901.
MIMi184430. gene.
neXtProtiNX_Q06945.
PharmGKBiPA36037.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiNOG130659.
GeneTreeiENSGT00760000118988.
HOGENOMiHOG000231874.
HOVERGENiHBG005040.
InParanoidiQ06945.
KOiK09268.
OMAiHRSYTSL.
OrthoDBiEOG7TMZVP.
PhylomeDBiQ06945.

Miscellaneous databases

ChiTaRSiSOX4. human.
GeneWikiiSOX4.
GenomeRNAii6659.
NextBioi25959.
PROiQ06945.
SOURCEiSearch...

Gene expression databases

BgeeiQ06945.
CleanExiHS_SOX4.
GenevestigatoriQ06945.

Family and domain databases

Gene3Di1.10.30.10. 1 hit.
InterProiIPR009071. HMG_box_dom.
IPR017386. SOX-12/11/4a.
[Graphical view]
PfamiPF00505. HMG_box. 1 hit.
[Graphical view]
PIRSFiPIRSF038098. SOX-12/11/4a. 1 hit.
SMARTiSM00398. HMG. 1 hit.
[Graphical view]
SUPFAMiSSF47095. SSF47095. 1 hit.
PROSITEiPS50118. HMG_BOX_2. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. Cited for: NUCLEOTIDE SEQUENCE [MRNA].
  2. "The DNA sequence and analysis of human chromosome 6."
    Mungall A.J., Palmer S.A., Sims S.K., Edwards C.A., Ashurst J.L., Wilming L., Jones M.C., Horton R., Hunt S.E., Scott C.E., Gilbert J.G.R., Clamp M.E., Bethel G., Milne S., Ainscough R., Almeida J.P., Ambrose K.D., Andrews T.D.
    , Ashwell R.I.S., Babbage A.K., Bagguley C.L., Bailey J., Banerjee R., Barker D.J., Barlow K.F., Bates K., Beare D.M., Beasley H., Beasley O., Bird C.P., Blakey S.E., Bray-Allen S., Brook J., Brown A.J., Brown J.Y., Burford D.C., Burrill W., Burton J., Carder C., Carter N.P., Chapman J.C., Clark S.Y., Clark G., Clee C.M., Clegg S., Cobley V., Collier R.E., Collins J.E., Colman L.K., Corby N.R., Coville G.J., Culley K.M., Dhami P., Davies J., Dunn M., Earthrowl M.E., Ellington A.E., Evans K.A., Faulkner L., Francis M.D., Frankish A., Frankland J., French L., Garner P., Garnett J., Ghori M.J., Gilby L.M., Gillson C.J., Glithero R.J., Grafham D.V., Grant M., Gribble S., Griffiths C., Griffiths M.N.D., Hall R., Halls K.S., Hammond S., Harley J.L., Hart E.A., Heath P.D., Heathcott R., Holmes S.J., Howden P.J., Howe K.L., Howell G.R., Huckle E., Humphray S.J., Humphries M.D., Hunt A.R., Johnson C.M., Joy A.A., Kay M., Keenan S.J., Kimberley A.M., King A., Laird G.K., Langford C., Lawlor S., Leongamornlert D.A., Leversha M., Lloyd C.R., Lloyd D.M., Loveland J.E., Lovell J., Martin S., Mashreghi-Mohammadi M., Maslen G.L., Matthews L., McCann O.T., McLaren S.J., McLay K., McMurray A., Moore M.J.F., Mullikin J.C., Niblett D., Nickerson T., Novik K.L., Oliver K., Overton-Larty E.K., Parker A., Patel R., Pearce A.V., Peck A.I., Phillimore B.J.C.T., Phillips S., Plumb R.W., Porter K.M., Ramsey Y., Ranby S.A., Rice C.M., Ross M.T., Searle S.M., Sehra H.K., Sheridan E., Skuce C.D., Smith S., Smith M., Spraggon L., Squares S.L., Steward C.A., Sycamore N., Tamlyn-Hall G., Tester J., Theaker A.J., Thomas D.W., Thorpe A., Tracey A., Tromans A., Tubby B., Wall M., Wallis J.M., West A.P., White S.S., Whitehead S.L., Whittaker H., Wild A., Willey D.J., Wilmer T.E., Wood J.M., Wray P.W., Wyatt J.C., Young L., Younger R.M., Bentley D.R., Coulson A., Durbin R.M., Hubbard T., Sulston J.E., Dunham I., Rogers J., Beck S.
    Nature 425:805-811(2003) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
  3. "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
    The MGC Project Team
    Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
    Tissue: Ovary.
  4. "A conserved family of genes related to the testis determining gene, SRY."
    Denny P., Swift S., Brand N., Dabhade N., Barton P., Ashworth A.
    Nucleic Acids Res. 20:2887-2887(1992) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 70-123.
  5. Cited for: INTERACTION WITH UBE2I, SUBCELLULAR LOCATION.

Entry informationi

Entry nameiSOX4_HUMAN
AccessioniPrimary (citable) accession number: Q06945
Entry historyi
Integrated into UniProtKB/Swiss-Prot: June 1, 1994
Last sequence update: June 1, 1994
Last modified: February 4, 2015
This is version 124 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Human chromosome 6
    Human chromosome 6: entries, gene names and cross-references to MIM
  2. MIM cross-references
    Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot
  3. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into Uniref entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.