Reviewed,
UniProtKB/Swiss-Prot P35710 (SOX5_MOUSE)
Last modified
October 13, 2009.
Version 80.
History...
Clusters with 100%,
90%,
50% identity |
Documents (3) |
Third-party data |
Customize display | text xml rdf/xml gff fasta |
Names and origin
| Protein names | Recommended name: Transcription factor SOX-5 | ||||
| Gene names |
| ||||
| Organism | Mus musculus (Mouse) | ||||
| Taxonomic identifier | 10090 [NCBI] | ||||
| Taxonomic lineage | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Mammalia › Eutheria › Euarchontoglires › Glires › Rodentia › Sciurognathi › Muroidea › Muridae › Murinae › Mus |
Protein attributes
| Sequence length | 763 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is not processed. |
| Protein existence | Evidence at protein level. |
General annotation (Comments)
| Function | Binds specifically to the DNA sequence 5'-AACAAT-3'. Activates transcription of COL2A1 and AGC1 in vitro. Ref.3 |
| Subunit structure | Forms homodimers and heterodimers with SOX6. Ref.3 |
| Subcellular location | |
| Tissue specificity | Isoform 1 is found in the embryo and in adult testis. Isoform 2 is expressed in chondrocytes and, to a lesser extent, in brain. Isoform 3 is testis-specific. Ref.3 Ref.1 Ref.2 |
| Developmental stage | Expressed during spermatogenesis. |
| Sequence similarities | Contains 1 HMG box DNA-binding domain. |
Ontologies
Alternative products
| This entry describes 3 isoforms produced by alternative splicing. [Align] [Select] | ||||||
| Isoform 1 (identifier: P35710-1) This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry. | ||||||
| Isoform 2 (identifier: P35710-2) Also known as: L-Sox5; The sequence of this isoform differs from the canonical sequence as follows: 56-90: Missing. 340-388: Missing. | ||||||
| Isoform 3 (identifier: P35710-3) The sequence of this isoform differs from the canonical sequence as follows: 1-322: Missing. 340-388: Missing. |
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | |||||||||||
Molecule processing | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Chain | 1 – 763 | 763 | Transcription factor SOX-5 | PRO_0000048727 | ||||||||||||
Regions | ||||||||||||||||
| DNA binding | 556 – 624 | 69 | HMG box | |||||||||||||
| Coiled coil | 193 – 274 | 82 | Potential | |||||||||||||
| Coiled coil | 448 – 515 | 68 | Potential | |||||||||||||
Amino acid modifications | ||||||||||||||||
| Modified residue | 108 | 1 | Phosphothreonine Ref.5 | |||||||||||||
| Modified residue | 414 | 1 | Phosphoserine Ref.5 | |||||||||||||
Natural variations | ||||||||||||||||
| Alternative sequence | 1 – 322 | 322 | Missing in isoform 3. | VSP_007265 | ||||||||||||
| Alternative sequence | 56 – 90 | 35 | Missing in isoform 2. | VSP_007266 | ||||||||||||
| Alternative sequence | 340 – 388 | 49 | Missing in isoform 2 and isoform 3. | VSP_007267 | ||||||||||||
Experimental info | ||||||||||||||||
| Sequence conflict | 102 | 1 | S → A in CAA09269. Ref.3 | |||||||||||||
| Sequence conflict | 679 | 1 | S → G in CAA46608. Ref.1 | |||||||||||||
Secondary structure | ||||||||||||||||
Helix Strand Turn | ||||||||||||||||
| Helix | 562 – 575 | 14 | ||||||||||||||
| Helix | 583 – 594 | 12 | ||||||||||||||
| Helix | 600 – 602 | 3 | ||||||||||||||
| Helix | 603 – 617 | 15 | ||||||||||||||
Sequences
| ||||||||||||||||||||||||||||||
References
| « Hide 'large scale' references | |
| [1] | "An SRY-related gene expressed during spermatogenesis in the mouse encodes a sequence-specific DNA-binding protein." Denny P., Swift S., Connor F., Ashworth A. EMBO J. 11:3705-3712(1992) [PubMed: 1396566] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 3), TISSUE SPECIFICITY. Tissue: Brain and Testis. |
| [2] | "The mouse Sox5 gene encodes a protein containing the leucine zipper and the Q box." Hiraoka Y., Ogawa M., Sakai Y., Kido S., Aiso S. Biochim. Biophys. Acta 1399:40-46(1998) [PubMed: 9714725] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), TISSUE SPECIFICITY. Tissue: Embryo. |
| [3] | "A new long form of Sox5 (L-Sox5), Sox6 and Sox9 are coexpressed in chondrogenesis and cooperatively activate the type II collagen gene." Lefebvre V., Li P., de Crombrugghe B. EMBO J. 17:5718-5733(1998) [PubMed: 9755172] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 2), FUNCTION, SUBUNIT, TISSUE SPECIFICITY. |
| [4] | "DNA binding and bending properties of the post-meiotically expressed Sry-related protein Sox-5." Connor F., O'Cary P.D., Read C.M., Preston N.S., Driscoll P.C., Denny P., Crane-Robinson C., Ashworth A. Nucleic Acids Res. 22:3339-3346(1994) [PubMed: 8078769] [Abstract] Cited for: CHARACTERIZATION. |
| [5] | "Large-scale phosphorylation analysis of mouse liver." Villen J., Beausoleil S.A., Gerber S.A., Gygi S.P. Proc. Natl. Acad. Sci. U.S.A. 104:1488-1493(2007) [PubMed: 17242355] [Abstract] Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT THR-108 AND SER-414, MASS SPECTROMETRY. Tissue: Liver. |
| + | Additional computationally mapped references. |
Cross-references
Sequence databases | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| X65657 mRNA. Translation: CAA46608.1. X65658 mRNA. Translation: CAA46609.1. AB006330 mRNA. Translation: BAA32567.1. AJ010604 mRNA. Translation: CAA09269.1. | |||||||||||||
| IPI | IPI00128018. IPI00466410. IPI00467353. | ||||||||||||
| PIR | S25195. | ||||||||||||
| RefSeq | NP_001107031.1. NP_035574.2. | ||||||||||||
| UniGene | Mm.1752 | ||||||||||||
3D structure databases | |||||||||||||
| |||||||||||||
| ModBase | Search... | ||||||||||||
Protein-protein interaction databases | |||||||||||||
| STRING | P35710. | ||||||||||||
PTM databases | |||||||||||||
| PhosphoSite | P35710. | ||||||||||||
Proteomic databases | |||||||||||||
| PRIDE | P35710. | ||||||||||||
Genome annotation databases | |||||||||||||
| Ensembl | ENSMUST00000038815; ENSMUSP00000047567; ENSMUSG00000041540; Mus musculus. [Genome view] ENSMUST00000077160; ENSMUSP00000076403; ENSMUSG00000041540; Mus musculus. [Genome view] | ||||||||||||
| GeneID | 20678. | ||||||||||||
| KEGG | mmu:20678. | ||||||||||||
| UCSC | uc009eqk.1. mouse. uc009eqm.1. mouse. | ||||||||||||
Organism-specific databases | |||||||||||||
| MGI | MGI:98367. Sox5. | ||||||||||||
Phylogenomic databases | |||||||||||||
| HOGENOM | P35710. | ||||||||||||
| HOVERGEN | P35710. | ||||||||||||
Gene expression databases | |||||||||||||
| ArrayExpress | P35710. | ||||||||||||
| Bgee | P35710. | ||||||||||||
| CleanEx | MM_SOX5. | ||||||||||||
| Genevestigator | P35710. | ||||||||||||
| GermOnline | ENSMUSG00000041540. Mus musculus. | ||||||||||||
Family and domain databases | |||||||||||||
| InterPro | IPR000910. HMG_HMG1/HMG2. [Graphical view] | ||||||||||||
| Gene3D | G3DSA:1.10.30.10. HMG-box. 1 hit. | ||||||||||||
| Pfam | PF00505. HMG_box. 1 hit. [Graphical view] | ||||||||||||
| SMART | SM00398. HMG. 1 hit. [Graphical view] | ||||||||||||
| PROSITE | PS50118. HMG_BOX_2. 1 hit. [Graphical view] | ||||||||||||
| ProtoNet | Search... | ||||||||||||
Other Resources | |||||||||||||
| SOURCE | Search... | ||||||||||||
Entry information
| Entry name | SOX5_MOUSE | ||||||||
| Accession | Primary (citable) accession number: P35710 Secondary accession number(s): O88184, O89018 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation project | HPI (Human Proteome Initiative) | ||||||||
Relevant documents
| MGD cross-references Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot |
| PDB cross-references Index of Protein Data Bank (PDB) cross-references |
| SIMILARITY comments Index of protein domains and families |

Clusters with


