Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot P35710 (SOX5_MOUSE)

Last modified October 13, 2009. Version 80. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Alternative products · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Transcription factor SOX-5
Gene names
Name: Sox5
Synonyms: Sox-5
OrganismMus musculus (Mouse)
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMus

Protein attributes

Sequence length763 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceEvidence at protein level.

General annotation (Comments)

Function

Binds specifically to the DNA sequence 5'-AACAAT-3'. Activates transcription of COL2A1 and AGC1 in vitro. Ref.3

Subunit structure

Forms homodimers and heterodimers with SOX6. Ref.3

Subcellular location

Nucleus.

Tissue specificity

Isoform 1 is found in the embryo and in adult testis. Isoform 2 is expressed in chondrocytes and, to a lesser extent, in brain. Isoform 3 is testis-specific. Ref.3 Ref.1 Ref.2

Developmental stage

Expressed during spermatogenesis.

Sequence similarities

Contains 1 HMG box DNA-binding domain.

Alternative products

This entry describes 3 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: P35710-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: P35710-2)

Also known as: L-Sox5;

The sequence of this isoform differs from the canonical sequence as follows:
     56-90: Missing.
     340-388: Missing.
Isoform 3 (identifier: P35710-3)

The sequence of this isoform differs from the canonical sequence as follows:
     1-322: Missing.
     340-388: Missing.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 763763Transcription factor SOX-5
PRO_0000048727

Regions

DNA binding556 – 62469HMG box
Coiled coil193 – 27482 Potential
Coiled coil448 – 51568 Potential

Amino acid modifications

Modified residue1081Phosphothreonine Ref.5
Modified residue4141Phosphoserine Ref.5

Natural variations

Alternative sequence1 – 322322Missing in isoform 3.
VSP_007265
Alternative sequence56 – 9035Missing in isoform 2.
VSP_007266
Alternative sequence340 – 38849Missing in isoform 2 and isoform 3.
VSP_007267

Experimental info

Sequence conflict1021S → A in CAA09269. Ref.3
Sequence conflict6791S → G in CAA46608. Ref.1

Secondary structure

........ 763
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified April 23, 2003. Version 2.
Checksum: E38B1470ECCB1DE7

FASTA76384,090
        10         20         30         40         50         60 
MLTDPDLPQE FERMSSKRPA SPYGETDGEV AMVTSRQKVE EEESERLPAF HLPLHVSFPN 

        70         80         90        100        110        120 
KPHSEEFQPV SLLTQETCGP RTPTVQHNTM EVDGNKVMSS LSPYNSSTSP QKAEEGGRQS 

       130        140        150        160        170        180 
GESVSSAALG TPERRKGSLA DVVDTLKQRK MEELIKNEPE DTPSIEKLLS KDWKDKLLAM 

       190        200        210        220        230        240 
GSGNFGEIKG TPESLAEKER QLMGMINQLT SLREQLLAAH DEQKKLAASQ IEKQRQQMEL 

       250        260        270        280        290        300 
AKQQQEQIAR QQQQLLQQQH KINLLQQQIQ VQGQLPPLMI PVFPPDQRTL AAAAQQGFLL 

       310        320        330        340        350        360 
PPGFSYKAGC SDPYPVQLIP TTMAAAAAAT PGLGPLQLQQ FYAAQLAAMQ VSPGGKLLGL 

       370        380        390        400        410        420 
PQGNLGAAVS PTSIHTDKST NSPPPKSKDE VAQPLNLSAK PKTSDGKSPA SPTSPHMPAL 

       430        440        450        460        470        480 
RINSGAGPLK ASVPAALASP SARVSTIGYL NDHDAVTKAI QEARQMKEQL RREQQALDGK 

       490        500        510        520        530        540 
VAVVNSIGLS NCRTEKEKTT LESLTQQLAV KQNEEGKFSH GMMDFNMSGD SDGSAGVSES 

       550        560        570        580        590        600 
RIYRESRGRG SNEPHIKRPM NAFMVWAKDE RRKILQAFPD MHNSNISKIL GSRWKAMTNL 

       610        620        630        640        650        660 
EKQPYYEEQA RLSKQHLEKY PDYKYKPRPK RTCLVDGKKL RIGEYKAIMR NRRQEMRQYF 

       670        680        690        700        710        720 
NVGQQAQIPI ATAGVVYPSA IAMAGMPSPH LPSEHSSVSS SPEPGMPVIQ STYGAKGEEP 

       730        740        750        760 
HIKEEIQAED INGEIYEEYD EEEEDPDVDY GSDSENHIAG QAN 

« Hide

Isoform 2 (L-Sox5).

Checksum: BD550F7328EE9A1B
Show »

FASTA67975,225
Isoform 3.

Checksum: BF6A679800F916C8
Show »

FASTA39243,234

References

« Hide 'large scale' references
[1]"An SRY-related gene expressed during spermatogenesis in the mouse encodes a sequence-specific DNA-binding protein."
Denny P., Swift S., Connor F., Ashworth A.
EMBO J. 11:3705-3712(1992) [PubMed: 1396566] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 3), TISSUE SPECIFICITY.
Tissue: Brain and Testis.
[2]"The mouse Sox5 gene encodes a protein containing the leucine zipper and the Q box."
Hiraoka Y., Ogawa M., Sakai Y., Kido S., Aiso S.
Biochim. Biophys. Acta 1399:40-46(1998) [PubMed: 9714725] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), TISSUE SPECIFICITY.
Tissue: Embryo.
[3]"A new long form of Sox5 (L-Sox5), Sox6 and Sox9 are coexpressed in chondrogenesis and cooperatively activate the type II collagen gene."
Lefebvre V., Li P., de Crombrugghe B.
EMBO J. 17:5718-5733(1998) [PubMed: 9755172] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 2), FUNCTION, SUBUNIT, TISSUE SPECIFICITY.
[4]"DNA binding and bending properties of the post-meiotically expressed Sry-related protein Sox-5."
Connor F., O'Cary P.D., Read C.M., Preston N.S., Driscoll P.C., Denny P., Crane-Robinson C., Ashworth A.
Nucleic Acids Res. 22:3339-3346(1994) [PubMed: 8078769] [Abstract]
Cited for: CHARACTERIZATION.
[5]"Large-scale phosphorylation analysis of mouse liver."
Villen J., Beausoleil S.A., Gerber S.A., Gygi S.P.
Proc. Natl. Acad. Sci. U.S.A. 104:1488-1493(2007) [PubMed: 17242355] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT THR-108 AND SER-414, MASS SPECTROMETRY.
Tissue: Liver.
+Additional computationally mapped references.

Cross-references

Sequence databases

X65657 mRNA. Translation: CAA46608.1.
X65658 mRNA. Translation: CAA46609.1.
AB006330 mRNA. Translation: BAA32567.1.
AJ010604 mRNA. Translation: CAA09269.1.
IPIIPI00128018.
IPI00466410.
IPI00467353.
PIRS25195.
RefSeqNP_001107031.1.
NP_035574.2.
UniGeneMm.1752

3D structure databases

EntryMethodResolution (Å)ChainPositionsPDBsum
1I11NMR-A554-632[»]
ModBaseSearch...

Protein-protein interaction databases

STRINGP35710.

PTM databases

PhosphoSiteP35710.

Proteomic databases

PRIDEP35710.

Genome annotation databases

EnsemblENSMUST00000038815; ENSMUSP00000047567; ENSMUSG00000041540; Mus musculus. [Genome view]
ENSMUST00000077160; ENSMUSP00000076403; ENSMUSG00000041540; Mus musculus. [Genome view]
GeneID20678.
KEGGmmu:20678.
UCSCuc009eqk.1. mouse.
uc009eqm.1. mouse.

Organism-specific databases

MGIMGI:98367. Sox5.

Phylogenomic databases

HOGENOMP35710.
HOVERGENP35710.

Gene expression databases

ArrayExpressP35710.
BgeeP35710.
CleanExMM_SOX5.
GenevestigatorP35710.
GermOnlineENSMUSG00000041540. Mus musculus.

Family and domain databases

InterProIPR000910. HMG_HMG1/HMG2.
[Graphical view]
Gene3DG3DSA:1.10.30.10. HMG-box. 1 hit.
PfamPF00505. HMG_box. 1 hit.
[Graphical view]
SMARTSM00398. HMG. 1 hit.
[Graphical view]
PROSITEPS50118. HMG_BOX_2. 1 hit.
[Graphical view]
ProtoNetSearch...

Other Resources

SOURCESearch...

Entry information

Entry nameSOX5_MOUSE
AccessionPrimary (citable) accession number: P35710
Secondary accession number(s): O88184, O89018
Entry history
Integrated into UniProtKB/Swiss-Prot: June 1, 1994
Last sequence update: April 23, 2003
Last modified: October 13, 2009
This is version 80 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectHPI (Human Proteome Initiative)

Relevant documents

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot

PDB cross-references

Index of Protein Data Bank (PDB) cross-references

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Alternative products · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents