Skip Header

Contribute Send feedback
Read comments (?) or add your own

P48431 (SOX2_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified January 25, 2012. Version 111. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (4) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Web links·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Transcription factor SOX-2
Gene names
Name:SOX2
OrganismHomo sapiens (Human)
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length317 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Transcription factor that forms a trimeric complex with OCT4 on DNA and controls the expression of a number of genes involved in embryonic development such as YES1, FGF4, UTF1 and ZFP206 By similarity. Critical for early embryogenesis and for embryonic stem cell pluripotency. May function as a switch in neuronal development. Keeps neural cells undifferentiated by counteracting the activity of proneural proteins and suppresses neuronal differentiation By similarity. Ref.5

Subunit structure

Interacts with ZSCAN10 By similarity. Interacts with SOX3 and FGFR1 By similarity.

Subcellular location

Nucleus.

Post-translational modification

Sumoylation inhibits binding on DNA and negatively regulates the FGF4 transactivation By similarity.

Involvement in disease

Defects in SOX2 are the cause of microphthalmia syndromic type 3 (MCOPS3) [MIM:206900]. Microphthalmia is a clinically heterogeneous disorder of eye formation, ranging from small size of a single eye to complete bilateral absence of ocular tissues (anophthalmia). In many cases, microphthalmia/anophthalmia occurs in association with syndromes that include non-ocular abnormalities. MCOPS3 is characterized by the rare association of malformations including uni- or bilateral anophthalmia or microphthalmia, and esophageal atresia with trachoesophageal fistula. Ref.4

Biotechnological use

POU5F1/OCT4, SOX2, MYC/c-Myc and KLF4 are the four Yamanaka factors. When combined, these factors are sufficient to reprogram differenciated cells to an embryonic-like state designated iPS (induced pluripotent stem) cells. iPS cells exhibit the morphology and growth properties of ES cells and express ES cell marker genes. Ref.5

Sequence similarities

Contains 1 HMG box DNA-binding domain.

Sequence caution

The sequence AAA35997.1 differs from that shown. Reason: Erroneous initiation.

The sequence CAA83435.1 differs from that shown. Reason: Erroneous initiation.

Ontologies

Keywords
   Biological processTranscription
Transcription regulation
   Cellular componentNucleus
   DiseaseMicrophthalmia
   LigandDNA-binding
   Molecular functionActivator
Developmental protein
   PTMIsopeptide bond
Ubl conjugation
   Technical term3D-structure
Complete proteome
Reference proteome
Gene Ontology (GO)
   Biological processcell cycle arrest

Inferred from direct assay. Source: UniProtKB

chromatin organization

Non-traceable author statement Ref.1. Source: UniProtKB

endodermal cell fate specification

Inferred from direct assay. Source: MGI

eye development

Inferred from expression pattern. Source: UniProtKB

glial cell fate commitment

Non-traceable author statement. Source: UniProtKB

inner ear development

Inferred from expression pattern. Source: UniProtKB

negative regulation of canonical Wnt receptor signaling pathway

Inferred from direct assay. Source: UniProtKB

negative regulation of epithelial cell proliferation

Inferred from direct assay. Source: UniProtKB

negative regulation of neuron differentiation

Inferred from sequence or structural similarity. Source: UniProtKB

osteoblast differentiation

Inferred from direct assay. Source: UniProtKB

pituitary gland development

Inferred from expression pattern. Source: UniProtKB

positive regulation of MAPKKK cascade

Inferred from direct assay. Source: UniProtKB

positive regulation of transcription from RNA polymerase II promoter

Inferred from direct assay. Source: UniProtKB

regulation of cysteine-type endopeptidase activity involved in apoptotic process

Inferred from direct assay. Source: UniProtKB

response to growth factor stimulus

Inferred from direct assay. Source: UniProtKB

response to wounding

Inferred from expression pattern. Source: UniProtKB

somatic stem cell maintenance

Inferred from direct assay. Source: UniProtKB

transcription, DNA-dependent

Inferred from electronic annotation. Source: UniProtKB-KW

   Cellular componentcytosol

Inferred from direct assay. Source: UniProtKB

transcription factor complex

Traceable author statement. Source: BHF-UCL

   Molecular functionmiRNA binding

Inferred from direct assay. Source: UniProtKB

protein binding

Inferred from physical interaction. Source: UniProtKB

sequence-specific DNA binding

Inferred from direct assay. Source: MGI

sequence-specific DNA binding transcription factor activity

Inferred from direct assay. Source: UniProtKB

transcription regulatory region DNA binding

Inferred from direct assay. Source: UniProtKB

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 317317Transcription factor SOX-2
PRO_0000048715

Regions

DNA binding41 – 10969HMG box
Compositional bias19 – 235Poly-Gly
Compositional bias27 – 304Poly-Ala

Amino acid modifications

Cross-link245Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO) By similarity

Secondary structure

....... 317
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
P48431 [UniParc].

Last modified February 1, 1996. Version 1.
Checksum: EFCCAFE3E3A2B67B

FASTA31734,310
        10         20         30         40         50         60 
MYNMMETELK PPGPQQTSGG GGGNSTAAAA GGNQKNSPDR VKRPMNAFMV WSRGQRRKMA 

        70         80         90        100        110        120 
QENPKMHNSE ISKRLGAEWK LLSETEKRPF IDEAKRLRAL HMKEHPDYKY RPRRKTKTLM 

       130        140        150        160        170        180 
KKDKYTLPGG LLAPGGNSMA SGVGVGAGLG AGVNQRMDSY AHMNGWSNGS YSMMQDQLGY 

       190        200        210        220        230        240 
PQHPGLNAHG AAQMQPMHRY DVSALQYNSM TSSQTYMNGS PTYSMSYSQQ GTPGMALGSM 

       250        260        270        280        290        300 
GSVVKSEASS SPPVVTSSSH SRAPCQAGDL RDMISMYLPG AEVPEPAAPS RLHMSQHYQS 

       310 
GPVPGTAING TLPLSHM 

« Hide

References

« Hide 'large scale' references
[1]"The cDNA sequence and chromosomal location of the human SOX2 gene."
Stevanovic M., Zuffardi O., Collignon J., Lovell-Badge R., Goodfellow P.
Mamm. Genome 5:640-642(1994) [PubMed: 7849401] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA].
Tissue: Fetal brain.
[2]Sadler L.A., Badzioch M.D., Wagner M., Graves K.A., Swaroop A., Yang-Feng T.L., Zheng K., Daiger S.P.
Submitted (DEC-1992) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [MRNA].
Tissue: Retina.
[3]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
Tissue: Lung.
[4]"Mutations in SOX2 cause anophthalmia."
Fantes J., Ragge N.K., Lynch S.-A., McGill N.I., Collin J.R.O., Howard-Peebles P.N., Hayward C., Vivian A.J., Williamson K., van Heyningen V., FitzPatrick D.R.
Nat. Genet. 33:461-463(2003) [PubMed: 12612584] [Abstract]
Cited for: INVOLVEMENT IN MCOPS3.
[5]"Induction of pluripotent stem cells from adult human fibroblasts by defined factors."
Takahashi K., Tanabe K., Ohnuki M., Narita M., Ichisaka T., Tomoda K., Yamanaka S.
Cell 131:861-872(2007) [PubMed: 18035408] [Abstract]
Cited for: BIOTECHNOLOGY, FUNCTION.
+Additional computationally mapped references.

Web resources

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
Z31560 mRNA. Translation: CAA83435.1. Different initiation.
L07335 mRNA. Translation: AAA35997.1. Different initiation.
BC013923 mRNA. Translation: AAH13923.1.
IPIIPI00009703.
RefSeqNP_003097.1. NM_003106.3.
UniGeneHs.518438.

3D structure databases

PDBe
RCSB PDB
PDBj
EntryMethodResolution (Å)ChainPositionsPDBsum
1O4XNMR-B39-121[»]
2LE4NMR-A39-118[»]
ProteinModelPortalP48431.
SMRP48431. Positions 39-117.
ModBaseSearch...

Protein-protein interaction databases

STRINGP48431.

PTM databases

PhosphoSiteP48431.

Polymorphism databases

DMDM1351091.

Proteomic databases

PRIDEP48431.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENST00000325404; ENSP00000323588; ENSG00000181449.
GeneID6657.
KEGGhsa:6657.
UCSCuc003fkx.1. human.

Organism-specific databases

CTD6657.
GeneCardsGC03P181429.
H-InvDBHIX0003891.
HGNCHGNC:11195. SOX2.
HPACAB010648.
MIM184429. gene.
206900. phenotype.
neXtProtNX_P48431.
Orphanet77298. Anophthalmia/microphthalmia - esophageal atresia.
3157. Septo-optic dysplasia.
PharmGKBPA36032.
GenAtlasSearch...

Phylogenomic databases

eggNOGprNOG19854.
GeneTreeENSGT00600000084340.
HOGENOMHBG446398.
HOVERGENHBG105663.
InParanoidP48431.
OMAMSALQYN.
OrthoDBEOG4MPHQV.
PhylomeDBP48431.

Gene expression databases

ArrayExpressP48431.
BgeeP48431.
CleanExHS_SOX2.
GenevestigatorP48431.
GermOnlineENSG00000181449. Homo sapiens.

Family and domain databases

InterProIPR000910. HMG_HMG1/HMG2.
IPR009071. HMG_superfamily.
IPR022097. TF_SOX.
[Graphical view]
Gene3DG3DSA:1.10.30.10. HMG-box. 1 hit.
KOK09267.
PfamPF00505. HMG_box. 1 hit.
PF12336. SOXp. 1 hit.
[Graphical view]
SMARTSM00398. HMG. 1 hit.
[Graphical view]
SUPFAMSSF47095. HMG-box. 1 hit.
PROSITEPS50118. HMG_BOX_2. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

NextBio25951.
SOURCESearch...

Entry information

Entry nameSOX2_HUMAN
AccessionPrimary (citable) accession number: P48431
Secondary accession number(s): Q14537
Entry history
Integrated into UniProtKB/Swiss-Prot: February 1, 1996
Last sequence update: February 1, 1996
Last modified: January 25, 2012
This is version 111 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

Human chromosome 3

Human chromosome 3: entries, gene names and cross-references to MIM

MIM cross-references

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot

PDB cross-references

Index of Protein Data Bank (PDB) cross-references

SIMILARITY comments

Index of protein domains and families