Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

O00570 (SOX1_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified July 9, 2014. Version 108. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Interactions·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Transcription factor SOX-1
Gene names
Name:SOX1
OrganismHomo sapiens (Human) [Reference proteome]
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length391 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Transcriptional activator. May function as a switch in neuronal development. Keeps neural cells undifferentiated by counteracting the activity of proneural proteins and suppresses neuronal differentiation By similarity.

Subcellular location

Nucleus Probable.

Tissue specificity

Mainly expressed in the developing central nervous system.

Sequence similarities

Contains 1 HMG box DNA-binding domain.

Binary interactions

With

Entry

#Exp.

IntAct

Notes

STAT3P407632EBI-2935583,EBI-518675

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 391391Transcription factor SOX-1
PRO_0000048712

Regions

DNA binding51 – 11969HMG box
Compositional bias27 – 4317Poly-Gly
Compositional bias145 – 1506Poly-Gly
Compositional bias197 – 2048Poly-Ala
Compositional bias280 – 2889Poly-Ala
Compositional bias296 – 30611Poly-Ala
Compositional bias357 – 3648Poly-Ala

Experimental info

Sequence conflict1651A → P in CAA73847. Ref.1
Sequence conflict1801G → A in CAA73847. Ref.1
Sequence conflict226 – 2272AH → RT in CAA73847. Ref.1
Sequence conflict287 – 2904Missing in CAA73847. Ref.1

Sequences

Sequence LengthMass (Da)Tools
O00570 [UniParc].

Last modified September 23, 2008. Version 2.
Checksum: DD519BA97CF5E052

FASTA39139,023
        10         20         30         40         50         60 
MYSMMMETDL HSPGGAQAPT NLSGPAGAGG GGGGGGGGGG GGGAKANQDR VKRPMNAFMV 

        70         80         90        100        110        120 
WSRGQRRKMA QENPKMHNSE ISKRLGAEWK VMSEAEKRPF IDEAKRLRAL HMKEHPDYKY 

       130        140        150        160        170        180 
RPRRKTKTLL KKDKYSLAGG LLAAGAGGGG AAVAMGVGVG VGAAAVGQRL ESPGGAAGGG 

       190        200        210        220        230        240 
YAHVNGWANG AYPGSVAAAA AAAAMMQEAQ LAYGQHPGAG GAHPHAHPAH PHPHHPHAHP 

       250        260        270        280        290        300 
HNPQPMHRYD MGALQYSPIS NSQGYMSASP SGYGGLPYGA AAAAAAAAGG AHQNSAVAAA 

       310        320        330        340        350        360 
AAAAAASSGA LGALGSLVKS EPSGSPPAPA HSRAPCPGDL REMISMYLPA GEGGDPAAAA 

       370        380        390 
AAAAQSRLHS LPQHYQGAGA GVNGTVPLTH I 

« Hide

References

« Hide 'large scale' references
[1]"Cloning and mapping of the human SOX1: a highly conserved gene expressed in the developing brain."
Malas S., Duthie S.M., Mohri F., Lovell-Badge R., Episkopou V.
Mamm. Genome 8:866-868(1997) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
[2]"The DNA sequence and analysis of human chromosome 13."
Dunham A., Matthews L.H., Burton J., Ashurst J.L., Howe K.L., Ashcroft K.J., Beare D.M., Burford D.C., Hunt S.E., Griffiths-Jones S., Jones M.C., Keenan S.J., Oliver K., Scott C.E., Ainscough R., Almeida J.P., Ambrose K.D., Andrews D.T. expand/collapse author list , Ashwell R.I.S., Babbage A.K., Bagguley C.L., Bailey J., Bannerjee R., Barlow K.F., Bates K., Beasley H., Bird C.P., Bray-Allen S., Brown A.J., Brown J.Y., Burrill W., Carder C., Carter N.P., Chapman J.C., Clamp M.E., Clark S.Y., Clarke G., Clee C.M., Clegg S.C., Cobley V., Collins J.E., Corby N., Coville G.J., Deloukas P., Dhami P., Dunham I., Dunn M., Earthrowl M.E., Ellington A.G., Faulkner L., Frankish A.G., Frankland J., French L., Garner P., Garnett J., Gilbert J.G.R., Gilson C.J., Ghori J., Grafham D.V., Gribble S.M., Griffiths C., Hall R.E., Hammond S., Harley J.L., Hart E.A., Heath P.D., Howden P.J., Huckle E.J., Hunt P.J., Hunt A.R., Johnson C., Johnson D., Kay M., Kimberley A.M., King A., Laird G.K., Langford C.J., Lawlor S., Leongamornlert D.A., Lloyd D.M., Lloyd C., Loveland J.E., Lovell J., Martin S., Mashreghi-Mohammadi M., McLaren S.J., McMurray A., Milne S., Moore M.J.F., Nickerson T., Palmer S.A., Pearce A.V., Peck A.I., Pelan S., Phillimore B., Porter K.M., Rice C.M., Searle S., Sehra H.K., Shownkeen R., Skuce C.D., Smith M., Steward C.A., Sycamore N., Tester J., Thomas D.W., Tracey A., Tromans A., Tubby B., Wall M., Wallis J.M., West A.P., Whitehead S.L., Willey D.L., Wilming L., Wray P.W., Wright M.W., Young L., Coulson A., Durbin R.M., Hubbard T., Sulston J.E., Beck S., Bentley D.R., Rogers J., Ross M.T.
Nature 428:522-528(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[3]Mural R.J., Istrail S., Sutton G.G., Florea L., Halpern A.L., Mobarry C.M., Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., Flanigan M.J., Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., Hannenhalli S., Turner R. expand/collapse author list , Yooseph S., Lu F., Nusskern D.R., Shue B.C., Zheng X.H., Zhong F., Delcher A.L., Huson D.H., Kravitz S.A., Mouchard L., Reinert K., Remington K.A., Clark A.G., Waterman M.S., Eichler E.E., Adams M.D., Hunkapiller M.W., Myers E.W., Venter J.C.
Submitted (JUL-2005) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
Y13436 Genomic DNA. Translation: CAA73847.1.
AL138691 Genomic DNA. Translation: CAH72340.1.
CH471085 Genomic DNA. Translation: EAX09158.1.
CCDSCCDS9523.1.
RefSeqNP_005977.2. NM_005986.2.
UniGeneHs.202526.

3D structure databases

ProteinModelPortalO00570.
SMRO00570. Positions 49-127.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

IntActO00570. 1 interaction.
STRING9606.ENSP00000330218.

PTM databases

PhosphoSiteO00570.

Proteomic databases

PaxDbO00570.
PRIDEO00570.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENST00000330949; ENSP00000330218; ENSG00000182968.
GeneID6656.
KEGGhsa:6656.
UCSCuc001vsb.1. human.

Organism-specific databases

CTD6656.
GeneCardsGC13P112721.
HGNCHGNC:11189. SOX1.
MIM602148. gene.
neXtProtNX_O00570.
PharmGKBPA36026.
GenAtlasSearch...

Phylogenomic databases

eggNOGNOG321816.
HOGENOMHOG000231647.
HOVERGENHBG105663.
InParanoidO00570.
KOK09267.
OMAHSRGPCP.
OrthoDBEOG7TMZVP.
PhylomeDBO00570.
TreeFamTF351735.

Gene expression databases

BgeeO00570.
CleanExHS_SOX1.
GenevestigatorO00570.

Family and domain databases

Gene3D1.10.30.10. 1 hit.
InterProIPR009071. HMG_box_dom.
IPR022097. TF_SOX.
[Graphical view]
PfamPF00505. HMG_box. 1 hit.
PF12336. SOXp. 1 hit.
[Graphical view]
SMARTSM00398. HMG. 1 hit.
[Graphical view]
SUPFAMSSF47095. SSF47095. 1 hit.
PROSITEPS50118. HMG_BOX_2. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

GenomeRNAi6656.
NextBio25947.
PROO00570.
SOURCESearch...

Entry information

Entry nameSOX1_HUMAN
AccessionPrimary (citable) accession number: O00570
Secondary accession number(s): Q5W0Q1
Entry history
Integrated into UniProtKB/Swiss-Prot: November 1, 1997
Last sequence update: September 23, 2008
Last modified: July 9, 2014
This is version 108 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

SIMILARITY comments

Index of protein domains and families

MIM cross-references

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot

Human chromosome 13

Human chromosome 13: entries, gene names and cross-references to MIM