Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

O15370 (SOX12_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified July 9, 2014. Version 123. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Transcription factor SOX-12
Alternative name(s):
Protein SOX-22
Gene names
Name:SOX12
Synonyms:SOX22
OrganismHomo sapiens (Human) [Reference proteome]
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length315 AA.
Sequence statusComplete.
Protein existenceEvidence at transcript level

General annotation (Comments)

Function

Binds to the sequence 5'-AACAAT-3' By similarity.

Subcellular location

Nucleus Ref.1.

Tissue specificity

Expressed most abundantly in the CNS. Also expressed in fetal brain and kidney and adult heart, pancreas, testis and ovary. Other tissues were only weakly positive. Ref.1

Sequence similarities

Contains 1 HMG box DNA-binding domain.

Sequence caution

The sequence AAB69627.1 differs from that shown. Reason: Frameshift at positions 135 and 201.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 315315Transcription factor SOX-12
PRO_0000048754

Regions

DNA binding40 – 10869HMG box
Compositional bias15 – 228Poly-Pro
Compositional bias211 – 2177Poly-Ala
Compositional bias223 – 25028Asp/Glu-rich (acidic)
Compositional bias223 – 23311Poly-Glu
Compositional bias234 – 2374Poly-Ala

Sequences

Sequence LengthMass (Da)Tools
O15370 [UniParc].

Last modified October 3, 2006. Version 2.
Checksum: A5BF539AC505942D

FASTA31534,122
        10         20         30         40         50         60 
MVQQRGARAK RDGGPPPPGP GPAEEGAREP GWCKTPSGHI KRPMNAFMVW SQHERRKIMD 

        70         80         90        100        110        120 
QWPDMHNAEI SKRLGRRWQL LQDSEKIPFV REAERLRLKH MADYPDYKYR PRKKSKGAPA 

       130        140        150        160        170        180 
KARPRPPGGS GGGSRLKPGP QLPGRGGRRA AGGPLGGGAA APEDDDEDDD EELLEVRLVE 

       190        200        210        220        230        240 
TPGRELWRMV PAGRAARGQA ERAQGPSGEG AAAAAAASPT PSEDEEPEEE EEEAAAAEEG 

       250        260        270        280        290        300 
EEETVASGEE SLGFLSRLPP GPAGLDCSAL DRDPDLQPPS GTSHFEFPDY CTPEVTEMIA 

       310 
GDWRPSSIAD LVFTY 

« Hide

References

« Hide 'large scale' references
[1]"SOX22 is a new member of the SOX gene family, mainly expressed in human nervous tissue."
Jay P., Sahly I., Goze C., Taviaux S., Poulat F., Couly G., Abitbol M., Berta P.
Hum. Mol. Genet. 6:1069-1077(1997) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA], SUBCELLULAR LOCATION, TISSUE SPECIFICITY.
Tissue: Fetal brain.
[2]"The DNA sequence and comparative analysis of human chromosome 20."
Deloukas P., Matthews L.H., Ashurst J.L., Burton J., Gilbert J.G.R., Jones M., Stavrides G., Almeida J.P., Babbage A.K., Bagguley C.L., Bailey J., Barlow K.F., Bates K.N., Beard L.M., Beare D.M., Beasley O.P., Bird C.P., Blakey S.E. expand/collapse author list , Bridgeman A.M., Brown A.J., Buck D., Burrill W.D., Butler A.P., Carder C., Carter N.P., Chapman J.C., Clamp M., Clark G., Clark L.N., Clark S.Y., Clee C.M., Clegg S., Cobley V.E., Collier R.E., Connor R.E., Corby N.R., Coulson A., Coville G.J., Deadman R., Dhami P.D., Dunn M., Ellington A.G., Frankland J.A., Fraser A., French L., Garner P., Grafham D.V., Griffiths C., Griffiths M.N.D., Gwilliam R., Hall R.E., Hammond S., Harley J.L., Heath P.D., Ho S., Holden J.L., Howden P.J., Huckle E., Hunt A.R., Hunt S.E., Jekosch K., Johnson C.M., Johnson D., Kay M.P., Kimberley A.M., King A., Knights A., Laird G.K., Lawlor S., Lehvaeslaiho M.H., Leversha M.A., Lloyd C., Lloyd D.M., Lovell J.D., Marsh V.L., Martin S.L., McConnachie L.J., McLay K., McMurray A.A., Milne S.A., Mistry D., Moore M.J.F., Mullikin J.C., Nickerson T., Oliver K., Parker A., Patel R., Pearce T.A.V., Peck A.I., Phillimore B.J.C.T., Prathalingam S.R., Plumb R.W., Ramsay H., Rice C.M., Ross M.T., Scott C.E., Sehra H.K., Shownkeen R., Sims S., Skuce C.D., Smith M.L., Soderlund C., Steward C.A., Sulston J.E., Swann R.M., Sycamore N., Taylor R., Tee L., Thomas D.W., Thorpe A., Tracey A., Tromans A.C., Vaudin M., Wall M., Wallis J.M., Whitehead S.L., Whittaker P., Willey D.L., Williams L., Williams S.A., Wilming L., Wray P.W., Hubbard T., Durbin R.M., Bentley D.R., Beck S., Rogers J.
Nature 414:865-871(2001) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[3]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
Tissue: Prostate.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
U35612 mRNA. Translation: AAB69627.1. Frameshift.
AL034548 Genomic DNA. Translation: CAB81632.1.
BC067361 mRNA. Translation: AAH67361.1.
CCDSCCDS12995.1.
RefSeqNP_008874.2. NM_006943.3.
UniGeneHs.43627.
Hs.712815.
Hs.745045.
Hs.745058.

3D structure databases

ProteinModelPortalO15370.
SMRO15370. Positions 38-113.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

BioGrid112549. 4 interactions.
IntActO15370. 3 interactions.

PTM databases

PhosphoSiteO15370.

Proteomic databases

MaxQBO15370.
PaxDbO15370.
PRIDEO15370.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENST00000342665; ENSP00000347646; ENSG00000177732.
ENST00000544632; ENSP00000441671; ENSG00000177732.
GeneID6666.
KEGGhsa:6666.
UCSCuc002wdh.4. human.

Organism-specific databases

CTD6666.
GeneCardsGC20P000306.
HGNCHGNC:11198. SOX12.
MIM601947. gene.
neXtProtNX_O15370.
PharmGKBPA36035.
GenAtlasSearch...

Phylogenomic databases

eggNOGNOG290369.
HOGENOMHOG000231874.
HOVERGENHBG094895.
InParanoidO15370.
KOK09268.
OMAMVQQRGA.
OrthoDBEOG7TMZVP.
PhylomeDBO15370.
TreeFamTF351735.

Gene expression databases

BgeeO15370.
CleanExHS_SOX12.
GenevestigatorO15370.

Family and domain databases

Gene3D1.10.30.10. 1 hit.
InterProIPR009071. HMG_box_dom.
IPR017386. SOX-12/11/4a.
[Graphical view]
PfamPF00505. HMG_box. 1 hit.
[Graphical view]
PIRSFPIRSF038098. SOX-12/11/4a. 1 hit.
SMARTSM00398. HMG. 1 hit.
[Graphical view]
SUPFAMSSF47095. SSF47095. 1 hit.
PROSITEPS50118. HMG_BOX_2. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

GeneWikiSOX12.
GenomeRNAi6666.
NextBio25991.
PROO15370.
SOURCESearch...

Entry information

Entry nameSOX12_HUMAN
AccessionPrimary (citable) accession number: O15370
Secondary accession number(s): Q5D038, Q9NUD4
Entry history
Integrated into UniProtKB/Swiss-Prot: July 15, 1998
Last sequence update: October 3, 2006
Last modified: July 9, 2014
This is version 123 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

SIMILARITY comments

Index of protein domains and families

MIM cross-references

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot

Human chromosome 20

Human chromosome 20: entries, gene names and cross-references to MIM