ID SOX4_MOUSE Reviewed; 440 AA. AC Q06831; Q5SW95; DT 01-JUN-1994, integrated into UniProtKB/Swiss-Prot. DT 27-JUL-2011, sequence version 2. DT 14-OCT-2015, entry version 130. DE RecName: Full=Transcription factor SOX-4; GN Name=Sox4; Synonyms=Sox-4; OS Mus musculus (Mouse). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; OC Muroidea; Muridae; Murinae; Mus; Mus. OX NCBI_TaxID=10090; RN [1] RP NUCLEOTIDE SEQUENCE [MRNA]. RX PubMed=8404853; RA van de Wetering M., Oosterwegel M., van Norren K., Clevers H.C.; RT "Sox-4, an Sry-like HMG box protein, is a transcriptional activator in RT lymphocytes."; RL EMBO J. 12:3847-3854(1993). RN [2] RP NUCLEOTIDE SEQUENCE [MRNA]. RX PubMed=8493110; DOI=10.1093/nar/21.8.2009; RA Schilham M.W., van Eijk M., van de Wetering M., Clevers H.C.; RT "The murine Sox-4 protein is encoded on a single exon."; RL Nucleic Acids Res. 21:2009-2009(1993). RN [3] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=C57BL/6J; RX PubMed=19468303; DOI=10.1371/journal.pbio.1000112; RA Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., RA She X., Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., RA Kapustin Y., Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., RA Zhou S., Teague B., Potamousis K., Churas C., Place M., Herschleb J., RA Runnheim R., Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., RA Lindblad-Toh K., Eichler E.E., Ponting C.P.; RT "Lineage-specific biology revealed by a finished genome assembly of RT the mouse."; RL PLoS Biol. 7:E1000112-E1000112(2009). RN [4] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Mural R.J., Adams M.D., Myers E.W., Smith H.O., Venter J.C.; RL Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases. RN [5] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. RC STRAIN=C57BL/6; TISSUE=Brain; RX PubMed=15489334; DOI=10.1101/gr.2596504; RG The MGC Project Team; RT "The status, quality, and expansion of the NIH full-length cDNA RT project: the Mammalian Gene Collection (MGC)."; RL Genome Res. 14:2121-2127(2004). RN [6] RP NUCLEOTIDE SEQUENCE [MRNA] OF 69-122. RC STRAIN=Swiss Webster; TISSUE=Embryonic tooth; RX PubMed=8921394; DOI=10.1006/geno.1996.0548; RA Stock D.W., Buchanan A.V., Zhao Z., Weiss K.M.; RT "Numerous members of the Sox family of HMG box-containing genes are RT expressed in developing mouse teeth."; RL Genomics 37:234-237(1996). RN [7] RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RC TISSUE=Embryonic brain; RX PubMed=15345747; DOI=10.1074/mcp.M400085-MCP200; RA Ballif B.A., Villen J., Beausoleil S.A., Schwartz D., Gygi S.P.; RT "Phosphoproteomic analysis of the developing mouse brain."; RL Mol. Cell. Proteomics 3:1093-1101(2004). CC -!- FUNCTION: Transcriptional activator that binds with high affinity CC to the T-cell enhancer motif 5'-AACAAAG-3' motif. CC -!- SUBUNIT: Interacts with UBE2I. {ECO:0000250}. CC -!- INTERACTION: CC P15884:TCF4 (xeno); NbExp=2; IntAct=EBI-6262177, EBI-533224; CC -!- SUBCELLULAR LOCATION: Nucleus. CC -!- TISSUE SPECIFICITY: Expressed in lymphocytes and in molar and CC incisor tooth germs. CC -!- SIMILARITY: Contains 1 HMG box DNA-binding domain. CC {ECO:0000255|PROSITE-ProRule:PRU00267}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; X70298; CAA49779.1; -; mRNA. DR EMBL; AL606511; CAI24776.1; -; Genomic_DNA. DR EMBL; CH466561; EDL32406.1; -; Genomic_DNA. DR EMBL; BC052736; AAH52736.1; -; mRNA. DR EMBL; U70440; AAC52858.1; -; mRNA. DR CCDS; CCDS26411.1; -. DR PIR; S37303; S37303. DR RefSeq; NP_033264.2; NM_009238.2. DR UniGene; Mm.240627; -. DR UniGene; Mm.455819; -. DR PDB; 3U2B; X-ray; 2.40 A; C=57-135. DR PDBsum; 3U2B; -. DR ProteinModelPortal; Q06831; -. DR SMR; Q06831; 57-132. DR IntAct; Q06831; 5. DR STRING; 10090.ENSMUSP00000100013; -. DR PhosphoSite; Q06831; -. DR MaxQB; Q06831; -. DR PRIDE; Q06831; -. DR Ensembl; ENSMUST00000067230; ENSMUSP00000100013; ENSMUSG00000076431. DR GeneID; 20677; -. DR KEGG; mmu:20677; -. DR UCSC; uc007pyk.1; mouse. DR CTD; 6659; -. DR MGI; MGI:98366; Sox4. DR GeneTree; ENSGT00760000118988; -. DR HOGENOM; HOG000231874; -. DR HOVERGEN; HBG005040; -. DR InParanoid; Q06831; -. DR KO; K09268; -. DR OMA; SHDDEFE; -. DR OrthoDB; EOG7TMZVP; -. DR Reactome; R-MMU-3769402; deactivation of the beta-catenin transactivating complex. DR ChiTaRS; Sox4; mouse. DR NextBio; 299169; -. DR PRO; PR:Q06831; -. DR Proteomes; UP000000589; Chromosome 13. DR Bgee; Q06831; -. DR CleanEx; MM_SOX4; -. DR Genevisible; Q06831; MM. DR GO; GO:0005737; C:cytoplasm; ISS:UniProtKB. DR GO; GO:0005739; C:mitochondrion; ISS:UniProtKB. DR GO; GO:0044798; C:nuclear transcription factor complex; IDA:MGI. DR GO; GO:0005654; C:nucleoplasm; ISO:MGI. DR GO; GO:0005634; C:nucleus; IDA:UniProtKB. DR GO; GO:0001046; F:core promoter sequence-specific DNA binding; IDA:UniProtKB. DR GO; GO:0001071; F:nucleic acid binding transcription factor activity; ISS:UniProtKB. DR GO; GO:0046982; F:protein heterodimerization activity; IDA:MGI. DR GO; GO:0001105; F:RNA polymerase II transcription coactivator activity; IDA:UniProtKB. DR GO; GO:0000981; F:RNA polymerase II transcription factor activity, sequence-specific DNA binding; IDA:MGI. DR GO; GO:0003700; F:transcription factor activity, sequence-specific DNA binding; IDA:UniProtKB. DR GO; GO:0000976; F:transcription regulatory region sequence-specific DNA binding; IDA:UniProtKB. DR GO; GO:0001077; F:transcriptional activator activity, RNA polymerase II core promoter proximal region sequence-specific binding; IDA:UniProtKB. DR GO; GO:0035910; P:ascending aorta morphogenesis; IMP:BHF-UCL. DR GO; GO:0003289; P:atrial septum primum morphogenesis; IMP:BHF-UCL. DR GO; GO:0060070; P:canonical Wnt signaling pathway; IMP:BHF-UCL. DR GO; GO:0003215; P:cardiac right ventricle morphogenesis; IMP:BHF-UCL. DR GO; GO:0003211; P:cardiac ventricle formation; IMP:UniProtKB. DR GO; GO:0071333; P:cellular response to glucose stimulus; IMP:UniProtKB. DR GO; GO:0042769; P:DNA damage response, detection of DNA damage; ISS:UniProtKB. DR GO; GO:0006977; P:DNA damage response, signal transduction by p53 class mediator resulting in cell cycle arrest; ISS:UniProtKB. DR GO; GO:0031018; P:endocrine pancreas development; IMP:MGI. DR GO; GO:0021782; P:glial cell development; IMP:UniProtKB. DR GO; GO:0014009; P:glial cell proliferation; IMP:UniProtKB. DR GO; GO:0042593; P:glucose homeostasis; IMP:UniProtKB. DR GO; GO:0007507; P:heart development; IMP:BHF-UCL. DR GO; GO:0060993; P:kidney morphogenesis; IMP:BHF-UCL. DR GO; GO:0060174; P:limb bud formation; IMP:UniProtKB. DR GO; GO:0003183; P:mitral valve morphogenesis; IMP:BHF-UCL. DR GO; GO:0060548; P:negative regulation of cell death; IMP:UniProtKB. DR GO; GO:0008285; P:negative regulation of cell proliferation; ISS:UniProtKB. DR GO; GO:0046826; P:negative regulation of protein export from nucleus; ISS:UniProtKB. DR GO; GO:0031397; P:negative regulation of protein ubiquitination; ISS:UniProtKB. DR GO; GO:0001841; P:neural tube formation; IMP:UniProtKB. DR GO; GO:0060563; P:neuroepithelial cell differentiation; IMP:UniProtKB. DR GO; GO:0003357; P:noradrenergic neuron differentiation; IMP:UniProtKB. DR GO; GO:0043065; P:positive regulation of apoptotic process; ISO:MGI. DR GO; GO:0090263; P:positive regulation of canonical Wnt signaling pathway; IDA:UniProtKB. DR GO; GO:0008284; P:positive regulation of cell proliferation; IMP:UniProtKB. DR GO; GO:0032024; P:positive regulation of insulin secretion; IMP:UniProtKB. DR GO; GO:2000761; P:positive regulation of N-terminal peptidyl-lysine acetylation; ISS:UniProtKB. DR GO; GO:0045944; P:positive regulation of transcription from RNA polymerase II promoter; IDA:UniProtKB. DR GO; GO:0045893; P:positive regulation of transcription, DNA-templated; IDA:UniProtKB. DR GO; GO:0045727; P:positive regulation of translation; ISS:UniProtKB. DR GO; GO:0030177; P:positive regulation of Wnt signaling pathway; IMP:BHF-UCL. DR GO; GO:0002328; P:pro-B cell differentiation; IMP:BHF-UCL. DR GO; GO:0050821; P:protein stabilization; IMP:BHF-UCL. DR GO; GO:0031647; P:regulation of protein stability; ISS:UniProtKB. DR GO; GO:0006355; P:regulation of transcription, DNA-templated; ISS:UniProtKB. DR GO; GO:0001501; P:skeletal system development; IMP:UniProtKB. DR GO; GO:0035019; P:somatic stem cell maintenance; IDA:MGI. DR GO; GO:0021510; P:spinal cord development; IMP:UniProtKB. DR GO; GO:0021522; P:spinal cord motor neuron differentiation; IMP:UniProtKB. DR GO; GO:0048485; P:sympathetic nervous system development; IMP:UniProtKB. DR GO; GO:0030217; P:T cell differentiation; IMP:BHF-UCL. DR GO; GO:0006366; P:transcription from RNA polymerase II promoter; IDA:GOC. DR GO; GO:0060412; P:ventricular septum morphogenesis; IMP:BHF-UCL. DR Gene3D; 1.10.30.10; -; 1. DR InterPro; IPR009071; HMG_box_dom. DR Pfam; PF00505; HMG_box; 1. DR SMART; SM00398; HMG; 1. DR SUPFAM; SSF47095; SSF47095; 1. DR PROSITE; PS50118; HMG_BOX_2; 1. PE 1: Evidence at protein level; KW 3D-structure; Activator; Complete proteome; DNA-binding; Nucleus; KW Reference proteome; Transcription; Transcription regulation. FT CHAIN 1 440 Transcription factor SOX-4. FT /FTId=PRO_0000048725. FT DNA_BIND 59 127 HMG box. {ECO:0000255|PROSITE- FT ProRule:PRU00267}. FT COMPBIAS 347 363 Poly-Ser. FT CONFLICT 175 175 S -> T (in Ref. 1; CAA49779). FT {ECO:0000305}. FT CONFLICT 179 179 A -> T (in Ref. 1; CAA49779). FT {ECO:0000305}. FT CONFLICT 235 236 SA -> QL (in Ref. 1; CAA49779). FT {ECO:0000305}. FT CONFLICT 263 263 R -> H (in Ref. 1; CAA49779). FT {ECO:0000305}. FT CONFLICT 283 283 S -> C (in Ref. 1; CAA49779). FT {ECO:0000305}. FT HELIX 65 78 {ECO:0000244|PDB:3U2B}. FT HELIX 86 99 {ECO:0000244|PDB:3U2B}. FT HELIX 102 122 {ECO:0000244|PDB:3U2B}. FT STRAND 123 125 {ECO:0000244|PDB:3U2B}. SQ SEQUENCE 440 AA; 45044 MW; 979AADBA7F674B6D CRC64; MVQQTNNAEN TEALLAGESS DSGAGLELGI ASSPTPGSTA STGGKADDPS WCKTPSGHIK RPMNAFMVWS QIERRKIMEQ SPDMHNAEIS KRLGKRWKLL KDSDKIPFIQ EAERLRLKHM ADYPDYKYRP RKKVKSGNAG AGSAATAKPG EKGDKVAGSS GHAGSSHAGG GAGGSSKPAP KKSCGPKVAG SSVGKPHAKL VPAGGSKAAA SFSPEQAALL PLGEPTAVYK VRTPSAATPA ASSSPSSALA TPAKHPADKK VKRVYLFGSL GASASPVGGL GASADPSDPL GLYEDGGPGC SPDGRSLSGR SSAASSPAAS RSPADHRGYA SLRAASPAPS SAPSHASSSL SSSSSSSSGS SSSDDEFEDD LLDLNPSSNF ESMSLGSFSS SSALDRDLDF NFEPGSGSHF EFPDYCTPEV SEMISGDWLE SSISNLVFTY //