Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Transcription factor Sox-7

Gene

sox7

Organism
Xenopus tropicalis (Western clawed frog) (Silurana tropicalis)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Experimental evidence at transcript leveli

Functioni

Transcription factor. Binds to the DNA sequence 5'-AACAAT-3'. Acts downstream of vegt and upstream of nodal signaling to promote endodermal and mesodermal differentiation by promoting vegt-induced expression of both endodermal genes (including endodermin) and mesodermal genes (including snai1/snail and snai2/slug). Induces expression of multiple nodal genes (including nodal, nodal2, nodal4, nodal5 and nodal6) and binds directly to sites within the promoter of the nodal5 gene. The endodermal and mesodermal specification pathways then interact to initiate cardiogenesis. Acts partially redundantly with sox18 during cardiogenesis. Also acts as an antagonist of beta-catenin signaling (By similarity). Regulates (possibly indirectly) development of the pronephros, the functional larval kidney.By similarity1 Publication

Regions

Feature keyPosition(s)DescriptionActionsGraphical viewLength
DNA bindingi42 – 110HMG boxPROSITE-ProRule annotationAdd BLAST69

GO - Molecular functioni

GO - Biological processi

Keywordsi

Molecular functionActivator, Developmental protein, DNA-binding
Biological processTranscription, Transcription regulation

Names & Taxonomyi

Protein namesi
Recommended name:
Transcription factor Sox-7
Gene namesi
Name:sox7Imported
ORF Names:TEgg131e23.1
OrganismiXenopus tropicalis (Western clawed frog) (Silurana tropicalis)
Taxonomic identifieri8364 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiAmphibiaBatrachiaAnuraPipoideaPipidaeXenopodinaeXenopusSilurana
Proteomesi
  • UP000008143 Componenti: Unassembled WGS sequence

Organism-specific databases

XenbaseiXB-GENE-488067. sox7.

Subcellular locationi

  • Nucleus PROSITE-ProRule annotationBy similarity

GO - Cellular componenti

Keywords - Cellular componenti

Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00003702461 – 362Transcription factor Sox-7Add BLAST362

Proteomic databases

PaxDbiQ28GD5.

Expressioni

Tissue specificityi

Expressed in the embryonic pronephric sinus as well as posterior cardinal veins.1 Publication

Gene expression databases

BgeeiENSXETG00000000693.

Interactioni

Protein-protein interaction databases

STRINGi8364.ENSXETP00000001530.

Structurei

3D structure databases

ProteinModelPortaliQ28GD5.
SMRiQ28GD5.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini245 – 362Sox C-terminalPROSITE-ProRule annotationAdd BLAST118

Phylogenomic databases

eggNOGiENOG410IPZD. Eukaryota.
ENOG410XP3W. LUCA.
GeneTreeiENSGT00760000118988.
HOGENOMiHOG000069999.
InParanoidiQ28GD5.
KOiK09270.
OMAiPHINGAV.
OrthoDBiEOG091G0CYU.

Family and domain databases

Gene3Di1.10.30.10. 1 hit.
InterProiView protein in InterPro
IPR009071. HMG_box_dom.
IPR033392. Sox7/17/18_central.
IPR021934. Sox_C.
PfamiView protein in Pfam
PF00505. HMG_box. 1 hit.
PF12067. Sox17_18_mid. 1 hit.
SMARTiView protein in SMART
SM00398. HMG. 1 hit.
SUPFAMiSSF47095. SSF47095. 1 hit.
PROSITEiView protein in PROSITE
PS50118. HMG_BOX_2. 1 hit.
PS51516. SOX_C. 1 hit.

Sequencei

Sequence statusi: Complete.

Q28GD5-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MTTLMGSYSW TEGLDCSPID EDLSDGLSPH RSPREKGSET RIRRPMNAFM
60 70 80 90 100
VWAKDERKRL AVQNPDLHNA ELSKMLGKSW KALSPAQKRP YVEEAERLRV
110 120 130 140 150
QHMQDYPNYK YRPRRKKQIK RICKRVDTGF LLSSLSRDQN SVPDTRGCRT
160 170 180 190 200
AVEKEENGGY PGSALPDMRH YRETPSNGSK HDQTYPYGLP TPPEMSPLEA
210 220 230 240 250
IDQDQSFYST PCSEDCHPHI NGAVYEYSSR SPILCSHLSQ VPIPQTGSSM
260 270 280 290 300
IPPVPNCPPA YYSSTYHSIH HNYHAHLGQL SPPPEHPHYD AIDQISQAEL
310 320 330 340 350
LGDMDRNEFD QYLNTSLHDP SEMTIHGHVQ VSQASDIQPS ETSLISVLAD
360
ATATYYNSYS VS
Length:362
Mass (Da):40,948
Last modified:April 4, 2006 - v1
Checksum:iA2B2F7754CC8EAC1
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
CR761431 mRNA. Translation: CAJ82258.1.
BC170859 mRNA. Translation: AAI70859.1.
RefSeqiNP_001016326.1. NM_001016326.2.
UniGeneiStr.66419.

Genome annotation databases

EnsembliENSXETT00000001530; ENSXETP00000001530; ENSXETG00000000693.
GeneIDi549080.
KEGGixtr:549080.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
CR761431 mRNA. Translation: CAJ82258.1.
BC170859 mRNA. Translation: AAI70859.1.
RefSeqiNP_001016326.1. NM_001016326.2.
UniGeneiStr.66419.

3D structure databases

ProteinModelPortaliQ28GD5.
SMRiQ28GD5.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi8364.ENSXETP00000001530.

Proteomic databases

PaxDbiQ28GD5.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSXETT00000001530; ENSXETP00000001530; ENSXETG00000000693.
GeneIDi549080.
KEGGixtr:549080.

Organism-specific databases

CTDi83595.
XenbaseiXB-GENE-488067. sox7.

Phylogenomic databases

eggNOGiENOG410IPZD. Eukaryota.
ENOG410XP3W. LUCA.
GeneTreeiENSGT00760000118988.
HOGENOMiHOG000069999.
InParanoidiQ28GD5.
KOiK09270.
OMAiPHINGAV.
OrthoDBiEOG091G0CYU.

Gene expression databases

BgeeiENSXETG00000000693.

Family and domain databases

Gene3Di1.10.30.10. 1 hit.
InterProiView protein in InterPro
IPR009071. HMG_box_dom.
IPR033392. Sox7/17/18_central.
IPR021934. Sox_C.
PfamiView protein in Pfam
PF00505. HMG_box. 1 hit.
PF12067. Sox17_18_mid. 1 hit.
SMARTiView protein in SMART
SM00398. HMG. 1 hit.
SUPFAMiSSF47095. SSF47095. 1 hit.
PROSITEiView protein in PROSITE
PS50118. HMG_BOX_2. 1 hit.
PS51516. SOX_C. 1 hit.
ProtoNetiSearch...

Entry informationi

Entry nameiSOX7_XENTR
AccessioniPrimary (citable) accession number: Q28GD5
Entry historyiIntegrated into UniProtKB/Swiss-Prot: April 14, 2009
Last sequence update: April 4, 2006
Last modified: March 15, 2017
This is version 74 of the entry and version 1 of the sequence. See complete history.
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.