Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

GS homeobox 2

Gene

Gsx2

Organism
Mus musculus (Mouse)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

During telencephalic development, causes ventralization of pallial progenitors and, depending on the developmental stage, specifies different neuronal fates. At early stages, necessary and sufficient to correctly specify the ventral lateral ganglionic eminence (LGE) and its major derivatives, the striatal projection neurons. At later stages, may specify LGE progenitors toward dorsal LGE fates, including olfactory bulb interneurons (PubMed:19709628). Transcription factor that binds 5'-CNAATTAG-3' DNA sequence (PubMed:7619729).2 Publications

Regions

Feature keyPosition(s)DescriptionActionsGraphical viewLength
DNA bindingi203 – 262HomeoboxPROSITE-ProRule annotationAdd BLAST60

GO - Molecular functioni

  • DNA binding Source: MGI
  • sequence-specific DNA binding Source: InterPro

GO - Biological processi

  • brain development Source: MGI
  • central nervous system development Source: MGI
  • forebrain dorsal/ventral pattern formation Source: MGI
  • forebrain morphogenesis Source: MGI
  • hindbrain morphogenesis Source: MGI
  • neuron fate commitment Source: MGI
  • neuron fate specification Source: MGI
  • olfactory bulb interneuron differentiation Source: MGI
  • pattern specification process Source: MGI
  • positive regulation of Notch signaling pathway Source: MGI
  • positive regulation of oligodendrocyte differentiation Source: MGI
  • regulation of cell migration Source: MGI
  • regulation of respiratory gaseous exchange by neurological system process Source: MGI
  • regulation of transcription, DNA-templated Source: UniProtKB-KW
  • spinal cord association neuron differentiation Source: MGI
  • subpallium development Source: MGI
  • subpallium neuron fate commitment Source: MGI
  • telencephalon regionalization Source: MGI
  • transcription, DNA-templated Source: UniProtKB-KW
Complete GO annotation...

Keywords - Molecular functioni

Developmental protein

Keywords - Biological processi

Transcription, Transcription regulation

Keywords - Ligandi

DNA-binding

Names & Taxonomyi

Protein namesi
Recommended name:
GS homeobox 2
Alternative name(s):
Genetic-screened homeobox 2
Homeobox protein GSH-2
Gene namesi
Name:Gsx2
Synonyms:Gsh-2, Gsh2
OrganismiMus musculus (Mouse)
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus
Proteomesi
  • UP000000589 Componenti: Chromosome 5

Organism-specific databases

MGIiMGI:95843. Gsx2.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00000488971 – 305GS homeobox 2Add BLAST305

Proteomic databases

PaxDbiP31316.
PRIDEiP31316.

Expressioni

Developmental stagei

At 10.0 dpc, expressed in a band of primitive neuroepithelial cells in the neural tube, mesencephalon and telencephalon. At 11.5-13.5 dpc, expression is symmetrical, but tightly limited in areas of the forebrain, midbrain and hindbrain (PubMed:7619729). At 11 dpc, in the developping telencephalon, expressed at high levels in cells throughout the presumptive lateral ganglionic eminence (LGE), with an apparent ventral-to-dorsal gradient in expressing cell numbers. Positive cells are also scattered somewhat uniformly throughout the adjacent medial ganglionic eminence. At 12.5 dpc onward, exhibits a clear graded pattern of expression, with low levels found in cells located ventrally and the highest levels confined to those in the most dorsal portion of the LGE (at protein level) (PubMed:19709628). Expression decreases from 14.5 dpc on and becomes undetectable at 16.5 dpc (PubMed:7619729).2 Publications

Gene expression databases

BgeeiENSMUSG00000035946.
CleanExiMM_GSX2.
ExpressionAtlasiP31316. baseline and differential.
GenevisibleiP31316. MM.

Interactioni

Protein-protein interaction databases

BioGridi200086. 1 interactor.
STRINGi10090.ENSMUSP00000036625.

Structurei

3D structure databases

ProteinModelPortaliP31316.
SMRiP31316.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Compositional bias

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Compositional biasi124 – 130Poly-His7
Compositional biasi134 – 139Poly-His6
Compositional biasi147 – 163Poly-AlaAdd BLAST17

Sequence similaritiesi

Belongs to the Antp homeobox family.Curated
Contains 1 homeobox DNA-binding domain.PROSITE-ProRule annotation

Keywords - Domaini

Homeobox

Phylogenomic databases

eggNOGiKOG0489. Eukaryota.
ENOG410ZTBY. LUCA.
GeneTreeiENSGT00760000118940.
HOGENOMiHOG000010106.
HOVERGENiHBG003555.
InParanoidiP31316.
KOiK09310.
OMAiAFCVCPL.
OrthoDBiEOG091G0HAO.
PhylomeDBiP31316.
TreeFamiTF315938.

Family and domain databases

Gene3Di1.10.10.60. 1 hit.
InterProiIPR017970. Homeobox_CS.
IPR001356. Homeobox_dom.
IPR020479. Homeobox_metazoa.
IPR009057. Homeodomain-like.
[Graphical view]
PfamiPF00046. Homeobox. 1 hit.
[Graphical view]
PRINTSiPR00024. HOMEOBOX.
SMARTiSM00389. HOX. 1 hit.
[Graphical view]
SUPFAMiSSF46689. SSF46689. 1 hit.
PROSITEiPS00027. HOMEOBOX_1. 1 hit.
PS50071. HOMEOBOX_2. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

P31316-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MSRSFYVDSL IIKDSSRPAP SLPESHPGPD FFIPLGMPSP LVMSVSGPGC
60 70 80 90 100
PSRKSGAFCV CPLCVTSHLH SSRPPAGAGG GATGTAGAAV AGGGVAGGTG
110 120 130 140 150
ALPLLKSQFS PAPGDAQFCP RVSHAHHHHH PPQHHHHHHQ PQQPGSAAAA
160 170 180 190 200
AAAAAAAAAA AAALGHPQHH APVCAATTYN MSDPRRFHCL SMGGSDTSQV
210 220 230 240 250
PNGKRMRTAF TSTQLLELER EFSSNMYLSR LRRIEIATYL NLSEKQVKIW
260 270 280 290 300
FQNRRVKHKK EGKGASRNNH TSCKCVGSQA HYARSEDEDS LSPASANEDK

EISPL
Length:305
Mass (Da):32,167
Last modified:October 1, 1996 - v2
Checksum:i51E7F2DB76E32608
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
S79041 mRNA. Translation: AAB34947.1.
CCDSiCCDS19350.1.
PIRiB37290.
RefSeqiNP_573555.1. NM_133256.2.
UniGeneiMm.218752.

Genome annotation databases

EnsembliENSMUST00000040477; ENSMUSP00000036625; ENSMUSG00000035946.
GeneIDi14843.
KEGGimmu:14843.
UCSCiuc008xtx.1. mouse.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
S79041 mRNA. Translation: AAB34947.1.
CCDSiCCDS19350.1.
PIRiB37290.
RefSeqiNP_573555.1. NM_133256.2.
UniGeneiMm.218752.

3D structure databases

ProteinModelPortaliP31316.
SMRiP31316.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi200086. 1 interactor.
STRINGi10090.ENSMUSP00000036625.

Proteomic databases

PaxDbiP31316.
PRIDEiP31316.

Protocols and materials databases

DNASUi14843.
Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSMUST00000040477; ENSMUSP00000036625; ENSMUSG00000035946.
GeneIDi14843.
KEGGimmu:14843.
UCSCiuc008xtx.1. mouse.

Organism-specific databases

CTDi170825.
MGIiMGI:95843. Gsx2.

Phylogenomic databases

eggNOGiKOG0489. Eukaryota.
ENOG410ZTBY. LUCA.
GeneTreeiENSGT00760000118940.
HOGENOMiHOG000010106.
HOVERGENiHBG003555.
InParanoidiP31316.
KOiK09310.
OMAiAFCVCPL.
OrthoDBiEOG091G0HAO.
PhylomeDBiP31316.
TreeFamiTF315938.

Miscellaneous databases

PROiP31316.
SOURCEiSearch...

Gene expression databases

BgeeiENSMUSG00000035946.
CleanExiMM_GSX2.
ExpressionAtlasiP31316. baseline and differential.
GenevisibleiP31316. MM.

Family and domain databases

Gene3Di1.10.10.60. 1 hit.
InterProiIPR017970. Homeobox_CS.
IPR001356. Homeobox_dom.
IPR020479. Homeobox_metazoa.
IPR009057. Homeodomain-like.
[Graphical view]
PfamiPF00046. Homeobox. 1 hit.
[Graphical view]
PRINTSiPR00024. HOMEOBOX.
SMARTiSM00389. HOX. 1 hit.
[Graphical view]
SUPFAMiSSF46689. SSF46689. 1 hit.
PROSITEiPS00027. HOMEOBOX_1. 1 hit.
PS50071. HOMEOBOX_2. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiGSX2_MOUSE
AccessioniPrimary (citable) accession number: P31316
Entry historyi
Integrated into UniProtKB/Swiss-Prot: July 1, 1993
Last sequence update: October 1, 1996
Last modified: November 2, 2016
This is version 128 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. MGD cross-references
    Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.