Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

GS homeobox 1

Gene

GSX1

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 4 out of 5-Experimental evidence at transcript leveli

Functioni

Probable transcription factor that binds to the DNA sequence 5'-GC[TA][AC]ATTA[GA]-3'. Activates the transcription of the GHRH gene. Plays an important role in pituitary development.

Regions

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
DNA bindingi147 – 20660HomeoboxPROSITE-ProRule annotationAdd
BLAST

GO - Molecular functioni

  1. RNA polymerase II core promoter proximal region sequence-specific DNA binding Source: Ensembl
  2. RNA polymerase II core promoter proximal region sequence-specific DNA binding transcription factor activity involved in positive regulation of transcription Source: Ensembl
  3. sequence-specific DNA binding Source: MGI

GO - Biological processi

  1. adenohypophysis development Source: Ensembl
  2. hypothalamus development Source: Ensembl
  3. neuron fate commitment Source: Ensembl
  4. positive regulation of transcription from RNA polymerase II promoter Source: MGI
  5. spinal cord association neuron differentiation Source: Ensembl
Complete GO annotation...

Keywords - Molecular functioni

Activator, Developmental protein

Keywords - Biological processi

Transcription, Transcription regulation

Keywords - Ligandi

DNA-binding

Names & Taxonomyi

Protein namesi
Recommended name:
GS homeobox 1
Alternative name(s):
Homeobox protein GSH-1
Gene namesi
Name:GSX1
Synonyms:GSH1
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
ProteomesiUP000005640: Chromosome 13

Organism-specific databases

HGNCiHGNC:20374. GSX1.

Subcellular locationi

Nucleus PROSITE-ProRule annotation

GO - Cellular componenti

  1. nucleus Source: UniProtKB-SubCell
Complete GO annotation...

Keywords - Cellular componenti

Nucleus

Pathology & Biotechi

Organism-specific databases

PharmGKBiPA162390373.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 264264GS homeobox 1PRO_0000048894Add
BLAST

Proteomic databases

PaxDbiQ9H4S2.
PRIDEiQ9H4S2.

Expressioni

Gene expression databases

BgeeiQ9H4S2.
CleanExiHS_GSX1.
GenevestigatoriQ9H4S2.

Organism-specific databases

HPAiHPA047096.

Interactioni

Protein-protein interaction databases

STRINGi9606.ENSP00000304331.

Structurei

3D structure databases

ProteinModelPortaliQ9H4S2.
SMRiQ9H4S2. Positions 149-205.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni1 – 2020SNAG domainBy similarityAdd
BLAST

Compositional bias

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Compositional biasi111 – 1188Poly-Ala
Compositional biasi213 – 22311Poly-GlyAdd
BLAST

Sequence similaritiesi

Belongs to the Antp homeobox family.Curated
Contains 1 homeobox DNA-binding domain.PROSITE-ProRule annotation

Keywords - Domaini

Homeobox

Phylogenomic databases

eggNOGiNOG264927.
GeneTreeiENSGT00760000118940.
HOGENOMiHOG000010106.
HOVERGENiHBG003555.
InParanoidiQ9H4S2.
KOiK09310.
OMAiHNCKCSS.
OrthoDBiEOG72C514.
PhylomeDBiQ9H4S2.
TreeFamiTF315938.

Family and domain databases

Gene3Di1.10.10.60. 1 hit.
InterProiIPR017970. Homeobox_CS.
IPR001356. Homeobox_dom.
IPR020479. Homeobox_metazoa.
IPR009057. Homeodomain-like.
[Graphical view]
PfamiPF00046. Homeobox. 1 hit.
[Graphical view]
PRINTSiPR00024. HOMEOBOX.
SMARTiSM00389. HOX. 1 hit.
[Graphical view]
SUPFAMiSSF46689. SSF46689. 1 hit.
PROSITEiPS00027. HOMEOBOX_1. 1 hit.
PS50071. HOMEOBOX_2. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Q9H4S2-1 [UniParc]FASTAAdd to Basket

« Hide

        10         20         30         40         50
MPRSFLVDSL VLREAGEKKA PEGSPPPLFP YAVPPPHALH GLSPGACHAR
60 70 80 90 100
KAGLLCVCPL CVTASQLHGP PGPPALPLLK ASFPPFGSQY CHAPLGRQHS
110 120 130 140 150
AVSPGVAHGP AAAAAAAALY QTSYPLPDPR QFHCISVDSS SNQLPSSKRM
160 170 180 190 200
RTAFTSTQLL ELEREFASNM YLSRLRRIEI ATYLNLSEKQ VKIWFQNRRV
210 220 230 240 250
KHKKEGKGSN HRGGGGGGAG GGGSAPQGCK CASLSSAKCS EDDDELPMSP
260
SSSGKDDRDL TVTP
Length:264
Mass (Da):27,883
Last modified:March 1, 2001 - v1
Checksum:i25F4C4336E270C00
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AB044157 mRNA. Translation: BAB78692.1.
AB044158 Genomic DNA. Translation: BAB78693.1.
AL390738 Genomic DNA. Translation: CAC12721.1.
CCDSiCCDS9326.1.
RefSeqiNP_663632.1. NM_145657.1.
UniGeneiHs.351785.

Genome annotation databases

EnsembliENST00000302945; ENSP00000304331; ENSG00000169840.
GeneIDi219409.
KEGGihsa:219409.
UCSCiuc001urr.1. human.

Polymorphism databases

DMDMi27923786.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AB044157 mRNA. Translation: BAB78692.1.
AB044158 Genomic DNA. Translation: BAB78693.1.
AL390738 Genomic DNA. Translation: CAC12721.1.
CCDSiCCDS9326.1.
RefSeqiNP_663632.1. NM_145657.1.
UniGeneiHs.351785.

3D structure databases

ProteinModelPortaliQ9H4S2.
SMRiQ9H4S2. Positions 149-205.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi9606.ENSP00000304331.

Polymorphism databases

DMDMi27923786.

Proteomic databases

PaxDbiQ9H4S2.
PRIDEiQ9H4S2.

Protocols and materials databases

DNASUi219409.
Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000302945; ENSP00000304331; ENSG00000169840.
GeneIDi219409.
KEGGihsa:219409.
UCSCiuc001urr.1. human.

Organism-specific databases

CTDi219409.
GeneCardsiGC13P028366.
HGNCiHGNC:20374. GSX1.
HPAiHPA047096.
neXtProtiNX_Q9H4S2.
PharmGKBiPA162390373.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiNOG264927.
GeneTreeiENSGT00760000118940.
HOGENOMiHOG000010106.
HOVERGENiHBG003555.
InParanoidiQ9H4S2.
KOiK09310.
OMAiHNCKCSS.
OrthoDBiEOG72C514.
PhylomeDBiQ9H4S2.
TreeFamiTF315938.

Miscellaneous databases

GenomeRNAii219409.
NextBioi90587.
PROiQ9H4S2.

Gene expression databases

BgeeiQ9H4S2.
CleanExiHS_GSX1.
GenevestigatoriQ9H4S2.

Family and domain databases

Gene3Di1.10.10.60. 1 hit.
InterProiIPR017970. Homeobox_CS.
IPR001356. Homeobox_dom.
IPR020479. Homeobox_metazoa.
IPR009057. Homeodomain-like.
[Graphical view]
PfamiPF00046. Homeobox. 1 hit.
[Graphical view]
PRINTSiPR00024. HOMEOBOX.
SMARTiSM00389. HOX. 1 hit.
[Graphical view]
SUPFAMiSSF46689. SSF46689. 1 hit.
PROSITEiPS00027. HOMEOBOX_1. 1 hit.
PS50071. HOMEOBOX_2. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA / MRNA].
  2. "The DNA sequence and analysis of human chromosome 13."
    Dunham A., Matthews L.H., Burton J., Ashurst J.L., Howe K.L., Ashcroft K.J., Beare D.M., Burford D.C., Hunt S.E., Griffiths-Jones S., Jones M.C., Keenan S.J., Oliver K., Scott C.E., Ainscough R., Almeida J.P., Ambrose K.D., Andrews D.T.
    , Ashwell R.I.S., Babbage A.K., Bagguley C.L., Bailey J., Bannerjee R., Barlow K.F., Bates K., Beasley H., Bird C.P., Bray-Allen S., Brown A.J., Brown J.Y., Burrill W., Carder C., Carter N.P., Chapman J.C., Clamp M.E., Clark S.Y., Clarke G., Clee C.M., Clegg S.C., Cobley V., Collins J.E., Corby N., Coville G.J., Deloukas P., Dhami P., Dunham I., Dunn M., Earthrowl M.E., Ellington A.G., Faulkner L., Frankish A.G., Frankland J., French L., Garner P., Garnett J., Gilbert J.G.R., Gilson C.J., Ghori J., Grafham D.V., Gribble S.M., Griffiths C., Hall R.E., Hammond S., Harley J.L., Hart E.A., Heath P.D., Howden P.J., Huckle E.J., Hunt P.J., Hunt A.R., Johnson C., Johnson D., Kay M., Kimberley A.M., King A., Laird G.K., Langford C.J., Lawlor S., Leongamornlert D.A., Lloyd D.M., Lloyd C., Loveland J.E., Lovell J., Martin S., Mashreghi-Mohammadi M., McLaren S.J., McMurray A., Milne S., Moore M.J.F., Nickerson T., Palmer S.A., Pearce A.V., Peck A.I., Pelan S., Phillimore B., Porter K.M., Rice C.M., Searle S., Sehra H.K., Shownkeen R., Skuce C.D., Smith M., Steward C.A., Sycamore N., Tester J., Thomas D.W., Tracey A., Tromans A., Tubby B., Wall M., Wallis J.M., West A.P., Whitehead S.L., Willey D.L., Wilming L., Wray P.W., Wright M.W., Young L., Coulson A., Durbin R.M., Hubbard T., Sulston J.E., Beck S., Bentley D.R., Rogers J., Ross M.T.
    Nature 428:522-528(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
  3. "Identification of homeobox genes expressed in human haemopoietic progenitor cells."
    Moretti P., Simmons P., Thomas P., Haylock D., Rathjen P., Vadas M., D'Andrea R.
    Gene 144:213-219(1994) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 155-193.
    Tissue: Bone marrow.

Entry informationi

Entry nameiGSX1_HUMAN
AccessioniPrimary (citable) accession number: Q9H4S2
Secondary accession number(s): Q9UD62
Entry historyi
Integrated into UniProtKB/Swiss-Prot: January 27, 2003
Last sequence update: March 1, 2001
Last modified: January 7, 2015
This is version 108 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Human chromosome 13
    Human chromosome 13: entries, gene names and cross-references to MIM
  2. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into Uniref entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.