Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Homeobox protein goosecoid

Gene

Gsc

Organism
Drosophila melanogaster (Fruit fly)
Status
Reviewed-Annotation score: Annotation score: 4 out of 5-Experimental evidence at transcript leveli

Functioni

Appears to regulate regional development of specific tissues. Can rescue axis polarity in UV-radiated Xenopus embryos.

Regions

Feature keyPosition(s)DescriptionActionsGraphical viewLength
DNA bindingi282 – 341HomeoboxPROSITE-ProRule annotationAdd BLAST60

GO - Molecular functioni

  • protein heterodimerization activity Source: FlyBase
  • protein homodimerization activity Source: FlyBase
  • sequence-specific DNA binding Source: InterPro
  • transcriptional repressor activity, RNA polymerase II core promoter proximal region sequence-specific binding Source: FlyBase

GO - Biological processi

Complete GO annotation...

Keywords - Molecular functioni

Developmental protein

Keywords - Ligandi

DNA-binding

Names & Taxonomyi

Protein namesi
Recommended name:
Homeobox protein goosecoid
Gene namesi
Name:Gsc
ORF Names:CG2851
OrganismiDrosophila melanogaster (Fruit fly)
Taxonomic identifieri7227 [NCBI]
Taxonomic lineageiEukaryotaMetazoaEcdysozoaArthropodaHexapodaInsectaPterygotaNeopteraEndopterygotaDipteraBrachyceraMuscomorphaEphydroideaDrosophilidaeDrosophilaSophophora
Proteomesi
  • UP000000803 Componenti: Chromosome 2L

Organism-specific databases

FlyBaseiFBgn0010323. Gsc.

Subcellular locationi

GO - Cellular componenti

  • nucleus Source: FlyBase
Complete GO annotation...

Keywords - Cellular componenti

Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00000488931 – 415Homeobox protein goosecoidAdd BLAST415

Proteomic databases

PaxDbiP54366.
PRIDEiP54366.

Expressioni

Tissue specificityi

In early embryo development, expression confined to two regions; a horseshoe-like pattern across the dorsal side which is destined to form the brain hemispheres and a second domain which invaginates inside the stomodeum and which, is fated to form the foregut, ring gland and stomatogastric nervous system (SNS).

Gene expression databases

BgeeiFBgn0010323.
ExpressionAtlasiP54366. baseline.
GenevisibleiP54366. DM.

Interactioni

GO - Molecular functioni

  • protein heterodimerization activity Source: FlyBase
  • protein homodimerization activity Source: FlyBase

Protein-protein interaction databases

BioGridi59496. 20 interactors.
IntActiP54366. 1 interactor.
MINTiMINT-314608.
STRINGi7227.FBpp0113060.

Structurei

3D structure databases

ProteinModelPortaliP54366.
SMRiP54366.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Compositional bias

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Compositional biasi51 – 77Gln-richPROSITE-ProRule annotationAdd BLAST27
Compositional biasi191 – 239Ala-richPROSITE-ProRule annotationAdd BLAST49
Compositional biasi244 – 275His-richPROSITE-ProRule annotationAdd BLAST32

Sequence similaritiesi

Contains 1 homeobox DNA-binding domain.PROSITE-ProRule annotation

Keywords - Domaini

Homeobox

Phylogenomic databases

eggNOGiKOG0490. Eukaryota.
ENOG410YIJ3. LUCA.
HOGENOMiHOG000276707.
InParanoidiP54366.
KOiK09324.
OrthoDBiEOG091G0OQ6.

Family and domain databases

Gene3Di1.10.10.60. 1 hit.
InterProiIPR017970. Homeobox_CS.
IPR001356. Homeobox_dom.
IPR009057. Homeodomain-like.
[Graphical view]
PfamiPF00046. Homeobox. 1 hit.
[Graphical view]
SMARTiSM00389. HOX. 1 hit.
[Graphical view]
SUPFAMiSSF46689. SSF46689. 1 hit.
PROSITEiPS00027. HOMEOBOX_1. 1 hit.
PS50071. HOMEOBOX_2. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

P54366-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MVETNSPPAG YTLKRSPSDL GEQQQPPRQI SRSPGNTAAY HLTTAMLLNS
60 70 80 90 100
QQCGYLGQRL QSVLQQQHAQ HQQSQSQTPS SDDGSQSGVT ILEEERRGGA
110 120 130 140 150
AAASLFTIDS ILGSRQQGGG TAPSQGSHIS SNGNQNGLTS NGISLGLKRS
160 170 180 190 200
GAESPASPNS NSSSSAAASP IRPQRVPAML QHPGLHLGHL AAAAASGFAA
210 220 230 240 250
SPSDFLVAYP NFYPNYMHAA AVAHVAAAQM QAHVSGAAAG LSGHGHHPHH
260 270 280 290 300
PHGHPHHPHL GAHHHGQHHL SHLGHGPPPK RKRRHRTIFT EEQLEQLEAT
310 320 330 340 350
FDKTHYPDVV LREQLALKVD LKEERVEVWF KNRRAKWRKQ KREEQERLRK
360 370 380 390 400
LQEEQCGSTT NGTTNSSSGT TSSTGNGSLT VKCPGSDHYS AQLVHIKSDA
410
NGYSDADESS DLEVA
Length:415
Mass (Da):44,506
Last modified:March 16, 2016 - v2
Checksum:i9646621754668657
GO

Sequence cautioni

The sequence AAB17948 differs from that shown. Reason: Erroneous initiation. Translation N-terminally shortened.Curated
The sequence CAA64699 differs from that shown. Reason: Erroneous initiation. Translation N-terminally shortened.Curated

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
X95420 mRNA. Translation: CAA64699.1. Different initiation.
U52968 mRNA. Translation: AAB17948.1. Different initiation.
AE014134 Genomic DNA. Translation: AAF51473.2.
PIRiS70617.
RefSeqiNP_476949.2. NM_057601.3.

Genome annotation databases

GeneIDi33240.
KEGGidme:Dmel_CG2851.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
X95420 mRNA. Translation: CAA64699.1. Different initiation.
U52968 mRNA. Translation: AAB17948.1. Different initiation.
AE014134 Genomic DNA. Translation: AAF51473.2.
PIRiS70617.
RefSeqiNP_476949.2. NM_057601.3.

3D structure databases

ProteinModelPortaliP54366.
SMRiP54366.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi59496. 20 interactors.
IntActiP54366. 1 interactor.
MINTiMINT-314608.
STRINGi7227.FBpp0113060.

Proteomic databases

PaxDbiP54366.
PRIDEiP54366.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

GeneIDi33240.
KEGGidme:Dmel_CG2851.

Organism-specific databases

CTDi145258.
FlyBaseiFBgn0010323. Gsc.

Phylogenomic databases

eggNOGiKOG0490. Eukaryota.
ENOG410YIJ3. LUCA.
HOGENOMiHOG000276707.
InParanoidiP54366.
KOiK09324.
OrthoDBiEOG091G0OQ6.

Miscellaneous databases

GenomeRNAii33240.
PROiP54366.

Gene expression databases

BgeeiFBgn0010323.
ExpressionAtlasiP54366. baseline.
GenevisibleiP54366. DM.

Family and domain databases

Gene3Di1.10.10.60. 1 hit.
InterProiIPR017970. Homeobox_CS.
IPR001356. Homeobox_dom.
IPR009057. Homeodomain-like.
[Graphical view]
PfamiPF00046. Homeobox. 1 hit.
[Graphical view]
SMARTiSM00389. HOX. 1 hit.
[Graphical view]
SUPFAMiSSF46689. SSF46689. 1 hit.
PROSITEiPS00027. HOMEOBOX_1. 1 hit.
PS50071. HOMEOBOX_2. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiGSC_DROME
AccessioniPrimary (citable) accession number: P54366
Secondary accession number(s): Q9VPR9
Entry historyi
Integrated into UniProtKB/Swiss-Prot: October 1, 1996
Last sequence update: March 16, 2016
Last modified: November 30, 2016
This is version 136 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programDrosophila annotation project

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Drosophila
    Drosophila: entries, gene names and cross-references to FlyBase
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.