Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Homeobox protein prospero

Gene

pros

Organism
Drosophila melanogaster (Fruit fly)
Status
Reviewed-Annotation score: -Experimental evidence at protein leveli

Functioni

Required for proper neuronal differentiation of most or all neurons and their precursors in central and peripheral nervous systems, axonal outgrowth and pathfinding. Not required for the specification of neuronal identity. May regulate transcription by binding to DNA.4 Publications

GO - Molecular functioni

  • DNA binding Source: FlyBase
  • DNA binding transcription factor activity Source: FlyBase
  • sequence-specific DNA binding Source: UniProtKB
  • transcriptional repressor activity, RNA polymerase II proximal promoter sequence-specific DNA binding Source: FlyBase

GO - Biological processi

  • asymmetric cell division Source: FlyBase
  • asymmetric neuroblast division Source: FlyBase
  • asymmetric neuroblast division resulting in ganglion mother cell formation Source: FlyBase
  • axon development Source: UniProtKB
  • axon guidance Source: FlyBase
  • axonogenesis Source: FlyBase
  • axonogenesis involved in innervation Source: FlyBase
  • brain development Source: FlyBase
  • cell dedifferentiation Source: UniProtKB
  • cell fate commitment Source: FlyBase
  • cell fate specification Source: FlyBase
  • central nervous system development Source: UniProtKB
  • compound eye cone cell fate commitment Source: FlyBase
  • courtship behavior Source: FlyBase
  • dendrite guidance Source: FlyBase
  • dendrite morphogenesis Source: FlyBase
  • eye photoreceptor cell differentiation Source: FlyBase
  • G1 to G0 transition Source: FlyBase
  • ganglion mother cell fate determination Source: FlyBase
  • glial cell differentiation Source: FlyBase
  • male courtship behavior Source: FlyBase
  • negative regulation of axonogenesis Source: FlyBase
  • negative regulation of cell proliferation Source: FlyBase
  • negative regulation of gene expression Source: FlyBase
  • negative regulation of neuroblast proliferation Source: FlyBase
  • negative regulation of transcription by RNA polymerase II Source: FlyBase
  • nervous system development Source: FlyBase
  • neuroblast differentiation Source: UniProtKB
  • neuroblast fate determination Source: FlyBase
  • neuroblast proliferation Source: FlyBase
  • peripheral nervous system development Source: UniProtKB
  • positive regulation of gene expression Source: FlyBase
  • positive regulation of neuroblast proliferation Source: UniProtKB
  • positive regulation of transcription by RNA polymerase II Source: UniProtKB
  • protein localization Source: FlyBase
  • R7 cell fate commitment Source: FlyBase
  • regulation of DNA binding transcription factor activity Source: UniProtKB
  • regulation of gene expression Source: UniProtKB
  • regulation of neuroblast proliferation Source: UniProtKB
  • regulation of neuron differentiation Source: UniProtKB
  • regulation of protein localization to nucleus Source: FlyBase
  • regulation of R7 cell differentiation Source: FlyBase
  • regulation of retinal cone cell fate specification Source: FlyBase
  • regulation of transcription, DNA-templated Source: FlyBase
  • sensory organ development Source: FlyBase
  • sensory perception of taste Source: FlyBase
  • synapse assembly Source: FlyBase
  • transcription, DNA-templated Source: UniProtKB-KW
  • ventral cord development Source: FlyBase

Keywordsi

Molecular functionDevelopmental protein, DNA-binding
Biological processTranscription, Transcription regulation

Names & Taxonomyi

Protein namesi
Recommended name:
Homeobox protein prospero
Gene namesi
Name:pros
ORF Names:CG17228
OrganismiDrosophila melanogaster (Fruit fly)
Taxonomic identifieri7227 [NCBI]
Taxonomic lineageiEukaryotaMetazoaEcdysozoaArthropodaHexapodaInsectaPterygotaNeopteraHolometabolaDipteraBrachyceraMuscomorphaEphydroideaDrosophilidaeDrosophilaSophophora
Proteomesi
  • UP000000803 Componenti: Chromosome 3R

Organism-specific databases

FlyBaseiFBgn0004595 pros

Subcellular locationi

Extracellular region or secreted Cytosol Plasma membrane Cytoskeleton Lysosome Endosome Peroxisome ER Golgi apparatus Nucleus Mitochondrion Manual annotation Automatic computational assertionGraphics by Christian Stolte; Source: COMPARTMENTS

Keywords - Cellular componenti

Nucleus

Pathology & Biotechi

Disruption phenotypei

Aberrant expression of the homeobox genes fushi tarazu, even-skipped and engrailed in a subset of neuroblast progeny.2 Publications

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00002088841 – 1703Homeobox protein prosperoAdd BLAST1703

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei479Phosphoserine1 Publication1
Modified residuei482Phosphoserine1 Publication1
Modified residuei485Phosphoserine1 Publication1
Modified residuei497Phosphoserine1 Publication1
Modified residuei651Phosphoserine1 Publication1
Modified residuei654Phosphoserine1 Publication1

Keywords - PTMi

Phosphoprotein

Proteomic databases

PaxDbiP29617
PRIDEiP29617

PTM databases

iPTMnetiP29617

Expressioni

Tissue specificityi

Expressed in embryonic neuronal precursors. Expressed in the developing CNS, lens-secreting cone cells of the eye, and midgut.3 Publications

Developmental stagei

Expressed during embryogenesis.1 Publication

Gene expression databases

BgeeiFBgn0004595
ExpressionAtlasiP29617 baseline and differential
GenevisibleiP29617 DM

Interactioni

Protein-protein interaction databases

BioGridi66496, 23 interactors
IntActiP29617, 3 interactors
STRINGi7227.FBpp0293146

Structurei

Secondary structure

11703
Legend: HelixTurnBeta strandPDB Structure known for this area
Show more details
Feature keyPosition(s)DescriptionActionsGraphical viewLength
Helixi1550 – 1559Combined sources10
Turni1560 – 1562Combined sources3
Helixi1568 – 1574Combined sources7
Helixi1582 – 1612Combined sources31
Helixi1628 – 1637Combined sources10
Helixi1647 – 1665Combined sources19
Helixi1668 – 1670Combined sources3
Turni1672 – 1675Combined sources4
Helixi1676 – 1683Combined sources8
Helixi1691 – 1694Combined sources4
Helixi1696 – 1699Combined sources4

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
1MIJX-ray2.05A1545-1696[»]
1XPXX-ray2.80A1541-1703[»]
ProteinModelPortaliP29617
SMRiP29617
ModBaseiSearch...
MobiDBiSearch...

Miscellaneous databases

EvolutionaryTraceiP29617

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini1545 – 1603Prospero-type homeoPROSITE-ProRule annotationAdd BLAST59
Domaini1604 – 1703ProsperoPROSITE-ProRule annotationAdd BLAST100

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Regioni1545 – 1703Homeo-ProsperoPROSITE-ProRule annotationAdd BLAST159

Motif

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Motifi1291 – 1298Nuclear localization signal8

Compositional bias

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Compositional biasi516 – 564Gln-richAdd BLAST49
Compositional biasi570 – 586Asn-richAdd BLAST17
Compositional biasi618 – 654Ser-richAdd BLAST37
Compositional biasi1000 – 1348Gln-richAdd BLAST349
Compositional biasi1427 – 1437His-richAdd BLAST11

Domaini

The Prospero-type homeodomain and the adjacent Prospero domain act as a single structural unit, the Homeo-Prospero domain.PROSITE-ProRule annotation2 Publications

Sequence similaritiesi

Belongs to the Prospero homeodomain family.PROSITE-ProRule annotation

Keywords - Domaini

Homeobox

Phylogenomic databases

eggNOGiKOG3779 Eukaryota
ENOG410ZE21 LUCA
GeneTreeiENSGT00530000063507
InParanoidiP29617
KOiK20211
OMAiSMMLDQD
OrthoDBiEOG091G0GHY

Family and domain databases

Gene3Di1.10.10.500, 1 hit
InterProiView protein in InterPro
IPR023082 Homeo_prospero_dom
IPR037131 Homeo_prospero_dom_sf
IPR009057 Homeobox-like_sf
PfamiView protein in Pfam
PF05044 HPD, 1 hit
SUPFAMiSSF46689 SSF46689, 1 hit
PROSITEiView protein in PROSITE
PS51818 HOMEO_PROSPERO, 1 hit

Sequences (3)i

Sequence statusi: Complete.

This entry describes 3 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform F (identifier: P29617-1) [UniParc]FASTAAdd to basket
Also known as: L

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MMSSEEYEAD CFGLYSDENN VLLKANEPET TAATKQQHQP QFQQQQQQQQ
60 70 80 90 100
QQQQHHHQHQ QPPPSSNGHT SPIPSQVNGD GSAANCGESG KTNTNTGSHS
110 120 130 140 150
HSNSGNTNTD ADKEQEREKA KEKANEEESE DSDDDVVVVL EGCEGNASSS
160 170 180 190 200
SSSSNSNSNA SSNNHKAAAT TTTTATTANH CNRSGGSSRS HRSARSSRQI
210 220 230 240 250
SQSTAVGKTT TCAAKKPTAA QATTTTAKNS NSNSNVNVNV NVNSNGSGNG
260 270 280 290 300
NANSKVNRRS RQRSLSKDIN NQPASSNSNS NSSNNSSNSN GGATATAAGF
310 320 330 340 350
MSSAAAAAAG AAGGGALFQP QSVSTANSSS SNNNNSSTPA ALATHSPTSN
360 370 380 390 400
SPVSGASSAS SLLTAAFGNL FGGSSAKMLN ELFGRQMKQA QDATSGLPQS
410 420 430 440 450
LDNAMLAAAM ETATSAELLI GSLNSTSKLL QQQHNNNSIA PANSTPMSNG
460 470 480 490 500
TNASISPGSA HSSSHSHQGV SPKGSRRVSA CSDRSLEAAA ADVAGGSPPR
510 520 530 540 550
AASVSSLNGG ASSGEQHQSQ LQHDLVAHHM LRNILQGKKE LMQLDQELRT
560 570 580 590 600
AMQQQQQQLQ EKEQLHSKLN NNNNNNIAAT ANNNNNTTME SINLIDDSEM
610 620 630 640 650
ADIKIKSEPQ TAPQPQQSPH GSSHSSRSGS GSGSHSSMAS DGSLRRKSSD
660 670 680 690 700
SLDSHGAQDD AQDEEDAAPT GQRSESRAPE EPQLPTKKES VDDMLDEVEL
710 720 730 740 750
LGLHSRGSDM DSLASPSHSD MMLLDKDDVL DEDDDDDCVE QKTSGSGCLK
760 770 780 790 800
KPGMDLKRAR VENIVSGMRC SPSSGLAQAG QLQVNGCKKR KLYQPQQHAM
810 820 830 840 850
ERYVAAAAGL NFGLNLQSMM LDQEDSESNE LESPQIQQKR VEKNALKSQL
860 870 880 890 900
RSMQEQLAEM QQKYVQLCSR MEQESECQEL DQDQDVEQEQ EPDNGSSDHI
910 920 930 940 950
ELSPSPTLTG DGDVSPNHKE ETGQERPGSS SPSPSPLKPK TSLGESSDSG
960 970 980 990 1000
ANMLSQMMSK MMSGKLHNPL VGVGHPALPQ GFPPLLQHMG DMSHAAAMYQ
1010 1020 1030 1040 1050
QFFFEQEARM AKEAAEQQQQ QQQQQQQQQQ QQQQEQQRRF EQEQQEQQRR
1060 1070 1080 1090 1100
KEEQQQQIQR QQQHLQQLQQ QQMEQQHVAT AAPRPQMHHP APARLPTRMG
1110 1120 1130 1140 1150
GAAGHTALKS ELSEKFQMLR ANNNSSMMRM SGTDLEGLAD VLKSEITTSL
1160 1170 1180 1190 1200
SALVDTIVTR FVHQRRLFSK QADSVTAAAE QLNKDLLLAS QILDRKSPRT
1210 1220 1230 1240 1250
KVADRPQNGP TPATQSAAAM FQAPKTPQGM NPVAAAALYN SMTGPFCLPP
1260 1270 1280 1290 1300
DQQQQQQTAQ QQQSAQQQQQ SSQQTQQQLE QNEALSLVVT PKKKRHKVTD
1310 1320 1330 1340 1350
TRITPRTVSR ILAQDGVVPP TGGPPSTPQQ QQQQQQQQQQ QQQQQQQQAS
1360 1370 1380 1390 1400
NGGNSNATPA QSPTRSSGGA AYHPQPPPPP PPMMPVSLPT SVAIPNPSLH
1410 1420 1430 1440 1450
ESKVFSPYSP FFNPHAAAGQ ATAAQLHQHH QQHHPHHQSM QLSSSPPGSL
1460 1470 1480 1490 1500
GALMDSRDSP PLPHPPSMLH PALLAAAHHG GSPDYKTCLR AVMDAQDRQS
1510 1520 1530 1540 1550
ECNSADMQFD GMAPTISFYK QMQLKTEHQE SLMAKHCESL TPLHSSTLTP
1560 1570 1580 1590 1600
MHLRKAKLMF FWVRYPSSAV LKMYFPDIKF NKNNTAQLVK WFSNFREFYY
1610 1620 1630 1640 1650
IQMEKYARQA VTEGIKTPDD LLIAGDSELY RVLNLHYNRN NHIEVPQNFR
1660 1670 1680 1690 1700
FVVESTLREF FRAIQGGKDT EQSWKKSIYK IISRMDDPVP EYFKSPNFLE

QLE
Length:1,703
Mass (Da):184,980
Last modified:March 24, 2009 - v3
Checksum:i35040F15C1CBB1D8
GO
Isoform E (identifier: P29617-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1216-1216: S → SGNNGSLLLA...PPPHIRPSPT
     1516-1544: Missing.

Note: No experimental confirmation available.
Show »
Length:1,835
Mass (Da):198,422
Checksum:i9C9F48EEEF8D2F3A
GO
Isoform G (identifier: P29617-4) [UniParc]FASTAAdd to basket
Also known as: S

The sequence of this isoform differs from the canonical sequence as follows:
     1516-1544: Missing.

Note: No experimental confirmation available.
Show »
Length:1,674
Mass (Da):181,539
Checksum:i7B55B08222C636A6
GO

Sequence cautioni

The sequence AAA28841 differs from that shown. Reason: Frameshift at positions 179 and 231.Curated
The sequence AAL28228 differs from that shown. Reason: Frameshift at position 1422.Curated
The sequence AAL28228 differs from that shown. Reason: Erroneous initiation. Translation N-terminally shortened.Curated
The sequence AAT94492 differs from that shown. Reason: Erroneous initiation.Curated
The sequence BAA01464 differs from that shown. Reason: Erroneous initiation.Curated
The sequence CAA77802 differs from that shown. Reason: Erroneous initiation.Curated

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti49 – 57QQQQQQHHH → HHQ in AAA28841 (PubMed:1720353).Curated9
Sequence conflicti124 – 135ANEEE…DSDDD → RTKRSPRIPTTT in AAA28841 (PubMed:1720353).CuratedAdd BLAST12
Sequence conflicti175A → G in AAA28841 (PubMed:1720353).Curated1
Sequence conflicti230S → L in AAA28841 (PubMed:1720353).Curated1
Sequence conflicti283S → R in AAA28841 (PubMed:1720353).Curated1
Sequence conflicti293A → R in AAA28841 (PubMed:1720353).Curated1
Sequence conflicti376 – 398AKMLN…TSGLP → GQDAERAVWPPDEAGPGRNE WPA in AAA28841 (PubMed:1720353).CuratedAdd BLAST23
Sequence conflicti376 – 398AKMLN…TSGLP → GQDAERAVWPPDEAGPGRNE WPA in AAF05703 (PubMed:11051550).CuratedAdd BLAST23
Sequence conflicti420 – 444IGSLN…APANS → NLALQFHVQVAAAAAITTAL LPPIG in AAA28841 (PubMed:1720353).CuratedAdd BLAST25
Sequence conflicti420 – 444IGSLN…APANS → NLALQFHVQVAAAAAITTAL LPPIG in AAF05703 (PubMed:11051550).CuratedAdd BLAST25
Sequence conflicti718H → Q in BAA01464 (PubMed:1540176).Curated1
Sequence conflicti977A → C in AAA28841 (PubMed:1720353).Curated1
Sequence conflicti1102A → S in AAA28841 (PubMed:1720353).Curated1
Sequence conflicti1258T → S in AAA28841 (PubMed:1720353).Curated1
Sequence conflicti1348Q → QQQQQ in AAA28841 (PubMed:1720353).Curated1

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_0023071216S → SGNNGSLLLANSQMPSQTTA SGSSAQQQQQQNAQQQHQGS QQQQQQNVVQQQNVAQQQHM QQQQQQQQSHPLLPPNCQQL ISAPRLNGSQLSFASPAAAA AAAMGLQMHHAAAAAAMSAQ QQQQQSGDPGANTNPNSGPA NPTNSSLSTLNIPPPHIRPS PT in isoform E. 1 Publication1
Alternative sequenceiVSP_0023081516 – 1544Missing in isoform E and isoform G. 2 PublicationsAdd BLAST29

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
M81389 mRNA Translation: AAA28841.1 Frameshift.
AE014297 Genomic DNA Translation: AAF54628.3
AE014297 Genomic DNA Translation: AAN13500.3
AE014297 Genomic DNA Translation: AAN13501.3
Z11743 mRNA Translation: CAA77802.1 Different initiation.
D10609 mRNA Translation: BAA01464.1 Different initiation.
BT015263 mRNA Translation: AAT94492.1 Different initiation.
AF190403 Genomic DNA Translation: AAF05703.1
AY060680 mRNA Translation: AAL28228.1 Sequence problems.
PIRiS24548
RefSeqiNP_001247046.1, NM_001260117.2 [P29617-1]
NP_524317.4, NM_079593.6
NP_731565.4, NM_169386.3
NP_788636.3, NM_176459.4
UniGeneiDm.7116

Genome annotation databases

EnsemblMetazoaiFBtr0304603; FBpp0293145; FBgn0004595 [P29617-4]
FBtr0304604; FBpp0293146; FBgn0004595 [P29617-2]
FBtr0304605; FBpp0293147; FBgn0004595
FBtr0304608; FBpp0293150; FBgn0004595 [P29617-1]
GeneIDi41363
KEGGidme:Dmel_CG17228

Keywords - Coding sequence diversityi

Alternative splicing

Similar proteinsi

Entry informationi

Entry nameiPROS_DROME
AccessioniPrimary (citable) accession number: P29617
Secondary accession number(s): Q6AWI5
, Q95SP0, Q9U6A2, Q9VGP8
Entry historyiIntegrated into UniProtKB/Swiss-Prot: April 1, 1993
Last sequence update: March 24, 2009
Last modified: May 23, 2018
This is version 183 of the entry and version 3 of the sequence. See complete history.
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programDrosophila annotation project

Miscellaneousi

Keywords - Technical termi

3D-structure, Complete proteome, Reference proteome

Cookie policy

We would like to use anonymized google analytics cookies to gather statistics on how uniprot.org is used in aggregate. Learn more

UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health