Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Homeotic protein bicoid

Gene

bcd

Organism
Drosophila melanogaster (Fruit fly)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Segment polarity protein that provides positional cues for the development of head and thoracic segments. Regulates the expression of zygotic genes, possibly through its homeodomain, and inhibits the activity of other maternal gene products. May also bind RNA. Interacts with Bin1 to repress transcription of bicoid target genes in the anterior tip of the embryo; a process known as retraction.

Regions

Feature keyPosition(s)DescriptionActionsGraphical viewLength
DNA bindingi97 – 156HomeoboxPROSITE-ProRule annotationAdd BLAST60

GO - Molecular functioni

  • morphogen activity Source: FlyBase
  • mRNA 3'-UTR binding Source: FlyBase
  • sequence-specific DNA binding Source: InterPro
  • transcription factor activity, RNA polymerase II distal enhancer sequence-specific binding Source: FlyBase
  • transcription factor activity, sequence-specific DNA binding Source: FlyBase
  • translation regulator activity Source: FlyBase
  • translation repressor activity Source: FlyBase

GO - Biological processi

  • anterior/posterior axis specification Source: FlyBase
  • anterior/posterior axis specification, embryo Source: FlyBase
  • anterior region determination Source: FlyBase
  • maternal determination of anterior/posterior axis, embryo Source: FlyBase
  • negative regulation of translation Source: FlyBase
  • oogenesis Source: FlyBase
  • positive regulation of transcription from RNA polymerase II promoter Source: FlyBase
  • regulation of transcription, DNA-templated Source: FlyBase
  • regulation of transcription from RNA polymerase II promoter Source: FlyBase
  • regulation of translation Source: FlyBase
  • segment polarity determination Source: FlyBase
  • transcription, DNA-templated Source: UniProtKB-KW
Complete GO annotation...

Keywords - Molecular functioni

Developmental protein

Keywords - Biological processi

Transcription, Transcription regulation

Keywords - Ligandi

DNA-binding, RNA-binding

Enzyme and pathway databases

SignaLinkiP09081.

Names & Taxonomyi

Protein namesi
Recommended name:
Homeotic protein bicoid
Alternative name(s):
PRD-4
Gene namesi
Name:bcd
ORF Names:CG1034
OrganismiDrosophila melanogaster (Fruit fly)
Taxonomic identifieri7227 [NCBI]
Taxonomic lineageiEukaryotaMetazoaEcdysozoaArthropodaHexapodaInsectaPterygotaNeopteraEndopterygotaDipteraBrachyceraMuscomorphaEphydroideaDrosophilidaeDrosophilaSophophora
Proteomesi
  • UP000000803 Componenti: Chromosome 3R

Organism-specific databases

FlyBaseiFBgn0000166. bcd.

Subcellular locationi

GO - Cellular componenti

  • nucleus Source: FlyBase
Complete GO annotation...

Keywords - Cellular componenti

Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00000490141 – 494Homeotic protein bicoidAdd BLAST494

Proteomic databases

PaxDbiP09081.
PRIDEiP09081.

Expressioni

Tissue specificityi

Maternal expression is an anterior cap concentrated in the cortical cytoplasm.

Developmental stagei

Expressed both maternally and zygotically.

Gene expression databases

BgeeiFBgn0000166.
GenevisibleiP09081. DM.

Interactioni

Subunit structurei

Interacts with Bin1; in vitro and yeast cells. Interacts with bin3.2 Publications

Binary interactionsi

WithEntry#Exp.IntActNotes
Bin1Q9VEX92EBI-196628,EBI-129424
bin3Q7K4804EBI-196628,EBI-180984
smt3O971023EBI-196628,EBI-114439

Protein-protein interaction databases

BioGridi66028. 63 interactors.
IntActiP09081. 30 interactors.
MINTiMINT-302540.
STRINGi7227.FBpp0081168.

Structurei

Secondary structure

1494
Legend: HelixTurnBeta strandPDB Structure known for this area
Show more details
Feature keyPosition(s)DescriptionActionsGraphical viewLength
Helixi106 – 116Combined sources11
Helixi124 – 134Combined sources11
Helixi138 – 157Combined sources20

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
1ZQ3NMR-P97-163[»]
ProteinModelPortaliP09081.
SMRiP09081.
ModBaseiSearch...
MobiDBiSearch...

Miscellaneous databases

EvolutionaryTraceiP09081.

Family & Domainsi

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Regioni433 – 440RNA-bindingBy similarity8

Compositional bias

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Compositional biasi12 – 40His/Pro-rich (PRD motif)Add BLAST29
Compositional biasi260 – 294Gln/His-rich (OPA repeat)Add BLAST35

Sequence similaritiesi

Contains 1 homeobox DNA-binding domain.PROSITE-ProRule annotation

Keywords - Domaini

Homeobox

Phylogenomic databases

eggNOGiENOG410KBMY. Eukaryota.
ENOG4110N46. LUCA.
InParanoidiP09081.
KOiK18659.
OMAiQFAYCFN.
OrthoDBiEOG091G0CRZ.
PhylomeDBiP09081.

Family and domain databases

Gene3Di1.10.10.60. 1 hit.
InterProiIPR017970. Homeobox_CS.
IPR001356. Homeobox_dom.
IPR009057. Homeodomain-like.
[Graphical view]
PfamiPF00046. Homeobox. 1 hit.
[Graphical view]
SMARTiSM00389. HOX. 1 hit.
[Graphical view]
SUPFAMiSSF46689. SSF46689. 1 hit.
PROSITEiPS00027. HOMEOBOX_1. 1 hit.
PS50071. HOMEOBOX_2. 1 hit.
[Graphical view]

Sequences (5)i

Sequence statusi: Complete.

This entry describes 5 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform G (identifier: P09081-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MAQPPPDQNF YHHPLPHTHT HPHPHSHPHP HSHPHPHHQH PQLQLPPQFR
60 70 80 90 100
NPFDLLFDER TGAINYNYIR PYLPNQMPKP DVFPSEELPD SLVMRRPRRT
110 120 130 140 150
RTTFTSSQIA ELEQHFLQGR YLTAPRLADL SAKLALGTAQ VKIWFKNRRR
160 170 180 190 200
RHKIQSDQHK DQSYEGMPLS PGMKQSDGDP PSLQTLSLGG GATPNALTPS
210 220 230 240 250
PTPSTPTAHM TEHYSESFNA YYNYNGGHNH AQANRHMHMQ YPSGGGPGPG
260 270 280 290 300
STNVNGGQFF QQQQVHNHQQ QLHHQGNHVP HQMQQQQQQA QQQQYHHFDF
310 320 330 340 350
QQKQASACRV LVKDEPEADY NFNSSYYMRS GMSGATASAS AVARGAASPG
360 370 380 390 400
SEVYEPLTPK NDESPSLCGI GIGGPCAIAV GETEAADDMD DGTSKKTTLQ
410 420 430 440 450
ILEPLKGLDK SCDDGSSDDM STGIRALAGT GNRGAAFAKF GKPSPPQGPQ
460 470 480 490
PPLGMGGVAM GESNQYQCTM DTIMQAYNPH RNAAGNSQFA YCFN
Length:494
Mass (Da):54,511
Last modified:May 10, 2004 - v3
Checksum:i561D8509D5C11FD3
GO
Isoform A (identifier: P09081-3) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     56-400: Missing.

Show »
Length:149
Mass (Da):16,363
Checksum:i9ECE6842A7DB862A
GO
Isoform D (identifier: P09081-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     81-85: Missing.

Show »
Length:489
Mass (Da):53,966
Checksum:i81829ACF03419B86
GO
Isoform E (identifier: P09081-4) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-76: Missing.

Note: No experimental confirmation available.
Show »
Length:418
Mass (Da):45,438
Checksum:i04D4C70D6750DB39
GO
Isoform F (identifier: P09081-5) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-76: Missing.
     81-85: Missing.

Note: No experimental confirmation available.
Show »
Length:413
Mass (Da):44,893
Checksum:i204670A1A666802F
GO

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti298F → S in CAB37631 (PubMed:2901954).Curated1

Natural variant

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Natural varianti284Q → H in strain: Z362. 1
Natural varianti317E → K in strain: Z229. 1
Natural varianti337A → S in strain: Z95, Z197 and Z229. 1
Natural varianti438A → P in strain: Z184, Z210 and Z216. 1
Natural varianti458V → L in strain: Z157. 1
Natural varianti460M → L in strain: Oregon-R, Z145, Z266, Z346 and Z398. 1

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_0272031 – 76Missing in isoform E and isoform F. CuratedAdd BLAST76
Alternative sequenceiVSP_00223556 – 400Missing in isoform A. 1 PublicationAdd BLAST345
Alternative sequenceiVSP_00223481 – 85Missing in isoform D and isoform F. 2 Publications5

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
X07870 Genomic DNA. Translation: CAA30720.1.
X14458 mRNA. Translation: CAA32627.1.
X14459 mRNA. Translation: CAB37631.1.
X14460 mRNA. Translation: CAA32629.1.
AF466621 Genomic DNA. Translation: AAL77008.1.
AF466622 Genomic DNA. Translation: AAL77009.1.
AF466623 Genomic DNA. Translation: AAL77010.1.
AF466624 Genomic DNA. Translation: AAL77011.1.
AF466625 Genomic DNA. Translation: AAL77012.1.
AF466626 Genomic DNA. Translation: AAL77013.1.
AF466627 Genomic DNA. Translation: AAL77014.1.
AF466628 Genomic DNA. Translation: AAL77015.1.
AF466629 Genomic DNA. Translation: AAL77016.1.
AF466630 Genomic DNA. Translation: AAL77017.1.
AF466631 Genomic DNA. Translation: AAL77018.1.
AF466632 Genomic DNA. Translation: AAL77019.1.
AF466633 Genomic DNA. Translation: AAL77020.1.
AF466634 Genomic DNA. Translation: AAL77021.1.
AF466635 Genomic DNA. Translation: AAL77022.1.
AF466636 Genomic DNA. Translation: AAL77023.1.
AF466637 Genomic DNA. Translation: AAL77024.1.
AF466638 Genomic DNA. Translation: AAL77025.1.
AF466639 Genomic DNA. Translation: AAL77026.1.
AF466640 Genomic DNA. Translation: AAL77027.1.
AF466641 Genomic DNA. Translation: AAL77028.1.
AF466642 Genomic DNA. Translation: AAL77029.1.
AF466643 Genomic DNA. Translation: AAL77030.1.
AF466644 Genomic DNA. Translation: AAL77031.1.
AF466645 Genomic DNA. Translation: AAL77032.1.
AE001572 Genomic DNA. Translation: AAD19798.1.
AE014297 Genomic DNA. Translation: AAF54085.2.
AE014297 Genomic DNA. Translation: AAN13368.1.
AE014297 Genomic DNA. Translation: AAN13369.1.
AE014297 Genomic DNA. Translation: AAN13371.2.
AE014297 Genomic DNA. Translation: AAO41514.1.
AY058658 mRNA. Translation: AAL13887.1.
BT021332 mRNA. Translation: AAX33480.1.
M14549 Genomic DNA. Translation: AAA28385.1.
K03517 mRNA. Translation: AAA28391.1.
PIRiS00835. WJFFBC.
RefSeqiNP_476825.1. NM_057477.5. [P09081-3]
NP_731111.1. NM_169157.3. [P09081-2]
NP_731113.2. NM_169159.4. [P09081-4]
NP_788587.1. NM_176410.3. [P09081-1]
NP_788588.1. NM_176411.3. [P09081-5]
UniGeneiDm.3237.

Genome annotation databases

EnsemblMetazoaiFBtr0081668; FBpp0081168; FBgn0000166. [P09081-1]
GeneIDi40830.
KEGGidme:Dmel_CG1034.

Keywords - Coding sequence diversityi

Alternative splicing, Polymorphism

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
X07870 Genomic DNA. Translation: CAA30720.1.
X14458 mRNA. Translation: CAA32627.1.
X14459 mRNA. Translation: CAB37631.1.
X14460 mRNA. Translation: CAA32629.1.
AF466621 Genomic DNA. Translation: AAL77008.1.
AF466622 Genomic DNA. Translation: AAL77009.1.
AF466623 Genomic DNA. Translation: AAL77010.1.
AF466624 Genomic DNA. Translation: AAL77011.1.
AF466625 Genomic DNA. Translation: AAL77012.1.
AF466626 Genomic DNA. Translation: AAL77013.1.
AF466627 Genomic DNA. Translation: AAL77014.1.
AF466628 Genomic DNA. Translation: AAL77015.1.
AF466629 Genomic DNA. Translation: AAL77016.1.
AF466630 Genomic DNA. Translation: AAL77017.1.
AF466631 Genomic DNA. Translation: AAL77018.1.
AF466632 Genomic DNA. Translation: AAL77019.1.
AF466633 Genomic DNA. Translation: AAL77020.1.
AF466634 Genomic DNA. Translation: AAL77021.1.
AF466635 Genomic DNA. Translation: AAL77022.1.
AF466636 Genomic DNA. Translation: AAL77023.1.
AF466637 Genomic DNA. Translation: AAL77024.1.
AF466638 Genomic DNA. Translation: AAL77025.1.
AF466639 Genomic DNA. Translation: AAL77026.1.
AF466640 Genomic DNA. Translation: AAL77027.1.
AF466641 Genomic DNA. Translation: AAL77028.1.
AF466642 Genomic DNA. Translation: AAL77029.1.
AF466643 Genomic DNA. Translation: AAL77030.1.
AF466644 Genomic DNA. Translation: AAL77031.1.
AF466645 Genomic DNA. Translation: AAL77032.1.
AE001572 Genomic DNA. Translation: AAD19798.1.
AE014297 Genomic DNA. Translation: AAF54085.2.
AE014297 Genomic DNA. Translation: AAN13368.1.
AE014297 Genomic DNA. Translation: AAN13369.1.
AE014297 Genomic DNA. Translation: AAN13371.2.
AE014297 Genomic DNA. Translation: AAO41514.1.
AY058658 mRNA. Translation: AAL13887.1.
BT021332 mRNA. Translation: AAX33480.1.
M14549 Genomic DNA. Translation: AAA28385.1.
K03517 mRNA. Translation: AAA28391.1.
PIRiS00835. WJFFBC.
RefSeqiNP_476825.1. NM_057477.5. [P09081-3]
NP_731111.1. NM_169157.3. [P09081-2]
NP_731113.2. NM_169159.4. [P09081-4]
NP_788587.1. NM_176410.3. [P09081-1]
NP_788588.1. NM_176411.3. [P09081-5]
UniGeneiDm.3237.

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
1ZQ3NMR-P97-163[»]
ProteinModelPortaliP09081.
SMRiP09081.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi66028. 63 interactors.
IntActiP09081. 30 interactors.
MINTiMINT-302540.
STRINGi7227.FBpp0081168.

Proteomic databases

PaxDbiP09081.
PRIDEiP09081.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaiFBtr0081668; FBpp0081168; FBgn0000166. [P09081-1]
GeneIDi40830.
KEGGidme:Dmel_CG1034.

Organism-specific databases

CTDi40830.
FlyBaseiFBgn0000166. bcd.

Phylogenomic databases

eggNOGiENOG410KBMY. Eukaryota.
ENOG4110N46. LUCA.
InParanoidiP09081.
KOiK18659.
OMAiQFAYCFN.
OrthoDBiEOG091G0CRZ.
PhylomeDBiP09081.

Enzyme and pathway databases

SignaLinkiP09081.

Miscellaneous databases

EvolutionaryTraceiP09081.
GenomeRNAii40830.
PROiP09081.

Gene expression databases

BgeeiFBgn0000166.
GenevisibleiP09081. DM.

Family and domain databases

Gene3Di1.10.10.60. 1 hit.
InterProiIPR017970. Homeobox_CS.
IPR001356. Homeobox_dom.
IPR009057. Homeodomain-like.
[Graphical view]
PfamiPF00046. Homeobox. 1 hit.
[Graphical view]
SMARTiSM00389. HOX. 1 hit.
[Graphical view]
SUPFAMiSSF46689. SSF46689. 1 hit.
PROSITEiPS00027. HOMEOBOX_1. 1 hit.
PS50071. HOMEOBOX_2. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiBCD_DROME
AccessioniPrimary (citable) accession number: P09081
Secondary accession number(s): Q5BI92
, Q86BA9, Q86BP2, Q8INR7, Q8ST46, Q8STB1, Q8T9S9, Q8T9T0, Q8T9T1, Q95TN3, Q9UAM0, Q9VI47
Entry historyi
Integrated into UniProtKB/Swiss-Prot: November 1, 1988
Last sequence update: May 10, 2004
Last modified: November 2, 2016
This is version 177 of the entry and version 3 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programDrosophila annotation project

Miscellaneousi

Keywords - Technical termi

3D-structure, Complete proteome, Reference proteome

Documents

  1. Drosophila
    Drosophila: entries, gene names and cross-references to FlyBase
  2. PDB cross-references
    Index of Protein Data Bank (PDB) cross-references
  3. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.