Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Homeobox protein caupolican

Gene

caup

Organism
Drosophila melanogaster (Fruit fly)
Status
Reviewed-Annotation score: Annotation score: 4 out of 5-Experimental evidence at transcript leveli

Functioni

Controls proneural and vein forming genes. Positive transcriptional controler of ac-sc (achaete-scute). May act as an activator that interacts with the transcriptional complex assembled on the ac and sc promoters and participates in transcription initiation.1 Publication

Regions

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
DNA bindingi226 – 28863Homeobox; TALE-typePROSITE-ProRule annotationAdd
BLAST

GO - Molecular functioni

  • sequence-specific DNA binding Source: InterPro
  • transcription factor activity, RNA polymerase II distal enhancer sequence-specific binding Source: FlyBase
  • transcription factor activity, sequence-specific DNA binding Source: FlyBase

GO - Biological processi

  • compound eye morphogenesis Source: FlyBase
  • equator specification Source: FlyBase
  • imaginal disc-derived wing vein specification Source: FlyBase
  • muscle cell fate commitment Source: FlyBase
  • negative regulation of growth Source: FlyBase
  • phagocytosis Source: FlyBase
  • positive regulation of transcription from RNA polymerase II promoter Source: FlyBase
  • regulation of mitotic cell cycle Source: FlyBase
  • regulation of transcription from RNA polymerase II promoter Source: FlyBase
  • transcription, DNA-templated Source: UniProtKB-KW
Complete GO annotation...

Keywords - Molecular functioni

Activator, Developmental protein

Keywords - Biological processi

Transcription, Transcription regulation

Keywords - Ligandi

DNA-binding

Names & Taxonomyi

Protein namesi
Recommended name:
Homeobox protein caupolican
Gene namesi
Name:caup
ORF Names:CG10605
OrganismiDrosophila melanogaster (Fruit fly)
Taxonomic identifieri7227 [NCBI]
Taxonomic lineageiEukaryotaMetazoaEcdysozoaArthropodaHexapodaInsectaPterygotaNeopteraEndopterygotaDipteraBrachyceraMuscomorphaEphydroideaDrosophilidaeDrosophilaSophophora
Proteomesi
  • UP000000803 Componenti: Chromosome 3L

Organism-specific databases

FlyBaseiFBgn0015919. caup.

Subcellular locationi

GO - Cellular componenti

  • nucleus Source: FlyBase
Complete GO annotation...

Keywords - Cellular componenti

Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 693693Homeobox protein caupolicanPRO_0000048845Add
BLAST

Proteomic databases

PaxDbiP54269.
PRIDEiP54269.

Expressioni

Gene expression databases

BgeeiFBgn0015919.
GenevisibleiP54269. DM.

Interactioni

Protein-protein interaction databases

BioGridi64789. 3 interactions.
DIPiDIP-18539N.
IntActiP54269. 2 interactions.
MINTiMINT-945509.
STRINGi7227.FBpp0075641.

Structurei

3D structure databases

ProteinModelPortaliP54269.
SMRiP54269. Positions 235-284.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Compositional bias

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Compositional biasi300 – 3034Poly-Asp
Compositional biasi405 – 41814Poly-GlnAdd
BLAST
Compositional biasi501 – 51616Poly-GlnAdd
BLAST
Compositional biasi517 – 52812Poly-HisAdd
BLAST
Compositional biasi565 – 5728Poly-Ser
Compositional biasi613 – 62412Poly-SerAdd
BLAST

Sequence similaritiesi

Belongs to the TALE/IRO homeobox family.Curated
Contains 1 homeobox DNA-binding domain.PROSITE-ProRule annotation

Keywords - Domaini

Homeobox

Phylogenomic databases

eggNOGiKOG0773. Eukaryota.
ENOG410XPMQ. LUCA.
GeneTreeiENSGT00750000117365.
InParanoidiP54269.
OMAiAFYAPLS.
OrthoDBiEOG091G0N9R.
PhylomeDBiP54269.

Family and domain databases

Gene3Di1.10.10.60. 1 hit.
InterProiIPR017970. Homeobox_CS.
IPR001356. Homeobox_dom.
IPR008422. Homeobox_KN_domain.
IPR009057. Homeodomain-like.
IPR003893. Iroquois_homeo.
[Graphical view]
PfamiPF05920. Homeobox_KN. 1 hit.
[Graphical view]
SMARTiSM00389. HOX. 1 hit.
SM00548. IRO. 1 hit.
[Graphical view]
SUPFAMiSSF46689. SSF46689. 1 hit.
PROSITEiPS00027. HOMEOBOX_1. 1 hit.
PS50071. HOMEOBOX_2. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

P54269-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MAAYAQFGYA GYPTANQLTT ANTDSQSGHG GGSPLSGTNE ASLSPSGGST
60 70 80 90 100
ATGLTAGPLS PGAVSQSSHH AGHKGLSTSP AEDVVGGDVP VGLSSAAQDL
110 120 130 140 150
PSRGSCCENG RPIITDPVSG QTVCSCQYDP ARLAIGGYSR MALPSGGVGV
160 170 180 190 200
GVYGGPYPSN EQNPYPSIGV DNSAFYAPLS NPYGIKDTSP STEMSAWTSA
210 220 230 240 250
SLQSTTGYYS YDPTLAAYGY GPNYDLAARR KNATRESTAT LKAWLSEHKK
260 270 280 290 300
NPYPTKGEKI MLAIITKMTL TQVSTWFANA RRRLKKENKM TWEPKNKTED
310 320 330 340 350
DDDGMMSDDE KEKDAGDGGK LSTEAFDPGN QLIKSELGKA EKEVDSSGDQ
360 370 380 390 400
KLDLDREPHN LVAMRGLAPY ATPPGAHPMH AAYSSYAQSH NTHTHPHPQQ
410 420 430 440 450
MQHHQQQQQQ QQNQQQLQHH QMDQPYYHPG GYGQEESGEF AAQKNPLSRD
460 470 480 490 500
CGIPVPASKP KIWSVADTAA CKTPPPTAAY LGQNFYPPSS ADQQLPHQPL
510 520 530 540 550
QQHQQQQLQQ LQQQQQHHHH PHHHHPHHSM ELGSPLSMMS SYAGGSPYSR
560 570 580 590 600
IPTAYTEAMG MHLPSSSSSS SSTGKLPPTH IHPAPQRVGF PEIQPDTPPQ
610 620 630 640 650
TPPTMKLNSS GGSSSSSGSS HSSSMHSVTP VTVASMVNIL YSNTDSGYGH
660 670 680 690
GHSHGHGHGH GHGLGHGHGL GHGHGHMGVT SNAYLTEGGR SGS
Length:693
Mass (Da):73,668
Last modified:August 14, 2001 - v2
Checksum:iFBEB1616493F7EC9
GO

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti106 – 1061C → R in CAA64485 (PubMed:8620542).Curated
Sequence conflicti316 – 3161G → A in CAA64485 (PubMed:8620542).Curated
Sequence conflicti518 – 5181H → N in AAV36871 (Ref. 4) Curated
Sequence conflicti678 – 6781G → A in CAA64485 (PubMed:8620542).Curated

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
X95178 mRNA. Translation: CAA64485.1.
AE014296 Genomic DNA. Translation: AAF49895.1.
BT015986 mRNA. Translation: AAV36871.1.
BT021246 mRNA. Translation: AAX33394.1.
AY122206 mRNA. Translation: AAM52718.1.
RefSeqiNP_524046.2. NM_079322.3.
UniGeneiDm.2873.

Genome annotation databases

EnsemblMetazoaiFBtr0075909; FBpp0075641; FBgn0015919.
GeneIDi39440.
KEGGidme:Dmel_CG10605.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
X95178 mRNA. Translation: CAA64485.1.
AE014296 Genomic DNA. Translation: AAF49895.1.
BT015986 mRNA. Translation: AAV36871.1.
BT021246 mRNA. Translation: AAX33394.1.
AY122206 mRNA. Translation: AAM52718.1.
RefSeqiNP_524046.2. NM_079322.3.
UniGeneiDm.2873.

3D structure databases

ProteinModelPortaliP54269.
SMRiP54269. Positions 235-284.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi64789. 3 interactions.
DIPiDIP-18539N.
IntActiP54269. 2 interactions.
MINTiMINT-945509.
STRINGi7227.FBpp0075641.

Proteomic databases

PaxDbiP54269.
PRIDEiP54269.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaiFBtr0075909; FBpp0075641; FBgn0015919.
GeneIDi39440.
KEGGidme:Dmel_CG10605.

Organism-specific databases

CTDi39440.
FlyBaseiFBgn0015919. caup.

Phylogenomic databases

eggNOGiKOG0773. Eukaryota.
ENOG410XPMQ. LUCA.
GeneTreeiENSGT00750000117365.
InParanoidiP54269.
OMAiAFYAPLS.
OrthoDBiEOG091G0N9R.
PhylomeDBiP54269.

Miscellaneous databases

ChiTaRSicaup. fly.
GenomeRNAii39440.
PROiP54269.

Gene expression databases

BgeeiFBgn0015919.
GenevisibleiP54269. DM.

Family and domain databases

Gene3Di1.10.10.60. 1 hit.
InterProiIPR017970. Homeobox_CS.
IPR001356. Homeobox_dom.
IPR008422. Homeobox_KN_domain.
IPR009057. Homeodomain-like.
IPR003893. Iroquois_homeo.
[Graphical view]
PfamiPF05920. Homeobox_KN. 1 hit.
[Graphical view]
SMARTiSM00389. HOX. 1 hit.
SM00548. IRO. 1 hit.
[Graphical view]
SUPFAMiSSF46689. SSF46689. 1 hit.
PROSITEiPS00027. HOMEOBOX_1. 1 hit.
PS50071. HOMEOBOX_2. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiCAUP_DROME
AccessioniPrimary (citable) accession number: P54269
Secondary accession number(s): Q5BIH8
, Q5U1A6, Q8MR03, Q9VU00
Entry historyi
Integrated into UniProtKB/Swiss-Prot: October 1, 1996
Last sequence update: August 14, 2001
Last modified: September 7, 2016
This is version 136 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programDrosophila annotation project

Miscellaneousi

Miscellaneous

'Caupolican' is named after the Araucanian American-Indian tribe, also called mohawks, who shaved all but a medial stripe of hairs on the head.

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Drosophila
    Drosophila: entries, gene names and cross-references to FlyBase
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.