Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Segmentation polarity homeobox protein engrailed

Gene

en

Organism
Drosophila melanogaster (Fruit fly)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

This protein specifies the body segmentation pattern. It is required for the development of the central nervous system. Transcriptional regulator that represses activated promoters. Wg signaling operates by inactivating the SGG repression of EN autoactivation.

Regions

Feature keyPosition(s)DescriptionActionsGraphical viewLength
DNA bindingi454 – 513HomeoboxPROSITE-ProRule annotationAdd BLAST60

GO - Molecular functioni

  • enhancer sequence-specific DNA binding Source: FlyBase
  • RNA polymerase II distal enhancer sequence-specific DNA binding Source: FlyBase
  • sequence-specific DNA binding Source: FlyBase
  • transcriptional repressor activity, RNA polymerase II core promoter proximal region sequence-specific binding Source: FlyBase

GO - Biological processi

  • analia development Source: FlyBase
  • anterior/posterior lineage restriction, imaginal disc Source: FlyBase
  • anterior/posterior pattern specification, imaginal disc Source: FlyBase
  • anterior commissure morphogenesis Source: FlyBase
  • anterior head segmentation Source: FlyBase
  • axon guidance Source: FlyBase
  • compartment pattern specification Source: FlyBase
  • genital disc anterior/posterior pattern formation Source: FlyBase
  • genital disc development Source: FlyBase
  • gonad development Source: FlyBase
  • imaginal disc-derived female genitalia development Source: FlyBase
  • imaginal disc-derived male genitalia development Source: FlyBase
  • imaginal disc-derived wing vein specification Source: FlyBase
  • imaginal disc pattern formation Source: FlyBase
  • negative regulation of gene expression Source: FlyBase
  • negative regulation of transcription, DNA-templated Source: FlyBase
  • negative regulation of transcription from RNA polymerase II promoter Source: FlyBase
  • neuroblast fate determination Source: FlyBase
  • positive regulation of transcription from RNA polymerase II promoter Source: FlyBase
  • posterior compartment specification Source: FlyBase
  • posterior head segmentation Source: FlyBase
  • regulation of gene expression Source: FlyBase
  • segment polarity determination Source: FlyBase
  • spiracle morphogenesis, open tracheal system Source: FlyBase
  • transcription, DNA-templated Source: UniProtKB-KW
  • trunk segmentation Source: FlyBase
  • ventral midline development Source: FlyBase
  • wing disc anterior/posterior pattern formation Source: FlyBase
Complete GO annotation...

Keywords - Molecular functioni

Developmental protein, Repressor, Segmentation polarity protein

Keywords - Biological processi

Transcription, Transcription regulation

Keywords - Ligandi

DNA-binding

Enzyme and pathway databases

SignaLinkiP02836.

Names & Taxonomyi

Protein namesi
Recommended name:
Segmentation polarity homeobox protein engrailed
Gene namesi
Name:en
ORF Names:CG9015
OrganismiDrosophila melanogaster (Fruit fly)
Taxonomic identifieri7227 [NCBI]
Taxonomic lineageiEukaryotaMetazoaEcdysozoaArthropodaHexapodaInsectaPterygotaNeopteraEndopterygotaDipteraBrachyceraMuscomorphaEphydroideaDrosophilidaeDrosophilaSophophora
Proteomesi
  • UP000000803 Componenti: Chromosome 2R

Organism-specific databases

FlyBaseiFBgn0000577. en.

Subcellular locationi

  • Nucleus PROSITE-ProRule annotation1 Publication

GO - Cellular componenti

  • nucleus Source: FlyBase
Complete GO annotation...

Keywords - Cellular componenti

Nucleus

Pathology & Biotechi

Mutagenesis

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Mutagenesisi508K → Q: Reduced specificity and affinity for DNA. 1

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00001960771 – 552Segmentation polarity homeobox protein engrailedAdd BLAST552

Post-translational modificationi

Phosphorylated. Phosphorylation may directly or allosterically modify its function.1 Publication

Keywords - PTMi

Phosphoprotein

Proteomic databases

PaxDbiP02836.
PRIDEiP02836.

PTM databases

iPTMnetiP02836.

Expressioni

Developmental stagei

Expression initiates prior to the ninth embryonic nuclear division cycle within 1.5 hours after fertilization. By the cellular blastoderm stage (the 14th nuclear division cycle) is localized into 14 stripes, 1-2 cells wide, spaced along the anterior-posterior axis of the embryo.

Gene expression databases

BgeeiFBgn0000577.
GenevisibleiP02836. DM.

Interactioni

Protein-protein interaction databases

BioGridi62028. 16 interactors.
IntActiP02836. 2 interactors.
MINTiMINT-310169.
STRINGi7227.FBpp0087197.

Structurei

Secondary structure

1552
Legend: HelixTurnBeta strandPDB Structure known for this area
Show more details
Feature keyPosition(s)DescriptionActionsGraphical viewLength
Turni453 – 455Combined sources3
Beta strandi458 – 460Combined sources3
Helixi463 – 475Combined sources13
Helixi481 – 491Combined sources11
Helixi495 – 510Combined sources16

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
1DU0X-ray2.00A/B456-512[»]
1ENHX-ray2.10A456-509[»]
1HDDX-ray2.80C/D453-512[»]
1P7IX-ray2.10A/B/C/D454-512[»]
1P7JX-ray2.10A/B/C/D454-512[»]
1ZTRNMR-A453-512[»]
2HDDX-ray1.90A/B454-512[»]
2HOSX-ray1.90A/B453-513[»]
2HOTX-ray2.19A/B453-513[»]
2JWTNMR-A453-512[»]
2P81NMR-A469-512[»]
3HDDX-ray2.20A/B454-513[»]
ProteinModelPortaliP02836.
SMRiP02836.
ModBaseiSearch...
MobiDBiSearch...

Miscellaneous databases

EvolutionaryTraceiP02836.

Family & Domainsi

Compositional bias

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Compositional biasi19 – 53Gln-richAdd BLAST35
Compositional biasi55 – 87Ala-richAdd BLAST33
Compositional biasi232 – 240Ala-rich9
Compositional biasi320 – 411Ser-richAdd BLAST92

Sequence similaritiesi

Belongs to the engrailed homeobox family.Curated
Contains 1 homeobox DNA-binding domain.PROSITE-ProRule annotation

Keywords - Domaini

Homeobox

Phylogenomic databases

eggNOGiKOG0493. Eukaryota.
ENOG4111P06. LUCA.
GeneTreeiENSGT00840000129733.
InParanoidiP02836.
KOiK09319.
OMAiDTRSETG.
OrthoDBiEOG091G0XBB.
PhylomeDBiP02836.

Family and domain databases

Gene3Di1.10.10.60. 1 hit.
InterProiIPR019549. Homeobox-engrailed_C-terminal.
IPR017970. Homeobox_CS.
IPR001356. Homeobox_dom.
IPR020479. Homeobox_metazoa.
IPR009057. Homeodomain-like.
IPR000747. Homeodomain_engrailed.
IPR019737. Homoebox-engrailed_CS.
IPR000047. HTH_motif.
[Graphical view]
PfamiPF10525. Engrail_1_C_sig. 1 hit.
PF00046. Homeobox. 1 hit.
[Graphical view]
PRINTSiPR00026. ENGRAILED.
PR00024. HOMEOBOX.
PR00031. HTHREPRESSR.
SMARTiSM00389. HOX. 1 hit.
[Graphical view]
SUPFAMiSSF46689. SSF46689. 1 hit.
PROSITEiPS00033. ENGRAILED. 1 hit.
PS00027. HOMEOBOX_1. 1 hit.
PS50071. HOMEOBOX_2. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

P02836-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MALEDRCSPQ SAPSPITLQM QHLHHQQQQQ QQQQQQMQHL HQLQQLQQLH
60 70 80 90 100
QQQLAAGVFH HPAMAFDAAA AAAAAAAAAA AHAHAAALQQ RLSGSGSPAS
110 120 130 140 150
CSTPASSTPL TIKEEESDSV IGDMSFHNQT HTTNEEEEAE EDDDIDVDVD
160 170 180 190 200
DTSAGGRLPP PAHQQQSTAK PSLAFSISNI LSDRFGDVQK PGKSMENQAS
210 220 230 240 250
IFRPFEASRS QTATPSAFTR VDLLEFSRQQ QAAAAAATAA MMLERANFLN
260 270 280 290 300
CFNPAAYPRI HEEIVQSRLR RSAANAVIPP PMSSKMSDAN PEKSALGSLC
310 320 330 340 350
KAVSQIGQPA APTMTQPPLS SSASSLASPP PASNASTISS TSSVATSSSS
360 370 380 390 400
SSSGCSSAAS SLNSSPSSRL GASGSGVNAS SPQPQPIPPP SAVSRDSGME
410 420 430 440 450
SSDDTRSETG STTTEGGKNE MWPAWVYCTR YSDRPSSGPR YRRPKQPKDK
460 470 480 490 500
TNDEKRPRTA FSSEQLARLK REFNENRYLT ERRRQQLSSE LGLNEAQIKI
510 520 530 540 550
WFQNKRAKIK KSTGSKNPLA LQLMAQGLYN HTTVPLTKEE EELEMRMNGQ

IP
Length:552
Mass (Da):59,411
Last modified:February 2, 2004 - v2
Checksum:i92A94C14AA85C527
GO

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti195M → I in AAA65478 (PubMed:3917855).Curated1
Sequence conflicti208S → N in AAA65478 (PubMed:3917855).Curated1
Sequence conflicti486Q → E in CAA25906 (PubMed:2481829).Curated1

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
M10017 mRNA. Translation: AAA65478.1.
K03055 Genomic DNA. No translation available.
K03056 Genomic DNA. No translation available.
AE013599 Genomic DNA. Translation: AAF58639.1.
AY069448 mRNA. Translation: AAL39593.1.
X01765 Genomic DNA. Translation: CAA25906.1.
PIRiA90862. WJFFEN.
RefSeqiNP_523700.2. NM_078976.4.
NP_725059.1. NM_165841.2.
UniGeneiDm.22056.

Genome annotation databases

EnsemblMetazoaiFBtr0088095; FBpp0087197; FBgn0000577.
FBtr0088096; FBpp0087198; FBgn0000577.
GeneIDi36240.
KEGGidme:Dmel_CG9015.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
M10017 mRNA. Translation: AAA65478.1.
K03055 Genomic DNA. No translation available.
K03056 Genomic DNA. No translation available.
AE013599 Genomic DNA. Translation: AAF58639.1.
AY069448 mRNA. Translation: AAL39593.1.
X01765 Genomic DNA. Translation: CAA25906.1.
PIRiA90862. WJFFEN.
RefSeqiNP_523700.2. NM_078976.4.
NP_725059.1. NM_165841.2.
UniGeneiDm.22056.

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
1DU0X-ray2.00A/B456-512[»]
1ENHX-ray2.10A456-509[»]
1HDDX-ray2.80C/D453-512[»]
1P7IX-ray2.10A/B/C/D454-512[»]
1P7JX-ray2.10A/B/C/D454-512[»]
1ZTRNMR-A453-512[»]
2HDDX-ray1.90A/B454-512[»]
2HOSX-ray1.90A/B453-513[»]
2HOTX-ray2.19A/B453-513[»]
2JWTNMR-A453-512[»]
2P81NMR-A469-512[»]
3HDDX-ray2.20A/B454-513[»]
ProteinModelPortaliP02836.
SMRiP02836.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi62028. 16 interactors.
IntActiP02836. 2 interactors.
MINTiMINT-310169.
STRINGi7227.FBpp0087197.

PTM databases

iPTMnetiP02836.

Proteomic databases

PaxDbiP02836.
PRIDEiP02836.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaiFBtr0088095; FBpp0087197; FBgn0000577.
FBtr0088096; FBpp0087198; FBgn0000577.
GeneIDi36240.
KEGGidme:Dmel_CG9015.

Organism-specific databases

CTDi36240.
FlyBaseiFBgn0000577. en.

Phylogenomic databases

eggNOGiKOG0493. Eukaryota.
ENOG4111P06. LUCA.
GeneTreeiENSGT00840000129733.
InParanoidiP02836.
KOiK09319.
OMAiDTRSETG.
OrthoDBiEOG091G0XBB.
PhylomeDBiP02836.

Enzyme and pathway databases

SignaLinkiP02836.

Miscellaneous databases

EvolutionaryTraceiP02836.
GenomeRNAii36240.
PROiP02836.

Gene expression databases

BgeeiFBgn0000577.
GenevisibleiP02836. DM.

Family and domain databases

Gene3Di1.10.10.60. 1 hit.
InterProiIPR019549. Homeobox-engrailed_C-terminal.
IPR017970. Homeobox_CS.
IPR001356. Homeobox_dom.
IPR020479. Homeobox_metazoa.
IPR009057. Homeodomain-like.
IPR000747. Homeodomain_engrailed.
IPR019737. Homoebox-engrailed_CS.
IPR000047. HTH_motif.
[Graphical view]
PfamiPF10525. Engrail_1_C_sig. 1 hit.
PF00046. Homeobox. 1 hit.
[Graphical view]
PRINTSiPR00026. ENGRAILED.
PR00024. HOMEOBOX.
PR00031. HTHREPRESSR.
SMARTiSM00389. HOX. 1 hit.
[Graphical view]
SUPFAMiSSF46689. SSF46689. 1 hit.
PROSITEiPS00033. ENGRAILED. 1 hit.
PS00027. HOMEOBOX_1. 1 hit.
PS50071. HOMEOBOX_2. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiHMEN_DROME
AccessioniPrimary (citable) accession number: P02836
Secondary accession number(s): P02837
, Q0E9C0, Q24356, Q9V601
Entry historyi
Integrated into UniProtKB/Swiss-Prot: July 21, 1986
Last sequence update: February 2, 2004
Last modified: November 30, 2016
This is version 184 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programDrosophila annotation project

Miscellaneousi

Keywords - Technical termi

3D-structure, Complete proteome, Reference proteome

Documents

  1. Drosophila
    Drosophila: entries, gene names and cross-references to FlyBase
  2. PDB cross-references
    Index of Protein Data Bank (PDB) cross-references
  3. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.