Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Early growth response protein 1

Gene

egr1

Organism
Xenopus tropicalis (Western clawed frog) (Silurana tropicalis)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Experimental evidence at transcript leveli

Functioni

Transcriptional regulator. Recognizes and binds to the DNA sequence 5'-CGCCCCCGC-3'(EGR-site). Activates the transcription of target genes whose products are required for mitogenesis and differentiation (By similarity).By similarity

Regions

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Zinc fingeri307 – 33125C2H2-type 1PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri337 – 35923C2H2-type 2PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri365 – 38723C2H2-type 3PROSITE-ProRule annotationAdd
BLAST

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Molecular functioni

Activator

Keywords - Biological processi

Transcription, Transcription regulation

Keywords - Ligandi

DNA-binding, Metal-binding, Zinc

Names & Taxonomyi

Protein namesi
Recommended name:
Early growth response protein 1By similarity
Short name:
EGR-1By similarity
Gene namesi
Name:egr1Imported
Synonyms:egr1 Publication
OrganismiXenopus tropicalis (Western clawed frog) (Silurana tropicalis)
Taxonomic identifieri8364 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiAmphibiaBatrachiaAnuraPipoideaPipidaeXenopodinaeXenopusSilurana
Proteomesi
  • UP000008143 Componenti: Unassembled WGS sequence

Organism-specific databases

XenbaseiXB-GENE-853412. egr1.

Subcellular locationi

  • Nucleus By similarity

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 498498Early growth response protein 1PRO_0000386429Add
BLAST

Proteomic databases

PaxDbiA4II20.

Expressioni

Gene expression databases

BgeeiENSXETG00000006697.
ExpressionAtlasiA4II20. baseline.

Interactioni

Protein-protein interaction databases

STRINGi8364.ENSXETP00000047681.

Structurei

3D structure databases

ProteinModelPortaliA4II20.
SMRiA4II20. Positions 304-390.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Compositional bias

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Compositional biasi139 – 18345Ser-richSequence analysisAdd
BLAST
Compositional biasi402 – 49190Ser-richSequence analysisAdd
BLAST

Sequence similaritiesi

Belongs to the EGR C2H2-type zinc-finger protein family.Sequence analysis
Contains 3 C2H2-type zinc fingers.PROSITE-ProRule annotation

Zinc finger

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Zinc fingeri307 – 33125C2H2-type 1PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri337 – 35923C2H2-type 2PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri365 – 38723C2H2-type 3PROSITE-ProRule annotationAdd
BLAST

Keywords - Domaini

Repeat, Zinc-finger

Phylogenomic databases

eggNOGiKOG1721. Eukaryota.
COG5048. LUCA.
GeneTreeiENSGT00550000074455.
HOGENOMiHOG000036856.
InParanoidiA4II20.
KOiK09203.
OMAiGEEHEND.
OrthoDBiEOG091G06VX.
TreeFamiTF318980.

Family and domain databases

Gene3Di3.30.160.60. 3 hits.
InterProiIPR021839. DUF3432.
IPR021849. DUF3446.
IPR007087. Znf_C2H2.
IPR015880. Znf_C2H2-like.
IPR013087. Znf_C2H2/integrase_DNA-bd.
[Graphical view]
PfamiPF11914. DUF3432. 1 hit.
PF11928. DUF3446. 1 hit.
[Graphical view]
SMARTiSM00355. ZnF_C2H2. 3 hits.
[Graphical view]
PROSITEiPS00028. ZINC_FINGER_C2H2_1. 3 hits.
PS50157. ZINC_FINGER_C2H2_2. 3 hits.
[Graphical view]

Sequencei

Sequence statusi: Complete.

A4II20-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MAAAKTDMLV SPLQISDPFS SFPHSPTMDN YPKLEEMMLL NPGAPQFLGA
60 70 80 90 100
AVPEGSGFNS PVEGSEQFDH LAADAFSDMS LSGEKAVIES SYANQSARLP
110 120 130 140 150
SLTYTGRFSL EPAPNSSNTL WPEPLFSLVS GLVGMANASP SSAPSSSPSS
160 170 180 190 200
SSSSSQSPPL SCSVQSNDSS PIYSAAPTFP NSSPELFPDQ SPQPFQNAST
210 220 230 240 250
ASIPYPPPAY PVSKTTFQVP MIPDYLFPQQ QGDVSLVSAD QKPFQAMESR
260 270 280 290 300
TQQPSLTPLS TIKAFATQTS QDLKTINSTY QSQIIKPSRM RKYPNRPSKT
310 320 330 340 350
PPHERPYACP VESCDRRFSR SDELTRHIRI HTGQKPFQCR ICMRNFSRSD
360 370 380 390 400
HLTTHIRTHT GEKPFACDIC GRKFARSDER KRHTKIHLRQ KDKKADKATP
410 420 430 440 450
VSVASPVSSY SPSASTSYPS PVPTSYSSPV SSAYPSPVHS SFPSPTTAVT
460 470 480 490
YPSVTSTFQT HGITSFPSSI VTNSFSSPVS SALSDMSITY SPRTIEIC
Length:498
Mass (Da):53,997
Last modified:October 13, 2009 - v2
Checksum:iF0A812624C004C74
GO

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti148 – 1481P → PS in AAI35807 (Ref. 2) Curated

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AC146894 Genomic DNA. Translation: AAT71995.1.
BC135806 mRNA. Translation: AAI35807.1.
RefSeqiNP_001090830.1. NM_001097361.1.
UniGeneiStr.39717.

Genome annotation databases

EnsembliENSXETT00000047681; ENSXETP00000047681; ENSXETG00000006697.
GeneIDi100038164.
KEGGixtr:100038164.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AC146894 Genomic DNA. Translation: AAT71995.1.
BC135806 mRNA. Translation: AAI35807.1.
RefSeqiNP_001090830.1. NM_001097361.1.
UniGeneiStr.39717.

3D structure databases

ProteinModelPortaliA4II20.
SMRiA4II20. Positions 304-390.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi8364.ENSXETP00000047681.

Proteomic databases

PaxDbiA4II20.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSXETT00000047681; ENSXETP00000047681; ENSXETG00000006697.
GeneIDi100038164.
KEGGixtr:100038164.

Organism-specific databases

CTDi1958.
XenbaseiXB-GENE-853412. egr1.

Phylogenomic databases

eggNOGiKOG1721. Eukaryota.
COG5048. LUCA.
GeneTreeiENSGT00550000074455.
HOGENOMiHOG000036856.
InParanoidiA4II20.
KOiK09203.
OMAiGEEHEND.
OrthoDBiEOG091G06VX.
TreeFamiTF318980.

Gene expression databases

BgeeiENSXETG00000006697.
ExpressionAtlasiA4II20. baseline.

Family and domain databases

Gene3Di3.30.160.60. 3 hits.
InterProiIPR021839. DUF3432.
IPR021849. DUF3446.
IPR007087. Znf_C2H2.
IPR015880. Znf_C2H2-like.
IPR013087. Znf_C2H2/integrase_DNA-bd.
[Graphical view]
PfamiPF11914. DUF3432. 1 hit.
PF11928. DUF3446. 1 hit.
[Graphical view]
SMARTiSM00355. ZnF_C2H2. 3 hits.
[Graphical view]
PROSITEiPS00028. ZINC_FINGER_C2H2_1. 3 hits.
PS50157. ZINC_FINGER_C2H2_2. 3 hits.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiEGR1_XENTR
AccessioniPrimary (citable) accession number: A4II20
Secondary accession number(s): Q6F2L9
Entry historyi
Integrated into UniProtKB/Swiss-Prot: October 13, 2009
Last sequence update: October 13, 2009
Last modified: September 7, 2016
This is version 60 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.