Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein
Submitted name:

Transcription factor COE1

Gene

Ebf1

Organism
Mus musculus (Mouse)
Status
Unreviewed-Annotation score: Annotation score: 2 out of 5-Experimental evidence at protein leveli

Functioni

GO - Molecular functioni

GO - Biological processi

Keywordsi

Molecular functionDevelopmental proteinUniRule annotation, DNA-bindingUniRule annotation
Biological processTranscription, Transcription regulationUniRule annotation
LigandMetal-binding, Zinc

Names & Taxonomyi

Protein namesi
Submitted name:
Transcription factor COE1Imported
Gene namesi
Name:Ebf1Imported
OrganismiMus musculus (Mouse)Imported
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaMyomorphaMuroideaMuridaeMurinaeMusMus
Proteomesi
  • UP000000589 Componenti: Chromosome 11

Organism-specific databases

MGIiMGI:95275. Ebf1.

Subcellular locationi

  • Nucleus UniRule annotation

GO - Cellular componenti

Keywords - Cellular componenti

NucleusUniRule annotation

PTM / Processingi

Proteomic databases

MaxQBiQ5SWK8.
PaxDbiQ5SWK8.

Expressioni

Gene expression databases

BgeeiENSMUSG00000057098.
ExpressionAtlasiQ5SWK8. baseline and differential.

Structurei

3D structure databases

ProteinModelPortaliQ5SWK8.
SMRiQ5SWK8.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini254 – 338IPT/TIGInterPro annotationAdd BLAST85

Sequence similaritiesi

Belongs to the COE family.UniRule annotation

Keywords - Domaini

Zinc-fingerUniRule annotation

Phylogenomic databases

eggNOGiKOG3836. Eukaryota.
ENOG410XQ9Z. LUCA.
GeneTreeiENSGT00390000014051.
HOGENOMiHOG000092311.
HOVERGENiHBG005108.

Family and domain databases

Gene3Di2.60.40.10. 1 hit.
InterProiView protein in InterPro
IPR032200. COE_DBD.
IPR032201. COE_HLH.
IPR013783. Ig-like_fold.
IPR014756. Ig_E-set.
IPR002909. IPT.
IPR003523. Transcription_factor_COE.
IPR018350. Transcription_factor_COE_CS.
PANTHERiPTHR10747. PTHR10747. 1 hit.
PfamiView protein in Pfam
PF16422. COE1_DBD. 1 hit.
PF16423. COE1_HLH. 1 hit.
PF01833. TIG. 1 hit.
SMARTiView protein in SMART
SM00429. IPT. 1 hit.
SUPFAMiSSF81296. SSF81296. 1 hit.
PROSITEiView protein in PROSITE
PS01345. COE. 1 hit.

Sequencei

Sequence statusi: Complete.

Q5SWK8-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MFGIQESIQR SGSSMKEEPL GSGMNAVRTW MQGAGVLDAN TAAQSGVGLA
60 70 80 90 100
RAHFEKQPPS NLRKSNFFHF VLALYDRQGQ PVEIERTAFV GFVEKEKEAN
110 120 130 140 150
SEKTNNGIHY RLQLLYSNGI RTEQDFYVRL IDSMTKQAIV YEGQDKNPEM
160 170 180 190 200
CRVLLTHEIM CSRCCDKKSC GNRNETPSDP VIIDRFFLKF FLKCNQNCLK
210 220 230 240 250
NAGNPRDMRR FQVVVSTTVN VDGHVLAVSD NMFVHNNSKH GRRARRLDPS
260 270 280 290 300
EAATPCIKAI SPSEGWTTGG ATVIIIGDNF FDGLQVIFGT MLVWSELITP
310 320 330 340 350
HAIRVQTPPR HIPGVVEVTL SYKSKQFCKG TPGRFIYTAL NEPTIDYGFQ
360 370 380 390 400
RLQKVIPRHP GDPERLPKEV ILKRAADLVE ALYGMPHNNQ EIILKRAADI
410 420 430 440 450
AEALYSVPRN HNQLPALANT SVHAGMMGVN SFSGQLAVNV SEASQATNQG
460 470 480 490 500
FTRNSSSVSP HGYVPSTTPQ QTNYNSVTTS MNGYGSAAMS NLGGSPTFLN
510 520 530 540 550
GSAANSPYAI VPSSPTMASS TSLPSNCSSS SGIFSFSPAN MVSAVKQKSA
560 570 580
FAPVVRPQTS PPPTCTSTNG NSLQAISGMI VPPM
Length:584
Mass (Da):63,650
Last modified:February 5, 2008 - v1
Checksum:i1E903A9801BBCFEA
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AL603913 Genomic DNA. No translation available.
AL645607 Genomic DNA. No translation available.
AL672013 Genomic DNA. No translation available.
AL732622 Genomic DNA. No translation available.
RefSeqiXP_006532218.1. XM_006532155.3.
UniGeneiMm.439662.
Mm.440346.

Genome annotation databases

EnsembliENSMUST00000109268; ENSMUSP00000104891; ENSMUSG00000057098.
GeneIDi13591.

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.

Entry informationi

Entry nameiQ5SWK8_MOUSE
AccessioniPrimary (citable) accession number: Q5SWK8
Entry historyiIntegrated into UniProtKB/TrEMBL: February 5, 2008
Last sequence update: February 5, 2008
Last modified: July 5, 2017
This is version 83 of the entry and version 1 of the sequence. See complete history.
Entry statusiUnreviewed (UniProtKB/TrEMBL)

Miscellaneousi

Caution

The sequence shown here is derived from an Ensembl automatic analysis pipeline and should be considered as preliminary data.Imported

Keywords - Technical termi

Complete proteome, Proteomics identificationCombined sources, Reference proteomeImported