Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Uncharacterized protein YehI

Gene

yehI

Organism
Escherichia coli (strain K12)
Status
Reviewed-Annotation score: Annotation score: 1 out of 5-Protein predictedi

Functioni

Enzyme and pathway databases

BioCyciEcoCyc:EG11995-MONOMER.
ECOL316407:JW2105-MONOMER.

Names & Taxonomyi

Protein namesi
Recommended name:
Uncharacterized protein YehI
Gene namesi
Name:yehI
Ordered Locus Names:b2118, JW2105
OrganismiEscherichia coli (strain K12)
Taxonomic identifieri83333 [NCBI]
Taxonomic lineageiBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia
Proteomesi
  • UP000000318 Componenti: Chromosome
  • UP000000625 Componenti: Chromosome

Organism-specific databases

EcoGeneiEG11995. yehI.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 12101210Uncharacterized protein YehIPRO_0000169131Add
BLAST

Proteomic databases

PaxDbiP33346.
PRIDEiP33346.

Interactioni

Protein-protein interaction databases

BioGridi4259176. 18 interactions.
IntActiP33346. 8 interactions.
STRINGi511145.b2118.

Structurei

3D structure databases

ProteinModelPortaliP33346.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Sequence similaritiesi

To E.coli molybdate metabolism regulator (MolR).Curated

Phylogenomic databases

eggNOGiENOG4108KDT. Bacteria.
ENOG4111FHT. LUCA.
HOGENOMiHOG000122164.
OMAiESSWQRC.

Family and domain databases

InterProiIPR025406. DUF4132.
[Graphical view]
PfamiPF13569. DUF4132. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

P33346-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MDKELPWLAD NAQLELKYKK GKTPLSHRRW PGEPVSVITG SLIQTLGDEL
60 70 80 90 100
LQKAEKKKNI VWRYENFSLE WQSAITQAIN LIGEHKPSIP ARTMAALACI
110 120 130 140 150
AQNDSQQLLD EIVQQEGLEY ATEVVIARQF IARCYESDPL VVTLQYQDED
160 170 180 190 200
YGYGYRSETY NEFDLRLRKH LSLAEESCWQ RCADKLIAAL PGINKVRRPF
210 220 230 240 250
IALILPEKPE IANELVGLEC PRTHFHSKEW LKVVANDPTA VRKLEHYWSQ
260 270 280 290 300
DIFSDREASY MSHENHFGYA ACAALLREQG LAAIPRLAMY AHKEDCGSLL
310 320 330 340 350
VQINHPQVIR TLLLVADKNK PSLQRVAKYH KNFPHATLAA LAELLALTEP
360 370 380 390 400
PARPGYPIIE DKKLPAQQKA RDEYWRTLLQ TLMASQPQLA AEVMPWLSTQ
410 420 430 440 450
PQSVLKSYLS APPKPVIDGT DNSNLPEILV SPPWRSKKKM TAPRLDLAPL
460 470 480 490 500
ELTPQVYWQP GEQERLAATE PARYFSTESL AQRMEQKSGR VVLQELGFGD
510 520 530 540 550
DVWLFLNYIL PGKLDAARNS LFVQWHYYQG RVEEILNGWN SPEAQLAEQA
560 570 580 590 600
LRSGHIEALI NIWENDNYSH YRPEKSVWNL YLLAQLPREM ALTFWLRINE
610 620 630 640 650
KKHLFAGEDY FLSILGLDAL PGLLLAFSHR PKETFPLILN FGATELALPV
660 670 680 690 700
AHVWRRFAAQ RDLARQWILQ WPEHTASALI PLVFTKPSDN SEAALLALRL
710 720 730 740 750
LYEQGHGELL QTVANRWQRT DVWSALEQLL KQGPMDIYPA RIPKAPDFWH
760 770 780 790 800
PQMWSRPRLI TNNQTVTNDA LEIIGEMLRF TQGGRFYSGL EQLKTFCQPQ
810 820 830 840 850
TLAAFAWDLF TAWQQAGAPA KDNWAFLALS LFGDESTARD LTTQILAWPQ
860 870 880 890 900
EGKSARAVSG LNILTLMNND MALIQLHHIS QRAKSRPLRD NAAEFLQVVA
910 920 930 940 950
ENRGLSQEEL ADRLVPTLGL DDPQALSFDF GPRQFTVRFD ENLNPVIFDQ
960 970 980 990 1000
QNVRQKSVPR LRADDDQLKA PEALARLKGL KKDATQVSKN LLPRLETALR
1010 1020 1030 1040 1050
TTRRWSLADF HSLFVNHPFT RLVTQRLIWG VYPANEPRCL LKAFRVAAEG
1060 1070 1080 1090 1100
EFCNAQDEPI DLPADALIGI AHPLEMTAEM RSEFAQLFAD YEIMPPFRQL
1110 1120 1130 1140 1150
SRRTVLLTPD ESTSNSLTRW EGKSATVGQL MGMRYKGWES GYEDAFVYNL
1160 1170 1180 1190 1200
GEYRLVLKFS PGFNHYNVDS KALMSFRSLR VYRDNKSVTF AELDVFDLSE
1210
ALSAPDVIFH
Length:1,210
Mass (Da):138,068
Last modified:November 1, 1997 - v2
Checksum:i0C2D3412D3CD6574
GO

Sequence cautioni

The sequence AAA60478 differs from that shown. Reason: Frameshift at positions 349 and 354. Curated

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U00007 Genomic DNA. Translation: AAA60478.1. Frameshift.
U00096 Genomic DNA. Translation: AAC75179.1.
AP009048 Genomic DNA. Translation: BAE76593.1.
PIRiE64979.
RefSeqiNP_416621.1. NC_000913.3.
WP_000356817.1. NZ_LN832404.1.

Genome annotation databases

EnsemblBacteriaiAAC75179; AAC75179; b2118.
BAE76593; BAE76593; BAE76593.
GeneIDi946649.
KEGGiecj:JW2105.
eco:b2118.
PATRICi32119569. VBIEscCol129921_2195.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U00007 Genomic DNA. Translation: AAA60478.1. Frameshift.
U00096 Genomic DNA. Translation: AAC75179.1.
AP009048 Genomic DNA. Translation: BAE76593.1.
PIRiE64979.
RefSeqiNP_416621.1. NC_000913.3.
WP_000356817.1. NZ_LN832404.1.

3D structure databases

ProteinModelPortaliP33346.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi4259176. 18 interactions.
IntActiP33346. 8 interactions.
STRINGi511145.b2118.

Proteomic databases

PaxDbiP33346.
PRIDEiP33346.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaiAAC75179; AAC75179; b2118.
BAE76593; BAE76593; BAE76593.
GeneIDi946649.
KEGGiecj:JW2105.
eco:b2118.
PATRICi32119569. VBIEscCol129921_2195.

Organism-specific databases

EchoBASEiEB1936.
EcoGeneiEG11995. yehI.

Phylogenomic databases

eggNOGiENOG4108KDT. Bacteria.
ENOG4111FHT. LUCA.
HOGENOMiHOG000122164.
OMAiESSWQRC.

Enzyme and pathway databases

BioCyciEcoCyc:EG11995-MONOMER.
ECOL316407:JW2105-MONOMER.

Miscellaneous databases

PROiP33346.

Family and domain databases

InterProiIPR025406. DUF4132.
[Graphical view]
PfamiPF13569. DUF4132. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiYEHI_ECOLI
AccessioniPrimary (citable) accession number: P33346
Secondary accession number(s): P76430, Q2MAW3
Entry historyi
Integrated into UniProtKB/Swiss-Prot: February 1, 1994
Last sequence update: November 1, 1997
Last modified: September 7, 2016
This is version 97 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Escherichia coli
    Escherichia coli (strain K12): entries and cross-references to EcoGene

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.