Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Protein YhgF

Gene

yhgF

Organism
Escherichia coli (strain K12)
Status
Reviewed-Annotation score: Annotation score: 2 out of 5-Experimental evidence at protein leveli

Functioni

GO - Molecular functioni

GO - Biological processi

  • nucleobase-containing compound metabolic process Source: InterPro
  • response to ionizing radiation Source: EcoCyc
Complete GO annotation...

Keywords - Ligandi

RNA-binding

Enzyme and pathway databases

BioCyciEcoCyc:G7746-MONOMER.
ECOL316407:JW3370-MONOMER.
RETL1328306-WGS:GSTH-905-MONOMER.

Names & Taxonomyi

Protein namesi
Recommended name:
Protein YhgF
Gene namesi
Name:yhgF
Ordered Locus Names:b3407, JW3370
OrganismiEscherichia coli (strain K12)
Taxonomic identifieri83333 [NCBI]
Taxonomic lineageiBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia
Proteomesi
  • UP000000318 Componenti: Chromosome
  • UP000000625 Componenti: Chromosome

Organism-specific databases

EcoGeneiEG12932. yhgF.

Subcellular locationi

GO - Cellular componenti

  • cytosol Source: EcoCyc
Complete GO annotation...

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 773773Protein YhgFPRO_0000215104Add
BLAST

Proteomic databases

EPDiP46837.
PaxDbiP46837.
PRIDEiP46837.

Interactioni

Protein-protein interaction databases

BioGridi4261725. 12 interactions.
DIPiDIP-12337N.
IntActiP46837. 9 interactions.
MINTiMINT-1288997.
STRINGi511145.b3407.

Structurei

3D structure databases

ProteinModelPortaliP46837.
SMRiP46837. Positions 4-726.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini651 – 72070S1 motifPROSITE-ProRule annotationAdd
BLAST

Sequence similaritiesi

Contains 1 S1 motif domain.PROSITE-ProRule annotation

Phylogenomic databases

eggNOGiENOG4105BZM. Bacteria.
COG2183. LUCA.
HOGENOMiHOG000270497.
InParanoidiP46837.
KOiK06959.
OMAiGFLRIRD.
OrthoDBiEOG6WT8CC.
PhylomeDBiP46837.

Family and domain databases

Gene3Di1.10.10.650. 1 hit.
1.10.150.310. 1 hit.
1.10.3500.10. 2 hits.
2.40.50.140. 1 hit.
3.30.420.140. 1 hit.
InterProiIPR012340. NA-bd_OB-fold.
IPR012337. RNaseH-like_dom.
IPR010994. RuvA_2-like.
IPR022967. S1_dom.
IPR003029. S1_domain.
IPR023323. Tex-like_dom.
IPR023319. Tex-like_HTH_dom.
IPR018974. Tex-like_N.
IPR023097. Tex_RuvX-like_dom.
IPR032639. Tex_YqgF.
IPR006641. YqgF/RNaseH-like_dom.
[Graphical view]
PfamiPF00575. S1. 1 hit.
PF09371. Tex_N. 1 hit.
PF16921. Tex_YqgF. 1 hit.
[Graphical view]
SMARTiSM00316. S1. 1 hit.
SM00732. YqgFc. 1 hit.
[Graphical view]
SUPFAMiSSF47781. SSF47781. 2 hits.
SSF50249. SSF50249. 1 hit.
SSF53098. SSF53098. 1 hit.
PROSITEiPS50126. S1. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

P46837-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MMNDSFCRII AGEIQARPEQ VDAAVRLLDE GNTVPFIARY RKEITGGLDD
60 70 80 90 100
TQLRNLETRL SYLRELEERR QAILKSISEQ GKLTDDLAKA INATLSKTEL
110 120 130 140 150
EDLYLPYKPK RRTRGQIAIE AGLEPLADLL WSDPSHTPEV AAAQYVYADK
160 170 180 190 200
GVADTKAALD GARYILMERF AEDAALLAKV RDYLWKNAHL VSTVVSGKEE
210 220 230 240 250
EGAKFRDYFD HHEPLSTVPS HRALAMFRGR NEGVLQLSLN ADPQFDEPPK
260 270 280 290 300
ESYCEQIIMD HLGLRLNNAP ADSWRKGVVS WTWRIKVLMH LETELMGTVR
310 320 330 340 350
ERAEDEAINV FARNLHDLLM AAPAGLRATM GLDPGLRTGV KVAVVDATGK
360 370 380 390 400
LVATDTIYPH TGQAAKAAMT VAALCEKHNV ELVAIGNGTA SRETERFYLD
410 420 430 440 450
VQKQFPKVTA QKVIVSEAGA SVYSASELAA QEFPDLDVSL RGAVSIARRL
460 470 480 490 500
QDPLAELVKI DPKSIGVGQY QHDVSQTQLA RKLDAVVEDC VNAVGVDLNT
510 520 530 540 550
ASVPLLTRVA GLTRMMAQNI VAWRDENGQF QNRQQLLKVS RLGPKAFEQC
560 570 580 590 600
AGFLRINHGD NPLDASTVHP EAYPVVERIL AATQQALKDL MGNSSELRNL
610 620 630 640 650
KASDFTDEKF GVPTVTDIIK ELEKPGRDPR PEFKTAQFAD GVETMNDLQP
660 670 680 690 700
GMILEGAVTN VTNFGAFVDI GVHQDGLVHI SSLSNKFVED PHTVVKAGDI
710 720 730 740 750
VKVKVLEVDL QRKRIALTMR LDEQPGETNA RRGGGNERPQ NNRPAAKPRG
760 770
REAQPAGNSA MMDALAAAMG KKR
Length:773
Mass (Da):85,120
Last modified:July 15, 1999 - v3
Checksum:iEA54D9ED952A8229
GO

Sequence cautioni

The sequence AAA58204.1 differs from that shown.Wrong choice of frame.Curated
The sequence AAA58205.1 differs from that shown. Reason: Frameshift at positions 12 and 66. Curated
The sequence AAA58205.1 differs from that shown. Reason: Erroneous initiation. Translation N-terminally extended.Curated

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti754 – 7552QP → HA in AAA58205 (PubMed:9278503).Curated

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U18997 Genomic DNA. Translation: AAA58204.1. Sequence problems.
U18997 Genomic DNA. Translation: AAA58205.1. Sequence problems.
U00096 Genomic DNA. Translation: AAC76432.2.
AP009048 Genomic DNA. Translation: BAE77884.1.
PIRiB65136.
RefSeqiNP_417866.4. NC_000913.3.
WP_000980727.1. NZ_LN832404.1.

Genome annotation databases

EnsemblBacteriaiAAC76432; AAC76432; b3407.
BAE77884; BAE77884; BAE77884.
GeneIDi947911.
KEGGiecj:JW3370.
eco:b3407.
PATRICi32122250. VBIEscCol129921_3502.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U18997 Genomic DNA. Translation: AAA58204.1. Sequence problems.
U18997 Genomic DNA. Translation: AAA58205.1. Sequence problems.
U00096 Genomic DNA. Translation: AAC76432.2.
AP009048 Genomic DNA. Translation: BAE77884.1.
PIRiB65136.
RefSeqiNP_417866.4. NC_000913.3.
WP_000980727.1. NZ_LN832404.1.

3D structure databases

ProteinModelPortaliP46837.
SMRiP46837. Positions 4-726.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi4261725. 12 interactions.
DIPiDIP-12337N.
IntActiP46837. 9 interactions.
MINTiMINT-1288997.
STRINGi511145.b3407.

Proteomic databases

EPDiP46837.
PaxDbiP46837.
PRIDEiP46837.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaiAAC76432; AAC76432; b3407.
BAE77884; BAE77884; BAE77884.
GeneIDi947911.
KEGGiecj:JW3370.
eco:b3407.
PATRICi32122250. VBIEscCol129921_3502.

Organism-specific databases

EchoBASEiEB2768.
EcoGeneiEG12932. yhgF.

Phylogenomic databases

eggNOGiENOG4105BZM. Bacteria.
COG2183. LUCA.
HOGENOMiHOG000270497.
InParanoidiP46837.
KOiK06959.
OMAiGFLRIRD.
OrthoDBiEOG6WT8CC.
PhylomeDBiP46837.

Enzyme and pathway databases

BioCyciEcoCyc:G7746-MONOMER.
ECOL316407:JW3370-MONOMER.
RETL1328306-WGS:GSTH-905-MONOMER.

Miscellaneous databases

PROiP46837.

Family and domain databases

Gene3Di1.10.10.650. 1 hit.
1.10.150.310. 1 hit.
1.10.3500.10. 2 hits.
2.40.50.140. 1 hit.
3.30.420.140. 1 hit.
InterProiIPR012340. NA-bd_OB-fold.
IPR012337. RNaseH-like_dom.
IPR010994. RuvA_2-like.
IPR022967. S1_dom.
IPR003029. S1_domain.
IPR023323. Tex-like_dom.
IPR023319. Tex-like_HTH_dom.
IPR018974. Tex-like_N.
IPR023097. Tex_RuvX-like_dom.
IPR032639. Tex_YqgF.
IPR006641. YqgF/RNaseH-like_dom.
[Graphical view]
PfamiPF00575. S1. 1 hit.
PF09371. Tex_N. 1 hit.
PF16921. Tex_YqgF. 1 hit.
[Graphical view]
SMARTiSM00316. S1. 1 hit.
SM00732. YqgFc. 1 hit.
[Graphical view]
SUPFAMiSSF47781. SSF47781. 2 hits.
SSF50249. SSF50249. 1 hit.
SSF53098. SSF53098. 1 hit.
PROSITEiPS50126. S1. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: K12 / MG1655 / ATCC 47076.
  2. "Highly accurate genome sequences of Escherichia coli K-12 strains MG1655 and W3110."
    Hayashi K., Morooka N., Yamamoto Y., Fujita K., Isono K., Choi S., Ohtsubo E., Baba T., Wanner B.L., Mori H., Horiuchi T.
    Mol. Syst. Biol. 2:E1-E5(2006) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: K12 / W3110 / ATCC 27325 / DSM 5911.
  3. "Enrichment of low abundance proteins of Escherichia coli by hydroxyapatite chromatography."
    Fountoulakis M., Takacs M.-F., Berndt P., Langen H., Takacs B.
    Electrophoresis 20:2181-2195(1999) [PubMed] [Europe PMC] [Abstract]
    Cited for: IDENTIFICATION BY MASS SPECTROMETRY.
    Strain: B / BL21.

Entry informationi

Entry nameiYHGF_ECOLI
AccessioniPrimary (citable) accession number: P46837
Secondary accession number(s): P76689, Q2M772
Entry historyi
Integrated into UniProtKB/Swiss-Prot: November 1, 1995
Last sequence update: July 15, 1999
Last modified: July 6, 2016
This is version 116 of the entry and version 3 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Escherichia coli
    Escherichia coli (strain K12): entries and cross-references to EcoGene
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.