Skip Header

Contribute Send feedback
Read comments (?) or add your own

P46837 (YHGF_ECOLI) Reviewed, UniProtKB/Swiss-Prot

Last modified May 1, 2013. Version 95. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Interactions·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Protein YhgF
Gene names
Name:yhgF
Ordered Locus Names:b3407, JW3370
OrganismEscherichia coli (strain K12) [Reference proteome] [HAMAP]
Taxonomic identifier83333 [NCBI]
Taxonomic lineageBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia

Protein attributes

Sequence length773 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Sequence similarities

Contains 1 S1 motif domain.

Sequence caution

The sequence AAA58204.1 differs from that shown. Reason: Wrong choice of frame.

The sequence AAA58205.1 differs from that shown. Reason: Erroneous initiation. Translation N-terminally extended.

The sequence AAA58205.1 differs from that shown. Reason: Frameshift at positions 12 and 66.

Binary interactions

With

Entry

#Exp.

IntAct

Notes

valSP071181EBI-554743,EBI-559242

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 773773Protein YhgF
PRO_0000215104

Regions

Domain651 – 72070S1 motif

Experimental info

Sequence conflict754 – 7552QP → HA in AAA58205. Ref.1

Sequences

Sequence LengthMass (Da)Tools
P46837 [UniParc].

Last modified July 15, 1999. Version 3.
Checksum: EA54D9ED952A8229

FASTA77385,120
        10         20         30         40         50         60 
MMNDSFCRII AGEIQARPEQ VDAAVRLLDE GNTVPFIARY RKEITGGLDD TQLRNLETRL 

        70         80         90        100        110        120 
SYLRELEERR QAILKSISEQ GKLTDDLAKA INATLSKTEL EDLYLPYKPK RRTRGQIAIE 

       130        140        150        160        170        180 
AGLEPLADLL WSDPSHTPEV AAAQYVYADK GVADTKAALD GARYILMERF AEDAALLAKV 

       190        200        210        220        230        240 
RDYLWKNAHL VSTVVSGKEE EGAKFRDYFD HHEPLSTVPS HRALAMFRGR NEGVLQLSLN 

       250        260        270        280        290        300 
ADPQFDEPPK ESYCEQIIMD HLGLRLNNAP ADSWRKGVVS WTWRIKVLMH LETELMGTVR 

       310        320        330        340        350        360 
ERAEDEAINV FARNLHDLLM AAPAGLRATM GLDPGLRTGV KVAVVDATGK LVATDTIYPH 

       370        380        390        400        410        420 
TGQAAKAAMT VAALCEKHNV ELVAIGNGTA SRETERFYLD VQKQFPKVTA QKVIVSEAGA 

       430        440        450        460        470        480 
SVYSASELAA QEFPDLDVSL RGAVSIARRL QDPLAELVKI DPKSIGVGQY QHDVSQTQLA 

       490        500        510        520        530        540 
RKLDAVVEDC VNAVGVDLNT ASVPLLTRVA GLTRMMAQNI VAWRDENGQF QNRQQLLKVS 

       550        560        570        580        590        600 
RLGPKAFEQC AGFLRINHGD NPLDASTVHP EAYPVVERIL AATQQALKDL MGNSSELRNL 

       610        620        630        640        650        660 
KASDFTDEKF GVPTVTDIIK ELEKPGRDPR PEFKTAQFAD GVETMNDLQP GMILEGAVTN 

       670        680        690        700        710        720 
VTNFGAFVDI GVHQDGLVHI SSLSNKFVED PHTVVKAGDI VKVKVLEVDL QRKRIALTMR 

       730        740        750        760        770 
LDEQPGETNA RRGGGNERPQ NNRPAAKPRG REAQPAGNSA MMDALAAAMG KKR 

« Hide

References

« Hide 'large scale' references
[1]"The complete genome sequence of Escherichia coli K-12."
Blattner F.R., Plunkett G. III, Bloch C.A., Perna N.T., Burland V., Riley M., Collado-Vides J., Glasner J.D., Rode C.K., Mayhew G.F., Gregor J., Davis N.W., Kirkpatrick H.A., Goeden M.A., Rose D.J., Mau B., Shao Y.
Science 277:1453-1474(1997) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: K12 / MG1655 / ATCC 47076.
[2]"Highly accurate genome sequences of Escherichia coli K-12 strains MG1655 and W3110."
Hayashi K., Morooka N., Yamamoto Y., Fujita K., Isono K., Choi S., Ohtsubo E., Baba T., Wanner B.L., Mori H., Horiuchi T.
Mol. Syst. Biol. 2:E1-E5(2006) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: K12 / W3110 / ATCC 27325 / DSM 5911.
[3]"Enrichment of low abundance proteins of Escherichia coli by hydroxyapatite chromatography."
Fountoulakis M., Takacs M.-F., Berndt P., Langen H., Takacs B.
Electrophoresis 20:2181-2195(1999) [PubMed] [Europe PMC] [Abstract]
Cited for: IDENTIFICATION BY MASS SPECTROMETRY.
Strain: B / BL21.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
U18997 Genomic DNA. Translation: AAA58204.1. Sequence problems.
U18997 Genomic DNA. Translation: AAA58205.1. Sequence problems.
U00096 Genomic DNA. Translation: AAC76432.2.
AP009048 Genomic DNA. Translation: BAE77884.1.
PIRB65136.
RefSeqNP_417866.4. NC_000913.2.
YP_492025.1. NC_007779.1.

3D structure databases

ProteinModelPortalP46837.
SMRP46837. Positions 4-726.
ModBaseSearch...

Protein-protein interaction databases

DIPDIP-12337N.
IntActP46837. 9 interactions.
MINTMINT-1288997.
STRING511145.b3407.

Proteomic databases

PaxDbP46837.
PRIDEP46837.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaAAC76432; AAC76432; b3407.
BAE77884; BAE77884; BAE77884.
GeneID12932731.
947911.
KEGGecj:Y75_p3769.
eco:b3407.
PATRIC32122250. VBIEscCol129921_3502.

Organism-specific databases

EchoBASEEB2768.
EcoGeneEG12932. yhgF.

Phylogenomic databases

eggNOGCOG2183.
HOGENOMHOG000270497.
KOK06959.
OMAYKQKRRT.
ProtClustDBCLSK862831.

Enzyme and pathway databases

BioCycEcoCyc:G7746-MONOMER.
ECOL316407:JW3370-MONOMER.

Gene expression databases

GenevestigatorP46837.

Family and domain databases

Gene3D1.10.10.650. 1 hit.
1.10.150.310. 1 hit.
1.10.3500.10. 2 hits.
2.40.50.140. 1 hit.
3.30.420.140. 1 hit.
InterProIPR012340. NA-bd_OB-fold.
IPR003029. Rbsml_prot_S1_RNA-bd_dom.
IPR022967. RNA-binding_domain_S1.
IPR023323. Tex-like_dom.
IPR023319. Tex-like_HTH_dom.
IPR018974. Tex-like_N.
IPR023097. Tex_RuvX-like_dom.
IPR006641. YqgF/RNaseH-like_dom.
[Graphical view]
PfamPF00575. S1. 1 hit.
PF09371. Tex_N. 1 hit.
[Graphical view]
SMARTSM00316. S1. 1 hit.
SM00732. YqgFc. 1 hit.
[Graphical view]
SUPFAMSSF50249. Nucleic_acid_OB. 1 hit.
PROSITEPS50126. S1. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameYHGF_ECOLI
AccessionPrimary (citable) accession number: P46837
Secondary accession number(s): P76689, Q2M772
Entry history
Integrated into UniProtKB/Swiss-Prot: November 1, 1995
Last sequence update: July 15, 1999
Last modified: May 1, 2013
This is version 95 of the entry and version 3 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Relevant documents

Escherichia coli

Escherichia coli (strain K12): entries and cross-references to EcoGene

SIMILARITY comments

Index of protein domains and families