Skip Header

Contribute Send feedback
Read comments (?) or add your own

P25740 (RFAG_ECOLI) Reviewed, UniProtKB/Swiss-Prot

Last modified May 1, 2013. Version 100. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (4) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Lipopolysaccharide core biosynthesis protein RfaG

EC=2.4.-.-
Alternative name(s):
Glucosyltransferase I
Gene names
Name:rfaG
Synonyms:pcsA, waaG
Ordered Locus Names:b3631, JW3606
OrganismEscherichia coli (strain K12) [Reference proteome] [HAMAP]
Taxonomic identifier83333 [NCBI]
Taxonomic lineageBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia

Protein attributes

Sequence length374 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Involved in the addition of the first glucose residue to the lipopolysaccharide core.

Pathway

Bacterial outer membrane biogenesis; LPS core biosynthesis.

Sequence similarities

Belongs to the glycosyltransferase group 1 family. Glycosyltransferase 4 subfamily.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 374374Lipopolysaccharide core biosynthesis protein RfaG
PRO_0000080305

Secondary structure

..................................................................... 374
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
P25740 [UniParc].

Last modified May 1, 1992. Version 1.
Checksum: 7F20AE577CBB80C2

FASTA37442,284
        10         20         30         40         50         60 
MIVAFCLYKY FPFGGLQRDF MRIASTVAAR GHHVRVYTQS WEGDCPKAFE LIQVPVKSHT 

        70         80         90        100        110        120 
NHGRNAEYYA WVQNHLKEHP ADRVVGFNKM PGLDVYFAAD VCYAEKVAQE KGFLYRLTSR 

       130        140        150        160        170        180 
YRHYAAFERA TFEQGKSTKL MMLTDKQIAD FQKHYQTEPE RFQILPPGIY PDRKYSEQIP 

       190        200        210        220        230        240 
NSREIYRQKN GIKEQQNLLL QVGSDFGRKG VDRSIEALAS LPESLRHNTL LFVVGQDKPR 

       250        260        270        280        290        300 
KFEALAEKLG VRSNVHFFSG RNDVSELMAA ADLLLHPAYQ EAAGIVLLEA ITAGLPVLTT 

       310        320        330        340        350        360 
AVCGYAHYIA DANCGTVIAE PFSQEQLNEV LRKALTQSPL RMAWAENARH YADTQDLYSL 

       370 
PEKAADIITG GLDG 

« Hide

References

« Hide 'large scale' references
[1]"Identification and sequences of the lipopolysaccharide core biosynthetic genes rfaQ, rfaP, and rfaG of Escherichia coli K-12."
Parker C.T., Pradel E., Schnaitman C.A.
J. Bacteriol. 174:930-934(1992) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
Strain: K12.
[2]"Analysis of the Escherichia coli genome. V. DNA sequence of the region from 76.0 to 81.5 minutes."
Sofia H.J., Burland V., Daniels D.L., Plunkett G. III, Blattner F.R.
Nucleic Acids Res. 22:2576-2586(1994) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: K12 / MG1655 / ATCC 47076.
[3]"The complete genome sequence of Escherichia coli K-12."
Blattner F.R., Plunkett G. III, Bloch C.A., Perna N.T., Burland V., Riley M., Collado-Vides J., Glasner J.D., Rode C.K., Mayhew G.F., Gregor J., Davis N.W., Kirkpatrick H.A., Goeden M.A., Rose D.J., Mau B., Shao Y.
Science 277:1453-1474(1997) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: K12 / MG1655 / ATCC 47076.
[4]"Highly accurate genome sequences of Escherichia coli K-12 strains MG1655 and W3110."
Hayashi K., Morooka N., Yamamoto Y., Fujita K., Isono K., Choi S., Ohtsubo E., Baba T., Wanner B.L., Mori H., Horiuchi T.
Mol. Syst. Biol. 2:E1-E5(2006) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: K12 / W3110 / ATCC 27325 / DSM 5911.
[5]"The gene coding for 3-deoxy-manno-octulosonic acid transferase and the rfaQ gene are transcribed from divergently arranged promoters in Escherichia coli."
Clementz T.
J. Bacteriol. 174:7750-7756(1992) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-217.
[6]"Identification by Tn10 transposon mutagenesis of host factors involved in the biosynthesis of K99 fimbriae of Escherichia coli: effect of LPS core mutations."
Pilipcinec E., Huisman T.T., Willemsen P.T., Appelmelk B.J., Graaf F.K., Oudega B.
FEMS Microbiol. Lett. 123:201-206(1994) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-58.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
M80599 Genomic DNA. Translation: AAA24082.1.
M86305 Genomic DNA. Translation: AAA03743.2.
U00039 Genomic DNA. Translation: AAB18608.1.
U00096 Genomic DNA. Translation: AAC76655.1.
AP009048 Genomic DNA. Translation: BAE77661.1.
S75736 Genomic DNA. Translation: AAD43826.1.
PIRB42595.
RefSeqNP_418088.1. NC_000913.2.
YP_491802.1. NC_007779.1.

3D structure databases

PDBe
RCSB PDB
PDBj
EntryMethodResolution (Å)ChainPositionsPDBsum
2IV7X-ray1.60A1-374[»]
2IW1X-ray1.50A1-374[»]
ProteinModelPortalP25740.
SMRP25740. Positions 2-371.
ModBaseSearch...

Protein-protein interaction databases

IntActP25740. 10 interactions.
STRING511145.b3631.

Protein family/group databases

CAZyGT4. Glycosyltransferase Family 4.

Proteomic databases

PaxDbP25740.
PRIDEP25740.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaAAC76655; AAC76655; b3631.
BAE77661; BAE77661; BAE77661.
GeneID12931875.
948149.
KEGGecj:Y75_p3543.
eco:b3631.
PATRIC32122749. VBIEscCol129921_3751.

Organism-specific databases

EchoBASEEB1315.
EcoGeneEG11339. rfaG.

Phylogenomic databases

eggNOGCOG0438.
HOGENOMHOG000126226.
KOK02844.
OMAGLQRDFL.
ProtClustDBCLSK869057.

Enzyme and pathway databases

BioCycEcoCyc:EG11339-MONOMER.
ECOL316407:JW3606-MONOMER.
MetaCyc:EG11339-MONOMER.
UniPathwayUPA00958.

Gene expression databases

GenevestigatorP25740.

Family and domain databases

InterProIPR001296. Glyco_trans_1.
[Graphical view]
PfamPF00534. Glycos_transf_1. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

EvolutionaryTraceP25740.

Entry information

Entry nameRFAG_ECOLI
AccessionPrimary (citable) accession number: P25740
Secondary accession number(s): Q2M7U5
Entry history
Integrated into UniProtKB/Swiss-Prot: May 1, 1992
Last sequence update: May 1, 1992
Last modified: May 1, 2013
This is version 100 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Relevant documents

Escherichia coli

Escherichia coli (strain K12): entries and cross-references to EcoGene

PATHWAY comments

Index of metabolic and biosynthesis pathways

PDB cross-references

Index of Protein Data Bank (PDB) cross-references

SIMILARITY comments

Index of protein domains and families