Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Poly-beta-1,6-N-acetyl-D-glucosamine synthase

Gene

pgaC

Organism
Escherichia coli (strain K12)
Status
Reviewed-Annotation score: Annotation score: 4 out of 5-Experimental evidence at protein leveli

Functioni

Probable N-acetylglucosaminyltransferase that catalyzes the polymerization of single monomer units of UDP-N-acetylglucosamine to produce the linear homopolymer poly-beta-1,6-N-acetyl-D-glucosamine (PGA), a biofilm adhesin polysaccharide.2 Publications

GO - Molecular functioni

  • acetylglucosaminyltransferase activity Source: EcoCyc

GO - Biological processi

  • cell adhesion involved in biofilm formation Source: EcoCyc
Complete GO annotation...

Keywords - Molecular functioni

Glycosyltransferase, Transferase

Enzyme and pathway databases

BioCyciEcoCyc:G6529-MONOMER.
ECOL316407:JW1007-MONOMER.
MetaCyc:G6529-MONOMER.

Protein family/group databases

CAZyiGT2. Glycosyltransferase Family 2.
TCDBi4.D.1.1.3. the putative vectorial glycosyl polymerization (vgp) family.

Names & Taxonomyi

Protein namesi
Recommended name:
Poly-beta-1,6-N-acetyl-D-glucosamine synthase (EC:2.4.1.-)
Short name:
PGA synthase
Short name:
Poly-beta-1,6-GlcNAc synthase
Alternative name(s):
Biofilm PGA synthesis protein PgaC
N-acetylglucosaminyltransferase PgaC
Gene namesi
Name:pgaC
Synonyms:ycdQ
Ordered Locus Names:b1022, JW1007
OrganismiEscherichia coli (strain K12)
Taxonomic identifieri83333 [NCBI]
Taxonomic lineageiBacteriaProteobacteriaGammaproteobacteriaEnterobacteralesEnterobacteriaceaeEscherichia
Proteomesi
  • UP000000318 Componenti: Chromosome
  • UP000000625 Componenti: Chromosome

Organism-specific databases

EcoGeneiEG13863. pgaC.

Subcellular locationi

Topology

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Topological domaini1 – 4PeriplasmicSequence analysis4
Transmembranei5 – 25HelicalSequence analysisAdd BLAST21
Topological domaini26 – 32CytoplasmicSequence analysis7
Transmembranei33 – 53HelicalSequence analysisAdd BLAST21
Topological domaini54 – 330PeriplasmicSequence analysisAdd BLAST277
Transmembranei331 – 351HelicalSequence analysisAdd BLAST21
Topological domaini352 – 362CytoplasmicSequence analysisAdd BLAST11
Transmembranei363 – 383HelicalSequence analysisAdd BLAST21
Topological domaini384 – 394PeriplasmicSequence analysisAdd BLAST11
Transmembranei395 – 415HelicalSequence analysisAdd BLAST21
Topological domaini416 – 441CytoplasmicSequence analysisAdd BLAST26

GO - Cellular componenti

  • integral component of membrane Source: UniProtKB-KW
  • plasma membrane Source: EcoCyc
Complete GO annotation...

Keywords - Cellular componenti

Cell inner membrane, Cell membrane, Membrane

Pathology & Biotechi

Disruption phenotypei

Cells lacking this gene do not synthesize PGA.2 Publications

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00000592121 – 441Poly-beta-1,6-N-acetyl-D-glucosamine synthaseAdd BLAST441

Proteomic databases

PaxDbiP75905.
PRIDEiP75905.

Interactioni

Binary interactionsi

WithEntry#Exp.IntActNotes
pgaDP694322EBI-561450,EBI-562069

Protein-protein interaction databases

BioGridi4260056. 253 interactors.
DIPiDIP-11512N.
IntActiP75905. 6 interactors.
MINTiMINT-1234361.
STRINGi511145.b1022.

Structurei

3D structure databases

ProteinModelPortaliP75905.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Sequence similaritiesi

Belongs to the glycosyltransferase 2 family.Curated

Keywords - Domaini

Transmembrane, Transmembrane helix

Phylogenomic databases

eggNOGiENOG4105CND. Bacteria.
COG1215. LUCA.
HOGENOMiHOG000170129.
InParanoidiP75905.
KOiK11936.
OMAiWPPVTIL.
PhylomeDBiP75905.

Family and domain databases

Gene3Di3.90.550.10. 1 hit.
InterProiIPR001173. Glyco_trans_2-like.
IPR029044. Nucleotide-diphossugar_trans.
IPR023853. PGA_PgaC/IcaA.
[Graphical view]
PfamiPF00535. Glycos_transf_2. 1 hit.
[Graphical view]
SUPFAMiSSF53448. SSF53448. 1 hit.
TIGRFAMsiTIGR03937. PgaC_IcaA. 1 hit.

Sequencei

Sequence statusi: Complete.

P75905-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MINRIVSFFI LCLVLCIPLC VAYFHSGELM MRFVFFWPFF MSIMWIVGGV
60 70 80 90 100
YFWVYRERHW PWGENAPAPQ LKDNPSISII IPCFNEEKNV EETIHAALAQ
110 120 130 140 150
RYENIEVIAV NDGSTDKTRA ILDRMAAQIP HLRVIHLAQN QGKAIALKTG
160 170 180 190 200
AAAAKSEYLV CIDGDALLDR DAAAYIVEPM LYNPRVGAVT GNPRIRTRST
210 220 230 240 250
LVGKIQVGEY SSIIGLIKRT QRIYGNVFTV SGVIAAFRRS ALAEVGYWSD
260 270 280 290 300
DMITEDIDIS WKLQLNQWTI FYEPRALCWI LMPETLKGLW KQRLRWAQGG
310 320 330 340 350
AEVFLKNMTR LWRKENFRMW PLFFEYCLTT IWAFTCLVGF IIYAVQLAGV
360 370 380 390 400
PLNIELTHIA ATHTAGILLC TLCLLQFIVS LMIENRYEHN LTSSLFWIIW
410 420 430 440
FPVIFWMLSL ATTLVSFTRV MLMPKKQRAR WVSPDRGILR G
Length:441
Mass (Da):50,766
Last modified:February 1, 1997 - v1
Checksum:i04F5A53D72FEBABB
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U00096 Genomic DNA. Translation: AAC74107.1.
AP009048 Genomic DNA. Translation: BAA35803.1.
PIRiD64844.
RefSeqiNP_415541.1. NC_000913.3.

Genome annotation databases

EnsemblBacteriaiAAC74107; AAC74107; b1022.
BAA35803; BAA35803; BAA35803.
GeneIDi945606.
KEGGiecj:JW1007.
eco:b1022.
PATRICi32117277. VBIEscCol129921_1062.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U00096 Genomic DNA. Translation: AAC74107.1.
AP009048 Genomic DNA. Translation: BAA35803.1.
PIRiD64844.
RefSeqiNP_415541.1. NC_000913.3.

3D structure databases

ProteinModelPortaliP75905.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi4260056. 253 interactors.
DIPiDIP-11512N.
IntActiP75905. 6 interactors.
MINTiMINT-1234361.
STRINGi511145.b1022.

Protein family/group databases

CAZyiGT2. Glycosyltransferase Family 2.
TCDBi4.D.1.1.3. the putative vectorial glycosyl polymerization (vgp) family.

Proteomic databases

PaxDbiP75905.
PRIDEiP75905.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaiAAC74107; AAC74107; b1022.
BAA35803; BAA35803; BAA35803.
GeneIDi945606.
KEGGiecj:JW1007.
eco:b1022.
PATRICi32117277. VBIEscCol129921_1062.

Organism-specific databases

EchoBASEiEB3623.
EcoGeneiEG13863. pgaC.

Phylogenomic databases

eggNOGiENOG4105CND. Bacteria.
COG1215. LUCA.
HOGENOMiHOG000170129.
InParanoidiP75905.
KOiK11936.
OMAiWPPVTIL.
PhylomeDBiP75905.

Enzyme and pathway databases

BioCyciEcoCyc:G6529-MONOMER.
ECOL316407:JW1007-MONOMER.
MetaCyc:G6529-MONOMER.

Miscellaneous databases

PROiP75905.

Family and domain databases

Gene3Di3.90.550.10. 1 hit.
InterProiIPR001173. Glyco_trans_2-like.
IPR029044. Nucleotide-diphossugar_trans.
IPR023853. PGA_PgaC/IcaA.
[Graphical view]
PfamiPF00535. Glycos_transf_2. 1 hit.
[Graphical view]
SUPFAMiSSF53448. SSF53448. 1 hit.
TIGRFAMsiTIGR03937. PgaC_IcaA. 1 hit.
ProtoNetiSearch...

Entry informationi

Entry nameiPGAC_ECOLI
AccessioniPrimary (citable) accession number: P75905
Entry historyi
Integrated into UniProtKB/Swiss-Prot: November 1, 1997
Last sequence update: February 1, 1997
Last modified: November 2, 2016
This is version 123 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Escherichia coli
    Escherichia coli (strain K12): entries and cross-references to EcoGene
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.