Skip Header

Contribute Send feedback
Read comments (?) or add your own

P37653 (BCSA_ECOLI) Reviewed, UniProtKB/Swiss-Prot

Last modified May 1, 2013. Version 119. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Cellulose synthase catalytic subunit [UDP-forming]

EC=2.4.1.12
Gene names
Name:bcsA
Synonyms:yhjO, yhjP
Ordered Locus Names:b3533, JW5665
OrganismEscherichia coli (strain K12) [Reference proteome] [HAMAP]
Taxonomic identifier83333 [NCBI]
Taxonomic lineageBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia

Protein attributes

Sequence length872 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Catalytic subunit of cellulose synthase. It polymerizes uridine 5'-diphosphate glucose to cellulose, which is produced as an extracellular component for mechanical and chemical protection at the onset of the stationary phase, when the cells exhibit multicellular behavior (rdar morphotype). Coexpression of cellulose and thin aggregative fimbriae leads to a hydrophobic network with tightly packed cells embedded in a highly inert matrix.

Catalytic activity

UDP-glucose + (1,4-beta-D-glucosyl)(n) = UDP + (1,4-beta-D-glucosyl)(n+1).

Cofactor

Magnesium By similarity.

Enzyme regulation

Activated by bis-(3'-5') cyclic diguanylic acid (c-di-GMP).

Pathway

Glycan metabolism; bacterial cellulose biosynthesis.

Subcellular location

Cell inner membrane; Multi-pass membrane protein Potential.

Domain

There are two conserved domains in the globular part of the protein: the N-terminal domain (domain A) contains the conserved DXD motif and is possibly involved in catalysis and substrate binding. The C-terminal domain (domain B) contains the QXXRW motif and is present only in processive glycosyl transferases. It could be involved in the processivity function of the enzyme, possibly required for holding the growing glycan chain in the active site.

Miscellaneous

The genes bscA, bcsB, bcsZ and bcsC are constitutively transcribed but cellulose synthesis occurs only when AdrA, a putative transmembrane protein regulated by AgfD, is expressed. Cellulose production is abolished in E.coli K12.

Sequence similarities

Belongs to the glycosyltransferase 2 family.

Contains 1 PilZ domain.

Sequence caution

The sequence AAB18511.1 differs from that shown. Reason: Frameshift at position 128.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 872872Cellulose synthase catalytic subunit [UDP-forming]
PRO_0000059267

Regions

Transmembrane30 – 5021Helical; Potential
Transmembrane151 – 17121Helical; Potential
Transmembrane173 – 19321Helical; Potential
Transmembrane230 – 25021Helical; Potential
Transmembrane525 – 54521Helical; Potential
Transmembrane547 – 56721Helical; Potential
Transmembrane592 – 61221Helical; Potential
Transmembrane640 – 66021Helical; Potential
Transmembrane668 – 68821Helical; Potential
Transmembrane833 – 85321Helical; Potential
Domain694 – 79097PilZ
Region271 – 36494Catalytic subdomain A
Region441 – 50161Catalytic subdomain B

Sites

Active site3131 Potential
Active site4571 Potential
Binding site3601Substrate Potential
Binding site3621Substrate Potential

Sequences

Sequence LengthMass (Da)Tools
P37653 [UniParc].

Last modified July 26, 2002. Version 3.
Checksum: 14326B8A2EB228F7

FASTA87299,785
        10         20         30         40         50         60 
MSILTRWLLI PPVNARLIGR YRDYRRHGAS AFSATLGCFW MILAWIFIPL EHPRWQRIRA 

        70         80         90        100        110        120 
EHKNLYPHIN ASRPRPLDPV RYLIQTCWLL IGASRKETPK PRRRAFSGLQ NIRGRYHQWM 

       130        140        150        160        170        180 
NELPERVSHK TQHLDEKKEL GHLSAGARRL ILGIIVTFSL ILALICVTQP FNPLAQFIFL 

       190        200        210        220        230        240 
MLLWGVALIV RRMPGRFSAL MLIVLSLTVS CRYIWWRYTS TLNWDDPVSL VCGLILLFAE 

       250        260        270        280        290        300 
TYAWIVLVLG YFQVVWPLNR QPVPLPKDMS LWPSVDIFVP TYNEDLNVVK NTIYASLGID 

       310        320        330        340        350        360 
WPKDKLNIWI LDDGGREEFR QFAQNVGVKY IARTTHEHAK AGNINNALKY AKGEFVSIFD 

       370        380        390        400        410        420 
CDHVPTRSFL QMTMGWFLKE KQLAMMQTPH HFFSPDPFER NLGRFRKTPN EGTLFYGLVQ 

       430        440        450        460        470        480 
DGNDMWDATF FCGSCAVIRR KPLDEIGGIA VETVTEDAHT SLRLHRRGYT SAYMRIPQAA 

       490        500        510        520        530        540 
GLATESLSAH IGQRIRWARG MVQIFRLDNP LTGKGLKFAQ RLCYVNAMFH FLSGIPRLIF 

       550        560        570        580        590        600 
LTAPLAFLLL HAYIIYAPAL MIALFVLPHM IHASLTNSKI QGKYRHSFWS EIYETVLAWY 

       610        620        630        640        650        660 
IAPPTLVALI NPHKGKFNVT AKGGLVEEEY VDWVISRPYI FLVLLNLVGV AVGIWRYFYG 

       670        680        690        700        710        720 
PPTEMLTVVV SMVWVFYNLI VLGGAVAVSV ESKQVRRSHR VEMTMPAAIA REDGHLFSCT 

       730        740        750        760        770        780 
VQDFSDGGLG IKINGQAQIL EGQKVNLLLK RGQQEYVFPT QVARVMGNEV GLKLMPLTTQ 

       790        800        810        820        830        840 
QHIDFVQCTF ARADTWALWQ DSYPEDKPLE SLLDILKLGF RGYRHLAEFA PSSVKGIFRV 

       850        860        870 
LTSLVSWVVS FIPRRPERSE TAQPSDQALA QQ 

« Hide

References

« Hide 'large scale' references
[1]"Analysis of the Escherichia coli genome. V. DNA sequence of the region from 76.0 to 81.5 minutes."
Sofia H.J., Burland V., Daniels D.L., Plunkett G. III, Blattner F.R.
Nucleic Acids Res. 22:2576-2586(1994) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: K12 / MG1655 / ATCC 47076.
[2]"The complete genome sequence of Escherichia coli K-12."
Blattner F.R., Plunkett G. III, Bloch C.A., Perna N.T., Burland V., Riley M., Collado-Vides J., Glasner J.D., Rode C.K., Mayhew G.F., Gregor J., Davis N.W., Kirkpatrick H.A., Goeden M.A., Rose D.J., Mau B., Shao Y.
Science 277:1453-1474(1997) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], SEQUENCE REVISION.
Strain: K12 / MG1655 / ATCC 47076.
[3]"Highly accurate genome sequences of Escherichia coli K-12 strains MG1655 and W3110."
Hayashi K., Morooka N., Yamamoto Y., Fujita K., Isono K., Choi S., Ohtsubo E., Baba T., Wanner B.L., Mori H., Horiuchi T.
Mol. Syst. Biol. 2:E1-E5(2006) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: K12 / W3110 / ATCC 27325 / DSM 5911.
[4]"The multicellular morphotypes of Salmonella typhimurium and Escherichia coli produce cellulose as the second component of the extracellular matrix."
Zogaj X., Nimtz M., Rohde M., Bokranz W., Roemling U.
Mol. Microbiol. 39:1452-1463(2001) [PubMed] [Europe PMC] [Abstract]
Cited for: CHARACTERIZATION.
Strain: ECOR 10, ECOR 12 and TOB1.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
U00039 Genomic DNA. Translation: AAB18510.1. Frameshift.
U00039 Genomic DNA. Translation: AAB18511.1. Frameshift.
U00096 Genomic DNA. Translation: AAC76558.2.
AP009048 Genomic DNA. Translation: BAE77761.1.
PIRH65151.
S47754.
S47755.
RefSeqNP_417990.4. NC_000913.2.
YP_491902.1. NC_007779.1.

3D structure databases

ProteinModelPortalP37653.
SMRP37653. Positions 149-803.
ModBaseSearch...

Protein-protein interaction databases

DIPDIP-12387N.
IntActP37653. 2 interactions.
STRING511145.b3533.

Protein family/group databases

CAZyGT2. Glycosyltransferase Family 2.

Proteomic databases

PaxDbP37653.
PRIDEP37653.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaAAC76558; AAC76558; b3533.
BAE77761; BAE77761; BAE77761.
GeneID12931861.
948053.
KEGGecj:Y75_p3644.
eco:b3533.
PATRIC32122532. VBIEscCol129921_3644.

Organism-specific databases

EchoBASEEB2169.
EcoGeneEG12260. bcsA.

Phylogenomic databases

eggNOGCOG1215.
HOGENOMHOG000259144.
KOK00694.
OMALYVLPHM.
ProtClustDBPRK11498.

Enzyme and pathway databases

BioCycEcoCyc:EG12260-MONOMER.
ECOL316407:JW5665-MONOMER.
UniPathwayUPA00694.

Gene expression databases

GenevestigatorP37653.

Family and domain databases

InterProIPR003919. Cell_synth_A.
IPR001173. Glyco_trans_2.
IPR009875. PilZ_domain.
[Graphical view]
PfamPF00535. Glycos_transf_2. 1 hit.
PF07238. PilZ. 1 hit.
[Graphical view]
PRINTSPR01439. CELLSNTHASEA.
TIGRFAMsTIGR03030. CelA. 1 hit.
ProtoNetSearch...

Entry information

Entry nameBCSA_ECOLI
AccessionPrimary (citable) accession number: P37653
Secondary accession number(s): P37654 expand/collapse secondary AC list , P76712, P76713, Q2M7J5, Q8RSS7
Entry history
Integrated into UniProtKB/Swiss-Prot: October 1, 1994
Last sequence update: July 26, 2002
Last modified: May 1, 2013
This is version 119 of the entry and version 3 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Relevant documents

Escherichia coli

Escherichia coli (strain K12): entries and cross-references to EcoGene

PATHWAY comments

Index of metabolic and biosynthesis pathways

SIMILARITY comments

Index of protein domains and families