Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot P37653 (BCSA_ECOLI)

Last modified June 16, 2009. Version 86. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Cellulose synthase catalytic subunit [UDP-forming]
    EC=2.4.1.12
Gene names
Name: bcsA
Synonyms: yhjO, yhjP
Ordered Locus Names: b3533, JW5665
OrganismEscherichia coli (strain K12) [Complete proteome] [HAMAP]
Taxonomic identifier83333 [NCBI]
Taxonomic lineageBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia

Protein attributes

Sequence length872 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceEvidence at protein level.

General annotation (Comments)

Function

Catalytic subunit of cellulose synthase. It polymerizes uridine 5'-diphosphate glucose to cellulose, which is produced as an extracellular component for mechanical and chemical protection at the onset of the stationary phase, when the cells exhibit multicellular behavior (rdar morphotype). Coexpression of cellulose and thin aggregative fimbriae leads to a hydrophobic network with tightly packed cells embedded in a highly inert matrix.

Catalytic activity

UDP-glucose + (1,4-beta-D-glucosyl)(n) = UDP + (1,4-beta-D-glucosyl)(n+1).

Cofactor

Magnesium By similarity.

Enzyme regulation

Activated by bis-(3'-5') cyclic diguanylic acid (c-di-GMP).

Pathway

Glycan metabolism; bacterial cellulose biosynthesis.

Subcellular location

Cell inner membrane; Multi-pass membrane protein Potential.

Domain

There are two conserved domains in the globular part of the protein: the N-terminal domain (domain A) contains the conserved DXD motif and is possibly involved in catalysis and substrate binding. The C-terminal domain (domain B) contains the QXXRW motif and is present only in processive glycosyl transferases. It could be involved in the processivity function of the enzyme, possibly required for holding the growing glycan chain in the active site.

Miscellaneous

The genes bscA, bcsB, bcsZ and bcsC are constitutively transcribed but cellulose synthesis occurs only when adrA, a putative transmembrane protein regulated by agfD, is expressed. Cellulose production is abolished in E.coli K12.

Sequence similarities

Belongs to the glycosyltransferase 2 family.

Sequence caution

The sequence AAB18511.1 differs from that shown. Reason: Frameshift at position 128.

Ontologies

Keywords
   Biological processCellulose biosynthesis
   Cellular componentCell inner membrane
Cell membrane
Membrane
   DomainTransmembrane
   Ligandc-di-GMP
   Molecular functionGlycosyltransferase
Transferase
   Technical termComplete proteome
Gene Ontology (GO)
   Biological processUDP-glucose metabolic process

Inferred from electronic annotation. Source: InterPro

cellulose biosynthetic process

Inferred from electronic annotation. Source: UniProtKB-KW

   Cellular componentintegral to plasma membrane

Inferred from electronic annotation. Source: InterPro

   Molecular functioncellulose synthase (UDP-forming) activity

Inferred from electronic annotation. Source: EC

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 872872Cellulose synthase catalytic subunit [UDP-forming]
PRO_0000059267

Regions

Transmembrane30 – 5021 Potential
Transmembrane151 – 17121 Potential
Transmembrane173 – 19321 Potential
Transmembrane230 – 25021 Potential
Transmembrane525 – 54521 Potential
Transmembrane547 – 56721 Potential
Transmembrane592 – 61221 Potential
Transmembrane640 – 66021 Potential
Transmembrane668 – 68821 Potential
Transmembrane833 – 85321 Potential
Region271 – 36494Catalytic subdomain A
Region441 – 50161Catalytic subdomain B

Sites

Active site3131 Potential
Active site4571 Potential
Binding site3601Substrate Potential
Binding site3621Substrate Potential

Sequences

Sequence LengthMass (Da)Tools
P37653-1 [UniParc].

Last modified July 26, 2002. Version 3.
Checksum: 14326B8A2EB228F7

FASTA87299,785
        10         20         30         40         50         60 
MSILTRWLLI PPVNARLIGR YRDYRRHGAS AFSATLGCFW MILAWIFIPL EHPRWQRIRA 

        70         80         90        100        110        120 
EHKNLYPHIN ASRPRPLDPV RYLIQTCWLL IGASRKETPK PRRRAFSGLQ NIRGRYHQWM 

       130        140        150        160        170        180 
NELPERVSHK TQHLDEKKEL GHLSAGARRL ILGIIVTFSL ILALICVTQP FNPLAQFIFL 

       190        200        210        220        230        240 
MLLWGVALIV RRMPGRFSAL MLIVLSLTVS CRYIWWRYTS TLNWDDPVSL VCGLILLFAE 

       250        260        270        280        290        300 
TYAWIVLVLG YFQVVWPLNR QPVPLPKDMS LWPSVDIFVP TYNEDLNVVK NTIYASLGID 

       310        320        330        340        350        360 
WPKDKLNIWI LDDGGREEFR QFAQNVGVKY IARTTHEHAK AGNINNALKY AKGEFVSIFD 

       370        380        390        400        410        420 
CDHVPTRSFL QMTMGWFLKE KQLAMMQTPH HFFSPDPFER NLGRFRKTPN EGTLFYGLVQ 

       430        440        450        460        470        480 
DGNDMWDATF FCGSCAVIRR KPLDEIGGIA VETVTEDAHT SLRLHRRGYT SAYMRIPQAA 

       490        500        510        520        530        540 
GLATESLSAH IGQRIRWARG MVQIFRLDNP LTGKGLKFAQ RLCYVNAMFH FLSGIPRLIF 

       550        560        570        580        590        600 
LTAPLAFLLL HAYIIYAPAL MIALFVLPHM IHASLTNSKI QGKYRHSFWS EIYETVLAWY 

       610        620        630        640        650        660 
IAPPTLVALI NPHKGKFNVT AKGGLVEEEY VDWVISRPYI FLVLLNLVGV AVGIWRYFYG 

       670        680        690        700        710        720 
PPTEMLTVVV SMVWVFYNLI VLGGAVAVSV ESKQVRRSHR VEMTMPAAIA REDGHLFSCT 

       730        740        750        760        770        780 
VQDFSDGGLG IKINGQAQIL EGQKVNLLLK RGQQEYVFPT QVARVMGNEV GLKLMPLTTQ 

       790        800        810        820        830        840 
QHIDFVQCTF ARADTWALWQ DSYPEDKPLE SLLDILKLGF RGYRHLAEFA PSSVKGIFRV 

       850        860        870 
LTSLVSWVVS FIPRRPERSE TAQPSDQALA QQ 

« Hide

References

« Hide 'large scale' references
[1]"Analysis of the Escherichia coli genome. V. DNA sequence of the region from 76.0 to 81.5 minutes."
Sofia H.J., Burland V., Daniels D.L., Plunkett G. III, Blattner F.R.
Nucleic Acids Res. 22:2576-2586(1994) [PubMed: 8041620] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: K12 / MG1655 / ATCC 47076.
[2]"The complete genome sequence of Escherichia coli K-12."
Blattner F.R., Plunkett G. III, Bloch C.A., Perna N.T., Burland V., Riley M., Collado-Vides J., Glasner J.D., Rode C.K., Mayhew G.F., Gregor J., Davis N.W., Kirkpatrick H.A., Goeden M.A., Rose D.J., Mau B., Shao Y.
Science 277:1453-1474(1997) [PubMed: 9278503] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], SEQUENCE REVISION.
Strain: K12 / MG1655 / ATCC 47076.
[3]"Highly accurate genome sequences of Escherichia coli K-12 strains MG1655 and W3110."
Hayashi K., Morooka N., Yamamoto Y., Fujita K., Isono K., Choi S., Ohtsubo E., Baba T., Wanner B.L., Mori H., Horiuchi T.
Mol. Syst. Biol. 2:E1-E5(2006) [PubMed: 16738553] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: K12 / W3110 / ATCC 27325 / DSM 5911.
[4]"The multicellular morphotypes of Salmonella typhimurium and Escherichia coli produce cellulose as the second component of the extracellular matrix."
Zogaj X., Nimtz M., Rohde M., Bokranz W., Roemling U.
Mol. Microbiol. 39:1452-1463(2001) [PubMed: 11260463] [Abstract]
Cited for: CHARACTERIZATION.
Strain: ECOR 10, ECOR 12 and TOB1.

Cross-references

Sequence databases

U00039 Genomic DNA. Translation: AAB18510.1. Frameshift.
U00039 Genomic DNA. Translation: AAB18511.1. Frameshift.
U00096 Genomic DNA. Translation: AAC76558.1. Different initiation.
AP009048 Genomic DNA. Translation: BAE77761.1.
PIRH65151.
S47754.
S47755.
RefSeqAP_004260.1.
NP_417990.4.

3D structure databases

ModBaseSearch...

Protein-protein interaction databases

DIPDIP:12387N.

Protein family/group databases

CAZyGT2. Glycosyltransferase Family 2.

Genome annotation databases

GeneID948053.
GenomeReviewsGene locus JW5665 in contig AP009048_GR.
Gene locus b3533 in contig U00096_GR.
KEGGecj:JW5665.
eco:b3533.

Organism-specific databases

EchoBASEEB2169.
EcoGeneEG12260. bcsA.
CMRSearch...

Phylogenomic databases

HOGENOMP37653.
OMAP37653. CYANAML.

Enzyme and pathway databases

BioCycEcoCyc:EG12260-MON.

Family and domain databases

InterProIPR003919. Cell_synth_A.
IPR017480. Cellulose_synth_catalytic.
IPR001173. Glyco_trans_2.
IPR009875. PilZ.
[Graphical view]
PfamPF00535. Glycos_transf_2. 1 hit.
PF07238. PilZ. 1 hit.
[Graphical view]
PRINTSPR01439. CELLSNTHASEA.
TIGRFAMsTIGR03030. CelA. 1 hit.
ProtoNetSearch...

Entry information

Entry nameBCSA_ECOLI
AccessionPrimary (citable) accession number: P37653
Secondary accession number(s): P37654 expand/collapse secondary AC list , P76712, P76713, Q2M7J5, Q8RSS7
Entry history
Integrated into UniProtKB/Swiss-Prot: October 1, 1994
Last sequence update: July 26, 2002
Last modified: June 16, 2009
This is version 86 of the entry and version 3 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectHAMAP (High-quality Automated and Manual Annotation of microbial Proteomes)

Relevant documents

Escherichia coli

Escherichia coli (strain K12): entries and cross-references to EcoGene

PATHWAY comments

Index of metabolic and biosynthesis pathways

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents