Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Cellulose synthase operon protein C

Gene

bcsC

Organism
Escherichia coli (strain K12)
Status
Reviewed-Annotation score: Annotation score: 2 out of 5-Experimental evidence at protein leveli

Functioni

Required for maximal bacterial cellulose synthesis.By similarity

Pathwayi: bacterial cellulose biosynthesis

This protein is involved in the pathway bacterial cellulose biosynthesis, which is part of Glycan metabolism.
View all proteins of this organism that are known to be involved in the pathway bacterial cellulose biosynthesis and in Glycan metabolism.

GO - Biological processi

Complete GO annotation...

Keywords - Biological processi

Cellulose biosynthesis

Enzyme and pathway databases

BioCyciEcoCyc:EG12257-MONOMER.
ECOL316407:JW5942-MONOMER.
UniPathwayiUPA00694.

Protein family/group databases

TCDBi1.B.55.3.1. the poly acetyl glucosamine porin (pgaa) family.

Names & Taxonomyi

Protein namesi
Recommended name:
Cellulose synthase operon protein C
Gene namesi
Name:bcsC
Synonyms:yhjL
Ordered Locus Names:b3530, JW5942
OrganismiEscherichia coli (strain K12)
Taxonomic identifieri83333 [NCBI]
Taxonomic lineageiBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia
Proteomesi
  • UP000000318 Componenti: Chromosome
  • UP000000625 Componenti: Chromosome

Organism-specific databases

EcoGeneiEG12257. bcsC.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 2323Sequence analysisAdd
BLAST
Chaini24 – 11571134Cellulose synthase operon protein CPRO_0000106429Add
BLAST

Proteomic databases

EPDiP37650.
PaxDbiP37650.

Interactioni

Protein-protein interaction databases

BioGridi4261634. 107 interactions.
STRINGi511145.b3530.

Structurei

3D structure databases

ProteinModelPortaliP37650.
SMRiP37650. Positions 162-204, 303-432, 463-552, 605-630.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Repeati62 – 9534TPR 1Add
BLAST
Repeati112 – 14534TPR 2Add
BLAST
Repeati269 – 30234TPR 3Add
BLAST
Repeati303 – 33634TPR 4Add
BLAST
Repeati351 – 38434TPR 5Add
BLAST
Repeati385 – 41834TPR 6Add
BLAST
Repeati461 – 49434TPR 7Add
BLAST
Repeati603 – 63634TPR 8Add
BLAST
Repeati710 – 74435TPR 9Add
BLAST

Sequence similaritiesi

Belongs to the AcsC/BcsC family.Curated
Contains 9 TPR repeats.PROSITE-ProRule annotation

Keywords - Domaini

Repeat, Signal, TPR repeat

Phylogenomic databases

eggNOGiENOG4105EJG. Bacteria.
COG0457. LUCA.
HOGENOMiHOG000125935.
InParanoidiP37650.
OrthoDBiEOG6PS5QS.

Family and domain databases

Gene3Di1.25.40.10. 4 hits.
InterProiIPR008410. BCSC_C.
IPR013026. TPR-contain_dom.
IPR011990. TPR-like_helical_dom.
IPR019734. TPR_repeat.
[Graphical view]
PfamiPF05420. BCSC_C. 1 hit.
[Graphical view]
SMARTiSM00028. TPR. 6 hits.
[Graphical view]
SUPFAMiSSF48452. SSF48452. 6 hits.
PROSITEiPS50005. TPR. 6 hits.
PS50293. TPR_REGION. 2 hits.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

P37650-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MRKFTLNIFT LSLGLAVMPM VEAAPTAQQQ LLEQVRLGEA THREDLVQQS
60 70 80 90 100
LYRLELIDPN NPDVVAARFR SLLRQGDIDG AQKQLDRLSQ LAPSSNAYKS
110 120 130 140 150
SRTTMLLSTP DGRQALQQAR LQATTGHAEE AVASYNKLFN GAPPEGDIAV
160 170 180 190 200
EYWSTVAKIP ARRGEAINQL KRINADAPGN TGLQNNLALL LFSSDRRDEG
210 220 230 240 250
FAVLEQMAKS NAGREGASKI WYGQIKDMPV SDASVSALKK YLSIFSDGDS
260 270 280 290 300
VAAAQSQLAE QQKQLADPAF RARAQGLAAV DSGMAGKAIP ELQQAVRANP
310 320 330 340 350
KDSEALGALG QAYSQKGDRA NAVANLEKAL ALDPHSSNND KWNSLLKVNR
360 370 380 390 400
YWLAIQQGDA ALKANNPDRA ERLFQQARNV DNTDSYAVLG LGDVAMARKD
410 420 430 440 450
YPAAERYYQQ TLRMDSGNTN AVRGLANIYR QQSPEKAEAF IASLSASQRR
460 470 480 490 500
SIDDIERSLQ NDRLAQQAEA LENQGKWAQA AALQRQRLAL DPGSVWITYR
510 520 530 540 550
LSQDLWQAGQ RSQADTLMRN LAQQKSNDPE QVYAYGLYLS GHDQDRAALA
560 570 580 590 600
HINSLPRAQW NSNIQELVNR LQSDQVLETA NRLRESGKEA EAEAMLRQQP
610 620 630 640 650
PSTRIDLTLA DWAQQRRDYT AARAAYQNVL TREPANADAI LGLTEVDIAA
660 670 680 690 700
GDKAAARSQL AKLPATDNAS LNTQRRVALA QAQLGDTAAA QRTFNKLIPQ
710 720 730 740 750
AKSQPPSMES AMVLRDGAKF EAQAGDPTQA LETYKDAMVA SGVTTTRPQD
760 770 780 790 800
NDTFTRLTRN DEKDDWLKRG VRSDAADLYR QQDLNVTLEH DYWGSSGTGG
810 820 830 840 850
YSDLKAHTTM LQVDAPYSDG RMFFRSDFVN MNVGSFSTNA DGKWDDNWGT
860 870 880 890 900
CTLQDCSGNR SQSDSGASVA VGWRNDVWSW DIGTTPMGFN VVDVVGGISY
910 920 930 940 950
SDDIGPLGYT VNAHRRPISS SLLAFGGQKD SPSNTGKKWG GVRADGVGLS
960 970 980 990 1000
LSYDKGEANG VWASLSGDQL TGKNVEDNWR VRWMTGYYYK VINQNNRRVT
1010 1020 1030 1040 1050
IGLNNMIWHY DKDLSGYSLG QGGYYSPQEY LSFAIPVMWR ERTENWSWEL
1060 1070 1080 1090 1100
GASGSWSHSR TKTMPRYPLM NLIPTDWQEE AARQSNDGGS SQGFGYTARA
1110 1120 1130 1140 1150
LLERRVTSNW FVGTAIDIQQ AKDYAPSHFL LYVRYSAAGW QGDMDLPPQP

LIPYADW
Length:1,157
Mass (Da):127,724
Last modified:November 24, 2009 - v3
Checksum:iE99ADBAF20C0DC20
GO

Sequence cautioni

The sequence AAB18507.1 differs from that shown. Reason: Frameshift at position 11. Curated

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti594 – 5952AM → V in AAB18507 (PubMed:8041620).Curated

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U00039 Genomic DNA. Translation: AAB18507.1. Frameshift.
U00096 Genomic DNA. Translation: AAT48188.4.
AP009048 Genomic DNA. Translation: BAE77764.1.
RefSeqiWP_001225124.1. NZ_LN832404.1.
YP_026226.4. NC_000913.3.

Genome annotation databases

EnsemblBacteriaiAAT48188; AAT48188; b3530.
BAE77764; BAE77764; BAE77764.
GeneIDi948047.
KEGGiecj:JW5942.
eco:b3530.
PATRICi32122526. VBIEscCol129921_3641.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U00039 Genomic DNA. Translation: AAB18507.1. Frameshift.
U00096 Genomic DNA. Translation: AAT48188.4.
AP009048 Genomic DNA. Translation: BAE77764.1.
RefSeqiWP_001225124.1. NZ_LN832404.1.
YP_026226.4. NC_000913.3.

3D structure databases

ProteinModelPortaliP37650.
SMRiP37650. Positions 162-204, 303-432, 463-552, 605-630.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi4261634. 107 interactions.
STRINGi511145.b3530.

Protein family/group databases

TCDBi1.B.55.3.1. the poly acetyl glucosamine porin (pgaa) family.

Proteomic databases

EPDiP37650.
PaxDbiP37650.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaiAAT48188; AAT48188; b3530.
BAE77764; BAE77764; BAE77764.
GeneIDi948047.
KEGGiecj:JW5942.
eco:b3530.
PATRICi32122526. VBIEscCol129921_3641.

Organism-specific databases

EchoBASEiEB2166.
EcoGeneiEG12257. bcsC.

Phylogenomic databases

eggNOGiENOG4105EJG. Bacteria.
COG0457. LUCA.
HOGENOMiHOG000125935.
InParanoidiP37650.
OrthoDBiEOG6PS5QS.

Enzyme and pathway databases

UniPathwayiUPA00694.
BioCyciEcoCyc:EG12257-MONOMER.
ECOL316407:JW5942-MONOMER.

Miscellaneous databases

PROiP37650.

Family and domain databases

Gene3Di1.25.40.10. 4 hits.
InterProiIPR008410. BCSC_C.
IPR013026. TPR-contain_dom.
IPR011990. TPR-like_helical_dom.
IPR019734. TPR_repeat.
[Graphical view]
PfamiPF05420. BCSC_C. 1 hit.
[Graphical view]
SMARTiSM00028. TPR. 6 hits.
[Graphical view]
SUPFAMiSSF48452. SSF48452. 6 hits.
PROSITEiPS50005. TPR. 6 hits.
PS50293. TPR_REGION. 2 hits.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "Analysis of the Escherichia coli genome. V. DNA sequence of the region from 76.0 to 81.5 minutes."
    Sofia H.J., Burland V., Daniels D.L., Plunkett G. III, Blattner F.R.
    Nucleic Acids Res. 22:2576-2586(1994) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: K12 / MG1655 / ATCC 47076.
  2. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], SEQUENCE REVISION TO 594-595.
    Strain: K12 / MG1655 / ATCC 47076.
  3. "Highly accurate genome sequences of Escherichia coli K-12 strains MG1655 and W3110."
    Hayashi K., Morooka N., Yamamoto Y., Fujita K., Isono K., Choi S., Ohtsubo E., Baba T., Wanner B.L., Mori H., Horiuchi T.
    Mol. Syst. Biol. 2:E1-E5(2006) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: K12 / W3110 / ATCC 27325 / DSM 5911.
  4. "The multicellular morphotypes of Salmonella typhimurium and Escherichia coli produce cellulose as the second component of the extracellular matrix."
    Zogaj X., Nimtz M., Rohde M., Bokranz W., Roemling U.
    Mol. Microbiol. 39:1452-1463(2001) [PubMed] [Europe PMC] [Abstract]
    Cited for: CHARACTERIZATION.
    Strain: ECOR 10, ECOR 12 and TOB1.

Entry informationi

Entry nameiBCSC_ECOLI
AccessioniPrimary (citable) accession number: P37650
Secondary accession number(s): P76710, Q2M7J2
Entry historyi
Integrated into UniProtKB/Swiss-Prot: October 1, 1994
Last sequence update: November 24, 2009
Last modified: May 11, 2016
This is version 133 of the entry and version 3 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Miscellaneous

The genes bscA, bcsB, bcsZ and bcsC are constitutively transcribed but cellulose synthesis occurs only when AdrA, a putative transmembrane protein regulated by AgfD, is expressed. Cellulose production is abolished in E.coli K12.

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Escherichia coli
    Escherichia coli (strain K12): entries and cross-references to EcoGene
  2. PATHWAY comments
    Index of metabolic and biosynthesis pathways
  3. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.