Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Major curlin subunit

Gene

csgA

Organism
Escherichia coli (strain K12)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Experimental evidence at protein leveli

Functioni

Curlin is the structural subunit of the curli fimbriae. Curli are coiled surface structures that assemble preferentially at growth temperatures below 37 degrees Celsius. Curli can bind to fibronectin.

GO - Biological processi

  • amyloid fibril formation Source: EcoCyc
  • cell adhesion Source: EcoCyc
  • single-species biofilm formation Source: EcoCyc
Complete GO annotation...

Enzyme and pathway databases

BioCyciEcoCyc:EG11489-MONOMER.
ECOL316407:JW1025-MONOMER.

Names & Taxonomyi

Protein namesi
Recommended name:
Major curlin subunit
Gene namesi
Name:csgA
Ordered Locus Names:b1042, JW1025
OrganismiEscherichia coli (strain K12)
Taxonomic identifieri83333 [NCBI]
Taxonomic lineageiBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia
Proteomesi
  • UP000000318 Componenti: Chromosome
  • UP000000625 Componenti: Chromosome

Organism-specific databases

EcoGeneiEG11489. csgA.

Subcellular locationi

  • Fimbrium

  • Note: Part of the curli surface structure.

GO - Cellular componenti

  • pilus Source: EcoCyc
Complete GO annotation...

Keywords - Cellular componenti

Fimbrium

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 20202 PublicationsAdd
BLAST
Chaini21 – 151131Major curlin subunitPRO_0000006369Add
BLAST

Proteomic databases

PaxDbiP28307.

Expressioni

Inductioni

Under control of the CsgD transcription factor, part of the csgBAC/ymdA operon.1 Publication

Interactioni

Protein-protein interaction databases

BioGridi4260065. 16 interactions.
DIPiDIP-9325N.
STRINGi511145.b1042.

Structurei

3D structure databases

ProteinModelPortaliP28307.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Sequence similaritiesi

Belongs to the CsgA/CsgB family.Curated

Keywords - Domaini

Signal

Phylogenomic databases

eggNOGiENOG4105G76. Bacteria.
ENOG4111QFZ. LUCA.
HOGENOMiHOG000118951.
InParanoidiP28307.
KOiK04334.
OMAiNSEIQIY.
PhylomeDBiP28307.

Family and domain databases

InterProiIPR009742. Curlin_rpt.
[Graphical view]
PfamiPF07012. Curlin_rpt. 3 hits.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

P28307-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MKLLKVAAIA AIVFSGSALA GVVPQYGGGG NHGGGGNNSG PNSELNIYQY
60 70 80 90 100
GGGNSALALQ TDARNSDLTI TQHGGGNGAD VGQGSDDSSI DLTQRGFGNS
110 120 130 140 150
ATLDQWNGKN SEMTVKQFGG GNGAAVDQTA SNSSVNVTQV GFGNNATAHQ

Y
Length:151
Mass (Da):15,049
Last modified:October 1, 1996 - v3
Checksum:iC003470D208D395F
GO

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti7 – 71A → E in AAA23616 (PubMed:8459772).Curated

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
L04979 Genomic DNA. Translation: AAA23616.1.
X90754 Genomic DNA. Translation: CAA62282.1.
U00096 Genomic DNA. Translation: AAC74126.1.
AP009048 Genomic DNA. Translation: BAA35832.1.
PIRiS70788.
RefSeqiNP_415560.1. NC_000913.3.
WP_000771437.1. NZ_LN832404.1.

Genome annotation databases

EnsemblBacteriaiAAC74126; AAC74126; b1042.
BAA35832; BAA35832; BAA35832.
GeneIDi949055.
KEGGiecj:JW1025.
eco:b1042.
PATRICi32117321. VBIEscCol129921_1083.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
L04979 Genomic DNA. Translation: AAA23616.1.
X90754 Genomic DNA. Translation: CAA62282.1.
U00096 Genomic DNA. Translation: AAC74126.1.
AP009048 Genomic DNA. Translation: BAA35832.1.
PIRiS70788.
RefSeqiNP_415560.1. NC_000913.3.
WP_000771437.1. NZ_LN832404.1.

3D structure databases

ProteinModelPortaliP28307.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi4260065. 16 interactions.
DIPiDIP-9325N.
STRINGi511145.b1042.

Proteomic databases

PaxDbiP28307.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaiAAC74126; AAC74126; b1042.
BAA35832; BAA35832; BAA35832.
GeneIDi949055.
KEGGiecj:JW1025.
eco:b1042.
PATRICi32117321. VBIEscCol129921_1083.

Organism-specific databases

EchoBASEiEB1452.
EcoGeneiEG11489. csgA.

Phylogenomic databases

eggNOGiENOG4105G76. Bacteria.
ENOG4111QFZ. LUCA.
HOGENOMiHOG000118951.
InParanoidiP28307.
KOiK04334.
OMAiNSEIQIY.
PhylomeDBiP28307.

Enzyme and pathway databases

BioCyciEcoCyc:EG11489-MONOMER.
ECOL316407:JW1025-MONOMER.

Miscellaneous databases

PROiP28307.

Family and domain databases

InterProiIPR009742. Curlin_rpt.
[Graphical view]
PfamiPF07012. Curlin_rpt. 3 hits.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiCSGA_ECOLI
AccessioniPrimary (citable) accession number: P28307
Entry historyi
Integrated into UniProtKB/Swiss-Prot: December 1, 1992
Last sequence update: October 1, 1996
Last modified: September 7, 2016
This is version 110 of the entry and version 3 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Direct protein sequencing, Reference proteome

Documents

  1. Escherichia coli
    Escherichia coli (strain K12): entries and cross-references to EcoGene
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.