Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Uncharacterized protein YfaL

Gene

yfaL

Organism
Escherichia coli (strain K12)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Protein inferred from homologyi

Functioni

GO - Biological processi

  • cell adhesion involved in biofilm formation Source: EcoCyc
Complete GO annotation...

Enzyme and pathway databases

BioCyciEcoCyc:EG12850-MONOMER.
ECOL316407:JW2227-MONOMER.

Protein family/group databases

MEROPSiU69.A11.
TCDBi1.B.12.1.5. the autotransporter-1 (at-1) family.

Names & Taxonomyi

Protein namesi
Recommended name:
Uncharacterized protein YfaL
Gene namesi
Name:yfaL
Synonyms:yfaF, yfaJ, yfaK
Ordered Locus Names:b2233, JW2227
OrganismiEscherichia coli (strain K12)
Taxonomic identifieri83333 [NCBI]
Taxonomic lineageiBacteriaProteobacteriaGammaproteobacteriaEnterobacteralesEnterobacteriaceaeEscherichia
Proteomesi
  • UP000000318 Componenti: Chromosome
  • UP000000625 Componenti: Chromosome

Organism-specific databases

EcoGeneiEG12850. yfaL.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Signal peptidei1 – 23Sequence analysisAdd BLAST23
ChainiPRO_000000271324 – 1250Uncharacterized protein YfaLAdd BLAST1227

Proteomic databases

PaxDbiP45508.
PRIDEiP45508.

Interactioni

Protein-protein interaction databases

BioGridi4262133. 267 interactors.
IntActiP45508. 6 interactors.
STRINGi511145.b2233.

Structurei

3D structure databases

ProteinModelPortaliP45508.
SMRiP45508.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Repeati919 – 92012
Repeati921 – 9222; approximate2
Repeati923 – 92432
Repeati925 – 9264; approximate2
Repeati927 – 92852
Repeati929 – 93062
Repeati931 – 93272
Repeati933 – 93482
Repeati935 – 93692
Repeati937 – 938102
Repeati939 – 940112
Repeati941 – 942122
Repeati943 – 944132
Repeati945 – 946142
Repeati947 – 948152
Domaini980 – 1250AutotransporterPROSITE-ProRule annotationAdd BLAST271

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Regioni919 – 94815 X 2 AA approximate tandem repeats of [DTPE]-PAdd BLAST30

Sequence similaritiesi

Contains 1 autotransporter (TC 1.B.12) domain. [View classification]PROSITE-ProRule annotation

Keywords - Domaini

Repeat, Signal

Phylogenomic databases

eggNOGiENOG4107GHC. Bacteria.
COG3468. LUCA.
HOGENOMiHOG000122378.
KOiK07279.
OMAiIKASCQA.
PhylomeDBiP45508.

Family and domain databases

Gene3Di2.160.20.20. 1 hit.
2.40.128.130. 1 hit.
InterProiIPR005546. Autotransporte_beta.
IPR013425. Autotrns_rpt.
IPR006315. OM_autotransptr_brl.
IPR012332. P22_tailspike_C-like.
IPR011050. Pectin_lyase_fold/virulence.
IPR003368. POMP_repeat.
[Graphical view]
PfamiPF03797. Autotransporter. 1 hit.
PF02415. Chlam_PMP. 3 hits.
PF12951. PATR. 3 hits.
[Graphical view]
SMARTiSM00869. Autotransporter. 1 hit.
[Graphical view]
SUPFAMiSSF103515. SSF103515. 1 hit.
SSF51126. SSF51126. 2 hits.
TIGRFAMsiTIGR01414. autotrans_barl. 1 hit.
TIGR02601. autotrns_rpt. 1 hit.
TIGR01376. POMP_repeat. 2 hits.
PROSITEiPS51208. AUTOTRANSPORTER. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

P45508-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MRIIFLRKEY LSLLPSMIAS LFSANGVAAV TDSCQGYDVK ASCQASRQSL
60 70 80 90 100
SGITQDWSIA DGQWLVFSDM TNNASGGAVF LQQGAEFSLL PENETGMTLF
110 120 130 140 150
ANNTVTGEYN NGGAIFAKEN STLNLTDVIF SGNVAGGYGG AIYSSGTNDT
160 170 180 190 200
GAVDLRVTNA MFRNNIANDG KGGAIYTINN DVYLSDVIFD NNQAYTSTSY
210 220 230 240 250
SDGDGGAIDV TDNNSDSKHP SGYTIVNNTA FTNNTAEGYG GAIYTNSVTA
260 270 280 290 300
PYLIDISVDD SYSQNGGVLV DENNSAAGYG DGPSSAAGGF MYLGLSEVTF
310 320 330 340 350
DIADGKTLVI GNTENDGAVD SIAGTGLITK TGSGDLVLNA DNNDFTGEMQ
360 370 380 390 400
IENGEVTLGR SNSLMNVGDT HCQDDPQDCY GLTIGSIDQY QNQAELNVGS
410 420 430 440 450
TQQTFVHALT GFQNGTLNID AGGNVTVNQG SFAGIIEGAG QLTIAQNGSY
460 470 480 490 500
VLAGAQSMAL TGDIVVDDGA VLSLEGDAAD LTALQDDPQS IVLNGGVLDL
510 520 530 540 550
SDFSTWQSGT SYNDGLEVSG SSGTVIGSQD VVDLAGGDNL HIGGDGKDGV
560 570 580 590 600
YVVVDASDGQ VSLANNNSYL GTTQIASGTL MVSDNSQLGD THYNRQVIFT
610 620 630 640 650
DKQQESVMEI TSDVDTRSDA AGHGRDIEMR ADGEVAVDAG VDTQWGALMA
660 670 680 690 700
DSSGQHQDEG STLTKTGAGT LELTASGTTQ SAVRVEEGTL KGDVADILPY
710 720 730 740 750
ASSLWVGDGA TFVTGADQDI QSIDAISSGT IDISDGTVLR LTGQDTSVAL
760 770 780 790 800
NASLFNGDGT LVNATDGVTL TGELNTNLET DSLTYLSNVT VNGNLTNTSG
810 820 830 840 850
AVSLQNGVAG DTLTVNGDYT GGGTLLLDSE LNGDDSVSDQ LVMNGNTAGN
860 870 880 890 900
TTVVVNSITG IGEPTSTGIK VVDFAADPTQ FQNNAQFSLA GSGYVNMGAY
910 920 930 940 950
DYTLVEDNND WYLRSQEVTP PSPPDPDPTP DPDPTPDPDP TPDPEPTPAY
960 970 980 990 1000
QPVLNAKVGG YLNNLRAANQ AFMMERRDHA GGDGQTLNLR VIGGDYHYTA
1010 1020 1030 1040 1050
AGQLAQHEDT STVQLSGDLF SGRWGTDGEW MLGIVGGYSD NQGDSRSNMT
1060 1070 1080 1090 1100
GTRADNQNHG YAVGLTSSWF QHGNQKQGAW LDSWLQYAWF SNDVSEQEDG
1110 1120 1130 1140 1150
TDHYHSSGII ASLEAGYQWL PGRGVVIEPQ AQVIYQGVQQ DDFTAANRAR
1160 1170 1180 1190 1200
VSQSQGDDIQ TRLGLHSEWR TAVHVIPTLD LNYYHDPHST EIEEDGSTIS
1210 1220 1230 1240 1250
DDAVKQRGEI KVGVTGNISQ RVSLRGSVAW QKGSDDFAQT AGFLSMTVKW
Length:1,250
Mass (Da):131,153
Last modified:November 1, 1997 - v2
Checksum:i17F98C05E299FC95
GO

Sequence cautioni

The sequence K02672 differs from that shown. Reason: Frameshift at several positions.Curated

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti28 – 30AAV → RGRS in K02672 (PubMed:6087316).Curated3
Sequence conflicti40K → Q in K02672 (PubMed:6087316).Curated1
Sequence conflicti65 – 66LV → PG in K02672 (PubMed:6087316).Curated2
Sequence conflicti431S → Q in K02672 (PubMed:6087316).Curated1
Sequence conflicti433 – 434AG → SA in K02672 (PubMed:6087316).Curated2
Sequence conflicti478A → R in K02672 (PubMed:6087316).Curated1
Sequence conflicti773E → S in K02672 (PubMed:6087316).Curated1
Sequence conflicti853V → M in K02672 (PubMed:6087316).Curated1
Sequence conflicti923 – 924PP → AT in K02672 (PubMed:6087316).Curated2
Sequence conflicti948 – 994PAYQP…RVIGG → LLTSRC in AAA74094 (Ref. 5) CuratedAdd BLAST47

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U00096 Genomic DNA. Translation: AAC75293.1.
AP009048 Genomic DNA. Translation: BAA16050.2.
K02672 Genomic DNA. No translation available.
U30459 Genomic DNA. Translation: AAA74094.1.
Y00544 Genomic DNA. No translation available.
PIRiG64993.
RefSeqiNP_416736.1. NC_000913.3.
WP_001220077.1. NZ_LN832404.1.

Genome annotation databases

EnsemblBacteriaiAAC75293; AAC75293; b2233.
BAA16050; BAA16050; BAA16050.
GeneIDi946595.
KEGGiecj:JW2227.
eco:b2233.
PATRICi32119825. VBIEscCol129921_2322.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U00096 Genomic DNA. Translation: AAC75293.1.
AP009048 Genomic DNA. Translation: BAA16050.2.
K02672 Genomic DNA. No translation available.
U30459 Genomic DNA. Translation: AAA74094.1.
Y00544 Genomic DNA. No translation available.
PIRiG64993.
RefSeqiNP_416736.1. NC_000913.3.
WP_001220077.1. NZ_LN832404.1.

3D structure databases

ProteinModelPortaliP45508.
SMRiP45508.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi4262133. 267 interactors.
IntActiP45508. 6 interactors.
STRINGi511145.b2233.

Protein family/group databases

MEROPSiU69.A11.
TCDBi1.B.12.1.5. the autotransporter-1 (at-1) family.

Proteomic databases

PaxDbiP45508.
PRIDEiP45508.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaiAAC75293; AAC75293; b2233.
BAA16050; BAA16050; BAA16050.
GeneIDi946595.
KEGGiecj:JW2227.
eco:b2233.
PATRICi32119825. VBIEscCol129921_2322.

Organism-specific databases

EchoBASEiEB2695.
EcoGeneiEG12850. yfaL.

Phylogenomic databases

eggNOGiENOG4107GHC. Bacteria.
COG3468. LUCA.
HOGENOMiHOG000122378.
KOiK07279.
OMAiIKASCQA.
PhylomeDBiP45508.

Enzyme and pathway databases

BioCyciEcoCyc:EG12850-MONOMER.
ECOL316407:JW2227-MONOMER.

Miscellaneous databases

PROiP45508.

Family and domain databases

Gene3Di2.160.20.20. 1 hit.
2.40.128.130. 1 hit.
InterProiIPR005546. Autotransporte_beta.
IPR013425. Autotrns_rpt.
IPR006315. OM_autotransptr_brl.
IPR012332. P22_tailspike_C-like.
IPR011050. Pectin_lyase_fold/virulence.
IPR003368. POMP_repeat.
[Graphical view]
PfamiPF03797. Autotransporter. 1 hit.
PF02415. Chlam_PMP. 3 hits.
PF12951. PATR. 3 hits.
[Graphical view]
SMARTiSM00869. Autotransporter. 1 hit.
[Graphical view]
SUPFAMiSSF103515. SSF103515. 1 hit.
SSF51126. SSF51126. 2 hits.
TIGRFAMsiTIGR01414. autotrans_barl. 1 hit.
TIGR02601. autotrns_rpt. 1 hit.
TIGR01376. POMP_repeat. 2 hits.
PROSITEiPS51208. AUTOTRANSPORTER. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiYFAL_ECOLI
AccessioniPrimary (citable) accession number: P45508
Secondary accession number(s): P39441
, P45506, P45507, P76468, P77487
Entry historyi
Integrated into UniProtKB/Swiss-Prot: November 1, 1995
Last sequence update: November 1, 1997
Last modified: November 2, 2016
This is version 134 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Escherichia coli
    Escherichia coli (strain K12): entries and cross-references to EcoGene
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.