Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Glucosyltransferase-I

Gene

gtfB

Organism
Streptococcus mutans serotype c (strain ATCC 700610 / UA159)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Protein inferred from homologyi

Functioni

Production of extracellular glucans, that are thought to play a key role in the development of the dental plaque because of their ability to adhere to smooth surfaces and mediate the aggregation of bacterial cells and food debris.

Catalytic activityi

Sucrose + ((1->6)-alpha-D-glucosyl)(n) = D-fructose + ((1->6)-alpha-D-glucosyl)(n+1).

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Molecular functioni

Glycosyltransferase, Transferase

Protein family/group databases

CAZyiGH70. Glycoside Hydrolase Family 70.

Names & Taxonomyi

Protein namesi
Recommended name:
Glucosyltransferase-I (EC:2.4.1.5)
Short name:
GTF-I
Alternative name(s):
Dextransucrase
Sucrose 6-glucosyltransferase
Gene namesi
Name:gtfB
Ordered Locus Names:SMU_1004
OrganismiStreptococcus mutans serotype c (strain ATCC 700610 / UA159)
Taxonomic identifieri210007 [NCBI]
Taxonomic lineageiBacteriaFirmicutesBacilliLactobacillalesStreptococcaceaeStreptococcus
Proteomesi
  • UP000002512 Componenti: Chromosome

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Secreted

Pathology & Biotechi

Keywords - Diseasei

Dental caries

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Signal peptidei1 – 34Sequence analysisAdd BLAST34
ChainiPRO_000002138535 – 1476Glucosyltransferase-IAdd BLAST1442

Interactioni

Protein-protein interaction databases

STRINGi210007.SMU_1004.

Structurei

3D structure databases

ProteinModelPortaliP08987.
SMRiP08987.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Repeati159 – 178Cell wall-binding 1Add BLAST20
Repeati179 – 199Cell wall-binding 2Add BLAST21
Repeati1087 – 1106Cell wall-binding 3Add BLAST20
Repeati1107 – 1126Cell wall-binding 4Add BLAST20
Repeati1170 – 1189Cell wall-binding 5Add BLAST20
Repeati1214 – 1234Cell wall-binding 6Add BLAST21
Repeati1235 – 1254Cell wall-binding 7Add BLAST20
Repeati1279 – 1299Cell wall-binding 8Add BLAST21
Repeati1300 – 1319Cell wall-binding 9Add BLAST20
Repeati1344 – 1364Cell wall-binding 10Add BLAST21
Repeati1365 – 1384Cell wall-binding 11Add BLAST20
Repeati1409 – 1429Cell wall-binding 12Add BLAST21
Repeati1430 – 1449Cell wall-binding 13Add BLAST20

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Regioni200 – 1051Catalytic; approximateAdd BLAST852

Sequence similaritiesi

Belongs to the glycosyl hydrolase 70 family.Curated
Contains 13 cell wall-binding repeats.PROSITE-ProRule annotation

Keywords - Domaini

Repeat, Signal

Phylogenomic databases

eggNOGiENOG4108N67. Bacteria.
COG5263. LUCA.
KOiK00689.
OMAiANGVQVK.
PhylomeDBiP08987.

Family and domain databases

Gene3Di3.20.20.80. 3 hits.
InterProiIPR018337. Cell_wall/Cho-bd_repeat.
IPR027636. Glucan-bd_rpt.
IPR003318. Glyco_hydro70cat.
IPR013781. Glyco_hydro_catalytic_dom.
IPR017853. Glycoside_hydrolase_SF.
IPR022263. KxYKxGKxW.
[Graphical view]
PfamiPF01473. CW_binding_1. 4 hits.
PF02324. Glyco_hydro_70. 1 hit.
[Graphical view]
SUPFAMiSSF51445. SSF51445. 4 hits.
TIGRFAMsiTIGR04035. glucan_65_rpt. 6 hits.
TIGR03715. KxYKxGKxW. 1 hit.
PROSITEiPS51170. CW. 13 hits.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

P08987-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MDKKVRYKLR KVKKRWVTVS VASAVMTLTT LSGGLVKADS NESKSQISND
60 70 80 90 100
SNTSVVTANE ESNVTTEVTS KQEAASSQTN HTVTTISSST SVVNPKEVVS
110 120 130 140 150
NPYTVGETAS NGEKLQNQTT TVDKTSEAAA NNISKQTTEA DTDVIDDSNA
160 170 180 190 200
ANLQILEKLP NVKEIDGKYY YYDNNGKVRT NFTLIADGKI LHFDETGAYT
210 220 230 240 250
DTSIDTVNKD IVTTRSNLYK KYNQVYDRSA QSFEHVDHYL TAESWYRPKY
260 270 280 290 300
ILKDGKTWTQ STEKDFRPLL MTWWPSQETQ RQYVNYMNAQ LGINKTYDDT
310 320 330 340 350
SNQLQLNIAA ATIQAKIEAK ITTLKNTDWL RQTISAFVKT QSAWNSDSEK
360 370 380 390 400
PFDDHLQNGA VLYDNEGKLT PYANSNYRIL NRTPTNQTGK KDPRYTADNT
410 420 430 440 450
IGGYEFLLAN DVDNSNPVVQ AEQLNWLHFL MNFGNIYAND PDANFDSIRV
460 470 480 490 500
DAVDNVDADL LQIAGDYLKA AKGIHKNDKA ANDHLSILEA WSDNDTPYLH
510 520 530 540 550
DDGDNMINMD NKLRLSLLFS LAKPLNQRSG MNPLITNSLV NRTDDNAETA
560 570 580 590 600
AVPSYSFIRA HDSEVQDLIR DIIKAEINPN VVGYSFTMEE IKKAFEIYNK
610 620 630 640 650
DLLATEKKYT HYNTALSYAL LLTNKSSVPR VYYGDMFTDD GQYMAHKTIN
660 670 680 690 700
YEAIETLLKA RIKYVSGGQA MRNQQVGNSE IITSVRYGKG ALKATDTGDR
710 720 730 740 750
TTRTSGVAVI EGNNPSLRLK ASDRVVVNMG AAHKNQAYRP LLLTTDNGIK
760 770 780 790 800
AYHSDQEAAG LVRYTNDRGE LIFTAADIKG YANPQVSGYL GVWVPVGAAA
810 820 830 840 850
DQDVRVAAST APSTDGKSVH QNAALDSRVM FEGFSNFQAF ATKKEEYTNV
860 870 880 890 900
VIAKNVDKFA EWGVTDFEMA PQYVSSTDGS FLDSVIQNGY AFTDRYDLGI
910 920 930 940 950
SKPNKYGTAD DLVKAIKALH SKGIKVMADW VPDQMYAFPE KEVVTATRVD
960 970 980 990 1000
KFGKPVEGSQ IKSVLYVADS KSSGKDQQAK YGGAFLEELQ AKYPELFARK
1010 1020 1030 1040 1050
QISTGVPMDP SVKIKQWSAK YFNGTNILGR GAGYVLKDQA TNTYFNISDN
1060 1070 1080 1090 1100
KEINFLPKTL LNQDSQVGFS YDGKGYVYYS TSGYQAKNTF ISEGDKWYYF
1110 1120 1130 1140 1150
DNNGYMVTGA QSINGVNYYF LSNGLQLRDA ILKNEDGTYA YYGNDGRRYE
1160 1170 1180 1190 1200
NGYYQFMSGV WRHFNNGEMS VGLTVIDGQV QYFDEMGYQA KGKFVTTADG
1210 1220 1230 1240 1250
KIRYFDKQSG NMYRNRFIEN EEGKWLYLGE DGAAVTGSQT INGQHLYFRA
1260 1270 1280 1290 1300
NGVQVKGEFV TDRYGRISYY DSNSGDQIRN RFVRNAQGQW FYFDNNGYAV
1310 1320 1330 1340 1350
TGARTINGQH LYFRANGVQV KGEFVTDRHG RISYYDGNSG DQIRNRFVRN
1360 1370 1380 1390 1400
AQGQWFYFDN NGYAVTGART INGQHLYFRA NGVQVKGEFV TDRYGRISYY
1410 1420 1430 1440 1450
DSNSGDQIRN RFVRNAQGQW FYFDNNGYAV TGARTINGQH LYFRANGVQV
1460 1470
KGEFVTDRYG RISYYDANSG ERVRIN
Length:1,476
Mass (Da):165,847
Last modified:December 6, 2002 - v3
Checksum:i9C6E09F731B4CBCF
GO

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti570R → A in AAA88588 (PubMed:3040685).Curated1
Sequence conflicti800 – 817ADQDV…STDGK → LIKMFALRLARPHQQMA in AAA88588 (PubMed:3040685).CuratedAdd BLAST18
Sequence conflicti1310H → L in AAA88588 (PubMed:3040685).Curated1

Natural variant

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Natural varianti62S → T in strain: MT4239. 1
Natural varianti65T → I in strain: GS-5. 1
Natural varianti68V → A in strain: GS-5, MT4245, MT4251, MT4467 and MT8148. 1
Natural varianti78Q → P in strain: MT4251. 1
Natural varianti86I → S in strain: GS-5, MT4245, MT4251, MT4467 and MT8148. 1
Natural varianti89S → F in strain: MT4251. 1
Natural varianti168K → N in strain: MT4251. 1
Natural varianti276S → D in strain: GS-5, MT4467 and MT8148. 1
Natural varianti399N → R in strain: MT4239. 1
Natural varianti474I → T in strain: MT4239. 1
Natural varianti512K → R in strain: MT8148. 1
Natural varianti519F → Y in strain: MT8148. 1
Natural varianti701T → I in strain: MT8148. 1
Natural varianti708A → V in strain: MT8148. 1
Natural varianti938F → L in strain: MT8148. 1
Natural varianti952 – 957FGKPVE → YGTPVA in strain: GS-5, MT4239 and MT4467. 6
Natural varianti963 – 964SV → NT in strain: GS-5, MT4239 and MT4467. 2
Natural varianti968 – 970ADS → VDG in strain: GS-5, MT4239 and MT4467. 3
Natural varianti1086A → T in strain: MT4239. 1
Natural varianti1158S → N in strain: MT4239. 1
Natural varianti1163H → Y in strain: MT4251. 1
Natural varianti1168E → K in strain: MT8148. 1
Natural varianti1182Y → C in strain: MT8148. 1
Natural varianti1234A → P in strain: MT4239. 1
Natural varianti1263R → H in strain: GS-5 and MT4467. 1
Natural varianti1263R → P in strain: MT8148. 1
Natural varianti1264Y → H in strain: GS-5, MT4239, MT4467 and MT8148. 1
Natural varianti1272S → G in strain: GS-5, MT4239, MT4467 and MT8148. 1
Natural varianti1329H → Y in strain: GS-5 and MT4467. 1
Natural varianti1394Y → H in strain: GS-5, MT4239, MT4467 and MT8148. 1
Natural varianti1402S → G in strain: GS-5, MT4239, MT4467 and MT8148. 1
Natural varianti1459Y → H in strain: MT4467. 1

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
M17361 Genomic DNA. Translation: AAA88588.1.
D88651 Genomic DNA. Translation: BAA26101.1.
D88654 Genomic DNA. Translation: BAA26105.1.
D88657 Genomic DNA. Translation: BAA26109.1.
D88660 Genomic DNA. Translation: BAA26113.1.
D89977 Genomic DNA. Translation: BAA26119.1.
AE014133 Genomic DNA. Translation: AAN58705.1.
PIRiB33135.
RefSeqiNP_721399.1. NC_004350.2.
WP_002352268.1. NC_004350.2.

Genome annotation databases

EnsemblBacteriaiAAN58705; AAN58705; SMU_1004.
GeneIDi1028336.
KEGGismu:SMU_1004.
PATRICi19664129. VBIStrMut61772_0897.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
M17361 Genomic DNA. Translation: AAA88588.1.
D88651 Genomic DNA. Translation: BAA26101.1.
D88654 Genomic DNA. Translation: BAA26105.1.
D88657 Genomic DNA. Translation: BAA26109.1.
D88660 Genomic DNA. Translation: BAA26113.1.
D89977 Genomic DNA. Translation: BAA26119.1.
AE014133 Genomic DNA. Translation: AAN58705.1.
PIRiB33135.
RefSeqiNP_721399.1. NC_004350.2.
WP_002352268.1. NC_004350.2.

3D structure databases

ProteinModelPortaliP08987.
SMRiP08987.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi210007.SMU_1004.

Protein family/group databases

CAZyiGH70. Glycoside Hydrolase Family 70.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaiAAN58705; AAN58705; SMU_1004.
GeneIDi1028336.
KEGGismu:SMU_1004.
PATRICi19664129. VBIStrMut61772_0897.

Phylogenomic databases

eggNOGiENOG4108N67. Bacteria.
COG5263. LUCA.
KOiK00689.
OMAiANGVQVK.
PhylomeDBiP08987.

Family and domain databases

Gene3Di3.20.20.80. 3 hits.
InterProiIPR018337. Cell_wall/Cho-bd_repeat.
IPR027636. Glucan-bd_rpt.
IPR003318. Glyco_hydro70cat.
IPR013781. Glyco_hydro_catalytic_dom.
IPR017853. Glycoside_hydrolase_SF.
IPR022263. KxYKxGKxW.
[Graphical view]
PfamiPF01473. CW_binding_1. 4 hits.
PF02324. Glyco_hydro_70. 1 hit.
[Graphical view]
SUPFAMiSSF51445. SSF51445. 4 hits.
TIGRFAMsiTIGR04035. glucan_65_rpt. 6 hits.
TIGR03715. KxYKxGKxW. 1 hit.
PROSITEiPS51170. CW. 13 hits.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiGTFB_STRMU
AccessioniPrimary (citable) accession number: P08987
Secondary accession number(s): O69381
, O69384, O69387, O69390, O69396
Entry historyi
Integrated into UniProtKB/Swiss-Prot: November 1, 1988
Last sequence update: December 6, 2002
Last modified: November 2, 2016
This is version 138 of the entry and version 3 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Miscellaneous

GTF-I synthesizes water-insoluble glucans (alpha 1,3-linked glucose and some 1,6 linkages), GTF-S synthesizes water-soluble glucans (alpha 1,6-glucose). GTF-SI synthesizes both forms of glucans.

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Glycosyl hydrolases
    Classification of glycosyl hydrolase families and list of entries
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.