Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein
Submitted name:

Toxin A

Gene

tcdA

Organism
Peptoclostridium difficile (strain 630) (Clostridium difficile)
Status
Unreviewed-Annotation score: Annotation score: 1 out of 5-Experimental evidence at protein leveli

Functioni

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Molecular functioni

GlycosyltransferaseImported, Transferase

Enzyme and pathway databases

BioCyciPDIF272563:G12WB-772-MONOMER.
BRENDAi2.4.1.B62. 13625.

Protein family/group databases

CAZyiGT44. Glycosyltransferase Family 44.

Names & Taxonomyi

Protein namesi
Submitted name:
Toxin AImported (EC:2.4.1.-Imported)
Gene namesi
Name:tcdAImported
Ordered Locus Names:CD630_06630Imported
OrganismiPeptoclostridium difficile (strain 630) (Clostridium difficile)Imported
Taxonomic identifieri272563 [NCBI]
Taxonomic lineageiBacteriaFirmicutesClostridiaClostridialesPeptostreptococcaceaeClostridioides
Proteomesi
  • UP000001978 Componenti: Chromosome

Interactioni

Protein-protein interaction databases

STRINGi272563.CD0663.

Structurei

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
3SRZX-ray2.58A1-542[»]
3SS1X-ray2.20A1-542[»]
ProteinModelPortaliQ189K5.
SMRiQ189K5.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini569 – 776Peptidase C80InterPro annotationAdd BLAST208

Phylogenomic databases

eggNOGiENOG4105SFK. Bacteria.
ENOG4111U22. LUCA.
HOGENOMiHOG000122990.
KOiK11063.
OMAiGFEYFAP.

Family and domain databases

InterProiIPR018337. Cell_wall/Cho-bd_repeat.
IPR020974. CPD_dom.
IPR029044. Nucleotide-diphossugar_trans.
IPR024770. TcdA/TcdB_cat.
IPR024772. TcdA/TcdB_N.
IPR024769. TcdA/TcdB_pore_forming.
[Graphical view]
PfamiPF01473. CW_binding_1. 13 hits.
PF11713. Peptidase_C80. 1 hit.
PF12919. TcdA_TcdB. 1 hit.
PF12920. TcdA_TcdB_pore. 1 hit.
PF12918. TcdB_N. 1 hit.
[Graphical view]
SUPFAMiSSF53448. SSF53448. 1 hit.
PROSITEiPS51771. CGT_MARTX_CPD. 1 hit.
PS51170. CW. 32 hits.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Q189K5-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MSLISKEELI KLAYSIRPRE NEYKTILTNL DEYNKLTTNN NENKYLQLKK
60 70 80 90 100
LNESIDVFMN KYKTSSRNRA LSNLKKDILK EVILIKNSNT SPVEKNLHFV
110 120 130 140 150
WIGGEVSDIA LEYIKQWADI NAEYNIKLWY DSEAFLVNTL KKAIVESSTT
160 170 180 190 200
EALQLLEEEI QNPQFDNMKF YKKRMEFIYD RQKRFINYYK SQINKPTVPT
210 220 230 240 250
IDDIIKSHLV SEYNRDETVL ESYRTNSLRK INSNHGIDIR ANSLFTEQEL
260 270 280 290 300
LNIYSQELLN RGNLAAASDI VRLLALKNFG GVYLDVDMLP GIHSDLFKTI
310 320 330 340 350
SRPSSIGLDR WEMIKLEAIM KYKKYINNYT SENFDKLDQQ LKDNFKLIIE
360 370 380 390 400
SKSEKSEIFS KLENLNVSDL EIKIAFALGS VINQALISKQ GSYLTNLVIE
410 420 430 440 450
QVKNRYQFLN QHLNPAIESD NNFTDTTKIF HDSLFNSATA ENSMFLTKIA
460 470 480 490 500
PYLQVGFMPE ARSTISLSGP GAYASAYYDF INLQENTIEK TLKASDLIEF
510 520 530 540 550
KFPENNLSQL TEQEINSLWS FDQASAKYQF EKYVRDYTGG SLSEDNGVDF
560 570 580 590 600
NKNTALDKNY LLNNKIPSNN VEEAGSKNYV HYIIQLQGDD ISYEATCNLF
610 620 630 640 650
SKNPKNSIII QRNMNESAKS YFLSDDGESI LELNKYRIPE RLKNKEKVKV
660 670 680 690 700
TFIGHGKDEF NTSEFARLSV DSLSNEISSF LDTIKLDISP KNVEVNLLGC
710 720 730 740 750
NMFSYDFNVE ETYPGKLLLS IMDKITSTLP DVNKNSITIG ANQYEVRINS
760 770 780 790 800
EGRKELLAHS GKWINKEEAI MSDLSSKEYI FFDSIDNKLK AKSKNIPGLA
810 820 830 840 850
SISEDIKTLL LDASVSPDTK FILNNLKLNI ESSIGDYIYY EKLEPVKNII
860 870 880 890 900
HNSIDDLIDE FNLLENVSDE LYELKKLNNL DEKYLISFED ISKNNSTYSV
910 920 930 940 950
RFINKSNGES VYVETEKEIF SKYSEHITKE ISTIKNSIIT DVNGNLLDNI
960 970 980 990 1000
QLDHTSQVNT LNAAFFIQSL IDYSSNKDVL NDLSTSVKVQ LYAQLFSTGL
1010 1020 1030 1040 1050
NTIYDSIQLV NLISNAVNDT INVLPTITEG IPIVSTILDG INLGAAIKEL
1060 1070 1080 1090 1100
LDEHDPLLKK ELEAKVGVLA INMSLSIAAT VASIVGIGAE VTIFLLPIAG
1110 1120 1130 1140 1150
ISAGIPSLVN NELILHDKAT SVVNYFNHLS ESKKYGPLKT EDDKILVPID
1160 1170 1180 1190 1200
DLVISEIDFN NNSIKLGTCN ILAMEGGSGH TVTGNIDHFF SSPSISSHIP
1210 1220 1230 1240 1250
SLSIYSAIGI ETENLDFSKK IMMLPNAPSR VFWWETGAVP GLRSLENDGT
1260 1270 1280 1290 1300
RLLDSIRDLY PGKFYWRFYA FFDYAITTLK PVYEDTNIKI KLDKDTRNFI
1310 1320 1330 1340 1350
MPTITTNEIR NKLSYSFDGA GGTYSLLLSS YPISTNINLS KDDLWIFNID
1360 1370 1380 1390 1400
NEVREISIEN GTIKKGKLIK DVLSKIDINK NKLIIGNQTI DFSGDIDNKD
1410 1420 1430 1440 1450
RYIFLTCELD DKISLIIEIN LVAKSYSLLL SGDKNYLISN LSNIIEKINT
1460 1470 1480 1490 1500
LGLDSKNIAY NYTDESNNKY FGAISKTSQK SIIHYKKDSK NILEFYNDST
1510 1520 1530 1540 1550
LEFNSKDFIA EDINVFMKDD INTITGKYYV DNNTDKSIDF SISLVSKNQV
1560 1570 1580 1590 1600
KVNGLYLNES VYSSYLDFVK NSDGHHNTSN FMNLFLDNIS FWKLFGFENI
1610 1620 1630 1640 1650
NFVIDKYFTL VGKTNLGYVE FICDNNKNID IYFGEWKTSS SKSTIFSGNG
1660 1670 1680 1690 1700
RNVVVEPIYN PDTGEDISTS LDFSYEPLYG IDRYINKVLI APDLYTSLIN
1710 1720 1730 1740 1750
INTNYYSNEY YPEIIVLNPN TFHKKVNINL DSSSFEYKWS TEGSDFILVR
1760 1770 1780 1790 1800
YLEESNKKIL QKIRIKGILS NTQSFNKMSI DFKDIKKLSL GYIMSNFKSF
1810 1820 1830 1840 1850
NSENELDRDH LGFKIIDNKT YYYDEDSKLV KGLININNSL FYFDPIEFNL
1860 1870 1880 1890 1900
VTGWQTINGK KYYFDINTGA ALISYKIING KHFYFNNDGV MQLGVFKGPD
1910 1920 1930 1940 1950
GFEYFAPANT QNNNIEGQAI VYQSKFLTLN GKKYYFDNDS KAVTGWRIIN
1960 1970 1980 1990 2000
NEKYYFNPNN AIAAVGLQVI DNNKYYFNPD TAIISKGWQT VNGSRYYFDT
2010 2020 2030 2040 2050
DTAIAFNGYK TIDGKHFYFD SDCVVKIGVF STSNGFEYFA PANTYNNNIE
2060 2070 2080 2090 2100
GQAIVYQSKF LTLNGKKYYF DNNSKAVTGW QTIDSKKYYF NTNTAEAATG
2110 2120 2130 2140 2150
WQTIDGKKYY FNTNTAEAAT GWQTIDGKKY YFNTNTAIAS TGYTIINGKH
2160 2170 2180 2190 2200
FYFNTDGIMQ IGVFKGPNGF EYFAPANTDA NNIEGQAILY QNEFLTLNGK
2210 2220 2230 2240 2250
KYYFGSDSKA VTGWRIINNK KYYFNPNNAI AAIHLCTINN DKYYFSYDGI
2260 2270 2280 2290 2300
LQNGYITIER NNFYFDANNE SKMVTGVFKG PNGFEYFAPA NTHNNNIEGQ
2310 2320 2330 2340 2350
AIVYQNKFLT LNGKKYYFDN DSKAVTGWQT IDGKKYYFNL NTAEAATGWQ
2360 2370 2380 2390 2400
TIDGKKYYFN LNTAEAATGW QTIDGKKYYF NTNTFIASTG YTSINGKHFY
2410 2420 2430 2440 2450
FNTDGIMQIG VFKGPNGFEY FAPANTHNNN IEGQAILYQN KFLTLNGKKY
2460 2470 2480 2490 2500
YFGSDSKAVT GLRTIDGKKY YFNTNTAVAV TGWQTINGKK YYFNTNTSIA
2510 2520 2530 2540 2550
STGYTIISGK HFYFNTDGIM QIGVFKGPDG FEYFAPANTD ANNIEGQAIR
2560 2570 2580 2590 2600
YQNRFLYLHD NIYYFGNNSK AATGWVTIDG NRYYFEPNTA MGANGYKTID
2610 2620 2630 2640 2650
NKNFYFRNGL PQIGVFKGSN GFEYFAPANT DANNIEGQAI RYQNRFLHLL
2660 2670 2680 2690 2700
GKIYYFGNNS KAVTGWQTIN GKVYYFMPDT AMAAAGGLFE IDGVIYFFGV
2710
DGVKAPGIYG
Length:2,710
Mass (Da):308,219
Last modified:July 25, 2006 - v1
Checksum:i3838438DD59FF458
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AM180355 Genomic DNA. Translation: CAJ67494.1.
RefSeqiWP_011860904.1. NZ_CP010905.1.
YP_001087137.1. NC_009089.1.

Genome annotation databases

EnsemblBacteriaiCAJ67494; CAJ67494; CD630_06630.
GeneIDi4914076.
KEGGicdf:CD630_06630.
pdc:CDIF630_00776.
PATRICi19439607. VBICloDif38397_0697.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AM180355 Genomic DNA. Translation: CAJ67494.1.
RefSeqiWP_011860904.1. NZ_CP010905.1.
YP_001087137.1. NC_009089.1.

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
3SRZX-ray2.58A1-542[»]
3SS1X-ray2.20A1-542[»]
ProteinModelPortaliQ189K5.
SMRiQ189K5.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi272563.CD0663.

Protein family/group databases

CAZyiGT44. Glycosyltransferase Family 44.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaiCAJ67494; CAJ67494; CD630_06630.
GeneIDi4914076.
KEGGicdf:CD630_06630.
pdc:CDIF630_00776.
PATRICi19439607. VBICloDif38397_0697.

Phylogenomic databases

eggNOGiENOG4105SFK. Bacteria.
ENOG4111U22. LUCA.
HOGENOMiHOG000122990.
KOiK11063.
OMAiGFEYFAP.

Enzyme and pathway databases

BioCyciPDIF272563:G12WB-772-MONOMER.
BRENDAi2.4.1.B62. 13625.

Family and domain databases

InterProiIPR018337. Cell_wall/Cho-bd_repeat.
IPR020974. CPD_dom.
IPR029044. Nucleotide-diphossugar_trans.
IPR024770. TcdA/TcdB_cat.
IPR024772. TcdA/TcdB_N.
IPR024769. TcdA/TcdB_pore_forming.
[Graphical view]
PfamiPF01473. CW_binding_1. 13 hits.
PF11713. Peptidase_C80. 1 hit.
PF12919. TcdA_TcdB. 1 hit.
PF12920. TcdA_TcdB_pore. 1 hit.
PF12918. TcdB_N. 1 hit.
[Graphical view]
SUPFAMiSSF53448. SSF53448. 1 hit.
PROSITEiPS51771. CGT_MARTX_CPD. 1 hit.
PS51170. CW. 32 hits.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiQ189K5_PEPD6
AccessioniPrimary (citable) accession number: Q189K5
Entry historyi
Integrated into UniProtKB/TrEMBL: July 25, 2006
Last sequence update: July 25, 2006
Last modified: November 2, 2016
This is version 67 of the entry and version 1 of the sequence. [Complete history]
Entry statusiUnreviewed (UniProtKB/TrEMBL)

Miscellaneousi

Keywords - Technical termi

3D-structureCombined sources, Complete proteome, Reference proteomeImported

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.