Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein
Submitted name:

Glycosyl hydrolase family protein

Gene

SZO_05310

Organism
Streptococcus equi subsp. zooepidemicus (strain H70)
Status
Unreviewed-Annotation score: Annotation score: 1 out of 5-Protein predictedi

Functioni

GO - Molecular functioni

  1. exo-alpha-sialidase activity Source: InterPro
  2. transferase activity, transferring acyl groups other than amino-acyl groups Source: InterPro

GO - Biological processi

  1. carbohydrate metabolic process Source: InterPro
Complete GO annotation...

Keywords - Molecular functioni

HydrolaseImported

Enzyme and pathway databases

BioCyciSEQU40041:GC8B-608-MONOMER.

Names & Taxonomyi

Protein namesi
Submitted name:
Glycosyl hydrolase family proteinImported
Gene namesi
Ordered Locus Names:SZO_05310Imported
OrganismiStreptococcus equi subsp. zooepidemicus (strain H70)Imported
Taxonomic identifieri553483 [NCBI]
Taxonomic lineageiBacteriaFirmicutesBacilliLactobacillalesStreptococcaceaeStreptococcus
ProteomesiUP000001368: Chromosome

Subcellular locationi

GO - Cellular componenti

  1. extracellular region Source: UniProtKB-KW
  2. membrane Source: InterPro
Complete GO annotation...

Keywords - Cellular componenti

SecretedSAAS annotation

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 4242 PotentialImportedAdd
BLAST
Chaini43 – 15461504 PotentialImportedPRO_5000453409Add
BLAST

Interactioni

Protein-protein interaction databases

STRINGi553483.SZO_05310.

Family & Domainsi

Keywords - Domaini

SignalImported

Phylogenomic databases

eggNOGiCOG2273.
HOGENOMiHOG000071801.
KOiK01186.
OMAiTEISHEQ.
OrthoDBiEOG61CM2G.

Family and domain databases

Gene3Di2.120.10.10. 2 hits.
2.60.120.200. 1 hit.
2.60.120.260. 1 hit.
InterProiIPR013320. ConA-like_dom.
IPR008979. Galactose-bd-like.
IPR000757. Glyco_hydro_16.
IPR004124. Glyco_hydro_33_N.
IPR011040. Sialidases.
IPR020610. Thiolase_AS.
IPR005877. YSIRK_signal_dom.
[Graphical view]
PfamiPF00722. Glyco_hydro_16. 1 hit.
PF02973. Sialidase. 1 hit.
PF04650. YSIRK_signal. 1 hit.
[Graphical view]
SUPFAMiSSF49785. SSF49785. 1 hit.
SSF49899. SSF49899. 2 hits.
SSF50939. SSF50939. 2 hits.
TIGRFAMsiTIGR01168. YSIRK_signal. 1 hit.
PROSITEiPS00099. THIOLASE_3. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

C0MCG2-1 [UniParc]FASTAAdd to Basket

« Hide

        10         20         30         40         50
MLFNPIGKEQ IRKYAIKKLS VGVASVCVGI GLAAGLPVVV YADDVSEVSL
60 70 80 90 100
QQPTKAAVTE ISHEQEPQAT DTDSEALVVS EETTVDEAVV ANAVTSNAAD
110 120 130 140 150
QTADEEKSNA PVELNKVANG GFDNDYVSNK NQWQYREGGH SVLTTENNNS
160 170 180 190 200
YAEVTSGTLD EHILQKVSTT VGKTYTLEAD VKVEADTPHN GLYLTAKESN
210 220 230 240 250
HNLQGPVIKE VSLTDTDGTW SHIKLSFTAT TSETFVGLVK RLEASSPETL
260 270 280 290 300
AASASIDNVS VVEENDYELI WQDEFSGDQL NQENWGYELG SIRGNEQQHY
310 320 330 340 350
TDSTENIYLE NGNLVLNVTE RKGEDRYANP RGGTSARQVI YDSGSVRTVG
360 370 380 390 400
KQEFLYGRIE ARAKLPKGKG AFPAFWTLGA DFTLDGDIAS TQGYGWPSTG
410 420 430 440 450
ELDIMELIGA PNGEHEGELA EGDQSNKTVY GTPHFYYVKG DADKDGSYSP
460 470 480 490 500
TALGGNLTLS DDFYDDYHIF GINWYPDKIE WYVDGIVYNT MYLTGDERLE
510 520 530 540 550
AAAAAFNKPQ YLQFNLATGG NWSKNAGYYL ASDETAFVID YVRYYQDAEQ
560 570 580 590 600
KAASEAYYAS QPDLKGVKDL TMLEGTSPDL AQAVTTDQEG YVVDFSVENE
610 620 630 640 650
YLFTNKGGNT NAALQAAGRD DLSALSELAP GIYNIYYSAV PYNADLGSTV
660 670 680 690 700
TPTAKIAREV AILTVLPKEG LIGKKGEPLS TVALPANWQW VTPEEILGSA
710 720 730 740 750
EHYQIKYTTE GGRAIYTSIE ASYISDQPIS DQASILVDLG NAILDATAEK
760 770 780 790 800
AAVTDDEALS KLDDVMSLNQ GTVTIRYRLD TADTTVRSSS PLALLSISNQ
810 820 830 840 850
ASANEYASFF IEPKNNKIGL EFKGAEVPIV KVGSGFNLLT NSDWQTISYV
860 870 880 890 900
FTGSRLKIYL NGDLYGEADF AGFMKQLPWK ASADTLTIGG LKRTYDGESV
910 920 930 940 950
LHWGLKGLVD QVLIDTDVYD LTDIAKAHQS TLRPVTGEKT NVWDKYDEGV
960 970 980 990 1000
FEYRIPSVVK TPSGTLIAAA DARKKHYNDW GDVATVVRIS HDDGKTWSNN
1010 1020 1030 1040 1050
ITVLDMPTQP YFTTQYSLAD WNTNMTQSAF SIDSTLLTDA SGKLYLLVDV
1060 1070 1080 1090 1100
FPESQGAVAS KAGSGYELIN GQYYLNLYDF DNNKYTVREG GIVYDQNGHQ
1110 1120 1130 1140 1150
TDMYVDEGNF ETAFSTRGNL YQTEAGEDIL LGNIYLRSGR TKQGITREGS
1160 1170 1180 1190 1200
QTAPLFTYMT SFLWLLTSED EGQTWSTPLD ITPQIKEDWM GFLGTGAAAG
1210 1220 1230 1240 1250
IEIDAVNAEG EQVKRLVFPI YYTNQQNATS ASLGRQSSAN IFSDDGGQTW
1260 1270 1280 1290 1300
QRGESPNDGR IYGHNQQTTS KDFDTSVTEL TENQIIQLNN GHLLQFMRNT
1310 1320 1330 1340 1350
GKTIVIARST DYGATWEDNP TVTDLPEPYV NLSAIHMMVD GKEYVVLSNP
1360 1370 1380 1390 1400
LGEPSGEQLT IRNQRMKGIL RVGEVLEDDS INWVASTIFE PKRFAYSSLV
1410 1420 1430 1440 1450
QLDDERVGLL YEYSGQITYS TFNIKQMISD QFREDKAEIE EVTITSAAPK
1460 1470 1480 1490 1500
QNGKASITIQ MTFNQPMFIL GDRQLAVTVN GVDKLASYVS GDGTDTAIFE
1510 1520 1530 1540
LSLESMPEGP ITVLPQFDHT IVETKYGIRL TDDKAYTVSY PAAVTA
Length:1,546
Mass (Da):169,787
Last modified:May 5, 2009 - v1
Checksum:iEC1A64DB75AB7E72
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
FM204884 Genomic DNA. Translation: CAW98510.1.
RefSeqiYP_002744087.1. NC_012470.1.

Genome annotation databases

EnsemblBacteriaiCAW98510; CAW98510; SZO_05310.
GeneIDi7695084.
KEGGiseq:SZO_05310.
PATRICi19650718. VBIStrEqu35012_0562.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
FM204884 Genomic DNA. Translation: CAW98510.1.
RefSeqiYP_002744087.1. NC_012470.1.

3D structure databases

ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi553483.SZO_05310.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaiCAW98510; CAW98510; SZO_05310.
GeneIDi7695084.
KEGGiseq:SZO_05310.
PATRICi19650718. VBIStrEqu35012_0562.

Phylogenomic databases

eggNOGiCOG2273.
HOGENOMiHOG000071801.
KOiK01186.
OMAiTEISHEQ.
OrthoDBiEOG61CM2G.

Enzyme and pathway databases

BioCyciSEQU40041:GC8B-608-MONOMER.

Family and domain databases

Gene3Di2.120.10.10. 2 hits.
2.60.120.200. 1 hit.
2.60.120.260. 1 hit.
InterProiIPR013320. ConA-like_dom.
IPR008979. Galactose-bd-like.
IPR000757. Glyco_hydro_16.
IPR004124. Glyco_hydro_33_N.
IPR011040. Sialidases.
IPR020610. Thiolase_AS.
IPR005877. YSIRK_signal_dom.
[Graphical view]
PfamiPF00722. Glyco_hydro_16. 1 hit.
PF02973. Sialidase. 1 hit.
PF04650. YSIRK_signal. 1 hit.
[Graphical view]
SUPFAMiSSF49785. SSF49785. 1 hit.
SSF49899. SSF49899. 2 hits.
SSF50939. SSF50939. 2 hits.
TIGRFAMsiTIGR01168. YSIRK_signal. 1 hit.
PROSITEiPS00099. THIOLASE_3. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

  1. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: H70Imported.

Entry informationi

Entry nameiC0MCG2_STRS7
AccessioniPrimary (citable) accession number: C0MCG2
Entry historyi
Integrated into UniProtKB/TrEMBL: May 5, 2009
Last sequence update: May 5, 2009
Last modified: January 7, 2015
This is version 43 of the entry and version 1 of the sequence. [Complete history]
Entry statusiUnreviewed (UniProtKB/TrEMBL)

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteomeImported

External Data

Dasty 3

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into Uniref entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.