Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Beta-N-acetylhexosaminidase

Gene

strH

Organism
Streptococcus pneumoniae serotype 4 (strain ATCC BAA-334 / TIGR4)
Status
Reviewed-Annotation score: -Experimental evidence at protein leveli

Functioni

Catalytic activityi

Hydrolysis of terminal non-reducing N-acetyl-D-hexosamine residues in N-acetyl-beta-D-hexosaminides.

GO - Molecular functioni

GO - Biological processi

Keywordsi

Molecular functionGlycosidase, Hydrolase

Enzyme and pathway databases

BioCyciSPNE170187:G1FZB-64-MONOMER

Protein family/group databases

CAZyiGH20 Glycoside Hydrolase Family 20

Names & Taxonomyi

Protein namesi
Recommended name:
Beta-N-acetylhexosaminidase (EC:3.2.1.52)
Gene namesi
Name:strH
Ordered Locus Names:SP_0057
OrganismiStreptococcus pneumoniae serotype 4 (strain ATCC BAA-334 / TIGR4)
Taxonomic identifieri170187 [NCBI]
Taxonomic lineageiBacteriaFirmicutesBacilliLactobacillalesStreptococcaceaeStreptococcus
Proteomesi
  • UP000000585 Componenti: Chromosome

Subcellular locationi

GO - Cellular componenti

Keywords - Cellular componenti

Cell wall, Secreted

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Signal peptidei1 – 33Sequence analysisAdd BLAST33
ChainiPRO_000001202034 – 1284Beta-N-acetylhexosaminidaseAdd BLAST1251
PropeptideiPRO_00000120211285 – 1312Removed by sortasePROSITE-ProRule annotationAdd BLAST28

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei1284Pentaglycyl murein peptidoglycan amidated threoninePROSITE-ProRule annotation1

Keywords - PTMi

Peptidoglycan-anchor

Proteomic databases

PRIDEiP49610

Structurei

Secondary structure

11312
Legend: HelixTurnBeta strandPDB Structure known for this area
Show more details

3D structure databases

ProteinModelPortaliP49610
SMRiP49610
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini1059 – 1138G5 1PROSITE-ProRule annotationAdd BLAST80
Domaini1150 – 1230G5 2PROSITE-ProRule annotationAdd BLAST81

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Regioni176 – 616Catalytic domain 1Add BLAST441
Regioni621 – 1046Catalytic domain 2Add BLAST426

Motif

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Motifi1281 – 1285LPXTG sorting signalPROSITE-ProRule annotation5

Sequence similaritiesi

Belongs to the glycosyl hydrolase 20 family.Curated

Keywords - Domaini

Repeat, Signal

Phylogenomic databases

eggNOGiENOG41069UX Bacteria
COG3525 LUCA
HOGENOMiHOG000285052
KOiK12373
OMAiTYASDDV

Family and domain databases

InterProiView protein in InterPro
IPR011098 G5_dom
IPR015883 Glyco_hydro_20_cat
IPR017853 Glycoside_hydrolase_SF
IPR019948 Gram-positive_anchor
IPR005877 YSIRK_signal_dom
PfamiView protein in Pfam
PF07501 G5, 2 hits
PF00728 Glyco_hydro_20, 2 hits
PF00746 Gram_pos_anchor, 1 hit
PF04650 YSIRK_signal, 1 hit
SMARTiView protein in SMART
SM01208 G5, 2 hits
SUPFAMiSSF51445 SSF51445, 2 hits
TIGRFAMsiTIGR01168 YSIRK_signal, 1 hit
PROSITEiView protein in PROSITE
PS51109 G5, 2 hits
PS50847 GRAM_POS_ANCHORING, 1 hit

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

P49610-1 [UniParc]FASTAAdd to basket
« Hide
        10         20         30         40         50
MKHEKQQRFS IRKYAVGAAS VLIGFAFQAQ TVAADGVTPT TTENQPTIHT
60 70 80 90 100
VSDSPQSSEN RTEETPKAVL QPEAPKTVET ETPATDKVAS LPKTEEKPQE
110 120 130 140 150
EVSSTPSDKA EVVTPTSAEK ETANKKAEEA SPKKEEAKEV DSKESNTDKT
160 170 180 190 200
DKDKPAKKDE AKAEADKPAT EAGKERAATV NEKLAKKKIV SIDAGRKYFS
210 220 230 240 250
PEQLKEIIDK AKHYGYTDLH LLVGNDGLRF MLDDMSITAN GKTYASDDVK
260 270 280 290 300
RAIEKGTNDY YNDPNGNHLT ESQMTDLINY AKDKGIGLIP TVNSPGHMDA
310 320 330 340 350
ILNAMKELGI QNPNFSYFGK KSARTVDLDN EQAVAFTKAL IDKYAAYFAK
360 370 380 390 400
KTEIFNIGLD EYANDATDAK GWSVLQADKY YPNEGYPVKG YEKFIAYAND
410 420 430 440 450
LARIVKSHGL KPMAFNDGIY YNSDTSFGSF DKDIIVSMWT GGWGGYDVAS
460 470 480 490 500
SKLLAEKGHQ ILNTNDAWYY VLGRNADGQG WYNLDQGLNG IKNTPITSVP
510 520 530 540 550
KTEGADIPII GGMVAAWADT PSARYSPSRL FKLMRHFANA NAEYFAADYE
560 570 580 590 600
SAEQALNEVP KDLNRYTAES VTAVKEAEKA IRSLDSNLSR AQQDTIDQAI
610 620 630 640 650
AKLQETVNNL TLTPEAQKEE EAKREVEKLA KNKVISIDAG RKYFTLNQLK
660 670 680 690 700
RIVDKASELG YSDVHLLLGN DGLRFLLDDM TITANGKTYA SDDVKKAIIE
710 720 730 740 750
GTKAYYDDPN GTALTQAEVT ELIEYAKSKD IGLIPAINSP GHMDAMLVAM
760 770 780 790 800
EKLGIKNPQA HFDKVSKTTM DLKNEEAMNF VKALIGKYMD FFAGKTKIFN
810 820 830 840 850
FGTDEYANDA TSAQGWYYLK WYQLYGKFAE YANTLAAMAK ERGLQPMAFN
860 870 880 890 900
DGFYYEDKDD VQFDKDVLIS YWSKGWWGYN LASPQYLASK GYKFLNTNGD
910 920 930 940 950
WYYILGQKPE DGGGFLKKAI ENTGKTPFNQ LASTKYPEVD LPTVGSMLSI
960 970 980 990 1000
WADRPSAEYK EEEIFELMTA FADHNKDYFR ANYNALREEL AKIPTNLEGY
1010 1020 1030 1040 1050
SKESLEALDA AKTALNYNLN RNKQAELDTL VANLKAALQG LKPAVTHSGS
1060 1070 1080 1090 1100
LDENEVAANV ETRPELITRT EEIPFEVIKK ENPNLPAGQE NIITAGVKGE
1110 1120 1130 1140 1150
RTHYISVLTE NGKTTETVLD SQVTKEVINQ VVEVGAPVTH KGDESGLAPT
1160 1170 1180 1190 1200
TEVKPRLDIQ EEEIPFTTVT CENPLLLKGK TQVITKGVNG HRSNFYSVST
1210 1220 1230 1240 1250
SADGKEVKTL VNSVVAQEAV TQIVEVGTMV THVGDENGQA AIAEEKPKLE
1260 1270 1280 1290 1300
IPSQPAPSTA PAEESKVLPQ DPAPVVTEKK LPETGTHDSA GLVVAGLMST
1310
LAAYGLTKRK ED
Length:1,312
Mass (Da):144,550
Last modified:September 26, 2001 - v2
Checksum:i503375B5257A90B5
GO

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti39Missing in AAC41450 (PubMed:7721787).Curated1
Sequence conflicti69V → E in AAC41450 (PubMed:7721787).Curated1
Sequence conflicti169A → E in AAC41450 (PubMed:7721787).Curated1
Sequence conflicti617Q → L in AAC41450 (PubMed:7721787).Curated1
Sequence conflicti1045V → A in AAC41450 (PubMed:7721787).Curated1
Sequence conflicti1161E → K in AAC41450 (PubMed:7721787).Curated1
Sequence conflicti1171C → R in AAC41450 (PubMed:7721787).Curated1
Sequence conflicti1267V → A in AAC41450 (PubMed:7721787).Curated1

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
L36923 Genomic DNA Translation: AAC41450.1
AE005672 Genomic DNA Translation: AAK74246.1
PIRiA56390
E95006
RefSeqiWP_000679952.1, NZ_AKVY01000001.1

Genome annotation databases

EnsemblBacteriaiAAK74246; AAK74246; SP_0057
KEGGispn:SP_0057

Similar proteinsi

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
L36923 Genomic DNA Translation: AAC41450.1
AE005672 Genomic DNA Translation: AAK74246.1
PIRiA56390
E95006
RefSeqiWP_000679952.1, NZ_AKVY01000001.1

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
2LTJNMR-A1050-1140[»]
2YL5X-ray2.15A/B/C/D627-1064[»]
2YL6X-ray1.60A181-614[»]
2YL8X-ray1.75A181-614[»]
2YL9X-ray2.65A/B/C/D627-1062[»]
2YLAX-ray2.70A/B/C/D627-1064[»]
2YLLX-ray1.85A181-614[»]
4AZ5X-ray1.73A181-614[»]
4AZ6X-ray1.36A181-613[»]
4AZ7X-ray1.70A181-613[»]
4AZBX-ray2.10A181-614[»]
4AZCX-ray2.09A/B/C/D627-1064[»]
4AZGX-ray2.40A/B627-1064[»]
4AZHX-ray2.22A/B/C/D627-1064[»]
4AZIX-ray1.98A/B627-1064[»]
ProteinModelPortaliP49610
SMRiP49610
ModBaseiSearch...
MobiDBiSearch...

Protein family/group databases

CAZyiGH20 Glycoside Hydrolase Family 20

Proteomic databases

PRIDEiP49610

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaiAAK74246; AAK74246; SP_0057
KEGGispn:SP_0057

Phylogenomic databases

eggNOGiENOG41069UX Bacteria
COG3525 LUCA
HOGENOMiHOG000285052
KOiK12373
OMAiTYASDDV

Enzyme and pathway databases

BioCyciSPNE170187:G1FZB-64-MONOMER

Family and domain databases

InterProiView protein in InterPro
IPR011098 G5_dom
IPR015883 Glyco_hydro_20_cat
IPR017853 Glycoside_hydrolase_SF
IPR019948 Gram-positive_anchor
IPR005877 YSIRK_signal_dom
PfamiView protein in Pfam
PF07501 G5, 2 hits
PF00728 Glyco_hydro_20, 2 hits
PF00746 Gram_pos_anchor, 1 hit
PF04650 YSIRK_signal, 1 hit
SMARTiView protein in SMART
SM01208 G5, 2 hits
SUPFAMiSSF51445 SSF51445, 2 hits
TIGRFAMsiTIGR01168 YSIRK_signal, 1 hit
PROSITEiView protein in PROSITE
PS51109 G5, 2 hits
PS50847 GRAM_POS_ANCHORING, 1 hit
ProtoNetiSearch...

Entry informationi

Entry nameiSTRH_STRPN
AccessioniPrimary (citable) accession number: P49610
Entry historyiIntegrated into UniProtKB/Swiss-Prot: February 1, 1996
Last sequence update: September 26, 2001
Last modified: February 28, 2018
This is version 125 of the entry and version 2 of the sequence. See complete history.
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

3D-structure, Complete proteome

Documents

  1. SIMILARITY comments
    Index of protein domains and families
  2. PDB cross-references
    Index of Protein Data Bank (PDB) cross-references
  3. Glycosyl hydrolases
    Classification of glycosyl hydrolase families and list of entries
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again