Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

P49610 (STRH_STRPN) Reviewed, UniProtKB/Swiss-Prot

Last modified November 13, 2013. Version 102. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Beta-N-acetylhexosaminidase

EC=3.2.1.52
Gene names
Name:strH
Ordered Locus Names:SP_0057
OrganismStreptococcus pneumoniae serotype 4 (strain ATCC BAA-334 / TIGR4) [Complete proteome] [HAMAP]
Taxonomic identifier170187 [NCBI]
Taxonomic lineageBacteriaFirmicutesBacilliLactobacillalesStreptococcaceaeStreptococcus

Protein attributes

Sequence length1312 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level

General annotation (Comments)

Catalytic activity

Hydrolysis of terminal non-reducing N-acetyl-D-hexosamine residues in N-acetyl-beta-D-hexosaminides.

Subcellular location

Secretedcell wall; Peptidoglycan-anchor Potential.

Sequence similarities

Belongs to the glycosyl hydrolase 20 family.

Contains 2 G5 domains.

Ontologies

Keywords
   Cellular componentCell wall
Secreted
   DomainRepeat
Signal
   Molecular functionGlycosidase
Hydrolase
   PTMPeptidoglycan-anchor
   Technical term3D-structure
Complete proteome
Gene Ontology (GO)
   Biological_processcarbohydrate metabolic process

Inferred from electronic annotation. Source: InterPro

   Cellular_componentcell wall

Inferred from electronic annotation. Source: UniProtKB-SubCell

extracellular region

Inferred from electronic annotation. Source: UniProtKB-KW

membrane

Inferred from electronic annotation. Source: InterPro

   Molecular_functionbeta-N-acetylhexosaminidase activity

Inferred from electronic annotation. Source: UniProtKB-EC

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 3333 Potential
Chain34 – 12841251Beta-N-acetylhexosaminidase
PRO_0000012020
Propeptide1285 – 131228Removed by sortase Potential
PRO_0000012021

Regions

Domain1059 – 113880G5 1
Domain1150 – 123081G5 2
Region176 – 616441Catalytic domain 1
Region621 – 1046426Catalytic domain 2
Motif1281 – 12855LPXTG sorting signal Potential

Amino acid modifications

Modified residue12841Pentaglycyl murein peptidoglycan amidated threonine Potential

Experimental info

Sequence conflict391Missing in AAC41450. Ref.1
Sequence conflict691V → E in AAC41450. Ref.1
Sequence conflict1691A → E in AAC41450. Ref.1
Sequence conflict6171Q → L in AAC41450. Ref.1
Sequence conflict10451V → A in AAC41450. Ref.1
Sequence conflict11611E → K in AAC41450. Ref.1
Sequence conflict11711C → R in AAC41450. Ref.1
Sequence conflict12671V → A in AAC41450. Ref.1

Secondary structure

................................................................................................................................................ 1312
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
P49610 [UniParc].

Last modified September 26, 2001. Version 2.
Checksum: 503375B5257A90B5

FASTA1,312144,550
        10         20         30         40         50         60 
MKHEKQQRFS IRKYAVGAAS VLIGFAFQAQ TVAADGVTPT TTENQPTIHT VSDSPQSSEN 

        70         80         90        100        110        120 
RTEETPKAVL QPEAPKTVET ETPATDKVAS LPKTEEKPQE EVSSTPSDKA EVVTPTSAEK 

       130        140        150        160        170        180 
ETANKKAEEA SPKKEEAKEV DSKESNTDKT DKDKPAKKDE AKAEADKPAT EAGKERAATV 

       190        200        210        220        230        240 
NEKLAKKKIV SIDAGRKYFS PEQLKEIIDK AKHYGYTDLH LLVGNDGLRF MLDDMSITAN 

       250        260        270        280        290        300 
GKTYASDDVK RAIEKGTNDY YNDPNGNHLT ESQMTDLINY AKDKGIGLIP TVNSPGHMDA 

       310        320        330        340        350        360 
ILNAMKELGI QNPNFSYFGK KSARTVDLDN EQAVAFTKAL IDKYAAYFAK KTEIFNIGLD 

       370        380        390        400        410        420 
EYANDATDAK GWSVLQADKY YPNEGYPVKG YEKFIAYAND LARIVKSHGL KPMAFNDGIY 

       430        440        450        460        470        480 
YNSDTSFGSF DKDIIVSMWT GGWGGYDVAS SKLLAEKGHQ ILNTNDAWYY VLGRNADGQG 

       490        500        510        520        530        540 
WYNLDQGLNG IKNTPITSVP KTEGADIPII GGMVAAWADT PSARYSPSRL FKLMRHFANA 

       550        560        570        580        590        600 
NAEYFAADYE SAEQALNEVP KDLNRYTAES VTAVKEAEKA IRSLDSNLSR AQQDTIDQAI 

       610        620        630        640        650        660 
AKLQETVNNL TLTPEAQKEE EAKREVEKLA KNKVISIDAG RKYFTLNQLK RIVDKASELG 

       670        680        690        700        710        720 
YSDVHLLLGN DGLRFLLDDM TITANGKTYA SDDVKKAIIE GTKAYYDDPN GTALTQAEVT 

       730        740        750        760        770        780 
ELIEYAKSKD IGLIPAINSP GHMDAMLVAM EKLGIKNPQA HFDKVSKTTM DLKNEEAMNF 

       790        800        810        820        830        840 
VKALIGKYMD FFAGKTKIFN FGTDEYANDA TSAQGWYYLK WYQLYGKFAE YANTLAAMAK 

       850        860        870        880        890        900 
ERGLQPMAFN DGFYYEDKDD VQFDKDVLIS YWSKGWWGYN LASPQYLASK GYKFLNTNGD 

       910        920        930        940        950        960 
WYYILGQKPE DGGGFLKKAI ENTGKTPFNQ LASTKYPEVD LPTVGSMLSI WADRPSAEYK 

       970        980        990       1000       1010       1020 
EEEIFELMTA FADHNKDYFR ANYNALREEL AKIPTNLEGY SKESLEALDA AKTALNYNLN 

      1030       1040       1050       1060       1070       1080 
RNKQAELDTL VANLKAALQG LKPAVTHSGS LDENEVAANV ETRPELITRT EEIPFEVIKK 

      1090       1100       1110       1120       1130       1140 
ENPNLPAGQE NIITAGVKGE RTHYISVLTE NGKTTETVLD SQVTKEVINQ VVEVGAPVTH 

      1150       1160       1170       1180       1190       1200 
KGDESGLAPT TEVKPRLDIQ EEEIPFTTVT CENPLLLKGK TQVITKGVNG HRSNFYSVST 

      1210       1220       1230       1240       1250       1260 
SADGKEVKTL VNSVVAQEAV TQIVEVGTMV THVGDENGQA AIAEEKPKLE IPSQPAPSTA 

      1270       1280       1290       1300       1310 
PAEESKVLPQ DPAPVVTEKK LPETGTHDSA GLVVAGLMST LAAYGLTKRK ED 

« Hide

References

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
L36923 Genomic DNA. Translation: AAC41450.1.
AE005672 Genomic DNA. Translation: AAK74246.1.
PIRA56390.
E95006.
RefSeqNP_344606.1. NC_003028.3.

3D structure databases

PDBe
RCSB PDB
PDBj
EntryMethodResolution (Å)ChainPositionsPDBsum
2LTJNMR-A1050-1140[»]
2YL5X-ray2.15A/B/C/D627-1064[»]
2YL6X-ray1.60A181-614[»]
2YL8X-ray1.75A181-614[»]
2YL9X-ray2.65A/B/C/D627-1062[»]
2YLAX-ray2.70A/B/C/D627-1064[»]
2YLLX-ray1.85A181-614[»]
4AZ5X-ray1.73A181-614[»]
4AZ6X-ray1.36A181-613[»]
4AZ7X-ray1.70A181-613[»]
4AZBX-ray2.10A181-614[»]
4AZCX-ray2.09A/B/C/D627-1064[»]
4AZGX-ray2.40A/B627-1064[»]
4AZHX-ray2.22A/B/C/D627-1064[»]
4AZIX-ray1.98A/B627-1064[»]
ProteinModelPortalP49610.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

STRING170187.SP_0057.

Protein family/group databases

CAZyGH20. Glycoside Hydrolase Family 20.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaAAK74246; AAK74246; SP_0057.
GeneID929812.
KEGGspn:SP_0057.
PATRIC19704441. VBIStrPne105772_0065.

Organism-specific databases

CMRSearch...

Phylogenomic databases

eggNOGCOG3525.
HOGENOMHOG000285052.
KOK12373.
OMATYASDDV.
OrthoDBEOG6GFGKT.
ProtClustDBCLSK883897.

Enzyme and pathway databases

BioCycSPNE170187:GHGN-63-MONOMER.

Family and domain databases

Gene3D3.20.20.80. 2 hits.
InterProIPR011098. G5_dom.
IPR015883. Glyco_hydro_20_cat-core.
IPR013781. Glyco_hydro_catalytic_dom.
IPR017853. Glycoside_hydrolase_SF.
IPR005877. Gpos_YSIRK.
IPR019948. Gram-positive_anchor.
IPR019931. LPXTG_anchor.
[Graphical view]
PfamPF07501. G5. 2 hits.
PF00728. Glyco_hydro_20. 2 hits.
PF00746. Gram_pos_anchor. 1 hit.
PF04650. YSIRK_signal. 1 hit.
[Graphical view]
SUPFAMSSF51445. SSF51445. 2 hits.
TIGRFAMsTIGR01167. LPXTG_anchor. 1 hit.
TIGR01168. YSIRK_signal. 1 hit.
PROSITEPS51109. G5. 2 hits.
PS50847. GRAM_POS_ANCHORING. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameSTRH_STRPN
AccessionPrimary (citable) accession number: P49610
Entry history
Integrated into UniProtKB/Swiss-Prot: February 1, 1996
Last sequence update: September 26, 2001
Last modified: November 13, 2013
This is version 102 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families

PDB cross-references

Index of Protein Data Bank (PDB) cross-references

Glycosyl hydrolases

Classification of glycosyl hydrolase families and list of entries