P50429 (ARSB_MOUSE) Reviewed, UniProtKB/Swiss-Prot
Last modified
April 3, 2013.
Version 114.
History...
Names·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order
Names·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize orderNames and origin
| Protein names | Recommended name: Arylsulfatase B Short name=ASB EC=3.1.6.12 Alternative name(s): N-acetylgalactosamine-4-sulfatase Short name=G4S | ||||
| Gene names |
| ||||
| Organism | Mus musculus (Mouse) [Reference proteome] | ||||
| Taxonomic identifier | 10090 [NCBI] | ||||
| Taxonomic lineage | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Mammalia › Eutheria › Euarchontoglires › Glires › Rodentia › Sciurognathi › Muroidea › Muridae › Murinae › Mus › Mus![]() |
Protein attributes
| Sequence length | 534 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is further processed into a mature form. |
| Protein existence | Evidence at transcript level |
General annotation (Comments)
| Catalytic activity | Hydrolysis of the 4-sulfate groups of the N-acetyl-D-galactosamine 4-sulfate units of chondroitin sulfate and dermatan sulfate. |
| Cofactor | Binds 1 calcium ion per subunit By similarity. |
| Subunit structure | Homodimer By similarity. |
| Subcellular location | |
| Post-translational modification | The conversion to 3-oxoalanine (also known as C-formylglycine, FGly), of a serine or cysteine residue in prokaryotes and of a cysteine residue in eukaryotes, is critical for catalytic activity By similarity. |
| Sequence similarities | Belongs to the sulfatase family. |
Ontologies
Alternative products
| This entry describes 2 isoforms produced by alternative splicing. [Align] [Select] | ||||||
| Isoform 1 (identifier: P50429-1) This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry. | ||||||
| Isoform 2 (identifier: P50429-2) The sequence of this isoform differs from the canonical sequence as follows: 383-431: EGHPSPRVEL...EHSAFNTSIH → PVTGDHWHAE...PDCGRARWFL 432-534: Missing. |
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||||
Molecule processing | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Signal peptide | 1 – 41 | 41 | Potential | ||||||||
| Chain | 42 – 534 | 493 | Arylsulfatase B | PRO_0000033423 | |||||||
Sites | |||||||||||
| Active site | 148 | 1 | By similarity | ||||||||
| Metal binding | 54 | 1 | Calcium By similarity | ||||||||
| Metal binding | 55 | 1 | Calcium By similarity | ||||||||
| Metal binding | 92 | 1 | Calcium; via 3-oxoalanine By similarity | ||||||||
| Metal binding | 301 | 1 | Calcium By similarity | ||||||||
| Metal binding | 302 | 1 | Calcium By similarity | ||||||||
| Binding site | 146 | 1 | Substrate By similarity | ||||||||
| Binding site | 243 | 1 | Substrate By similarity | ||||||||
| Binding site | 319 | 1 | Substrate By similarity | ||||||||
Amino acid modifications | |||||||||||
| Modified residue | 92 | 1 | 3-oxoalanine (Cys) By similarity | ||||||||
| Glycosylation | 189 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 280 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 292 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 367 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 427 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 459 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Disulfide bond | 118 ↔ 522 | By similarity | |||||||||
| Disulfide bond | 122 ↔ 156 | By similarity | |||||||||
| Disulfide bond | 182 ↔ 193 | By similarity | |||||||||
| Disulfide bond | 406 ↔ 448 | By similarity | |||||||||
Natural variations | |||||||||||
| Alternative sequence | 383 – 431 | 49 | EGHPS…NTSIH → PVTGDHWHAEGELGCSFRTA SAAEEEPTYKLREKKRRKSP DCGRARWFL in isoform 2. | VSP_007881 | |||||||
| Alternative sequence | 432 – 534 | 103 | Missing in isoform 2. | VSP_022249 | |||||||
Experimental info | |||||||||||
| Sequence conflict | 60 | 1 | D → A in AAA37261. Ref.3 | ||||||||
| Sequence conflict | 352 | 1 | T → S in BAE34455. Ref.1 | ||||||||
| Sequence conflict | 352 | 1 | T → S in CAA63067. Ref.4 | ||||||||
| Sequence conflict | 467 | 1 | G → D in CAI84992. Ref.2 | ||||||||
Sequences
| ||||||||||||||||||||||||
References
Cross-references
Sequence databases | |
|---|---|
| EMBL GenBank DDBJ | AK083309 mRNA. Translation: BAC38859.1. AK154098 mRNA. Translation: BAE32375.1. AK158312 mRNA. Translation: BAE34455.1. AC131739 Genomic DNA. No translation available. AC136976 Genomic DNA. No translation available. M82877 mRNA. Translation: AAA37261.1. X92096 mRNA. Translation: CAA63067.1. BN000746 mRNA. Translation: CAI84992.1. |
| IPI | IPI00406459. IPI00652358. |
| RefSeq | NP_033842.3. NM_009712.3. |
| UniGene | Mm.300178. Mm.472255. |
3D structure databases | |
| ProteinModelPortal | P50429. |
| SMR | P50429. Positions 43-534. |
| ModBase | Search... |
PTM databases | |
| PhosphoSite | P50429. |
Proteomic databases | |
| PaxDb | P50429. |
| PRIDE | P50429. |
Protocols and materials databases | |
| DNASU | 11881. |
| StructuralBiologyKnowledgebase | Search... |
Genome annotation databases | |
| Ensembl | ENSMUST00000091403; ENSMUSP00000088964; ENSMUSG00000042082. |
| GeneID | 11881. |
| KEGG | mmu:11881. |
| UCSC | uc007rlo.1. mouse. uc011zcv.1. mouse. |
Organism-specific databases | |
| CTD | 411. |
| MGI | MGI:88075. Arsb. |
Phylogenomic databases | |
| eggNOG | COG3119. |
| GeneTree | ENSGT00560000077076. |
| HOGENOM | HOG000135354. |
| HOVERGEN | HBG004282. |
| InParanoid | P50429. |
| KO | K01135. |
| OrthoDB | EOG4DV5M0. |
Enzyme and pathway databases | |
| SABIO-RK | P50429. |
Gene expression databases | |
| Bgee | P50429. |
| CleanEx | MM_ARSB. |
| Genevestigator | P50429. |
| GermOnline | ENSMUSG00000042093. Mus musculus. |
Family and domain databases | |
| Gene3D | 3.40.720.10. 1 hit. |
| InterPro | IPR017849. Alkaline_Pase-like_a/b/a. IPR017850. Alkaline_phosphatase_core. IPR000917. Sulfatase. IPR024607. Sulfatase_CS. [Graphical view] |
| Pfam | PF00884. Sulfatase. 1 hit. [Graphical view] |
| SUPFAM | SSF53649. Alkaline_phosphatase_core. 1 hit. |
| PROSITE | PS00523. SULFATASE_1. 1 hit. PS00149. SULFATASE_2. 1 hit. [Graphical view] |
| ProtoNet | Search... |
Other | |
| NextBio | 279911. |
| SOURCE | Search... |
Entry information
| Entry name | ARSB_MOUSE | ||||||||
| Accession | Primary (citable) accession number: P50429 Secondary accession number(s): Q32KJ1 Q8C404 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation program | Chordata Protein Annotation Program | ||||||||
Relevant documents
| MGD cross-references Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot |
| SIMILARITY comments Index of protein domains and families |

Clusters with
