Skip Header

Contribute Send feedback
Read comments (?) or add your own

P00787 (CATB_RAT) Reviewed, UniProtKB/Swiss-Prot

Last modified October 19, 2011. Version 120. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Cathepsin B

EC=3.4.22.1
Alternative name(s):
Cathepsin B1
RSG-2

Cleaved into the following 2 chains:

  1. Cathepsin B light chain
  2. Cathepsin B heavy chain
Gene names
Name:Ctsb
OrganismRattus norvegicus (Rat)
Taxonomic identifier10116 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeRattus

Protein attributes

Sequence length339 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Thiol protease which is believed to participate in intracellular degradation and turnover of proteins. Has also been implicated in tumor invasion and metastasis.

Catalytic activity

Hydrolysis of proteins with broad specificity for peptide bonds. Preferentially cleaves -Arg-Arg-|-Xaa bonds in small molecule substrates (thus differing from cathepsin L). In addition to being an endopeptidase, shows peptidyl-dipeptidase activity, liberating C-terminal dipeptides.

Subunit structure

Interacts with SRPX2 By similarity. Dimer of a heavy chain and a light chain cross-linked by a disulfide bond.

Subcellular location

Lysosome. Melanosome By similarity.

Sequence similarities

Belongs to the peptidase C1 family.

Ontologies

Keywords
   Cellular componentLysosome
   DomainSignal
   Molecular functionHydrolase
Protease
Thiol protease
   PTMDisulfide bond
Glycoprotein
Zymogen
   Technical term3D-structure
Complete proteome
Direct protein sequencing
Reference proteome
Gene Ontology (GO)
   Biological processautophagy

Inferred from expression pattern. Source: RGD

negative regulation of cell death

Inferred from mutant phenotype. Source: RGD

proteolysis

Inferred from direct assay. Source: RGD

regulation of catalytic activity

Inferred from electronic annotation. Source: InterPro

response to amine stimulus

Inferred from expression pattern. Source: RGD

response to ethanol

Inferred from expression pattern. Source: RGD

response to glucose stimulus

Inferred from expression pattern. Source: RGD

response to interleukin-4

Inferred from expression pattern. Source: RGD

response to mechanical stimulus

Inferred from expression pattern. Source: RGD

response to organic cyclic compound

Inferred from expression pattern. Source: RGD

response to peptide hormone stimulus

Inferred from expression pattern. Source: RGD

response to wounding

Inferred from expression pattern. Source: RGD

skeletal muscle tissue development

Inferred from expression pattern. Source: RGD

   Cellular componentapical plasma membrane

Inferred from direct assay. Source: RGD

external side of plasma membrane

Inferred from direct assay. Source: RGD

extracellular space

Inferred from direct assay. Source: RGD

lysosome

Inferred from direct assay. Source: RGD

melanosome

Inferred from electronic annotation. Source: UniProtKB-SubCell

mitochondrion

Inferred from direct assay. Source: RGD

sarcolemma

Inferred from direct assay. Source: RGD

soluble fraction

Inferred from direct assay. Source: RGD

   Molecular functioncysteine-type endopeptidase activity

Inferred from direct assay. Source: RGD

kininogen binding

Inferred from physical interaction. Source: RGD

peptide binding

Inferred from direct assay. Source: RGD

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 1717 Potential
Propeptide18 – 7962Activation peptide
PRO_0000026153
Chain80 – 333254Cathepsin B
PRO_0000026154
Chain80 – 12647Cathepsin B light chain
PRO_0000026155
Chain129 – 333205Cathepsin B heavy chain
PRO_0000026156
Propeptide334 – 3396
PRO_0000026157

Sites

Active site1081
Active site2781
Active site2981

Amino acid modifications

Glycosylation1921N-linked (GlcNAc...) Ref.3
Disulfide bond93 ↔ 122
Disulfide bond105 ↔ 150
Disulfide bond141 ↔ 207
Disulfide bond142 ↔ 146
Disulfide bond179 ↔ 211
Disulfide bond187 ↔ 198

Natural variations

Natural variant3021V → A.

Experimental info

Sequence conflict1591W → G AA sequence Ref.3

Secondary structure

............................................ 339
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
P00787 [UniParc].

Last modified October 1, 1996. Version 2.
Checksum: 925E2E58C2B03CDA

FASTA33937,470
        10         20         30         40         50         60 
MWWSLIPLSC LLALTSAHDK PSSHPLSDDM INYINKQNTT WQAGRNFYNV DISYLKKLCG 

        70         80         90        100        110        120 
TVLGGPNLPE RVGFSEDINL PESFDAREQW SNCPTIAQIR DQGSCGSCWA FGAVEAMSDR 

       130        140        150        160        170        180 
ICIHTNGRVN VEVSAEDLLT CCGIQCGDGC NGGYPSGAWN FWTRKGLVSG GVYNSHIGCL 

       190        200        210        220        230        240 
PYTIPPCEHH VNGSRPPCTG EGDTPKCNKM CEAGYSTSYK EDKHYGYTSY SVSDSEKEIM 

       250        260        270        280        290        300 
AEIYKNGPVE GAFTVFSDFL TYKSGVYKHE AGDVMGGHAI RILGWGIENG VPYWLVANSW 

       310        320        330 
NVDWGDNGFF KILRGENHCG IESEIVAGIP RTQQYWGRF 

« Hide

References

[1]"Cathepsin B, a cysteine protease implicated in metastatic progression, is also expressed during regression of the rat prostate and mammary glands."
Guenette R.S., Mooibroek M., Wong K., Wong P., Tenniswood M.
Eur. J. Biochem. 226:311-321(1994) [PubMed: 8001549] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA].
Strain: Sprague-Dawley.
Tissue: Mammary gland.
[2]"Identification of cDNA clones encoding a precursor of rat liver cathepsin B."
San Segundo B., Chan S.J., Steiner D.F.
Proc. Natl. Acad. Sci. U.S.A. 82:2320-2324(1985) [PubMed: 2986112] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 69-339.
[3]"Homology of amino acid sequences of rat liver cathepsins B and H with that of papain."
Takio K., Towatari T., Katunuma N., Teller D.C., Titani K.
Proc. Natl. Acad. Sci. U.S.A. 80:3666-3670(1983) [PubMed: 6574504] [Abstract]
Cited for: PROTEIN SEQUENCE OF 80-126 AND 129-333.
Tissue: Liver.
[4]Lubec G., Afjehi-Sadat L., Chen W.-Q.
Submitted (JAN-2009) to UniProtKB
Cited for: PROTEIN SEQUENCE OF 246-263, MASS SPECTROMETRY.
Strain: Sprague-Dawley.
Tissue: Hippocampus and Spinal cord.
[5]"Rat procathepsin B. Proteolytic processing to the mature form in vitro."
Rowan A.D., Mason P., Mach L., Mort J.S.
J. Biol. Chem. 267:15993-15999(1992) [PubMed: 1639824] [Abstract]
Cited for: PROTEOLYTIC PROCESSING.
[6]"Crystal structures of recombinant rat cathepsin B and a cathepsin B-inhibitor complex. Implications for structure-based inhibitor design."
Jia Z., Hasnain S., Hirama T., Lee X., Mort J.S., To R., Huber C.P.
J. Biol. Chem. 270:5527-5533(1995) [PubMed: 7890671] [Abstract]
Cited for: X-RAY CRYSTALLOGRAPHY (2.2 ANGSTROMS).
[7]"Structure of rat procathepsin B: model for inhibition of cysteine protease activity by the proregion."
Cygler M., Sivaraman J., Grochulski P., Coulombe R., Storer A.C., Mort J.S.
Structure 4:405-416(1996) [PubMed: 8740363] [Abstract]
Cited for: X-RAY CRYSTALLOGRAPHY (2.8 ANGSTROMS) OF 18-339.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
X82396 mRNA. Translation: CAA57792.1.
M11305 mRNA. Translation: AAA40993.1.
IPIIPI00212811.
PIRKHRTB. S51041.
UniGeneRn.100909.

3D structure databases

PDBe
RCSB PDB
PDBj
EntryMethodResolution (Å)ChainPositionsPDBsum
1CPJX-ray2.20A/B74-333[»]
1CTEX-ray2.10A/B80-333[»]
1MIRX-ray2.80A/B18-339[»]
1THEX-ray1.90A/B74-333[»]
ProteinModelPortalP00787.
SMRP00787. Positions 27-339.
ModBaseSearch...

Protein-protein interaction databases

STRINGP00787.

Protein family/group databases

MEROPSC01.060.

Proteomic databases

PRIDEP00787.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

UCSCNM_022597. rat.

Organism-specific databases

RGD621509. Ctsb.

Phylogenomic databases

eggNOGroNOG08540.
HOVERGENHBG003480.
InParanoidP00787.
OrthoDBEOG4K6G4C.

Enzyme and pathway databases

BRENDA3.4.22.1. 5301.

Gene expression databases

ArrayExpressP00787.
GenevestigatorP00787.
GermOnlineENSRNOG00000010331. Rattus norvegicus.

Family and domain databases

InterProIPR000169. Pept_cys_AS.
IPR013128. Peptidase_C1A.
IPR000668. Peptidase_C1A_C.
IPR015643. Peptidase_C1A_cathepsin-B.
IPR012599. Propeptide_C1A.
[Graphical view]
PANTHERPTHR12411:SF16. CathepsinB_like. 1 hit.
PTHR12411. Peptidase_C1A. 1 hit.
PfamPF00112. Peptidase_C1. 1 hit.
PF08127. Propeptide_C1. 1 hit.
[Graphical view]
PRINTSPR00705. PAPAIN.
SMARTSM00645. Pept_C1. 1 hit.
[Graphical view]
PROSITEPS00640. THIOL_PROTEASE_ASN. 1 hit.
PS00139. THIOL_PROTEASE_CYS. 1 hit.
PS00639. THIOL_PROTEASE_HIS. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameCATB_RAT
AccessionPrimary (citable) accession number: P00787
Entry history
Integrated into UniProtKB/Swiss-Prot: July 21, 1986
Last sequence update: October 1, 1996
Last modified: October 19, 2011
This is version 120 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Relevant documents

Peptidase families

Classification of peptidase families and list of entries

PDB cross-references

Index of Protein Data Bank (PDB) cross-references

SIMILARITY comments

Index of protein domains and families