Skip Header

 
Contribute Send feedback
Read comments (1) or add your own

Reviewed, UniProtKB/Swiss-Prot P09668 (CATH_HUMAN)

Last modified June 16, 2009. Version 108. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (7) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Web resources · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Cathepsin H
    EC=3.4.22.16
Cleaved into the following 3 chains:
    1- Recommended name:
            Cathepsin H mini chain
    2- Recommended name:
            Cathepsin H heavy chain
    3- Recommended name:
            Cathepsin H light chain
Gene names
Name: CTSH
Synonyms: CPSB
OrganismHomo sapiens (Human)
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length335 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level.

General annotation (Comments)

Function

Important for the overall degradation of proteins in lysosomes.

Catalytic activity

Hydrolysis of proteins, acting as an aminopeptidase (notably, cleaving Arg-|-Xaa bonds) as well as an endopeptidase.

Subunit structure

Composed of a mini chain and a large chain. The large chain may be split into heavy and light chain. All chains are held together by disulfide bonds.

Subcellular location

Lysosome.

Sequence similarities

Belongs to the peptidase C1 family.

Ontologies

Keywords
   Cellular componentLysosome
   Coding sequence diversityPolymorphism
   DomainSignal
   Molecular functionHydrolase
Protease
Thiol protease
   PTMDisulfide bond
Glycoprotein
Zymogen
   Technical term3D-structure
Direct protein sequencing
Gene Ontology (GO)
   Biological processproteolysis Ref.4

Inferred from direct assay. Source: UniProtKB

   Cellular componentlysosome

Inferred from electronic annotation. Source: UniProtKB-SubCell

   Molecular functioncysteine-type endopeptidase activity Ref.4

Inferred from direct assay. Source: UniProtKB

protein binding

Inferred from physical interaction. Source: UniProtKB

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 2222
Propeptide23 – 9775Activation peptide Ref.5
PRO_0000026206
Peptide98 – 1058Cathepsin H mini chain
PRO_0000026207
Propeptide106 – 11510
PRO_0000026208
Chain116 – 335220Cathepsin H
PRO_0000026209
Chain116 – 292177Cathepsin H heavy chain
PRO_0000026210
Chain293 – 33543Cathepsin H light chain
PRO_0000026211

Sites

Active site1411 By similarity
Active site2811 By similarity
Active site3011 By similarity

Amino acid modifications

Glycosylation1011N-linked (GlcNAc...) Ref.5
Glycosylation2301N-linked (GlcNAc...) Ref.5
Disulfide bond102 ↔ 327 By similarity
Disulfide bond138 ↔ 181 By similarity
Disulfide bond172 ↔ 214 By similarity
Disulfide bond272 ↔ 322 By similarity

Natural variations

Natural variant111G → R: dbSNP rs2289702.
VAR_057038
Natural variant231A → T: dbSNP rs35001431.
VAR_057039
Natural variant1261G → R in a colorectal cancer sample; somatic mutation. Ref.9
VAR_036478

Experimental info

Sequence conflict1791H → Y Ref.1
Sequence conflict1791H → Y Ref.3
Sequence conflict3061Q → E Ref.5
Sequence conflict3061Q → E Ref.6

Secondary structure

.............................. 335
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
P09668-1 [UniParc].

Last modified May 27, 2002. Version 3.
Checksum: 4690615EE155B767

FASTA33537,378
        10         20         30         40         50         60 
MWATLPLLCA GAWLLGVPVC GAAELSVNSL EKFHFKSWMS KHRKTYSTEE YHHRLQTFAS 

        70         80         90        100        110        120 
NWRKINAHNN GNHTFKMALN QFSDMSFAEI KHKYLWSEPQ NCSATKSNYL RGTGPYPPSV 

       130        140        150        160        170        180 
DWRKKGNFVS PVKNQGACGS CWTFSTTGAL ESAIAIATGK MLSLAEQQLV DCAQDFNNHG 

       190        200        210        220        230        240 
CQGGLPSQAF EYILYNKGIM GEDTYPYQGK DGYCKFQPGK AIGFVKDVAN ITIYDEEAMV 

       250        260        270        280        290        300 
EAVALYNPVS FAFEVTQDFM MYRTGIYSST SCHKTPDKVN HAVLAVGYGE KNGIPYWIVK 

       310        320        330 
NSWGPQWGMN GYFLIERGKN MCGLAACASY PIPLV 

« Hide

References

« Hide 'large scale' references
[1]"Nucleotide sequence of human preprocathepsin H, a lysosomal cysteine proteinase."
Fuchs R., Gassen H.G.
Nucleic Acids Res. 17:9471-9471(1989) [PubMed: 2587265] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA].
Tissue: Liver.
[2]"Complete sequencing and characterization of 21,243 full-length human cDNAs."
Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S. expand/collapse author list , Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.
Nat. Genet. 36:40-45(2004) [PubMed: 14702039] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
Tissue: Lung.
[3]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
Tissue: Eye.
[4]"Molecular cloning and sequencing of a cDNA coding for mature human kidney cathepsin H."
Fuchs R., Machleidt W., Gassen H.G.
Biol. Chem. Hoppe-Seyler 369:469-475(1988) [PubMed: 2849458] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 88-335.
Tissue: Kidney.
[5]"Amino acid sequences of the human kidney cathepsins H and L."
Ritonja A., Popovic T., Kotnik M., Machleidt W., Turk V.
FEBS Lett. 228:341-345(1988) [PubMed: 3342889] [Abstract]
Cited for: PROTEIN SEQUENCE OF 98-105 AND 114-335, GLYCOSYLATION AT ASN-101 AND ASN-230.
Tissue: Kidney.
[6]"Human cathepsins B, H and L: characterization by amino acid sequences and some kinetics of inhibition by the kininogens."
Machleidt W., Ritonja A., Popovic T., Kotnik M., Brzin J., Turk V., Machleidt I., Mueller-Esterl W.
(In) Turk V. (eds.); Cysteine proteinases and their inhibitors, pp.3-18, Walter de Gruyter, Berlin and New York (1986)
Cited for: PROTEIN SEQUENCE OF 99-105; 116-159 AND 294-335.
[7]Colinge J., Superti-Furga G., Bennett K.L.
Submitted (OCT-2008) to UniProtKB
Cited for: IDENTIFICATION [LARGE SCALE ANALYSIS], MASS SPECTROMETRY.
[8]"Glycoproteomics analysis of human liver tissue by combination of multiple enzyme digestion and hydrazide chemistry."
Chen R., Jiang X., Sun D., Han G., Wang F., Ye M., Wang L., Zou H.
J. Proteome Res. 8:651-661(2009) [PubMed: 19159218] [Abstract]
Cited for: GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-230, MASS SPECTROMETRY.
Tissue: Liver.
[9]"The consensus coding sequences of human breast and colorectal cancers."
Sjoeblom T., Jones S., Wood L.D., Parsons D.W., Lin J., Barber T.D., Mandelker D., Leary R.J., Ptak J., Silliman N., Szabo S., Buckhaults P., Farrell C., Meeh P., Markowitz S.D., Willis J., Dawson D., Willson J.K.V. expand/collapse author list , Gazdar A.F., Hartigan J., Wu L., Liu C., Parmigiani G., Park B.H., Bachman K.E., Papadopoulos N., Vogelstein B., Kinzler K.W., Velculescu V.E.
Science 314:268-274(2006) [PubMed: 16959974] [Abstract]
Cited for: VARIANT [LARGE SCALE ANALYSIS] ARG-126.
+Additional computationally mapped references.

Cross-references

Sequence databases

X07549 mRNA. Translation: CAA30428.1.
X07549 mRNA. Translation: CAA30429.1. Sequence problems.
AK314698 mRNA. Translation: BAG37247.1.
BC002479 mRNA. Translation: AAH02479.1.
X16832 mRNA. Translation: CAA34734.1.
IPIIPI00297487.
PIRKHHUH. S12486.
UniGeneHs.148641

3D structure databases

EntryMethodResolution (Å)ChainPositionsPDBsum
1BZNmodel-B116-335[»]
SMRP09668. Positions 116-335.
ModBaseSearch...

Protein family/group databases

MEROPSC01.040.

Proteomic databases

PRIDEP09668.

Genome annotation databases

EnsemblENSG00000103811. Homo sapiens. [Contig view]

Organism-specific databases

GeneCardsGC15M077001.
H-InvDBHIX0012481.
HGNCHGNC:2535. CTSH.
HPACAB000458.
HPA003524.
MIM116820. gene.
PharmGKBPA27033.
GenAtlasSearch...

Phylogenomic databases

HOGENOMP09668.
HOVERGENP09668.

Enzyme and pathway databases

BRENDA3.4.22.16. 247.

Gene expression databases

ArrayExpressP09668.
BgeeP09668.
CleanExHS_CTSH.
GermOnlineENSG00000103811. Homo sapiens.

Family and domain databases

InterProIPR000169. Pept_cys_AS.
IPR013128. Peptidase_C1A.
IPR000668. Peptidase_C1A_C.
IPR013201. Prot_inhib_I29.
[Graphical view]
PANTHERPTHR12411. Peptidase_C1A. 1 hit.
PfamPF08246. Inhibitor_I29. 1 hit.
PF00112. Peptidase_C1. 1 hit.
[Graphical view]
PRINTSPR00705. PAPAIN.
ProDomPD000158. Peptidase_C1. 1 hit.
[Graphical view] [Entries sharing at least one domain]
SMARTSM00645. Pept_C1. 1 hit.
[Graphical view]
PROSITEPS00640. THIOL_PROTEASE_ASN. 1 hit.
PS00139. THIOL_PROTEASE_CYS. 1 hit.
PS00639. THIOL_PROTEASE_HIS. 1 hit.
[Graphical view]
ProtoNetSearch...

Other Resources

SOURCESearch...

Entry information

Entry nameCATH_HUMAN
AccessionPrimary (citable) accession number: P09668
Secondary accession number(s): B2RBK0, Q9BUM7
Entry history
Integrated into UniProtKB/Swiss-Prot: July 1, 1989
Last sequence update: May 27, 2002
Last modified: June 16, 2009
This is version 108 of the entry and version 3 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectHPI (Human Proteome Initiative)

Relevant documents

Human chromosome 15

Human chromosome 15: entries, gene names and cross-references to MIM

Human entries with polymorphisms or disease mutations

List of human entries with polymorphisms or disease mutations

Human polymorphisms and disease mutations

Index of human polymorphisms and disease mutations

MIM cross-references

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot

PDB cross-references

Index of Protein Data Bank (PDB) cross-references

Peptidase families

Classification of peptidase families and list of entries

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Web resources · Cross-references · Entry information · Relevant documents