P09668 (CATH_HUMAN) Reviewed, UniProtKB/Swiss-Prot
Last modified
January 25, 2012.
Version 134.
History...
Names·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Web links·Cross-refs·Entry info·DocumentsCustomize order
Names·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Web links·Cross-refs·Entry info·DocumentsCustomize orderNames and origin
| Protein names | Recommended name: Pro-cathepsin H Cleaved into the following 4 chains: | ||||
| Gene names |
| ||||
| Organism | Homo sapiens (Human) | ||||
| Taxonomic identifier | 9606 [NCBI] | ||||
| Taxonomic lineage | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Mammalia › Eutheria › Euarchontoglires › Primates › Haplorrhini › Catarrhini › Hominidae › Homo |
Protein attributes
| Sequence length | 335 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is further processed into a mature form. |
| Protein existence | Evidence at protein level |
General annotation (Comments)
| Function | Important for the overall degradation of proteins in lysosomes. |
| Catalytic activity | Hydrolysis of proteins, acting as an aminopeptidase (notably, cleaving Arg-|-Xaa bonds) as well as an endopeptidase. |
| Subunit structure | Composed of a mini chain and a large chain. The large chain may be split into heavy and light chain. All chains are held together by disulfide bonds. |
| Subcellular location | |
| Sequence similarities | Belongs to the peptidase C1 family. |
Ontologies
| Keywords | |
|---|---|
| Cellular component | Lysosome |
| Coding sequence diversity | Polymorphism |
| Domain | Signal |
| Molecular function | Hydrolase Protease Thiol protease |
| PTM | Disulfide bond Glycoprotein Zymogen |
| Technical term | 3D-structure Complete proteome Direct protein sequencing Reference proteome |
| Gene Ontology (GO) | |
| Biological process | protein destabilization Inferred from mutant phenotype. Source: UniProtKB proteolysisInferred from direct assay. Source: BHF-UCL |
| Cellular component | lysosome Inferred from electronic annotation. Source: UniProtKB-SubCell |
| Molecular function | cysteine-type endopeptidase activity Inferred from direct assay. Source: BHF-UCL |
| Complete GO annotation... | |
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | |||||||||||||||||||||||||||||||||
Molecule processing | ||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Signal peptide | 1 – 22 | 22 | ||||||||||||||||||||||||||||||||||||
| Propeptide | 23 – 97 | 75 | Activation peptide | PRO_0000026206 | ||||||||||||||||||||||||||||||||||
| Peptide | 98 – 105 | 8 | Cathepsin H mini chain Ref.6 | PRO_0000026207 | ||||||||||||||||||||||||||||||||||
| Propeptide | 106 – 115 | 10 | PRO_0000026208 | |||||||||||||||||||||||||||||||||||
| Chain | 116 – 335 | 220 | Cathepsin H | PRO_0000026209 | ||||||||||||||||||||||||||||||||||
| Chain | 116 – 292 | 177 | Cathepsin H heavy chain | PRO_0000026210 | ||||||||||||||||||||||||||||||||||
| Chain | 293 – 335 | 43 | Cathepsin H light chain | PRO_0000026211 | ||||||||||||||||||||||||||||||||||
Sites | ||||||||||||||||||||||||||||||||||||||
| Active site | 141 | 1 | By similarity | |||||||||||||||||||||||||||||||||||
| Active site | 281 | 1 | By similarity | |||||||||||||||||||||||||||||||||||
| Active site | 301 | 1 | By similarity | |||||||||||||||||||||||||||||||||||
Amino acid modifications | ||||||||||||||||||||||||||||||||||||||
| Glycosylation | 101 | 1 | N-linked (GlcNAc...) Ref.6 | |||||||||||||||||||||||||||||||||||
| Glycosylation | 230 | 1 | N-linked (GlcNAc...) Ref.6 Ref.8 | |||||||||||||||||||||||||||||||||||
| Disulfide bond | 102 ↔ 327 | By similarity | ||||||||||||||||||||||||||||||||||||
| Disulfide bond | 138 ↔ 181 | By similarity | ||||||||||||||||||||||||||||||||||||
| Disulfide bond | 172 ↔ 214 | By similarity | ||||||||||||||||||||||||||||||||||||
| Disulfide bond | 272 ↔ 322 | By similarity | ||||||||||||||||||||||||||||||||||||
Natural variations | ||||||||||||||||||||||||||||||||||||||
| Natural variant | 11 | 1 | G → R. Corresponds to variant rs2289702 [ dbSNP | Ensembl ]. | VAR_057038 | ||||||||||||||||||||||||||||||||||
| Natural variant | 23 | 1 | A → T. Corresponds to variant rs35001431 [ dbSNP | Ensembl ]. | VAR_057039 | ||||||||||||||||||||||||||||||||||
| Natural variant | 26 | 1 | C → S. Ref.1 Ref.2 Ref.4 Corresponds to variant rs1036938 [ dbSNP | Ensembl ]. | VAR_060368 | ||||||||||||||||||||||||||||||||||
| Natural variant | 126 | 1 | G → R in a colorectal cancer sample; somatic mutation. Ref.10 | VAR_036478 | ||||||||||||||||||||||||||||||||||
Experimental info | ||||||||||||||||||||||||||||||||||||||
| Sequence conflict | 179 | 1 | H → Y in CAA34734. Ref.1 | |||||||||||||||||||||||||||||||||||
| Sequence conflict | 179 | 1 | H → Y in CAA30428. Ref.5 | |||||||||||||||||||||||||||||||||||
| Sequence conflict | 306 | 1 | Q → E AA sequence Ref.6 | |||||||||||||||||||||||||||||||||||
| Sequence conflict | 306 | 1 | Q → E AA sequence Ref.7 | |||||||||||||||||||||||||||||||||||
Secondary structure | ||||||||||||||||||||||||||||||||||||||
Helix Strand Turn | ||||||||||||||||||||||||||||||||||||||
| Helix | 123 – 126 | 4 | ||||||||||||||||||||||||||||||||||||
| Helix | 141 – 158 | 18 | ||||||||||||||||||||||||||||||||||||
| Helix | 166 – 172 | 7 | ||||||||||||||||||||||||||||||||||||
| Helix | 180 – 182 | 3 | ||||||||||||||||||||||||||||||||||||
| Turn | 186 – 188 | 3 | ||||||||||||||||||||||||||||||||||||
| Helix | 189 – 196 | 8 | ||||||||||||||||||||||||||||||||||||
| Turn | 202 – 204 | 3 | ||||||||||||||||||||||||||||||||||||
| Helix | 237 – 245 | 9 | ||||||||||||||||||||||||||||||||||||
| Beta strand | 249 – 252 | 4 | ||||||||||||||||||||||||||||||||||||
| Helix | 258 – 261 | 4 | ||||||||||||||||||||||||||||||||||||
| Beta strand | 273 – 275 | 3 | ||||||||||||||||||||||||||||||||||||
| Beta strand | 283 – 291 | 9 | ||||||||||||||||||||||||||||||||||||
| Beta strand | 294 – 300 | 7 | ||||||||||||||||||||||||||||||||||||
| Turn | 304 – 310 | 7 | ||||||||||||||||||||||||||||||||||||
| Beta strand | 326 – 329 | 4 | ||||||||||||||||||||||||||||||||||||
Sequences
| ||||||||||||||||||
References
| « Hide 'large scale' references | |
| [1] | "Nucleotide sequence of human preprocathepsin H, a lysosomal cysteine proteinase." Fuchs R., Gassen H.G. Nucleic Acids Res. 17:9471-9471(1989) [PubMed: 2587265] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA], VARIANT SER-26. Tissue: Liver. |
| [2] | "Complete sequencing and characterization of 21,243 full-length human cDNAs." Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S. Sugano S.Nat. Genet. 36:40-45(2004) [PubMed: 14702039] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA], VARIANT SER-26. Tissue: Lung. |
| [3] | "Analysis of the DNA sequence and duplication history of human chromosome 15." Zody M.C., Garber M., Sharpe T., Young S.K., Rowen L., O'Neill K., Whittaker C.A., Kamal M., Chang J.L., Cuomo C.A., Dewar K., FitzGerald M.G., Kodira C.D., Madan A., Qin S., Yang X., Abbasi N., Abouelleil A. Nusbaum C.Nature 440:671-675(2006) [PubMed: 16572171] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. |
| [4] | "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)." The MGC Project Team Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA], VARIANT SER-26. Tissue: Eye. |
| [5] | "Molecular cloning and sequencing of a cDNA coding for mature human kidney cathepsin H." Fuchs R., Machleidt W., Gassen H.G. Biol. Chem. Hoppe-Seyler 369:469-475(1988) [PubMed: 2849458] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 88-335. Tissue: Kidney. |
| [6] | "Amino acid sequences of the human kidney cathepsins H and L." Ritonja A., Popovic T., Kotnik M., Machleidt W., Turk V. FEBS Lett. 228:341-345(1988) [PubMed: 3342889] [Abstract] Cited for: PROTEIN SEQUENCE OF 98-105 AND 114-335, GLYCOSYLATION AT ASN-101 AND ASN-230. Tissue: Kidney. |
| [7] | "Human cathepsins B, H and L: characterization by amino acid sequences and some kinetics of inhibition by the kininogens." Machleidt W., Ritonja A., Popovic T., Kotnik M., Brzin J., Turk V., Machleidt I., Mueller-Esterl W. (In) Turk V. (eds.); Cysteine proteinases and their inhibitors, pp.3-18, Walter de Gruyter, Berlin and New York (1986) Cited for: PROTEIN SEQUENCE OF 99-105; 116-159 AND 294-335. |
| [8] | "Glycoproteomics analysis of human liver tissue by combination of multiple enzyme digestion and hydrazide chemistry." Chen R., Jiang X., Sun D., Han G., Wang F., Ye M., Wang L., Zou H. J. Proteome Res. 8:651-661(2009) [PubMed: 19159218] [Abstract] Cited for: GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-230, MASS SPECTROMETRY. Tissue: Liver. |
| [9] | "Initial characterization of the human central proteome." Burkard T.R., Planyavsky M., Kaupe I., Breitwieser F.P., Buerckstuemmer T., Bennett K.L., Superti-Furga G., Colinge J. BMC Syst. Biol. 5:17-17(2011) [PubMed: 21269460] [Abstract] Cited for: IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. |
| [10] | "The consensus coding sequences of human breast and colorectal cancers." Sjoeblom T., Jones S., Wood L.D., Parsons D.W., Lin J., Barber T.D., Mandelker D., Leary R.J., Ptak J., Silliman N., Szabo S., Buckhaults P., Farrell C., Meeh P., Markowitz S.D., Willis J., Dawson D., Willson J.K.V. Velculescu V.E.Science 314:268-274(2006) [PubMed: 16959974] [Abstract] Cited for: VARIANT [LARGE SCALE ANALYSIS] ARG-126. |
| + | Additional computationally mapped references. |
Cross-references
Sequence databases | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| EMBL GenBank DDBJ | X16832 mRNA. Translation: CAA34734.1. AK314698 mRNA. Translation: BAG37247.1. AC011944 Genomic DNA. No translation available. BC002479 mRNA. Translation: AAH02479.1. X07549 mRNA. Translation: CAA30428.1. X07549 mRNA. Translation: CAA30429.1. | ||||||||||||
| IPI | IPI00297487. | ||||||||||||
| PIR | KHHUH. S12486. | ||||||||||||
| RefSeq | NP_004381.2. NM_004390.3. | ||||||||||||
| UniGene | Hs.148641. | ||||||||||||
3D structure databases | |||||||||||||
| PDBe RCSB PDB PDBj |
| ||||||||||||
| ProteinModelPortal | P09668. | ||||||||||||
| SMR | P09668. Positions 27-335. | ||||||||||||
| ModBase | Search... | ||||||||||||
Protein-protein interaction databases | |||||||||||||
| STRING | P09668. | ||||||||||||
Protein family/group databases | |||||||||||||
| MEROPS | C01.040. | ||||||||||||
Polymorphism databases | |||||||||||||
| DMDM | 288558851. | ||||||||||||
Proteomic databases | |||||||||||||
| PRIDE | P09668. | ||||||||||||
Protocols and materials databases | |||||||||||||
| StructuralBiologyKnowledgebase | Search... | ||||||||||||
Genome annotation databases | |||||||||||||
| Ensembl | ENST00000220166; ENSP00000220166; ENSG00000103811. | ||||||||||||
| GeneID | 1512. | ||||||||||||
| KEGG | hsa:1512. | ||||||||||||
Organism-specific databases | |||||||||||||
| CTD | 1512. | ||||||||||||
| GeneCards | GC15M079213. | ||||||||||||
| H-InvDB | HIX0012481. | ||||||||||||
| HGNC | HGNC:2535. CTSH. | ||||||||||||
| HPA | CAB000458. HPA003524. | ||||||||||||
| MIM | 116820. gene. | ||||||||||||
| neXtProt | NX_P09668. | ||||||||||||
| PharmGKB | PA27033. | ||||||||||||
| GenAtlas | Search... | ||||||||||||
Phylogenomic databases | |||||||||||||
| eggNOG | prNOG04713. | ||||||||||||
| HOGENOM | HBG746690. | ||||||||||||
| HOVERGEN | HBG011513. | ||||||||||||
| InParanoid | P09668. | ||||||||||||
| OMA | EKFHFKS. | ||||||||||||
| OrthoDB | EOG4W9J43. | ||||||||||||
| PhylomeDB | P09668. | ||||||||||||
Enzyme and pathway databases | |||||||||||||
| BRENDA | 3.4.22.16. 2681. | ||||||||||||
Gene expression databases | |||||||||||||
| ArrayExpress | P09668. | ||||||||||||
| Bgee | P09668. | ||||||||||||
| CleanEx | HS_CTSH. | ||||||||||||
| Genevestigator | P09668. | ||||||||||||
| GermOnline | ENSG00000103811. Homo sapiens. | ||||||||||||
Family and domain databases | |||||||||||||
| InterPro | IPR000169. Pept_cys_AS. IPR013128. Peptidase_C1A. IPR000668. Peptidase_C1A_C. IPR013201. Prot_inhib_I29. [Graphical view] | ||||||||||||
| KO | K01366. | ||||||||||||
| PANTHER | PTHR12411. Peptidase_C1A. 1 hit. | ||||||||||||
| Pfam | PF08246. Inhibitor_I29. 1 hit. PF00112. Peptidase_C1. 1 hit. [Graphical view] | ||||||||||||
| PRINTS | PR00705. PAPAIN. | ||||||||||||
| SMART | SM00848. Inhibitor_I29. 1 hit. SM00645. Pept_C1. 1 hit. [Graphical view] | ||||||||||||
| PROSITE | PS00640. THIOL_PROTEASE_ASN. 1 hit. PS00139. THIOL_PROTEASE_CYS. 1 hit. PS00639. THIOL_PROTEASE_HIS. 1 hit. [Graphical view] | ||||||||||||
| ProtoNet | Search... | ||||||||||||
Other | |||||||||||||
| SOURCE | Search... | ||||||||||||
Entry information
| Entry name | CATH_HUMAN | ||||||||
| Accession | Primary (citable) accession number: P09668 Secondary accession number(s): B2RBK0, Q9BUM7 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation program | Chordata Protein Annotation Program | ||||||||
| Disclaimer | Any medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care. | ||||||||
Relevant documents
| Peptidase families Classification of peptidase families and list of entries |
| Human chromosome 15 Human chromosome 15: entries, gene names and cross-references to MIM |
| Human entries with polymorphisms or disease mutations List of human entries with polymorphisms or disease mutations |
| Human polymorphisms and disease mutations Index of human polymorphisms and disease mutations |
| MIM cross-references Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot |
| PDB cross-references Index of Protein Data Bank (PDB) cross-references |
| SIMILARITY comments Index of protein domains and families |

Clusters with