ID CATH_BOVIN Reviewed; 335 AA. AC Q3T0I2; DT 30-MAY-2006, integrated into UniProtKB/Swiss-Prot. DT 11-OCT-2005, sequence version 1. DT 16-JUN-2009, entry version 29. DE RecName: Full=Cathepsin H; DE EC=3.4.22.16; DE Contains: DE RecName: Full=Cathepsin H mini chain; DE Contains: DE RecName: Full=Cathepsin H heavy chain; DE Contains: DE RecName: Full=Cathepsin H light chain; DE Flags: Precursor; GN Name=CTSH; OS Bos taurus (Bovine). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Ruminantia; OC Pecora; Bovidae; Bovinae; Bos. OX NCBI_TaxID=9913; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. RC STRAIN=Crossbred X Angus; TISSUE=Ileum; RG NIH - Mammalian Gene Collection (MGC) project; RL Submitted (AUG-2005) to the EMBL/GenBank/DDBJ databases. CC -!- FUNCTION: Important for the overall degradation of proteins in CC lysosomes (By similarity). CC -!- CATALYTIC ACTIVITY: Hydrolysis of proteins, acting as an CC aminopeptidase (notably, cleaving Arg-|-Xaa bonds) as well as an CC endopeptidase. CC -!- SUBUNIT: Composed of a mini chain and a large chain. The large CC chain may be split into heavy and light chain. All chains are held CC together by disulfide bonds (By similarity). CC -!- SUBCELLULAR LOCATION: Lysosome (By similarity). CC -!- SIMILARITY: Belongs to the peptidase C1 family. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BC102386; AAI02387.1; -; mRNA. DR IPI; IPI00693034; -. DR RefSeq; NP_001029557.1; -. DR UniGene; Bt.52393; -. DR SMR; Q3T0I2; 116-335. DR MEROPS; C01.040; -. DR Ensembl; ENSBTAG00000010992; Bos taurus. DR GeneID; 510524; -. DR KEGG; bta:510524; -. DR HOVERGEN; Q3T0I2; -. DR OMA; Q3T0I2; HHRLQTF. DR BRENDA; 3.4.22.16; 251. DR GO; GO:0005764; C:lysosome; IEA:UniProtKB-SubCell. DR GO; GO:0004197; F:cysteine-type endopeptidase activity; IEA:InterPro. DR GO; GO:0005515; F:protein binding; IPI:UniProtKB. DR GO; GO:0006508; P:proteolysis; IEA:InterPro. DR InterPro; IPR000169; Pept_cys_AS. DR InterPro; IPR013128; Peptidase_C1A. DR InterPro; IPR000668; Peptidase_C1A_C. DR InterPro; IPR013201; Prot_inhib_I29. DR PANTHER; PTHR12411; Peptidase_C1A; 1. DR Pfam; PF08246; Inhibitor_I29; 1. DR Pfam; PF00112; Peptidase_C1; 1. DR PRINTS; PR00705; PAPAIN. DR ProDom; PD000158; Peptidase_C1; 1. DR SMART; SM00645; Pept_C1; 1. DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 1. DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1. DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1. PE 2: Evidence at transcript level; KW Disulfide bond; Glycoprotein; Hydrolase; Lysosome; Protease; Signal; KW Thiol protease; Zymogen. FT SIGNAL 1 22 By similarity. FT PROPEP 23 97 Activation peptide (By similarity). FT /FTId=PRO_0000238112. FT PEPTIDE 98 105 Cathepsin H mini chain. FT /FTId=PRO_0000238113. FT PROPEP 106 115 By similarity. FT /FTId=PRO_0000238114. FT CHAIN 116 335 Cathepsin H. FT /FTId=PRO_0000238115. FT CHAIN 116 292 Cathepsin H heavy chain. FT /FTId=PRO_0000238116. FT CHAIN 293 335 Cathepsin H light chain. FT /FTId=PRO_0000238117. FT ACT_SITE 141 141 By similarity. FT ACT_SITE 281 281 By similarity. FT ACT_SITE 301 301 By similarity. FT CARBOHYD 72 72 N-linked (GlcNAc...) (Potential). FT CARBOHYD 101 101 N-linked (GlcNAc...) (Potential). FT CARBOHYD 230 230 N-linked (GlcNAc...) (Potential). FT DISULFID 102 327 By similarity. FT DISULFID 138 181 By similarity. FT DISULFID 172 214 By similarity. FT DISULFID 272 322 By similarity. SQ SEQUENCE 335 AA; 37351 MW; 79FDBEBB9984D227 CRC64; MWAVLPLLCA GAWLLGAPAC GAAELAANSL EKFHFQSWMV QHQKKYSSEE YYHRLQAFAS NLREINAHNA RNHTFKMGLN QFSDMSFDEL KRKYLWSEPQ NCSATKSNYL RGTGPYPPSM DWRKKGNFVT PVKNQGSCGS CWTFSTTGAL ESAVAIATGK LPFLAEQQLV DCAQNFNNHG CQGGLPSQAF EYIRYNKGIM GEDTYPYRGQ DGDCKYQPSK AIAFVKDVAN ITLNDEEAMV EAVALHNPVS FAFEVTADFM MYRKGIYSST SCHKTPDKVN HAVLAVGYGE EKGIPYWIVK NSWGPNWGMK GYFLIERGKN MCGLAACASF PIPLV //