Skip Header

Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot P07339 (CATD_HUMAN)

Last modified February 9, 2010. Version 127. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (7) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Web resources · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Cathepsin D
    EC=3.4.23.5
Cleaved into the following 2 chains:
    1- Recommended name:
            Cathepsin D light chain
    2- Recommended name:
            Cathepsin D heavy chain
Gene names
Name: CTSD
Synonyms: CPSD
OrganismHomo sapiens (Human) [Complete proteome]
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length412 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level.

General annotation (Comments)

Function

Acid protease active in intracellular protein breakdown. Involved in the pathogenesis of several diseases such as breast cancer and possibly Alzheimer disease.

Catalytic activity

Specificity similar to, but narrower than, that of pepsin A. Does not cleave the 4-Gln-|-His-5 bond in B chain of insulin.

Subunit structure

Consists of a light chain and a heavy chain.

Subcellular location

Lysosome. Melanosome. Note: Identified by mass spectrometry in melanosome fractions from stage I to stage IV. Ref.10 Ref.14

Polymorphism

The Val-58 allele is significantly overrepresented in demented patients (11.8%) compared with non-demented controls (4.9%). Carriers of the Val-58 allele have a 3.1-fold increased risk for developing AD than non-carriers.

Involvement in disease

Defects in CTSD are the cause of neuronal ceroid lipofuscinosis type 10 (CLN10) [MIM:610127]; also known as neuronal ceroid lipofuscinosis due to cathepsin D deficiency. A form of neuronal ceroid lipofuscinosis with onset at birth or early childhood. Neuronal ceroid lipofuscinoses are progressive neurodegenerative, lysosomal storage diseases characterized by intracellular accumulation of autofluorescent liposomal material, and clinically by seizures, dementia, visual loss, and/or cerebral atrophy. Ref.13 Ref.21

Sequence similarities

Belongs to the peptidase A1 family.

Ontologies

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 1818
Propeptide19 – 6446Activation peptide
PRO_0000025949
Chain65 – 412348Cathepsin D
PRO_0000025950
Chain65 – 16197Cathepsin D light chain Probable
PRO_0000025951
Chain169 – 412244Cathepsin D heavy chain Probable
PRO_0000025952

Sites

Active site971 Ref.19
Active site2951 Ref.19

Amino acid modifications

Glycosylation1341N-linked (GlcNAc...) Ref.17
Glycosylation2631N-linked (GlcNAc...) Ref.17 Ref.11 Ref.12 Ref.15 Ref.19
Disulfide bond91 ↔ 160 Ref.19
Disulfide bond110 ↔ 117 Ref.19
Disulfide bond286 ↔ 290 Ref.19
Disulfide bond329 ↔ 366

Natural variations

Natural variant581A → V Associated with increased risk in AD; possibly influences secretion and intracellular maturation. dbSNP rs17571. Ref.20
VAR_011621
Natural variant2291F → I in CLN10. Ref.21
VAR_029362
Natural variant2821G → R
VAR_058490
Natural variant3831W → C in CLN10. Ref.21
VAR_029363

Secondary structure

....................................................................... 412
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
P07339-1 [UniParc].

Last modified April 1, 1988. Version 1.
Checksum: 903FB8412E0CF0B0

FASTA41244,552
        10         20         30         40         50         60 
MQPSSLLPLA LCLLAAPASA LVRIPLHKFT SIRRTMSEVG GSVEDLIAKG PVSKYSQAVP 

        70         80         90        100        110        120 
AVTEGPIPEV LKNYMDAQYY GEIGIGTPPQ CFTVVFDTGS SNLWVPSIHC KLLDIACWIH 

       130        140        150        160        170        180 
HKYNSDKSST YVKNGTSFDI HYGSGSLSGY LSQDTVSVPC QSASSASALG GVKVERQVFG 

       190        200        210        220        230        240 
EATKQPGITF IAAKFDGILG MAYPRISVNN VLPVFDNLMQ QKLVDQNIFS FYLSRDPDAQ 

       250        260        270        280        290        300 
PGGELMLGGT DSKYYKGSLS YLNVTRKAYW QVHLDQVEVA SGLTLCKEGC EAIVDTGTSL 

       310        320        330        340        350        360 
MVGPVDEVRE LQKAIGAVPL IQGEYMIPCE KVSTLPAITL KLGGKGYKLS PEDYTLKVSQ 

       370        380        390        400        410 
AGKTLCLSGF MGMDIPPPSG PLWILGDVFI GRYYTVFDRD NNRVGFAEAA RL 

« Hide

References

« Hide 'large scale' references
[1]"Cloning and sequence analysis of cDNA for human cathepsin D."
Faust P.L., Kornfeld S., Chirgwin J.M.
Proc. Natl. Acad. Sci. U.S.A. 82:4910-4914(1985) [PubMed: 3927292] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA].
[2]"Oestrogen regulates cathepsin D mRNA levels in oestrogen responsive human breast cancer cells."
Westley B.R., May F.E.B.
Nucleic Acids Res. 15:3773-3786(1987) [PubMed: 3588310] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA].
[3]"Molecular organization of the human cathepsin D gene."
Redecker B., Heckendorf B., Grosch H.W., Mersmann G., Hasilik A.
DNA Cell Biol. 10:423-431(1991) [PubMed: 2069717] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
[4]"Cloning of human full open reading frames in Gateway(TM) system entry vector (pDONR201)."
Ebert L., Schick M., Neubert P., Schatten R., Henze S., Korn B.
Submitted (JUN-2004) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
[5]"Cloning of human full-length CDSs in BD Creator(TM) system donor vector."
Kalnine N., Chen X., Rolfs A., Halleck A., Hines L., Eisenstein S., Koundinya M., Raphael J., Moreira D., Kelley T., LaBaer J., Lin Y., Phelan M., Farmer A.
Submitted (OCT-2004) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
[6]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
Tissue: Kidney.
[7]"The human cathepsin D-encoding gene is transcribed from an estrogen-regulated and a constitutive start point."
May F.E., Smith D.J., Westley B.R.
Gene 134:277-282(1993) [PubMed: 8262386] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-22.
[8]"Characterization of the proximal estrogen-responsive element of human cathepsin D gene."
Augereau P., Miralles F., Cavailles V., Gaudelet C., Parker M., Rochefort H.
Mol. Endocrinol. 8:693-703(1994) [PubMed: 7935485] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-22.
[9]Hochstrasser D.F., Frutiger S., Paquet N., Bairoch A., Ravier F., Pasquali C., Sanchez J.-C., Tissot J.-D., Bjellqvist B., Vargas R., Appel R.D., Hughes G.J.
Submitted (JUN-1992) to UniProtKB
Cited for: PROTEIN SEQUENCE OF 170-180.
Tissue: Liver.
[10]"Proteomic analysis of early melanosomes: identification of novel melanosomal proteins."
Basrur V., Yang F., Kushimoto T., Higashimoto Y., Yasumoto K., Valencia J., Muller J., Vieira W.D., Watabe H., Shabanowitz J., Hearing V.J., Hunt D.F., Appella E.
J. Proteome Res. 2:69-79(2003) [PubMed: 12643545] [Abstract]
Cited for: SUBCELLULAR LOCATION [LARGE SCALE ANALYSIS], MASS SPECTROMETRY.
[11]"Identification and quantification of N-linked glycoproteins using hydrazide chemistry, stable isotope labeling and mass spectrometry."
Zhang H., Li X.-J., Martin D.B., Aebersold R.
Nat. Biotechnol. 21:660-666(2003) [PubMed: 12754519] [Abstract]
Cited for: GLYCOSYLATION AT ASN-263.
[12]"Human plasma N-glycoproteome analysis by immunoaffinity subtraction, hydrazide chemistry, and mass spectrometry."
Liu T., Qian W.-J., Gritsenko M.A., Camp D.G. II, Monroe M.E., Moore R.J., Smith R.D.
J. Proteome Res. 4:2070-2080(2005) [PubMed: 16335952] [Abstract]
Cited for: GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-263, MASS SPECTROMETRY.
Tissue: Plasma.
[13]"Cathepsin D deficiency underlies congenital human neuronal ceroid-lipofuscinosis."
Siintola E., Partanen S., Stromme P., Haapanen A., Haltia M., Maehlen J., Lehesjoki A.E., Tyynela J.
Brain 129:1438-1445(2006) [PubMed: 16670177] [Abstract]
Cited for: INVOLVEMENT IN CLN10, VARIANT ARG-282.
[14]"Proteomic and bioinformatic characterization of the biogenesis and function of melanosomes."
Chi A., Valencia J.C., Hu Z.-Z., Watabe H., Yamaguchi H., Mangini N.J., Huang H., Canfield V.A., Cheng K.C., Yang F., Abe R., Yamagishi S., Shabanowitz J., Hearing V.J., Wu C., Appella E., Hunt D.F.
J. Proteome Res. 5:3135-3144(2006) [PubMed: 17081065] [Abstract]
Cited for: SUBCELLULAR LOCATION [LARGE SCALE ANALYSIS], MASS SPECTROMETRY.
[15]"Elucidation of N-glycosylation sites on human platelet proteins: a glycoproteomic approach."
Lewandrowski U., Moebius J., Walter U., Sickmann A.
Mol. Cell. Proteomics 5:226-233(2006) [PubMed: 16263699] [Abstract]
Cited for: GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-263, MASS SPECTROMETRY.
Tissue: Platelet.
[16]Colinge J., Superti-Furga G., Bennett K.L.
Submitted (OCT-2008) to UniProtKB
Cited for: IDENTIFICATION [LARGE SCALE ANALYSIS], MASS SPECTROMETRY.
[17]"Glycoproteomics analysis of human liver tissue by combination of multiple enzyme digestion and hydrazide chemistry."
Chen R., Jiang X., Sun D., Han G., Wang F., Ye M., Wang L., Zou H.
J. Proteome Res. 8:651-661(2009) [PubMed: 19159218] [Abstract]
Cited for: GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-134 AND ASN-263, MASS SPECTROMETRY.
Tissue: Liver.
[18]"Two crystal structures for cathepsin D: the lysosomal targeting signal and active site."
Metcalf P., Fusek M.
EMBO J. 12:1293-1302(1993) [PubMed: 8467789] [Abstract]
Cited for: X-RAY CRYSTALLOGRAPHY (3 ANGSTROMS).
Tissue: Spleen.
[19]"Crystal structures of native and inhibited forms of human cathepsin D: implications for lysosomal targeting and drug design."
Baldwin E.T., Bhat T.N., Gulnik S., Hosur M.V., Sowder R.C. II, Cachau R.E., Collins J., Silva A.M., Erickson J.W.
Proc. Natl. Acad. Sci. U.S.A. 90:6796-6800(1993) [PubMed: 8393577] [Abstract]
Cited for: X-RAY CRYSTALLOGRAPHY (2.5 ANGSTROMS).
Tissue: Liver.
[20]"A genetic variation of cathepsin D is a major risk factor for Alzheimer's disease."
Papassotiropoulos A., Bagli M., Kurz A., Kornhuber J., Forstl H., Maier W., Pauls J., Lautenschlager N., Heun R.
Ann. Neurol. 47:399-403(2000) [PubMed: 10716266] [Abstract]
Cited for: VARIANT VAL-58.
[21]"Cathepsin D deficiency is associated with a human neurodegenerative disorder."
Steinfeld R., Reinhardt K., Schreiber K., Hillebrand M., Kraetzner R., Bruck W., Saftig P., Gartner J.
Am. J. Hum. Genet. 78:988-998(2006) [PubMed: 16685649] [Abstract]
Cited for: VARIANTS CLN10 ILE-229 AND CYS-383.
+Additional computationally mapped references.

Web resources

NCL CTSD

Neural Ceroid Lipofuscinoses mutation db

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
M11233 mRNA. Translation: AAB59529.1.
X05344 mRNA. Translation: CAA28955.1.
M63138 expand/collapse EMBL AC list , M63134, M63135, M63136, M63137 Genomic DNA. Translation: AAA51922.1.
CR456947 mRNA. Translation: CAG33228.1.
BT006910 mRNA. Translation: AAP35556.1.
BT020155 mRNA. Translation: AAV38957.1.
BC016320 mRNA. Translation: AAH16320.1.
L12980 Genomic DNA. Translation: AAA16314.1.
S74689 Genomic DNA. Translation: AAD14156.1.
S52557 Genomic DNA. Translation: AAD13868.1.
IPIIPI00011229.
PIRKHHUD. A25771.
RefSeqNP_001900.1.
UniGeneHs.654447

3D structure databases

PDBe
RCSB PDB
PDBj
EntryMethodResolution (Å)ChainPositionsPDBsum
1LYAX-ray2.50A/C65-161[»]
B/D170-410[»]
1LYBX-ray2.50A/C65-161[»]
B/D170-410[»]
1LYWX-ray2.50A/C/E/G65-161[»]
B/D/F/H170-410[»]
SMRP07339. Positions 69-409.
ModBaseSearch...

Protein-protein interaction databases

IntActP07339. 1 interaction.
STRINGP07339.

Protein family/group databases

MEROPSA01.009.

2-D gel databases

SWISS-2DPAGEP07339.
DOSAC-COBS-2DPAGEP07339.
REPRODUCTION-2DPAGEIPI00011229.
Siena-2DPAGEP07339.

Proteomic databases

PeptideAtlasP07339.
PRIDEP07339.

Genome annotation databases

EnsemblENST00000236671; ENSP00000236671; ENSG00000117984; Homo sapiens. [Genome view]
ENST00000407742; ENSP00000384947; ENSG00000117984; Homo sapiens. [Genome view]
GeneID1509.
KEGGhsa:1509.
UCSCuc001luc.1. human.

Organism-specific databases

CTD1509.
GeneCardsGC11M001730.
H-InvDBHIX0009358.
HIX0009359.
HGNCHGNC:2529. CTSD.
HPACAB000109.
HPA003001.
MIM116840. gene.
610127. phenotype.
Orphanet168486. Congenital neuronal ceroid lipofuscinosis.
216. Neuronal ceroid lipofuscinosis.
PharmGKBPA27029.
GenAtlasSearch...

Phylogenomic databases

eggNOGprNOG10308.
HOGENOMHBG590923.
HOVERGENP07339.
InParanoidP07339.
OMASGFMGID.
PhylomeDBP07339.

Enzyme and pathway databases

BioCycMetaCyc:MONOMER-14813.
BRENDA3.4.23.5. 247.
Pathway_Interaction_DBceramidepathway. Ceramide signaling pathway.

Gene expression databases

ArrayExpressP07339.
BgeeP07339.
CleanExHS_CTSD.
GenevestigatorP07339.
GermOnlineENSG00000117984. Homo sapiens.

Family and domain databases

InterProIPR001461. Peptidase_A1.
IPR021109. Peptidase_aspartic.
IPR001969. Peptidase_aspartic_AS.
IPR009007. Peptidase_aspartic_catalytic.
IPR012848. Propep_A1.
[Graphical view]
Gene3DG3DSA:2.40.70.10. Pept_Aspartc_cat. 2 hits.
PANTHERPTHR13683. Peptidase_A1. 1 hit.
PfamPF07966. A1_Propeptide. 1 hit.
PF00026. Asp. 1 hit.
[Graphical view]
PRINTSPR00792. PEPSIN.
PROSITEPS00141. ASP_PROTEASE. 2 hits.
[Graphical view]
ProtoNetSearch...

Other Resources

DrugBankDB00047. Insulin Glargine recombinant.
DB00046. Insulin Lyspro recombinant.
DB00030. Insulin recombinant.
DB00071. Insulin, porcine.
NextBio6247.
PMAP-CutDBP07339.
SOURCESearch...

Entry information

Entry nameCATD_HUMAN
AccessionPrimary (citable) accession number: P07339
Secondary accession number(s): Q6IB57
Entry history
Integrated into UniProtKB/Swiss-Prot: April 1, 1988
Last sequence update: April 1, 1988
Last modified: February 9, 2010
This is version 127 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectHPI (Human Proteome Initiative)
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

Human chromosome 11

Human chromosome 11: entries, gene names and cross-references to MIM

Human entries with polymorphisms or disease mutations

List of human entries with polymorphisms or disease mutations

Human polymorphisms and disease mutations

Index of human polymorphisms and disease mutations

MIM cross-references

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot

PDB cross-references

Index of Protein Data Bank (PDB) cross-references

Peptidase families

Classification of peptidase families and list of entries

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Web resources · Cross-references · Entry information · Relevant documents