Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Cathepsin L

Gene

Cp1

Organism
Drosophila melanogaster (Fruit fly)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at transcript leveli

Functioni

Important for the overall degradation of proteins in lysosomes. Essential for adult male and female fertility. May play a role in digestion.3 Publications

Catalytic activityi

Specificity close to that of papain. As compared to cathepsin B, cathepsin L exhibits higher activity toward protein substrates, but has little activity on Z-Arg-Arg-NHMec, and no peptidyl-dipeptidase activity.

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Active sitei178By similarity1
Active sitei317By similarity1
Active sitei338By similarity1

GO - Molecular functioni

  • cysteine-type endopeptidase activity Source: FlyBase
  • peptidase activity Source: FlyBase

GO - Biological processi

  • autophagic cell death Source: FlyBase
  • digestion Source: UniProtKB-KW
  • proteolysis Source: FlyBase
  • proteolysis involved in cellular protein catabolic process Source: GO_Central
  • salivary gland cell autophagic cell death Source: FlyBase

Keywordsi

Molecular functionDevelopmental protein, Hydrolase, Protease, Thiol protease
Biological processDigestion

Enzyme and pathway databases

ReactomeiR-DME-1442490. Collagen degradation.
R-DME-1474228. Degradation of the extracellular matrix.
R-DME-1592389. Activation of Matrix Metalloproteinases.
R-DME-1679131. Trafficking and processing of endosomal TLR.
R-DME-2132295. MHC class II antigen presentation.

Protein family/group databases

MEROPSiC01.092.

Names & Taxonomyi

Protein namesi
Recommended name:
Cathepsin L (EC:3.4.22.15)
Alternative name(s):
Cysteine proteinase 1
Cleaved into the following 2 chains:
Gene namesi
Name:Cp1
Synonyms:fs(2)50Ca
ORF Names:CG6692
OrganismiDrosophila melanogaster (Fruit fly)
Taxonomic identifieri7227 [NCBI]
Taxonomic lineageiEukaryotaMetazoaEcdysozoaArthropodaHexapodaInsectaPterygotaNeopteraHolometabolaDipteraBrachyceraMuscomorphaEphydroideaDrosophilidaeDrosophilaSophophora
Proteomesi
  • UP000000803 Componenti: Chromosome 2R

Organism-specific databases

FlyBaseiFBgn0013770. Cp1.

Subcellular locationi

Extracellular region or secreted Cytosol Plasma membrane Cytoskeleton Lysosome Endosome Peroxisome ER Golgi apparatus Nucleus Mitochondrion Manual annotation Automatic computational assertionGraphics by Christian Stolte; Source: COMPARTMENTS

Keywords - Cellular componenti

Lysosome

Pathology & Biotechi

Disruption phenotypei

Flies exhibit wing and pigmentation defects. Females are sterile, males are partially sterile.1 Publication

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Signal peptidei1 – 48Sequence analysisAdd BLAST48
PropeptideiPRO_000002626549 – 153Activation peptideAdd BLAST105
ChainiPRO_0000026266154 – 326Cathepsin L heavy chainAdd BLAST173
PropeptideiPRO_0000026267327 – 3293
ChainiPRO_0000026268330 – 371Cathepsin L light chainAdd BLAST42

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Glycosylationi127N-linked (GlcNAc...) asparagineSequence analysis1
Disulfide bondi175 ↔ 218By similarity
Disulfide bondi209 ↔ 251By similarity
Disulfide bondi310 ↔ 360Interchain (between heavy and light chains)By similarity

Keywords - PTMi

Disulfide bond, Glycoprotein, Zymogen

Proteomic databases

PaxDbiQ95029.
PRIDEiQ95029.

Expressioni

Tissue specificityi

In the embryo, predominantly expressed in the midgut. Also expressed in larval alimentary organs such as salivary gland and midgut including gastric caeca.1 Publication

Developmental stagei

Expressed in embryo, larva, pupa and adult.1 Publication

Gene expression databases

BgeeiFBgn0013770.
ExpressionAtlasiQ95029. differential.
GenevisibleiQ95029. DM.

Interactioni

Subunit structurei

Dimer of a heavy and a light chain linked by disulfide bonds.

Protein-protein interaction databases

BioGridi62300. 41 interactors.
IntActiQ95029. 2 interactors.
MINTiMINT-814156.
STRINGi7227.FBpp0086719.

Structurei

3D structure databases

ProteinModelPortaliQ95029.
SMRiQ95029.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Sequence similaritiesi

Belongs to the peptidase C1 family.PROSITE-ProRule annotation

Keywords - Domaini

Signal

Phylogenomic databases

eggNOGiKOG1543. Eukaryota.
COG4870. LUCA.
GeneTreeiENSGT00900000140823.
InParanoidiQ95029.
KOiK01365.
OMAiFRYIKDN.
OrthoDBiEOG091G0AKT.
PhylomeDBiQ95029.

Family and domain databases

InterProiView protein in InterPro
IPR025661. Pept_asp_AS.
IPR000169. Pept_cys_AS.
IPR025660. Pept_his_AS.
IPR013128. Peptidase_C1A.
IPR000668. Peptidase_C1A_C.
IPR013201. Prot_inhib_I29.
PANTHERiPTHR12411. PTHR12411. 1 hit.
PfamiView protein in Pfam
PF08246. Inhibitor_I29. 1 hit.
PF00112. Peptidase_C1. 1 hit.
PRINTSiPR00705. PAPAIN.
SMARTiView protein in SMART
SM00848. Inhibitor_I29. 1 hit.
SM00645. Pept_C1. 1 hit.
PROSITEiView protein in PROSITE
PS00640. THIOL_PROTEASE_ASN. 1 hit.
PS00139. THIOL_PROTEASE_CYS. 1 hit.
PS00639. THIOL_PROTEASE_HIS. 1 hit.

Sequences (2)i

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

This entry describes 2 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform C (identifier: Q95029-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MNHLGVFETR FRPRTRHKSQ RAQLIPEQIT MRTAVLLPLL ALLAVAQAVS
60 70 80 90 100
FADVVMEEWH TFKLEHRKNY QDETEERFRL KIFNENKHKI AKHNQRFAEG
110 120 130 140 150
KVSFKLAVNK YADLLHHEFR QLMNGFNYTL HKQLRAADES FKGVTFISPA
160 170 180 190 200
HVTLPKSVDW RTKGAVTAVK DQGHCGSCWA FSSTGALEGQ HFRKSGVLVS
210 220 230 240 250
LSEQNLVDCS TKYGNNGCNG GLMDNAFRYI KDNGGIDTEK SYPYEAIDDS
260 270 280 290 300
CHFNKGTVGA TDRGFTDIPQ GDEKKMAEAV ATVGPVSVAI DASHESFQFY
310 320 330 340 350
SEGVYNEPQC DAQNLDHGVL VVGFGTDESG EDYWLVKNSW GTTWGDKGFI
360 370
KMLRNKENQC GIASASSYPL V
Note: No experimental confirmation available.
Length:371
Mass (Da):41,601
Last modified:November 28, 2006 - v2
Checksum:i01955EB4735316D7
GO
Isoform A (identifier: Q95029-2) [UniParc]FASTAAdd to basket
Also known as: B

The sequence of this isoform differs from the canonical sequence as follows:
     2-31: Missing.

Show »
Length:341
Mass (Da):37,971
Checksum:iD18382396ACE1D59
GO

Sequence cautioni

Q95029: The sequence BAA06738 differs from that shown. Intron retention.Curated

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti228R → P in BAA06738 (PubMed:7851441).Curated1
Sequence conflicti255 – 257KGT → RAQ in BAA06738 (PubMed:7851441).Curated3
Sequence conflicti277 – 281AEAVA → PEPVP in BAA06738 (PubMed:7851441).Curated5
Sequence conflicti365A → P in BAA06738 (PubMed:7851441).Curated1

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_0217712 – 31Missing in isoform A. 2 PublicationsAdd BLAST30

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U75652 mRNA. Translation: AAB18345.1.
AF012089 Genomic DNA. Translation: AAB65749.1.
AE013599 Genomic DNA. Translation: AAF58311.1.
AE013599 Genomic DNA. Translation: AAM68565.1.
BT016071 mRNA. Translation: AAV36956.1.
D31970 Genomic DNA. Translation: BAA06738.1. Sequence problems.
RefSeqiNP_523735.2. NM_079011.3. [Q95029-1]
NP_725347.1. NM_166026.3. [Q95029-2]
UniGeneiDm.7400.

Genome annotation databases

EnsemblMetazoaiFBtr0087592; FBpp0086718; FBgn0013770. [Q95029-2]
FBtr0087593; FBpp0086719; FBgn0013770. [Q95029-1]
GeneIDi36546.
KEGGidme:Dmel_CG6692.

Keywords - Coding sequence diversityi

Alternative splicing

Similar proteinsi

Entry informationi

Entry nameiCATL_DROME
AccessioniPrimary (citable) accession number: Q95029
Secondary accession number(s): O97431, Q5U121
Entry historyiIntegrated into UniProtKB/Swiss-Prot: February 21, 2001
Last sequence update: November 28, 2006
Last modified: October 25, 2017
This is version 149 of the entry and version 2 of the sequence. See complete history.
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programDrosophila annotation project

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Drosophila
    Drosophila: entries, gene names and cross-references to FlyBase
  2. Peptidase families
    Classification of peptidase families and list of entries
  3. SIMILARITY comments
    Index of protein domains and families