Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Cathepsin B-like CP1

Gene

CP1

Organism
Giardia intestinalis (Giardia lamblia)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Experimental evidence at transcript leveli

Functioni

Thiol protease which is required for parasite excystation and invasion of the proximal small intestine of the human host.1 Publication

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Active sitei105By similarity1
Active sitei249By similarity1
Active sitei270By similarity1

GO - Molecular functioni

Complete GO annotation...

Keywords - Molecular functioni

Hydrolase, Protease, Thiol protease

Protein family/group databases

MEROPSiC01.094.

Names & Taxonomyi

Protein namesi
Recommended name:
Cathepsin B-like CP1 (EC:3.4.22.-)
Alternative name(s):
Cathepsin B-like protease B1
Gene namesi
Name:CP1
OrganismiGiardia intestinalis (Giardia lamblia)
Taxonomic identifieri5741 [NCBI]
Taxonomic lineageiEukaryotaDiplomonadidaHexamitidaeGiardiinaeGiardia

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Vacuole

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_0000026165? – 303Cathepsin B-like CP1
Signal peptidei1 – 19Sequence analysisAdd BLAST19
PropeptideiPRO_000002616420 – ?Activation peptide

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Glycosylationi41N-linked (GlcNAc...)Sequence analysis1
Disulfide bondi92 ↔ 119By similarity
Disulfide bondi102 ↔ 145By similarity
Disulfide bondi138 ↔ 181By similarity

Keywords - PTMi

Disulfide bond, Glycoprotein, Zymogen

Expressioni

Developmental stagei

To initiate infection, trophozoites emerge from a cyst in the host. Excystation is blocked by specific cysteine protease inhibitors. Vacuoles release it just prior to excystation.

Inductioni

Not expressed without supplemental iron.1 Publication

Interactioni

Protein-protein interaction databases

STRINGi184922.XP_001706373.1.

Structurei

3D structure databases

ProteinModelPortaliP92131.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Sequence similaritiesi

Belongs to the peptidase C1 family.PROSITE-ProRule annotation

Keywords - Domaini

Signal

Phylogenomic databases

eggNOGiKOG1543. Eukaryota.
COG4870. LUCA.

Family and domain databases

InterProiIPR025661. Pept_asp_AS.
IPR000169. Pept_cys_AS.
IPR025660. Pept_his_AS.
IPR013128. Peptidase_C1A.
IPR000668. Peptidase_C1A_C.
[Graphical view]
PANTHERiPTHR12411. PTHR12411. 1 hit.
PfamiPF00112. Peptidase_C1. 1 hit.
[Graphical view]
PRINTSiPR00705. PAPAIN.
SMARTiSM00645. Pept_C1. 1 hit.
[Graphical view]
PROSITEiPS00640. THIOL_PROTEASE_ASN. 1 hit.
PS00139. THIOL_PROTEASE_CYS. 1 hit.
PS00639. THIOL_PROTEASE_HIS. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

P92131-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MALSLLLAVV CAKPLVSRAE LRRIQALNPP WKAGMPKRFE NVTEDEFRSM
60 70 80 90 100
LIRPDRLRAR SGSLPPISIT EVQELVDPIP PQFDFRDEYP QCVKPALDQG
110 120 130 140 150
SCGSCWAFSA IGVFGDRRCA MGIDKEAVSY SQQHLISCSL ENFGCDGGDF
160 170 180 190 200
QPTWSFLTFT GATTAECVKY VDYGHTVASP CPAVCDDGSP IQLYKAHGYG
210 220 230 240 250
QVSKSVPAIM GMLVAGGPLQ TMIVVYADLS YYESGVYKHT YGTINLGFHA
260 270 280 290 300
LEIVGYGTTD DGTDYWIIKN SWGPDWGENG YFRIVRGVNE CRIEDEIYAV

YLD
Length:303
Mass (Da):33,508
Last modified:June 16, 2003 - v3
Checksum:i53160EF586B6DA44
GO

Sequence cautioni

The sequence AAB58258 differs from that shown. Reason: Erroneous initiation.Curated

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti104S → E in AAB58258 (PubMed:9150143).Curated1
Sequence conflicti104S → E in CAC18646 (PubMed:9150143).Curated1

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U83275 Genomic DNA. Translation: AAB58258.1. Different initiation.
AJ302011 Genomic DNA. Translation: CAC18646.1.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U83275 Genomic DNA. Translation: AAB58258.1. Different initiation.
AJ302011 Genomic DNA. Translation: CAC18646.1.

3D structure databases

ProteinModelPortaliP92131.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi184922.XP_001706373.1.

Protein family/group databases

MEROPSiC01.094.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Phylogenomic databases

eggNOGiKOG1543. Eukaryota.
COG4870. LUCA.

Family and domain databases

InterProiIPR025661. Pept_asp_AS.
IPR000169. Pept_cys_AS.
IPR025660. Pept_his_AS.
IPR013128. Peptidase_C1A.
IPR000668. Peptidase_C1A_C.
[Graphical view]
PANTHERiPTHR12411. PTHR12411. 1 hit.
PfamiPF00112. Peptidase_C1. 1 hit.
[Graphical view]
PRINTSiPR00705. PAPAIN.
SMARTiSM00645. Pept_C1. 1 hit.
[Graphical view]
PROSITEiPS00640. THIOL_PROTEASE_ASN. 1 hit.
PS00139. THIOL_PROTEASE_CYS. 1 hit.
PS00639. THIOL_PROTEASE_HIS. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiCATB1_GIAIN
AccessioniPrimary (citable) accession number: P92131
Secondary accession number(s): Q9GP25
Entry historyi
Integrated into UniProtKB/Swiss-Prot: June 16, 2003
Last sequence update: June 16, 2003
Last modified: October 5, 2016
This is version 73 of the entry and version 3 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)

Miscellaneousi

Documents

  1. Peptidase families
    Classification of peptidase families and list of entries
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.