Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

P34308 (CAN_CAEEL) Reviewed, UniProtKB/Swiss-Prot

Last modified April 16, 2014. Version 112. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Calpain clp-1

EC=3.4.22.-
Gene names
Name:clp-1
ORF Names:C06G4.2/C06G4.3
OrganismCaenorhabditis elegans [Reference proteome]
Taxonomic identifier6239 [NCBI]
Taxonomic lineageEukaryotaMetazoaEcdysozoaNematodaChromadoreaRhabditidaRhabditoideaRhabditidaePeloderinaeCaenorhabditis

Protein attributes

Sequence length780 AA.
Sequence statusComplete.
Protein existenceInferred from homology

General annotation (Comments)

Function

Calcium-regulated protease involved in necrotic cell death. Ref.3

Sequence similarities

Belongs to the peptidase C2 family.

Contains 1 calpain catalytic domain.

Alternative products

This entry describes 4 isoforms produced by alternative splicing. [Align] [Select]

Note: Experimental confirmation may be lacking for some isoforms.
Isoform a (identifier: P34308-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Note: No experimental confirmation available.
Isoform b (identifier: P34308-2)

The sequence of this isoform differs from the canonical sequence as follows:
     76-121: Missing.
     708-708: D → DMNN
Note: No experimental confirmation available.
Isoform c (identifier: P34308-3)

The sequence of this isoform differs from the canonical sequence as follows:
     77-113: SNYDQGGNGNSGDQQKRKRDMAKDLIGGIFDNVVNRK → IFIFKIVRQKFPKNSSSFFCVRKHLDSLKTSPCGLDQ
     114-780: Missing.
Note: No experimental confirmation available.
Isoform d (identifier: P34308-4)

The sequence of this isoform differs from the canonical sequence as follows:
     77-100: SNYDQGGNGNSGDQQKRKRDMAKD → GGGSGGGGGGNNIGSLVGSLIGGG
     101-121: Missing.
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 780780Calpain clp-1
PRO_0000207734

Regions

Domain316 – 611296Calpain catalytic
Compositional bias21 – 288268Gly-rich

Sites

Active site3711 By similarity
Active site5271 By similarity
Active site5511 By similarity

Natural variations

Alternative sequence76 – 12146Missing in isoform b.
VSP_005247
Alternative sequence77 – 11337SNYDQ…VVNRK → IFIFKIVRQKFPKNSSSFFC VRKHLDSLKTSPCGLDQ in isoform c.
VSP_005249
Alternative sequence77 – 10024SNYDQ…DMAKD → GGGSGGGGGGNNIGSLVGSL IGGG in isoform d.
VSP_005248
Alternative sequence101 – 12121Missing in isoform d.
VSP_005250
Alternative sequence114 – 780667Missing in isoform c.
VSP_005251
Alternative sequence7081D → DMNN in isoform b.
VSP_005252

Sequences

Sequence LengthMass (Da)Tools
Isoform a [UniParc].

Last modified July 5, 2005. Version 4.
Checksum: C1DFA4E5671F7835

FASTA78083,643
        10         20         30         40         50         60 
MADDEEEIIQ KVEVKPDEFN GLIGSIAGNL IRDKVGGAGG DILGGLASNF FGGGGGGGGG 

        70         80         90        100        110        120 
GGGGGFGGGN GGFGGGSNYD QGGNGNSGDQ QKRKRDMAKD LIGGIFDNVV NRKGKKEQDN 

       130        140        150        160        170        180 
YGGGGNYGGG GGNQGGGGGG GFNFNDIGGL INSMGGGGGG GQRQGGGGGG FGDILGGIGS 

       190        200        210        220        230        240 
LIGGGGGGQY NGGGGNVNPN NLNGGMVNVI GNLIGEAAHR FLGVDPGTGR IIGAVAGNVI 

       250        260        270        280        290        300 
MGLGGKDNSL GNIGKVILDN IISGKFRRDV DPFVRPGPDP DRGGGGSGPS PISPRPTTEP 

       310        320        330        340        350        360 
QDFYELRDQC LESKRLFEDP QFLANDSSLF FSKRPPKRVE WLRPGEITRE PQLITEGHSR 

       370        380        390        400        410        420 
FDVIQGELGD CWLLAAAANL TLKDELFYRV VPPDQSFTEN YAGIFHFQFW QYGKWVDVVI 

       430        440        450        460        470        480 
DDRLPTSNGE LLYMHSASNN EFWSALLEKA YAKLFGSYEA LKGGTTSEAL EDMTGGLTEF 

       490        500        510        520        530        540 
IDLKNPPRNL MQMMMRGFEM GSLFGCSIEA DPNVWEAKMS NGLVKGHAYS ITGCRIVDGP 

       550        560        570        580        590        600 
NGQTCILRIR NPWGNEQEWN GPWSDNSREW RSVPDSVKQD MGLKFDHDGE FWMSFDDFMR 

       610        620        630        640        650        660 
NFEKMEICNL GPDVMDEVYQ MTGVKAAGMV WAANTHDGAW VRNQTAGGCR NYINTFANNP 

       670        680        690        700        710        720 
QFRVQLTDSD PDDDDELCTV IFAVLQKYRR NLKQDGLDNV PIGFAVYDAG NNRGRLSKQF 

       730        740        750        760        770        780 
FAANKSAMRS AAFINLREMT GRFRVPPGNY VVVPSTFEPN EEAEFMLRVY TNGFIESEEL 

« Hide

Isoform b [UniParc].

Checksum: 0B91CF87AD6FA281
Show »

FASTA73778,904
Isoform c [UniParc].

Checksum: 2DDF5D0C0A40C297
Show »

FASTA11311,283
Isoform d [UniParc].

Checksum: 5CC728E8C5372689
Show »

FASTA75980,442

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
FO080399 Genomic DNA. Translation: CCD63432.1.
FO080399 Genomic DNA. Translation: CCD63433.1.
FO080399 Genomic DNA. Translation: CCD63434.1.
FO080399 Genomic DNA. Translation: CCD63435.1.
PIRS44749.
S44750.
RefSeqNP_498740.2. NM_066339.5.
NP_498741.3. NM_066340.4.
NP_741237.1. NM_171886.4.
NP_741238.1. NM_171201.1.
UniGeneCel.17311.

3D structure databases

ProteinModelPortalP34308.
SMRP34308. Positions 26-80, 100-137, 166-258, 301-771.
ModBaseSearch...
MobiDBSearch...

Protein family/group databases

MEROPSC02.A03.

Proteomic databases

PaxDbP34308.
PRIDEP34308.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaC06G4.2a.1; C06G4.2a.1; C06G4.2. [P34308-1]
C06G4.2a.2; C06G4.2a.2; C06G4.2. [P34308-1]
GeneID176122.
KEGGcel:CELE_C06G4.2.
UCSCC06G4.2a.1. c. elegans. [P34308-1]

Organism-specific databases

CTD176122.
WormBaseC06G4.2a; CE37743; WBGene00000542; clp-1.
C06G4.2b; CE37744; WBGene00000542; clp-1.
C06G4.2c; CE00517; WBGene00000542; clp-1.
C06G4.2d; CE30486; WBGene00000542; clp-1.

Phylogenomic databases

eggNOGNOG327523.
HOGENOMHOG000020136.
InParanoidP34308.
KOK08585.
OMAHSRFDVI.
OrthoDBEOG7RV9FM.
PhylomeDBP34308.

Family and domain databases

InterProIPR022684. Calpain_cysteine_protease.
IPR022682. Calpain_domain_III.
IPR022683. Calpain_III.
IPR000169. Pept_cys_AS.
IPR001300. Peptidase_C2_calpain_cat.
[Graphical view]
PfamPF01067. Calpain_III. 1 hit.
PF00648. Peptidase_C2. 1 hit.
[Graphical view]
PRINTSPR00704. CALPAIN.
SMARTSM00720. calpain_III. 1 hit.
SM00230. CysPc. 1 hit.
[Graphical view]
SUPFAMSSF49758. SSF49758. 1 hit.
PROSITEPS50203. CALPAIN_CAT. 1 hit.
PS00139. THIOL_PROTEASE_CYS. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

NextBio891208.

Entry information

Entry nameCAN_CAEEL
AccessionPrimary (citable) accession number: P34308
Secondary accession number(s): P34309, Q5DX51, Q5DX52
Entry history
Integrated into UniProtKB/Swiss-Prot: February 1, 1994
Last sequence update: July 5, 2005
Last modified: April 16, 2014
This is version 112 of the entry and version 4 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programCaenorhabditis annotation project

Relevant documents

SIMILARITY comments

Index of protein domains and families

Peptidase families

Classification of peptidase families and list of entries

Caenorhabditis elegans

Caenorhabditis elegans: entries, gene names and cross-references to WormBase