Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q9VN93 (CPR1_DROME) Reviewed, UniProtKB/Swiss-Prot

Last modified March 19, 2014. Version 105. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Putative cysteine proteinase CG12163

EC=3.4.22.-
Gene names
ORF Names:CG12163
OrganismDrosophila melanogaster (Fruit fly) [Reference proteome]
Taxonomic identifier7227 [NCBI]
Taxonomic lineageEukaryotaMetazoaEcdysozoaArthropodaHexapodaInsectaPterygotaNeopteraEndopterygotaDipteraBrachyceraMuscomorphaEphydroideaDrosophilidaeDrosophilaSophophora

Protein attributes

Sequence length614 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at transcript level

General annotation (Comments)

Function

May have a role in autophagic cell death. Ref.4

Sequence similarities

Belongs to the peptidase C1 family.

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform A Ref.1 (identifier: Q9VN93-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Note: No experimental confirmation available.
Isoform B Ref.1 (identifier: Q9VN93-2)

The sequence of this isoform differs from the canonical sequence as follows:
     37-175: Missing.
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 2020 Potential
Propeptide21 – 393373Activation peptide By similarity
PRO_0000026368
Chain394 – 614221Putative cysteine proteinase CG12163 UniProtKB Q9R013
PRO_0000026369

Sites

Active site4181 By similarity UniProtKB Q9R013
Active site5551 By similarity UniProtKB Q9R013
Active site5811 By similarity UniProtKB Q9R013

Amino acid modifications

Glycosylation1511N-linked (GlcNAc...) Potential
Glycosylation4921N-linked (GlcNAc...) Potential
Glycosylation5101N-linked (GlcNAc...) Potential
Disulfide bond415 ↔ 456 By similarity UniProtKB Q9R013
Disulfide bond449 ↔ 489 By similarity UniProtKB Q9R013
Disulfide bond548 ↔ 602 By similarity UniProtKB Q9R013

Natural variations

Alternative sequence37 – 175139Missing in isoform B. Ref.1
VSP_050566

Sequences

Sequence LengthMass (Da)Tools
Isoform A [UniParc].

Last modified July 11, 2003. Version 2.
Checksum: C44D32383375E032

FASTA61468,961
        10         20         30         40         50         60 
MRLFAAATVA LVLLLGQAAG EELAEERAGQ AQGDAESTES SETTTDQAVS EPPITLVHVL 

        70         80         90        100        110        120 
NPGEREYLSP NLIGVQNIAM TFLPLSMNFV NIIDAFREIT AGVRYEILLN ALDTKAIQPA 

       130        140        150        160        170        180 
EADIVCRLVI LEKPWLRTQW GDKHRELVTS NCTDPAVNSV AGDPAEKARL LNEKYVHRSR 

       190        200        210        220        230        240 
RSANDILGRH KPYDEEAAKA QLQKSLDKLT AGEGPHYKIV KVYSASRQVD SGILTRIDAD 

       250        260        270        280        290        300 
LIDGSEEQHR CIVDIWTKVW VRKDEHEITF KCRNQPVVQA RHTRSVEWAE KKTHKKHSHR 

       310        320        330        340        350        360 
FDKVDHLFYK FQVRFGRRYV STAERQMRLR IFRQNLKTIE ELNANEMGSA KYGITEFADM 

       370        380        390        400        410        420 
TSSEYKERTG LWQRDEAKAT GGSAAVVPAY HGELPKEFDW RQKDAVTQVK NQGSCGSCWA 

       430        440        450        460        470        480 
FSVTGNIEGL YAVKTGELKE FSEQELLDCD TTDSACNGGL MDNAYKAIKD IGGLEYEAEY 

       490        500        510        520        530        540 
PYKAKKNQCH FNRTLSHVQV AGFVDLPKGN ETAMQEWLLA NGPISIGINA NAMQFYRGGV 

       550        560        570        580        590        600 
SHPWKALCSK KNLDHGVLVV GYGVSDYPNF HKTLPYWIVK NSWGPRWGEQ GYYRVYRGDN 

       610 
TCGVSEMATS AVLA 

« Hide

Isoform B [UniParc].

Checksum: 130DE83869498EF1
Show »

FASTA47553,545

References

« Hide 'large scale' references
[1]"The genome sequence of Drosophila melanogaster."
Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N., Sutton G.G., Wortman J.R., Yandell M.D. expand/collapse author list , Zhang Q., Chen L.X., Brandon R.C., Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C., Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A., An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B., Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C., Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S., Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M., Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D., Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F., Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D., Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A., Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C., McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C., Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L., Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R., Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F., Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J., Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R., Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y., Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T., Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S., Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.
Science 287:2185-2195(2000) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: Berkeley.
[2]"Annotation of the Drosophila melanogaster euchromatic genome: a systematic review."
Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S., Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E., Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P., Bettencourt B.R., Celniker S.E., de Grey A.D.N.J. expand/collapse author list , Drysdale R.A., Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M., Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.
Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002) [PubMed] [Europe PMC] [Abstract]
Cited for: GENOME REANNOTATION, ALTERNATIVE SPLICING.
Strain: Berkeley.
[3]"A Drosophila full-length cDNA resource."
Stapleton M., Carlson J.W., Brokstein P., Yu C., Champe M., George R.A., Guarin H., Kronmiller B., Pacleb J.M., Park S., Wan K.H., Rubin G.M., Celniker S.E.
Genome Biol. 3:RESEARCH0080.1-RESEARCH0080.8(2002) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM A).
Strain: Berkeley.
Tissue: Head, Larva and Pupae.
[4]"A SAGE approach to discovery of genes involved in autophagic cell death."
Gorski S.M., Chittaranjan S., Pleasance E.D., Freeman J.D., Anderson C.L., Varhol R.J., Coughlin S.M., Zuyderduyn S.D., Jones S.J.M., Marra M.A.
Curr. Biol. 13:358-363(2003) [PubMed] [Europe PMC] [Abstract]
Cited for: IDENTIFICATION.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AE014297 Genomic DNA. Translation: AAF52055.2.
AE014297 Genomic DNA. Translation: AAN13266.1.
AY121614 mRNA. Translation: AAM51941.1. Sequence problems.
BT003231 mRNA. Translation: AAO24986.1.
RefSeqNP_649521.1. NM_141264.4.
NP_730901.1. NM_169033.4.
NP_730902.2. NM_169034.4.
UniGeneDm.7315.

3D structure databases

ProteinModelPortalQ9VN93.
SMRQ9VN93. Positions 86-154, 318-613.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

BioGrid65841. 4 interactions.
DIPDIP-17491N.
IntActQ9VN93. 1 interaction.
MINTMINT-763966.

Protein family/group databases

MEROPSC01.A27.

Proteomic databases

PaxDbQ9VN93.
PRIDEQ9VN93.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaFBtr0078823; FBpp0078465; FBgn0260462.
GeneID40628.
KEGGdme:Dmel_CG12163.
UCSCCG12163-RA. d. melanogaster. [Q9VN93-1]

Organism-specific databases

FlyBaseFBgn0260462. CG12163.

Phylogenomic databases

eggNOGCOG4870.
GeneTreeENSGT00740000114856.
InParanoidQ9VN93.
KOK01373.
OMAGPRWGEQ.
OrthoDBEOG7DJSKG.
PhylomeDBQ9VN93.

Gene expression databases

BgeeQ9VN93.

Family and domain databases

InterProIPR025661. Pept_asp_AS.
IPR000169. Pept_cys_AS.
IPR025660. Pept_his_AS.
IPR013128. Peptidase_C1A.
IPR000668. Peptidase_C1A_C.
IPR000010. Prot_inh_cystat.
IPR013201. Prot_inhib_I29.
[Graphical view]
PANTHERPTHR12411. PTHR12411. 1 hit.
PfamPF08246. Inhibitor_I29. 1 hit.
PF00112. Peptidase_C1. 1 hit.
[Graphical view]
PRINTSPR00705. PAPAIN.
SMARTSM00043. CY. 1 hit.
SM00848. Inhibitor_I29. 1 hit.
SM00645. Pept_C1. 1 hit.
[Graphical view]
PROSITEPS00640. THIOL_PROTEASE_ASN. 1 hit.
PS00139. THIOL_PROTEASE_CYS. 1 hit.
PS00639. THIOL_PROTEASE_HIS. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

GenomeRNAi40628.
NextBio819744.
PROQ9VN93.

Entry information

Entry nameCPR1_DROME
AccessionPrimary (citable) accession number: Q9VN93
Secondary accession number(s): Q867H7, Q9VN92
Entry history
Integrated into UniProtKB/Swiss-Prot: July 11, 2003
Last sequence update: July 11, 2003
Last modified: March 19, 2014
This is version 105 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programDrosophila annotation project

Relevant documents

SIMILARITY comments

Index of protein domains and families

Peptidase families

Classification of peptidase families and list of entries

Drosophila

Drosophila: entries, gene names and cross-references to FlyBase