Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q18253 (DPF2_CAEEL) Reviewed, UniProtKB/Swiss-Prot

Last modified April 16, 2014. Version 95. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Dipeptidyl peptidase family member 2

EC=3.4.14.-
Gene names
Name:dpf-2
ORF Names:C27C12.7
OrganismCaenorhabditis elegans [Reference proteome]
Taxonomic identifier6239 [NCBI]
Taxonomic lineageEukaryotaMetazoaEcdysozoaNematodaChromadoreaRhabditidaRhabditoideaRhabditidaePeloderinaeCaenorhabditis

Protein attributes

Sequence length829 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Removes N-terminal dipeptides sequentially from polypeptides By similarity. Essential for control of distal tip cell migration. Ref.3

Subcellular location

Cell membrane; Single-pass type II membrane protein By similarity.

Sequence similarities

Belongs to the peptidase S9B family. DPPIV subfamily.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 829829Dipeptidyl peptidase family member 2
PRO_0000248535

Regions

Topological domain1 – 2727Cytoplasmic Potential
Transmembrane28 – 4821Helical; Signal-anchor for type II membrane protein; Potential
Topological domain49 – 829781Extracellular Potential

Sites

Active site6911Charge relay system By similarity
Active site7681Charge relay system By similarity
Active site8001Charge relay system By similarity

Amino acid modifications

Glycosylation611N-linked (GlcNAc...) Potential
Glycosylation661N-linked (GlcNAc...) Potential
Glycosylation1831N-linked (GlcNAc...) Potential
Glycosylation2091N-linked (GlcNAc...) Potential
Glycosylation3141N-linked (GlcNAc...) Ref.2 Ref.4
Glycosylation3591N-linked (GlcNAc...) Potential
Glycosylation7541N-linked (GlcNAc...) Potential
Disulfide bond514 ↔ 533 By similarity
Disulfide bond711 ↔ 821 By similarity

Sequences

Sequence LengthMass (Da)Tools
Q18253 [UniParc].

Last modified November 1, 1996. Version 1.
Checksum: B3F76F6DC12E44A5

FASTA82994,387
        10         20         30         40         50         60 
MENDNYDVEE QGCSVFNGKH GYFARSCCVV FILIICVIFV FSVIFTFMQN PINLNSDNGF 

        70         80         90        100        110        120 
NQTSGNTSSL EATTLKPKFS SLMTTTRRFT FEQLFSGKQF LVDYYDYIWL PDGSFVQMND 

       130        140        150        160        170        180 
DFTIRKQMKK IPLGSSVAEP FFNNGEYVKA LSSNMKYAYG SKKVNELWRH SAEYLYHIVK 

       190        200        210        220        230        240 
INNKTVSTEQ WHVGPEENSL IQAFYWNPNA SSNDFVYVHN YNLYYQKDPE KPDGAIQLTV 

       250        260        270        280        290        300 
GGSTFNRFGL ANWLYEEEIL EASSAVWWSP SGRYVSYLRF DDREVNRIFL PKYTDDDSYV 

       310        320        330        340        350        360 
EYFELPYPKA GVQNNTLVTQ YIWDSENHKI VETAPPNELS AANGDYYVLT NKWITMPRNG 

       370        380        390        400        410        420 
SDLGEERLVT VWANRDQNHV YFSLCNEQDC VMALSFQFSI DNRQLWVSPK DVRGVFPTET 

       430        440        450        460        470        480 
GFLTVLPHKH DDGNIYNHVA HVELDGTGTG KITKWIGENF DVILVLGYSS KIDALTFSAY 

       490        500        510        520        530        540 
GDGVGEFSTY IVREAMYSNK KTTLQKVTDQ FEDCKTLGSQ SADPTGQRIV VQCEKPFDNT 

       550        560        570        580        590        600 
RLYLVDVVDT TKKIMLEGGT KAVIPFDVPN MKFGKLKLPS GIDGHYMMLT PANLLDGAKI 

       610        620        630        640        650        660 
PLLLDIYGGP DSKQVFQKTP TAHAIQIVSQ YDIAYARIDV RGTGGRGWDV KEAVYRKLGD 

       670        680        690        700        710        720 
AEVVDTLDMI RAFINTFGFI DEDRIAVMGW SYGGFLTSKI AIKDQGELVK CAISIAPVTD 

       730        740        750        760        770        780 
FKYYDSAYTE RYLGQPAENL QGYINTNVIP HARNVTNVKY LLAHGERDDN VHYQNSARWS 

       790        800        810        820 
EALQQNGIHF TQLVYANEAH SLSHKLFHLY GEVQRFLMND CFKSNLDLL 

« Hide

References

« Hide 'large scale' references
[1]"Genome sequence of the nematode C. elegans: a platform for investigating biology."
The C. elegans sequencing consortium
Science 282:2012-2018(1998) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: Bristol N2.
[2]"Lectin affinity capture, isotope-coded tagging and mass spectrometry to identify N-linked glycoproteins."
Kaji H., Saito H., Yamauchi Y., Shinkawa T., Taoka M., Hirabayashi J., Kasai K., Takahashi N., Isobe T.
Nat. Biotechnol. 21:667-672(2003) [PubMed] [Europe PMC] [Abstract]
Cited for: GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-314, IDENTIFICATION BY MASS SPECTROMETRY.
Strain: Bristol N2.
[3]"Dipeptidyl peptidase IV-like protease family is essential for control of distal tip cell migration in C. elegans."
Yoshina S., Gengyo-Ando K., Mitani S., Iino Y., Inoue H., Takahashi K.
(In) Proceedings of the 15th international C. elegans meeting, pp.402-402, Los Angeles (2005)
Cited for: FUNCTION.
[4]"Proteomics reveals N-linked glycoprotein diversity in Caenorhabditis elegans and suggests an atypical translocation mechanism for integral membrane proteins."
Kaji H., Kamiie J., Kawakami H., Kido K., Yamauchi Y., Shinkawa T., Taoka M., Takahashi N., Isobe T.
Mol. Cell. Proteomics 6:2100-2109(2007) [PubMed] [Europe PMC] [Abstract]
Cited for: GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-314, IDENTIFICATION BY MASS SPECTROMETRY.
Strain: Bristol N2.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
Z69883 Genomic DNA. Translation: CAA93743.1.
PIRT19514.
RefSeqNP_510461.1. NM_078060.6.
UniGeneCel.5433.

3D structure databases

ProteinModelPortalQ18253.
SMRQ18253. Positions 87-823.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

STRING6239.C27C12.7.

Protein family/group databases

MEROPSS09.A74.

Proteomic databases

PaxDbQ18253.
PRIDEQ18253.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaC27C12.7; C27C12.7; C27C12.7.
GeneID181579.
KEGGcel:CELE_C27C12.7.
UCSCC27C12.7. c. elegans.

Organism-specific databases

CTD181579.
WormBaseC27C12.7; CE05324; WBGene00001055; dpf-2.

Phylogenomic databases

eggNOGCOG1506.
HOGENOMHOG000018328.
InParanoidQ18253.
OMACAISIAP.
OrthoDBEOG71CFKC.
PhylomeDBQ18253.

Family and domain databases

Gene3D2.140.10.30. 1 hit.
InterProIPR001375. Peptidase_S9.
IPR002469. Peptidase_S9B.
[Graphical view]
PfamPF00930. DPPIV_N. 1 hit.
PF00326. Peptidase_S9. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

NextBio914522.

Entry information

Entry nameDPF2_CAEEL
AccessionPrimary (citable) accession number: Q18253
Entry history
Integrated into UniProtKB/Swiss-Prot: September 5, 2006
Last sequence update: November 1, 1996
Last modified: April 16, 2014
This is version 95 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programCaenorhabditis annotation project

Relevant documents

SIMILARITY comments

Index of protein domains and families

Peptidase families

Classification of peptidase families and list of entries

Caenorhabditis elegans

Caenorhabditis elegans: entries, gene names and cross-references to WormBase