Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q20176 (NAS39_CAEEL) Reviewed, UniProtKB/Swiss-Prot

Last modified December 14, 2011. Version 85. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Zinc metalloproteinase nas-39

EC=3.4.24.21
Alternative name(s):
Nematode astacin 39
Gene names
Name:nas-39
ORF Names:F38E9.2
OrganismCaenorhabditis elegans
Taxonomic identifier6239 [NCBI]
Taxonomic lineageEukaryotaMetazoaNematodaChromadoreaRhabditidaRhabditoideaRhabditidaePeloderinaeCaenorhabditis

Protein attributes

Sequence length951 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at transcript level

General annotation (Comments)

Function

Probable metalloprotease By similarity.

Catalytic activity

Hydrolysis of peptide bonds in substrates containing five or more amino acids, preferentially with Ala in P1', and Pro in P2'.

Cofactor

Binds 1 zinc ion per subunit By similarity.

Subcellular location

Secreted Potential.

Sequence similarities

Belongs to the peptidase M12A family.

Contains 5 CUB domains.

Contains 2 EGF-like domains.

Ontologies

Keywords
   Cellular componentSecreted
   DomainEGF-like domain
Repeat
Signal
   LigandMetal-binding
Zinc
   Molecular functionHydrolase
Metalloprotease
Protease
   PTMDisulfide bond
Glycoprotein
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
   Biological processproteolysis

Inferred from electronic annotation. Source: UniProtKB-KW

   Cellular componentextracellular region

Inferred from electronic annotation. Source: UniProtKB-SubCell

   Molecular functioncalcium ion binding

Inferred from electronic annotation. Source: InterPro

metalloendopeptidase activity

Inferred from electronic annotation. Source: InterPro

zinc ion binding

Inferred from electronic annotation. Source: InterPro

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 3030 Potential
Chain31 – 951921Zinc metalloproteinase nas-39
PRO_0000028943

Regions

Domain249 – 381133CUB 1
Domain382 – 499118CUB 2
Domain499 – 53941EGF-like 1; calcium-binding Potential
Domain542 – 648107CUB 3
Domain648 – 68841EGF-like 2; calcium-binding Potential
Domain692 – 804113CUB 4
Domain805 – 923119CUB 5

Sites

Active site1421 By similarity
Metal binding1411Zinc; catalytic By similarity
Metal binding1451Zinc; catalytic By similarity
Metal binding1511Zinc; catalytic By similarity

Amino acid modifications

Glycosylation691N-linked (GlcNAc...) Potential
Glycosylation871N-linked (GlcNAc...) Potential
Glycosylation2831N-linked (GlcNAc...) Potential
Glycosylation3171N-linked (GlcNAc...) Potential
Glycosylation5501N-linked (GlcNAc...) Potential
Glycosylation5831N-linked (GlcNAc...) Potential
Glycosylation7171N-linked (GlcNAc...) Potential
Disulfide bond249 ↔ 268 By similarity
Disulfide bond382 ↔ 408 By similarity
Disulfide bond435 ↔ 462 By similarity
Disulfide bond503 ↔ 514 By similarity
Disulfide bond510 ↔ 523 By similarity
Disulfide bond525 ↔ 538 By similarity
Disulfide bond542 ↔ 568 By similarity
Disulfide bond596 ↔ 610 By similarity
Disulfide bond652 ↔ 663 By similarity
Disulfide bond659 ↔ 672 By similarity
Disulfide bond674 ↔ 687 By similarity
Disulfide bond692 ↔ 718 By similarity
Disulfide bond745 ↔ 767 By similarity
Disulfide bond805 ↔ 835 By similarity
Disulfide bond863 ↔ 886 By similarity

Sequences

Sequence LengthMass (Da)Tools
Q20176 [UniParc].

Last modified October 1, 2002. Version 3.
Checksum: B5D2A0B258163613

FASTA951107,534
        10         20         30         40         50         60 
MRFSANIAII VNIIFLFIVV EFVLPTFIRS GDVRFRRYYR NNGRVSRAAT AKKERIWPEG 

        70         80         90        100        110        120 
IIPFVIASNF SGEHQHLFLR AMRHWENFTC VSFVPRQPHH KHYITFTVDK CGCCSYVGRR 

       130        140        150        160        170        180 
GEGPQAISIG KNCDKFGIVV HELGHVVGFW HEHTRPDRDM YVDIFYKSIQ TGQDYNFEKS 

       190        200        210        220        230        240 
KPEEVDSLGE PYDFSSIMHY ARDTFSRGAF YDTILPKPNS GFRLEIGQRV QLSEGDIRQT 

       250        260        270        280        290        300 
KKLYKCAECG GTLMQESGNL AIQHAGVCTW HIISPQGHTI FLNITGGLKL IIMIIFLTDE 

       310        320        330        340        350        360 
KLEDLTEIFK PNILKKNQTY LHWKTFLIYY TNSFFHFEIL DRICGGDSLF RTIASSGNRM 

       370        380        390        400        410        420 
LIQVRSSTPA ASLPFATYYA ICGGPIYANE GVIHSPKYPE SYPPNSDCQW TIHVDENSQV 

       430        440        450        460        470        480 
AIEFVYFHLE QHKECIYDRL ILTEGISKNS KKDGKEMSET FCGLIEKKTI VSKTNQISLR 

       490        500        510        520        530        540 
FFSDNSVQKT GFELRFTKEL NECATDKNIC HHYCVNTVGG FKCACRVGYS LSSNGFSCDS 

       550        560        570        580        590        600 
TCGGYLKASN GSISSPNFPE MYPNSKTCIW EIEAPDGYHI FLNFTKFNVE GMKTECAYDY 

       610        620        630        640        650        660 
VKIGDSEKLC GEYHEALLFT TPRNRVRIEF SSDSSVERDG FFANFIADFD ECQNDNAGCE 

       670        680        690        700        710        720 
HTCQNRLGSY VCTCNPGYIL AEDKHNCKEG SCFFEVNAPA GDINSPNYPN DYPKGQNCSW 

       730        740        750        760        770        780 
HFVTTPGHRL MLTFSSFQVE EHAQCKYDAV SVYDGGDGSA QLAGVFCGLA PPPLLLSSSN 

       790        800        810        820        830        840 
ELYLTFSSDA SVSRRGFQAH YTSLCGGRLT AESTPGHIYS HATFSDSKYG KNQDCSWIVR 

       850        860        870        880        890        900 
AKSPGRGVRI QFSTFNIESE EGCQYDYIEI YDGPEATLER LVGRFCGDTS PEVITSTGPE 

       910        920        930        940        950 
LLLIMHTDNA EEEKGFVAEY REAPRSSSTK RTFVSKTRHS PLEEPIHDRN E 

« Hide

References

« Hide 'large scale' references
[1]"Genome sequence of the nematode C. elegans: a platform for investigating biology."
The C. elegans sequencing consortium
Science 282:2012-2018(1998) [PubMed: 9851916] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: Bristol N2.
[2]"The astacin protein family in Caenorhabditis elegans."
Moehrlen F., Hutter H., Zwilling R.
Eur. J. Biochem. 270:4909-4920(2003) [PubMed: 14653817] [Abstract]
Cited for: IDENTIFICATION, NOMENCLATURE.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
FO081327 Genomic DNA. Translation: CCD70820.1.
PIRT30018.
RefSeqNP_510672.2. NM_078271.3.
UniGeneCel.8940.

3D structure databases

ProteinModelPortalQ20176.
SMRQ20176. Positions 48-947.
ModBaseSearch...

Protein family/group databases

MEROPSM12.A24.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaF38E9.2; F38E9.2; F38E9.2.
GeneID3565986.
KEGGcel:F38E9.2.
NMPDRfig|6239.3.peg.25588.
UCSCF38E9.2. c. elegans.

Organism-specific databases

CTD3565986.
WormBaseF38E9.2; CE30977; WBGene00003555; nas-39.

Phylogenomic databases

eggNOGmeNOG04697.
GeneTreeEMGT00050000000216.
HOGENOMHBG355261.
InParanoidQ20176.
OMAIMHYARD.
PhylomeDBQ20176.

Gene expression databases

ArrayExpressQ20176.

Family and domain databases

InterProIPR015446. BMP_1/tolloid-like.
IPR000859. CUB.
IPR001881. EGF-like_Ca-bd.
IPR013032. EGF-like_reg_CS.
IPR000152. EGF-type_Asp/Asn_hydroxyl_site.
IPR000742. EGF_3.
IPR018097. EGF_Ca-bd_CS.
IPR024079. MetalloPept_cat_dom.
IPR001506. Peptidase_M12A.
IPR006026. Peptidase_Metallo.
[Graphical view]
Gene3DG3DSA:2.60.120.290. CUB. 5 hits.
G3DSA:3.40.390.10. G3DSA:3.40.390.10. 1 hit.
KOK08076.
PfamPF01400. Astacin. 1 hit.
PF00431. CUB. 4 hits.
PF07645. EGF_CA. 2 hits.
[Graphical view]
PIRSFPIRSF001199. BMP_1/tolloid-like. 1 hit.
PRINTSPR00480. ASTACIN.
SMARTSM00042. CUB. 5 hits.
SM00179. EGF_CA. 2 hits.
SM00235. ZnMc. 1 hit.
[Graphical view]
SUPFAMSSF49854. CUB. 5 hits.
PROSITEPS00010. ASX_HYDROXYL. 2 hits.
PS01180. CUB. 5 hits.
PS00022. EGF_1. False negative.
PS01186. EGF_2. 2 hits.
PS50026. EGF_3. 2 hits.
PS01187. EGF_CA. 2 hits.
PS00142. ZINC_PROTEASE. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

NextBio958633.

Entry information

Entry nameNAS39_CAEEL
AccessionPrimary (citable) accession number: Q20176
Entry history
Integrated into UniProtKB/Swiss-Prot: January 4, 2005
Last sequence update: October 1, 2002
Last modified: December 14, 2011
This is version 85 of the entry and version 3 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programCaenorhabditis annotation project

Relevant documents

Peptidase families

Classification of peptidase families and list of entries

Caenorhabditis elegans

Caenorhabditis elegans: entries, gene names and cross-references to WormPep

SIMILARITY comments

Index of protein domains and families