Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Zinc metalloproteinase nas-26

Gene

toh-1

Organism
Caenorhabditis elegans
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Protein inferred from homologyi

Functioni

Probable metalloprotease.By similarity

Catalytic activityi

Hydrolysis of peptide bonds in substrates containing five or more amino acids, preferentially with Ala in P1', and Pro in P2'.

Cofactori

Zn2+By similarityNote: Binds 1 zinc ion per subunit.By similarity

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Metal bindingi154 – 1541Zinc; catalyticBy similarity
Active sitei155 – 1551By similarity
Metal bindingi158 – 1581Zinc; catalyticBy similarity
Metal bindingi164 – 1641Zinc; catalyticBy similarity

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Molecular functioni

Hydrolase, Metalloprotease, Protease

Keywords - Ligandi

Metal-binding, Zinc

Protein family/group databases

MEROPSiM12.A17.

Names & Taxonomyi

Protein namesi
Recommended name:
Zinc metalloproteinase nas-26 (EC:3.4.24.21)
Alternative name(s):
Nematode astacin 26
Tollish protein 1
Gene namesi
Name:toh-1
Synonyms:nas-26
ORF Names:T24A11.3
OrganismiCaenorhabditis elegans
Taxonomic identifieri6239 [NCBI]
Taxonomic lineageiEukaryotaMetazoaEcdysozoaNematodaChromadoreaRhabditidaRhabditoideaRhabditidaePeloderinaeCaenorhabditis
Proteomesi
  • UP000001940 Componenti: Chromosome III

Organism-specific databases

WormBaseiT24A11.3; CE35909; WBGene00006591; toh-1.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Secreted

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 2020Sequence analysisAdd
BLAST
Chaini21 – 414394Zinc metalloproteinase nas-26PRO_0000028930Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Glycosylationi24 – 241N-linked (GlcNAc...)Sequence analysis
Disulfide bondi263 ↔ 275PROSITE-ProRule annotation
Disulfide bondi267 ↔ 286PROSITE-ProRule annotation
Disulfide bondi289 ↔ 300PROSITE-ProRule annotation
Disulfide bondi308 ↔ 331PROSITE-ProRule annotation
Disulfide bondi358 ↔ 378PROSITE-ProRule annotation

Keywords - PTMi

Disulfide bond, Glycoprotein

Proteomic databases

PaxDbiQ22710.

Interactioni

Protein-protein interaction databases

STRINGi6239.T24A11.3.2.

Structurei

3D structure databases

ProteinModelPortaliQ22710.
SMRiQ22710. Positions 71-237.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini251 – 30757EGF-likeAdd
BLAST
Domaini308 – 414107CUBPROSITE-ProRule annotationAdd
BLAST

Sequence similaritiesi

Belongs to the peptidase M12A family.Curated
Contains 1 CUB domain.PROSITE-ProRule annotation
Contains 1 EGF-like domain.Curated

Keywords - Domaini

EGF-like domain, Signal

Phylogenomic databases

eggNOGiKOG3714. Eukaryota.
ENOG410ZPX7. LUCA.
GeneTreeiENSGT00760000119227.
HOGENOMiHOG000018812.
InParanoidiQ22710.
KOiK08076.
OMAiCYWRISV.
PhylomeDBiQ22710.

Family and domain databases

Gene3Di2.60.120.290. 1 hit.
3.40.390.10. 1 hit.
InterProiIPR000859. CUB_dom.
IPR024079. MetalloPept_cat_dom.
IPR017050. Metallopeptidase_nem.
IPR001506. Peptidase_M12A.
IPR006026. Peptidase_Metallo.
[Graphical view]
PfamiPF01400. Astacin. 1 hit.
PF00431. CUB. 1 hit.
[Graphical view]
PIRSFiPIRSF036365. Astacin_nematoda. 1 hit.
PRINTSiPR00480. ASTACIN.
SMARTiSM00042. CUB. 1 hit.
SM00235. ZnMc. 1 hit.
[Graphical view]
SUPFAMiSSF49854. SSF49854. 1 hit.
PROSITEiPS01180. CUB. 1 hit.
PS00022. EGF_1. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

Q22710-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MTSSLVLILA PLALVAIGEA AFGNSSKIFE IPGLEVMASD KYPHFTTIET
60 70 80 90 100
VSRTKVHRHR REVIAGQIYD WNSYEIPFQI WGGDYNFQSL IRRGIRMWED
110 120 130 140 150
STCLRFKENQ QSRDAIRYVL EKGDSCFTEY IGRNGGHQDI IIGSECAEEY
160 170 180 190 200
VVAHETGHAL GFWHTHQRPD RDRHISINWK NVMEEATASF MPFRSMLQAF
210 220 230 240 250
GIRQVSPRRV PYDYGSLMHY HAVAHAVKVS DFTIVPKELK YVTTMGTEKM
260 270 280 290 300
AFLDAKVIND IYCPNACQGR NHLNCLAGGY PDPNNCNVCR CPEGLGGPDC
310 320 330 340 350
GRLQPSPCGG EIHASDQWQT LSSPSGRDVH CYWRISVPEG SRVRFRLSDG
360 370 380 390 400
EFPCSYGCQS YVEIKHKLDV RLTGFRSCCY RPKEDTVSES NQIFVIYHPN
410
GRTARFSLRF RRQA
Length:414
Mass (Da):47,140
Last modified:July 5, 2004 - v4
Checksum:iB8B381B421239D1B
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
Z49072 Genomic DNA. Translation: CAA88882.4.
PIRiT25213.
RefSeqiNP_497769.3. NM_065368.5.
UniGeneiCel.22628.

Genome annotation databases

EnsemblMetazoaiT24A11.3; T24A11.3; WBGene00006591.
GeneIDi175491.
KEGGicel:CELE_T24A11.3.
UCSCiT24A11.3. c. elegans.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
Z49072 Genomic DNA. Translation: CAA88882.4.
PIRiT25213.
RefSeqiNP_497769.3. NM_065368.5.
UniGeneiCel.22628.

3D structure databases

ProteinModelPortaliQ22710.
SMRiQ22710. Positions 71-237.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi6239.T24A11.3.2.

Protein family/group databases

MEROPSiM12.A17.

Proteomic databases

PaxDbiQ22710.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaiT24A11.3; T24A11.3; WBGene00006591.
GeneIDi175491.
KEGGicel:CELE_T24A11.3.
UCSCiT24A11.3. c. elegans.

Organism-specific databases

CTDi175491.
WormBaseiT24A11.3; CE35909; WBGene00006591; toh-1.

Phylogenomic databases

eggNOGiKOG3714. Eukaryota.
ENOG410ZPX7. LUCA.
GeneTreeiENSGT00760000119227.
HOGENOMiHOG000018812.
InParanoidiQ22710.
KOiK08076.
OMAiCYWRISV.
PhylomeDBiQ22710.

Miscellaneous databases

NextBioi888382.
PROiQ22710.

Family and domain databases

Gene3Di2.60.120.290. 1 hit.
3.40.390.10. 1 hit.
InterProiIPR000859. CUB_dom.
IPR024079. MetalloPept_cat_dom.
IPR017050. Metallopeptidase_nem.
IPR001506. Peptidase_M12A.
IPR006026. Peptidase_Metallo.
[Graphical view]
PfamiPF01400. Astacin. 1 hit.
PF00431. CUB. 1 hit.
[Graphical view]
PIRSFiPIRSF036365. Astacin_nematoda. 1 hit.
PRINTSiPR00480. ASTACIN.
SMARTiSM00042. CUB. 1 hit.
SM00235. ZnMc. 1 hit.
[Graphical view]
SUPFAMiSSF49854. SSF49854. 1 hit.
PROSITEiPS01180. CUB. 1 hit.
PS00022. EGF_1. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "Genome sequence of the nematode C. elegans: a platform for investigating biology."
    The C. elegans sequencing consortium
    Science 282:2012-2018(1998) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: Bristol N2.
  2. Cited for: IDENTIFICATION.
  3. "The astacin protein family in Caenorhabditis elegans."
    Moehrlen F., Hutter H., Zwilling R.
    Eur. J. Biochem. 270:4909-4920(2003) [PubMed] [Europe PMC] [Abstract]
    Cited for: IDENTIFICATION, NOMENCLATURE.

Entry informationi

Entry nameiNAS26_CAEEL
AccessioniPrimary (citable) accession number: Q22710
Entry historyi
Integrated into UniProtKB/Swiss-Prot: January 4, 2005
Last sequence update: July 5, 2004
Last modified: November 11, 2015
This is version 109 of the entry and version 4 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programCaenorhabditis annotation project

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Caenorhabditis elegans
    Caenorhabditis elegans: entries, gene names and cross-references to WormBase
  2. Peptidase families
    Classification of peptidase families and list of entries
  3. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.