Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Papilin

Gene

mig-6

Organism
Caenorhabditis elegans
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Involved in pharynx morphogenesis probably by remodeling the basement membrane.1 Publication
Isoform a: Plays a role in embryogenesis, the second phase of distal cell tip migration and is required for distribution of the metalloproteinase, mig-17, during organogenesis.1 Publication
Isoform b: Plays a role in post embryonic distal cell tip migration (PubMed:19297413). Essential extracellular matrix (ECM) protein required for hypodermal enclosure in the embryo (PubMed:11076767).2 Publications

GO - Molecular functioni

GO - Biological processi

  • cell morphogenesis Source: WormBase
  • embryo development ending in birth or egg hatching Source: WormBase
  • embryonic morphogenesis Source: WormBase
  • multicellular organism reproduction Source: WormBase
  • nematode larval development Source: WormBase

Keywordsi

Molecular functionDevelopmental protein, Protease inhibitor, Serine protease inhibitor

Protein family/group databases

MEROPSiI02.968.

Names & Taxonomyi

Protein namesi
Recommended name:
Papilin
Alternative name(s):
Abnormal cell migration protein 6
Gene namesi
Name:mig-6
Synonyms:ppn-1
ORF Names:C37C3.6
OrganismiCaenorhabditis elegans
Taxonomic identifieri6239 [NCBI]
Taxonomic lineageiEukaryotaMetazoaEcdysozoaNematodaChromadoreaRhabditidaRhabditoideaRhabditidaePeloderinaeCaenorhabditis
Proteomesi
  • UP000001940 Componenti: Chromosome V

Organism-specific databases

WormBaseiC37C3.6a; CE17535; WBGene00003242; mig-6.
C37C3.6b; CE17536; WBGene00003242; mig-6.
C37C3.6c; CE30735; WBGene00003242; mig-6.

Subcellular locationi

Extracellular region or secreted Cytosol Plasma membrane Cytoskeleton Lysosome Endosome Peroxisome ER Golgi apparatus Nucleus Mitochondrion Manual annotation Automatic computational assertionGraphics by Christian Stolte; Source: COMPARTMENTS

Keywords - Cellular componenti

Basement membrane, Extracellular matrix, Secreted

Pathology & Biotechi

Mutagenesis

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Mutagenesisi650Y → D in k177; no pharyngeal defects. 1 Publication1
Mutagenesisi848C → Y in ev701; no pharyngeal defects. 1 Publication1
Mutagenesisi878 – 879GC → EY in ev700; no pharyngeal defects. 1 Publication2
Mutagenesisi965G → E in sa580; pharynx is twisted without affecting feeding or pumping rate. Causes some twisting in amphid neurons close to the pharynx. 1 Publication1
Mutagenesisi973C → Y in et4; pharynx is twisted without affecting feeding or pumping rate. Causes some twisting in amphid neurons close to the pharynx. NO defect in mig-17 localization. 1 Publication1

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Signal peptidei1 – 19Sequence analysisAdd BLAST19
ChainiPRO_000024854520 – 2167PapilinAdd BLAST2148

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Disulfide bondi88 ↔ 117By similarity
Disulfide bondi92 ↔ 122By similarity
Disulfide bondi103 ↔ 107By similarity
Glycosylationi268N-linked (GlcNAc...) asparagine1 Publication1
Disulfide bondi353 ↔ 396By similarity
Disulfide bondi357 ↔ 401By similarity
Disulfide bondi368 ↔ 382By similarity
Glycosylationi386N-linked (GlcNAc...) asparagine2 Publications1
Glycosylationi445N-linked (GlcNAc...) asparagine3 Publications1
Glycosylationi541N-linked (GlcNAc...) asparagine2 Publications1
Glycosylationi568N-linked (GlcNAc...) asparagine2 Publications1
Glycosylationi638N-linked (GlcNAc...) asparagine2 Publications1
Glycosylationi715N-linked (GlcNAc...) asparagineSequence analysis1
Glycosylationi729N-linked (GlcNAc...) asparagineSequence analysis1
Glycosylationi741N-linked (GlcNAc...) asparagineSequence analysis1
Glycosylationi814N-linked (GlcNAc...) asparagineSequence analysis1
Glycosylationi820N-linked (GlcNAc...) asparagineSequence analysis1
Glycosylationi857N-linked (GlcNAc...) asparagine3 Publications1
Glycosylationi933N-linked (GlcNAc...) asparagine2 Publications1
Disulfide bondi1089 ↔ 1141By similarity
Glycosylationi1090N-linked (GlcNAc...) asparagine3 Publications1
Disulfide bondi1099 ↔ 1124By similarity
Disulfide bondi1116 ↔ 1137By similarity
Disulfide bondi1150 ↔ 1202By similarity
Disulfide bondi1161 ↔ 1185By similarity
Disulfide bondi1177 ↔ 1198By similarity
Disulfide bondi1271 ↔ 1321By similarity
Disulfide bondi1280 ↔ 1304By similarity
Disulfide bondi1296 ↔ 1317By similarity
Disulfide bondi1375 ↔ 1425By similarity
Disulfide bondi1384 ↔ 1408By similarity
Disulfide bondi1400 ↔ 1421By similarity
Disulfide bondi1447 ↔ 1497By similarity
Disulfide bondi1456 ↔ 1480By similarity
Disulfide bondi1472 ↔ 1493By similarity
Disulfide bondi1504 ↔ 1554By similarity
Disulfide bondi1513 ↔ 1537By similarity
Disulfide bondi1529 ↔ 1550By similarity
Disulfide bondi1621 ↔ 1671By similarity
Disulfide bondi1630 ↔ 1654By similarity
Disulfide bondi1646 ↔ 1667By similarity
Disulfide bondi1731 ↔ 1781By similarity
Disulfide bondi1740 ↔ 1764By similarity
Disulfide bondi1756 ↔ 1777By similarity
Disulfide bondi1790 ↔ 1840By similarity
Disulfide bondi1799 ↔ 1823By similarity
Disulfide bondi1815 ↔ 1836By similarity
Glycosylationi1848N-linked (GlcNAc...) asparagineSequence analysis1
Disulfide bondi1853 ↔ 1903By similarity
Disulfide bondi1862 ↔ 1886By similarity
Disulfide bondi1878 ↔ 1899By similarity
Disulfide bondi1914 ↔ 1964By similarity
Disulfide bondi1923 ↔ 1947By similarity
Disulfide bondi1939 ↔ 1960By similarity
Glycosylationi1992N-linked (GlcNAc...) asparagine1 Publication1
Glycosylationi2087N-linked (GlcNAc...) asparagineSequence analysis1
Glycosylationi2133N-linked (GlcNAc...) asparagineSequence analysis1

Keywords - PTMi

Disulfide bond, Glycoprotein, Proteoglycan

Proteomic databases

EPDiO76840.
PaxDbiO76840.
PeptideAtlasiO76840.
PRIDEiO76840.

PTM databases

iPTMnetiO76840.

Expressioni

Tissue specificityi

Localizes to the basement membranes of the gonad primordium, pharynx and intestine (at protein level) (PubMed:19297413). Expressed in head and CAN neurons, coelomocytes, body-wall muscles and anal depressor and sphincter and stomatointestinal muscles (PubMed:19297413, PubMed:20805556). Expressed Isoform a: is expressed in body wall muscles and distal cell tips (PubMed:19297413). Isoform b: expressed in embryonic muscles (PubMed:19297413).2 Publications

Developmental stagei

Expression is first detected is intestinal primordium at the E16 embryonic stage followed by expression in future head neurons and head muscle cells at the comma and 1.5-fold stages (PubMed:20805556). Expressed in larvae and adults (PubMed:19297413, PubMed:20805556).2 Publications

Gene expression databases

BgeeiWBGene00003242.

Interactioni

Protein-protein interaction databases

BioGridi44205. 13 interactors.
DIPiDIP-24457N.
IntActiO76840. 12 interactors.
MINTiMINT-1044821.
STRINGi6239.C37C3.6b.1.

Structurei

3D structure databases

ProteinModelPortaliO76840.
SMRiO76840.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini76 – 123TSP type-1 1PROSITE-ProRule annotationAdd BLAST48
Domaini341 – 402TSP type-1 2PROSITE-ProRule annotationAdd BLAST62
Domaini404 – 459TSP type-1 3PROSITE-ProRule annotationAdd BLAST56
Domaini461 – 525TSP type-1 4PROSITE-ProRule annotationAdd BLAST65
Domaini585 – 643TSP type-1 5PROSITE-ProRule annotationAdd BLAST59
Domaini645 – 702TSP type-1 6PROSITE-ProRule annotationAdd BLAST58
Domaini1089 – 1141BPTI/Kunitz inhibitor 1PROSITE-ProRule annotationAdd BLAST53
Domaini1150 – 1202BPTI/Kunitz inhibitor 2PROSITE-ProRule annotationAdd BLAST53
Domaini1271 – 1321BPTI/Kunitz inhibitor 3PROSITE-ProRule annotationAdd BLAST51
Domaini1375 – 1425BPTI/Kunitz inhibitor 4PROSITE-ProRule annotationAdd BLAST51
Domaini1447 – 1497BPTI/Kunitz inhibitor 5PROSITE-ProRule annotationAdd BLAST51
Domaini1504 – 1554BPTI/Kunitz inhibitor 6PROSITE-ProRule annotationAdd BLAST51
Domaini1621 – 1671BPTI/Kunitz inhibitor 7PROSITE-ProRule annotationAdd BLAST51
Domaini1731 – 1781BPTI/Kunitz inhibitor 8PROSITE-ProRule annotationAdd BLAST51
Domaini1790 – 1840BPTI/Kunitz inhibitor 9PROSITE-ProRule annotationAdd BLAST51
Domaini1853 – 1903BPTI/Kunitz inhibitor 10PROSITE-ProRule annotationAdd BLAST51
Domaini1914 – 1964BPTI/Kunitz inhibitor 11PROSITE-ProRule annotationAdd BLAST51
Domaini2124 – 2163PLACPROSITE-ProRule annotationAdd BLAST40

Compositional bias

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Compositional biasi825 – 1052Cys-richAdd BLAST228
Compositional biasi1221 – 1265Gln-richAdd BLAST45

Sequence similaritiesi

Belongs to the papilin family.Curated

Keywords - Domaini

Immunoglobulin domain, Repeat, Signal

Phylogenomic databases

eggNOGiENOG410KCTN. Eukaryota.
ENOG410XQNP. LUCA.
GeneTreeiENSGT00900000140774.
HOGENOMiHOG000021853.
InParanoidiO76840.
OMAiCEQPVES.
OrthoDBiEOG091G14M8.
PhylomeDBiO76840.

Family and domain databases

CDDicd00109. KU. 11 hits.
Gene3Di2.20.100.10. 7 hits.
2.60.40.10. 1 hit.
4.10.410.10. 11 hits.
InterProiView protein in InterPro
IPR010294. ADAM_spacer1.
IPR007110. Ig-like_dom.
IPR036179. Ig-like_dom_sf.
IPR013783. Ig-like_fold.
IPR003599. Ig_sub.
IPR003598. Ig_sub2.
IPR002223. Kunitz_BPTI.
IPR036880. Kunitz_BPTI_sf.
IPR013273. Peptidase_M12B_ADAM-TS.
IPR010909. PLAC.
IPR020901. Prtase_inh_Kunz-CS.
IPR000884. TSP1_rpt.
IPR036383. TSP1_rpt_sf.
PfamiView protein in Pfam
PF05986. ADAM_spacer1. 1 hit.
PF00014. Kunitz_BPTI. 11 hits.
PF08686. PLAC. 1 hit.
PF00090. TSP_1. 6 hits.
PRINTSiPR01857. ADAMTSFAMILY.
PR00759. BASICPTASE.
SMARTiView protein in SMART
SM00409. IG. 1 hit.
SM00408. IGc2. 1 hit.
SM00131. KU. 11 hits.
SM00209. TSP1. 7 hits.
SUPFAMiSSF48726. SSF48726. 1 hit.
SSF57362. SSF57362. 11 hits.
SSF82895. SSF82895. 6 hits.
PROSITEiView protein in PROSITE
PS00280. BPTI_KUNITZ_1. 10 hits.
PS50279. BPTI_KUNITZ_2. 11 hits.
PS50835. IG_LIKE. 1 hit.
PS50900. PLAC. 1 hit.
PS50092. TSP1. 5 hits.

Sequences (3)i

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

This entry describes 3 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform b (identifier: O76840-1) [UniParc]FASTAAdd to basket
Also known as: mig-6L

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MRLLLFSAAL LLCSVPTWAF SLSSFFGSDV AQKPYLHPNS PPERDPASSR
60 70 80 90 100
MKRQAYQVYV DGDVSVTVDK SGQKETGNWG PWVPENECSR SCGGGVQLEK
110 120 130 140 150
RQCSGDCTGA SVRYISCNLN ACESGTDFRA EQCSKFNDEA LDGNYHKWTP
160 170 180 190 200
YKGKNKCELV CKPESGNFYY KWADKVVDGT KCDSKSNDIC VDGECLPVGC
210 220 230 240 250
DGKLGSSLKF DKCGKCDGDG STCKTIEGRF DERNLSPGYH DIIKLPEGAT
260 270 280 290 300
NIKIQEARKS TNNLALKNGS DHFYLNGNGL IQVEKEVEVG GTIFVYDDAE
310 320 330 340 350
PETLSAQGPL SEELTVALLF RKGSRDTAIK YEFSIPLEEE VDYMYKFDNW
360 370 380 390 400
TPCSVSCGKG VQTRNLYCID GKNKGRVEDD LCEENNATKP EFEKSCETVD
410 420 430 440 450
CEAEWFTGDW ESCSSTCGDQ GQQYRVVYCH QVFANGRRVT VEDGNCTVER
460 470 480 490 500
PPVKQTCNRF ACPEWQAGPW SACSEKCGDA FQYRSVTCRS EKEGEEGKLL
510 520 530 540 550
AADACPADEQ EKFDTERTCN LGPCEGLTFV TGEWNLCTRC NDTEETREVT
560 570 580 590 600
CKDSQGRAYP LEKCLVDNST EIPTDTRSCA TQPPCEYEWT VSEWSKCTTE
610 620 630 640 650
CGHGHKTRRV ICAIHQNGGL EVVDEGHCQA EKPEGKTNCT NEEKCTGTWY
660 670 680 690 700
TSSWSECTAE CGGGSQDRVA VCLNYDKKPV PEWCDEAVKP SEKQDCNVDD
710 720 730 740 750
CPTCVDSEFG CCPDNSTFAT GEFNFGCSNC SETEFGCCAD NVTVATGPNS
760 770 780 790 800
KGCEEFVESP LNLEADVANA DAEASGDAPE LCSVTNENGE AVDVECATIA
810 820 830 840 850
PITALLGDGE LIGNDTDASN ETIHCSKTEF GCCPDWYTAA SGKGNEGCPS
860 870 880 890 900
FTLGGCNETQ FGCCHDDVTL ARGANLEGCG EPSCAASLYG CCKDRKTIAF
910 920 930 940 950
GPHYSGCERS SFPCELSDFG CCPDGETAAL GKNGTGCGEN CLTTKFGCCP
960 970 980 990 1000
DGKTTAKGSH NEGCGCEFAQ YGCCPDGKSV AKGAGFYGCP ESCAQSQFGC
1010 1020 1030 1040 1050
CPDGKTRARG ENKEGCPCQY TRYGCCPDGE TTALGPRNDG CDNCRYAKHG
1060 1070 1080 1090 1100
CCPDGETKAL GPDGAGCPPT TTPPFLMGGT VAPHKIAACN QTQESGTVCG
1110 1120 1130 1140 1150
AGYKLAWHYD TTEGRCNQFW YGGCGGNDNN FASQDMCETI CVEPPGKGRC
1160 1170 1180 1190 1200
YLPRVDGPLR CDQLQPRYYY DHSKKHCVAF WWRGCLGNAN NFNSFEECSM
1210 1220 1230 1240 1250
FCKDVGPYDA PTTAAPPPPP QQNAQQYLPT PEVQQIEIQS AEQPQPQQPQ
1260 1270 1280 1290 1300
QQQQQQQQQP QQPRQSMEDI CRSRQDAGPC ETYSDQWFYN AFSQECETFT
1310 1320 1330 1340 1350
YGGCGGNLNR FRSKDECEQR CFFVHGAQPS AARQEQAQPA AQPAQPAQPS
1360 1370 1380 1390 1400
NIVSPPQQSA SPVVVPSNSK QRDACHLNVD QGRCKGAFDS WYYEVATGSC
1410 1420 1430 1440 1450
VTFKYTGCGG NANRFASKDQ CESLCVKPAS EAASAGIDGA AGINSVCDEA
1460 1470 1480 1490 1500
KDTGPCTNFV TKWYYNKADG TCNRFHYGGC QGTNNRFDNE QQCKAACQNH
1510 1520 1530 1540 1550
KDACQLPKVQ GPCSGKHSYY YYNTASHQCE TFTYGGCLGN TNRFATIEEC
1560 1570 1580 1590 1600
QARCPKDDQT TTTSQPEELP SLPLVQEDPQ PRPAFSLKQS FAHSRRRDAP
1610 1620 1630 1640 1650
FARSVSARHH TPDSEEERVD CYAVPDPGSC GDYRLVWHYS ATSNSCRQFY
1660 1670 1680 1690 1700
YGGCAGNTNR FETRDKCETS CVAKIEERVE SVSEASKSLE EVRLTDPRMD
1710 1720 1730 1740 1750
SHFGYHDPEV DQIEEEAEYV IVDTGALPEL CMLPEQRGSC YDNILRWRFD
1760 1770 1780 1790 1800
SEKSQCVTFM YSGCNPNANH FTSQETCERA CGKWRNVAVC ELPAEHGDCQ
1810 1820 1830 1840 1850
LAIPRWYHDP KTSQCQMMMW TGCGGNGNAF SSKADCESLC RVETLWSNNT
1860 1870 1880 1890 1900
DFCTLERSAG PCTDSISMWY FDSTHLDCKP FTYGGCRGNQ NRFVSKEQCQ
1910 1920 1930 1940 1950
QSCRPGDTKS EDICTLRPEP GPCRLGLEKY FYDPVIQSCH MFHYGGCEGN
1960 1970 1980 1990 2000
ANRFDSELDC FRRCSSVKVE ASESERVGQL TSASTPVIYI VNKTAIFVGN
2010 2020 2030 2040 2050
TFRIRCNSYG VLPITWYKNG GLLQFGSRIT EENDDTLEIV DALTADAGVY
2060 2070 2080 2090 2100
TCIAGQDSTM SEGVEVVIKR LPGHRTTSRP MLTPSKNFSL GTPPTPSPST
2110 2120 2130 2140 2150
VSTTPFRIYT PGSAPSDARV SRPTSNSCMD VGNASTCDLI VKNGLCGKKR
2160
YGTFCCHTCT RVHNFKF
Length:2,167
Mass (Da):237,601
Last modified:November 1, 1998 - v1
Checksum:i96274786D52E3639
GO
Isoform a (identifier: O76840-2) [UniParc]FASTAAdd to basket
Also known as: mig-6S

The sequence of this isoform differs from the canonical sequence as follows:
     1556-1558: KDD → SKF
     1559-2167: Missing.

Show »
Length:1,558
Mass (Da):169,604
Checksum:i38FCAA2CA6B80D6B
GO
Isoform c (identifier: O76840-3) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1367-1438: SNSKQRDACHLNVDQGRCKGAFDSWYYEVATGSCVTFKYTGCGGNANRFASKDQCESLCVKPASEAASAGID → Y
     1556-1558: KDD → SKF
     1559-2167: Missing.

Note: No experimental confirmation available.
Show »
Length:1,487
Mass (Da):162,109
Checksum:i3DB432B96B90FD4E
GO

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_0203071367 – 1438SNSKQ…SAGID → Y in isoform c. CuratedAdd BLAST72
Alternative sequenceiVSP_0203081556 – 1558KDD → SKF in isoform a and isoform c. Curated3
Alternative sequenceiVSP_0203091559 – 2167Missing in isoform a and isoform c. CuratedAdd BLAST609

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
FO080808 Genomic DNA. Translation: CCD66947.1.
FO080808 Genomic DNA. Translation: CCD66945.1.
FO080808 Genomic DNA. Translation: CCD66946.1.
PIRiC89114.
T34395.
RefSeqiNP_505017.1. NM_072616.4. [O76840-1]
NP_505018.1. NM_072617.4. [O76840-2]
NP_741569.1. NM_171931.4. [O76840-3]
UniGeneiCel.17525.

Genome annotation databases

EnsemblMetazoaiC37C3.6b.1; C37C3.6b.1; WBGene00003242. [O76840-1]
C37C3.6b.2; C37C3.6b.2; WBGene00003242. [O76840-1]
GeneIDi179162.
KEGGicel:CELE_C37C3.6.
UCSCiC37C3.6a.2. c. elegans. [O76840-1]

Keywords - Coding sequence diversityi

Alternative splicing

Entry informationi

Entry nameiPPN1_CAEEL
AccessioniPrimary (citable) accession number: O76840
Secondary accession number(s): Q22911, Q8I7I0, Q8MPV5
Entry historyiIntegrated into UniProtKB/Swiss-Prot: September 5, 2006
Last sequence update: November 1, 1998
Last modified: October 25, 2017
This is version 136 of the entry and version 1 of the sequence. See complete history.
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programCaenorhabditis annotation project

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Caenorhabditis elegans
    Caenorhabditis elegans: entries, gene names and cross-references to WormBase
  2. SIMILARITY comments
    Index of protein domains and families