Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Mesocentin

Gene

dig-1

Organism
Caenorhabditis elegans
Status
Reviewed-Annotation score: Annotation score: 4 out of 5-Experimental evidence at protein leveli

Functioni

Encodes an adhesion molecule involved in sensory map formation. Functions during sensory process development in the nervous system.1 Publication

GO - Molecular functioni

GO - Biological processi

  • cell adhesion Source: UniProtKB-KW
  • gonad development Source: WormBase
  • nervous system development Source: WormBase
Complete GO annotation...

Keywords - Molecular functioni

Developmental protein

Keywords - Biological processi

Cell adhesion

Names & Taxonomyi

Protein namesi
Recommended name:
Mesocentin
Gene namesi
Name:dig-1
ORF Names:K07E12.1
OrganismiCaenorhabditis elegans
Taxonomic identifieri6239 [NCBI]
Taxonomic lineageiEukaryotaMetazoaEcdysozoaNematodaChromadoreaRhabditidaRhabditoideaRhabditidaePeloderinaeCaenorhabditis
Proteomesi
  • UP000001940 Componenti: Chromosome III

Organism-specific databases

WormBaseiK07E12.1a; CE32905; WBGene00000998; dig-1.

Subcellular locationi

  • Secretedextracellular spaceextracellular matrix 1 Publication

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Extracellular matrix, Secreted

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 2424Sequence analysisAdd
BLAST
Chaini25 – 1310013076MesocentinPRO_0000233334Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Glycosylationi70 – 701N-linked (GlcNAc...)Sequence analysis
Glycosylationi178 – 1781N-linked (GlcNAc...)Sequence analysis
Disulfide bondi193 ↔ 220PROSITE-ProRule annotation
Glycosylationi324 – 3241N-linked (GlcNAc...)1 Publication
Disulfide bondi327 ↔ 374PROSITE-ProRule annotation
Glycosylationi384 – 3841N-linked (GlcNAc...)1 Publication
Disulfide bondi517 ↔ 561PROSITE-ProRule annotation
Glycosylationi518 – 5181N-linked (GlcNAc...)2 Publications
Glycosylationi575 – 5751N-linked (GlcNAc...)2 Publications
Disulfide bondi705 ↔ 762PROSITE-ProRule annotation
Glycosylationi841 – 8411N-linked (GlcNAc...)1 Publication
Glycosylationi931 – 9311N-linked (GlcNAc...)Sequence analysis
Glycosylationi1112 – 11121N-linked (GlcNAc...)Sequence analysis
Glycosylationi1211 – 12111N-linked (GlcNAc...)Sequence analysis
Disulfide bondi1226 ↔ 1264PROSITE-ProRule annotation
Glycosylationi1399 – 13991N-linked (GlcNAc...)Sequence analysis
Disulfide bondi1523 ↔ 1562PROSITE-ProRule annotation
Glycosylationi1612 – 16121N-linked (GlcNAc...)Sequence analysis
Disulfide bondi1839 ↔ 1888PROSITE-ProRule annotation
Glycosylationi1840 – 18401N-linked (GlcNAc...)Sequence analysis
Glycosylationi1879 – 18791N-linked (GlcNAc...)Sequence analysis
Glycosylationi1898 – 18981N-linked (GlcNAc...)Sequence analysis
Glycosylationi8810 – 88101N-linked (GlcNAc...)Sequence analysis
Glycosylationi10555 – 105551N-linked (GlcNAc...)Sequence analysis
Glycosylationi10570 – 105701N-linked (GlcNAc...)Sequence analysis
Glycosylationi12294 – 122941N-linked (GlcNAc...)1 Publication
Glycosylationi12478 – 124781N-linked (GlcNAc...)1 Publication
Glycosylationi12840 – 128401N-linked (GlcNAc...)Sequence analysis
Glycosylationi12843 – 128431N-linked (GlcNAc...)1 Publication
Glycosylationi12895 – 128951N-linked (GlcNAc...)Sequence analysis
Glycosylationi12913 – 129131N-linked (GlcNAc...)Sequence analysis

Keywords - PTMi

Disulfide bond, Glycoprotein

Proteomic databases

EPDiQ09165.
PaxDbiQ09165.
PRIDEiQ09165.

Expressioni

Gene expression databases

ExpressionAtlasiQ09165. baseline and differential.

Interactioni

Protein-protein interaction databases

STRINGi6239.K07E12.1a.1.

Structurei

3D structure databases

ProteinModelPortaliQ09165.
SMRiQ09165. Positions 266-391, 409-493, 592-675, 698-723, 800-876, 991-1090, 1163-1196, 1214-1282, 1296-1381, 1511-1580, 1582-1810, 12167-12564, 12930-13014.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini162 – 22261SushiAdd
BLAST
Domaini225 – 26339EGF-like 1Add
BLAST
Domaini305 – 39288Ig-like C2-type 1Add
BLAST
Domaini394 – 49198Fibronectin type-III 1PROSITE-ProRule annotationAdd
BLAST
Domaini491 – 57787Ig-like C2-type 2Add
BLAST
Domaini580 – 67899Fibronectin type-III 2PROSITE-ProRule annotationAdd
BLAST
Domaini681 – 77898Ig-like C2-type 3Add
BLAST
Domaini782 – 87998Fibronectin type-III 3PROSITE-ProRule annotationAdd
BLAST
Domaini892 – 98392Ig-like C2-type 4Add
BLAST
Domaini994 – 109299Fibronectin type-III 4PROSITE-ProRule annotationAdd
BLAST
Domaini1097 – 1199103Fibronectin type-III 5PROSITE-ProRule annotationAdd
BLAST
Domaini1203 – 128078Ig-like C2-type 5Add
BLAST
Domaini1287 – 138498Fibronectin type-III 6PROSITE-ProRule annotationAdd
BLAST
Domaini1385 – 1493109Fibronectin type-III 7PROSITE-ProRule annotationAdd
BLAST
Domaini1500 – 157677Ig-like C2-type 6Add
BLAST
Domaini1585 – 1697113Fibronectin type-III 8PROSITE-ProRule annotationAdd
BLAST
Domaini1702 – 1806105Fibronectin type-III 9PROSITE-ProRule annotationAdd
BLAST
Domaini1810 – 190495Ig-like C2-type 7Add
BLAST
Domaini2007 – 210094Fibronectin type-III 10PROSITE-ProRule annotationAdd
BLAST
Domaini6599 – 6787189VWFA 1PROSITE-ProRule annotationAdd
BLAST
Domaini12167 – 12341175VWFA 2PROSITE-ProRule annotationAdd
BLAST
Domaini12379 – 12557179VWFA 3PROSITE-ProRule annotationAdd
BLAST
Domaini12609 – 12780172VWFA 4PROSITE-ProRule annotationAdd
BLAST
Domaini12816 – 1291095Fibronectin type-III 11PROSITE-ProRule annotationAdd
BLAST
Domaini12988 – 1302942EGF-like 2; calcium-bindingSequence analysisAdd
BLAST

Compositional bias

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Compositional biasi6894 – 68985Poly-Glu
Compositional biasi7613 – 76175Poly-Glu
Compositional biasi8387 – 83915Poly-Glu
Compositional biasi9431 – 94355Poly-Glu
Compositional biasi10150 – 101545Poly-Glu

Sequence similaritiesi

Contains 2 EGF-like domains.Curated
Contains 11 fibronectin type-III domains.PROSITE-ProRule annotation
Contains 1 Sushi (CCP/SCR) domain.Curated
Contains 4 VWFA domains.PROSITE-ProRule annotation

Keywords - Domaini

EGF-like domain, Immunoglobulin domain, Repeat, Signal, Sushi

Phylogenomic databases

eggNOGiENOG410J6QH. Eukaryota.
ENOG410XY8D. LUCA.
GeneTreeiENSGT00840000131684.
HOGENOMiHOG000020923.
InParanoidiQ09165.
OMAiDGQINIT.
OrthoDBiEOG773XKP.
PhylomeDBiQ09165.

Family and domain databases

Gene3Di2.120.10.30. 53 hits.
2.60.40.10. 17 hits.
3.40.50.410. 4 hits.
InterProiIPR011042. 6-blade_b-propeller_TolB-like.
IPR001881. EGF-like_Ca-bd_dom.
IPR000742. EGF-like_dom.
IPR018097. EGF_Ca-bd_CS.
IPR003961. FN3_dom.
IPR007110. Ig-like_dom.
IPR013783. Ig-like_fold.
IPR003599. Ig_sub.
IPR003598. Ig_sub2.
IPR013151. Immunoglobulin.
IPR011041. Quinoprot_gluc/sorb_DH.
IPR002035. VWF_A.
[Graphical view]
PfamiPF00041. fn3. 9 hits.
PF00047. ig. 2 hits.
PF00092. VWA. 3 hits.
[Graphical view]
SMARTiSM00181. EGF. 3 hits.
SM00179. EGF_CA. 1 hit.
SM00060. FN3. 11 hits.
SM00409. IG. 7 hits.
SM00408. IGc2. 6 hits.
SM00327. VWA. 4 hits.
[Graphical view]
SUPFAMiSSF48726. SSF48726. 6 hits.
SSF49265. SSF49265. 7 hits.
SSF50952. SSF50952. 17 hits.
SSF53300. SSF53300. 4 hits.
PROSITEiPS00010. ASX_HYDROXYL. 1 hit.
PS00022. EGF_1. 1 hit.
PS01186. EGF_2. 1 hit.
PS01187. EGF_CA. 1 hit.
PS50853. FN3. 11 hits.
PS50835. IG_LIKE. 6 hits.
PS50234. VWFA. 4 hits.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

Q09165-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MGGRNWLFRS AVLVSTLLTC ISIAQELLPS IEVESLAQDL QIQEWMRTLR
60 70 80 90 100
RVKRAPTRNN RPEPVVVGRN GTGKCVISAD RASHFCGMEE EVSAPPIPPP
110 120 130 140 150
DEGKCIISKA SGREICYPSY SQLDTSCTDV TGQSSNGLVV PPVVPHATVR
160 170 180 190 200
AMAFVPPDNL RRLIIQYYRQ QGKHQPKNAT FSPKSFLFVK YHCDYGYEMV
210 220 230 240 250
DEVDTMFCQD KKWVMTPPMC RGQGLCAADN GGCSHTCISY NDEKIECKCP
260 270 280 290 300
RGMTLDVDEK TCIKPIPKSL CRSLSGCTCN GITETQFACS CGDNKQKCLL
310 320 330 340 350
IAGPPRIYIE PQGPYEVAPG GNINISCTSV AYPFPDIYWF KNQKVNTDGP
360 370 380 390 400
DQNTLRASQI LIIKEIYRNE EFTCVSDNIH GSANRTVSIV VTGPGSAPHL
410 420 430 440 450
KSASAGRTSL TVRWEPPSII NRPITTYTLY YTNNPQQPVK NWKKLEVKEP
460 470 480 490 500
TREVAIPDLR PDTAYYIRVR ANDPLGPGKL GNQVQIKTLK PAVRPYVNIV
510 520 530 540 550
EGDEIRVPPM TAFEIDCNVT RADPVPVLVW LHKGRPLNKG SKTQHIKMKN
560 570 580 590 600
GGVLESTQFS CVAENEAGKS TKKINVTVTG PSAPERIRYQ IDGDKVTLQW
610 620 630 640 650
EPPQITNGPM AGYDVFYTED PSLPRDQWKV HHIDDPNART TTVLRLNEKT
660 670 680 690 700
PYTFVIVGRN RLGPGLPSAP FTATTWLAAK PPVVQLEPSE EMTKEPSNDE
710 720 730 740 750
MIIECGAQGV PKPKIIWLWS GTLIEDGKEE FRVYDTTPTD AQDRTRSKLI
760 770 780 790 800
AQSTTRSGVA TCQAVNSEGS DEKKVPVKIL GPGSAPLGIT PTPMHTGFDV
810 820 830 840 850
AWKPPKVTNG RITDYVVYYS KDPDAPLSDW ESKTVPADTR NLTVNVDDED
860 870 880 890 900
TPYVVKVQAR TDDGPGIISE AYEVTTGRKQ VPLSVRLEIA DPSVDPSTGE
910 920 930 940 950
TIVEPTQPIH FRCVADGRPM PSVSYSWLPI NASESGDEPV PIPIHSDDSQ
960 970 980 990 1000
PHHYNSIQVY STTATKRILL CQARNPDGTV DDRHVFIVNK PGSAPQNPEV
1010 1020 1030 1040 1050
IVDPDNRVTI TWQPPKYPNG EITSYNVYIT GDPSLPVDQW QVFPVDDVTD
1060 1070 1080 1090 1100
PKLVLQRGAL QPETPYFVKI AAVNPHGEGI HTDPKHFDTV SGAPIDAPTD
1110 1120 1130 1140 1150
VLPSVSIDNT VNITWSPPTQ PLGPIKSYTV YFAPEYDDSD FKTWQRISVD
1160 1170 1180 1190 1200
APDGADHGEV TLPKEQFNPN TPYKIRISAT NDLSEGPASD PVRFETGSGE
1210 1220 1230 1240 1250
IPPTITLDPS NSTYTVEPLG AATITCTATG VPQPKVHWIK ANGETVDSAT
1260 1270 1280 1290 1300
LQLYDLVKDT SATCVAENNA GKTQEAVSIQ VTGPGTAPNE IVLLPMPNQE
1310 1320 1330 1340 1350
INVEWTSPDE VNGQITNYII HYGEISEDGS EPATWDQVTI ARDDVNHKLA
1360 1370 1380 1390 1400
NLEPKKTYAI RVQAVSDRGP GVISAPQVIK TLPLAPQAIT NPIIQVHPNN
1410 1420 1430 1440 1450
SVTIEFTPPD DPENPGKKVK DFVIQYTTDE EPDDESVWKE LKFTDPDDTD
1460 1470 1480 1490 1500
DTTIVSIDGE NFNPDTKYNT RIIARGEIDS QPNEPTLFAT GDGVIAPSQP
1510 1520 1530 1540 1550
SFNVDTEDGV IRVPAGTDYT IKCVSEGYPA PDVRWVDSHG NQLSDGPLLR
1560 1570 1580 1590 1600
IIDIRKTLNA KCLAENRGGL KETDLTIFVA GPGTAPENIQ LTANKPTTIS
1610 1620 1630 1640 1650
VQYEVPSIPN GNISKYIIYY TPLDDQDPDH QLGQVQTKPI SDWQNVHDMN
1660 1670 1680 1690 1700
DGVEGPRKVD IKDFVSTDTA YAVVVQAIND DGPGPYSNQY TIRTMSRARE
1710 1720 1730 1740 1750
GPPVELRVEP DGQRSAVAQW KEPVTSDVPP IGYEIYYVRG DKSVEEDDSA
1760 1770 1780 1790 1800
GLNDWIKISI DDPTKLTHKI QNLLLPDTDY VFKMRAIYPD GPSVFSEPCI
1810 1820 1830 1840 1850
MKTLPDGNAP YIQISTGDNG VEGSTTIQIL PGSQMTIACN ATGIPLPQVK
1860 1870 1880 1890 1900
WIKAGNYEID PSRVDADGNH AQFSLQVANI TEDTTFNCVA QNPLGHANWT
1910 1920 1930 1940 1950
INVNLIEGLE PNWRDDFVTS KSDGGQIVLV FNDELPEYLK PPNEWTIQYT
1960 1970 1980 1990 2000
DDAEQPKDQW ESIPSGGAPL TRVEVPNMNP GTFYYLVVDN PEKGIQTPTL
2010 2020 2030 2040 2050
VVMTPKPPSD IRFGKNNDDE QIVDFKPAVA SEPIKEYTIS VWPSTDPSNV
2060 2070 2080 2090 2100
KKFTTPADVT SGVVVDGLEP DTEYNVQVAA EFYEGEELAS EPVTVKTPPR
2110 2120 2130 2140 2150
DVSCECDHGC AFEMNEDAGT MEPKCYCHGG FHLTSDGKSC ERDEEDDATS
2160 2170 2180 2190 2200
QAVLQVTPPS ITTKVAPEEL LTGSSGEVDS TPETLSPVVG PDGKPLVLDK
2210 2220 2230 2240 2250
KGKPIDSSGK PVKFDENGDP IAPEGTKLEK NDNGEWVYPL VDRNGKPLPV
2260 2270 2280 2290 2300
DENNKPIITV IDKDGRVVTE TDDGTFVTSD GKQVEVDDLG RPLDEDGNPY
2310 2320 2330 2340 2350
KTNENGQFVI SDVDGAVEGD DEEEQPQVIP LYVVDVDDDG KYLDEDGNEI
2360 2370 2380 2390 2400
PVNEDGDPID VNGKPLEKNE DGKFVKPKES TQETPQPTKI TIVSPDGTPL
2410 2420 2430 2440 2450
PTDASGSPIG LDGQPVPTDA SGKPLAKDGS PLPTDNNGNY VILPSSKNSV
2460 2470 2480 2490 2500
DSQPTDDAGR VIYPVVLPDG SPLATDSTGN FVNRHGDIVE RDDEGKPMGP
2510 2520 2530 2540 2550
DGQLLPTDAS GNYIYPVTGP NDEVLPTDAN GNPIYPVVGP DGTPLPTDAS
2560 2570 2580 2590 2600
GAIVGPDGQP IPTDSNGKPL SKEGYPLPVD NQGNYILLPT EIDAAQSLPT
2610 2620 2630 2640 2650
DDAGMPVYPI VKPDGTPLAT DSTGSFINDN GEIIEKDDEG RPFGPDGLIL
2660 2670 2680 2690 2700
PTDASGNYIY PAMGLDGQPL ATDASGNYVL VSTEQTVTKS YPVDDSDITI
2710 2720 2730 2740 2750
HPIVNPDGTP LATDSTGSYV TEDGQIIEKD DEGRPLGPDG QVLPTDDSGN
2760 2770 2780 2790 2800
YIYPVAESGE ETKPTDASGK TVYPVRGPDG TPLPTDASGA VIGPDGEVIP
2810 2820 2830 2840 2850
TDENGIPLSQ DGSPLPTDNQ GNYILVLTSE TPTKTLPIDE SGNVVYPITK
2860 2870 2880 2890 2900
PDGTPLATDS TGSFVTEDGT IIAKDDEGKP LGPDGEVLPT DASGNYVYPV
2910 2920 2930 2940 2950
TVSDEQTLPT DDTGKTVYPI RGPDGTPLPT DASGPVIGPD GEIMPTDENG
2960 2970 2980 2990 3000
IPLSKDGTPL PTDNDGNYVI VPSDEETSKE LPIDDSGNVI YPITKPDGTP
3010 3020 3030 3040 3050
LATDSTGSFV TEDGTIIEKD DEGKPLGPDG QILSTDASGN YVYPDPGLDS
3060 3070 3080 3090 3100
QILTTDVYGK PIYTVIGPDG TALPTDASGA AIGPDGTPIS TDETGEPLDK
3110 3120 3130 3140 3150
DGSILPTDDY GNFVFVVSQE LPTDAEVQTP ITKPDGTLLA TDSSGNYVND
3160 3170 3180 3190 3200
NGDIIEKDDE GKPLGPDGEV LPTDGTGNFI YPATTSDGEV IPTDDSGKPL
3210 3220 3230 3240 3250
YTIRGPDGTP LPTDETGSAL GPDGEPISTD SSGKPLSKDG SPLPTDNNGN
3260 3270 3280 3290 3300
YVLVPTDEST TKALPTDESG NVVIPITNPD GTSLATDSTG SFVTDDGQII
3310 3320 3330 3340 3350
EKDDEGKPLG PDGAILPTDA SGNYIYPVVG PDGQALPTDE TGKTVYPVRG
3360 3370 3380 3390 3400
PDGTPLPTDA SGAVMGPDGE PIPTDANGKP LSKDGSPLPT DASGNYVLVP
3410 3420 3430 3440 3450
SDEVTAKELP TDESGTIVYP VTRADGTPLA TDSTGSFVTD DGQIIGKDDE
3460 3470 3480 3490 3500
GKPLGPDGQV LPTDDSGNYI YPAVGPDGQA FPTDKSGKPL YPVRGPDGTS
3510 3520 3530 3540 3550
LPTDASGAAI GPDGEVIPTD ENGIPLDKDG SPLPTDASGN YIIVPSGELT
3560 3570 3580 3590 3600
MASHPTDDTG NVIYPITKPD GTLLSTDSTG SFVTEDGQII EKDDEGKPLG
3610 3620 3630 3640 3650
PNGEALPTDD LGNYIYPITD SDEQTSPTED VGTSVHLVRG PDGTPLPTDA
3660 3670 3680 3690 3700
SGSAIGPDGE VIPTDENGVP LDKDGSPLPT DNNGNYVLVP TKESVTKILP
3710 3720 3730 3740 3750
TDDSEAVVHP ITRQDGTPLS TDSTGNFVTD NGEIIEKDDE GRPVGPDGQV
3760 3770 3780 3790 3800
LSTDVSGNFV YPVTESPNDG EKPIHPVLGP DGSPLPTDDS GAVIGPDGEV
3810 3820 3830 3840 3850
IPTDASGVPL SKLGLPLPTD SDGNYIILSS DTDVTKELPT DDTGNVIYPI
3860 3870 3880 3890 3900
TKPDGTPLGT DTSGSFVSDD GQIIEKDDDG KPLGPDGQVL PTDATGNFIY
3910 3920 3930 3940 3950
PVLGPDGQAL PTDESGKTVY PVRRPDGNPL PTDASGAVIG PGGEPIPTDS
3960 3970 3980 3990 4000
SGKPLSADGS PLPTDASGNY VLVPSDEVTA KELSTDESGT IVYPVTRADG
4010 4020 4030 4040 4050
TPLATDSTGS FVTDEGQTIE KDDEGKPLGP DGQVLPTYAS GNYIYPVIGP
4060 4070 4080 4090 4100
DGQALPTDES GKTVYPVRGP DGTPLPTDVS GAVIGPDGEV IPTDSNGIPL
4110 4120 4130 4140 4150
SQDGTPLPTD NQGNYILVPT SETATKALPT DESGNVIYPI TKADGTPLAT
4160 4170 4180 4190 4200
DSTGTFVTDD GQIIEKDDEG KPLGPDGQVL PTDDSGNYIY PVVGPDGQTD
4210 4220 4230 4240 4250
ESGKTVYPVR GSHPTDDTGN VIYPITKPDG TLLATDSTGS FVTEDGQIIE
4260 4270 4280 4290 4300
KDDEGKPLGP DGQVLPTDES GNYVYPEVKS DEQLLPTDHT GKTVYPVHGP
4310 4320 4330 4340 4350
DGTPLPTDDS GAIIGPDGEV IPTDENGIPL SKDGSQLPTD NNGNYVLVPS
4360 4370 4380 4390 4400
DEGATKTHPT DETSDAVHPI TKPDGTPLAT DSTGNFVTEN GDVITKDEEG
4410 4420 4430 4440 4450
KPLGPNGQIL PTDASGNYIY PVIGPDGQAL PTDESGKTVY PVRGPDGTPL
4460 4470 4480 4490 4500
PTDASGAVIS PDGEVIPTDA NGIPLDKDGS PLPTDASGNY ILVPSEQDIT
4510 4520 4530 4540 4550
KTLPTDDSGN VIYPITKPDG TSLATDSTGS FVTEGGEIVE RDEDGKPLGP
4560 4570 4580 4590 4600
DGQVLPTDAS GNYIYPVVGP DGQVLPTDDT GKTVYPVYGP DGIPLSTDAS
4610 4620 4630 4640 4650
GAVIGPDGEP IPTDASGRPL DKDGSFLSTD ASGNYILVPS DAPTNEAGPV
4660 4670 4680 4690 4700
VVQHQITRPD GTPLATDSSG HFVTEDGVII ENDKEGRPIG ADGQVLPTDA
4710 4720 4730 4740 4750
SNNYIFTDVP TQGYAVFIPT DVVPIELEAP NCDQVDGRVD TLLFVVESSH
4760 4770 4780 4790 4800
TSAPYLDTLK KLIENLLLTT PRDFLPKIGT LIYSATTEIT IDIGSYGDFK
4810 4820 4830 4840 4850
ELFDSTNEIR EIGGIPDVTN ALRTAKMILE ETSRGDTLVL HLLASPMRTS
4860 4870 4880 4890 4900
SKVYTERIRA LPNTRLIHLN EKQWAEDPNA VELLRSHLCI PSEVPLPSMM
4910 4920 4930 4940 4950
PTDASGNLLS IPTDEVVTDG TPTDESGFVI YPITKPDGTP LATDSTGSFV
4960 4970 4980 4990 5000
TEDGQIIEKN EDGKPLGPDG QVLPTDNSGN YIYPIVGPDG QALPTDASGK
5010 5020 5030 5040 5050
PIYPVRGPDG TPLPTDASGA VIGPDGEPIP TDASGKPLAQ DGSPLPVDNE
5060 5070 5080 5090 5100
GNYIILPTQQ VDTKEYPTDE TGNVIVPITK PDGTLLPTDS TGSFVTENGD
5110 5120 5130 5140 5150
RIEFNEEGKP LGPDGEVLAT DASGNYVYPG SVVEPTAEPQ EVTHGPDGQV
5160 5170 5180 5190 5200
LPTDASGKPI YPVRGPDGIP LPTDASGAAI GPDGETIATD ENGIPLSKDG
5210 5220 5230 5240 5250
SPLPTDNTGN YVLVPSDEGA TEEKPTQGSE SIVHPITKPD GTPLATDSTG
5260 5270 5280 5290 5300
SFVTDDDQVI AKDEDGKPIG PDGQVLPTDS SGNYIYPVIG PDGQALPTDE
5310 5320 5330 5340 5350
SGKTVYPVRG PDGTPLSTDA SGAVIGSDGK PIPTDETGLP LNKDGSPLPT
5360 5370 5380 5390 5400
DNDGNYILIP ADESVVKALP TDEAKEVYPI VQPDGTPLAT DSSGNFVTSS
5410 5420 5430 5440 5450
GDIIDIDDEG KPLGPDGQAL PTDDSGNYIY PVIGPDGQAL PTDESGKTVY
5460 5470 5480 5490 5500
PIRGPDGTPL PTDASGAVIG PDGEPIPTDA SGKPLSQDGS PLPTDASGNY
5510 5520 5530 5540 5550
ILVPSDGEVT KTLPTDDVGN VIYPITKPDG TPLATDSTGS FVTDDGQIIE
5560 5570 5580 5590 5600
KDDEGKPLGP DGQVLSTDDS GNYIYPAVGP NGQTIPTDDT GRTVYPVRGP
5610 5620 5630 5640 5650
DGTPLPTDAS GAVIGPDGEP IPTDASGKPL SADGSPLPTD NNGNYVIVPT
5660 5670 5680 5690 5700
DGSTVKSHPT DDSGNTIYPV VNEDGTPLST DLSGNFLTNS GEIVDRDDEG
5710 5720 5730 5740 5750
KPLGPDGQTL PTDASGNYVY LQKVEETTKP LPTDESGNIV YPITKPDGTP
5760 5770 5780 5790 5800
LATDSTGSFV TEDGTVIEKD DEGKPVGPDG QVLPTDESGN YIYPDVTPDG
5810 5820 5830 5840 5850
QVQPTDVSGK PVYPVRGPDG STLPTDASGA ALGPDGKPIP TDSNGVPLSE
5860 5870 5880 5890 5900
DGSPLPTDNQ GNYVLVPTSE TVTKSMPTDD NRNVIYPITM SDGSLLSTDS
5910 5920 5930 5940 5950
TGSFVTEDGK VIEKDDEGKP LGPDGQVLPT DASGNYIYPV HGQDGTPLPT
5960 5970 5980 5990 6000
DASGAVIGPD GSPLPTDDSG AVIGPDGEVI PTDSNGIPLN KDGLPLSTDA
6010 6020 6030 6040 6050
SGNYIVVSAE QPGEEIKEIP ITKPDGTLLS TDSTGNFITE NGEIIERDDE
6060 6070 6080 6090 6100
GKPIGPDGQI LPTDASGNYV YPVIGPDGQG LPTDESGKTI YPVRGPDGTP
6110 6120 6130 6140 6150
LPTDASGAVI GPDGEPIPTD ASGKPLSQDG SLLPTDNNGN YVLLPSNEET
6160 6170 6180 6190 6200
TQGLTTDESV NVIYPITKPD GTPLATDSTG NFVTDNGETI EKDEEGKPIG
6210 6220 6230 6240 6250
PDGQTLPTDD SGNYIYPVVG PDGQALPTDE SGKTIYPVHG PDGTPLPTDA
6260 6270 6280 6290 6300
SGASIGPDGE PIPTDTSGKP LFKDGSPLPT DSNGNFIIVP SEKRMDEELP
6310 6320 6330 6340 6350
TDDSGKIIYP ITKPDGTPLA SDSTGVFVTE DGTIIEKDDD GKPLGPDGQV
6360 6370 6380 6390 6400
LPTDASGNYI YPIVGPDGKT QPTDESEKTP YPVHGPDGTP LPTDASGAVI
6410 6420 6430 6440 6450
GPDGEPIPTD ASGKPLSADG SPLPTDNNGK YVLVPADEVT TKVLPTDDSG
6460 6470 6480 6490 6500
NVVHPITRPD GTPLGTDASG SFITDDGQAI EKDDEGKPIG PDGQILPIDA
6510 6520 6530 6540 6550
SGNYIYPVIG PDGQALPTDE SGKTVYPVRG PDGTPLPTDA SGAVIGLDGE
6560 6570 6580 6590 6600
PIPTDASGKP LSRNGSPLST DSSGNYIFVP TDDEKKDSKK CDISSSLSDI
6610 6620 6630 6640 6650
IFVLVNDGDG AQNYDQFKKA VVGFSRKVDM SPDIIRLAVL SVGSEIAVPL
6660 6670 6680 6690 6700
PLGGYQEKEH LSSILNSFEI PPIVGTEILS PVQAANQQFT SFPRTGISKM
6710 6720 6730 6740 6750
VVIFADNEEK STFIGGATYI TVKYGTTPKD IINTLIEACE KGLVEIVPDD
6760 6770 6780 6790 6800
TKHVIDETVP TISSTPVIVD QSGKPLPTDA SGNYIDNNGK PIVIEGEEPT
6810 6820 6830 6840 6850
GPEDQKLSKN KKGEWVYPLV DKFGKPVETD DNDKPVITVV DNDGNELSKN
6860 6870 6880 6890 6900
DDGNWIDLSG NEIDTDELGR PLDSEGNPYK FDDNGHVVIA PQIEEEEETT
6910 6920 6930 6940 6950
PAIPFIIIDG EPINEDDGVY TDKDGNVIPT NSEGKPIDEN GQVLPKNEDG
6960 6970 6980 6990 7000
EFVKPKEADT TQSTIVSPDG SPLPTDASGA AIGPDGEPIP TDSNGRPLAK
7010 7020 7030 7040 7050
DGSPLPTDNN GRYVILPSGR YSGDTETTDE SGNVIYPIIN PDGTPLGTDS
7060 7070 7080 7090 7100
TGNYITSIGD IIERDDEGKP IGPDGQVLTT DASGNYIYPV VGPDGLILPT
7110 7120 7130 7140 7150
DATGKPIYPV RGPDGTPLPT DASGAVIGPN GEPIPTDASG KPLSQDGSPL
7160 7170 7180 7190 7200
PTDVNGNYIM LPSDEVTSQS LPTDESGNVI YPITKPDGTP LGTDSSGSFI
7210 7220 7230 7240 7250
TEDGQIIEKD DEGKPIGPDG QILSTDASGN YIYPDVGPDV QTLPTDGDMI
7260 7270 7280 7290 7300
SVPTVEATVE FTSDKTPEVI HSITKPDGTP LSTDSTGEFV TEDGQIIEKD
7310 7320 7330 7340 7350
DEGKPIGPDG QVLPTDASGN YIYPVIGLDG QALPTDKSGK TVYPVRGPNG
7360 7370 7380 7390 7400
TPLPTDASGA VIGLDGEPIP TDASGKPLSA DGSPLPTDAV GNYILVPSDD
7410 7420 7430 7440 7450
GVIRTHPTDE SGNTIYPITK PDGTPLATDS TGAFVTDDGQ VIEKDDEGKP
7460 7470 7480 7490 7500
IGPDGQVLPT DASGNYIYPV TSSDGQVLPT DAEKPVIVDQ SGKPLPTDAS
7510 7520 7530 7540 7550
GNYIDNNGKP IVIEGEEPTG PEDQKLSKNK KGEWVYPLVD KFGKPVETDD
7560 7570 7580 7590 7600
NDKPVITVVD NDGNELSKND DGNWIDLSGN EIDTDELGRP LDSEGNPYKF
7610 7620 7630 7640 7650
DDNGHVVIAP QIEEEEEATP AIPFIIIDGE PINEDDGVYT DKDGNVIPTN
7660 7670 7680 7690 7700
SEGKPIDENG QVLPKNEYGE FVKPKEADTT QSTIVSPDGS PLPTDASGAA
7710 7720 7730 7740 7750
IGPDGEPIPT DSSGRPISKD GSPLPTDASG NYILVPSGEG VTDSLPTDEA
7760 7770 7780 7790 7800
GNIIYPITKP DGTLLATDST GSFVADDGQI IEKDDEGKPI GPDGQVLPTD
7810 7820 7830 7840 7850
ASGNYIYPVI GPDGQALPTD ESGKTVYPVR GPDGTPLPTD ASGAVIGPDG
7860 7870 7880 7890 7900
EPIPTDPSGK PLSADGSPLP TDINGNYVLV PSDESAAKVL PTDESGSVVY
7910 7920 7930 7940 7950
PITKPDGTPL GTDASGSFVT DDGQAIGKDD EGKPIGPDGQ TLPIDDSGNY
7960 7970 7980 7990 8000
IYPVVGPDGQ ALPTDESGKT VYPVLGPDGI PLPTDASGAV IGPDGEIIPT
8010 8020 8030 8040 8050
DASGKPLSAD GSPLPTDNNG NYVLVPADEV TTKVLPTDDS GNVVHPITRP
8060 8070 8080 8090 8100
DGTPLGTDAS GSFVTDDGQA IEKDDEGKPI GPDGQVLPTD ASGNYIYPVI
8110 8120 8130 8140 8150
GPDGQALPTD KSGKTVYPVR GPDGTPLSTD ASGALIGLDG EPIPTDASGK
8160 8170 8180 8190 8200
PLSADGSPLP TDAVGNYILV PSDDGVIRTH PTDESGNTIY PITKPDGTPL
8210 8220 8230 8240 8250
ATDSTGAFVT DDGQVIEKDD EGKPIGPDGQ VLPTDASGNY IYPVTSSDGQ
8260 8270 8280 8290 8300
VLPTDAEKPV IVDQSGKPLP TDASGNYIDN NGKPIVIEGE EPTGPEDQKL
8310 8320 8330 8340 8350
SKNEKGEWVY PLVDKFGKPV ETDDNDKPVI TVVDNDGNEL SKNDDGNWID
8360 8370 8380 8390 8400
LSGNEIDTDE LGRPLDSEGN PYKFDDNGHV VIAPQIEEEE ETTPAIPFII
8410 8420 8430 8440 8450
IDGEPINEDD GVYTDKDGNV IPTNSEGKPI DENGQVLPKN EDGEFVKPKE
8460 8470 8480 8490 8500
ADTTQSTIVS PDGSPLPTDA SGAAIGPDGE PIPTDSSGRP ISKDGSPLPT
8510 8520 8530 8540 8550
DASGNYILVP SGEGVTDSLP TDEAGNIIYP ITKPDGTLLA TDSTGSFVAD
8560 8570 8580 8590 8600
DGQIIEKDDE GKPIGPDGQV LPTDASGNYI YPVIGPDGQA LPTDESGKTV
8610 8620 8630 8640 8650
YPVRGPDGTP LPTDASGAVI GPDGEPIPTD PSGKPLSADG SKLPTDINGN
8660 8670 8680 8690 8700
YVLVPADEVT TKVLPTDDSG NVVHPITRPD GTPLGTDASG SFITEDGQIV
8710 8720 8730 8740 8750
EKNDDGKPIG PDGQVLPTDS SDNYIYPSIG SDEQAMPTDT TGSVIYPLVS
8760 8770 8780 8790 8800
PDGTVIEGPP KVAKPVGPDG KVLPTDASGH FIGPDGPIPT DYGVTYSDTV
8810 8820 8830 8840 8850
TTPDGIPLSN DSTGAFITED GTVIENNEDG KPIGPDGQVL PTDAYGNYIY
8860 8870 8880 8890 8900
PAIGPDGQAL PTDESGNPVY PVRGPDGTPL PTDVSGAVIG PDGEPIPTDA
8910 8920 8930 8940 8950
SGKPLSADGG SPLPTDNNGN YVLVPADEVT TKVLPTDDSG NVVHPITRPD
8960 8970 8980 8990 9000
GTPLGTDASG SFVRDDGQAI EKDDEGKPIG PDGQVLPTDA SGNYIYPVIG
9010 9020 9030 9040 9050
PDGQALPTDE SGKTVYPVRG PDGTPLPTDA SGAVIGLDGE PIPTDASGKP
9060 9070 9080 9090 9100
LSAEGSPLPT DNNGNYVLVP ADEVTTKVLP TDDSGNVVHP ITRPDGTPLG
9110 9120 9130 9140 9150
TDASGSFVRD DGQAIEKDDE GKPIGPDGQV LPTDASGNYI YPVIGPDGQA
9160 9170 9180 9190 9200
LPTDESGKTV YPVRGPDGTP LPTDASGAVI GLDGEPIPTD ASGKPLSAEG
9210 9220 9230 9240 9250
SPLPTDNNGN YVLVPAHEVT TKVLPTDDSG NVVHPITRPD GTPLGTDASG
9260 9270 9280 9290 9300
SFVTDDGQAI EKDDEGKPIG PDGQVLPTDA SGNYIYPVTS SDGQVLPTDA
9310 9320 9330 9340 9350
EKPVIVDQSG KPLPTDASGN YIDNNGKPIV IEGEEPTGPE DQKLSKNEKG
9360 9370 9380 9390 9400
EWVYPLVDKF GKPVETDDND KPVITVVDND GNELSKNDDG NWIDLSGNEI
9410 9420 9430 9440 9450
DTDELGRPLD SEGNPYKFDD NGHVVIAPQI EEEEEATPAI PFIIIDGEPI
9460 9470 9480 9490 9500
NEDDGVYTDK DGNVIPTNSE GKPIDENGQV LPKNEDGEFV KPKEADTTQS
9510 9520 9530 9540 9550
TIVSPDGSPL PTDASGAAIG PDGEPIPTDS NGRPLAKDGS PLPTDNNGRY
9560 9570 9580 9590 9600
VILPSGRYSG DTETTDESGN VIYPIINPDG TPLGTDSTGN YITSIGDIIE
9610 9620 9630 9640 9650
RDDEGKPIGP DGQVLTTDAS GNYIYPVVGP DGLILPTDAT GKPIYPVRGP
9660 9670 9680 9690 9700
DGTPLPTDAS GAVIGPNGEP IPTDASGKPL SQDGSPLPTD VNGNYIMLPS
9710 9720 9730 9740 9750
DEVTSQSLPT DESGNVIYPI TKPDGTPLGT DSSGSFITED GQIIEKDDEG
9760 9770 9780 9790 9800
KPIGPDGQIL STDASGNYIY PDVGPDVQTL PTDGDMISVP TVEATVEFTS
9810 9820 9830 9840 9850
DKTPEVIHSI TKPDGTPLST DSTGEFVTED GQIIEKDDEG KPIGPDGQVL
9860 9870 9880 9890 9900
PTDASGNYIY PVIGLDGQAL PTDKSGKTVY PVRGPNGTPL PTDASGAVIG
9910 9920 9930 9940 9950
LDGEPIPTDA SGKPLSADGS PLPTDAVGNY ILVPSDDGVI RTHPTDESGN
9960 9970 9980 9990 10000
TIYPITKPDG TPLATDSTGA FVTDDGQVIE KDDEGKPIGP DGQVLPTDAS
10010 10020 10030 10040 10050
GNYIYPVTSS DGQVLPTDAE KPVIVDQSGK PLPTDASGNY IDNNGKPIVI
10060 10070 10080 10090 10100
EGEEPTGPED QKLSKNKKGE WVYPLVDKFG KPVETDDNDK PVITVVDNDG
10110 10120 10130 10140 10150
NELSKNDDGN WIDLSGNEID TDELGRPLDS EGNPYKFDDN GHVVIAPQIE
10160 10170 10180 10190 10200
EEEEATPAIP FIIIDGEPIN EDDGVYTDKD GNVIPTNSEG KPIDENGQVL
10210 10220 10230 10240 10250
PKNEDGEFVK PKEADTTQST IVSPDGSPLP TDASGAAIGP DGEPIPTDSS
10260 10270 10280 10290 10300
GRPISKDGSP LPTDASGNYI LVPSGEGVTD SLPTDEAGNI IYPITKPDGT
10310 10320 10330 10340 10350
LLATDSTGSF VADDGQIIEK DDEGKPIGPD GQVLPTDASG NYIYPVIGPD
10360 10370 10380 10390 10400
GQALPTDESG KTVFPVRGPD GTPLPTDASG AVIGPDGEPI PTDPSGKPLS
10410 10420 10430 10440 10450
ADGSPLPTDI NGNYVLVPSD ESAAKVLPTD ESGSVVYPIT KPDGTPLGTD
10460 10470 10480 10490 10500
SSGSYITEDG QLVGKDEEGK PVGPDGQVLP TDSAGHYVYP ITGADRQILT
10510 10520 10530 10540 10550
TDAAGKPIYS VFNEDGIQLP TDSSGYAIGH DGELVPTEST NGVPLNKDGT
10560 10570 10580 10590 10600
PLPTNDSGHF VLVLPGATVN DSKPTDEVIV SITNPDGTLL GTDSTGAFVT
10610 10620 10630 10640 10650
EDGPIIENDD EGKPVGPDGQ VLPTDDSGNY IYPVIGPDGQ ALPTDESGKT
10660 10670 10680 10690 10700
VYPIRGPDGT PLPTDASGAS IGPDGEPIPT DASGKPLSKD GSPLPTDNDG
10710 10720 10730 10740 10750
HYVLVPVDDS TIKAFPTDES GNVAYPITRP DGTPLGTDSS GSFVTDDGTI
10760 10770 10780 10790 10800
IENDDEGKPI GPDGQVLPTD ASGNYIYPVI GPDGQALPTD ESGKTVYPVH
10810 10820 10830 10840 10850
GPDGTPLPTD ASGAAIGPDG EPIPTDASGK PLSQDGSALP TDNNGNFILV
10860 10870 10880 10890 10900
PSDKSTTKTL PTDESGNFIY PITKPDGVLF ATDSTGNYVT DEGELIEKDD
10910 10920 10930 10940 10950
NGYPLGPDKR VLPTDGSGNY IYPAVGSDEK ILPTDNLGKV VYPITRPDGS
10960 10970 10980 10990 11000
PLATDSTGVF VTGDGTIVER NEEGKPIGPD GQVLTTDNSG NYIYPVIGPD
11010 11020 11030 11040 11050
GEPLGTDASG KTVYPVRGPD GTPLATDAFG AVIGPDGEPI PTDASGKPLD
11060 11070 11080 11090 11100
QSGFPLPTDN NGNYILVPSD EALGKILPTD ENGNVVYSVT NPDGTPLATD
11110 11120 11130 11140 11150
STGSFIASNG LIVEKDDEGK PIGPDGQVLP TDASGNYIYP VIGPDGQALP
11160 11170 11180 11190 11200
TDESGKPIYP VFTEDGTQLP TDSTGFAIGP DGELVPTDSA NGVPLSKDGS
11210 11220 11230 11240 11250
PLPTDASGNY ILPDSGVTTA NPTDENGYAI YPITKPDGTL LATDSTGSYI
11260 11270 11280 11290 11300
TQGGQLIEKD NTGKPIGPDG QVLPTDGSGN YVYPVVGPDG QALPTDDTGN
11310 11320 11330 11340 11350
VVYPVINADG SLLATDSSGS FITENGKIVA KDDEGKPISP DGQVLPTDAS
11360 11370 11380 11390 11400
GNYIYPALGP DGSILPTDSN GKSIYPVRGP DGTPLPTDEF GFAIGPDGKP
11410 11420 11430 11440 11450
IPTDTSGKPL SADGSPLPTD NNGNYILVLS EGVTEHAPTD ENGNVIYPVT
11460 11470 11480 11490 11500
NPDGTPLGTD SSGAFITQDG TVVKKDEDGK PIGPDGQVLP TDNSGNYIYP
11510 11520 11530 11540 11550
VIGPDGQVLP TDASGKTVHS VYGPDGTQLP TDASGSAIGP DGELVPTDVS
11560 11570 11580 11590 11600
GRPLSQDGSP LPTDNNGNYA LVVSDEATTK VLPTDEGGNV IYHITKPDGS
11610 11620 11630 11640 11650
LLGTDASGDF ITDHGKAVQK DDEGKPIGPD GSVLPTDTSG NYIYPITGPD
11660 11670 11680 11690 11700
GNVLPTDSNG KPVYPVFNED GTQLPTDSTG SAIDQDGELV STDSTSGVPL
11710 11720 11730 11740 11750
AKDGSPLPTN SAGNYVLVSS GKSQPTDEHG NVIYPITKPD GTLLATDSTG
11760 11770 11780 11790 11800
SYLTEDGQLV EIDDSGKPLG SDGQVLPIDA SGNYIYPALG PDGQALPTDD
11810 11820 11830 11840 11850
AGNLVYPIVY PDGTPLATES TGNYVTENGE VVGKNTDGKP ISPDGQVLPT
11860 11870 11880 11890 11900
DASGNYIYPA VGPDGQVLPT DASGKLIYPV FHPDGTQLPT DASGYAVAPD
11910 11920 11930 11940 11950
GSLIPTEFSG KPLGKDGSVL PTDNSGRYVL VHDDREVTQT IPTDESGNTI
11960 11970 11980 11990 12000
YPITRPDGTL LSTDSTGIYL TDEGNVIDRD NEGKPLGPDG QVLPTDGYGN
12010 12020 12030 12040 12050
FVYPADSDIG GAKLLPTDEY GHTLYPVIRP DGSLLSTESS GSFVTDDGTV
12060 12070 12080 12090 12100
VSKDSDGKPL GPAGQVLPTD ASGNYIYPSI GPDGSPLPTD INGKPAYTVI
12110 12120 12130 12140 12150
GRYGDVLPTD SLGRAVNIDG SVVPTDDEGL PIDQYGVVLP TDTTRKLHTL
12160 12170 12180 12190 12200
VPTRRPSSFC YVTSHIDLLL VIDSSNNIKV LDYRVMKELI KNFLTEHFNL
12210 12220 12230 12240 12250
RKHQVRVGLV KYGDGAEIPV SLGDYDNEDD LVHRISESRR LKGRAQLGAG
12260 12270 12280 12290 12300
LREALDELSI SGVDGVPQIV LIVKNGKASD DYSSAVKSLK AERNVTVFVV
12310 12320 12330 12340 12350
DAGDDESQQQ NSELTEEDKT IVISQWRGAD SEVLGPIADY ICKIVPNVET
12360 12370 12380 12390 12400
SRTWPTPRTK ATTTSGTGRS CSSIDYESDV IIVLDSSENF TPDEFVSMKD
12410 12420 12430 12440 12450
AVASIVDTGF DLAPDVSKIG FVIYSDKVAV PVALGHYEDK IELLEKITDA
12460 12470 12480 12490 12500
EKINDGVAIA LYGLNAARQQ FQLHGRENAT KVVILITNGK NRGNAAAAAE
12510 12520 12530 12540 12550
DLRDMYGVQL FAVAVGSNPE ELATIKRLVG NSNTENVIEV AQSTEIDDDA
12560 12570 12580 12590 12600
AALLKAVCGN TSPKNSEMPA HLTTKRDVLA QKFTTAPMLR TTRAVAGGLC
12610 12620 12630 12640 12650
NDGIRRPYHF NILVDITSRA SADEFRRVLD HLINFFNDRM RDEQHMITIN
12660 12670 12680 12690 12700
IITVNSDKVQ NILSNLRADQ LSEQLNAITQ QSDDTVSPKL GAGIDALAEL
12710 12720 12730 12740 12750
SKENYINGAI KLMLIVGSDG TSSDDALPAA EYANSDFQHN IIAVSVRKPA
12760 12770 12780 12790 12800
TDLLSKIAGL PTRVVHLDQW SAPNELFDSW IAYITCDYAT ASTTRKSTTP
12810 12820 12830 12840 12850
KMTTLRPYDR KASKEDATNI ELIPLSPSSL SVSWTCCTNN KSNYTILYTH
12860 12870 12880 12890 12900
DTSITKEKWI RKEVTCRDSF GTHLNELPSD HTYTVCVMTN ERVDNSTALA
12910 12920 12930 12940 12950
IDKNCDSLHI DQNTTAPEDY VKPSPSSCNC QCSEGKAVLR ATCEMVIDTN
12960 12970 12980 12990 13000
RPIATLPPAT VDECPCKVKA HGGRCPKGYI AKDGQCYDID ECETNNGQCS
13010 13020 13030 13040 13050
EGCVNTPGSY YCACPHGMMR DPLDPFNCVN TANSFDKIAA LLANYLEANT
13060 13070 13080 13090 13100
KNSGSEVTSE KSDGGRVNYK ATIKSADDKT ITFEWSHVPE VVRRAFKWLF
Length:13,100
Mass (Da):1,373,302
Last modified:March 1, 2003 - v2
Checksum:iE13DCBF28C7EA6FE
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AY117398 mRNA. Translation: AAM78593.1.
FO081605 Genomic DNA. Translation: CCD72779.1.
PIRiT16580.
RefSeqiNP_741200.1. NM_171172.3.
UniGeneiCel.20013.

Genome annotation databases

EnsemblMetazoaiK07E12.1a.1; K07E12.1a.1; WBGene00000998.
K07E12.1a.2; K07E12.1a.2; WBGene00000998.
GeneIDi175951.
KEGGicel:CELE_K07E12.1.
UCSCiK07E12.1a.1. c. elegans.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AY117398 mRNA. Translation: AAM78593.1.
FO081605 Genomic DNA. Translation: CCD72779.1.
PIRiT16580.
RefSeqiNP_741200.1. NM_171172.3.
UniGeneiCel.20013.

3D structure databases

ProteinModelPortaliQ09165.
SMRiQ09165. Positions 266-391, 409-493, 592-675, 698-723, 800-876, 991-1090, 1163-1196, 1214-1282, 1296-1381, 1511-1580, 1582-1810, 12167-12564, 12930-13014.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi6239.K07E12.1a.1.

Proteomic databases

EPDiQ09165.
PaxDbiQ09165.
PRIDEiQ09165.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaiK07E12.1a.1; K07E12.1a.1; WBGene00000998.
K07E12.1a.2; K07E12.1a.2; WBGene00000998.
GeneIDi175951.
KEGGicel:CELE_K07E12.1.
UCSCiK07E12.1a.1. c. elegans.

Organism-specific databases

CTDi175951.
WormBaseiK07E12.1a; CE32905; WBGene00000998; dig-1.

Phylogenomic databases

eggNOGiENOG410J6QH. Eukaryota.
ENOG410XY8D. LUCA.
GeneTreeiENSGT00840000131684.
HOGENOMiHOG000020923.
InParanoidiQ09165.
OMAiDGQINIT.
OrthoDBiEOG773XKP.
PhylomeDBiQ09165.

Miscellaneous databases

NextBioi890446.
PROiQ09165.

Gene expression databases

ExpressionAtlasiQ09165. baseline and differential.

Family and domain databases

Gene3Di2.120.10.30. 53 hits.
2.60.40.10. 17 hits.
3.40.50.410. 4 hits.
InterProiIPR011042. 6-blade_b-propeller_TolB-like.
IPR001881. EGF-like_Ca-bd_dom.
IPR000742. EGF-like_dom.
IPR018097. EGF_Ca-bd_CS.
IPR003961. FN3_dom.
IPR007110. Ig-like_dom.
IPR013783. Ig-like_fold.
IPR003599. Ig_sub.
IPR003598. Ig_sub2.
IPR013151. Immunoglobulin.
IPR011041. Quinoprot_gluc/sorb_DH.
IPR002035. VWF_A.
[Graphical view]
PfamiPF00041. fn3. 9 hits.
PF00047. ig. 2 hits.
PF00092. VWA. 3 hits.
[Graphical view]
SMARTiSM00181. EGF. 3 hits.
SM00179. EGF_CA. 1 hit.
SM00060. FN3. 11 hits.
SM00409. IG. 7 hits.
SM00408. IGc2. 6 hits.
SM00327. VWA. 4 hits.
[Graphical view]
SUPFAMiSSF48726. SSF48726. 6 hits.
SSF49265. SSF49265. 7 hits.
SSF50952. SSF50952. 17 hits.
SSF53300. SSF53300. 4 hits.
PROSITEiPS00010. ASX_HYDROXYL. 1 hit.
PS00022. EGF_1. 1 hit.
PS01186. EGF_2. 1 hit.
PS01187. EGF_CA. 1 hit.
PS50853. FN3. 11 hits.
PS50835. IG_LIKE. 6 hits.
PS50234. VWFA. 4 hits.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "Evolution and developmental functions of mesocentin - a novel Caenorhabditis elegans extracellular matrix protein."
    Proenca R.B., Hedgecock E.M.
    Submitted (JUN-2002) to the EMBL/GenBank/DDBJ databases
    Cited for: NUCLEOTIDE SEQUENCE [MRNA], SUBCELLULAR LOCATION.
  2. "Genome sequence of the nematode C. elegans: a platform for investigating biology."
    The C. elegans sequencing consortium
    Science 282:2012-2018(1998) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: Bristol N2.
  3. "The C. elegans gene dig-1 encodes a novel adhesion molecule of the immunoglobulin superfamily that functions during sensory process development in the nervous system."
    Burket C., Higgins C.E., Hull L.C., Hubbard S., Berninsone P., Ryder E.F.
    Unpublished observations (APR-2006)
    Cited for: FUNCTION.
  4. "Identification of the hydrophobic glycoproteins of Caenorhabditis elegans."
    Fan X., She Y.-M., Bagshaw R.D., Callahan J.W., Schachter H., Mahuran D.J.
    Glycobiology 15:952-964(2005) [PubMed] [Europe PMC] [Abstract]
    Cited for: GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-518 AND ASN-575, IDENTIFICATION BY MASS SPECTROMETRY.
  5. "Proteomics reveals N-linked glycoprotein diversity in Caenorhabditis elegans and suggests an atypical translocation mechanism for integral membrane proteins."
    Kaji H., Kamiie J., Kawakami H., Kido K., Yamauchi Y., Shinkawa T., Taoka M., Takahashi N., Isobe T.
    Mol. Cell. Proteomics 6:2100-2109(2007) [PubMed] [Europe PMC] [Abstract]
    Cited for: GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-324; ASN-384; ASN-518; ASN-575; ASN-841; ASN-12294; ASN-12478 AND ASN-12843, IDENTIFICATION BY MASS SPECTROMETRY.
    Strain: Bristol N2.

Entry informationi

Entry nameiDIG1_CAEEL
AccessioniPrimary (citable) accession number: Q09165
Secondary accession number(s): Q8MTB9
Entry historyi
Integrated into UniProtKB/Swiss-Prot: May 2, 2006
Last sequence update: March 1, 2003
Last modified: May 11, 2016
This is version 128 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programCaenorhabditis annotation project

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Caenorhabditis elegans
    Caenorhabditis elegans: entries, gene names and cross-references to WormBase
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.