Q09165 (DIG1_CAEEL) Reviewed, UniProtKB/Swiss-Prot
Last modified
May 1, 2013.
Version 104.
History...
Names·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order
Names·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize orderNames and origin
| Protein names | Recommended name: Mesocentin | ||||
| Gene names |
| ||||
| Organism | Caenorhabditis elegans [Reference proteome] | ||||
| Taxonomic identifier | 6239 [NCBI] | ||||
| Taxonomic lineage | Eukaryota › Metazoa › Ecdysozoa › Nematoda › Chromadorea › Rhabditida › Rhabditoidea › Rhabditidae › Peloderinae › Caenorhabditis![]() |
Protein attributes
| Sequence length | 13100 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is further processed into a mature form. |
| Protein existence | Evidence at protein level |
General annotation (Comments)
| Function | Encodes an adhesion molecule involved in sensory map formation. Functions during sensory process development in the nervous system. Ref.3 |
| Subcellular location | Secreted › extracellular space › extracellular matrix Ref.1. |
| Sequence similarities | Contains 2 EGF-like domains. Contains 11 fibronectin type-III domains. Contains 7 Ig-like C2-type (immunoglobulin-like) domains. Contains 1 Sushi (CCP/SCR) domain. Contains 4 VWFA domains. |
Ontologies
| Keywords | |
|---|---|
| Biological process | Cell adhesion |
| Cellular component | Extracellular matrix Secreted |
| Domain | EGF-like domain Immunoglobulin domain Repeat Signal Sushi |
| Molecular function | Developmental protein |
| PTM | Disulfide bond Glycoprotein |
| Technical term | Complete proteome Reference proteome |
| Gene Ontology (GO) | |
| Biological_process | cell adhesion Inferred from electronic annotation. Source: UniProtKB-KW gonad developmentInferred from mutant phenotype PubMed 2401010PubMed 9486792. Source: WormBase metabolic processInferred from electronic annotation. Source: GOC nervous system developmentInferred from mutant phenotype PubMed 10409513. Source: WormBase |
| Cellular_component | proteinaceous extracellular matrix Inferred from electronic annotation. Source: UniProtKB-SubCell |
| Molecular_function | calcium ion binding Inferred from electronic annotation. Source: InterPro catalytic activityInferred from electronic annotation. Source: InterPro |
| Complete GO annotation... | |
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||||
Molecule processing | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Signal peptide | 1 – 24 | 24 | Potential | ||||||||
| Chain | 25 – 13100 | 13076 | Mesocentin | PRO_0000233334 | |||||||
Regions | |||||||||||
| Domain | 162 – 222 | 61 | Sushi | ||||||||
| Domain | 225 – 263 | 39 | EGF-like 1 | ||||||||
| Domain | 305 – 392 | 88 | Ig-like C2-type 1 | ||||||||
| Domain | 394 – 488 | 95 | Fibronectin type-III 1 | ||||||||
| Domain | 491 – 577 | 87 | Ig-like C2-type 2 | ||||||||
| Domain | 581 – 674 | 94 | Fibronectin type-III 2 | ||||||||
| Domain | 681 – 778 | 98 | Ig-like C2-type 3 | ||||||||
| Domain | 782 – 875 | 94 | Fibronectin type-III 3 | ||||||||
| Domain | 892 – 983 | 92 | Ig-like C2-type 4 | ||||||||
| Domain | 991 – 1091 | 101 | Fibronectin type-III 4 | ||||||||
| Domain | 1094 – 1194 | 101 | Fibronectin type-III 5 | ||||||||
| Domain | 1203 – 1280 | 78 | Ig-like C2-type 5 | ||||||||
| Domain | 1284 – 1381 | 98 | Fibronectin type-III 6 | ||||||||
| Domain | 1386 – 1490 | 105 | Fibronectin type-III 7 | ||||||||
| Domain | 1500 – 1576 | 77 | Ig-like C2-type 6 | ||||||||
| Domain | 1582 – 1694 | 113 | Fibronectin type-III 8 | ||||||||
| Domain | 1699 – 1803 | 105 | Fibronectin type-III 9 | ||||||||
| Domain | 1810 – 1904 | 95 | Ig-like C2-type 7 | ||||||||
| Domain | 2005 – 2097 | 93 | Fibronectin type-III 10 | ||||||||
| Domain | 6599 – 6787 | 189 | VWFA 1 | ||||||||
| Domain | 12167 – 12341 | 175 | VWFA 2 | ||||||||
| Domain | 12379 – 12557 | 179 | VWFA 3 | ||||||||
| Domain | 12609 – 12780 | 172 | VWFA 4 | ||||||||
| Domain | 12817 – 12905 | 89 | Fibronectin type-III 11 | ||||||||
| Domain | 12988 – 13029 | 42 | EGF-like 2; calcium-binding Potential | ||||||||
| Compositional bias | 6894 – 6898 | 5 | Poly-Glu | ||||||||
| Compositional bias | 7613 – 7617 | 5 | Poly-Glu | ||||||||
| Compositional bias | 8387 – 8391 | 5 | Poly-Glu | ||||||||
| Compositional bias | 9431 – 9435 | 5 | Poly-Glu | ||||||||
| Compositional bias | 10150 – 10154 | 5 | Poly-Glu | ||||||||
Amino acid modifications | |||||||||||
| Glycosylation | 70 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 178 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 324 | 1 | N-linked (GlcNAc...) Ref.5 | ||||||||
| Glycosylation | 384 | 1 | N-linked (GlcNAc...) Ref.5 | ||||||||
| Glycosylation | 518 | 1 | N-linked (GlcNAc...) Ref.4 Ref.5 | ||||||||
| Glycosylation | 575 | 1 | N-linked (GlcNAc...) Ref.4 Ref.5 | ||||||||
| Glycosylation | 841 | 1 | N-linked (GlcNAc...) Ref.5 | ||||||||
| Glycosylation | 931 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 1112 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 1211 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 1399 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 1612 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 1840 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 1879 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 1898 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 8810 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 10555 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 10570 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 12294 | 1 | N-linked (GlcNAc...) Ref.5 | ||||||||
| Glycosylation | 12478 | 1 | N-linked (GlcNAc...) Ref.5 | ||||||||
| Glycosylation | 12840 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 12843 | 1 | N-linked (GlcNAc...) Ref.5 | ||||||||
| Glycosylation | 12895 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 12913 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Disulfide bond | 193 ↔ 220 | By similarity | |||||||||
| Disulfide bond | 327 ↔ 374 | By similarity | |||||||||
| Disulfide bond | 517 ↔ 561 | By similarity | |||||||||
| Disulfide bond | 705 ↔ 762 | By similarity | |||||||||
| Disulfide bond | 1226 ↔ 1264 | By similarity | |||||||||
| Disulfide bond | 1523 ↔ 1562 | By similarity | |||||||||
| Disulfide bond | 1839 ↔ 1888 | By similarity | |||||||||
Sequences
| ||||||||||||||||||
References
| « Hide 'large scale' references | |
| [1] | "Evolution and developmental functions of mesocentin - a novel Caenorhabditis elegans extracellular matrix protein." Proenca R.B., Hedgecock E.M. Submitted (JUN-2002) to the EMBL/GenBank/DDBJ databases Cited for: NUCLEOTIDE SEQUENCE [MRNA], SUBCELLULAR LOCATION. |
| [2] | "Genome sequence of the nematode C. elegans: a platform for investigating biology." The C. elegans sequencing consortium Science 282:2012-2018(1998) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. Strain: Bristol N2. |
| [3] | "The C. elegans gene dig-1 encodes a novel adhesion molecule of the immunoglobulin superfamily that functions during sensory process development in the nervous system." Burket C., Higgins C.E., Hull L.C., Hubbard S., Berninsone P., Ryder E.F. Unpublished observations (APR-2006) Cited for: FUNCTION. |
| [4] | "Identification of the hydrophobic glycoproteins of Caenorhabditis elegans." Fan X., She Y.-M., Bagshaw R.D., Callahan J.W., Schachter H., Mahuran D.J. Glycobiology 15:952-964(2005) [PubMed] [Europe PMC] [Abstract] Cited for: GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-518 AND ASN-575, MASS SPECTROMETRY. |
| [5] | "Proteomics reveals N-linked glycoprotein diversity in Caenorhabditis elegans and suggests an atypical translocation mechanism for integral membrane proteins." Kaji H., Kamiie J., Kawakami H., Kido K., Yamauchi Y., Shinkawa T., Taoka M., Takahashi N., Isobe T. Mol. Cell. Proteomics 6:2100-2109(2007) [PubMed] [Europe PMC] [Abstract] Cited for: GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-324; ASN-384; ASN-518; ASN-575; ASN-841; ASN-12294; ASN-12478 AND ASN-12843, MASS SPECTROMETRY. Strain: Bristol N2. |
| + | Additional computationally mapped references. |
Cross-references
Sequence databases | |
|---|---|
| EMBL GenBank DDBJ | AY117398 mRNA. Translation: AAM78593.1. FO081605 Genomic DNA. Translation: CCD72779.1. |
| PIR | T16580. |
| RefSeq | NP_741200.1. NM_171172.3. |
| UniGene | Cel.20013. |
3D structure databases | |
| ProteinModelPortal | Q09165. |
| SMR | Q09165. Positions 299-383, 503-580, 641-670, 681-876, 991-1198, 1215-1240, 1293-1428, 1512-1539, 1595-1622, 1664-1692, 1826-1899, 2003-2099, 12930-13014. |
| ModBase | Search... |
Proteomic databases | |
| PaxDb | Q09165. |
| PRIDE | Q09165. |
Protocols and materials databases | |
| StructuralBiologyKnowledgebase | Search... |
Genome annotation databases | |
| EnsemblMetazoa | K07E12.1a.1; K07E12.1a.1; K07E12.1. K07E12.1a.2; K07E12.1a.2; K07E12.1. |
| GeneID | 175951. |
| KEGG | cel:CELE_K07E12.1. |
| UCSC | K07E12.1a.1. c. elegans. |
Organism-specific databases | |
| CTD | 175951. |
| WormBase | K07E12.1a; CE32905; WBGene00000998; dig-1. |
Phylogenomic databases | |
| eggNOG | NOG12793. |
| GeneTree | ENSGT00670000099008. |
| HOGENOM | HOG000020923. |
| InParanoid | Q09165. |
| OMA | DGQINIT. |
Gene expression databases | |
| ArrayExpress | Q09165. |
Family and domain databases | |
| Gene3D | 2.120.10.30. 53 hits. 2.60.40.10. 17 hits. |
| InterPro | IPR011042. 6-blade_b-propeller_TolB-like. IPR000742. EG-like_dom. IPR001881. EGF-like_Ca-bd. IPR018097. EGF_Ca-bd_CS. IPR003961. Fibronectin_type3. IPR007110. Ig-like_dom. IPR013783. Ig-like_fold. IPR013098. Ig_I-set. IPR003599. Ig_sub. IPR003598. Ig_sub2. IPR011047. Quinonprotein_ADH-like_supfam. IPR011041. Quinoprot_gluc/sorb_DH. IPR000436. Sushi_SCR_CCP. IPR002035. VWF_A. [Graphical view] |
| Pfam | PF07645. EGF_CA. 1 hit. PF00041. fn3. 9 hits. PF07679. I-set. 1 hit. PF00092. VWA. 4 hits. [Graphical view] |
| SMART | SM00032. CCP. 1 hit. SM00181. EGF. 1 hit. SM00179. EGF_CA. 1 hit. SM00060. FN3. 11 hits. SM00409. IG. 2 hits. SM00408. IGc2. 1 hit. SM00327. VWA. 4 hits. [Graphical view] |
| SUPFAM | SSF57535. Complement_control_module. 1 hit. SSF49265. FN_III-like. 10 hits. SSF50998. Quin_alc_DH_like. 4 hits. SSF50952. Quino_gluc_DH. 19 hits. |
| PROSITE | PS00010. ASX_HYDROXYL. 1 hit. PS00022. EGF_1. 1 hit. PS01186. EGF_2. 1 hit. PS50026. EGF_3. False negative. PS01187. EGF_CA. 1 hit. PS50853. FN3. 11 hits. PS50835. IG_LIKE. 7 hits. PS50923. SUSHI. False negative. PS50234. VWFA. 4 hits. [Graphical view] |
| ProtoNet | Search... |
Other | |
| NextBio | 890446. |
Entry information
| Entry name | DIG1_CAEEL | ||||||||
| Accession | Primary (citable) accession number: Q09165 Secondary accession number(s): Q8MTB9 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation program | Caenorhabditis annotation project | ||||||||
Relevant documents
| Recent format changes Overview of recent format changes |
| Caenorhabditis elegans Caenorhabditis elegans: entries, gene names and cross-references to WormBase |
| SIMILARITY comments Index of protein domains and families |

Clusters with
