Q60847 (COCA1_MOUSE) Reviewed, UniProtKB/Swiss-Prot
Last modified
May 29, 2013.
Version 121.
History...
Names·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order
Names·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize orderNames and origin
| Protein names | Recommended name: Collagen alpha-1(XII) chain | ||
| Gene names |
| ||
| Organism | Mus musculus (Mouse) [Reference proteome] | ||
| Taxonomic identifier | 10090 [NCBI] | ||
| Taxonomic lineage | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Mammalia › Eutheria › Euarchontoglires › Glires › Rodentia › Sciurognathi › Muroidea › Muridae › Murinae › Mus › Mus![]() |
Protein attributes
| Sequence length | 3120 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is further processed into a mature form. |
| Protein existence | Evidence at transcript level |
General annotation (Comments)
| Function | Type XII collagen interacts with type I collagen-containing fibrils, the COL1 domain could be associated with the surface of the fibrils, and the COL2 and NC3 domains may be localized in the perifibrillar matrix By similarity. |
| Subunit structure | Trimer of identical chains each containing 190 kDa of non-triple-helical sequences By similarity. |
| Subcellular location | Secreted › extracellular space › extracellular matrix By similarity. |
| Tissue specificity | Highest expression in tendons, perichondrium, skin, cornea, sclera, blood vessels, and periosteum. |
| Developmental stage | The long NC3 XIIA isoforms are predominant at early stages (ED7 and 11); at later stages of development (ED15 and 17) the short NC3 XIIB forms become the major forms. As the short NC3 forms become the major product, the long splice variant continues to be expressed in several tissues, even after birth. The long NC1 isoforms, XIIA-1 and XIIB-1, peak in 15-day old embryos and decrease in 17-day old ones. The expression of the short NC1 form XIIB-2 remains constant throughout late stages of embryonic development (ED15 and ED17). |
| Post-translational modification | The triple-helical tail is stabilized by disulfide bonds at each end By similarity. Hydroxylation on proline residues within the sequence motif, GXPG, is most likely to be 4-hydroxy as this fits the requirement for 4-hydroxylation in vertebrates By similarity. O-glycosylation of isoform 2; glycosaminoglycan of chondroitin-sulfate type By similarity. |
| Sequence similarities | Belongs to the fibril-associated collagens with interrupted helices (FACIT) family. Contains 4 collagen-like domains. Contains 18 fibronectin type-III domains. Contains 1 laminin G-like domain. Contains 4 VWFA domains. |
Ontologies
| Keywords | |
|---|---|
| Biological process | Cell adhesion |
| Cellular component | Extracellular matrix Secreted |
| Coding sequence diversity | Alternative splicing |
| Domain | Collagen Repeat Signal |
| PTM | Disulfide bond Glycoprotein Hydroxylation |
| Technical term | Complete proteome Reference proteome |
| Gene Ontology (GO) | |
| Biological_process | cell adhesion Inferred from electronic annotation. Source: UniProtKB-KW |
| Cellular_component | collagen Inferred from electronic annotation. Source: UniProtKB-KW extracellular spaceInferred from electronic annotation. Source: Compara |
| Complete GO annotation... | |
Alternative products
| This entry describes 5 isoforms produced by alternative splicing. [Align] [Select] Note: The final tissue form of collagen XII may contain homotrimers or any combination of the various isoforms. | ||||||
| Isoform 1 (identifier: Q60847-1) Also known as: XIIA-1; This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry. | ||||||
| Isoform 2 (identifier: Q60847-2) Also known as: ER#K; XIIA-2; The sequence of this isoform differs from the canonical sequence as follows: 3063-3065: EPY → GSG 3066-3120: Missing. | ||||||
| Isoform 3 (identifier: Q60847-3) Also known as: XIIB-1; The sequence of this isoform differs from the canonical sequence as follows: 25-1186: Missing. | ||||||
| Isoform 4 (identifier: Q60847-4) Also known as: XIIB-2; The sequence of this isoform differs from the canonical sequence as follows: 25-1186: Missing. 3063-3065: EPY → GSG 3066-3120: Missing. | ||||||
| Isoform 5 (identifier: Q60847-5) The sequence of this isoform differs from the canonical sequence as follows: 3063-3068: EPYVPE → GMLLPS 3069-3120: Missing. |
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||
Molecule processing | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Signal peptide | 1 – 23 | 23 | Potential | ||||||
| Chain | 24 – 3120 | 3097 | Collagen alpha-1(XII) chain | PRO_0000005784 | |||||
Regions | |||||||||
| Domain | 24 – 112 | 89 | Fibronectin type-III 1 | ||||||
| Domain | 140 – 316 | 177 | VWFA 1 | ||||||
| Domain | 333 – 422 | 90 | Fibronectin type-III 2 | ||||||
| Domain | 440 – 616 | 177 | VWFA 2 | ||||||
| Domain | 631 – 719 | 89 | Fibronectin type-III 3 | ||||||
| Domain | 722 – 810 | 89 | Fibronectin type-III 4 | ||||||
| Domain | 813 – 901 | 89 | Fibronectin type-III 5 | ||||||
| Domain | 904 – 993 | 90 | Fibronectin type-III 6 | ||||||
| Domain | 995 – 1083 | 89 | Fibronectin type-III 7 | ||||||
| Domain | 1086 – 1175 | 90 | Fibronectin type-III 8 | ||||||
| Domain | 1199 – 1371 | 173 | VWFA 3 | ||||||
| Domain | 1384 – 1472 | 89 | Fibronectin type-III 9 | ||||||
| Domain | 1474 – 1563 | 90 | Fibronectin type-III 10 | ||||||
| Domain | 1565 – 1652 | 88 | Fibronectin type-III 11 | ||||||
| Domain | 1656 – 1743 | 88 | Fibronectin type-III 12 | ||||||
| Domain | 1754 – 1843 | 90 | Fibronectin type-III 13 | ||||||
| Domain | 1845 – 1933 | 89 | Fibronectin type-III 14 | ||||||
| Domain | 1935 – 2024 | 90 | Fibronectin type-III 15 | ||||||
| Domain | 2026 – 2115 | 90 | Fibronectin type-III 16 | ||||||
| Domain | 2117 – 2204 | 88 | Fibronectin type-III 17 | ||||||
| Domain | 2208 – 2294 | 87 | Fibronectin type-III 18 | ||||||
| Domain | 2325 – 2498 | 174 | VWFA 4 | ||||||
| Domain | 2522 – 2714 | 193 | Laminin G-like | ||||||
| Domain | 2749 – 2800 | 52 | Collagen-like 1 | ||||||
| Domain | 2804 – 2854 | 51 | Collagen-like 2 | ||||||
| Domain | 2855 – 2899 | 45 | Collagen-like 3 | ||||||
| Domain | 2943 – 2992 | 50 | Collagen-like 4 | ||||||
| Region | 2453 – 2748 | 296 | Nonhelical region (NC3) | ||||||
| Region | 2749 – 2900 | 152 | Triple-helical region (COL2) with 1 imperfection | ||||||
| Region | 2901 – 2943 | 43 | Nonhelical region (NC2) | ||||||
| Region | 2944 – 3046 | 103 | Triple-helical region (COL1) with 2 imperfections | ||||||
| Region | 3047 – 3120 | 74 | Nonhelical region (NC1) | ||||||
| Motif | 862 – 864 | 3 | Cell attachment site Potential | ||||||
| Motif | 2781 – 2783 | 3 | Cell attachment site Potential | ||||||
| Motif | 2897 – 2899 | 3 | Cell attachment site Potential | ||||||
| Compositional bias | 865 – 868 | 4 | Poly-Thr | ||||||
Amino acid modifications | |||||||||
| Modified residue | 2946 | 1 | 4-hydroxyproline By similarity | ||||||
| Modified residue | 2949 | 1 | 4-hydroxyproline By similarity | ||||||
| Modified residue | 2952 | 1 | 4-hydroxyproline By similarity | ||||||
| Modified residue | 2961 | 1 | 4-hydroxyproline By similarity | ||||||
| Modified residue | 2967 | 1 | 4-hydroxyproline By similarity | ||||||
| Modified residue | 2970 | 1 | 4-hydroxyproline By similarity | ||||||
| Modified residue | 2973 | 1 | 4-hydroxyproline By similarity | ||||||
| Modified residue | 2985 | 1 | 4-hydroxyproline By similarity | ||||||
| Modified residue | 3002 | 1 | 4-hydroxyproline By similarity | ||||||
| Modified residue | 3005 | 1 | 4-hydroxyproline By similarity | ||||||
| Modified residue | 3016 | 1 | 4-hydroxyproline By similarity | ||||||
| Modified residue | 3025 | 1 | 4-hydroxyproline By similarity | ||||||
| Modified residue | 3028 | 1 | 4-hydroxyproline By similarity | ||||||
| Modified residue | 3031 | 1 | 4-hydroxyproline By similarity | ||||||
| Glycosylation | 700 | 1 | N-linked (GlcNAc...) Potential | ||||||
| Glycosylation | 798 | 1 | O-linked (Xyl...) (chondroitin sulfate) Potential | ||||||
| Glycosylation | 889 | 1 | O-linked (Xyl...) (chondroitin sulfate) Potential | ||||||
| Glycosylation | 981 | 1 | O-linked (Xyl...) (chondroitin sulfate) Potential | ||||||
| Glycosylation | 1765 | 1 | N-linked (GlcNAc...) Potential | ||||||
| Glycosylation | 2208 | 1 | N-linked (GlcNAc...) Potential | ||||||
| Glycosylation | 2530 | 1 | N-linked (GlcNAc...) Potential | ||||||
| Glycosylation | 2681 | 1 | N-linked (GlcNAc...) Potential | ||||||
Natural variations | |||||||||
| Alternative sequence | 25 – 1186 | 1162 | Missing in isoform 3 and isoform 4. | VSP_001150 | |||||
| Alternative sequence | 3063 – 3068 | 6 | EPYVPE → GMLLPS in isoform 5. | VSP_023404 | |||||
| Alternative sequence | 3063 – 3065 | 3 | EPY → GSG in isoform 2 and isoform 4. | VSP_001151 | |||||
| Alternative sequence | 3066 – 3120 | 55 | Missing in isoform 2 and isoform 4. | VSP_001152 | |||||
| Alternative sequence | 3069 – 3120 | 52 | Missing in isoform 5. | VSP_023405 | |||||
Experimental info | |||||||||
| Sequence conflict | 245 | 1 | A → G in AAA99719. Ref.1 | ||||||
| Sequence conflict | 421 | 1 | K → KTQPK in AAA99719. Ref.1 | ||||||
| Sequence conflict | 453 | 1 | I → T in AAA99719. Ref.1 | ||||||
| Sequence conflict | 552 | 1 | K → E in AAA99719. Ref.1 | ||||||
| Sequence conflict | 611 | 1 | E → V in AAB07047. Ref.1 | ||||||
| Sequence conflict | 611 | 1 | E → V in AAA99719. Ref.1 | ||||||
| Sequence conflict | 690 | 1 | N → S in AAA99719. Ref.1 | ||||||
| Sequence conflict | 797 | 1 | S → P in AAA99719. Ref.1 | ||||||
| Sequence conflict | 954 | 1 | P → N in AAA99719. Ref.1 | ||||||
| Sequence conflict | 1079 | 1 | G → R in AAA99719. Ref.1 | ||||||
| Sequence conflict | 1271 | 1 | Y → N in AAA99719. Ref.1 | ||||||
| Sequence conflict | 1472 | 1 | T → A in AAA99719. Ref.1 | ||||||
| Sequence conflict | 1524 | 1 | G → E in AAA99719. Ref.1 | ||||||
| Sequence conflict | 1773 | 1 | V → I in AAA99719. Ref.1 | ||||||
| Sequence conflict | 1831 | 1 | D → G in AAA99719. Ref.1 | ||||||
| Sequence conflict | 1939 | 1 | A → S in AAA99719. Ref.1 | ||||||
| Sequence conflict | 2005 | 1 | N → Y in AAA99719. Ref.1 | ||||||
| Sequence conflict | 2428 – 2429 | 2 | PK → R in AAA99719. Ref.1 | ||||||
| Sequence conflict | 2432 | 1 | V → G in AAA99719. Ref.1 | ||||||
| Sequence conflict | 2515 | 1 | L → Q in AAA99719. Ref.1 | ||||||
| Sequence conflict | 2551 – 2552 | 2 | SY → DS in AAA99719. Ref.1 | ||||||
| Sequence conflict | 2861 – 2864 | 4 | Missing in AAA99719. Ref.1 | ||||||
Sequences
| ||||||||||||||||||||||||||||||||||||||||||
References
| « Hide 'large scale' references | |
| [1] | "Primary structure of the long and short splice variants of mouse collagen XII and their tissue-specific expression during embryonic development." Boehme K., Li Y., Oh P.S., Olsen B.R. Dev. Dyn. 204:432-445(1995) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 5), ALTERNATIVE SPLICING (ISOFORMS 1 AND 3). Strain: C57BL/6J and Swiss Webster. Tissue: Skin. |
| [2] | "Lineage-specific biology revealed by a finished genome assembly of the mouse." Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X., Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y., Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S. Ponting C.P.PLoS Biol. 7:E1000112-E1000112(2009) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. Strain: C57BL/6J. |
| [3] | "Structural variation of type XII collagen at its carboxyl-terminal NC1 domain generated by tissue-specific alternative splicing." Kania A.M., Reichenberger E., Baur S.T., Karimbux N.Y., Taylor R.W., Olsen B.R., Nishimura I. J. Biol. Chem. 274:22053-22059(1999) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 3047-3120, ALTERNATIVE SPLICING (ISOFORMS 2 AND 4). Strain: C57BL/6J. Tissue: Skin fibroblast. |
| + | Additional computationally mapped references. |
Cross-references
Sequence databases | |
|---|---|
| EMBL GenBank DDBJ | U25652 mRNA. Translation: AAA99719.1. AC157477 Genomic DNA. No translation available. AC166055 Genomic DNA. No translation available. U57095 mRNA. Translation: AAB07047.1. |
| IPI | IPI00121430. IPI00230527. IPI00319976. IPI00776316. IPI00830892. |
| PIR | C44479. D44479. |
| UniGene | Mm.3819. |
3D structure databases | |
| ProteinModelPortal | Q60847. |
| SMR | Q60847. Positions 25-110, 140-308, 333-415, 440-616, 631-1179, 1197-1371, 1378-2312, 2325-2487, 2510-2726. |
| ModBase | Search... |
Protein-protein interaction databases | |
| MINT | MINT-4091424. |
PTM databases | |
| PhosphoSite | Q60847. |
Proteomic databases | |
| PaxDb | Q60847. |
| PRIDE | Q60847. |
Protocols and materials databases | |
| StructuralBiologyKnowledgebase | Search... |
Genome annotation databases | |
| Ensembl | ENSMUST00000071750; ENSMUSP00000071662; ENSMUSG00000032332. |
Organism-specific databases | |
| MGI | MGI:88448. Col12a1. |
Phylogenomic databases | |
| eggNOG | NOG12793. |
| GeneTree | ENSGT00700000104243. |
| HOGENOM | HOG000111877. |
| HOVERGEN | HBG051060. |
| InParanoid | Q60847. |
Gene expression databases | |
| ArrayExpress | Q60847. |
| Bgee | Q60847. |
| CleanEx | MM_COL12A1. |
| Genevestigator | Q60847. |
| GermOnline | ENSMUSG00000032332. Mus musculus. |
Family and domain databases | |
| Gene3D | 2.60.120.200. 1 hit. 2.60.40.10. 18 hits. 3.40.50.410. 4 hits. |
| InterPro | IPR008160. Collagen. IPR008985. ConA-like_lec_gl_sf. IPR013320. ConA-like_subgrp. IPR003961. Fibronectin_type3. IPR013783. Ig-like_fold. IPR001791. Laminin_G. IPR002035. VWF_A. [Graphical view] |
| Pfam | PF01391. Collagen. 4 hits. PF00041. fn3. 18 hits. PF00092. VWA. 4 hits. [Graphical view] |
| SMART | SM00060. FN3. 18 hits. SM00210. TSPN. 1 hit. SM00327. VWA. 4 hits. [Graphical view] |
| SUPFAM | SSF49899. ConA_like_lec_gl. 1 hit. SSF49265. FN_III-like. 18 hits. |
| PROSITE | PS50853. FN3. 18 hits. PS50025. LAM_G_DOMAIN. False negative. PS50234. VWFA. 4 hits. [Graphical view] |
| ProtoNet | Search... |
Other | |
| SOURCE | Search... |
Entry information
| Entry name | COCA1_MOUSE | ||||||||
| Accession | Primary (citable) accession number: Q60847 Secondary accession number(s): P70322 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation program | Chordata Protein Annotation Program | ||||||||
Relevant documents
| MGD cross-references Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot |
| SIMILARITY comments Index of protein domains and families |

Clusters with
