Q8BLX7 (COGA1_MOUSE) Reviewed, UniProtKB/Swiss-Prot
Last modified
May 1, 2013.
Version 86.
History...
Names·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order
Names·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize orderNames and origin
| Protein names | Recommended name: Collagen alpha-1(XVI) chain | ||
| Gene names |
| ||
| Organism | Mus musculus (Mouse) [Reference proteome] | ||
| Taxonomic identifier | 10090 [NCBI] | ||
| Taxonomic lineage | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Mammalia › Eutheria › Euarchontoglires › Glires › Rodentia › Sciurognathi › Muroidea › Muridae › Murinae › Mus › Mus![]() |
Protein attributes
| Sequence length | 1580 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is further processed into a mature form. |
| Protein existence | Evidence at transcript level |
General annotation (Comments)
| Function | Involved in mediating cell attachment and inducing integrin-mediated cellular reactions, such as cell spreading and alterations in cell morphology By similarity. UniProtKB Q07092 |
| Subunit structure | Homotrimer. Interacts with FBN1, fibronectin and integrins ITGA1/ITGB1 and ITGA2/ITGB1. Integrin ITGA1/ITGB1 binds to a unique site within COL16A1 located close to its C-terminal end between collagenous domains COL1-COL3 By similarity. UniProtKB Q07092 |
| Subcellular location | Secreted › extracellular space › extracellular matrix By similarity. |
| Tissue specificity | Expressed in most tissues examined with highest levels of expression observed in heart. Strongly expressed in cortical and medullar regions of kidney and more weakly expressed in lung. Also detected in the ciliary muscle of the eye, on the serosa layer lining the muscularis externa of intestinal tissue, and in the perimysium membrane lining both the cardiac muscle bundle and the smooth muscle tissue of the small intestine. Strongly stained in particulate or granular structures. Not detected in brain or skeletal muscle. Ref.3 |
| Developmental stage | At embryonic day 8 (E8) of gestation no significant expression of mRNA or protein is observed, but strong signals are observed in placental trophoblasts. By E11 weak positive signals are observed in heart. During later stages of development, stronger expression is observed in a variety of tissues, particularly in the atrial and ventricular walls of the developing heart, spinal root neural fibers and skin. Ref.3 |
| Domain | This sequence defines eighteen different domains, nine triple-helical domains (COL9 to COL1) and ten non-triple-helical domains (NC10 to NC1). The numerous interruptions in the triple helix may make this molecule either elastic or flexible. |
| Post-translational modification | Prolines at the third position of the tripeptide repeating unit (G-X-Y) are hydroxylated in some or all of the chains. Glycosylated By similarity. UniProtKB Q07092 |
| Sequence similarities | Belongs to the fibril-associated collagens with interrupted helices (FACIT) family. Contains 9 collagen-like domains. Contains 1 laminin G-like domain. |
Ontologies
| Keywords | |
|---|---|
| Biological process | Cell adhesion |
| Cellular component | Extracellular matrix Secreted |
| Coding sequence diversity | Alternative splicing |
| Domain | Collagen Repeat Signal |
| PTM | Glycoprotein Hydroxylation |
| Technical term | Complete proteome Reference proteome |
| Gene Ontology (GO) | |
| Biological_process | cell adhesion Inferred from sequence or structural similarity. Source: UniProtKB cellular response to amino acid stimulusInferred from direct assay PubMed 20548288. Source: MGI |
| Cellular_component | collagen Inferred from electronic annotation. Source: UniProtKB-KW |
| Molecular_function | integrin binding Inferred from sequence or structural similarity. Source: UniProtKB |
| Complete GO annotation... | |
Alternative products
| This entry describes 2 isoforms produced by alternative splicing. [Align] [Select] | ||||||
| Isoform 1 Ref.1 (identifier: Q8BLX7-1) This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry. | ||||||
| Note: No experimental confirmation available. | ||||||
| Isoform 2 Ref.1 (identifier: Q8BLX7-2) The sequence of this isoform differs from the canonical sequence as follows: 1-1430: Missing. | ||||||
| Note: No experimental confirmation available. |
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||
Molecule processing | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Signal peptide | 1 – 21 | 21 | Potential | ||||||
| Chain | 22 – 1580 | 1559 | Collagen alpha-1(XVI) chain | PRO_0000282960 | |||||
Regions | |||||||||
| Domain | 50 – 231 | 182 | Laminin G-like | ||||||
| Domain | 375 – 424 | 50 | Collagen-like 1 | ||||||
| Domain | 590 – 643 | 54 | Collagen-like 2 | ||||||
| Domain | 676 – 725 | 50 | Collagen-like 3 | ||||||
| Domain | 797 – 848 | 52 | Collagen-like 4 | ||||||
| Domain | 1006 – 1063 | 58 | Collagen-like 5 | ||||||
| Domain | 1210 – 1263 | 54 | Collagen-like 6 | ||||||
| Domain | 1350 – 1407 | 58 | Collagen-like 7 | ||||||
| Domain | 1448 – 1500 | 53 | Collagen-like 8 | ||||||
| Domain | 1504 – 1552 | 49 | Collagen-like 9 | ||||||
| Region | 232 – 374 | 143 | Nonhelical region 10 (NC10) | ||||||
| Region | 375 – 509 | 135 | Triple-helical region 9 (COL9) with 3 imperfections | ||||||
| Region | 510 – 524 | 15 | Nonhelical region 9 (NC9) | ||||||
| Region | 525 – 570 | 46 | Triple-helical region 8 (COL8) with 1 imperfection | ||||||
| Region | 571 – 586 | 16 | Nonhelical region 8 (NC8) | ||||||
| Region | 587 – 640 | 54 | Triple-helical region 7 (COL7) with 1 imperfection | ||||||
| Region | 641 – 661 | 21 | Nonhelical region 7 (NC7) | ||||||
| Region | 662 – 732 | 71 | Triple-helical region 6 (COL6) with 1 imperfection | ||||||
| Region | 733 – 747 | 15 | Nonhelical region 6 (NC6) | ||||||
| Region | 748 – 870 | 123 | Triple-helical region 5 (COL5) with 3 imperfections | ||||||
| Region | 871 – 881 | 11 | Nonhelical region 5 (NC5) | ||||||
| Region | 882 – 933 | 52 | Triple-helical region 4 (COL4) with 2 imperfections | ||||||
| Region | 934 – 967 | 34 | Nonhelical region 4 (NC4) | ||||||
| Region | 968 – 982 | 15 | Triple-helical region 3 (COL3) | ||||||
| Region | 983 – 1005 | 23 | Nonhelical region 3 (NC3) | ||||||
| Region | 1006 – 1409 | 404 | Triple-helical region 2 (COL2) with 2 imperfections | ||||||
| Region | 1410 – 1448 | 39 | Nonhelical region 2 (NC2) | ||||||
| Region | 1449 – 1554 | 106 | Triple-helical region 1 (COL1) with 2 imperfections | ||||||
| Region | 1555 – 1580 | 26 | Nonhelical region 1 (NC1) | ||||||
| Motif | 555 – 557 | 3 | Cell attachment site Potential | ||||||
| Motif | 1000 – 1002 | 3 | Cell attachment site Potential | ||||||
| Motif | 1206 – 1208 | 3 | Cell attachment site Potential | ||||||
Amino acid modifications | |||||||||
| Glycosylation | 47 | 1 | N-linked (GlcNAc...) Potential | ||||||
| Glycosylation | 327 | 1 | N-linked (GlcNAc...) Potential | ||||||
Natural variations | |||||||||
| Alternative sequence | 1 – 1430 | 1430 | Missing in isoform 2. Ref.1 | VSP_052375 | |||||
Experimental info | |||||||||
| Sequence conflict | 726 | 1 | K → R in BAC30765. Ref.1 | ||||||
| Sequence conflict | 1119 | 1 | R → Q in BAC30765. Ref.1 | ||||||
Sequences
| ||||||||||||||||||||||||
References
Cross-references
Sequence databases | |
|---|---|
| EMBL GenBank DDBJ | AK012212 mRNA. Translation: BAB28100.1. AK040971 mRNA. Translation: BAC30765.1. AL606925 Genomic DNA. Translation: CAM45907.1. |
| IPI | IPI00648306. IPI00649075. |
| RefSeq | NP_082542.3. NM_028266.5. |
| UniGene | Mm.41860. |
3D structure databases | |
| HSSP | HSSP built from PDB template 1Q7D based on UniProtKB Q15201. |
| ProteinModelPortal | Q8BLX7. |
| SMR | Q8BLX7. Positions 44-242. |
| ModBase | Search... |
PTM databases | |
| PhosphoSite | Q8BLX7. |
Proteomic databases | |
| PaxDb | Q8BLX7. |
| PRIDE | Q8BLX7. |
Protocols and materials databases | |
| StructuralBiologyKnowledgebase | Search... |
Genome annotation databases | |
| Ensembl | ENSMUST00000044565; ENSMUSP00000035802; ENSMUSG00000040690. |
| GeneID | 107581. |
| KEGG | mmu:107581. |
| UCSC | uc008uys.2. mouse. uc008uyv.2. mouse. |
Organism-specific databases | |
| CTD | 1307. |
| MGI | MGI:1095396. Col16a1. |
Phylogenomic databases | |
| eggNOG | NOG12793. |
| GeneTree | ENSGT00700000104155. |
| HOGENOM | HOG000085653. |
| HOVERGEN | HBG071631. |
| InParanoid | Q8BLX7. |
| OMA | CEVCPTL. |
| OrthoDB | EOG47WNN3. |
Gene expression databases | |
| ArrayExpress | Q8BLX7. |
| Bgee | Q8BLX7. |
| CleanEx | MM_COL16A1. |
| Genevestigator | Q8BLX7. |
Family and domain databases | |
| InterPro | IPR008160. Collagen. IPR008985. ConA-like_lec_gl_sf. IPR001791. Laminin_G. [Graphical view] |
| Pfam | PF01391. Collagen. 9 hits. [Graphical view] |
| SMART | SM00210. TSPN. 1 hit. [Graphical view] |
| SUPFAM | SSF49899. ConA_like_lec_gl. 1 hit. |
| PROSITE | PS50025. LAM_G_DOMAIN. False negative. [Graphical view] |
| ProtoNet | Search... |
Other | |
| NextBio | 359082. |
| SOURCE | Search... |
Entry information
| Entry name | COGA1_MOUSE | ||||||||
| Accession | Primary (citable) accession number: Q8BLX7 Secondary accession number(s): A3KFV5, Q9CZS2 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation program | Chordata Protein Annotation Program | ||||||||
Relevant documents
| MGD cross-references Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot |
| SIMILARITY comments Index of protein domains and families |

Clusters with
