P11087 (CO1A1_MOUSE) Reviewed, UniProtKB/Swiss-Prot
Last modified
May 1, 2013.
Version 136.
History...
Names·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order
Names·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize orderNames and origin
| Protein names | Recommended name: Collagen alpha-1(I) chain Alternative name(s): Alpha-1 type I collagen | ||||
| Gene names |
| ||||
| Organism | Mus musculus (Mouse) [Reference proteome] | ||||
| Taxonomic identifier | 10090 [NCBI] | ||||
| Taxonomic lineage | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Mammalia › Eutheria › Euarchontoglires › Glires › Rodentia › Sciurognathi › Muroidea › Muridae › Murinae › Mus › Mus![]() |
Protein attributes
| Sequence length | 1453 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is further processed into a mature form. |
| Protein existence | Evidence at protein level |
General annotation (Comments)
| Function | Type I collagen is a member of group I collagen (fibrillar forming collagen). |
| Subunit structure | Trimers of one alpha 2(I) and two alpha 1(I) chains. Interacts with MRC2 By similarity. Interacts with TRAM2 By similarity. |
| Subcellular location | Secreted › extracellular space › extracellular matrix By similarity. |
| Tissue specificity | Forms the fibrils of tendon, ligaments and bones. In bones the fibrils are mineralized with calcium hydroxyapatite. |
| Domain | The C-terminal propeptide, also known as COLFI domain, have crucial roles in tissue growth and repair by controlling both the intracellular assembly of procollagen molecules and the extracellular assembly of collagen fibrils. It binds a calcium ion which is essential for its function By similarity. |
| Post-translational modification | Proline residues at the third position of the tripeptide repeating unit (G-X-P) are hydroxylated in some or all of the chains. Proline residues at the second position of the tripeptide repeating unit (G-P-X) are hydroxylated in some of the chains. |
| Sequence similarities | Belongs to the fibrillar collagen family. Contains 1 fibrillar collagen NC1 domain. Contains 1 VWFC domain. |
| Sequence caution | The sequence CAA38657.1 differs from that shown. Reason: Erroneous gene model prediction. |
Ontologies
Alternative products
| This entry describes 2 isoforms produced by alternative splicing. [Align] [Select] | ||||||
| Isoform 1 (identifier: P11087-1) This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry. | ||||||
| Isoform 2 (identifier: P11087-2) The sequence of this isoform differs from the canonical sequence as follows: 803-1030: Missing. | ||||||
| Note: No experimental confirmation available. |
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||||
Molecule processing | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Signal peptide | 1 – 22 | 22 | |||||||||
| Propeptide | 23 – 151 | 129 | N-terminal propeptide | PRO_0000005722 | |||||||
| Chain | 152 – 1207 | 1056 | Collagen alpha-1(I) chain | PRO_0000005723 | |||||||
| Propeptide | 1208 – 1453 | 246 | C-terminal propeptide | PRO_0000005724 | |||||||
Regions | |||||||||||
| Domain | 29 – 87 | 59 | VWFC | ||||||||
| Domain | 1218 – 1453 | 236 | Fibrillar collagen NC1 | ||||||||
| Region | 152 – 167 | 16 | Nonhelical region (N-terminal) | ||||||||
| Region | 168 – 1181 | 1014 | Triple-helical region | ||||||||
| Region | 1182 – 1207 | 26 | Nonhelical region (C-terminal) | ||||||||
| Motif | 734 – 736 | 3 | Cell attachment site Potential | ||||||||
| Motif | 1082 – 1084 | 3 | Cell attachment site Potential | ||||||||
Sites | |||||||||||
| Metal binding | 1266 | 1 | Calcium By similarity | ||||||||
| Metal binding | 1268 | 1 | Calcium By similarity | ||||||||
| Metal binding | 1269 | 1 | Calcium; via carbonyl oxygen By similarity | ||||||||
| Metal binding | 1271 | 1 | Calcium; via carbonyl oxygen By similarity | ||||||||
| Metal binding | 1274 | 1 | Calcium By similarity | ||||||||
Amino acid modifications | |||||||||||
| Modified residue | 152 | 1 | Pyrrolidone carboxylic acid By similarity | ||||||||
| Modified residue | 160 | 1 | Allysine By similarity | ||||||||
| Modified residue | 254 | 1 | 5-hydroxylysine; alternate By similarity | ||||||||
| Modified residue | 1153 | 1 | 3-hydroxyproline By similarity | ||||||||
| Glycosylation | 56 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 254 | 1 | O-linked (Gal...); alternate By similarity | ||||||||
| Glycosylation | 1354 | 1 | N-linked (GlcNAc...) Ref.12 | ||||||||
| Disulfide bond | 1248 ↔ 1280 | By similarity | |||||||||
| Disulfide bond | 1254 | Interchain (with C-1271) By similarity | |||||||||
| Disulfide bond | 1271 | Interchain (with C-1254) By similarity | |||||||||
| Disulfide bond | 1288 ↔ 1451 | By similarity | |||||||||
| Disulfide bond | 1359 ↔ 1404 | By similarity | |||||||||
Natural variations | |||||||||||
| Alternative sequence | 803 – 1030 | 228 | Missing in isoform 2. | VSP_016548 | |||||||
Experimental info | |||||||||||
| Sequence conflict | 81 | 1 | E → G in AAA88912. Ref.1 | ||||||||
| Sequence conflict | 106 | 1 | D → G in AAA88912. Ref.1 | ||||||||
| Sequence conflict | 136 | 1 | P → H in AAH59281. Ref.3 | ||||||||
| Sequence conflict | 1202 | 1 | G → D in AAA88912. Ref.1 | ||||||||
| Sequence conflict | 1219 | 1 | E → A in AAA88912. Ref.1 | ||||||||
| Sequence conflict | 1222 | 1 | T → A in AAA88912. Ref.1 | ||||||||
| Sequence conflict | 1335 | 1 | A → T in AAA88912. Ref.1 | ||||||||
| Sequence conflict | 1399 – 1400 | 2 | TL → RV in AAA88912. Ref.1 | ||||||||
| Sequence conflict | 1450 | 1 | A → V in CAA29927. Ref.10 | ||||||||
| Sequence conflict | 1450 | 1 | A → V in CAA33904. Ref.10 | ||||||||
Sequences
| ||||||||||||||||||||||||
References
| « Hide 'large scale' references | |
| [1] | "The complete cDNA coding sequence for the mouse pro alpha 1(I) chain of type I procollagen." Li S.W., Khillan J., Prockop D.J. Matrix Biol. 14:593-595(1995) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1). Strain: FVB/N. |
| [2] | "Lineage-specific biology revealed by a finished genome assembly of the mouse." Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X., Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y., Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S. Ponting C.P.PLoS Biol. 7:E1000112-E1000112(2009) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. Strain: C57BL/6J. |
| [3] | "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)." The MGC Project Team Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1 AND 2). Strain: FVB/N. Tissue: Colon. |
| [4] | "Insertion of retrovirus into the first intron of alpha1(I) collagen gene leads to embryonic lethal mutation in mice." Harbers K., Kuehn M., Delius H., Jaenisch R. Proc. Natl. Acad. Sci. U.S.A. 81:1504-1508(1984) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-25. |
| [5] | "Genomic sequence of mouse COL1A1 encoding the collagen propeptides." Fenton S.P., Lamande S.R., Hannagan M., Stacey A., Jaenisch R., Bateman J.F. Biochim. Biophys. Acta 1216:469-474(1993) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-185 AND 1030-1453. |
| [6] | "DNA methylation represses the murine alpha 1(I) collagen promoter by an indirect mechanism." Rhodes K., Rippe R.A., Umezawa A., Nehls M., Brenner D.A., Breindl M. Mol. Cell. Biol. 14:5950-5960(1994) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-942. Strain: C57BL/6. Tissue: Liver. |
| [7] | "Nucleotide sequence of a cDNA clone for mouse pro alpha 1(I) collagen protein." French B.T., Lee W.-H., Maul G.G. Gene 39:311-312(1985) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 518-1128 (ISOFORM 1). |
| [8] | "DNA sequence analysis of a mouse pro alpha 1 (I) procollagen gene: evidence for a mouse B1 element within the gene." Monson J.M., Friedman J., McCarthy B.J. Mol. Cell. Biol. 2:1362-1371(1982) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 735-1130. |
| [9] | "Identification of a Balb/c mouse pro alpha 1(I) procollagen gene: evidence for insertions or deletions in gene coding sequences." Monson J.M., McCarthy B.J. DNA 1:59-69(1981) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 735-878 AND 1005-1058. |
| [10] | "Two mRNAs of mouse pro alpha 1(I) collagen gene differ in the size of the 3'-untranslated region." Mooslehner K., Harbers K. Nucleic Acids Res. 16:773-773(1988) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1442-1453. |
| [11] | "Specific hybridization probes for mouse type I, II, III and IX collagen mRNAs." Metsaeranta M., Toman D., de Crombrugghe B., Vuorio E. Biochim. Biophys. Acta 1089:241-243(1991) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1442-1453. |
| [12] | "Enhanced analysis of the mouse plasma proteome using cysteine-containing tryptic glycopeptides." Bernhard O.K., Kapp E.A., Simpson R.J. J. Proteome Res. 6:987-995(2007) [PubMed] [Europe PMC] [Abstract] Cited for: GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-1354, MASS SPECTROMETRY. Strain: C57BL/6. Tissue: Plasma. |
| + | Additional computationally mapped references. |
Cross-references
Sequence databases | |
|---|---|
| EMBL GenBank DDBJ | U08020 mRNA. Translation: AAA88912.1. AL662790, AL606480 Genomic DNA. Translation: CAI25880.1. AL606480, AL662790 Genomic DNA. Translation: CAI23970.1. BC050014 mRNA. Translation: AAH50014.1. BC059281 mRNA. Translation: AAH59281.1. K01688 Genomic DNA. Translation: AAA37330.1. S67530 Genomic DNA. Translation: AAB29424.1. S67482 Genomic DNA. No translation available. X54876 Genomic DNA. Translation: CAA38657.1. Sequence problems. M14423 mRNA. Translation: AAA37333.1. M17491 Genomic DNA. Translation: AAA37334.1. K03036 K03035 Genomic DNA. Translation: AAA37332.1.X06753 Genomic DNA. Translation: CAA29927.1. X15896 Genomic DNA. Translation: CAA33904.1. X57981 Genomic DNA. Translation: CAA41046.1. |
| IPI | IPI00329872. IPI00623191. |
| PIR | I49558. S21626. S57243. |
| RefSeq | NP_031768.2. NM_007742.3. |
| UniGene | Mm.277735. Mm.458212. |
3D structure databases | |
| ProteinModelPortal | P11087. |
| SMR | P11087. Positions 30-89, 1236-1453. |
| ModBase | Search... |
Protein-protein interaction databases | |
| IntAct | P11087. 1 interaction. |
| STRING | 10090.ENSMUSP00000001547. |
PTM databases | |
| PhosphoSite | P11087. |
Proteomic databases | |
| PaxDb | P11087. |
| PRIDE | P11087. |
Protocols and materials databases | |
| StructuralBiologyKnowledgebase | Search... |
Genome annotation databases | |
| Ensembl | ENSMUST00000001547; ENSMUSP00000001547; ENSMUSG00000001506. |
| GeneID | 12842. |
| KEGG | mmu:12842. |
| UCSC | uc007kzn.1. mouse. |
Organism-specific databases | |
| CTD | 1277. |
| MGI | MGI:88467. Col1a1. |
Phylogenomic databases | |
| eggNOG | NOG12793. |
| GeneTree | ENSGT00660000095287. |
| HOVERGEN | HBG004933. |
| InParanoid | P11087. |
| KO | K06236. |
| OMA | VAYMDQQ. |
| OrthoDB | EOG4S4PHP. |
Gene expression databases | |
| ArrayExpress | P11087. |
| Bgee | P11087. |
| CleanEx | MM_COL1A1. |
| Genevestigator | P11087. |
| GermOnline | ENSMUSG00000001506. Mus musculus. |
Family and domain databases | |
| InterPro | IPR008160. Collagen. IPR000885. Fib_collagen_C. IPR001007. VWF_C. [Graphical view] |
| Pfam | PF01410. COLFI. 1 hit. PF01391. Collagen. 9 hits. PF00093. VWC. 1 hit. [Graphical view] |
| ProDom | PD002078. Fib_collagen_C. 1 hit. [Graphical view] [Entries sharing at least one domain] |
| SMART | SM00038. COLFI. 1 hit. SM00214. VWC. 1 hit. [Graphical view] |
| PROSITE | PS51461. NC1_FIB. 1 hit. PS01208. VWFC_1. 1 hit. PS50184. VWFC_2. 1 hit. [Graphical view] |
| ProtoNet | Search... |
Other | |
| ChiTaRS | COL1A1. mouse. |
| NextBio | 282376. |
| PMAP-CutDB | P11087. |
| SOURCE | Search... |
Entry information
| Entry name | CO1A1_MOUSE | ||||||||
| Accession | Primary (citable) accession number: P11087 Secondary accession number(s): Q53WT0 Q810J9 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation program | Chordata Protein Annotation Program | ||||||||
Relevant documents
| MGD cross-references Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot |
| SIMILARITY comments Index of protein domains and families |

Clusters with
