P02454 (CO1A1_RAT) Reviewed, UniProtKB/Swiss-Prot
Last modified
April 3, 2013.
Version 118.
History...
Names·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order
Names·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize orderNames and origin
| Protein names | Recommended name: Collagen alpha-1(I) chain Alternative name(s): Alpha-1 type I collagen | ||
| Gene names |
| ||
| Organism | Rattus norvegicus (Rat) [Reference proteome] | ||
| Taxonomic identifier | 10116 [NCBI] | ||
| Taxonomic lineage | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Mammalia › Eutheria › Euarchontoglires › Glires › Rodentia › Sciurognathi › Muroidea › Muridae › Murinae › Rattus![]() |
Protein attributes
| Sequence length | 1453 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is further processed into a mature form. |
| Protein existence | Evidence at protein level |
General annotation (Comments)
| Function | Type I collagen is a member of group I collagen (fibrillar forming collagen). |
| Subunit structure | Trimers of one alpha 2(I) and two alpha 1(I) chains. Interacts with MRC2. Interacts with TRAM2 By similarity. Ref.15 |
| Subcellular location | Secreted › extracellular space › extracellular matrix By similarity. |
| Tissue specificity | Forms the fibrils of tendon, ligaments and bones. In bones the fibrils are mineralized with calcium hydroxyapatite. |
| Domain | The C-terminal propeptide, also known as COLFI domain, have crucial roles in tissue growth and repair by controlling both the intracellular assembly of procollagen molecules and the extracellular assembly of collagen fibrils. It binds a calcium ion which is essential for its function By similarity. |
| Post-translational modification | Proline residues at the third position of the tripeptide repeating unit (G-X-P) are hydroxylated in some or all of the chains. Proline residues at the second position of the tripeptide repeating unit (G-P-X) are hydroxylated in some of the chains. O-linked glycan consists of a Glc-Gal disaccharide bound to the oxygen atom of a post-translationally added hydroxyl group. Hydroxylation on proline residues within the sequence motif, GXPG, is most likely to be 4-hydroxy as this fits the requirement for 4-hydroxylation in vertebrates. |
| Sequence similarities | Belongs to the fibrillar collagen family. Contains 1 fibrillar collagen NC1 domain. Contains 1 VWFC domain. |
Ontologies
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||||
Molecule processing | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Signal peptide | 1 – 22 | 22 | Potential | ||||||||
| Propeptide | 23 – 151 | 129 | N-terminal propeptide | PRO_0000043358 | |||||||
| Chain | 152 – 1207 | 1056 | Collagen alpha-1(I) chain | PRO_0000043359 | |||||||
| Propeptide | 1208 – 1453 | 246 | C-terminal propeptide By similarity | PRO_0000043360 | |||||||
Regions | |||||||||||
| Domain | 29 – 87 | 59 | VWFC | ||||||||
| Domain | 1218 – 1453 | 236 | Fibrillar collagen NC1 | ||||||||
| Region | 152 – 167 | 16 | Nonhelical region (N-terminal) | ||||||||
| Region | 168 – 1181 | 1014 | Triple-helical region | ||||||||
| Region | 1176 – 1186 | 11 | Major antigenic determinant (of neutral salt-extracted rat skin collagen) | ||||||||
| Region | 1182 – 1207 | 26 | Nonhelical region (C-terminal) | ||||||||
| Motif | 734 – 736 | 3 | Cell attachment site Potential | ||||||||
| Motif | 1082 – 1084 | 3 | Cell attachment site Potential | ||||||||
Sites | |||||||||||
| Metal binding | 1266 | 1 | Calcium By similarity | ||||||||
| Metal binding | 1268 | 1 | Calcium By similarity | ||||||||
| Metal binding | 1269 | 1 | Calcium; via carbonyl oxygen By similarity | ||||||||
| Metal binding | 1271 | 1 | Calcium; via carbonyl oxygen By similarity | ||||||||
| Metal binding | 1274 | 1 | Calcium By similarity | ||||||||
Amino acid modifications | |||||||||||
| Modified residue | 152 | 1 | Pyrrolidone carboxylic acid Probable | ||||||||
| Modified residue | 160 | 1 | Allysine | ||||||||
| Modified residue | 179 | 1 | 4-hydroxyproline Probable | ||||||||
| Modified residue | 182 | 1 | 4-hydroxyproline Probable | ||||||||
| Modified residue | 185 | 1 | 4-hydroxyproline Probable | ||||||||
| Modified residue | 194 | 1 | 4-hydroxyproline Probable | ||||||||
| Modified residue | 197 | 1 | 4-hydroxyproline Probable | ||||||||
| Modified residue | 200 | 1 | 4-hydroxyproline Probable | ||||||||
| Modified residue | 254 | 1 | 5-hydroxylysine | ||||||||
| Modified residue | 575 | 1 | 5-hydroxylysine Probable | ||||||||
| Modified residue | 698 | 1 | 5-hydroxylysine Probable | ||||||||
| Modified residue | 1153 | 1 | 3-hydroxyproline By similarity | ||||||||
| Glycosylation | 56 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 254 | 1 | O-linked (Gal...) | ||||||||
| Glycosylation | 1354 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Disulfide bond | 1248 ↔ 1280 | By similarity | |||||||||
| Disulfide bond | 1254 | Interchain (with C-1271) By similarity | |||||||||
| Disulfide bond | 1271 | Interchain (with C-1254) By similarity | |||||||||
| Disulfide bond | 1288 ↔ 1451 | By similarity | |||||||||
| Disulfide bond | 1359 ↔ 1404 | By similarity | |||||||||
Experimental info | |||||||||||
| Sequence conflict | 143 | 1 | P → L in CAB01633. Ref.1 | ||||||||
| Sequence conflict | 202 | 1 | A → G in CAB01633. Ref.1 | ||||||||
| Sequence conflict | 209 | 1 | R → P in CAB01633. Ref.1 | ||||||||
| Sequence conflict | 232 | 1 | E → Q AA sequence Ref.7 | ||||||||
| Sequence conflict | 268 | 1 | D → N AA sequence Ref.8 | ||||||||
| Sequence conflict | 286 | 1 | A → T in CAB01633. Ref.1 | ||||||||
| Sequence conflict | 307 | 1 | S → T in CAB01633. Ref.1 | ||||||||
| Sequence conflict | 313 | 1 | N → D AA sequence Ref.9 | ||||||||
| Sequence conflict | 421 | 1 | N → T in CAB01633. Ref.1 | ||||||||
| Sequence conflict | 497 | 1 | A → S in CAB01633. Ref.1 | ||||||||
| Sequence conflict | 773 | 1 | T → A in CAB01633. Ref.1 | ||||||||
| Sequence conflict | 794 | 1 | P → A in CAB01633. Ref.1 | ||||||||
| Sequence conflict | 958 | 1 | E → K in CAB01633. Ref.1 | ||||||||
| Sequence conflict | 1111 | 1 | S → P AA sequence Ref.13 | ||||||||
| Sequence conflict | 1163 | 1 | S → A AA sequence Ref.13 | ||||||||
| Sequence conflict | 1166 | 1 | A → S AA sequence Ref.13 | ||||||||
| Sequence conflict | 1187 | 1 | F → L AA sequence Ref.14 | ||||||||
| Sequence conflict | 1190 | 1 | L → F AA sequence Ref.14 | ||||||||
Sequences
| ||||||||||||||||||
References
| « Hide 'large scale' references | |
| [1] | "Expression of collagen alpha1(I) mRNA variants during tooth and bone formation in the rat." Brandsten C., Lundmark C., Christersson C., Hammarstroem L., Wurtz T. J. Dent. Res. 78:11-19(1999) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA]. Strain: Sprague-Dawley. Tissue: Bone and Tooth. |
| [2] | Mural R.J., Adams M.D., Myers E.W., Smith H.O., Venter J.C. Submitted (JUL-2005) to the EMBL/GenBank/DDBJ databases Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. Strain: Brown Norway. |
| [3] | "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)." The MGC Project Team Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. Strain: Brown Norway. Tissue: Lung. |
| [4] | "Comparative sequence studies of rat skin and tendon collagen. II. The absence of a short sequence at the amino terminus of the skin alpha-1 chain." Bornstein P. Biochemistry 8:63-71(1969) [PubMed] [Europe PMC] [Abstract] Cited for: PROTEIN SEQUENCE OF 152-170. |
| [5] | "The amino acid sequence of peptides from the cross-linking region of rat skin collagen." Kang A.H., Bornstein P., Piez K.A. Biochemistry 6:788-795(1967) [PubMed] [Europe PMC] [Abstract] Cited for: PROTEIN SEQUENCE OF 156-170. |
| [6] | "The incomplete hydroxylation of individual prolyl residues in collagen." Bornstein P. J. Biol. Chem. 242:2572-2574(1967) [PubMed] [Europe PMC] [Abstract] Cited for: PROTEIN SEQUENCE OF 171-206. |
| [7] | "Chemical studies on the cyanogen bromide peptides of rat skin collagen. Amino acid sequence of alpha 1-CB4." Butler W.T., Ponds S.L. Biochemistry 10:2076-2081(1971) [PubMed] [Europe PMC] [Abstract] Cited for: PROTEIN SEQUENCE OF 207-253. |
| [8] | "Chemical studies on the cyanogen bromide peptides of rat skin collagen. The covalent structure of alpha 1-CB5, the major hexose-containing cyanogen bromide peptide of alpha 1." Butler W.T. Biochemistry 9:44-50(1970) [PubMed] [Europe PMC] [Abstract] Cited for: PROTEIN SEQUENCE OF 254-290. |
| [9] | "Structure of rat skin collagen alpha 1-CB8. Amino acid sequence of the hydroxylamine-produced fragment HA1." Balian G., Click E.M., Bornstein P. Biochemistry 10:4470-4478(1971) [PubMed] [Europe PMC] [Abstract] Cited for: PROTEIN SEQUENCE OF 291-389. |
| [10] | "Structure of rat skin collagen alpha 1-CBB. Amino acid sequence of the hydroxyl amine-produced fragment HA2." Balian G., Click E.M., Hermodson M.A., Bornstein P. Biochemistry 11:3798-3806(1972) [PubMed] [Europe PMC] [Abstract] Cited for: PROTEIN SEQUENCE OF 390-569. |
| [11] | "Chemical studies on the cyanogen bromide peptides of rat skin collagen. Amino acid sequence of alpha 1-CB3." Butler W.T., Underwood S.P., Finch J.E. Jr. Biochemistry 13:2946-2953(1974) [PubMed] [Europe PMC] [Abstract] Cited for: PROTEIN SEQUENCE OF 570-718. |
| [12] | "Construction of DNA sequences complementary to rat alpha 1 and alpha 2 collagen mRNA and their use in studying the regulation of type I collagen synthesis by 1,25-dihydroxyvitamin D." Genovese C., Rowe D., Kream B. Biochemistry 23:6210-6216(1984) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 680-718. |
| [13] | "Structural and immunogenic properties of a major antigenic determinant in neutral salt-extracted rat-skin collagen." Stoltz M., Timpl R., Furthmayr H., Kuehn K. Eur. J. Biochem. 37:287-294(1973) [PubMed] [Europe PMC] [Abstract] Cited for: PROTEIN SEQUENCE OF 1103-1186. |
| [14] | "Non-helical regions in rat collagen alpha 1-chain." Stoltz M., Timpl R., Kuehn K. FEBS Lett. 26:61-65(1972) [PubMed] [Europe PMC] [Abstract] Cited for: PROTEIN SEQUENCE OF 1186-1206. |
| [15] | "Endo180 binds to the C-terminal region of type I collagen." Thomas E.K., Nakamura M., Wienke D., Isacke C.M., Pozzi A., Liang P. J. Biol. Chem. 280:22596-22605(2005) [PubMed] [Europe PMC] [Abstract] Cited for: INTERACTION WITH MRC2. Strain: Sprague-Dawley. |
| [16] | "A new procedure for rapid, high yield purification of Type I collagen for tissue engineering." Xiong X., Ghosh R., Hiller E., Drepper F., Knapp B., Brunner H., Rupp S. Process Biochem. 44:1200-1212(2009) Cited for: IDENTIFICATION BY MASS SPECTROMETRY, PHOSPHORYLATION. |
| [17] | "Microfibrillar structure of type I collagen in situ." Orgel J.P.R.O., Irving T.C., Miller A., Wess T.J. Proc. Natl. Acad. Sci. U.S.A. 103:9001-9005(2006) [PubMed] [Europe PMC] [Abstract] Cited for: X-RAY CRYSTALLOGRAPHY (5.16 ANGSTROMS) OF 152-1207. |
| + | Additional computationally mapped references. |
Cross-references
Sequence databases | |||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| EMBL GenBank DDBJ | Z78279 mRNA. Translation: CAB01633.1. CH473948 Genomic DNA. Translation: EDM05727.1. BC133728 mRNA. Translation: AAI33729.1. M11432 mRNA. Translation: AAA40832.1. Sequence problems. | ||||||||||||||||||
| IPI | IPI00188909. | ||||||||||||||||||
| PIR | CGRT1S. A90559. | ||||||||||||||||||
| RefSeq | NP_445756.1. NM_053304.1. | ||||||||||||||||||
| UniGene | Rn.2953. | ||||||||||||||||||
3D structure databases | |||||||||||||||||||
| PDBe RCSB PDB PDBj |
| ||||||||||||||||||
| ProteinModelPortal | P02454. | ||||||||||||||||||
| ModBase | Search... | ||||||||||||||||||
Protein-protein interaction databases | |||||||||||||||||||
| IntAct | P02454. 2 interactions. | ||||||||||||||||||
Proteomic databases | |||||||||||||||||||
| PaxDb | P02454. | ||||||||||||||||||
| PRIDE | P02454. | ||||||||||||||||||
Protocols and materials databases | |||||||||||||||||||
| StructuralBiologyKnowledgebase | Search... | ||||||||||||||||||
Genome annotation databases | |||||||||||||||||||
| Ensembl | ENSRNOT00000005311; ENSRNOP00000005311; ENSRNOG00000003897. | ||||||||||||||||||
| GeneID | 29393. | ||||||||||||||||||
| KEGG | rno:29393. | ||||||||||||||||||
| UCSC | RGD:61817. rat. | ||||||||||||||||||
Organism-specific databases | |||||||||||||||||||
| CTD | 1277. | ||||||||||||||||||
| RGD | 61817. Col1a1. | ||||||||||||||||||
Phylogenomic databases | |||||||||||||||||||
| eggNOG | NOG12793. | ||||||||||||||||||
| GeneTree | ENSGT00660000095287. | ||||||||||||||||||
| HOGENOM | HOG000085654. | ||||||||||||||||||
| HOVERGEN | HBG004933. | ||||||||||||||||||
| InParanoid | A3KNA1. | ||||||||||||||||||
| KO | K06236. | ||||||||||||||||||
| OMA | VAYMDQQ. | ||||||||||||||||||
Enzyme and pathway databases | |||||||||||||||||||
| Reactome | REACT_150387. Gelatin degradation by MMP19. | ||||||||||||||||||
Gene expression databases | |||||||||||||||||||
| ArrayExpress | P02454. | ||||||||||||||||||
| Genevestigator | P02454. | ||||||||||||||||||
| GermOnline | ENSRNOG00000003897. Rattus norvegicus. | ||||||||||||||||||
Family and domain databases | |||||||||||||||||||
| InterPro | IPR008160. Collagen. IPR000885. Fib_collagen_C. IPR001007. VWF_C. [Graphical view] | ||||||||||||||||||
| Pfam | PF01410. COLFI. 1 hit. PF01391. Collagen. 12 hits. PF00093. VWC. 1 hit. [Graphical view] | ||||||||||||||||||
| ProDom | PD002078. Fib_collagen_C. 1 hit. [Graphical view] [Entries sharing at least one domain] | ||||||||||||||||||
| SMART | SM00038. COLFI. 1 hit. SM00214. VWC. 1 hit. [Graphical view] | ||||||||||||||||||
| PROSITE | PS51461. NC1_FIB. 1 hit. PS01208. VWFC_1. 1 hit. PS50184. VWFC_2. 1 hit. [Graphical view] | ||||||||||||||||||
| ProtoNet | Search... | ||||||||||||||||||
Other | |||||||||||||||||||
| EvolutionaryTrace | P02454. | ||||||||||||||||||
| NextBio | 609017. | ||||||||||||||||||
Entry information
| Entry name | CO1A1_RAT | ||||||||
| Accession | Primary (citable) accession number: P02454 Secondary accession number(s): A3KNA1, P02455, Q63079 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation program | Chordata Protein Annotation Program | ||||||||
Relevant documents
| PDB cross-references Index of Protein Data Bank (PDB) cross-references |
| SIMILARITY comments Index of protein domains and families |

Clusters with
