Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Collagen alpha-1(II) chain

Gene

Col2a1

Organism
Rattus norvegicus (Rat)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Type II collagen is specific for cartilaginous tissues. It is essential for the normal embryonic development of the skeleton, for linear growth and for the ability of cartilage to resist compressive forces.

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Metal bindingi1233CalciumBy similarity1
Metal bindingi1235CalciumBy similarity1
Metal bindingi1236Calcium; via carbonyl oxygenBy similarity1
Metal bindingi1238Calcium; via carbonyl oxygenBy similarity1
Metal bindingi1241CalciumBy similarity1

GO - Molecular functioni

  • extracellular matrix structural constituent Source: RGD
  • metal ion binding Source: UniProtKB-KW

GO - Biological processi

  • cartilage development Source: RGD
  • cartilage development involved in endochondral bone morphogenesis Source: RGD
  • cellular response to mechanical stimulus Source: RGD
  • cellular response to nicotine Source: RGD
  • cellular response to parathyroid hormone stimulus Source: RGD
  • cellular response to peptide hormone stimulus Source: RGD
  • cellular response to retinoic acid Source: RGD
  • cellular response to tumor necrosis factor Source: RGD
  • cellular response to vitamin E Source: RGD
  • chondrocyte differentiation Source: RGD
  • growth plate cartilage development Source: RGD
  • response to fibroblast growth factor Source: RGD
  • response to mechanical stimulus Source: RGD
  • response to X-ray Source: RGD
Complete GO annotation...

Keywords - Ligandi

Calcium, Metal-binding

Names & Taxonomyi

Protein namesi
Recommended name:
Collagen alpha-1(II) chain
Alternative name(s):
Alpha-1 type II collagen
Cleaved into the following 2 chains:
Gene namesi
Name:Col2a1
OrganismiRattus norvegicus (Rat)
Taxonomic identifieri10116 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeRattus
Proteomesi
  • UP000002494 Componenti: Unplaced

Organism-specific databases

RGDi2375. Col2a1.

Subcellular locationi

GO - Cellular componenti

  • collagen type II trimer Source: RGD
  • cytoplasm Source: RGD
Complete GO annotation...

Keywords - Cellular componenti

Extracellular matrix, Secreted

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Signal peptidei1 – 25Sequence analysisAdd BLAST25
PropeptideiPRO_000000573526 – 113N-terminal propeptideBy similarityAdd BLAST88
ChainiPRO_0000005736114 – 1173Collagen alpha-1(II) chainAdd BLAST1060
ChainiPRO_00000434071174 – 1419ChondrocalcinBy similarityAdd BLAST246

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei1225-hydroxylysineBy similarity1
Glycosylationi122O-linked (Gal...)By similarity1
Modified residuei2195-hydroxylysineBy similarity1
Glycosylationi219O-linked (Gal...)By similarity1
Modified residuei2315-hydroxylysineBy similarity1
Glycosylationi231O-linked (Gal...)By similarity1
Modified residuei2405-hydroxylysineBy similarity1
Glycosylationi240O-linked (Gal...)By similarity1
Modified residuei3065-hydroxylysineBy similarity1
Glycosylationi306O-linked (Gal...)By similarity1
Modified residuei5405-hydroxylysineBy similarity1
Glycosylationi540O-linked (Gal...)By similarity1
Modified residuei5525-hydroxylysineBy similarity1
Glycosylationi552O-linked (Gal...)By similarity1
Modified residuei6023-hydroxyproline; partial1 Publication1
Modified residuei8393-hydroxyproline; partial1 Publication1
Modified residuei10763-hydroxyproline; partial1 Publication1
Modified residuei11183-hydroxyproline1 Publication1
Modified residuei11333-hydroxyproline; partial1 Publication1
Modified residuei11393-hydroxyproline; partial1 Publication1
Modified residuei11453-hydroxyproline; partial1 Publication1
Disulfide bondi1215 ↔ 1247PROSITE-ProRule annotation
Disulfide bondi1221Interchain (with C-1238)PROSITE-ProRule annotation
Disulfide bondi1238Interchain (with C-1221)PROSITE-ProRule annotation
Disulfide bondi1255 ↔ 1417PROSITE-ProRule annotation
Glycosylationi1320N-linked (GlcNAc...)Sequence analysis1
Disulfide bondi1325 ↔ 1370PROSITE-ProRule annotation

Post-translational modificationi

Prolines at the third position of the tripeptide repeating unit (G-X-P) are hydroxylated in some or all of the chains. Probably 3-hydroxylated on Pro-602, Pro-839, Pro-1076, Pro-1133, Pro-1139 and Pro-1145 by LEPREL1.1 Publication

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sitei113 – 114Cleavage; by procollagen N-endopeptidaseBy similarity2
Sitei1173 – 1174Cleavage; by procollagen C-endopeptidaseBy similarity2

Keywords - PTMi

Disulfide bond, Glycoprotein, Hydroxylation

Proteomic databases

PaxDbiP05539.
PRIDEiP05539.

Expressioni

Tissue specificityi

Expressed in chondrocytes.1 Publication

Interactioni

Subunit structurei

Homotrimers of alpha 1(II) chains.

Protein-protein interaction databases

IntActiP05539. 1 interactor.
STRINGi10116.ENSRNOP00000016044.

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini1185 – 1419Fibrillar collagen NC1PROSITE-ProRule annotationAdd BLAST235

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Regioni133 – 1146Triple-helical regionAdd BLAST1014
Regioni1147 – 1173Nonhelical region (C-terminal)Add BLAST27

Domaini

The C-terminal propeptide, also known as COLFI domain, have crucial roles in tissue growth and repair by controlling both the intracellular assembly of procollagen molecules and the extracellular assembly of collagen fibrils. It binds a calcium ion which is essential for its function (By similarity).By similarity

Sequence similaritiesi

Belongs to the fibrillar collagen family.PROSITE-ProRule annotation
Contains 1 fibrillar collagen NC1 domain.PROSITE-ProRule annotation

Keywords - Domaini

Collagen, Repeat, Signal

Phylogenomic databases

eggNOGiKOG3544. Eukaryota.
ENOG410XNMM. LUCA.
HOGENOMiHOG000085654.
HOVERGENiHBG004933.
InParanoidiP05539.
KOiK19719.
PhylomeDBiP05539.

Family and domain databases

InterProiIPR008160. Collagen.
IPR000885. Fib_collagen_C.
[Graphical view]
PfamiPF01410. COLFI. 1 hit.
PF01391. Collagen. 3 hits.
[Graphical view]
ProDomiPD002078. Fib_collagen_C. 1 hit.
[Graphical view] [Entries sharing at least one domain]
SMARTiSM00038. COLFI. 1 hit.
[Graphical view]
PROSITEiPS51461. NC1_FIB. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

P05539-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MIRLGAPQSL VLLTLLIATV LQCQGQDARK LGPKGQKGEP GDIKDIIGPK
60 70 80 90 100
GPPGPQGPAG EQGPRGDRGD KGERGAPGPR GRDGEPGTPG NPGPPGPPGP
110 120 130 140 150
PGPPGLGGGN FAAQMAGGFD EKAGGAQMGV MQGPMGPMGP RGPPGPAGAP
160 170 180 190 200
GPQGFQGNPG EPGEPGVSGP IGPRGPPGPA GKPGDDGEAG KPGKAGERGL
210 220 230 240 250
PGPQGARGFP GTPGLPGVKG HRGYPGLDGA KGEAGAPGVK GESGSPGENG
260 270 280 290 300
SPGPMGPRGL PGERGRTGPA GAAGARGNDG QPGPAGPPGP VGPAGGPGFL
310 320 330 340 350
GAPGAKGEAG PTGARGPEGA QGSRGEPGNP GSPGPAGASG NPGTDGIPGA
360 370 380 390 400
KGSAGAPGIA GAPGFPGPRG PPGPQGATGP LGPKGQTGEP GIAGFKGEQG
410 420 430 440 450
PKGETGPAGP QGAPGPAGEE GKRGARGEPG GAGPIGPPGE RGAPGNRGFP
460 470 480 490 500
GQDGLAGPKG APGERGPSGL AGPKGANGDP GRPGEPGLPG ARGLTGRPGD
510 520 530 540 550
AGPQGKVGPS GAPGEDGRPG PPGPQGARGQ PGVMGFPGPK GANGEPGKAG
560 570 580 590 600
EKGLAGAPGL RGLPGKDGET GAAGPPGPSG PAGERGEQGA PGPSGFQGLP
610 620 630 640 650
GPPGPPGEGG KQGDQGIPGE AGAPGLVGPR GERGFPGERG SPGAQGLQGP
660 670 680 690 700
RGLPGTPGTD GPKGAAGPDG PPGAQGPPGL QGMPGERGAA GIAGPKGDRG
710 720 730 740 750
DVGEKGPEGA PGKDGGRGLT GPIGPPGPAG ANGEKGEVGP PGPSGSTGAR
760 770 780 790 800
GAPGERGETG PPGPAGFAGP PGADGQPGAK GDQGEAGQKG DAGAPGPQGP
810 820 830 840 850
SGAPGPQGPT GVTGPKGARG AQGPPGATGF PGAAGRVGPP GSNGNPGPAG
860 870 880 890 900
PPGPAGKDGP KGARGDTGAP GRAGDPGLQG PAGAPGEKGE PGDDGPSGSD
910 920 930 940 950
GPPGPQGLAG QRGIVGLPGQ RGERGFPGLP GPSGEPGKQG APGASGDRGP
960 970 980 990 1000
PGPVGPPGLT GPAGEPGREG SPGADGPPGR DGAAGVKGDR GETGALGAPG
1010 1020 1030 1040 1050
APGPPGSPGP AGPTGKQGDR GEAGAQGPMG PSGPAGARGI AGPQGPRGDK
1060 1070 1080 1090 1100
GEAGEPGERG LKGHRGFTGL QGLPGPPGPS GDQGTSGPAG PSGPRGPPGP
1110 1120 1130 1140 1150
VGPSGKDGSN GIPGPIGPPG PRGRSGETGP AGPPGNPGPP GPPGPPGPGI
1160 1170 1180 1190 1200
DMSAFAGLGQ REKGPDPLQY MRADEADSTL RQHDVEVDAT LKSLNNQIES
1210 1220 1230 1240 1250
IRSPDGSRKN PARTCQDLKL CHPEWKSGDY WIDPNQGCTL DAMKVFCNME
1260 1270 1280 1290 1300
TGESCVYPNP ATVPRKNWWS SKSKEKKHIW FGETMNGGFH FSYGDGNLAP
1310 1320 1330 1340 1350
NTANVQMTFL RLLSTEGSQN ITYHCKNSIA YLDEAAGNLK KALLIQGSND
1360 1370 1380 1390 1400
VEMRAEGNSR FTYTALKDGC TKHTGKWGKT IIEYRSQKTS RLPIVDIAPM
1410
DIGGPDQEFG VDIGPVCFL
Length:1,419
Mass (Da):134,570
Last modified:December 6, 2005 - v2
Checksum:iB7C63B77819CE50B
GO

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti121E → Q in AAA40919 (PubMed:6094525).Curated1

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
L48440 mRNA. Translation: AAA79780.1.
K02804 mRNA. Translation: AAA40919.1.
M10613 Genomic DNA. Translation: AAA40920.1.
X79816 mRNA. Translation: CAA56213.1.
PIRiA05152.
I60384.
RefSeqiNP_037061.1. NM_012929.1.
UniGeneiRn.10124.

Genome annotation databases

GeneIDi25412.
KEGGirno:25412.
UCSCiRGD:2375. rat.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
L48440 mRNA. Translation: AAA79780.1.
K02804 mRNA. Translation: AAA40919.1.
M10613 Genomic DNA. Translation: AAA40920.1.
X79816 mRNA. Translation: CAA56213.1.
PIRiA05152.
I60384.
RefSeqiNP_037061.1. NM_012929.1.
UniGeneiRn.10124.

3D structure databases

ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

IntActiP05539. 1 interactor.
STRINGi10116.ENSRNOP00000016044.

Proteomic databases

PaxDbiP05539.
PRIDEiP05539.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

GeneIDi25412.
KEGGirno:25412.
UCSCiRGD:2375. rat.

Organism-specific databases

CTDi1280.
RGDi2375. Col2a1.

Phylogenomic databases

eggNOGiKOG3544. Eukaryota.
ENOG410XNMM. LUCA.
HOGENOMiHOG000085654.
HOVERGENiHBG004933.
InParanoidiP05539.
KOiK19719.
PhylomeDBiP05539.

Miscellaneous databases

PROiP05539.

Family and domain databases

InterProiIPR008160. Collagen.
IPR000885. Fib_collagen_C.
[Graphical view]
PfamiPF01410. COLFI. 1 hit.
PF01391. Collagen. 3 hits.
[Graphical view]
ProDomiPD002078. Fib_collagen_C. 1 hit.
[Graphical view] [Entries sharing at least one domain]
SMARTiSM00038. COLFI. 1 hit.
[Graphical view]
PROSITEiPS51461. NC1_FIB. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiCO2A1_RAT
AccessioniPrimary (citable) accession number: P05539
Secondary accession number(s): Q63123, Q63565, Q78DY3
Entry historyi
Integrated into UniProtKB/Swiss-Prot: November 1, 1988
Last sequence update: December 6, 2005
Last modified: June 8, 2016
This is version 118 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.