Reviewed,
UniProtKB/Swiss-Prot P16112 (PGCA_HUMAN)
Last modified
November 24, 2009.
Version 126.
History...
Clusters with 100%,
90%,
50% identity |
Documents (5) |
Third-party data |
Customize display | text xml rdf/xml gff fasta |
Names and origin
| Protein names | Recommended name: Aggrecan core protein Alternative name(s): Cartilage-specific proteoglycan core protein Short name=CSPCP Chondroitin sulfate proteoglycan core protein 1 Cleaved into the following chain: 1- Recommended name: Aggrecan core protein 2 | ||||
| Gene names |
| ||||
| Organism | Homo sapiens (Human) [Complete proteome] | ||||
| Taxonomic identifier | 9606 [NCBI] | ||||
| Taxonomic lineage | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Mammalia › Eutheria › Euarchontoglires › Primates › Haplorrhini › Catarrhini › Hominidae › Homo |
Protein attributes
| Sequence length | 2415 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is further processed into a mature form. |
| Protein existence | Evidence at protein level. |
General annotation (Comments)
| Function | This proteoglycan is a major component of extracellular matrix of cartilagenous tissues. A major function of this protein is to resist compression in cartilage. It binds avidly to hyaluronic acid via an N-terminal globular region. |
| Subunit structure | Interacts with FBLN1 By similarity. Interacts with COMP. |
| Subcellular location | Secreted › extracellular space › extracellular matrix By similarity. |
| Tissue specificity | Restricted to cartilages. Ref.4 |
| Developmental stage | Expression was detected in chondrocytes throughout the developing skeleton. |
| Domain | Two globular domains, G1 and G2, comprise the N-terminus of the proteoglycan, while another globular region, G3, makes up the C-terminus. G1 contains Link domains and thus consists of three disulfide-bonded loop structures designated as the A, B, B' motifs. G2 is similar to G1. The keratan sulfate (KS) and the chondroitin sulfate (CS) attachment domains lie between G2 and G3. |
| Post-translational modification | Contains mostly chondroitin sulfate, but also keratan sulfate chains, N-linked and O-linked oligosaccharides. The release of aggrecan fragments from articular cartilage into the synovial fluid at all stages of human osteoarthritis is the result of cleavage by aggrecanase. |
| Involvement in disease | Defects in ACAN are the cause of spondyloepiphyseal dysplasia type Kimberley (SEDK) [MIM:608361]. Spondyloepiphyseal dysplasias are a heterogeneous group of congenital chondrodysplasias that specifically affect epiphyses and vertebrae. The autosomal dominant SEDK is associated with premature degenerative arthropathy. Ref.9 |
| Sequence similarities | Belongs to the aggrecan/versican proteoglycan family. Contains 1 C-type lectin domain. Contains 1 EGF-like domain. Contains 1 Ig-like V-type (immunoglobulin-like) domain. Contains 4 Link domains. Contains 1 Sushi (CCP/SCR) domain. |
Ontologies
Alternative products
| This entry describes 3 isoforms produced by alternative splicing. [Align] [Select] Note: Additional isoforms seem to exist. | ||||||
| Isoform 1 (identifier: P16112-1) This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry. | ||||||
| Isoform 2 (identifier: P16112-2) The sequence of this isoform differs from the canonical sequence as follows: 2163-2200: Missing. | ||||||
| Isoform 3 (identifier: P16112-3) The sequence of this isoform differs from the canonical sequence as follows: 2163-2200: Missing. 2330-2390: Missing. |
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||||
Molecule processing | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Signal peptide | 1 – 16 | 16 | Ref.2 | ||||||||
| Chain | 17 – 2415 | 2399 | Aggrecan core protein | PRO_0000017505 | |||||||
| Chain | 393 – 2415 | 2023 | Aggrecan core protein 2 | PRO_0000017506 | |||||||
Regions | |||||||||||
| Domain | 34 – 147 | 114 | Ig-like V-type | ||||||||
| Domain | 153 – 248 | 96 | Link 1 | ||||||||
| Domain | 254 – 350 | 97 | Link 2 | ||||||||
| Domain | 478 – 573 | 96 | Link 3 | ||||||||
| Domain | 579 – 674 | 96 | Link 4 | ||||||||
| Repeat | 772 – 777 | 6 | 1-1 | ||||||||
| Repeat | 778 – 783 | 6 | 1-2 | ||||||||
| Repeat | 784 – 789 | 6 | 1-3 | ||||||||
| Repeat | 790 – 795 | 6 | 1-4 | ||||||||
| Repeat | 796 – 801 | 6 | 1-5 | ||||||||
| Repeat | 802 – 807 | 6 | 1-6 | ||||||||
| Repeat | 808 – 813 | 6 | 1-7; approximate | ||||||||
| Repeat | 814 – 819 | 6 | 1-8; approximate | ||||||||
| Repeat | 820 – 825 | 6 | 1-9 | ||||||||
| Repeat | 826 – 831 | 6 | 1-10; approximate | ||||||||
| Repeat | 832 – 837 | 6 | 1-11 | ||||||||
| Repeat | 838 – 843 | 6 | 1-12 | ||||||||
| Repeat | 941 – 959 | 19 | 2-1 | ||||||||
| Repeat | 960 – 978 | 19 | 2-2 | ||||||||
| Repeat | 979 – 997 | 19 | 2-3 | ||||||||
| Repeat | 998 – 1016 | 19 | 2-4 | ||||||||
| Repeat | 1017 – 1035 | 19 | 2-5 | ||||||||
| Repeat | 1036 – 1054 | 19 | 2-6 | ||||||||
| Repeat | 1055 – 1073 | 19 | 2-7 | ||||||||
| Repeat | 1074 – 1092 | 19 | 2-8 | ||||||||
| Repeat | 1093 – 1111 | 19 | 2-9 | ||||||||
| Repeat | 1112 – 1130 | 19 | 2-10 | ||||||||
| Repeat | 1131 – 1149 | 19 | 2-11 | ||||||||
| Repeat | 1150 – 1168 | 19 | 2-12 | ||||||||
| Repeat | 1169 – 1187 | 19 | 2-13 | ||||||||
| Repeat | 1188 – 1206 | 19 | 2-14 | ||||||||
| Repeat | 1207 – 1225 | 19 | 2-15 | ||||||||
| Repeat | 1226 – 1244 | 19 | 2-16 | ||||||||
| Repeat | 1245 – 1263 | 19 | 2-17 | ||||||||
| Repeat | 1264 – 1282 | 19 | 2-18 | ||||||||
| Repeat | 1283 – 1301 | 19 | 2-19 | ||||||||
| Repeat | 1302 – 1320 | 19 | 2-20 | ||||||||
| Repeat | 1321 – 1339 | 19 | 2-21 | ||||||||
| Repeat | 1340 – 1358 | 19 | 2-22 | ||||||||
| Repeat | 1360 – 1378 | 19 | 2-23 | ||||||||
| Repeat | 1379 – 1397 | 19 | 2-24 | ||||||||
| Repeat | 1398 – 1416 | 19 | 2-25 | ||||||||
| Repeat | 1418 – 1436 | 19 | 2-26 | ||||||||
| Repeat | 1439 – 1457 | 19 | 2-27; approximate | ||||||||
| Repeat | 1459 – 1477 | 19 | 2-28 | ||||||||
| Repeat | 1479 – 1497 | 19 | 2-29 | ||||||||
| Domain | 2164 – 2199 | 36 | EGF-like | ||||||||
| Domain | 2201 – 2327 | 127 | C-type lectin | ||||||||
| Domain | 2330 – 2390 | 61 | Sushi | ||||||||
| Region | 48 – 141 | 94 | G1-A | ||||||||
| Region | 152 – 247 | 96 | G1-B | ||||||||
| Region | 253 – 349 | 97 | G1-B' | ||||||||
| Region | 477 – 571 | 95 | G2-B | ||||||||
| Region | 578 – 672 | 95 | G2-B' | ||||||||
| Region | 676 – 848 | 173 | KS | ||||||||
| Region | 772 – 843 | 72 | 12 X 6 AA approximate tandem repeats of E-[GVE]-P-[SFY]-[APT]-[TSP] | ||||||||
| Region | 851 – 1497 | 647 | CS-1 | ||||||||
| Region | 941 – 1497 | 557 | 29 X 19 AA approximate tandem repeats of E-[IVDG]-[LV]-[EV]-[GTI]-[STA]-[ATV]-[SP]-[GA]-[VIFAD]-[GEDL]-[DE]-[LVI]-[SG]-[GERK]-[LV]-P-S-G | ||||||||
| Region | 1498 – 2162 | 665 | CS-2 | ||||||||
| Region | 2163 – 2415 | 253 | G3 | ||||||||
Sites | |||||||||||
| Site | 392 – 393 | 2 | Cleavage; by aggrecanase | ||||||||
Amino acid modifications | |||||||||||
| Glycosylation | 126 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 239 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 333 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 371 | 1 | O-linked (Xyl...) (keratan sulfate) Probable | ||||||||
| Glycosylation | 376 | 1 | O-linked (Xyl...) (keratan sulfate) Probable | ||||||||
| Glycosylation | 387 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 434 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 602 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 657 | 1 | N-linked (GlcNAc...) Ref.10 | ||||||||
| Glycosylation | 737 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 1898 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Disulfide bond | 51 ↔ 133 | By similarity | |||||||||
| Disulfide bond | 175 ↔ 246 | By similarity | |||||||||
| Disulfide bond | 199 ↔ 220 | By similarity | |||||||||
| Disulfide bond | 273 ↔ 348 | By similarity | |||||||||
| Disulfide bond | 297 ↔ 318 | By similarity | |||||||||
| Disulfide bond | 500 ↔ 571 | By similarity | |||||||||
| Disulfide bond | 524 ↔ 545 | By similarity | |||||||||
| Disulfide bond | 598 ↔ 672 | By similarity | |||||||||
| Disulfide bond | 621 ↔ 642 | By similarity | |||||||||
| Disulfide bond | 2168 ↔ 2178 | By similarity | |||||||||
| Disulfide bond | 2173 ↔ 2187 | By similarity | |||||||||
| Disulfide bond | 2189 ↔ 2198 | By similarity | |||||||||
| Disulfide bond | 2205 ↔ 2216 | By similarity | |||||||||
| Disulfide bond | 2233 ↔ 2325 | By similarity | |||||||||
| Disulfide bond | 2301 ↔ 2317 | By similarity | |||||||||
| Disulfide bond | 2332 ↔ 2375 | By similarity | |||||||||
| Disulfide bond | 2361 ↔ 2388 | By similarity | |||||||||
Natural variations | |||||||||||
| Alternative sequence | 2163 – 2200 | 38 | Missing in isoform 2 and isoform 3. | VSP_003074 | |||||||
| Alternative sequence | 2330 – 2390 | 61 | Missing in isoform 3. | VSP_003075 | |||||||
| Natural variant | 102 | 1 | D → E: dbSNP rs16942318. | VAR_056152 | |||||||
| Natural variant | 275 | 1 | R → Q: dbSNP rs34949187. | VAR_056153 | |||||||
| Natural variant | 1943 | 1 | P → L: dbSNP rs35061438. | VAR_056154 | |||||||
| Natural variant | 2005 | 1 | S → R: dbSNP rs34153007. | VAR_056155 | |||||||
Experimental info | |||||||||||
| Sequence conflict | 766 | 1 | E → A in AAC60643. Ref.6 | ||||||||
| Sequence conflict | 847 | 1 | E → V in AAC60643. Ref.6 | ||||||||
| Sequence conflict | 1928 | 1 | E → A in CAA35463. Ref.7 | ||||||||
| Sequence conflict | 1964 | 1 | I → V in CAA35463. Ref.7 | ||||||||
| Sequence conflict | 1964 | 1 | I → V in AAA35726. Ref.8 | ||||||||
| Sequence conflict | 2070 | 1 | P → A in AAA35726. Ref.8 | ||||||||
| Sequence conflict | 2391 | 1 | A → P in CAA35463. Ref.7 | ||||||||
| Sequence conflict | 2391 | 1 | A → P in AAA35726. Ref.8 | ||||||||
Sequences
| ||||||||||||||||||||||||||||||
References
| « Hide 'large scale' references | |
| [1] | "Complete coding sequence and deduced primary structure of the human cartilage large aggregating proteoglycan, aggrecan. Human-specific repeats, and additional alternatively spliced forms." Doege K.J., Sasaki M., Kimura T., Yamada Y. J. Biol. Chem. 266:894-902(1991) [PubMed: 1985970] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 3). Tissue: Chondrocyte. |
| [2] | "Catabolism of aggrecan by explant cultures of human articular cartilage in the presence of retinoic acid." Ilic M.Z., Mok M.T., Williamson O.D., Campbell M.A., Hughes C.E., Handley C.J. Arch. Biochem. Biophys. 322:22-30(1995) [PubMed: 7574678] [Abstract] Cited for: PROTEIN SEQUENCE OF 17-27 AND 393-403, PROTEOLYTIC PROCESSING BY AGGRECANASE. |
| [3] | "The structure of aggrecan fragments in human synovial fluid. Evidence for the involvement in osteoarthritis of a novel proteinase which cleaves the Glu 373-Ala 374 bond of the interglobular domain." Sandy J.D., Flannery C.R., Neame P.J., Lohmander L.S. J. Clin. Invest. 89:1512-1516(1992) [PubMed: 1569188] [Abstract] Cited for: PROTEIN SEQUENCE OF 361-373 AND 393-409, PROTEOLYTIC PROCESSING, GLYCOSYLATION AT THR-371 AND THR-376. |
| [4] | "Analysis of aggrecan and tenascin gene expression in mouse skeletal tissues by northern and in situ hybridization using species specific cDNA probes." Glumoff V., Savontaus M., Vehanen J., Vuorio E. Biochim. Biophys. Acta 1219:613-622(1994) [PubMed: 7524681] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 380-497, TISSUE SPECIFICITY. |
| [5] | "The structure of aggrecan fragments in human synovial fluid. Evidence that aggrecanase mediates cartilage degradation in inflammatory joint disease, joint injury, and osteoarthritis." Lohmander L.S., Neame P.J., Sandy J.D. Arthritis Rheum. 36:1214-1222(1993) [PubMed: 8216415] [Abstract] Cited for: PROTEIN SEQUENCE OF 393-409. Tissue: Synovial fluid. |
| [6] | "Length variation in the keratan sulfate domain of mammalian aggrecan." Barry F.P., Neame P.J., Sasse J., Pearson D. Matrix Biol. 14:323-328(1994) [PubMed: 7827755] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 764-864. Tissue: Blood. |
| [7] | "Age-related changes in the content of the C-terminal region of aggrecan in human articular cartilage." Dudhia J., Davidson C.M., Wells T.M., Vynios D.H., Hardingham T.E., Bayliss M.T. Biochem. J. 313:933-940(1996) [PubMed: 8611178] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 1778-2415 (ISOFORM 2). Tissue: Chondrocyte. |
| [8] | "A new epidermal growth factor-like domain in the human core protein for the large cartilage-specific proteoglycan. Evidence for alternative splicing of the domain." Baldwin C.T., Reginato A.M., Prockop D.J. J. Biol. Chem. 264:15747-15750(1989) [PubMed: 2789216] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 1936-2415 (ISOFORM 1). |
| [9] | "A mutation in the variable repeat region of the aggrecan gene (AGC1) causes a form of spondyloepiphyseal dysplasia associated with severe, premature osteoarthritis." Gleghorn L., Ramesar R., Beighton P., Wallis G. Am. J. Hum. Genet. 77:484-490(2005) [PubMed: 16080123] [Abstract] Cited for: INVOLVEMENT IN SEDK. |
| [10] | "Human plasma N-glycoproteome analysis by immunoaffinity subtraction, hydrazide chemistry, and mass spectrometry." Liu T., Qian W.-J., Gritsenko M.A., Camp D.G. II, Monroe M.E., Moore R.J., Smith R.D. J. Proteome Res. 4:2070-2080(2005) [PubMed: 16335952] [Abstract] Cited for: GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-657, MASS SPECTROMETRY. Tissue: Plasma. |
| [11] | "Interaction of cartilage oligomeric matrix protein/thrombospondin 5 with aggrecan." Chen F.-H., Herndon M.E., Patel N., Hecht J.T., Tuan R.S., Lawler J. J. Biol. Chem. 282:24591-24598(2007) [PubMed: 17588949] [Abstract] Cited for: INTERACTION WITH COMP. |
| + | Additional computationally mapped references. |
Cross-references
Sequence databases | |
|---|---|
| M55172 mRNA. Translation: AAA62824.1. X80278 mRNA. No translation available. S74659 Genomic DNA. Translation: AAC60643.2. X17406 mRNA. Translation: CAA35463.1. J05062 mRNA. Translation: AAA35726.1. | |
| IPI | IPI00291932. IPI00923424. IPI00923437. |
| PIR | A39086. |
| UniGene | Hs.2159 Hs.616395 Hs.711491 |
3D structure databases | |
| SMR | P16112. Positions 2203-2328. |
| ModBase | Search... |
Protein-protein interaction databases | |
| STRING | P16112. |
PTM databases | |
| PhosphoSite | P16112. |
Proteomic databases | |
| PRIDE | P16112. |
Genome annotation databases | |
| Ensembl | ENST00000268134; ENSP00000268134; ENSG00000157766; Homo sapiens. [Genome view] |
| KEGG | hsa:176. |
Organism-specific databases | |
| CTD | 176. |
| GeneCards | GC15P087148. |
| H-InvDB | HIX0026812. |
| HGNC | HGNC:319. ACAN. |
| HPA | CAB016377. |
| MIM | 155760. gene. 608361. phenotype. |
| Orphanet | 171866. Spondyloepimetaphyseal dysplasia, aggrecan type. 253. Spondyloepiphyseal dysplasia. 93283. Spondyloepiphyseal dysplasia, Kimberley type. |
| PharmGKB | PA24616. |
| GenAtlas | Search... |
Phylogenomic databases | |
| HOGENOM | P16112. |
| HOVERGEN | P16112. |
Gene expression databases | |
| ArrayExpress | P16112. |
| Bgee | P16112. |
| CleanEx | HS_ACAN. |
| Genevestigator | P16112. |
| GermOnline | ENSG00000157766. Homo sapiens. |
Family and domain databases | |
| InterPro | IPR001304. C-type_lectin. IPR016186. C-type_lectin-like. IPR018378. C-type_lectin_CS. IPR016187. C-type_lectin_fold. IPR016060. Complement_control_module. IPR006209. EGF. IPR006210. EGF-like. IPR013032. EGF-like_reg_CS. IPR000742. EGF_3. IPR007110. Ig-like. IPR013783. Ig-like_fold. IPR003006. Ig/MHC_CS. IPR003599. Ig_sub. IPR013106. Ig_V-set. IPR000538. Link. IPR000436. Sushi_SCR_CCP. [Graphical view] |
| Gene3D | G3DSA:3.10.100.10. C-type_lectin-like. 5 hits. G3DSA:2.10.70.10. Complement_control_module. 1 hit. G3DSA:2.60.40.10. Ig-like_fold. 1 hit. |
| Pfam | PF00008. EGF. 1 hit. PF00059. Lectin_C. 1 hit. PF00084. Sushi. 1 hit. PF07686. V-set. 1 hit. PF00193. Xlink. 4 hits. [Graphical view] |
| PRINTS | PR01265. LINKMODULE. |
| SMART | SM00032. CCP. 1 hit. SM00034. CLECT. 1 hit. SM00181. EGF. 1 hit. SM00409. IG. 1 hit. SM00445. LINK. 4 hits. [Graphical view] |
| PROSITE | PS00615. C_TYPE_LECTIN_1. 1 hit. PS50041. C_TYPE_LECTIN_2. 1 hit. PS00022. EGF_1. 1 hit. PS01186. EGF_2. 1 hit. PS50026. EGF_3. 1 hit. PS50835. IG_LIKE. 1 hit. PS00290. IG_MHC. 1 hit. PS01241. LINK_1. 3 hits. PS50963. LINK_2. 4 hits. PS50923. SUSHI. 1 hit. [Graphical view] |
| ProtoNet | Search... |
Other Resources | |
| PMAP-CutDB | P16112. |
| SOURCE | Search... |
Entry information
| Entry name | PGCA_HUMAN | ||||||||
| Accession | Primary (citable) accession number: P16112 Secondary accession number(s): Q13650 Q9UDE0 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation project | HPI (Human Proteome Initiative) | ||||||||
| Disclaimer | Any medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care. | ||||||||
Relevant documents
| Human chromosome 15 Human chromosome 15: entries, gene names and cross-references to MIM |
| Human entries with polymorphisms or disease mutations List of human entries with polymorphisms or disease mutations |
| Human polymorphisms and disease mutations Index of human polymorphisms and disease mutations |
| MIM cross-references Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot |
| SIMILARITY comments Index of protein domains and families |

Clusters with


