Q14031 (CO4A6_HUMAN) Reviewed, UniProtKB/Swiss-Prot
Last modified
May 29, 2013.
Version 138.
History...
Names·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order
Names·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize orderNames and origin
| Protein names | Recommended name: Collagen alpha-6(IV) chain | ||
| Gene names |
| ||
| Organism | Homo sapiens (Human) [Reference proteome] | ||
| Taxonomic identifier | 9606 [NCBI] | ||
| Taxonomic lineage | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Mammalia › Eutheria › Euarchontoglires › Primates › Haplorrhini › Catarrhini › Hominidae › Homo![]() |
Protein attributes
| Sequence length | 1691 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is further processed into a mature form. |
| Protein existence | Evidence at transcript level |
General annotation (Comments)
| Function | Type IV collagen is the major structural component of glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork together with laminins, proteoglycans and entactin/nidogen. |
| Subunit structure | There are six type IV collagen isoforms, alpha 1(IV)-alpha 6(IV), each of which can form a triple helix structure with 2 other chains to generate type IV collagen network. |
| Subcellular location | Secreted › extracellular space › extracellular matrix › basement membrane. |
| Domain | Alpha chains of type IV collagen have a non-collagenous domain (NC1) at their C-terminus, frequent interruptions of the G-X-Y repeats in the long central triple-helical domain (which may cause flexibility in the triple helix), and a short N-terminal triple-helical 7S domain. |
| Post-translational modification | Prolines at the third position of the tripeptide repeating unit (G-X-Y) are hydroxylated in some or all of the chains. Type IV collagens contain numerous cysteine residues which are involved in inter- and intramolecular disulfide bonding. 12 of these, located in the NC1 domain, are conserved in all known type IV collagens. The trimeric structure of the NC1 domains is stabilized by covalent bonds between Lys and Met residues By similarity. |
| Involvement in disease | Deletions covering the N-terminal regions of COL4A5 and COL4A6, which are localized in a head-to-head manner, are found in the chromosome Xq22.3 centromeric deletion syndrome. This results in a phenotype with features of diffuse leiomyomatosis and Alport syndrome (DL-ATS). Ref.6 Ref.7 |
| Sequence similarities | Belongs to the type IV collagen family. Contains 1 collagen IV NC1 (C-terminal non-collagenous) domain. |
Ontologies
| Keywords | |
|---|---|
| Biological process | Cell adhesion |
| Cellular component | Basement membrane Extracellular matrix Secreted |
| Coding sequence diversity | Alternative splicing Chromosomal rearrangement Polymorphism |
| Domain | Collagen Repeat Signal |
| PTM | Disulfide bond Glycoprotein Hydroxylation |
| Technical term | Complete proteome Reference proteome |
| Gene Ontology (GO) | |
| Biological_process | cell adhesion Inferred from electronic annotation. Source: UniProtKB-KW cellular response to amino acid stimulusInferred from electronic annotation. Source: Compara collagen catabolic processTraceable author statement. Source: Reactome extracellular matrix disassemblyTraceable author statement. Source: Reactome |
| Cellular_component | collagen type IV Non-traceable author statement Ref.1. Source: UniProtKB endoplasmic reticulum lumenTraceable author statement. Source: Reactome |
| Molecular_function | extracellular matrix structural constituent Non-traceable author statement. Source: UniProtKB |
| Complete GO annotation... | |
Alternative products
| This entry describes 2 isoforms produced by alternative splicing. [Align] [Select] | ||||||
| Isoform A (identifier: Q14031-1) This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry. | ||||||
| Isoform B (identifier: Q14031-2) The sequence of this isoform differs from the canonical sequence as follows: 1-5: MLINK → MHPG |
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||||
Molecule processing | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Signal peptide | 1 – 22 | 22 | Potential | ||||||||
| Chain | 23 – 1691 | 1669 | Collagen alpha-6(IV) chain | PRO_0000005853 | |||||||
Regions | |||||||||||
| Domain | 1467 – 1691 | 225 | Collagen IV NC1 | ||||||||
| Region | 23 – 46 | 24 | 7S domain | ||||||||
| Region | 47 – 1463 | 1417 | Triple-helical region | ||||||||
| Motif | 515 – 517 | 3 | Cell attachment site Potential | ||||||||
| Motif | 560 – 562 | 3 | Cell attachment site Potential | ||||||||
| Motif | 986 – 988 | 3 | Cell attachment site Potential | ||||||||
Amino acid modifications | |||||||||||
| Glycosylation | 127 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Disulfide bond | 1482 ↔ 1571 | Or C-1482 with C-1568 By similarity | |||||||||
| Disulfide bond | 1515 ↔ 1568 | Or C-1515 with C-1571 By similarity | |||||||||
| Disulfide bond | 1527 ↔ 1533 | By similarity | |||||||||
| Disulfide bond | 1590 ↔ 1687 | Or C-1590 with C-1684 By similarity | |||||||||
| Disulfide bond | 1624 ↔ 1684 | Or C-1624 with C-1687 By similarity | |||||||||
| Disulfide bond | 1636 ↔ 1643 | By similarity | |||||||||
Natural variations | |||||||||||
| Alternative sequence | 1 – 5 | 5 | MLINK → MHPG in isoform B. | VSP_001174 | |||||||
| Natural variant | 455 | 1 | S → A. Ref.3 Corresponds to variant rs1042065 [ dbSNP | Ensembl ]. | VAR_015216 | |||||||
| Natural variant | 455 | 1 | S → P. Corresponds to variant rs1042065 [ dbSNP | Ensembl ]. | VAR_059242 | |||||||
| Natural variant | 1110 | 1 | N → K. Ref.3 Corresponds to variant rs1042067 [ dbSNP | Ensembl ]. | VAR_015217 | |||||||
| Natural variant | 1126 | 1 | P → S. Corresponds to variant rs35179844 [ dbSNP | Ensembl ]. | VAR_032972 | |||||||
| Natural variant | 1130 | 1 | G → E in a colorectal cancer sample; somatic mutation. Ref.8 | VAR_035748 | |||||||
| Natural variant | 1162 | 1 | I → V. Corresponds to variant rs34466065 [ dbSNP | Ensembl ]. | VAR_032973 | |||||||
| Natural variant | 1362 | 1 | L → P. Corresponds to variant rs35363062 [ dbSNP | Ensembl ]. | VAR_032974 | |||||||
Experimental info | |||||||||||
| Sequence conflict | 170 | 1 | M → I in AAA19569. Ref.2 | ||||||||
| Sequence conflict | 170 | 1 | M → I in AAA16338. Ref.5 | ||||||||
| Sequence conflict | 272 – 273 | 2 | IS → LR in AAB19038. Ref.3 | ||||||||
| Sequence conflict | 272 – 273 | 2 | IS → LR in AAB19039. Ref.3 | ||||||||
| Sequence conflict | 366 | 1 | V → D in AAB19038. Ref.3 | ||||||||
| Sequence conflict | 366 | 1 | V → D in AAB19039. Ref.3 | ||||||||
| Sequence conflict | 518 | 1 | R → Q in AAB19038. Ref.3 | ||||||||
| Sequence conflict | 518 | 1 | R → Q in AAB19039. Ref.3 | ||||||||
| Sequence conflict | 917 | 1 | P → S in BAA04809. Ref.1 | ||||||||
| Sequence conflict | 917 | 1 | P → S in AAB19038. Ref.3 | ||||||||
| Sequence conflict | 917 | 1 | P → S in AAB19039. Ref.3 | ||||||||
| Sequence conflict | 1302 – 1313 | 12 | Missing in BAA04809. Ref.1 | ||||||||
| Sequence conflict | 1356 | 1 | P → A in BAA04809. Ref.1 | ||||||||
| Sequence conflict | 1365 | 1 | D → H in AAB19038. Ref.3 | ||||||||
| Sequence conflict | 1365 | 1 | D → H in AAB19039. Ref.3 | ||||||||
Sequences
| ||||||||||||||||||||||||
References
| « Hide 'large scale' references | |
| [1] | "Identification of a new collagen IV chain, alpha 6(IV), by cDNA isolation and assignment of the gene to chromosome Xq22, which is the same locus for COL4A5." Oohashi T., Sugimoto M., Mattei M.-G., Ninomiya Y. J. Biol. Chem. 269:7520-7526(1994) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM B). Tissue: Eye and Kidney. |
| [2] | "Complete primary structure of the sixth chain of human basement membrane collagen, alpha 6(IV). Isolation of the cDNAs for alpha 6(IV) and comparison with five other type IV collagen chains." Zhou J., Ding M., Zhao Z., Reeders S.T. J. Biol. Chem. 269:13193-13199(1994) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM A). |
| [3] | "Structure of the human type IV collagen COL4A6 gene, which is mutated in Alport syndrome-associated leiomyomatosis." Zhang X., Zhou J., Reeders S.T., Tryggvason K. Genomics 33:473-479(1996) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] (ISOFORMS A AND B), VARIANTS ALA-455 AND LYS-1110. |
| [4] | "The DNA sequence of the human X chromosome." Ross M.T., Grafham D.V., Coffey A.J., Scherer S., McLay K., Muzny D., Platzer M., Howell G.R., Burrows C., Bird C.P., Frankish A., Lovell F.L., Howe K.L., Ashurst J.L., Fulton R.S., Sudbrak R., Wen G., Jones M.C. Bentley D.R.Nature 434:325-337(2005) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. |
| [5] | Mural R.J., Istrail S., Sutton G.G., Florea L., Halpern A.L., Mobarry C.M., Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., Flanigan M.J., Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., Hannenhalli S., Turner R. Venter J.C.Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. |
| [6] | "Deletion of the paired alpha 5(IV) and alpha 6(IV) collagen genes in inherited smooth muscle tumors." Zhou J., Mochizuki T., Smeets H., Antignac C., Laurila P., de Paepe A., Tryggvason K., Reeders S.T. Science 261:1167-1169(1993) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 1-542 (ISOFORM A), INVOLVEMENT IN DL-ATS. |
| [7] | "Alport syndrome with diffuse leiomyomatosis." Anker M.C., Arnemann J., Neumann K., Ahrens P., Schmidt H., Koenig R. Am. J. Med. Genet. A 119:381-385(2003) [PubMed] [Europe PMC] [Abstract] Cited for: INVOLVEMENT IN DL-ATS. |
| [8] | "The consensus coding sequences of human breast and colorectal cancers." Sjoeblom T., Jones S., Wood L.D., Parsons D.W., Lin J., Barber T.D., Mandelker D., Leary R.J., Ptak J., Silliman N., Szabo S., Buckhaults P., Farrell C., Meeh P., Markowitz S.D., Willis J., Dawson D., Willson J.K.V. Velculescu V.E.Science 314:268-274(2006) [PubMed] [Europe PMC] [Abstract] Cited for: VARIANT [LARGE SCALE ANALYSIS] GLU-1130. |
| + | Additional computationally mapped references. |
Cross-references
Sequence databases | |
|---|---|
| EMBL GenBank DDBJ | D21337 mRNA. Translation: BAA04809.1. U04845 mRNA. Translation: AAA19569.2. U47004 U47003 Genomic DNA. Translation: AAB19038.1.U47004 U47003 Genomic DNA. Translation: AAB19039.1.AL136080 AL109943 Genomic DNA. Translation: CAI40756.1.AL136080 AL109943 Genomic DNA. Translation: CAI40758.1.AL034369 AL136080 Genomic DNA. Translation: CAI42045.1.AL034369 AL136080 Genomic DNA. Translation: CAI42047.1.AL109943 AL136080 Genomic DNA. Translation: CAI42993.1.AL109943 AL136080 Genomic DNA. Translation: CAI42995.1.AL031177 AL136080 Genomic DNA. Translation: CAI43139.1.AL031177 AL136080 Genomic DNA. Translation: CAI43140.1.CH471120 Genomic DNA. Translation: EAX02688.1. L22763 mRNA. Translation: AAA16338.1. |
| IPI | IPI00472200. IPI00514030. |
| PIR | CGHU6B. A54122. |
| RefSeq | NP_001838.2. NM_001847.2. NP_378667.1. NM_033641.2. |
| UniGene | Hs.145586. |
3D structure databases | |
| ProteinModelPortal | Q14031. |
| ModBase | Search... |
Protein-protein interaction databases | |
| IntAct | Q14031. 1 interaction. |
PTM databases | |
| PhosphoSite | Q14031. |
Polymorphism databases | |
| DMDM | 116241307. |
Proteomic databases | |
| PaxDb | Q14031. |
| PRIDE | Q14031. |
Protocols and materials databases | |
| DNASU | 1288. |
| StructuralBiologyKnowledgebase | Search... |
Genome annotation databases | |
| Ensembl | ENST00000334504; ENSP00000334733; ENSG00000197565. ENST00000372216; ENSP00000361290; ENSG00000197565. |
| GeneID | 1288. |
| KEGG | hsa:1288. |
| UCSC | uc004env.4. human. uc004enw.4. human. |
Organism-specific databases | |
| CTD | 1288. |
| GeneCards | GC0XM107386. |
| H-InvDB | HIX0056214. |
| HGNC | HGNC:2208. COL4A6. |
| MIM | 303631. gene. |
| neXtProt | NX_Q14031. |
| Orphanet | 1018. X-linked diffuse leiomyomatosis - Alport syndrome. |
| PharmGKB | PA26723. |
| GenAtlas | Search... |
Phylogenomic databases | |
| eggNOG | NOG12793. |
| HOVERGEN | HBG004933. |
| KO | K06237. |
| OrthoDB | EOG4P2Q1K. |
Enzyme and pathway databases | |
| Reactome | REACT_118779. Extracellular matrix organization. |
Gene expression databases | |
| ArrayExpress | Q14031. |
| Bgee | Q14031. |
| Genevestigator | Q14031. |
| GermOnline | ENSG00000197565. Homo sapiens. |
Family and domain databases | |
| Gene3D | 2.170.240.10. 1 hit. |
| InterPro | IPR016187. C-type_lectin_fold. IPR008160. Collagen. IPR001442. Collagen_VI_NC. [Graphical view] |
| Pfam | PF01413. C4. 2 hits. PF01391. Collagen. 22 hits. [Graphical view] |
| SMART | SM00111. C4. 2 hits. [Graphical view] |
| SUPFAM | SSF56436. C-type_lectin_fold. 2 hits. |
| PROSITE | PS51403. NC1_IV. 1 hit. [Graphical view] |
| ProtoNet | Search... |
Other | |
| GenomeRNAi | 1288. |
| NextBio | 5213. |
| PMAP-CutDB | Q5JYH8. |
| SOURCE | Search... |
Entry information
| Entry name | CO4A6_HUMAN | ||||||||
| Accession | Primary (citable) accession number: Q14031 Secondary accession number(s): Q12823 Q9Y4L4 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation program | Chordata Protein Annotation Program | ||||||||
| Disclaimer | Any medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care. | ||||||||
Relevant documents
| Human chromosome X Human chromosome X: entries, gene names and cross-references to MIM |
| Human entries with polymorphisms or disease mutations List of human entries with polymorphisms or disease mutations |
| Human polymorphisms and disease mutations Index of human polymorphisms and disease mutations |
| MIM cross-references Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot |
| SIMILARITY comments Index of protein domains and families |

Clusters with
