Reviewed,
UniProtKB/Swiss-Prot P17140 (CO4A2_CAEEL)
Last modified
November 3, 2009.
Version 100.
History...
Clusters with 100%,
90%,
50% identity |
Documents (2) |
Third-party data |
Customize display | text xml rdf/xml gff fasta |
Names and origin
| Protein names | Recommended name: Collagen alpha-2(IV) chain Alternative name(s): Lethal protein 2 | ||||||
| Gene names |
| ||||||
| Organism | Caenorhabditis elegans [Complete proteome] | ||||||
| Taxonomic identifier | 6239 [NCBI] | ||||||
| Taxonomic lineage | Eukaryota › Metazoa › Nematoda › Chromadorea › Rhabditida › Rhabditoidea › Rhabditidae › Peloderinae › Caenorhabditis |
Protein attributes
| Sequence length | 1758 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is further processed into a mature form. |
| Protein existence | Evidence at protein level. |
General annotation (Comments)
| Function | Collagen type IV is specific for basement membranes. Vital for embryonic development. Ref.1 |
| Subunit structure | Trimers of two alpha 1(IV) and one alpha 2(IV) chain. Type IV collagen forms a mesh-like network linked through intermolecular interactions between 7S domains and between NC1 domains. |
| Subcellular location | Secreted › extracellular space › extracellular matrix › basement membrane. |
| Developmental stage | Isoform I is predominant in embryos and isoform II is predominant in the larvae and adults. |
| Domain | Alpha chains of type IV collagen have a non-collagenous domain (NC1) at their C-terminus, frequent interruptions of the G-X-Y repeats in the long central triple-helical domain (which may cause flexibility in the triple helix), and a short N-terminal triple-helical 7S domain. |
| Post-translational modification | Prolines at the third position of the tripeptide repeating unit (G-X-Y) are hydroxylated in some or all of the chains. Type IV collagens contain numerous cysteine residues which are involved in inter- and intramolecular disulfide bonding. 12 of these, located in the NC1 domain, are conserved in all known type IV collagens. The trimeric structure of the NC1 domains is stabilized by covalent bonds between Lys and Met residues By similarity. |
| Sequence similarities | Belongs to the type IV collagen family. Contains 1 collagen IV NC1 (C-terminal non-collagenous) domain. |
Ontologies
Alternative products
| This entry describes 2 isoforms produced by alternative splicing. [Align] [Select] | ||||||
| Isoform I (identifier: P17140-1) Also known as: a; This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry. | ||||||
| Isoform II (identifier: P17140-2) Also known as: b; The sequence of this isoform differs from the canonical sequence as follows: 229-264: GDLGSVGPPGPPGPREFTGSGSIVGPRGNPGEKGDK → GDIGAMGPAGPPGPIASTMSKGTIIGPKGDLGEKGEK |
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||||
Molecule processing | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Signal peptide | 1 – 26 | 26 | Potential | ||||||||
| Chain | 27 – 1758 | 1732 | Collagen alpha-2(IV) chain | PRO_0000005829 | |||||||
Regions | |||||||||||
| Domain | 1531 – 1754 | 224 | Collagen IV NC1 | ||||||||
| Region | 27 – 42 | 16 | 7S domain | ||||||||
| Region | 42 – 1527 | 1486 | Triple-helical region | ||||||||
Amino acid modifications | |||||||||||
| Glycosylation | 248 | 1 | O-linked (Xyl...) (glycosaminoglycan) Potential | ||||||||
| Disulfide bond | 1546 ↔ 1635 | By similarity | |||||||||
| Disulfide bond | 1579 ↔ 1632 | By similarity | |||||||||
| Disulfide bond | 1591 ↔ 1597 | By similarity | |||||||||
| Disulfide bond | 1654 ↔ 1750 | By similarity | |||||||||
| Disulfide bond | 1688 ↔ 1747 | By similarity | |||||||||
| Disulfide bond | 1700 ↔ 1707 | By similarity | |||||||||
Natural variations | |||||||||||
| Alternative sequence | 229 – 264 | 36 | GDLGS…EKGDK → GDIGAMGPAGPPGPIASTMS KGTIIGPKGDLGEKGEK in isoform II. | VSP_001160 | |||||||
Experimental info | |||||||||||
| Mutagenesis | 48 | 1 | G → E in MN114; 73% lethal. | ||||||||
| Mutagenesis | 366 | 1 | A → T in MN126; 100% lethal. | ||||||||
| Mutagenesis | 570 | 1 | G → E in MN109; 37% lethal. | ||||||||
| Mutagenesis | 588 | 1 | G → R in MN103 and MN151; 96% lethal. | ||||||||
| Mutagenesis | 597 | 1 | G → R in MN152; 50% lethal. | ||||||||
| Mutagenesis | 690 | 1 | G → E in MN129; 100% lethal. | ||||||||
| Mutagenesis | 690 | 1 | G → R in MN101; 100% lethal. | ||||||||
| Mutagenesis | 737 | 1 | G → E in MN143; 100% lethal. | ||||||||
| Mutagenesis | 877 | 1 | G → R in G30; 90% lethal. | ||||||||
| Mutagenesis | 904 | 1 | G → R in E1470; 94% lethal. | ||||||||
| Mutagenesis | 1003 | 1 | G → E in MN139; 20% lethal. | ||||||||
| Mutagenesis | 1125 | 1 | G → D in G25; 2% lethal. | ||||||||
| Mutagenesis | 1152 | 1 | G → D in MN147; 7% lethal. | ||||||||
| Mutagenesis | 1286 | 1 | G → D in G37 and B246; 9% lethal. | ||||||||
| Sequence conflict | 1604 | 1 | E → D in AAA96215. Ref.2 | ||||||||
| Sequence conflict | 1604 | 1 | E → D in AAA96216. Ref.2 | ||||||||
| Sequence conflict | 1682 | 1 | P → L in AAA96215. Ref.2 | ||||||||
| Sequence conflict | 1682 | 1 | P → L in AAA96216. Ref.2 | ||||||||
Sequences
| ||||||||||||||||||||||||
References
| « Hide 'large scale' references | |
| [1] | "Genetic identification, sequence, and alternative splicing of the Caenorhabditis elegans alpha 2(IV) collagen gene." Sibley M.H., Johnson J.J., Mello C.C., Kramer J.M. J. Cell Biol. 123:255-264(1993) [PubMed: 7691828] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA], ALTERNATIVE SPLICING, FUNCTION. Strain: Bristol N2. |
| [2] | "Genome sequence of the nematode C. elegans: a platform for investigating biology." The C. elegans sequencing consortium Science 282:2012-2018(1998) [PubMed: 9851916] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], ALTERNATIVE SPLICING. Strain: Bristol N2. |
| [3] | "The two Caenorhabditis elegans basement membrane (type IV) collagen genes are located on separate chromosomes." Guo X., Kramer J.M. J. Biol. Chem. 264:17574-17582(1989) [PubMed: 2793871] [Abstract] Cited for: PRELIMINARY NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1495-1758. Strain: Bristol N2. |
| [4] | "Mutations in the alpha 2(IV) basement membrane collagen gene of Caenorhabditis elegans produce phenotypes of differing severities." Sibley M.H., Graham P.L., von Mende N., Kramer J.M. EMBO J. 13:3278-3285(1994) [PubMed: 8045258] [Abstract] Cited for: MUTAGENESIS. |
| + | Additional computationally mapped references. |
Cross-references
Sequence databases | |
|---|---|
| Z22964 Genomic DNA. Translation: CAA80536.1. Z22964 Genomic DNA. Translation: CAA80537.1. U22327 Genomic DNA. Translation: AAA64312.1. Sequence problems. U53342 Genomic DNA. Translation: AAA96215.1. U53342 Genomic DNA. Translation: AAA96216.1. J05066 Genomic DNA. Translation: AAA27989.1. | |
| PIR | A34476. T29350. T29351. |
| RefSeq | NP_510663.1. NP_510664.1. |
| UniGene | Cel.17195 |
3D structure databases | |
| HSSP | HSSP built from PDB template 1LI1 based on UniProtKB P08572. |
| SMR | P17140. Positions 1531-1753. |
| ModBase | Search... |
Protein-protein interaction databases | |
| STRING | P17140. |
Proteomic databases | |
| PRIDE | P17140. |
Genome annotation databases | |
| Ensembl | F01G12.5a; F01G12.5a; F01G12.5; Caenorhabditis elegans. [Genome view] F01G12.5b.1; F01G12.5b.1; F01G12.5; Caenorhabditis elegans. [Genome view] F01G12.5b.2; F01G12.5b.2; F01G12.5; Caenorhabditis elegans. [Genome view] |
| GeneID | 181708. |
| KEGG | cel:F01G12.5. |
| UCSC | F01G12.5b.1. c. elegans. |
Organism-specific databases | |
| WormBase | WBGene00002280. let-2. |
| WormPep | F01G12.5a. CE04334. [WorfDB] F01G12.5b. CE04335. [WorfDB] |
Gene expression databases | |
| ArrayExpress | P17140. |
Family and domain databases | |
| InterPro | IPR008160. Collagen. IPR001442. Procollagn4_C. [Graphical view] |
| Gene3D | G3DSA:2.170.240.10. Procollagn4_C. 1 hit. |
| Pfam | PF01413. C4. 2 hits. PF01391. Collagen. 22 hits. [Graphical view] |
| ProDom | PD000007. Clg_helix. 14 hits. PD003923. Procollagn4_C. 2 hits. PD003992. XGLTT_domain. 2 hits. [Graphical view] [Entries sharing at least one domain] |
| SMART | SM00111. C4. 2 hits. [Graphical view] |
| PROSITE | PS51403. NC1_IV. 1 hit. [Graphical view] |
| ProtoNet | Search... |
Entry information
| Entry name | CO4A2_CAEEL | ||||||||
| Accession | Primary (citable) accession number: P17140 Secondary accession number(s): Q19098, Q19099 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation project | Caenorhabditis annotation project | ||||||||
Relevant documents
| Caenorhabditis elegans Caenorhabditis elegans: entries, gene names and cross-references to WormPep |
| SIMILARITY comments Index of protein domains and families |

Clusters with


