UniProtKB - P17139 (CO4A1_CAEEL)
Collagen alpha-1(IV) chain
emb-9
Functioni
Collagen type IV is specific for basement membranes (Probable). Required to restrict presynaptic growth at the neuromuscular junctions (NMJ) in late larval stage and in adult motor neurons (PubMed:25080592).
May play a role in axon regeneration in embryos following injury in D-type motor neurons (PubMed:27984580).
Curated2 PublicationsGO - Molecular functioni
- extracellular matrix structural constituent Source: GO_Central
GO - Biological processi
- collagen-activated tyrosine kinase receptor signaling pathway Source: GO_Central
- embryo development ending in birth or egg hatching Source: WormBase
- extracellular matrix organization Source: WormBase
- muscle organ development Source: WormBase
- nematode larval development Source: WormBase
- positive regulation of axon regeneration Source: UniProtKB
- positive regulation of protein secretion Source: WormBase
Enzyme and pathway databases
Reactomei | R-CEL-1442490, Collagen degradation R-CEL-1474244, Extracellular matrix organization R-CEL-186797, Signaling by PDGF R-CEL-2022090, Assembly of collagen fibrils and other multimeric structures R-CEL-216083, Integrin cell surface interactions R-CEL-2243919, Crosslinking of collagen fibrils R-CEL-3000157, Laminin interactions R-CEL-419037, NCAM1 interactions |
SignaLinki | P17139 |
Names & Taxonomyi
Protein namesi | Recommended name: Collagen alpha-1(IV) chain |
Gene namesi | ORF Names:K04H4.1Imported |
Organismi | Caenorhabditis elegans |
Taxonomic identifieri | 6239 [NCBI] |
Taxonomic lineagei | Eukaryota › Metazoa › Ecdysozoa › Nematoda › Chromadorea › Rhabditida › Rhabditina › Rhabditomorpha › Rhabditoidea › Rhabditidae › Peloderinae › Caenorhabditis |
Proteomesi |
|
Organism-specific databases
WormBasei | K04H4.1a ; CE48054 ; WBGene00001263 ; emb-9 K04H4.1b ; CE36390 ; WBGene00001263 ; emb-9 |
Subcellular locationi
Extracellular region or secreted
Extracellular region or secreted
- extracellular space Source: GO_Central
Other locations
- basement membrane Source: WormBase
- collagen type IV trimer Source: GO_Central
- extracellular matrix Source: GO_Central
Keywords - Cellular componenti
Basement membrane, Extracellular matrix, SecretedPathology & Biotechi
Disruption phenotypei
Mutagenesis
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Mutagenesisi | 141 | P → L in ju1197; suppresses the lethal phenotypes of the pxn-2 tm3464 mutant. 1 Publication | 1 | |
Mutagenesisi | 213 | G → E in b189; temperature-sensitive mutant. At the subrestrictive temperature of 22 degrees Celsius, causes the formation of ectopic presynaptic boutons on the ventral cord axons. Enhances the lethality of the pxn-2 ju436 mutant at 20 degrees Celsius. 2 Publications | 1 | |
Mutagenesisi | 258 | G → E in xd51; formation of ectopic presynaptic boutons on the ventral cord axons associated with a disruption of synapse basement membrane. 1 Publication | 1 | |
Mutagenesisi | 403 | G → E in allele g34; temperature-sensitive lethality during late embryogenesis. 1 Publication | 1 | |
Mutagenesisi | 409 | G → E in allele g23/hc70; temperature-sensitive lethality during late embryogenesis. 1 Publication | 1 |
PTM / Processingi
Molecule processing
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Signal peptidei | 1 – 20 | Sequence analysisAdd BLAST | 20 | |
PropeptideiPRO_0000005752 | 21 – ?194 | N-terminal propeptide (7S domain)Add BLAST | 174 | |
ChainiPRO_0000005753 | ?195 – 1759 | Collagen alpha-1(IV) chainAdd BLAST | 1565 |
Amino acid modifications
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Disulfide bondi | 1550 ↔ 1641 | Or C-1550 with C-1638PROSITE-ProRule annotation | ||
Disulfide bondi | 1583 ↔ 1638 | Or C-1583 with C-1641PROSITE-ProRule annotation | ||
Disulfide bondi | 1595 ↔ 1601 | PROSITE-ProRule annotation | ||
Cross-linki | 1623 | S-Lysyl-methionine sulfilimine (Met-Lys) (interchain with K-1741)By similarity | ||
Disulfide bondi | 1660 ↔ 1755 | Or C-1660 with C-1752PROSITE-ProRule annotation | ||
Disulfide bondi | 1694 ↔ 1752 | Or C-1694 with C-1755PROSITE-ProRule annotation | ||
Disulfide bondi | 1706 ↔ 1712 | PROSITE-ProRule annotation | ||
Cross-linki | 1741 | S-Lysyl-methionine sulfilimine (Lys-Met) (interchain with M-1623)By similarity |
Post-translational modificationi
Keywords - PTMi
Disulfide bond, HydroxylationProteomic databases
EPDi | P17139 |
PaxDbi | P17139 |
PeptideAtlasi | P17139 |
Expressioni
Gene expression databases
Bgeei | WBGene00001263, Expressed in multi-cellular organism and 5 other tissues |
Interactioni
Subunit structurei
Trimers of two alpha 1(IV) and one alpha 2(IV) chain. Type IV collagen forms a mesh-like network linked through intermolecular interactions between 7S domains and between NC1 domains.
Protein-protein interaction databases
BioGRIDi | 41512, 6 interactors |
IntActi | P17139, 7 interactors |
STRINGi | 6239.K04H4.1a |
Family & Domainsi
Domains and Repeats
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Domaini | 1535 – 1759 | Collagen IV NC1PROSITE-ProRule annotationAdd BLAST | 225 |
Region
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Regioni | 51 – 245 | DisorderedSequence analysisAdd BLAST | 195 | |
Regioni | 195 – 1530 | Triple-helical regionAdd BLAST | 1336 | |
Regioni | 269 – 415 | DisorderedSequence analysisAdd BLAST | 147 | |
Regioni | 548 – 596 | DisorderedSequence analysisAdd BLAST | 49 | |
Regioni | 618 – 650 | DisorderedSequence analysisAdd BLAST | 33 | |
Regioni | 666 – 720 | DisorderedSequence analysisAdd BLAST | 55 | |
Regioni | 787 – 1522 | DisorderedSequence analysisAdd BLAST | 736 |
Compositional bias
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Compositional biasi | 300 – 314 | Basic and acidic residuesSequence analysisAdd BLAST | 15 |
Domaini
Sequence similaritiesi
Keywords - Domaini
Collagen, Repeat, SignalPhylogenomic databases
eggNOGi | KOG3544, Eukaryota |
GeneTreei | ENSGT00940000153991 |
InParanoidi | P17139 |
OMAi | SCLMRFT |
OrthoDBi | 63831at2759 |
PhylomeDBi | P17139 |
Family and domain databases
Gene3Di | 2.170.240.10, 1 hit |
InterProi | View protein in InterPro IPR008160, Collagen IPR001442, Collagen_IV_NC IPR036954, Collagen_IV_NC_sf IPR016187, CTDL_fold |
Pfami | View protein in Pfam PF01413, C4, 2 hits PF01391, Collagen, 15 hits |
SMARTi | View protein in SMART SM00111, C4, 2 hits |
SUPFAMi | SSF56436, SSF56436, 2 hits |
PROSITEi | View protein in PROSITE PS51403, NC1_IV, 1 hit |
s (2)i Sequence
Sequence statusi: Complete.
: The displayed sequence is further processed into a mature form. Sequence processingi
This entry describes 2 produced by isoformsialternative splicing. AlignAdd to basketThis isoform has been chosen as the sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry. canonicali
10 20 30 40 50
MSRLSLLGLT AAVVLLSSFC QDRIHVDAAA ACKGCAPPCV CPGTKGERGN
60 70 80 90 100
PGFGGEPGHP GAPGQDGPEG APGAPGMFGA EGDFGDMGSK GARGDRGLPG
110 120 130 140 150
SPGHPGLQGL DGLPGLKGEE GIPGCNGTDG FPGMPGLAGP PGQSGQNGNP
160 170 180 190 200
GRPGLSGPPG EGGVNSQGRK GVKGESGRSG VPGLPGNSGY PGLKGAKGDP
210 220 230 240 250
GPYGLPGFPG VSGLKGRMGV RTSGVKGEKG LPGPPGPPGQ PGSYPWASKP
260 270 280 290 300
IEMEVLQGPV GPAGVKGEKG RDGPVGPPGM LGLDGPPGYP GLKGQKGDLG
310 320 330 340 350
DAGQRGKRGK DGVPGNYGEK GSQGEQGLGG TPGYPGTKGG AGEPGYPGRP
360 370 380 390 400
GFEGDCGPEG PLGEGTGEAG PHGAQGFDGV QGGKGLPGHD GLPGPVGPRG
410 420 430 440 450
PVGAPGAPGQ PGIDGMPGYT EKGDRGEDGY PGFAGEPGLP GEPGDCGYPG
460 470 480 490 500
EDGLPGYDIQ GPPGLDGQSG RDGFPGIPGD IGDPGYSGEK GFPGTGVNKV
510 520 530 540 550
GPPGMTGLPG EPGMPGRIGV DGYPGPPGNN GERGEDCGYC PDGVPGNAGD
560 570 580 590 600
PGFPGMNGYP GPPGPNGDHG DCGMPGAPGK PGSAGSDGLS GSPGLPGIPG
610 620 630 640 650
YPGMKGEAGE IVGPMENPAG IPGLKGDHGL PGLPGRPGSD GLPGYPGGPG
660 670 680 690 700
QNGFPGLQGE PGLAGIDGKR GRQGSLGIPG LQGPPGDSFP GQPGTPGYKG
710 720 730 740 750
ERGADGLPGL PGAQGPRGIP APLRIVNQVA GQPGVDGMPG LPGDRGADGL
760 770 780 790 800
PGLPGPVGPD GYPGTPGERG MDGLPGFPGL HGEPGMRGQQ GEVGFNGIDG
810 820 830 840 850
DCGEPGLDGY PGAPGAPGAP GETGFGFPGQ VGYPGPNGDA GAAGLPGPDG
860 870 880 890 900
YPGRDGLPGT PGYPGEAGMN GQDGAPGQPG SRGESGLVGI DGKKGRDGTP
910 920 930 940 950
GTRGQDGGPG YSGEAGAPGQ NGMDGYPGAP GDQGYPGSPG QDGYPGPSGI
960 970 980 990 1000
PGEDGLVGFP GLRGEHGDNG LPGLEGECGE EGSRGLDGVP GYPGEHGTDG
1010 1020 1030 1040 1050
LPGLPGADGQ PGFVGEAGEP GTPGYRGQPG EPGNLAYPGQ PGDVGYPGPD
1060 1070 1080 1090 1100
GPPGLPGQDG LPGLNGERGD NGDSYPGNPG LSGQPGDAGY DGLDGVPGPP
1110 1120 1130 1140 1150
GYPGITGMPG LKGESGLPGL PGRQGNDGIP GQPGLEGECG EDGFPGSPGQ
1160 1170 1180 1190 1200
PGYPGQQGRE GEKGYPGIPG ENGLPGLRGQ DGQPGLKGEN GLDGQPGYPG
1210 1220 1230 1240 1250
SAGQLGTPGD VGYPGAPGEN GDNGNQGRDG QPGLRGESGQ PGQPGLPGRD
1260 1270 1280 1290 1300
GQPGPVGPPG DDGYPGAPGQ DIYGPPGQAG QDGYPGLDGL PGAPGLNGEP
1310 1320 1330 1340 1350
GSPGQYGMPG LPGGPGESGL PGYPGERGLP GLDGKRGHDG LPGAPGVPGV
1360 1370 1380 1390 1400
EGVPGLEGDC GEDGYPGAPG APGSNGYPGE RGLPGVPGQQ GRSGDNGYPG
1410 1420 1430 1440 1450
APGQPGIKGP RGDDGFPGRD GLDGLPGRPG REGLPGPMAM AVRNPPGQPG
1460 1470 1480 1490 1500
ENGYPGEKGY PGLPGDNGLS GPPGKAGYPG APGTDGYPGP PGLSGMPGHG
1510 1520 1530 1540 1550
GDQGFQGAAG RTGNPGLPGT PGYPGSPGGW APSRGFTFAK HSQTTAVPQC
1560 1570 1580 1590 1600
PPGASQLWEG YSLLYVQGNG RASGQDLGQP GSCLSKFNTM PFMFCNMNSV
1610 1620 1630 1640 1650
CHVSSRNDYS FWLSTDEPMT PMMNPVTGTA IRPYISRCAV CEVPTQIIAV
1660 1670 1680 1690 1700
HSQDTSVPQC PQGWSGMWTG YSFVMHTAAG AEGTGQSLQS PGSCLEEFRA
1710 1720 1730 1740 1750
VPFIECHGRG TCNYYATNHG FWLSIVDQDK QFRKPMSQTL KAGGLKDRVS
RCQVCLKNR
Experimental Info
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Sequence conflicti | 302 – 305 | AGQR → LDN in CAA40299 (PubMed:1996137).Curated | 4 | |
Sequence conflicti | 769 | R → P in CAA40299 (PubMed:1996137).Curated | 1 | |
Sequence conflicti | 831 | V → D in CAA40299 (PubMed:1996137).Curated | 1 | |
Sequence conflicti | 1515 | P → Q in AAB59179 (PubMed:2793871).Curated | 1 | |
Sequence conflicti | 1723 | L → P in CAA40299 (PubMed:1996137).Curated | 1 | |
Sequence conflicti | 1723 | L → P in AAB59179 (PubMed:2793871).Curated | 1 |
Alternative sequence
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Alternative sequenceiVSP_011573 | 502 – 758 | Missing in isoform b. CuratedAdd BLAST | 257 |
Sequence databases
Select the link destinations: EMBLi GenBanki DDBJi Links Updated | X56979 Genomic DNA Translation: CAA40299.1 BX284603 Genomic DNA Translation: CAA81584.5 BX284603 Genomic DNA Translation: CAE52901.2 J05067 Genomic DNA Translation: AAB59179.1 |
PIRi | S40991 |
RefSeqi | NP_001022662.2, NM_001027491.4 [P17139-1] NP_001022663.1, NM_001027492.4 [P17139-2] |
Genome annotation databases
EnsemblMetazoai | K04H4.1a.1; K04H4.1a.1; WBGene00001263 [P17139-1] K04H4.1b.1; K04H4.1b.1; WBGene00001263 [P17139-2] |
GeneIDi | 176314 |
KEGGi | cel:CELE_K04H4.1 |
UCSCi | K04H4.1b, c. elegans [P17139-1] |
Keywords - Coding sequence diversityi
Alternative splicingSimilar proteinsi
Cross-referencesi
Sequence databases
Select the link destinations: EMBLi GenBanki DDBJi Links Updated | X56979 Genomic DNA Translation: CAA40299.1 BX284603 Genomic DNA Translation: CAA81584.5 BX284603 Genomic DNA Translation: CAE52901.2 J05067 Genomic DNA Translation: AAB59179.1 |
PIRi | S40991 |
RefSeqi | NP_001022662.2, NM_001027491.4 [P17139-1] NP_001022663.1, NM_001027492.4 [P17139-2] |
3D structure databases
SMRi | P17139 |
ModBasei | Search... |
Protein-protein interaction databases
BioGRIDi | 41512, 6 interactors |
IntActi | P17139, 7 interactors |
STRINGi | 6239.K04H4.1a |
Proteomic databases
EPDi | P17139 |
PaxDbi | P17139 |
PeptideAtlasi | P17139 |
Genome annotation databases
EnsemblMetazoai | K04H4.1a.1; K04H4.1a.1; WBGene00001263 [P17139-1] K04H4.1b.1; K04H4.1b.1; WBGene00001263 [P17139-2] |
GeneIDi | 176314 |
KEGGi | cel:CELE_K04H4.1 |
UCSCi | K04H4.1b, c. elegans [P17139-1] |
Organism-specific databases
CTDi | 176314 |
WormBasei | K04H4.1a ; CE48054 ; WBGene00001263 ; emb-9 K04H4.1b ; CE36390 ; WBGene00001263 ; emb-9 |
Phylogenomic databases
eggNOGi | KOG3544, Eukaryota |
GeneTreei | ENSGT00940000153991 |
InParanoidi | P17139 |
OMAi | SCLMRFT |
OrthoDBi | 63831at2759 |
PhylomeDBi | P17139 |
Enzyme and pathway databases
Reactomei | R-CEL-1442490, Collagen degradation R-CEL-1474244, Extracellular matrix organization R-CEL-186797, Signaling by PDGF R-CEL-2022090, Assembly of collagen fibrils and other multimeric structures R-CEL-216083, Integrin cell surface interactions R-CEL-2243919, Crosslinking of collagen fibrils R-CEL-3000157, Laminin interactions R-CEL-419037, NCAM1 interactions |
SignaLinki | P17139 |
Miscellaneous databases
PROi | PR:P17139 |
Gene expression databases
Bgeei | WBGene00001263, Expressed in multi-cellular organism and 5 other tissues |
Family and domain databases
Gene3Di | 2.170.240.10, 1 hit |
InterProi | View protein in InterPro IPR008160, Collagen IPR001442, Collagen_IV_NC IPR036954, Collagen_IV_NC_sf IPR016187, CTDL_fold |
Pfami | View protein in Pfam PF01413, C4, 2 hits PF01391, Collagen, 15 hits |
SMARTi | View protein in SMART SM00111, C4, 2 hits |
SUPFAMi | SSF56436, SSF56436, 2 hits |
PROSITEi | View protein in PROSITE PS51403, NC1_IV, 1 hit |
MobiDBi | Search... |
Entry informationi
Entry namei | CO4A1_CAEEL | |
Accessioni | P17139Primary (citable) accession number: P17139 Secondary accession number(s): Q6LAD0 | |
Entry historyi | Integrated into UniProtKB/Swiss-Prot: | August 1, 1990 |
Last sequence update: | July 24, 2013 | |
Last modified: | February 23, 2022 | |
This is version 196 of the entry and version 5 of the sequence. See complete history. | ||
Entry statusi | Reviewed (UniProtKB/Swiss-Prot) | |
Annotation program | Caenorhabditis annotation project |
Miscellaneousi
Keywords - Technical termi
Reference proteomeDocuments
- Caenorhabditis elegans
Caenorhabditis elegans: entries, gene names and cross-references to WormBase - SIMILARITY comments
Index of protein domains and families