Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Collagen alpha-1(VI) chain

Gene

Col6a1

Organism
Mus musculus (Mouse)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at transcript leveli

Functioni

Collagen VI acts as a cell-binding protein.

GO - Molecular functioni

  1. platelet-derived growth factor binding Source: MGI

GO - Biological processi

  1. cell adhesion Source: UniProtKB-KW
  2. cellular response to amino acid stimulus Source: MGI
  3. endodermal cell differentiation Source: Ensembl
  4. osteoblast differentiation Source: MGI
  5. protein heterotrimerization Source: MGI
Complete GO annotation...

Keywords - Biological processi

Cell adhesion

Enzyme and pathway databases

ReactomeiREACT_280178. Signaling by PDGF.
REACT_285754. Collagen biosynthesis and modifying enzymes.
REACT_313067. Collagen degradation.
REACT_318656. Assembly of collagen fibrils and other multimeric structures.
REACT_319054. NCAM1 interactions.
REACT_319261. Integrin cell surface interactions.
REACT_354321. ECM proteoglycans.

Names & Taxonomyi

Protein namesi
Recommended name:
Collagen alpha-1(VI) chain
Gene namesi
Name:Col6a1
OrganismiMus musculus (Mouse)
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus
ProteomesiUP000000589 Componenti: Chromosome 10

Organism-specific databases

MGIiMGI:88459. Col6a1.

Subcellular locationi

GO - Cellular componenti

  1. collagen trimer Source: UniProtKB-KW
  2. extracellular matrix Source: UniProtKB
  3. extracellular region Source: MGI
  4. extracellular vesicular exosome Source: MGI
  5. lysosomal membrane Source: MGI
  6. membrane Source: MGI
  7. proteinaceous extracellular matrix Source: MGI
  8. protein complex Source: MGI
  9. sarcolemma Source: MGI
Complete GO annotation...

Keywords - Cellular componenti

Extracellular matrix, Secreted

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 1919Add
BLAST
Chaini20 – 10251006Collagen alpha-1(VI) chainPRO_0000005759Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Glycosylationi211 – 2111N-linked (GlcNAc...)Sequence Analysis
Glycosylationi515 – 5151N-linked (GlcNAc...)Sequence Analysis
Glycosylationi536 – 5361N-linked (GlcNAc...)Sequence Analysis
Glycosylationi801 – 8011N-linked (GlcNAc...)Sequence Analysis
Glycosylationi893 – 8931N-linked (GlcNAc...)Sequence Analysis

Post-translational modificationi

Prolines at the third position of the tripeptide repeating unit (G-X-Y) are hydroxylated in some or all of the chains.

Keywords - PTMi

Glycoprotein, Hydroxylation

Proteomic databases

MaxQBiQ04857.
PaxDbiQ04857.
PRIDEiQ04857.

PTM databases

PhosphoSiteiQ04857.

Expressioni

Gene expression databases

BgeeiQ04857.
CleanExiMM_COL6A1.
ExpressionAtlasiQ04857. baseline and differential.
GenevestigatoriQ04857.

Interactioni

Subunit structurei

Trimers composed of three different chains: alpha-1(VI), alpha-2(VI), and alpha-3(VI) or alpha-4(VI) or alpha-5(VI) or alpha-6(VI).

Protein-protein interaction databases

BioGridi198823. 1 interaction.
IntActiQ04857. 1 interaction.
MINTiMINT-4091372.
STRINGi10090.ENSMUSP00000001147.

Structurei

3D structure databases

ProteinModelPortaliQ04857.
SMRiQ04857. Positions 34-220, 613-972.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini36 – 234199VWFA 1PROSITE-ProRule annotationAdd
BLAST
Domaini614 – 802189VWFA 2PROSITE-ProRule annotationAdd
BLAST
Domaini826 – 1018193VWFA 3PROSITE-ProRule annotationAdd
BLAST

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni20 – 255236N-terminal globular domainAdd
BLAST
Regioni256 – 591336Triple-helical regionAdd
BLAST
Regioni592 – 1025434C-terminal globular domainAdd
BLAST

Motif

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Motifi261 – 2633Cell attachment site
Motifi441 – 4433Cell attachment site
Motifi477 – 4793Cell attachment site

Sequence similaritiesi

Belongs to the type VI collagen family.Curated
Contains 3 VWFA domains.PROSITE-ProRule annotation

Keywords - Domaini

Collagen, Repeat, Signal

Phylogenomic databases

eggNOGiNOG256042.
GeneTreeiENSGT00760000119051.
HOGENOMiHOG000111863.
HOVERGENiHBG095954.
InParanoidiQ04857.
KOiK06238.
OMAiVKENYAE.
OrthoDBiEOG71K628.
PhylomeDBiQ04857.
TreeFamiTF331207.

Family and domain databases

Gene3Di3.40.50.410. 3 hits.
InterProiIPR008160. Collagen.
IPR002035. VWF_A.
[Graphical view]
PfamiPF01391. Collagen. 6 hits.
PF00092. VWA. 3 hits.
[Graphical view]
SMARTiSM00327. VWA. 3 hits.
[Graphical view]
SUPFAMiSSF53300. SSF53300. 3 hits.
PROSITEiPS50234. VWFA. 3 hits.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

Q04857-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MRLAHALLPL LLQACWVATQ DIQGSKAIAF QDCPVDLFFV LDTSESVALR
60 70 80 90 100
LKPYGALVDK VKSFTKRFID NLRDRYYRCD RNLVWNAGAL HYSDEVEIIR
110 120 130 140 150
GLTRMPSGRD ELKASVDAVK YFGKGTYTDC AIKKGLEELL IGGSHLKENK
160 170 180 190 200
YLIVVTDGHP LEGYKEPCGG LEDAVNEAKH LGIKVFSVAI TPDHLEPRLS
210 220 230 240 250
IIATDHTYRR NFTAADWGHS RDAEEVISQT IDTIVDMIKN NVEQVCCSFE
260 270 280 290 300
CQAARGPPGP RGDPGYEGER GKPGLPGEKG EAGDPGRPGD LGPVGYQGMK
310 320 330 340 350
GEKGSRGEKG SRGPKGYKGE KGKRGIDGVD GMKGETGYPG LPGCKGSPGF
360 370 380 390 400
DGIQGPPGPK GDAGAFGMKG EKGEAGADGE AGRPGNSGSP GDEGDPGEPG
410 420 430 440 450
PPGEKGEAGD EGNAGPDGAP GERGGPGERG PRGTPGVRGP RGDPGEAGPQ
460 470 480 490 500
GDQGREGPVG IPGDSGEAGP IGPKGYRGDE GPPGPEGLRG APGPVGPPGD
510 520 530 540 550
PGLMGERGED GPPGNGTEGF PGFPGYPGNR GPPGLNGTKG YPGLKGDEGE
560 570 580 590 600
VGDPGEDNND ISPRGVKGAK GYRGPEGPQG PPGHVGPPGP DECEILDIIM
610 620 630 640 650
KMCSCCECTC GPIDILFVLD SSESIGLQNF EIAKDFIIKV IDRLSKDELV
660 670 680 690 700
KFEPGQSHAG VVQYSHNQMQ EHVDMRSPNV RNAQDFKEAV KKLQWMAGGT
710 720 730 740 750
FTGEALQYTR DRLLPPTQNN RIALVITDGR SDTQRDTTPL SVLCGADIQV
760 770 780 790 800
VSVGIKDVFG FVAGSDQLNV ISCQGLSQGR PGISLVKENY AELLDDGFLK
810 820 830 840 850
NITAQICIDK KCPDYTCPIT FSSPADITIL LDSSASVGSH NFETTKVFAK
860 870 880 890 900
RLAERFLSAG RADPSQDVRV AVVQYSGQGQ QQPGRAALQF LQNYTVLASS
910 920 930 940 950
VDSMDFINDA TDVNDALSYV TRFYREASSG ATKKRVLLFS DGNSQGATAE
960 970 980 990 1000
AIEKAVQEAQ RAGIEIFVVV VGPQVNEPHI RVLVTGKTAE YDVAFGERHL
1010 1020
FRVPNYQALL RGVLYQTVSR KVALG
Length:1,025
Mass (Da):108,489
Last modified:June 1, 1994 - v1
Checksum:i2A05DFED8771BBF7
GO

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti674 – 6752DM → TL in CAA79152 (PubMed:8489506).Curated
Sequence conflicti709 – 7091T → A in CAA79152 (PubMed:8489506).Curated
Sequence conflicti943 – 9431Missing in CAA79152 (PubMed:8489506).Curated
Sequence conflicti960 – 9601Q → R in CAA79152 (PubMed:8489506).Curated

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
X66405 mRNA. Translation: CAA47032.1.
X66406 Genomic DNA. Translation: CAA47033.1.
Z18271 mRNA. Translation: CAA79152.1.
CCDSiCCDS23952.1.
PIRiS34839.
RefSeqiNP_034063.1. NM_009933.4.
UniGeneiMm.2509.

Genome annotation databases

EnsembliENSMUST00000001147; ENSMUSP00000001147; ENSMUSG00000001119.
GeneIDi12833.
KEGGimmu:12833.
UCSCiuc007fux.1. mouse.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
X66405 mRNA. Translation: CAA47032.1.
X66406 Genomic DNA. Translation: CAA47033.1.
Z18271 mRNA. Translation: CAA79152.1.
CCDSiCCDS23952.1.
PIRiS34839.
RefSeqiNP_034063.1. NM_009933.4.
UniGeneiMm.2509.

3D structure databases

ProteinModelPortaliQ04857.
SMRiQ04857. Positions 34-220, 613-972.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi198823. 1 interaction.
IntActiQ04857. 1 interaction.
MINTiMINT-4091372.
STRINGi10090.ENSMUSP00000001147.

PTM databases

PhosphoSiteiQ04857.

Proteomic databases

MaxQBiQ04857.
PaxDbiQ04857.
PRIDEiQ04857.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSMUST00000001147; ENSMUSP00000001147; ENSMUSG00000001119.
GeneIDi12833.
KEGGimmu:12833.
UCSCiuc007fux.1. mouse.

Organism-specific databases

CTDi1291.
MGIiMGI:88459. Col6a1.

Phylogenomic databases

eggNOGiNOG256042.
GeneTreeiENSGT00760000119051.
HOGENOMiHOG000111863.
HOVERGENiHBG095954.
InParanoidiQ04857.
KOiK06238.
OMAiVKENYAE.
OrthoDBiEOG71K628.
PhylomeDBiQ04857.
TreeFamiTF331207.

Enzyme and pathway databases

ReactomeiREACT_280178. Signaling by PDGF.
REACT_285754. Collagen biosynthesis and modifying enzymes.
REACT_313067. Collagen degradation.
REACT_318656. Assembly of collagen fibrils and other multimeric structures.
REACT_319054. NCAM1 interactions.
REACT_319261. Integrin cell surface interactions.
REACT_354321. ECM proteoglycans.

Miscellaneous databases

NextBioi282342.
PROiQ04857.
SOURCEiSearch...

Gene expression databases

BgeeiQ04857.
CleanExiMM_COL6A1.
ExpressionAtlasiQ04857. baseline and differential.
GenevestigatoriQ04857.

Family and domain databases

Gene3Di3.40.50.410. 3 hits.
InterProiIPR008160. Collagen.
IPR002035. VWF_A.
[Graphical view]
PfamiPF01391. Collagen. 6 hits.
PF00092. VWA. 3 hits.
[Graphical view]
SMARTiSM00327. VWA. 3 hits.
[Graphical view]
SUPFAMiSSF53300. SSF53300. 3 hits.
PROSITEiPS50234. VWFA. 3 hits.
[Graphical view]
ProtoNetiSearch...

Publicationsi

  1. "Murine alpha 1(VI) collagen chain. Complete amino acid sequence and identification of the gene promoter region."
    Bonaldo P., Piccolo S., Marvulli D., Volpin D., Bressan G.M.
    Matrix 13:223-233(1993) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA / MRNA].
  2. "Cloning and sequence analysis of cDNAs encoding the alpha 1, alpha 2 and alpha 3 chains of mouse collagen VI."
    Zhang R.Z., Pan T.C., Timpl R., Chu M.-L.
    Biochem. J. 291:787-792(1993) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 442-1025.

Entry informationi

Entry nameiCO6A1_MOUSE
AccessioniPrimary (citable) accession number: Q04857
Entry historyi
Integrated into UniProtKB/Swiss-Prot: June 1, 1994
Last sequence update: June 1, 1994
Last modified: April 1, 2015
This is version 125 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. MGD cross-references
    Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot
  2. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into Uniref entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.