Skip Header

Contribute Send feedback
Read comments (?) or add your own

A2AX52 (CO6A4_MOUSE) Reviewed, UniProtKB/Swiss-Prot

Last modified January 25, 2012. Version 39. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Collagen alpha-4(VI) chain
Gene names
Name:Col6a4
Synonyms:Dvwa
OrganismMus musculus (Mouse)
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus

Protein attributes

Sequence length2309 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Collagen VI acts as a cell-binding protein By similarity.

Subunit structure

Trimers composed of three different chains: alpha-1(VI), alpha-2(VI), and alpha-3(VI) or alpha-4(VI) or alpha-5(VI) or alpha-6(VI). Ref.2

Subcellular location

Secretedextracellular spaceextracellular matrix By similarity.

Tissue specificity

In newborn, it is expressed in lung, kidney, brain, intestine, skin, sternum and, at weak level, calvaria. In adult, it is almost absent with some weak expression in ovary and very weak expression in spleen, lung, uterus and brain. Ref.2

Post-translational modification

Prolines at the third position of the tripeptide repeating unit (G-X-Y) are hydroxylated in some or all of the chains By similarity.

Sequence similarities

Belongs to the type VI collagen family.

Contains 8 VWFA domains.

Ontologies

Keywords
   Biological processCell adhesion
   Cellular componentExtracellular matrix
Secreted
   DomainCollagen
Repeat
Signal
   PTMGlycoprotein
Hydroxylation
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
   Biological processcell adhesion

Inferred from electronic annotation. Source: UniProtKB-KW

protein heterotrimerization

Inferred from direct assay. Source: MGI

   Cellular componentcollagen

Inferred from electronic annotation. Source: UniProtKB-KW

protein complex

Inferred from direct assay. Source: MGI

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 2222 Potential
Chain23 – 23092287Collagen alpha-4(VI) chain
PRO_5000214197

Regions

Domain34 – 206173VWFA 1
Domain235 – 413179VWFA 2
Domain430 – 653224VWFA 3
Domain634 – 811178VWFA 4
Domain849 – 1018170VWFA 5
Domain1030 – 1199170VWFA 6
Domain1776 – 1957182VWFA 7
Domain1982 – 2187206VWFA 8
Region21 – 14101390Nonhelical region
Region1411 – 1744334Triple-helical region
Region1745 – 2309565Nonhelical region
Motif1527 – 15293Cell attachment site Potential
Motif2208 – 22103Cell attachment site Potential

Amino acid modifications

Glycosylation1881N-linked (GlcNAc...) Potential
Glycosylation7541N-linked (GlcNAc...) Potential
Glycosylation11141N-linked (GlcNAc...) Potential

Experimental info

Sequence conflict771S → G in BAF95091. Ref.1
Sequence conflict1471T → A in BAF95091. Ref.1
Sequence conflict2151G → A in BAF95091. Ref.1
Sequence conflict2231R → Q in BAF95091. Ref.1
Sequence conflict2631S → L in BAF95091. Ref.1
Sequence conflict3861N → S in BAF95091. Ref.1
Sequence conflict4251T → N in BAF95091. Ref.1
Sequence conflict4431Q → P in BAF95091. Ref.1
Sequence conflict4681Q → R in BAF95091. Ref.1
Sequence conflict6461R → G in BAF95091. Ref.1
Sequence conflict6991R → S in BAF95091. Ref.1
Sequence conflict7071R → Q in BAF95091. Ref.1
Sequence conflict7131A → T in BAF95091. Ref.1
Sequence conflict7471I → M in BAF95091. Ref.1
Sequence conflict7741T → I in BAF95091. Ref.1
Sequence conflict8221G → R in BAF95091. Ref.1
Sequence conflict9221V → E in BAF95091. Ref.1
Sequence conflict9451I → V in BAF95091. Ref.1
Sequence conflict10791Q → R in BAF95091. Ref.1
Sequence conflict1281 – 12833DET → NEI in BAF95091. Ref.1
Sequence conflict1353 – 13542IG → VS in BAF95091. Ref.1
Sequence conflict14341P → L in BAF95091. Ref.1
Sequence conflict16291L → P in BAF95091. Ref.1
Sequence conflict17031R → H in BAF95091. Ref.1
Sequence conflict17751T → M in BAF95091. Ref.1
Sequence conflict17801T → A in BAF95091. Ref.1
Sequence conflict17881A → S in BAF95091. Ref.1
Sequence conflict18091C → S in BAF95091. Ref.1
Sequence conflict19781K → D in BAF95091. Ref.1
Sequence conflict22221F → L in BAF95091. Ref.1

Sequences

Sequence LengthMass (Da)Tools
A2AX52 [UniParc].

Last modified May 20, 2008. Version 2.
Checksum: 31F0282FC77B0DE8

FASTA2,309250,798
        10         20         30         40         50         60 
MGTWKTFWLI ISLAAGLGFV KSQRIVCREA SVGDIVFLVH NSINPQHAHS VRNFLYILAN 

        70         80         90        100        110        120 
SLQVGRDNIR VGLAQYSDTP TSEFLLSVYH RKGDVLKHIR GLQFKPGGNR MGQALQFILE 

       130        140        150        160        170        180 
HHFREGAGSR ASQGVPQVAV VVSSGLTEDH IREPAEALRR AGILVYAIGV KDASQAELRE 

       190        200        210        220        230        240 
ISSSPKDNFT FFVPNFPGLP GLAQKLRPEL CSTLGKAAQY TERESPACSE ASPADIVFLV 

       250        260        270        280        290        300 
DSSTSIGLQN FQKVKHFLHS VVSGLDVRSD QVQVGLVQYS DNIYPAFPLK QSSLKSAVLD 

       310        320        330        340        350        360 
RIRNLPYSMG GTSTGSALEF IRANSLTEMS GSRAKDGVPQ IVVLVTDGES SDEVQDVADQ 

       370        380        390        400        410        420 
LKRDGVFVFV VGINIQDVQE LQKIANEPFE EFLFTTENFS ILQALSGTLL QALCSTVERQ 

       430        440        450        460        470        480 
MKKSTKTYAD VVFLIDTSQG TSQASFQWMQ NFISRIIGIL EVGQDKYQIG LAQYSDQGHT 

       490        500        510        520        530        540 
EFLFNTHKTR NEMVAHIHEL LVFQGGSRKT GQGLRFLHRT FFQEAAGSRL LQGVPQYVVV 

       550        560        570        580        590        600 
ITSGKSEDEV GEVAQILRKR GVDIVSVGLQ DFDRAELEGI GPVVLVSDLQ GEDRIRQLML 

       610        620        630        640        650        660 
DVNMFIQGSP KPPRVMTDVA KDAVEECLVP VPADLVFLVE DFSSARQPNF QRVVHFLTTT 

       670        680        690        700        710        720 
VHSLNIHPDT TRVSLVFYSE KPRLEFSLDM YQSAAQVLRH LDRLTFRARR GRAKAGAALD 

       730        740        750        760        770        780 
FLRKEVFLPE KGSRPHRGVQ QIAVVIIESP SLDNVSTPAS YLRRAGVTIY AAGTQPASES 

       790        800        810        820        830        840 
KDLEKIVTYP PWKHAIRLES FLQLSVVGNK LKKKLCPEML SGMPPLMSFI PESTRQSTQE 

       850        860        870        880        890        900 
GCESVEKADI YFLIDGSGSI KPNDFIEMKD FMKEVIKMFH IGPDRVRFGV VQYSDKIISQ 

       910        920        930        940        950        960 
FFLTQYASMA GLSAAIDNIQ QVGGGTTTGK ALSKMVPVFQ NTARIDVARY LIVITDGQST 

       970        980        990       1000       1010       1020 
DPVAEAAQGL RDIGVNIYAI GVRDANTTEL EEIASKKMFF IYEFDSLKSI HQEVIRDICS 

      1030       1040       1050       1060       1070       1080 
SENCKSQKAD IIFLIDGSES IAPKDFEKMK DFMERMVNQS NIGADEIQIG LLQFSSNPQE 

      1090       1100       1110       1120       1130       1140 
EFRLNRYSSK VDMCRAILSV QQMSDGTHTG KALNFTLPFF DSSRGGRPRV HQYLIVITDG 

      1150       1160       1170       1180       1190       1200 
VSQDNVAPPA KALRDRNIII FAIGVGNVQR AQLLEITNDQ DKVFQEENFE SLQSLEKEIL 

      1210       1220       1230       1240       1250       1260 
SEVCSSQGCN IDLSVGVDTS TSSERAQQEL RRLLPELMQQ LAFLSNISCE APGQMEPRFR 

      1270       1280       1290       1300       1310       1320 
YVVPGSSDQP VFDSGFEKYS DETIQKFLVH QGSVNNRMDV DFLQSLGETA IHLSLAKVKV 

      1330       1340       1350       1360       1370       1380 
LLVFTDGLDE DLERLRRTSE FLRSRGLSGL LLIGLGGAHK LEELQELEFG RGFAYRQPLS 

      1390       1400       1410       1420       1430       1440 
SSLPSLPSVL LKQLDTIVER TCCNMYAKCY GDDGIRGEPG SRGEQGERGL DGLPGHPGEE 

      1450       1460       1470       1480       1490       1500 
GDHGQRGPRG LPGLRGEEGC PGVRGPKGAR GFSGEKGNPG EEGVGGLDGE QGDRGAAGPS 

      1510       1520       1530       1540       1550       1560 
GEKGSSGSRG LTGLPGPAGP RGEPGLRGDP GDPGIDNLIQ GPKGEKGRRG HQGSPGFHGP 

      1570       1580       1590       1600       1610       1620 
LGEAGSVGPR GSLGRHGLPG LKGVLGETGE LGSRGEPGHP GPQGPRGRQG PPGFFGQKGD 

      1630       1640       1650       1660       1670       1680 
PGTQGNPGLP GPSGSKGPDG PRGLKGEVGP AGERGPRGQQ GPRGQPGLFG PDGHGYPGRK 

      1690       1700       1710       1720       1730       1740 
GRKGEPGFPG YPGVQGEDGN PGRGGEKGAK GIRGKRGNSG FPGLAGTPGD QGPPGKMGTK 

      1750       1760       1770       1780       1790       1800 
GSKGLADRTP CEIVDFVRGN CPCSTGISRC PAFPTEVVFT LDMSNDVAPS DFERMRNILL 

      1810       1820       1830       1840       1850       1860 
SLLMKLEMCE SNCPTGARVA IVSYNTRTDY LVRLSDHRGK AALLQAVRKI PLERSSGSRN 

      1870       1880       1890       1900       1910       1920 
LGATMRFVAR HVFKRVRSGL LVRKVAVFFQ AGRNYDTASV STATLELHAA DIATAVVTFT 

      1930       1940       1950       1960       1970       1980 
EEHNLPEAGL VDGPNEFHLF TWETEGQQDV ERLASCTLCY DKCRPALGCQ LRAPGPQKLD 

      1990       2000       2010       2020       2030       2040 
MDLVFLVDSS QGVSRDIYLG ALRLVDSVLK DLEVAAQPGT SWHGARAALL THTTPGFWPG 

      2050       2060       2070       2080       2090       2100 
VDQAPVLEYF HLTSHGHRTE MQRQIREAAS GLLQGGPALG HALEWTLENV LLTAVLPRRS 

      2110       2120       2130       2140       2150       2160 
RVLYAIVASE TSIWDREKLR TLSQEAKCKG IALFVLAVGP GVGAQELAEL AKVASAPWEQ 

      2170       2180       2190       2200       2210       2220 
HLLRLEGVSE AEVAYASRFT EAFLNLLNSG INQYPPPELV KECGGPNRGD TLLHFFTSAK 

      2230       2240       2250       2260       2270       2280 
RFSRSQSGTS AAFANDSEAL KSQGIFLGER KSRVASVALQ EALGSHGKDR ADTEDIDQET 

      2290       2300 
PAKGRHLGPT HGPCPMGPEE GECLNYVLK 

« Hide

References

« Hide 'large scale' references
[1]"Cloning and characterization of osteoarthritis-associated gene DVWA."
Nakajima M., Miyamoto Y., Ikegawa S.
Submitted (DEC-2007) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [MRNA].
[2]"Three novel collagen VI chains with high homology to the alpha 3 chain."
Gara S.K., Grumati P., Urciuolo A., Bonaldo P., Kobbe B., Koch M., Paulsson M., Wagener R.
J. Biol. Chem. 283:10658-10670(2008) [PubMed: 18276594] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA], SUBUNIT, TISSUE SPECIFICITY.
Strain: C57BL/6J.
Tissue: Brain and Uterus.
[3]"Lineage-specific biology revealed by a finished genome assembly of the mouse."
Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X., Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y., Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S. expand/collapse author list , Teague B., Potamousis K., Churas C., Place M., Herschleb J., Runnheim R., Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., Lindblad-Toh K., Eichler E.E., Ponting C.P.
PLoS Biol. 7:E1000112-E1000112(2009) [PubMed: 19468303] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: C57BL/6.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AB370265 mRNA. Translation: BAF95091.1.
AM231151 mRNA. Translation: CAJ77150.1.
AM231152 mRNA. Translation: CAJ77151.1.
AM231153 mRNA. Translation: CAJ77152.1.
AC120386 Genomic DNA. No translation available.
IPIIPI00828867.
RefSeqNP_081039.2. NM_026763.2.
UniGeneMm.28854.

3D structure databases

ProteinModelPortalA2AX52.
SMRA2AX52. Positions 32-212, 225-583, 631-819, 837-1205, 1420-1454, 1681-1714.
ModBaseSearch...

Protein-protein interaction databases

STRINGA2AX52.

PTM databases

PhosphoSiteA2AX52.

Proteomic databases

PRIDEA2AX52.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENSMUST00000121963; ENSMUSP00000112472; ENSMUSG00000032572.
GeneID68553.
KEGGmmu:68553.
UCSCuc012gzq.1. mouse.

Organism-specific databases

CTD68553.
MGIMGI:1915803. Col6a4.

Phylogenomic databases

HOGENOMHBG444500.
HOVERGENHBG107742.
InParanoidA2AX52.
OrthoDBEOG476JZ9.

Gene expression databases

BgeeA2AX52.
GenevestigatorA2AX52.

Family and domain databases

InterProIPR008160. Collagen.
IPR002035. VWF_A.
[Graphical view]
KOK06238.
PfamPF01391. Collagen. 2 hits.
PF00092. VWA. 7 hits.
[Graphical view]
SMARTSM00327. VWA. 9 hits.
[Graphical view]
PROSITEPS50234. VWFA. 8 hits.
[Graphical view]
ProtoNetSearch...

Other

NextBio327440.
SOURCESearch...

Entry information

Entry nameCO6A4_MOUSE
AccessionPrimary (citable) accession number: A2AX52
Secondary accession number(s): A2AX53, A2AX54, A9CR35
Entry history
Integrated into UniProtKB/Swiss-Prot: May 20, 2008
Last sequence update: May 20, 2008
Last modified: January 25, 2012
This is version 39 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Relevant documents

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot

SIMILARITY comments

Index of protein domains and families