Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot A2AX52 (CO6A4_MOUSE)

Last modified December 15, 2009. Version 24. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Collagen alpha-4(VI) chain
Gene names
Name: Col6a4
OrganismMus musculus (Mouse)
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMus

Protein attributes

Sequence length2309 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level.

General annotation (Comments)

Function

Collagen VI acts as a cell-binding protein By similarity.

Subunit structure

Trimers composed of three different chains: alpha-1(VI), alpha-2(VI), and alpha-3(VI) or alpha-4(VI) or alpha-5(VI) or alpha-6(VI). Ref.1

Subcellular location

Secretedextracellular spaceextracellular matrix By similarity.

Tissue specificity

In newborn, it is expressed in lung, kidney, brain, intestine, skin, sternum and, at weak level, calvaria. In adult, it is almost absent with some weak expression in ovary and very weak expression in spleen, lung, uterus and brain. Ref.1

Post-translational modification

Prolines at the third position of the tripeptide repeating unit (G-X-Y) are hydroxylated in some or all of the chains By similarity.

Miscellaneous

The human orthologous protein seems not to exist.

Sequence similarities

Belongs to the type VI collagen family.

Contains 8 VWFA domains.

Ontologies

Keywords
   Biological processCell adhesion
   Cellular componentExtracellular matrix
Secreted
   DomainCollagen
Repeat
Signal
   PTMGlycoprotein
Hydroxylation
Gene Ontology (GO)
   Biological processcell adhesion

Inferred from electronic annotation. Source: UniProtKB-KW

   Cellular componentproteinaceous extracellular matrix

Inferred from electronic annotation. Source: UniProtKB-SubCell

   Molecular functionprotein binding

Inferred from electronic annotation. Source: UniProtKB-KW

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 2222 Potential
Chain23 – 23092287Collagen alpha-4(VI) chain
PRO_5000214197

Regions

Domain34 – 206173VWFA 1
Domain235 – 413179VWFA 2
Domain430 – 653224VWFA 3
Domain634 – 811178VWFA 4
Domain849 – 1018170VWFA 5
Domain1030 – 1199170VWFA 6
Domain1776 – 1957182VWFA 7
Domain1982 – 2187206VWFA 8
Region21 – 14101390Nonhelical region
Region1411 – 1744334Triple-helical region
Region1745 – 2309565Nonhelical region
Motif1527 – 15293Cell attachment site Potential
Motif2208 – 22103Cell attachment site Potential

Amino acid modifications

Glycosylation1881N-linked (GlcNAc...) Potential
Glycosylation7541N-linked (GlcNAc...) Potential
Glycosylation11141N-linked (GlcNAc...) Potential

Sequences

Sequence LengthMass (Da)Tools
A2AX52-1 [UniParc].

Last modified May 20, 2008. Version 2.
Checksum: 31F0282FC77B0DE8

FASTA2,309250,798
        10         20         30         40         50         60 
MGTWKTFWLI ISLAAGLGFV KSQRIVCREA SVGDIVFLVH NSINPQHAHS VRNFLYILAN 

        70         80         90        100        110        120 
SLQVGRDNIR VGLAQYSDTP TSEFLLSVYH RKGDVLKHIR GLQFKPGGNR MGQALQFILE 

       130        140        150        160        170        180 
HHFREGAGSR ASQGVPQVAV VVSSGLTEDH IREPAEALRR AGILVYAIGV KDASQAELRE 

       190        200        210        220        230        240 
ISSSPKDNFT FFVPNFPGLP GLAQKLRPEL CSTLGKAAQY TERESPACSE ASPADIVFLV 

       250        260        270        280        290        300 
DSSTSIGLQN FQKVKHFLHS VVSGLDVRSD QVQVGLVQYS DNIYPAFPLK QSSLKSAVLD 

       310        320        330        340        350        360 
RIRNLPYSMG GTSTGSALEF IRANSLTEMS GSRAKDGVPQ IVVLVTDGES SDEVQDVADQ 

       370        380        390        400        410        420 
LKRDGVFVFV VGINIQDVQE LQKIANEPFE EFLFTTENFS ILQALSGTLL QALCSTVERQ 

       430        440        450        460        470        480 
MKKSTKTYAD VVFLIDTSQG TSQASFQWMQ NFISRIIGIL EVGQDKYQIG LAQYSDQGHT 

       490        500        510        520        530        540 
EFLFNTHKTR NEMVAHIHEL LVFQGGSRKT GQGLRFLHRT FFQEAAGSRL LQGVPQYVVV 

       550        560        570        580        590        600 
ITSGKSEDEV GEVAQILRKR GVDIVSVGLQ DFDRAELEGI GPVVLVSDLQ GEDRIRQLML 

       610        620        630        640        650        660 
DVNMFIQGSP KPPRVMTDVA KDAVEECLVP VPADLVFLVE DFSSARQPNF QRVVHFLTTT 

       670        680        690        700        710        720 
VHSLNIHPDT TRVSLVFYSE KPRLEFSLDM YQSAAQVLRH LDRLTFRARR GRAKAGAALD 

       730        740        750        760        770        780 
FLRKEVFLPE KGSRPHRGVQ QIAVVIIESP SLDNVSTPAS YLRRAGVTIY AAGTQPASES 

       790        800        810        820        830        840 
KDLEKIVTYP PWKHAIRLES FLQLSVVGNK LKKKLCPEML SGMPPLMSFI PESTRQSTQE 

       850        860        870        880        890        900 
GCESVEKADI YFLIDGSGSI KPNDFIEMKD FMKEVIKMFH IGPDRVRFGV VQYSDKIISQ 

       910        920        930        940        950        960 
FFLTQYASMA GLSAAIDNIQ QVGGGTTTGK ALSKMVPVFQ NTARIDVARY LIVITDGQST 

       970        980        990       1000       1010       1020 
DPVAEAAQGL RDIGVNIYAI GVRDANTTEL EEIASKKMFF IYEFDSLKSI HQEVIRDICS 

      1030       1040       1050       1060       1070       1080 
SENCKSQKAD IIFLIDGSES IAPKDFEKMK DFMERMVNQS NIGADEIQIG LLQFSSNPQE 

      1090       1100       1110       1120       1130       1140 
EFRLNRYSSK VDMCRAILSV QQMSDGTHTG KALNFTLPFF DSSRGGRPRV HQYLIVITDG 

      1150       1160       1170       1180       1190       1200 
VSQDNVAPPA KALRDRNIII FAIGVGNVQR AQLLEITNDQ DKVFQEENFE SLQSLEKEIL 

      1210       1220       1230       1240       1250       1260 
SEVCSSQGCN IDLSVGVDTS TSSERAQQEL RRLLPELMQQ LAFLSNISCE APGQMEPRFR 

      1270       1280       1290       1300       1310       1320 
YVVPGSSDQP VFDSGFEKYS DETIQKFLVH QGSVNNRMDV DFLQSLGETA IHLSLAKVKV 

      1330       1340       1350       1360       1370       1380 
LLVFTDGLDE DLERLRRTSE FLRSRGLSGL LLIGLGGAHK LEELQELEFG RGFAYRQPLS 

      1390       1400       1410       1420       1430       1440 
SSLPSLPSVL LKQLDTIVER TCCNMYAKCY GDDGIRGEPG SRGEQGERGL DGLPGHPGEE 

      1450       1460       1470       1480       1490       1500 
GDHGQRGPRG LPGLRGEEGC PGVRGPKGAR GFSGEKGNPG EEGVGGLDGE QGDRGAAGPS 

      1510       1520       1530       1540       1550       1560 
GEKGSSGSRG LTGLPGPAGP RGEPGLRGDP GDPGIDNLIQ GPKGEKGRRG HQGSPGFHGP 

      1570       1580       1590       1600       1610       1620 
LGEAGSVGPR GSLGRHGLPG LKGVLGETGE LGSRGEPGHP GPQGPRGRQG PPGFFGQKGD 

      1630       1640       1650       1660       1670       1680 
PGTQGNPGLP GPSGSKGPDG PRGLKGEVGP AGERGPRGQQ GPRGQPGLFG PDGHGYPGRK 

      1690       1700       1710       1720       1730       1740 
GRKGEPGFPG YPGVQGEDGN PGRGGEKGAK GIRGKRGNSG FPGLAGTPGD QGPPGKMGTK 

      1750       1760       1770       1780       1790       1800 
GSKGLADRTP CEIVDFVRGN CPCSTGISRC PAFPTEVVFT LDMSNDVAPS DFERMRNILL 

      1810       1820       1830       1840       1850       1860 
SLLMKLEMCE SNCPTGARVA IVSYNTRTDY LVRLSDHRGK AALLQAVRKI PLERSSGSRN 

      1870       1880       1890       1900       1910       1920 
LGATMRFVAR HVFKRVRSGL LVRKVAVFFQ AGRNYDTASV STATLELHAA DIATAVVTFT 

      1930       1940       1950       1960       1970       1980 
EEHNLPEAGL VDGPNEFHLF TWETEGQQDV ERLASCTLCY DKCRPALGCQ LRAPGPQKLD 

      1990       2000       2010       2020       2030       2040 
MDLVFLVDSS QGVSRDIYLG ALRLVDSVLK DLEVAAQPGT SWHGARAALL THTTPGFWPG 

      2050       2060       2070       2080       2090       2100 
VDQAPVLEYF HLTSHGHRTE MQRQIREAAS GLLQGGPALG HALEWTLENV LLTAVLPRRS 

      2110       2120       2130       2140       2150       2160 
RVLYAIVASE TSIWDREKLR TLSQEAKCKG IALFVLAVGP GVGAQELAEL AKVASAPWEQ 

      2170       2180       2190       2200       2210       2220 
HLLRLEGVSE AEVAYASRFT EAFLNLLNSG INQYPPPELV KECGGPNRGD TLLHFFTSAK 

      2230       2240       2250       2260       2270       2280 
RFSRSQSGTS AAFANDSEAL KSQGIFLGER KSRVASVALQ EALGSHGKDR ADTEDIDQET 

      2290       2300 
PAKGRHLGPT HGPCPMGPEE GECLNYVLK 

« Hide

References

[1]"Three novel collagen VI chains with high homology to the alpha 3 chain."
Gara S.K., Grumati P., Urciuolo A., Bonaldo P., Kobbe B., Koch M., Paulsson M., Wagener R.
J. Biol. Chem. 283:10658-10670(2008) [PubMed: 18276594] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA], SUBUNIT, TISSUE SPECIFICITY.
Strain: C57BL/6J.
Tissue: Brain and Uterus.
+Additional computationally mapped references.

Cross-references

Sequence databases

AM231151 mRNA. Translation: CAJ77150.1.
AM231152 mRNA. Translation: CAJ77151.1.
AM231153 mRNA. Translation: CAJ77152.1.
IPIIPI00828867.
RefSeqNP_081039.2.
UniGeneMm.28854

3D structure databases

ModBaseSearch...

Proteomic databases

PRIDEA2AX52.

Genome annotation databases

EnsemblENSMUST00000121963; ENSMUSP00000112472; ENSMUSG00000032572; Mus musculus. [Genome view]
GeneID68553.
KEGGmmu:68553.

Organism-specific databases

CTD68553.
MGIMGI:1915803. 1110001D15Rik.

Phylogenomic databases

HOGENOMHBG444500.
HOVERGENA2AX52.
InParanoidA2AX52.

Gene expression databases

BgeeA2AX52.
GenevestigatorA2AX52.

Family and domain databases

InterProIPR008160. Collagen.
IPR002035. VWF_A.
[Graphical view]
PfamPF01391. Collagen. 5 hits.
PF00092. VWA. 8 hits.
[Graphical view]
SMARTSM00327. VWA. 9 hits.
[Graphical view]
PROSITEPS50234. VWFA. 8 hits.
[Graphical view]
ProtoNetSearch...

Other Resources

NextBio327440.
SOURCESearch...

Entry information

Entry nameCO6A4_MOUSE
AccessionPrimary (citable) accession number: A2AX52
Secondary accession number(s): A2AX53, A2AX54
Entry history
Integrated into UniProtKB/Swiss-Prot: May 20, 2008
Last sequence update: May 20, 2008
Last modified: December 15, 2009
This is version 24 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectHPI (Human Proteome Initiative)

Relevant documents

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents