Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein
Submitted name:

Collagen alpha-1(III) chain

Gene

Col3a1

Organism
Mus musculus (Mouse)
Status
Unreviewed-Annotation score: Annotation score: 3 out of 5-Experimental evidence at protein leveli

Functioni

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Names & Taxonomyi

Protein namesi
Submitted name:
Collagen alpha-1(III) chainImported
Submitted name:
Procollagen, type III, alpha 1Imported
Gene namesi
Name:Col3a1Imported
ORF Names:mCG_114960Imported
OrganismiMus musculus (Mouse)
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus
Proteomesi
  • UP000000589 Componenti: Chromosome 1

Organism-specific databases

MGIiMGI:88453. Col3a1.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Extracellular matrixSAAS annotation, Secreted

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 2323Sequence analysisAdd
BLAST
Chaini24 – 14641441Sequence analysisPRO_5007319706Add
BLAST

Interactioni

Protein-protein interaction databases

STRINGi10090.ENSMUSP00000085192.

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini33 – 9058VWFCInterPro annotationAdd
BLAST
Domaini1230 – 1464235Fibrillar collagen NC1InterPro annotationAdd
BLAST

Keywords - Domaini

CollagenImported, SignalSequence analysis

Phylogenomic databases

eggNOGiKOG3544. Eukaryota.
ENOG410XNMM. LUCA.
GeneTreeiENSGT00840000129673.
HOVERGENiHBG004933.
KOiK19720.
OMAiAEKKHVW.

Family and domain databases

InterProiIPR008160. Collagen.
IPR000885. Fib_collagen_C.
IPR001007. VWF_dom.
[Graphical view]
PfamiPF01410. COLFI. 1 hit.
PF01391. Collagen. 4 hits.
PF00093. VWC. 1 hit.
[Graphical view]
ProDomiPD002078. Fib_collagen_C. 1 hit.
[Graphical view] [Entries sharing at least one domain]
SMARTiSM00038. COLFI. 1 hit.
SM00214. VWC. 1 hit.
[Graphical view]
PROSITEiPS51461. NC1_FIB. 1 hit.
PS01208. VWFC_1. 1 hit.
PS50184. VWFC_2. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Q3TVI5-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MMSFVQSGTW FLLTLLHPTL ILAQQSNVDE LGCSHLGQSY ESRDVWKPEP
60 70 80 90 100
CQICVCDSGS VLCDDIICDE EPLDCPNPEI PFGECCAICP QPSTPAPVLP
110 120 130 140 150
DGHGPQGPKG DPGPPGIPGR NGDPGLPGQP GLPGPPGSPG ICESCPTGGQ
160 170 180 190 200
NYSPQFDSYD VKSGVGGMGG YPGPAGPPGP PGPPGSSGHP GSPGSPGYQG
210 220 230 240 250
PPGEPGQAGP AGPPGPPGAL GPAGPAGKDG ESGRPGRPGE RGLPGPPGIK
260 270 280 290 300
GPAGMPGFPG MKGHRGFDGR NGEKGETGAP GLKGENGLPG DNGAPGPMGP
310 320 330 340 350
RGAPGERGRP GLPGAAGARG NDGARGSDGQ PGPPGPPGTA GFPGSPGAKG
360 370 380 390 400
EVGPAGSPGS NGSPGQRGEP GPQGHAGAQG PPGPPGNNGS PGGKGEMGPA
410 420 430 440 450
GIPGAPGLIG ARGPPGPAGT NGIPGTRGPS GEPGKNGAKG EPGARGERGE
460 470 480 490 500
AGSPGIPGPK GEDGKDGSPG EPGANGLPGA AGERGPSGFR GPAGPNGIPG
510 520 530 540 550
EKGPPGERGG PGPAGPRGVA GEPGRDGTPG GPGIRGMPGS PGGPGNDGKP
560 570 580 590 600
GPPGSQGESG RPGPPGPSGP RGQPGVMGFP GPKGNDGAPG KNGERGGPGG
610 620 630 640 650
PGLPGPAGKN GETGPQGPPG PTGPAGDKGD SGPPGPQGLQ GIPGTGGPPG
660 670 680 690 700
ENGKPGEPGP KGEVGAPGAP GGKGDSGAPG ERGPPGTAGI PGARGGAGPP
710 720 730 740 750
GPEGGKGPAG PPGPPGASGS PGLQGMPGER GGPGSPGPKG EKGEPGGAGA
760 770 780 790 800
DGVPGKDGPR GPAGPIGPPG PAGQPGDKGE GGSPGLPGIA GPRGGPGERG
810 820 830 840 850
EHGPPGPAGF PGAPGQNGEP GAKGERGAPG EKGEGGPPGP AGPTGSSGPA
860 870 880 890 900
GPPGPQGVKG ERGSPGGPGT AGFPGGRGLP GPPGNNGNPG PPGPSGAPGK
910 920 930 940 950
DGPPGPAGNS GSPGNPGIAG PKGDAGQPGE KGPPGAQGPP GSPGPLGIAG
960 970 980 990 1000
LTGARGLAGP PGMPGPRGSP GPQGIKGESG KPGASGHNGE RGPPGPQGLP
1010 1020 1030 1040 1050
GQPGTAGEPG RDGNPGSDGQ PGRDGSPGGK GDRGENGSPG APGAPGHPGP
1060 1070 1080 1090 1100
PGPVGPSGKS GDRGETGPAG PSGAPGPAGA RGAPGPQGPR GDKGETGERG
1110 1120 1130 1140 1150
SNGIKGHRGF PGNPGPPGSP GAAGHQGAIG SPGPAGPRGP VGPHGPPGKD
1160 1170 1180 1190 1200
GTSGHPGPIG PPGPRGNRGE RGSEGSPGHP GQPGPPGPPG APGPCCGGGA
1210 1220 1230 1240 1250
AAIAGVGGEK SGGFSPYYGD DPMDFKINTE EIMSSLKSVN GQIESLISPD
1260 1270 1280 1290 1300
GSRKNPARNC RDLKFCHPEL KSGEYWVDPN QGCKMDAIKV FCNMETGETC
1310 1320 1330 1340 1350
INASPMTVPR KHWWTDSGAE KKHVWFGESM NGGFQFSYGP PDLPEDVVDV
1360 1370 1380 1390 1400
QLAFLRLLSS RASQNITYHC KNSIAYMDQA SGNVKKSLKL MGSNEGEFKA
1410 1420 1430 1440 1450
EGNSKFTYTV LEDGCTKHTG EWSKTVFEYQ TRKAMRLPII DIAPYDIGGP
1460
DQEFGVDIGP VCFL
Length:1,464
Mass (Da):138,943
Last modified:October 11, 2005 - v1
Checksum:i2104EC27A886090B
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AC125167 Genomic DNA. No translation available.
AK157762 mRNA. Translation: BAE34185.1.
AK160107 mRNA. Translation: BAE35633.1.
CH466589 Genomic DNA. Translation: EDK96880.1.
RefSeqiNP_034060.2. NM_009930.2.
UniGeneiMm.249555.

Genome annotation databases

EnsembliENSMUST00000087883; ENSMUSP00000085192; ENSMUSG00000026043.
GeneIDi12825.
KEGGimmu:12825.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AC125167 Genomic DNA. No translation available.
AK157762 mRNA. Translation: BAE34185.1.
AK160107 mRNA. Translation: BAE35633.1.
CH466589 Genomic DNA. Translation: EDK96880.1.
RefSeqiNP_034060.2. NM_009930.2.
UniGeneiMm.249555.

3D structure databases

ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi10090.ENSMUSP00000085192.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSMUST00000087883; ENSMUSP00000085192; ENSMUSG00000026043.
GeneIDi12825.
KEGGimmu:12825.

Organism-specific databases

CTDi1281.
MGIiMGI:88453. Col3a1.

Phylogenomic databases

eggNOGiKOG3544. Eukaryota.
ENOG410XNMM. LUCA.
GeneTreeiENSGT00840000129673.
HOVERGENiHBG004933.
KOiK19720.
OMAiAEKKHVW.

Miscellaneous databases

ChiTaRSiCol3a1. mouse.
SOURCEiSearch...

Family and domain databases

InterProiIPR008160. Collagen.
IPR000885. Fib_collagen_C.
IPR001007. VWF_dom.
[Graphical view]
PfamiPF01410. COLFI. 1 hit.
PF01391. Collagen. 4 hits.
PF00093. VWC. 1 hit.
[Graphical view]
ProDomiPD002078. Fib_collagen_C. 1 hit.
[Graphical view] [Entries sharing at least one domain]
SMARTiSM00038. COLFI. 1 hit.
SM00214. VWC. 1 hit.
[Graphical view]
PROSITEiPS51461. NC1_FIB. 1 hit.
PS01208. VWFC_1. 1 hit.
PS50184. VWFC_2. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "High-efficiency full-length cDNA cloning."
    Carninci P., Hayashizaki Y.
    Methods Enzymol. 303:19-44(1999) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE.
    Strain: C57BL/6JImported.
    Tissue: ParthenogenoteImported.
  2. "Normalization and subtraction of cap-trapper-selected cDNAs to prepare full-length cDNA libraries for rapid discovery of new genes."
    Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M., Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.
    Genome Res. 10:1617-1630(2000) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE.
    Strain: C57BL/6JImported.
    Tissue: ParthenogenoteImported.
  3. Cited for: NUCLEOTIDE SEQUENCE.
    Strain: C57BL/6JImported.
    Tissue: ParthenogenoteImported.
  4. Cited for: NUCLEOTIDE SEQUENCE.
    Strain: C57BL/6JImported.
    Tissue: ParthenogenoteImported.
  5. "Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs."
    The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase I and II Team
    Nature 420:563-573(2002) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE.
    Strain: C57BL/6JImported.
    Tissue: ParthenogenoteImported.
  6. "A comparison of whole-genome shotgun-derived mouse chromosome 16 and the human genome."
    Mural R.J., Adams M.D., Myers E.W., Smith H.O., Miklos G.L., Wides R., Halpern A., Li P.W., Sutton G.G., Nadeau J., Salzberg S.L., Holt R.A., Kodira C.D., Lu F., Chen L., Deng Z., Evangelista C.C., Gan W.
    , Heiman T.J., Li J., Li Z., Merkulov G.V., Milshina N.V., Naik A.K., Qi R., Shue B.C., Wang A., Wang J., Wang X., Yan X., Ye J., Yooseph S., Zhao Q., Zheng L., Zhu S.C., Biddick K., Bolanos R., Delcher A.L., Dew I.M., Fasulo D., Flanigan M.J., Huson D.H., Kravitz S.A., Miller J.R., Mobarry C.M., Reinert K., Remington K.A., Zhang Q., Zheng X.H., Nusskern D.R., Lai Z., Lei Y., Zhong W., Yao A., Guan P., Ji R.R., Gu Z., Wang Z.Y., Zhong F., Xiao C., Chiang C.C., Yandell M., Wortman J.R., Amanatides P.G., Hladun S.L., Pratts E.C., Johnson J.E., Dodson K.L., Woodford K.J., Evans C.A., Gropman B., Rusch D.B., Venter E., Wang M., Smith T.J., Houck J.T., Tompkins D.E., Haynes C., Jacob D., Chin S.H., Allen D.R., Dahlke C.E., Sanders R., Li K., Liu X., Levitsky A.A., Majoros W.H., Chen Q., Xia A.C., Lopez J.R., Donnelly M.T., Newman M.H., Glodek A., Kraft C.L., Nodell M., Ali F., An H.J., Baldwin-Pitts D., Beeson K.Y., Cai S., Carnes M., Carver A., Caulk P.M., Center A., Chen Y.H., Cheng M.L., Coyne M.D., Crowder M., Danaher S., Davenport L.B., Desilets R., Dietz S.M., Doup L., Dullaghan P., Ferriera S., Fosler C.R., Gire H.C., Gluecksmann A., Gocayne J.D., Gray J., Hart B., Haynes J., Hoover J., Howland T., Ibegwam C., Jalali M., Johns D., Kline L., Ma D.S., MacCawley S., Magoon A., Mann F., May D., McIntosh T.C., Mehta S., Moy L., Moy M.C., Murphy B.J., Murphy S.D., Nelson K.A., Nuri Z., Parker K.A., Prudhomme A.C., Puri V.N., Qureshi H., Raley J.C., Reardon M.S., Regier M.A., Rogers Y.H., Romblad D.L., Schutz J., Scott J.L., Scott R., Sitter C.D., Smallwood M., Sprague A.C., Stewart E., Strong R.V., Suh E., Sylvester K., Thomas R., Tint N.N., Tsonis C., Wang G., Wang G., Williams M.S., Williams S.M., Windsor S.M., Wolfe K., Wu M.M., Zaveri J., Chaturvedi K., Gabrielian A.E., Ke Z., Sun J., Subramanian G., Venter J.C., Pfannkoch C.M., Barnstead M., Stephenson L.D.
    Science 296:1661-1671(2002) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE.
    Strain: MixedImported.
  7. Cited for: NUCLEOTIDE SEQUENCE.
    Strain: C57BL/6JImported.
    Tissue: ParthenogenoteImported.
  8. "The Transcriptional Landscape of the Mammalian Genome."
    The FANTOM Consortium, Riken Genome Exploration Research Group and Genome Science Group (Genome Network Project Core Group)
    Science 309:1559-1563(2005)
    Cited for: NUCLEOTIDE SEQUENCE.
    Strain: C57BL/6JImported.
    Tissue: ParthenogenoteImported.
  9. Cited for: NUCLEOTIDE SEQUENCE.
    Strain: C57BL/6JImported.
    Tissue: ParthenogenoteImported.
  10. Mural R.J., Adams M.D., Myers E.W., Smith H.O., Venter J.C.
    Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases
    Cited for: NUCLEOTIDE SEQUENCE.
    Strain: MixedImported.
  11. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: C57BL/6JImported.
  12. Cited for: IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
  13. Ensembl
    Submitted (AUG-2015) to UniProtKB
    Cited for: IDENTIFICATION.
    Strain: C57BL/6JImported.

Entry informationi

Entry nameiQ3TVI5_MOUSE
AccessioniPrimary (citable) accession number: Q3TVI5
Entry historyi
Integrated into UniProtKB/TrEMBL: October 11, 2005
Last sequence update: October 11, 2005
Last modified: June 8, 2016
This is version 103 of the entry and version 1 of the sequence. [Complete history]
Entry statusiUnreviewed (UniProtKB/TrEMBL)

Miscellaneousi

Keywords - Technical termi

Complete proteome, Proteomics identificationCombined sources, Reference proteomeImported

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.