Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

P13941 (CO3A1_RAT) Reviewed, UniProtKB/Swiss-Prot

Last modified February 19, 2014. Version 118. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (1) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Collagen alpha-1(III) chain
Gene names
Name:Col3a1
OrganismRattus norvegicus (Rat) [Reference proteome]
Taxonomic identifier10116 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeRattus

Protein attributes

Sequence length1463 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at transcript level

General annotation (Comments)

Function

Collagen type III occurs in most soft connective tissues along with type I collagen. Involved in regulation of cortical development. Is the major ligand of Gpr56 in the developing brain and binding to Gpr56 inhibits neuronal migration and activates the RhoA pathway by coupling Gpr56 to Gna13 and possibly Gna12 By similarity.

Subunit structure

Trimers of identical alpha 1(III) chains. The chains are linked to each other by interchain disulfide bonds. Trimers are also cross-linked via hydroxylysines.

Subcellular location

Secretedextracellular spaceextracellular matrix By similarity.

Domain

The C-terminal propeptide, also known as COLFI domain, have crucial roles in tissue growth and repair by controlling both the intracellular assembly of procollagen molecules and the extracellular assembly of collagen fibrils. It binds a calcium ion which is essential for its function By similarity.

Post-translational modification

O-glycosylated By similarity.

Prolines at the third position of the tripeptide repeating unit (G-X-Y) are hydroxylated in some or all of the chains.

Sequence similarities

Belongs to the fibrillar collagen family.

Contains 1 fibrillar collagen NC1 domain.

Contains 1 VWFC domain.

Ontologies

Keywords
   Cellular componentExtracellular matrix
Secreted
   DomainCollagen
Repeat
Signal
   LigandCalcium
Metal-binding
   PTMDisulfide bond
Glycoprotein
Hydroxylation
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
   Biological_processaging

Inferred from expression pattern PubMed 20610530. Source: RGD

blood vessel development

Inferred from electronic annotation. Source: Ensembl

cell-matrix adhesion

Inferred from electronic annotation. Source: Ensembl

cellular response to amino acid stimulus

Inferred from electronic annotation. Source: Ensembl

cerebral cortex development

Inferred from sequence or structural similarity. Source: UniProtKB

collagen biosynthetic process

Inferred from electronic annotation. Source: Ensembl

collagen fibril organization

Inferred from electronic annotation. Source: Ensembl

digestive tract development

Inferred from electronic annotation. Source: Ensembl

extracellular fibril organization

Inferred from electronic annotation. Source: Ensembl

heart development

Inferred from electronic annotation. Source: Ensembl

integrin-mediated signaling pathway

Inferred from electronic annotation. Source: Ensembl

negative regulation of immune response

Inferred from electronic annotation. Source: Ensembl

negative regulation of neuron migration

Inferred from sequence or structural similarity. Source: UniProtKB

peptide cross-linking

Inferred from electronic annotation. Source: Ensembl

positive regulation of Rho protein signal transduction

Inferred from sequence or structural similarity. Source: UniProtKB

response to cytokine

Inferred from electronic annotation. Source: Ensembl

response to mechanical stimulus

Inferred from expression pattern PubMed 20839322. Source: RGD

response to radiation

Inferred from electronic annotation. Source: Ensembl

skeletal system development

Inferred from expression pattern PubMed 10373016. Source: RGD

skin development

Inferred from electronic annotation. Source: Ensembl

transforming growth factor beta receptor signaling pathway

Inferred from electronic annotation. Source: Ensembl

wound healing

Inferred from electronic annotation. Source: Ensembl

   Cellular_componentcollagen type III

Inferred from electronic annotation. Source: Ensembl

extracellular matrix

Inferred from direct assay PubMed 20610530. Source: RGD

extracellular space

Inferred from electronic annotation. Source: Ensembl

   Molecular_functionextracellular matrix structural constituent

Inferred from electronic annotation. Source: Ensembl

metal ion binding

Inferred from electronic annotation. Source: UniProtKB-KW

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 2323 By similarity
Propeptide24 – 154131N-terminal propeptide By similarity
PRO_0000005746
Chain155 – 12181064Collagen alpha-1(III) chain
PRO_0000005747
Propeptide1219 – 1463245C-terminal propeptide By similarity
PRO_0000043408

Regions

Domain31 – 9060VWFC
Domain1229 – 1463235Fibrillar collagen NC1
Region155 – 16915Nonhelical region (N-terminal)
Region170 – 11951026Triple-helical region

Sites

Metal binding12771Calcium By similarity
Metal binding12791Calcium By similarity
Metal binding12801Calcium; via carbonyl oxygen By similarity
Metal binding12821Calcium; via carbonyl oxygen By similarity
Metal binding12851Calcium By similarity

Amino acid modifications

Modified residue26215-hydroxylysine; alternate By similarity
Modified residue28315-hydroxylysine By similarity
Modified residue85915-hydroxylysine By similarity
Modified residue97615-hydroxylysine By similarity
Modified residue109315-hydroxylysine By similarity
Modified residue110515-hydroxylysine By similarity
Glycosylation2621O-linked (Gal...); alternate By similarity
Disulfide bond1195Interchain By similarity
Disulfide bond1196Interchain By similarity
Disulfide bond1259 ↔ 1291 By similarity
Disulfide bond1265Interchain (with C-1282) By similarity
Disulfide bond1282Interchain (with C-1265) By similarity
Disulfide bond1299 ↔ 1461 By similarity
Disulfide bond1369 ↔ 1414 By similarity

Experimental info

Sequence conflict11671N → D in CAA06510. Ref.3
Sequence conflict12561A → G in CAA06510. Ref.3

Sequences

Sequence LengthMass (Da)Tools
P13941 [UniParc].

Last modified December 6, 2005. Version 3.
Checksum: 63C218CD2BCA47B6

FASTA1,463138,936
        10         20         30         40         50         60 
MMSFVQCGTW FLLTLLHPSL ILAQQSNVDE LGCNYLGQSY ESRDVWKPEP CQICVCDSGS 

        70         80         90        100        110        120 
VLCDDIMCDD EPLDCPNPEI PFGECCAICP QPSTPAPVIP DGNRPQGPKG DPGPPGIPGR 

       130        140        150        160        170        180 
NGDPGLPGQP GLPGPPGSPG ICESCPTGGQ NYSPQFDSYD VKSGVGGMGG YPGPAGPPGP 

       190        200        210        220        230        240 
PGPPGSSGHP GSPGSPGYQG PPGEPGQAGP AGPPGPPGAI GPSGPAGKDG ESGRPGRPGE 

       250        260        270        280        290        300 
RGLPGPPGIK GPAGIPGFPG MKGHRGFDGR NGEKGETGAP GLKGENGLPG DNGAPGPMGP 

       310        320        330        340        350        360 
RGAPGERGRP GLPGAAGARG NDGARGSDGQ PGPPGPPGTA GFPGSPGAKG EVGPAGSPGS 

       370        380        390        400        410        420 
NGSPGQRGEP GPQGHAGAQG PPGPPGNNGS PGGKGEMGPA GIPGAPGLLG ARGPPGPAGA 

       430        440        450        460        470        480 
NGAPGQRGPS GEPGKNGAKG EPGARGERGE AGSPGIPGPK GEDGKDGSPG EPGANGVPGN 

       490        500        510        520        530        540 
PGERGAPGFR GPAGPNGAPG EKGPAGERGG PGPAGPRGVA GEPGRDGTPG GPGIRGMPGS 

       550        560        570        580        590        600 
PGGPGNDGKP GPPGSQGESG RPGPPGPSGP RGQPGVMGFP GPKGNDGAPG KNGERGGPGG 

       610        620        630        640        650        660 
PGLPGPAGKN GETGPQGPPG PTGAPGDKGD AGPPGPQGLQ GIPGTSGPPG ENGKPGEPGP 

       670        680        690        700        710        720 
KGEAGAPGVP GGKGDSGAPG ERGPPGTAGT PGLRGGAGPP GPEGGKGPAG PPGPPGTSGP 

       730        740        750        760        770        780 
PGLQGMPGER GGPGSPGPKG EKGEPGGAGA DGVPGKDGPR GPAGPIGPPG PAGQPGDKGE 

       790        800        810        820        830        840 
GGAPGLPGIA GPRGGPGERG EHGPPGPAGF PGAPGQNGEP GAKGERGAPG EKGEGGPPGA 

       850        860        870        880        890        900 
AGPPGGSGPA GPPGPQGVKG ERGSPGGPGA AGFPGGRGLP GPPGNNGNPG PPGPSGAPGK 

       910        920        930        940        950        960 
DGPPGPAGNS GSPGNPGVAG PKGDAGQPGE KGPPGAQGPP GSPGPLGIAG LTGARGLAGP 

       970        980        990       1000       1010       1020 
PGMPGPRGSP GPQGIKGESG KPGASGHNGE RGPPGPQGLP GQPGTAGEPG RDGNPGSDGQ 

      1030       1040       1050       1060       1070       1080 
PGRDGSPGGK GDRGENGSPG APGAPGHPGP PGPVGPSGKN GDRGETGPAG PSGAPGPAGA 

      1090       1100       1110       1120       1130       1140 
RGAPGPQGPR GDKGETGERG SNGIKGHRGF PGNPGPPGSP GAAGHQGAVG SPGPAGPRGP 

      1150       1160       1170       1180       1190       1200 
VGPHGPPGKD GSSGHPGPIG PPGPRGNRGE RGSEGSPGHP GQPGPPGPPG APGPCCGGGA 

      1210       1220       1230       1240       1250       1260 
AIAGVGGEKS GGFSPYYGDD PMDFKINTEE IMSSLKSVNG QIESLISPDG SRKNPARNCR 

      1270       1280       1290       1300       1310       1320 
DLKFCHPELK SGEYWVDPNQ GCKMDAIKVF CNMETGETCI NASPMTVPRK HWWTDAGAEK 

      1330       1340       1350       1360       1370       1380 
KHVWFGESMN GGFQFSYGNP DLPEDVLDVQ LAFLRLLSSR ASQNITYHCK NSIAYMDQAN 

      1390       1400       1410       1420       1430       1440 
GNVKKSLKLM GSNEGEFKAE GNSKFTYTVL EDGCTKHTGE WSKTVFEYQT RKAMRLPIID 

      1450       1460 
IAPYDIGGPD QEFGVDIGPV CFL 

« Hide

References

« Hide 'large scale' references
[1]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
Tissue: Lung.
[2]"Cloning of cDNA for rat pro alpha 1(III) collagen mRNA. Different expression patterns of type I and type III collagen and fibronectin genes in experimental granulation tissue."
Glumoff V., Maekelae J.K., Vuorio E.
Biochim. Biophys. Acta 1217:41-48(1994) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 828-1463.
[3]Wurtz T., Ellerstroem C., Lundmark C., Christersson C.
Submitted (APR-1998) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 900-1463.
Strain: Sprague-Dawley.
Tissue: Fibroblast.
[4]"Regulation of alpha 2(I), alpha 1(III), and alpha 2(V) collagen mRNAs by estradiol in the immature rat uterus."
Frankel F.R., Hsu C.-Y.J., Meyers J.C., Lin E., Lyttle C.R., Komm B., Mohn K.
DNA 7:347-354(1988) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 1135-1309.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
BC087039 mRNA. Translation: AAH87039.1.
X70369 mRNA. Translation: CAA49832.1.
AJ005395 mRNA. Translation: CAA06510.1.
M21354 mRNA. Translation: AAA40942.1.
PIRS41067.
RefSeqNP_114474.1. NM_032085.1.
UniGeneRn.3247.

3D structure databases

ProteinModelPortalP13941.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

IntActP13941. 1 interaction.
STRING10116.ENSRNOP00000004956.

Proteomic databases

PaxDbP13941.
PRIDEP13941.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENSRNOT00000004956; ENSRNOP00000004956; ENSRNOG00000003357.
GeneID84032.
KEGGrno:84032.
UCSCRGD:71029. rat.

Organism-specific databases

CTD1281.
RGD71029. Col3a1.

Phylogenomic databases

eggNOGNOG12793.
GeneTreeENSGT00740000114967.
HOGENOMHOG000085654.
HOVERGENHBG004933.
InParanoidP13941.
KOK06236.
OMAKYVWFGE.
OrthoDBEOG7TJ3HH.
TreeFamTF344135.

Gene expression databases

GenevestigatorP13941.

Family and domain databases

InterProIPR008160. Collagen.
IPR000885. Fib_collagen_C.
IPR001007. VWF_C.
[Graphical view]
PfamPF01410. COLFI. 1 hit.
PF01391. Collagen. 4 hits.
PF00093. VWC. 1 hit.
[Graphical view]
ProDomPD002078. Fib_collagen_C. 1 hit.
[Graphical view] [Entries sharing at least one domain]
SMARTSM00038. COLFI. 1 hit.
SM00214. VWC. 1 hit.
[Graphical view]
PROSITEPS51461. NC1_FIB. 1 hit.
PS01208. VWFC_1. 1 hit.
PS50184. VWFC_2. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

NextBio616623.
PROP13941.

Entry information

Entry nameCO3A1_RAT
AccessionPrimary (citable) accession number: P13941
Secondary accession number(s): O70604, Q5PQT6
Entry history
Integrated into UniProtKB/Swiss-Prot: January 1, 1990
Last sequence update: December 6, 2005
Last modified: February 19, 2014
This is version 118 of the entry and version 3 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families