Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein
Submitted name:

Procollagen, type XVIII, alpha 1, isoform CRA_a

Gene

Col18a1

Organism
Rattus norvegicus (Rat)
Status
Unreviewed-Annotation score: Annotation score: 3 out of 5-Experimental evidence at protein leveli

Functioni

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Enzyme and pathway databases

ReactomeiR-RNO-1442490. Collagen degradation.
R-RNO-1592389. Activation of Matrix Metalloproteinases.
R-RNO-1650814. Collagen biosynthesis and modifying enzymes.
R-RNO-2022090. Assembly of collagen fibrils and other multimeric structures.
R-RNO-216083. Integrin cell surface interactions.
R-RNO-3000157. Laminin interactions.

Names & Taxonomyi

Protein namesi
Submitted name:
Procollagen, type XVIII, alpha 1, isoform CRA_aImported
Submitted name:
Protein Col18a1Imported
Gene namesi
Name:Col18a1Imported
ORF Names:rCG_60896Imported
OrganismiRattus norvegicus (Rat)Imported
Taxonomic identifieri10116 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeRattus
Proteomesi
  • UP000002494 Componenti: Chromosome 20

Organism-specific databases

RGDi70936. Col18a1.

Subcellular locationi

GO - Cellular componenti

  • collagen type XVIII trimer Source: RGD
  • extracellular exosome Source: Ensembl
  • extracellular space Source: RGD
Complete GO annotation...

Keywords - Cellular componenti

Extracellular matrixSAAS annotation, Secreted

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 2525Sequence analysisAdd
BLAST
Chaini26 – 13111286Sequence analysisPRO_5007913950Add
BLAST

Keywords - PTMi

Disulfide bondSAAS annotation

Expressioni

Gene expression databases

BgeeiENSRNOG00000001229.

Interactioni

Protein-protein interaction databases

STRINGi10116.ENSRNOP00000047962.

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini33 – 221189LAM_G_DOMAINInterPro annotationAdd
BLAST

Keywords - Domaini

CollagenSAAS annotationImported, RepeatSAAS annotation, SignalSequence analysis

Phylogenomic databases

eggNOGiKOG3546. Eukaryota.
ENOG410XQ04. LUCA.
GeneTreeiENSGT00710000106713.
KOiK06823.
OMAiGQWTRFA.
OrthoDBiEOG091G013X.

Family and domain databases

Gene3Di2.60.120.200. 1 hit.
3.10.100.10. 1 hit.
InterProiIPR016186. C-type_lectin-like.
IPR016187. C-type_lectin_fold.
IPR008160. Collagen.
IPR010515. Collagenase_NC10/endostatin.
IPR013320. ConA-like_dom.
IPR001791. Laminin_G.
[Graphical view]
PfamiPF01391. Collagen. 5 hits.
PF06482. Endostatin. 1 hit.
[Graphical view]
SMARTiSM00210. TSPN. 1 hit.
[Graphical view]
SUPFAMiSSF49899. SSF49899. 1 hit.
SSF56436. SSF56436. 1 hit.

Sequencei

Sequence statusi: Complete.

F1LR02-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MAPRWHLLDM LTSLVLLLVA RVSWAEPENV AEDVGLLQLL GDPLPQKISQ
60 70 80 90 100
VEDPHVGLAY VFGPYSKSSQ MAQYHFPKLF FRDFSLLFEV RPTTEAAGVL
110 120 130 140 150
FAITDAAQVV VSLGVKLSEV RDGQQNISLL YTEPGASQTQ TGASFRLPAF
160 170 180 190 200
VGQWTHFALS VDGSSVALYV DCEEFQRVPF ARSPHGLELE RGAGLFVGQA
210 220 230 240 250
GAADPDKFQG MISELRVRKT PRVSPVHCLD EEDDDDDRAS GDFGSGLEES
260 270 280 290 300
SNLHRQETYL RPGLPQPPPV TSPPLAGGSA TEDSRTEEKE EEATVDSKGA
310 320 330 340 350
DTLPVTDSSG VWDGDVQNPG GGLIKGGLKG QKGEPGAQGP PGPAGPQGPA
360 370 380 390 400
GPAVQSPSSQ PVPGAQGPPG PQGPPGKDGI PGRDGEPGDP GEDGRPGDTG
410 420 430 440 450
PQGFPGTPGD VGPKGEKGDP GIGPRGPPGP PGPPGPSFRQ DKLTFIDMEG
460 470 480 490 500
SGGFSGDLES LRGPRGFPGP PGPPGVPGLP GEPGRFGVNS SYAPGPAGLP
510 520 530 540 550
GVPGKEGPPG FPGPPGPPGK EGPPGVAGQK GSVGDAGSPG PKGSKGDLGP
560 570 580 590 600
IGMPGKSGLP GLPGPVGPPG PPGPPGPPGP GFAAGFDDME GSGTPLWSTA
610 620 630 640 650
RSSDGLQGDP GVTGPPGAKG EVGADGVQGI PGLPGREGVA GPPGPKGEKG
660 670 680 690 700
TQGEKGNPGK DGVGRPGLPG PPGPPGPVIY VSNEDRAVVS TPGPEGKPGY
710 720 730 740 750
AGFPGPAGPK GDLGSKGEQG LPGPKGEKGE PGSIFSPDGT ALGQAQKGAK
760 770 780 790 800
GEPGFRGPPG PYGRPGYKGE IGFPGRPGRP GTNGLKGEKG EPGEASLGFS
810 820 830 840 850
MRGLPGPPGP PGPPGPPGVP VYDSNAFVES GRPGLPGQQG VQGPPGPKGD
860 870 880 890 900
KGEVGPPGPP GQFPIDLFHL EAEMKGDKGD RGDAGRKGER GEPGAPGGGF
910 920 930 940 950
FSSSVPGPPG PPGYPGIPGP KGESIRGPPG PPGPQGPPGI GYEGRQGPPG
960 970 980 990 1000
PPGPPGPPSF PGPHRQTVSV PGPPGPPGPP GPPGAMGASA GQVRIWATYQ
1010 1020 1030 1040 1050
TMLDKIREVP EGWLIFVAER EELYVRVRNG FRKVLLEART ALPHGTDNEV
1060 1070 1080 1090 1100
AALQPPLVQL HEGSSYTRRE HSYPTARPWR ADDILANPPR LPDRQPYPGV
1110 1120 1130 1140 1150
PHHHHHHHHH HSSHEHRPPA HPSPSPAHTH QDFHPVLHLV ALNTPLSGGM
1160 1170 1180 1190 1200
RGIRGADFQC FQQARAVGLS GTFRAFLSSR LQDLYSIVRR ADRSSVPIVN
1210 1220 1230 1240 1250
LKDEVLSPSW DTLFSGSQGQ LHSGARIFSF DGRDVLRHPA WPQKSVWHGS
1260 1270 1280 1290 1300
DPSGRRLMES YCETWRTEAT GVTGQASSLL SGRLLEQKAE SCHNSYIVLC
1310
IENSFMTSFS K
Length:1,311
Mass (Da):134,646
Last modified:April 3, 2013 - v2
Checksum:i7079C49C842C7745
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AABR07044596 Genomic DNA. No translation available.
CH473988 Genomic DNA. Translation: EDL97114.1.
RefSeqiNP_445941.2. NM_053489.2.
UniGeneiRn.12030.

Genome annotation databases

EnsembliENSRNOT00000045315; ENSRNOP00000047962; ENSRNOG00000001229.
GeneIDi85251.
KEGGirno:85251.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AABR07044596 Genomic DNA. No translation available.
CH473988 Genomic DNA. Translation: EDL97114.1.
RefSeqiNP_445941.2. NM_053489.2.
UniGeneiRn.12030.

3D structure databases

ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi10116.ENSRNOP00000047962.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSRNOT00000045315; ENSRNOP00000047962; ENSRNOG00000001229.
GeneIDi85251.
KEGGirno:85251.

Organism-specific databases

CTDi80781.
RGDi70936. Col18a1.

Phylogenomic databases

eggNOGiKOG3546. Eukaryota.
ENOG410XQ04. LUCA.
GeneTreeiENSGT00710000106713.
KOiK06823.
OMAiGQWTRFA.
OrthoDBiEOG091G013X.

Enzyme and pathway databases

ReactomeiR-RNO-1442490. Collagen degradation.
R-RNO-1592389. Activation of Matrix Metalloproteinases.
R-RNO-1650814. Collagen biosynthesis and modifying enzymes.
R-RNO-2022090. Assembly of collagen fibrils and other multimeric structures.
R-RNO-216083. Integrin cell surface interactions.
R-RNO-3000157. Laminin interactions.

Gene expression databases

BgeeiENSRNOG00000001229.

Family and domain databases

Gene3Di2.60.120.200. 1 hit.
3.10.100.10. 1 hit.
InterProiIPR016186. C-type_lectin-like.
IPR016187. C-type_lectin_fold.
IPR008160. Collagen.
IPR010515. Collagenase_NC10/endostatin.
IPR013320. ConA-like_dom.
IPR001791. Laminin_G.
[Graphical view]
PfamiPF01391. Collagen. 5 hits.
PF06482. Endostatin. 1 hit.
[Graphical view]
SMARTiSM00210. TSPN. 1 hit.
[Graphical view]
SUPFAMiSSF49899. SSF49899. 1 hit.
SSF56436. SSF56436. 1 hit.
ProtoNetiSearch...

Entry informationi

Entry nameiF1LR02_RAT
AccessioniPrimary (citable) accession number: F1LR02
Entry historyi
Integrated into UniProtKB/TrEMBL: May 3, 2011
Last sequence update: April 3, 2013
Last modified: September 7, 2016
This is version 51 of the entry and version 2 of the sequence. [Complete history]
Entry statusiUnreviewed (UniProtKB/TrEMBL)

Miscellaneousi

Keywords - Technical termi

Complete proteome, Proteomics identificationCombined sources, Reference proteomeImported

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.