Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein
Submitted name:

Collagen alpha-1(XVIII) chain isoform A

Gene

COL18A1

Organism
Alligator mississippiensis (American alligator)
Status
Unreviewed-Annotation score: -Protein predictedi

Functioni

Caution

The sequence shown here is derived from an EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is preliminary data.Imported

GO - Molecular functioni

Names & Taxonomyi

Protein namesi
Submitted name:
Collagen alpha-1(XVIII) chain isoform AImported
Gene namesi
Name:COL18A1Imported
ORF Names:Y1Q_0008757Imported
OrganismiAlligator mississippiensis (American alligator)Imported
Taxonomic identifieri8496 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiArchelosauriaArchosauriaCrocodyliaAlligatoridaeAlligatorinaeAlligator
Proteomesi
  • UP000050525 Componenti: Unassembled WGS sequence

Subcellular locationi

Extracellular region or secreted Cytosol Plasma membrane Cytoskeleton Lysosome Endosome Peroxisome ER Golgi apparatus Nucleus Mitochondrion Manual annotation Automatic computational assertionGraphics by Christian Stolte; Source: COMPARTMENTS

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Signal peptidei1 – 27Sequence analysisAdd BLAST27
ChainiPRO_500758588528 – 1387Sequence analysisAdd BLAST1360

Keywords - PTMi

Disulfide bondSAAS annotation

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini38 – 226LAM_G_DOMAINInterPro annotationAdd BLAST189
Domaini87 – 225LAM_G_DOMAINInterPro annotationAdd BLAST139

Keywords - Domaini

CollagenSAAS annotationImported, SignalSequence analysis

Family and domain databases

CDDicd00247 Endostatin-like, 1 hit
Gene3Di3.10.100.10, 1 hit
InterProiView protein in InterPro
IPR016186 C-type_lectin-like/link_sf
IPR039075 COL18A1
IPR008160 Collagen
IPR010515 Collagenase_NC10/endostatin
IPR013320 ConA-like_dom_sf
IPR016187 CTDL_fold
IPR001791 Laminin_G
PANTHERiPTHR45362 PTHR45362, 1 hit
PfamiView protein in Pfam
PF01391 Collagen, 6 hits
PF06482 Endostatin, 1 hit
SMARTiView protein in SMART
SM00282 LamG, 1 hit
SM00210 TSPN, 1 hit
SUPFAMiSSF49899 SSF49899, 1 hit
SSF56436 SSF56436, 2 hits

Sequence (1+)i

Sequence statusi: Complete.

This entry has 1 described isoform and 1 potential isoform that is computationally mapped.iShow all

A0A151NAH2-1 [UniParc]FASTAAdd to basket
« Hide
        10         20         30         40         50
MRGRCPQRRR RPRPQLLPGL FVLLAAAASQ ERENFSTEVG LLQLIGDPPP
60 70 80 90 100
EQITKIYGPD NNPGYVFGPD ANTGQVARYH LPSPFYRDFS LLFHIQPTTN
110 120 130 140 150
KAGVLFAITD SSQAVIYIGV KLSDAKEGKQ HIIFYYTEPG SQSSYAAATF
160 170 180 190 200
TVSSLLNQWT RFAVSVEDEE VVLFLDCEEY DRIRFERSPD EMELEDGSGL
210 220 230 240 250
FVGQAGGADP DKYQGVIADL KIKNDPHAAV LQCEEDEDDT EEMSGDFGSG
260 270 280 290 300
VEEKLHSSGK ERGIPTVSGL PKPPPVTSPP IAGRPVEKVS GSSQLQTEHI
310 320 330 340 350
KAEETPPVST GARVGQKGEK GEKGERGPKG DSRTGGVPST GGVKGEKGEK
360 370 380 390 400
GELGVKGSAG FGYPGSKGQK GEPGAPGDPG PPGPPGPSGT ILQHQDGSVV
410 420 430 440 450
EHVTGPPGAM GPPGLPGKDG LPGKDGEPGD PGEDGKPGDT GPQGFPGTPG
460 470 480 490 500
EPGLKGEKGD PGVSVKGSPG PPGPPGPPGI PGLSSKQDKL TFIDMEGSGF
510 520 530 540 550
AGDLESLRGP RGPPGPPGPP GVPGLPGEPG RFGMNHTAIP GPPGLPGKDG
560 570 580 590 600
SPGLQGPSGP PGPPGREGLP GQPGFRGEKG DSGDLGLPGA PGPKGSKGET
610 620 630 640 650
GPQGSPGETG LAGLPGPIGP RGQPGPPGPP GPGYEAGFGD MEGSGIPIVS
660 670 680 690 700
SVPGPRGPEG PQGPPGLPGL KGDIGSLGQP GVPGQKGDPG TPGTNGQPGL
710 720 730 740 750
EGFPGPQGPK GDQGSPGVKG ERGQDGVGLP GPPGPPGPPG EIVYASNKTL
760 770 780 790 800
AVLPGPEGRP GHAGFPGPVG PKGEAGSPGL QGFPGLKGEK GEPGVIIGPD
810 820 830 840 850
GTVTALDTKG EKGEPGLRGP VGPLGPHGRP GQKGEIGFPG RPGRPGMNGL
860 870 880 890 900
KGEKGDPADP SGGFSVPGLP GPPGPPGPPG SPGSIVPIYD GNAFTESGPP
910 920 930 940 950
GPPGLPGYQG VPGHKGEKGD TGAPGPPGQF PYDLSRFGAA FRGEKGEQGD
960 970 980 990 1000
SGLKGEKGEP GGGTLYGPGV GQPGLPGPQG YPGLPGPKGD SIVGPPGPPG
1010 1020 1030 1040 1050
PQGPPGIGYE GRQGPPGPPG PPGPPSFPGP HRQTNVVGPP GPPGPPGPPG
1060 1070 1080 1090 1100
TSGTSASSGL KILPSYQAML STAHEVPEGW LVFIVDREEL YVRVRNGFRR
1110 1120 1130 1140 1150
ILLEDHTVIS STALDNEVYD KPPSIHFARG SSPSSGSQPH VPVHPHRDYN
1160 1170 1180 1190 1200
TYATVRPWGG DDIIANHHRL PKQPSIHQGA QHQQENLHDF YPNRRPEDTP
1210 1220 1230 1240 1250
SAMHTHHDFQ PALHLVALNT PLSGSMRGIR GADFQCFQQA REVGLTGTFR
1260 1270 1280 1290 1300
AFLSSRLQDL YSIVRRADRS SVPIVNLRDE VLFNNWESLF SGTEAQLRAA
1310 1320 1330 1340 1350
THILSFNGKD ILRDSTWPQK SVWHGSDSKG RRLTENYCET WRTDSSVVTG
1360 1370 1380
QASSLSSGKL LEQKASSCQN AFIVLCIENS FMTSSKK
Length:1,387
Mass (Da):142,088
Last modified:June 8, 2016 - v1
Checksum:i9625A35BC8371996
GO

Computationally mapped potential isoform sequencesi

There is 1 potential isoform mapped to this entry.BLASTAlignShow allAdd to basket
EntryEntry nameProtein names
Gene namesLengthAnnotation
A0A151N9V7A0A151N9V7_ALLMI
Collagen alpha-1(XVIII) chain isofo...
COL18A1 Y1Q_0008757
1,523Annotation score:

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AKHW03003682 Genomic DNA Translation: KYO33599.1
RefSeqiXP_019347547.1, XM_019492002.1

Genome annotation databases

GeneIDi102563131

Similar proteinsi

Entry informationi

Entry nameiA0A151NAH2_ALLMI
AccessioniPrimary (citable) accession number: A0A151NAH2
Entry historyiIntegrated into UniProtKB/TrEMBL: June 8, 2016
Last sequence update: June 8, 2016
Last modified: September 12, 2018
This is version 13 of the entry and version 1 of the sequence. See complete history.
Entry statusiUnreviewed (UniProtKB/TrEMBL)

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteomeImported
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again