Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot P30754 (CAFF_RIFPA)

Last modified November 24, 2009. Version 50. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information

Names and origin

Protein namesRecommended name:
    Fibril-forming collagen alpha chain
OrganismRiftia pachyptila (Tube worm)
Taxonomic identifier6426 [NCBI]
Taxonomic lineageEukaryotaMetazoaPogonophoraVestimentiferaRiftiidaRiftiidaeRiftia

Protein attributes

Sequence length1027 AA.
Sequence statusFragment.
Sequence processingThe displayed sequence is not processed.
Protein existenceEvidence at protein level.

General annotation (Comments)

Function

Fibril-forming collagen.

Subunit structure

Homotetramer.

Subcellular location

Secretedextracellular spaceextracellular matrix By similarity.

Ontologies

Keywords
   Cellular componentExtracellular matrix
Secreted
   DomainCollagen
Repeat
   PTMGlycoprotein
Hydroxylation
   Technical termDirect protein sequencing
Gene Ontology (GO)
   Cellular componentproteinaceous extracellular matrix

Inferred from electronic annotation. Source: UniProtKB-SubCell

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain‹1 – ›1027›1027Fibril-forming collagen alpha chain
PRO_0000059394

Regions

Region1 – 1212Nonhelical region (N-terminal)
Region13 – 10231011Triple-helical region
Region1024 – 10274Nonhelical region (C-terminal)

Sites

Site6101Imperfection in the GAA repeat

Amino acid modifications

Modified residue2114-hydroxyproline; partial Ref.1
Modified residue2414-hydroxyproline; partial
Modified residue2714-hydroxyproline Ref.1
Modified residue3914-hydroxyproline Ref.1
Modified residue5313-hydroxyproline; partial
Modified residue5414-hydroxyproline Ref.1
Modified residue7214-hydroxyproline; partial Ref.1
Modified residue9014-hydroxyproline Ref.1
Modified residue9314-hydroxyproline Ref.1
Modified residue9615-hydroxylysine Probable
Modified residue10815-hydroxylysine Probable
Modified residue12314-hydroxyproline; partial Ref.1
Modified residue12814-hydroxyproline; partial Ref.1
Modified residue15014-hydroxyproline Ref.1
Modified residue16113-hydroxyproline; partial
Modified residue16214-hydroxyproline Ref.1
Modified residue16414-hydroxyproline; partial
Modified residue16513-hydroxyproline
Modified residue17414-hydroxyproline Ref.1
Modified residue17714-hydroxyproline Ref.1
Modified residue18014-hydroxyproline Ref.1
Modified residue18315-hydroxylysine Ref.1
Modified residue19215-hydroxylysine Probable
Modified residue20714-hydroxyproline Ref.1
Modified residue21614-hydroxyproline Ref.1
Modified residue21914-hydroxyproline Ref.1
Modified residue22814-hydroxyproline Ref.1
Modified residue23714-hydroxyproline Ref.1
Modified residue24314-hydroxyproline; partial Ref.1
Modified residue24914-hydroxyproline Ref.1
Modified residue25514-hydroxyproline Ref.1
Modified residue26115-hydroxylysine Probable
Modified residue27314-hydroxyproline; partial Ref.1
Modified residue27614-hydroxyproline; partial Ref.1
Modified residue27915-hydroxylysine Probable
Modified residue28514-hydroxyproline; partial Ref.1
Modified residue29114-hydroxyproline; partial Ref.1
Modified residue30314-hydroxyproline; partial Ref.1
Modified residue30614-hydroxyproline Ref.1
Modified residue31214-hydroxyproline Ref.1
Modified residue32114-hydroxyproline Ref.1
Modified residue32714-hydroxyproline Ref.1
Modified residue33914-hydroxyproline Ref.1
Modified residue34215-hydroxylysine Ref.1
Modified residue34814-hydroxyproline; partial Ref.1
Modified residue35115-hydroxylysine; partial Ref.1
Modified residue36614-hydroxyproline Ref.1
Modified residue37214-hydroxyproline Ref.1
Modified residue37514-hydroxyproline Ref.1
Modified residue38114-hydroxyproline; partial Ref.1
Modified residue38714-hydroxyproline Ref.1
Modified residue41613-hydroxyproline; partial
Modified residue41714-hydroxyproline Ref.1
Modified residue42314-hydroxyproline Ref.1
Modified residue42914-hydroxyproline Ref.1
Modified residue43214-hydroxyproline Ref.1
Modified residue45314-hydroxyproline Ref.1
Modified residue46514-hydroxyproline Ref.1
Modified residue48314-hydroxyproline Ref.1
Modified residue50014-hydroxyproline; partial
Modified residue50314-hydroxyproline; partial
Modified residue50614-hydroxyproline; partial
Modified residue51314-hydroxyproline Ref.1
Modified residue52514-hydroxyproline Ref.1
Modified residue53314-hydroxyproline; partial
Modified residue53614-hydroxyproline; partial
Modified residue54014-hydroxyproline Ref.1
Modified residue54615-hydroxylysine Ref.1
Modified residue55113-hydroxyproline; partial
Modified residue55214-hydroxyproline Ref.1
Modified residue56114-hydroxyproline Ref.1
Modified residue56715-hydroxylysine
Modified residue57315-hydroxylysine Probable
Modified residue60314-hydroxyproline Ref.1
Modified residue61215-hydroxylysine Probable
Modified residue62114-hydroxyproline; partial Ref.1
Modified residue62714-hydroxyproline Ref.1
Modified residue64514-hydroxyproline; partial Ref.1
Modified residue64713-hydroxyproline; partial
Modified residue64814-hydroxyproline Ref.1
Modified residue65715-hydroxylysine Probable
Modified residue66314-hydroxyproline Ref.1
Modified residue70814-hydroxyproline Ref.1
Modified residue71114-hydroxyproline Ref.1
Modified residue71414-hydroxyproline Ref.1
Modified residue71714-hydroxyproline Ref.1
Modified residue72314-hydroxyproline Ref.1
Modified residue73815-hydroxylysine Probable
Modified residue74414-hydroxyproline Ref.1
Modified residue75914-hydroxyproline Ref.1
Modified residue76515-hydroxylysine Probable
Modified residue77313-hydroxyproline; partial
Modified residue77414-hydroxyproline Ref.1
Modified residue78314-hydroxyproline Ref.1
Modified residue79214-hydroxyproline Ref.1
Modified residue81015-hydroxylysine Probable
Modified residue81513-hydroxyproline; partial
Modified residue81614-hydroxyproline Ref.1
Modified residue84314-hydroxyproline Ref.1
Modified residue84914-hydroxyproline Ref.1
Modified residue85514-hydroxyproline Ref.1
Modified residue86114-hydroxyproline Ref.1
Modified residue86714-hydroxyproline Ref.1
Modified residue88814-hydroxyproline Ref.1
Modified residue89414-hydroxyproline Ref.1
Modified residue90314-hydroxyproline
Modified residue91514-hydroxyproline
Modified residue92715-hydroxylysine Probable
Modified residue93315-hydroxylysine; partial
Modified residue93615-hydroxylysine Probable
Modified residue93915-hydroxylysine
Modified residue94514-hydroxyproline
Modified residue95414-hydroxyproline; partial
Modified residue96314-hydroxyproline
Modified residue96614-hydroxyproline
Modified residue98414-hydroxyproline
Modified residue99014-hydroxyproline
Modified residue101013-hydroxyproline; partial
Modified residue101114-hydroxyproline
Modified residue101313-hydroxyproline; partial
Modified residue101414-hydroxyproline
Modified residue101613-hydroxyproline; partial
Modified residue101714-hydroxyproline
Modified residue101913-hydroxyproline; partial
Modified residue102014-hydroxyproline
Glycosylation961O-linked (Gal...) Probable
Glycosylation1081O-linked (Gal...) Probable
Glycosylation1921O-linked (Gal...) Probable
Glycosylation2611O-linked (Gal...) Probable
Glycosylation2791O-linked (Gal...) Probable
Glycosylation5731O-linked (Gal...) Probable
Glycosylation6121O-linked (Gal...) Probable
Glycosylation6571O-linked (Gal...) Probable
Glycosylation7381O-linked (Gal...) Probable
Glycosylation7651O-linked (Gal...) Probable
Glycosylation8101O-linked (Gal...) Probable
Glycosylation9271O-linked (Gal...) Probable
Glycosylation9361O-linked (Gal...) Probable

Natural variations

Natural variant9031P → A

Experimental info

Sequence uncertainty961
Sequence uncertainty1081
Sequence uncertainty1921
Sequence uncertainty2611
Sequence uncertainty2791
Sequence uncertainty5731
Sequence uncertainty6121
Sequence uncertainty6571
Sequence uncertainty7381
Sequence uncertainty7651
Sequence uncertainty8101
Sequence uncertainty9271
Sequence uncertainty9361
Non-terminal residue11
Non-terminal residue10271

Sequences

Sequence LengthMass (Da)Tools
P30754-1 [UniParc].

Last modified August 16, 2004. Version 2.
Checksum: 0555C50E880FDAF6

FASTA1,02794,587
        10         20         30         40         50         60 
YRAGPRYIQA QVGPIGPRGP PGPPGSPGQQ GYQGLRGEPG DSGPMGPIGK RGPPGPAGIA 

        70         80         90        100        110        120 
GKSGDDGRDG EPGPRGGIGP MGPRGAGGMP GMPGPKGHRG FRGLSGSKGE QGKSGNQGPD 

       130        140        150        160        170        180 
GGPGPAGPSG PIGPRGQTGE RGRDGKSGLP GLRGVDGLAG PPGPPGPIGS TGSPGFPGTP 

       190        200        210        220        230        240 
GSKGDRGQSG IKGAQGLQGP VGLSGQPGVA GENGHPGMPG MDGANGEPGA SGESGLPGPS 

       250        260        270        280        290        300 
GFPGPRGMPG TAGSPGQAGA KGDGGPTGEQ GRPGAPGVKG SSGPPGDVGA PGHAGEAGKR 

       310        320        330        340        350        360 
GSPGSPGPAG SPGPQGDRGL PGSRGLPGMT GASGAMGIPG EKGPSGEPGA KGPTGDTGRQ 

       370        380        390        400        410        420 
GNQGTPGIAG LPGNPGSDGR PGKDGRPGIR GKDGKQGEQG PQGPQGLAGL QGRAGPPGAR 

       430        440        450        460        470        480 
GEPGKNGAPG EPGAHGEQGD AGKDGETGAA GPPGAAGPTG ARGPPGPRGQ QGFQGLAGAQ 

       490        500        510        520        530        540 
GTPGEAGKTG ERGAVGATGP SGPAGPGGER GAPGDRGNVG PRGMPGERGA TGPAGPTGSP 

       550        560        570        580        590        600 
GVAGAKGQGG PPGPAGLVGL PGERGPKGVG GSKGSRGDIG PRGKAGERGK DGERGERGEN 

       610        620        630        640        650        660 
GLPGPSGLAA SKGERGDMGS PGERGSPGPA GERGPAGSQG IQGQPGPPGD AGPAGTKGDI 

       670        680        690        700        710        720 
GFPGERGTRG ATGKQGARGP RGLAGKRGLR GAGGSRGETG AQGEIGLPGS PGQPGLPGPS 

       730        740        750        760        770        780 
GQPGPSGPAG TAGKQGVKGA RGSPGLVGKQ GDRGSDGEPG RDGTKGERGE DGPPGVSGPT 

       790        800        810        820        830        840 
GAPGQQGERG MPGMVGLRGE TGPMGGQGMK GDGGPPGPSG DRGERGNAGP QGPTGPSGQA 

       850        860        870        880        890        900 
GAPGQEGAPG KDGLPGLAGR PGERGEPGVA GRAGSQGLAG LMGQRGLPGA AGPPGDRGER 

       910        920        930        940        950        960 
GEPGGQGVQG PVGAPGSQGP AGIMGMKGEA GGKGAKGDKG WTGLPGLQGL QGTPGHSGES 

       970        980        990       1000       1010       1020 
GPPGAPGPRG ARGEAGGRGS QGPPGKDGQP GPSGRVGPRG PSGDDGRSGP PGPPGPPGPP 


GNSDYGA 

« Hide

References

[1]"Amino-acid sequence and cell-adhesion activity of a fibril-forming collagen from the tube worm Riftia pachyptila living at deep sea hydrothermal vents."
Mann K., Gaill F., Timpl R.
Eur. J. Biochem. 210:839-847(1992) [PubMed: 1483468] [Abstract]
Cited for: PROTEIN SEQUENCE.
[2]"Molecular characterization of cuticle and interstitial collagens from worms collected at deep sea hydrothermal vents."
Gaill F., Wiedemann H., Mann K., Kuhn K., Timpl R., Engel J.
J. Mol. Biol. 221:209-223(1991) [PubMed: 1920405] [Abstract]
Cited for: PROTEIN SEQUENCE OF 8-45; 525-618 AND 810-882.
Tissue: Cuticle.

Cross-references

Sequence databases

PIRS28774.

3D structure databases

ModBaseSearch...

Family and domain databases

InterProIPR008160. Collagen.
[Graphical view]
PfamPF01391. Collagen. 15 hits.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameCAFF_RIFPA
AccessionPrimary (citable) accession number: P30754
Entry history
Integrated into UniProtKB/Swiss-Prot: July 1, 1993
Last sequence update: August 16, 2004
Last modified: November 24, 2009
This is version 50 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information