Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein
Submitted name:

Collagen alpha-1(II) chain

Gene

COL2A1

Organism
Bos taurus (Bovine)
Status
Unreviewed-Annotation score: Annotation score: 1 out of 5-Protein predictedi

Functioni

GO - Molecular functioni

Complete GO annotation...

Names & Taxonomyi

Protein namesi
Submitted name:
Collagen alpha-1(II) chainImported
Gene namesi
Name:COL2A1Imported
OrganismiBos taurus (Bovine)Imported
Taxonomic identifieri9913 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaLaurasiatheriaCetartiodactylaRuminantiaPecoraBovidaeBovinaeBos
ProteomesiUP000009136 Componenti: Chromosome 5

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

PTM / Processingi

Proteomic databases

PRIDEiF1MSR8.

Expressioni

Gene expression databases

ExpressionAtlasiF1MSR8. baseline.

Family & Domainsi

Phylogenomic databases

GeneTreeiENSGT00780000121837.
KOiK06236.
OrthoDBiEOG7TJ3HH.

Family and domain databases

InterProiIPR008160. Collagen.
IPR000885. Fib_collagen_C.
[Graphical view]
PfamiPF01410. COLFI. 1 hit.
PF01391. Collagen. 8 hits.
[Graphical view]
ProDomiPD002078. Fib_collagen_C. 1 hit.
[Graphical view] [Entries sharing at least one domain]
SMARTiSM00038. COLFI. 1 hit.
[Graphical view]
PROSITEiPS51461. NC1_FIB. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

F1MSR8-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MIRLGAPQTL VLLTLLVAAV LRCHGQDVRQ PGPKGQKGEP GDIKDIVGPK
60 70 80 90 100
GPPGPQGPAG EQGPRGDRGD KGEKGAPGPR GRDGEPGTPG NPGPPGPPGP
110 120 130 140 150
PGPPGLGGNF AAQMAGGFDE KAGGAQMGVM QGPMGPMGPR GPPGPAGAPG
160 170 180 190 200
PQGFQGNPGE PGEPGVSGPM GPRGPPGPPG KPGDDGEAGK PGKSGERGPP
210 220 230 240 250
GPQGARGFPG TPGLPGVKGH RGYPGLDGAK GEAGAPGVKG ESGSPGENGS
260 270 280 290 300
PGPMGPRGLP GERGRTGPAG AAGARGNDGQ PGPAGPPGPV GPAGGPGFPG
310 320 330 340 350
APGAKGEAGP TGARGPEGAQ GPRGEPGTPG SPGPAGAAGN PGTDGIPGAK
360 370 380 390 400
GSAGAPGIAG APGFPGPRGP PGPQGATGPL GPKGQTGEPG IAGFKGEQGP
410 420 430 440 450
KGEPGPAGPQ GAPGPAGEEG KRGARGEPGG AGPAGPPGER GAPGNRGFPG
460 470 480 490 500
QDGLAGPKGA PGERGPSGLA GPKGANGDPG RPGEPGLPGA RGLTGRPGDA
510 520 530 540 550
GPQGKVGPSG APGEDGRPGP PGPQGARGQP GVMGFPGPKG ANGEPGKAGE
560 570 580 590 600
KGLPGAPGLR GLPGKDGETG AAGPPGPAGP AGERGEQGAP GPSGFQGLPG
610 620 630 640 650
PPGPPGEGGK PGDQGVPGEA GAPGLVGPRG ERGFPGERGS PGSQGLQGAR
660 670 680 690 700
GLPGTPGTDG PKGAAGPAGP PGAQGPPGLQ GMPGERGAAG IAGPKGDRGD
710 720 730 740 750
VGEKGPEGAP GKDGGRGLTG PIGPPGPAGA NGEKGEVGPP GPAGTAGARG
760 770 780 790 800
APGERGETGP PGPAGFAGPP GADGQPGAKG EQGEAGQKGD AGAPGPQGPS
810 820 830 840 850
GAPGPQGPTG VTGPKGARGA QGPPGATGFP GAAGRVGPPG SNGNPGPPGP
860 870 880 890 900
PGPSGKDGPK GARGDSGPPG RAGDPGLQGP AGPPGEKGEP GDDGPSGPDG
910 920 930 940 950
PPGPQGLAGQ RGIVGLPGQR GERGFPGLPG PSGEPGKQGA PGASGDRGPP
960 970 980 990 1000
GPVGPPGLTG PAGEPGREGS PGADGPPGRD GAAGVKGDRG ETGAVGAPGA
1010 1020 1030 1040 1050
PGPPGSPGPA GPIGKQGDRG EAGAQGPMGP AGPAGARGMP GPQGPRGDKG
1060 1070 1080 1090 1100
ETGEAGERGL KGHRGFTGLQ GLPGPPGPSG DQGASGPAGP SGPRGPPGPV
1110 1120 1130 1140 1150
GPSGKDGANG IPGPIGPPGP RGRSGETGPA GPPGNPGPPG PPGPPGPGID
1160 1170 1180 1190 1200
MSAFAGLGQR EKGPDPLQYM RADEAAGNLR QHDAEVDATL KSLNNQIESL
1210 1220 1230 1240 1250
RSPEGSRKNP ARTCRDLKLC HPEWKSGDYW IDPNQGCTLD AMKVFCNMET
1260 1270 1280 1290 1300
GETCVYPNPA SVPKKNWWSS KSKDKKHIWF GETINGGFHF SYGDDNLAPN
1310 1320 1330 1340 1350
TANVQMTFLR LLSTEGSQNI TYHCKNSIAY LDEAAGNLKK ALLIQGSNDV
1360 1370 1380 1390 1400
EIRAEGNSRF TYTVLKDGCT KHTGKWGKTM IEYRSQKTSR LPIIDIAPMD
1410
IGGPEQEFGV DIGPVCFL
Length:1,418
Mass (Da):134,427
Last modified:November 16, 2011 - v2
Checksum:i95045FA20C8B2A39
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
DAAA02012985 Genomic DNA. No translation available.
DAAA02012986 Genomic DNA. No translation available.
RefSeqiNP_001106695.1. NM_001113224.1.
UniGeneiBt.21390.

Genome annotation databases

EnsembliENSBTAT00000017509; ENSBTAP00000017509; ENSBTAG00000013155.
GeneIDi407142.
KEGGibta:407142.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
DAAA02012985 Genomic DNA. No translation available.
DAAA02012986 Genomic DNA. No translation available.
RefSeqiNP_001106695.1. NM_001113224.1.
UniGeneiBt.21390.

3D structure databases

ModBaseiSearch...
MobiDBiSearch...

Proteomic databases

PRIDEiF1MSR8.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSBTAT00000017509; ENSBTAP00000017509; ENSBTAG00000013155.
GeneIDi407142.
KEGGibta:407142.

Organism-specific databases

CTDi1280.

Phylogenomic databases

GeneTreeiENSGT00780000121837.
KOiK06236.
OrthoDBiEOG7TJ3HH.

Miscellaneous databases

NextBioi20818406.

Gene expression databases

ExpressionAtlasiF1MSR8. baseline.

Family and domain databases

InterProiIPR008160. Collagen.
IPR000885. Fib_collagen_C.
[Graphical view]
PfamiPF01410. COLFI. 1 hit.
PF01391. Collagen. 8 hits.
[Graphical view]
ProDomiPD002078. Fib_collagen_C. 1 hit.
[Graphical view] [Entries sharing at least one domain]
SMARTiSM00038. COLFI. 1 hit.
[Graphical view]
PROSITEiPS51461. NC1_FIB. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: HerefordImported.
  2. Ensembl
    Submitted (JUL-2011) to UniProtKB
    Cited for: IDENTIFICATION.
    Strain: HerefordImported.

Entry informationi

Entry nameiF1MSR8_BOVIN
AccessioniPrimary (citable) accession number: F1MSR8
Entry historyi
Integrated into UniProtKB/TrEMBL: May 3, 2011
Last sequence update: November 16, 2011
Last modified: February 4, 2015
This is version 23 of the entry and version 2 of the sequence. [Complete history]
Entry statusiUnreviewed (UniProtKB/TrEMBL)

Miscellaneousi

Caution

The sequence shown here is derived from an Ensembl automatic analysis pipeline and should be considered as preliminary data.Imported

Keywords - Technical termi

Complete proteome, Reference proteomeImported

External Data

Dasty 3

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into Uniref entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.