UniProtKB - H9G925 (H9G925_ANOCA)
Protein
Fibrillar collagen NC1 domain-containing protein
Gene
N/A
Organism
Anolis carolinensis (Green anole) (American chameleon)
Status
Functioni
GO - Molecular functioni
- extracellular matrix structural constituent Source: GO_Central
GO - Biological processi
- extracellular matrix organization Source: GO_Central
Keywordsi
Molecular function | ToxinARBA annotation |
Names & Taxonomyi
Protein namesi | Recommended name: Fibrillar collagen NC1 domain-containing proteinInterPro annotation |
Organismi | Anolis carolinensis (Green anole) (American chameleon)Imported |
Taxonomic identifieri | 28377 [NCBI] |
Taxonomic lineagei | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Lepidosauria › Squamata › Bifurcata › Unidentata › Episquamata › Toxicofera › Iguania › Dactyloidae › Anolis |
Proteomesi |
|
Subcellular locationi
Extracellular region or secreted
- extracellular matrix Source: GO_Central
- extracellular space Source: GO_Central
Keywords - Cellular componenti
Extracellular matrixARBA annotation, SecretedExpressioni
Gene expression databases
Bgeei | ENSACAG00000004055, Expressed in skeletal muscle tissue and 10 other tissues |
Family & Domainsi
Domains and Repeats
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Domaini | 829 – 1058 | Fibrillar collagen NC1InterPro annotationAdd BLAST | 230 |
Region
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Regioni | 1 – 819 | DisorderedSequence analysisAdd BLAST | 819 |
Compositional bias
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Compositional biasi | 192 – 211 | Pro-richSequence analysisAdd BLAST | 20 | |
Compositional biasi | 491 – 505 | Pro-richSequence analysisAdd BLAST | 15 | |
Compositional biasi | 615 – 631 | Pro-richSequence analysisAdd BLAST | 17 | |
Compositional biasi | 758 – 773 | Pro-richSequence analysisAdd BLAST | 16 |
Phylogenomic databases
eggNOGi | KOG3544, Eukaryota |
GeneTreei | ENSGT00940000154535 |
HOGENOMi | CLU_001074_2_3_1 |
InParanoidi | H9G925 |
OMAi | CINTIAW |
TreeFami | TF344135 |
Family and domain databases
InterProi | View protein in InterPro IPR008160, Collagen IPR000885, Fib_collagen_C |
Pfami | View protein in Pfam PF01410, COLFI, 1 hit PF01391, Collagen, 5 hits |
SMARTi | View protein in SMART SM00038, COLFI, 1 hit |
PROSITEi | View protein in PROSITE PS51461, NC1_FIB, 1 hit |
i Sequence
Sequence statusi: Complete.
H9G925-1 [UniParc]FASTAAdd to basket
10 20 30 40 50
GHPGREGPAG EKGVQGPSGA AGPIGYPGPR GVKGAFGVRG LKGSKGEKGE
60 70 80 90 100
DGFPGFKGDM GPKGDRGDAG PLGVRGEDGP EGLKGQAGPL GEAGSPGLAG
110 120 130 140 150
EKGKLGVPGL PGYPGRQGPK GSTGFPGPLG LIGEKGKRGK AGQAGQTGQR
160 170 180 190 200
GPSGPPGERG LPGSTGKPGP KGDSGHDGAP GIGGKGKGPP GPQGPNGFPG
210 220 230 240 250
PKGPPGPAGK DGLPGHPGQR GEPGFHGKTG PPGPIGVVGP QGNSGETGPT
260 270 280 290 300
GERGHPGSPG PPGEHGLPGA AGREGAKGDP GPSGLPGKDG PPGMKGFTGA
310 320 330 340 350
RGPAGEAGPT GLKGAEGPLG PQGSIGPPGE RGPSGAAGGI GLPGRGGSQG
360 370 380 390 400
PPGPAGEKGA PGERGPIGPA GHDGVLGPVG LPGPPGPPGL SGEDGDKGEM
410 420 430 440 450
GGPGQKGSKG DKGDSGPPGP TGIQGPIGHP GPAGADGESG RRGQQGMYGQ
460 470 480 490 500
KGDEGSRGFP GVSGPPGLQG MPGLPGEKGE SGDVGPMGPP GAHGPRGPQG
510 520 530 540 550
PSGGDGPPGQ PGGVGQPGPV GEKGEAGEAG DPGPSGEPGR TGGKGDTGEK
560 570 580 590 600
GDSGQSGAAG PPGKKGPPGE DGAKGNLGPI GFPGDPGPSG EAGPPGIDGV
610 620 630 640 650
PGEKGDMGDP GLPGPPGASG EPGPPGSPGK RGPLGPAGRE GRQGEKGAKG
660 670 680 690 700
EPGTDGPPGK MGPVGPQGPP GRPGSEGLRG IPGPAGEQGL LGAPGQAGPP
710 720 730 740 750
GPMGPTGLPG LKGDPGYKGE KGHAGLIGLI GPPGEMGEKG DQGLPGNQGT
760 770 780 790 800
PGPKGDPGVL GPLGPPGPPG SPGLLGPQGQ KGNKGAPGPI GLKGDLGPAG
810 820 830 840 850
SPGPPGRPAQ LQDPMPLESN VDPDADGLAE VLAVLSSLRD EVEQMRCPLG
860 870 880 890 900
TPESPARVCK ELQFCHPHLT DGEYWIDPNQ GCSRDAFRVF CNFTAGGETC
910 920 930 940 950
VFPDKKFETV RLAAWNKEQP GNWYSTFKRG KKFSYIDADG NPIQVTQLTF
960 970 980 990 1000
LRLLSAIAHQ NFTLLCQNAA AWYEADTSSH NRALRFLAFN GVELMHNSTE
1010 1020 1030 1040 1050
APIRALYDGC QVRKGQERTV LQVLTPRVEQ LPLADVAVQD FGDTNQKFGF
ELGPVCFGG
Sequence databases
Select the link destinations: EMBLi GenBanki DDBJi Links Updated | AAWZ02011954 Genomic DNA No translation available. |
Genome annotation databases
Ensembli | ENSACAT00000004446; ENSACAP00000004346; ENSACAG00000004055 |
Similar proteinsi
Cross-referencesi
Sequence databases
Select the link destinations: EMBLi GenBanki DDBJi Links Updated | AAWZ02011954 Genomic DNA No translation available. |
3D structure databases
ModBasei | Search... |
SWISS-MODEL-Workspacei | Submit a new modelling project... |
Protein-protein interaction databases
STRINGi | 28377.ENSACAP00000004346 |
Genome annotation databases
Ensembli | ENSACAT00000004446; ENSACAP00000004346; ENSACAG00000004055 |
Phylogenomic databases
eggNOGi | KOG3544, Eukaryota |
GeneTreei | ENSGT00940000154535 |
HOGENOMi | CLU_001074_2_3_1 |
InParanoidi | H9G925 |
OMAi | CINTIAW |
TreeFami | TF344135 |
Gene expression databases
Bgeei | ENSACAG00000004055, Expressed in skeletal muscle tissue and 10 other tissues |
Family and domain databases
InterProi | View protein in InterPro IPR008160, Collagen IPR000885, Fib_collagen_C |
Pfami | View protein in Pfam PF01410, COLFI, 1 hit PF01391, Collagen, 5 hits |
SMARTi | View protein in SMART SM00038, COLFI, 1 hit |
PROSITEi | View protein in PROSITE PS51461, NC1_FIB, 1 hit |
ProtoNeti | Search... |
MobiDBi | Search... |
Entry informationi
Entry namei | H9G925_ANOCA | |
Accessioni | H9G925Primary (citable) accession number: H9G925 | |
Entry historyi | Integrated into UniProtKB/TrEMBL: | May 16, 2012 |
Last sequence update: | May 16, 2012 | |
Last modified: | February 10, 2021 | |
This is version 57 of the entry and version 1 of the sequence. See complete history. | ||
Entry statusi | Unreviewed (UniProtKB/TrEMBL) |