Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein
Submitted name:

Uncharacterized protein

Gene

20210948

Organism
Helobdella robusta (Californian leech)
Status
Unreviewed-Annotation score: -Protein predictedi

Functioni

GO - Molecular functioni

  • calcium ion binding Source: InterPro
  • extracellular matrix structural constituent Source: GO_Central

GO - Biological processi

Names & Taxonomyi

Protein namesi
Submitted name:
Uncharacterized proteinImported
Gene namesi
Name:20210948Imported
ORF Names:HELRODRAFT_188573Imported
OrganismiHelobdella robusta (Californian leech)Imported
Taxonomic identifieri6412 [NCBI]
Taxonomic lineageiEukaryotaMetazoaLophotrochozoaAnnelidaClitellataHirudineaHirudinidaGlossiphoniiformesGlossiphoniidaeHelobdella
Proteomesi
  • UP000015101 Componenti: Unassembled WGS sequence

Subcellular locationi

Extracellular region or secreted Cytosol Plasma membrane Cytoskeleton Lysosome Endosome Peroxisome ER Golgi apparatus Nucleus Mitochondrion Manual annotation Automatic computational assertionGraphics by Christian Stolte; Source: COMPARTMENTS

Topology

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Transmembranei2571 – 2593HelicalSequence analysisAdd BLAST23

Keywords - Cellular componenti

Membrane

PTM / Processingi

Keywords - PTMi

Disulfide bondSAAS annotation

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini2357 – 2403EGF-likeInterPro annotationAdd BLAST47
Domaini2404 – 2444EGF_CAInterPro annotationAdd BLAST41
Domaini2407 – 2444EGF-likeInterPro annotationAdd BLAST38
Domaini2460 – 2499EGF-likeInterPro annotationAdd BLAST40
Domaini2500 – 2527EGF_CAInterPro annotationAdd BLAST28

Keywords - Domaini

EGF-like domainSAAS annotation, Transmembrane, Transmembrane helixSequence analysis

Phylogenomic databases

OMAiNGNTGFT
OrthoDBiEOG091F06QF

Family and domain databases

InterProiView protein in InterPro
IPR008160 Collagen
IPR001881 EGF-like_Ca-bd_dom
IPR013032 EGF-like_CS
IPR000742 EGF-like_dom
IPR000152 EGF-type_Asp/Asn_hydroxyl_site
IPR018097 EGF_Ca-bd_CS
PfamiView protein in Pfam
PF01391 Collagen, 13 hits
PF07645 EGF_CA, 1 hit
SMARTiView protein in SMART
SM00181 EGF, 3 hits
SM00179 EGF_CA, 2 hits
PROSITEiView protein in PROSITE
PS00010 ASX_HYDROXYL, 1 hit
PS01186 EGF_2, 1 hit
PS01187 EGF_CA, 1 hit

Sequencei

Sequence statusi: Complete.

T1FQ51-1 [UniParc]FASTAAdd to basket
« Hide
        10         20         30         40         50
MAMNYLKSFF KLKILFEFAI KYFKGWTGST GSTGSTGITG FTGSTGPKGF
60 70 80 90 100
IGPQGNIGPM GLTGFQGPTG ATGFTGDTGQ PGTVGFTGPM GPQGIPGNTG
110 120 130 140 150
SPGATGTNGA QGSTGLRGDT GGTGNTGVTG EIGNTGLIGS TGFLGLIGNT
160 170 180 190 200
GRPGQVGQPG VAGRGGFTGE TGATGPQGPI GVTGSMGIQG ATGLPGNVAS
210 220 230 240 250
SGAPGVRGQS GQQGQQGSTG QSGTPGTNGA TGFTGYTGFT GATGFKGQTG
260 270 280 290 300
FTGNTGSTGQ QGVIGTQGAF GATGNTGSIG INGPIGPSET IAFTNKNITQ
310 320 330 340 350
SNIKYLFRPT QTTSHSIFFT SLYTRLIDFF RKPVISLSKG DIGSPGSIGS
360 370 380 390 400
TGDIGNPGVP GNTGSPGSPG NTGSTGTTGN TGYTGYVGFT GFTGTSGSQG
410 420 430 440 450
FTGRQGDTGF TGITGPQGST GFVGFSGQPG VQGQPGLVGT TGNTGLQGST
460 470 480 490 500
GNTGAPGNTG LSGNPGENGD PGLSGLPGIQ GSTGMMGVSG STGNVGNTGF
510 520 530 540 550
TGFTGSSGST GQTGMIGFSG QPGFSGSVGA TGVKGYTGNT GITGSTGNVG
560 570 580 590 600
DNGIQGPQGP IGFPGQNGLP GFSGSSGIKG STGASGSVGQ IGPNGIQGAT
610 620 630 640 650
GFTGNTGATG MKGDTGFAGQ PGILGLQGAT GNTGATGDPG GAGARGEQGG
660 670 680 690 700
TGQPGQKGAT GITGPFGPSG APGQRGPNGD PGFSGNTGPT GTTGVKGSKG
710 720 730 740 750
QTGPPGFIGP IGDTGSTGQT GSMGTQGNPG TIGNTGSSGS PGNQGAPGIV
760 770 780 790 800
GSTGSTGSSG FQGFSGSKGE PGLIGSSGPA GNKGSTGFSG LIGFTGATGS
810 820 830 840 850
TGLIGFSGSK GVNGSPGNTG ASGTIGQTGQ PGILGQQGAT GFTGNTGFTG
860 870 880 890 900
RSGDTGNSGV IGPDGLPGLP GIEGPRGSSG NFGQTGPFGQ QGFTGLIGSP
910 920 930 940 950
GIKGATGSAG TGGSQGFTGA TGFTGLQGDT GQNGLPGLLG FSGITGSTGS
960 970 980 990 1000
KGFPGQIGPT GSPGSIGTKG QTGFQGSIGE TGNSGSSGNP GQVGVPGAQG
1010 1020 1030 1040 1050
STGLPGIIGS PGNIGVTGFT GVQGSRGTQG YTGSTGSSGL QGDTGVMGST
1060 1070 1080 1090 1100
GFTGFPGLPG AQGVTGQMGA QGNTGEQGDP GNFGAKGQPG NTGSPGIFGF
1110 1120 1130 1140 1150
SGSTGLTGAT GPQGEQGFTG PIGSIGSPGQ TGLSGQQGTQ GPKGITGDTG
1160 1170 1180 1190 1200
QSGFVGMQGF TGGTGNTGNS GSIGFQGPKG QLGWTGERGD TGATGATGFS
1210 1220 1230 1240 1250
GTVGTQGLPG VMGYSGATGS TGVVGDTGFT GPSGVAGQSG ASGTMGSTGF
1260 1270 1280 1290 1300
NGLPGFIGYT GSTGASGVTG ATGTTGPKGS KGGTGLTGPT GMTGQTGLSG
1310 1320 1330 1340 1350
PSGSVGLPGQ PGLGGSPGSV GTVGASGPRG PTGLNGNTGF TGFSGITGPT
1360 1370 1380 1390 1400
GPNGSPGNLG ATGPPGSTGQ VGATGNSGSF GLQGFTGPRG LDGDSGFRGP
1410 1420 1430 1440 1450
TGTGGATGRF GDTGSSGNVG DTGSQGAPGW TGSSGNFGFT GSSGVKGQKG
1460 1470 1480 1490 1500
STGPSGPTGS SGNQGSVGNI GQDGVPGLPG PKGSQGPTGP IGNVGFPGTL
1510 1520 1530 1540 1550
GFTGASGNTG SPGPIGFTGS TGSSGQSGII GSMGLTGDDG NVGKFGPTGN
1560 1570 1580 1590 1600
TGESGSPGSP GIIGATGVQG PRGQPGANGQ PGASGAQGPA GVIGVMGSYG
1610 1620 1630 1640 1650
DTGSSGEQGI PGSIGFTGAN GFTGITGQPG LLGSKGYPGY TGNTGSPGFT
1660 1670 1680 1690 1700
GFQGDVGFPG IKGQPGFDGI IGPKGQNGNP GPSGPFGLAG LPGPTGITGF
1710 1720 1730 1740 1750
TGLVGNTGNT GITGPSGDTG NSGTVGLQGF TGATGPLGAT GPSGRPGFGA
1760 1770 1780 1790 1800
TPGRTGMIGA TGQTGYTGNT GAIGYTGQKG KDGAPGLLGN TGSFGAQGDT
1810 1820 1830 1840 1850
GFEGDVGPQG ASGQSGQPGI QGGPGQMGAT GFTGNSGAIG APGSMGATGF
1860 1870 1880 1890 1900
TGSNGNTGFT GYTGQLGPVG TNGKPGPTGN QGNDGSRGQQ GFTGPNGLPG
1910 1920 1930 1940 1950
ISGSSGVKGD IGAPGSQGSD GPPGDNGPQG LQGITGNSGL PGLPGSKGSL
1960 1970 1980 1990 2000
GPSGYTGPTG SIGFGGPPGD AGNTGFTGAT GYTGFTGARG LSGSDGSPGL
2010 2020 2030 2040 2050
AGDAGPQGAT GAIGNAGSTG FTGFIGDPGL RGPNGFKGNT GSSGLVGFTG
2060 2070 2080 2090 2100
YPGVPGTGGF TGFTGGTGFT GGIGFTGVTG NTGYTGQIGA TGPKGNNGLS
2110 2120 2130 2140 2150
GQIGPTGSTG NTGLKGLPGS PGFDGPRGIT GASGSVGAPG FNGVDGSNGA
2160 2170 2180 2190 2200
VGASGLQGAT GPNGMVGPVG STGKVGNVGN TGQIGAVGGK GPIGSQGQSG
2210 2220 2230 2240 2250
IPGVQGAAGT PGDTGTSGFV GMTGSTGVGG KTGDAGNTGS TGFLGPVGDS
2260 2270 2280 2290 2300
GIIGQMGNSG SQGLPGLSGP VGPTGQTGLA GQVGNTGYTG ATGLPGRTGV
2310 2320 2330 2340 2350
FGDTGNIGYT GATGSTGKPG FVGASGVPGA TGSTGSTGGP GMDSTNKLLN
2360 2370 2380 2390 2400
KDAVKIWCDQ LNCDQFCEVA QLSYPNGSIS YQPYCVCTVS YMKIRVAGGF
2410 2420 2430 2440 2450
KCTTLNECNY NNGFCEQICV DRMDGYQCSC RPGFTLGANK KSCTSTGGSS
2460 2470 2480 2490 2500
TVLINYTPIH CENLNGGCSD YCIKGTNGAA DVCACPKGYA LSMNNRDCID
2510 2520 2530 2540 2550
VDECALNQSI CPARLTCINS LGGYICVTAT LEQVVTQQQL QNMNDVLDKL
2560 2570 2580 2590 2600
PAINESMTSA KLNTQQLSSS YWGLIAFNTF VTIAITALIA LACKKWKQLS
2610 2620
NEENDYLNNF DSLSRTQSIG NIK
Length:2,623
Mass (Da):245,368
Last modified:October 16, 2013 - v1
Checksum:i329FEDCE181E1931
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AMQM01000795 Genomic DNA No translation available.
KB096742 Genomic DNA Translation: ESO02097.1
RefSeqiXP_009019505.1, XM_009021257.1

Genome annotation databases

EnsemblMetazoaiHelroT188573; HelroP188573; HelroG188573
GeneIDi20210948
KEGGihro:HELRODRAFT_188573

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AMQM01000795 Genomic DNA No translation available.
KB096742 Genomic DNA Translation: ESO02097.1
RefSeqiXP_009019505.1, XM_009021257.1

3D structure databases

ModBaseiSearch...
MobiDBiSearch...

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaiHelroT188573; HelroP188573; HelroG188573
GeneIDi20210948
KEGGihro:HELRODRAFT_188573

Organism-specific databases

CTDi20210948

Phylogenomic databases

OMAiNGNTGFT
OrthoDBiEOG091F06QF

Family and domain databases

InterProiView protein in InterPro
IPR008160 Collagen
IPR001881 EGF-like_Ca-bd_dom
IPR013032 EGF-like_CS
IPR000742 EGF-like_dom
IPR000152 EGF-type_Asp/Asn_hydroxyl_site
IPR018097 EGF_Ca-bd_CS
PfamiView protein in Pfam
PF01391 Collagen, 13 hits
PF07645 EGF_CA, 1 hit
SMARTiView protein in SMART
SM00181 EGF, 3 hits
SM00179 EGF_CA, 2 hits
PROSITEiView protein in PROSITE
PS00010 ASX_HYDROXYL, 1 hit
PS01186 EGF_2, 1 hit
PS01187 EGF_CA, 1 hit
ProtoNetiSearch...

Entry informationi

Entry nameiT1FQ51_HELRO
AccessioniPrimary (citable) accession number: T1FQ51
Entry historyiIntegrated into UniProtKB/TrEMBL: October 16, 2013
Last sequence update: October 16, 2013
Last modified: November 7, 2018
This is version 34 of the entry and version 1 of the sequence. See complete history.
Entry statusiUnreviewed (UniProtKB/TrEMBL)

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteomeImported
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again