Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Protein dachsous

Gene

ds

Organism
Drosophila melanogaster (Fruit fly)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Involved in morphogenesis. May also be involved in cell adhesion.1 Publication1 Publication

GO - Molecular functioni

GO - Biological processi

  • cell morphogenesis involved in differentiation Source: UniProtKB
  • cell proliferation Source: UniProtKB
  • establishment of ommatidial planar polarity Source: UniProtKB
  • homophilic cell adhesion via plasma membrane adhesion molecules Source: InterPro
  • peptide cross-linking Source: UniProtKB
Complete GO annotation...

Keywords - Molecular functioni

Developmental protein

Keywords - Biological processi

Cell adhesion

Keywords - Ligandi

Calcium

Names & Taxonomyi

Protein namesi
Recommended name:
Protein dachsous
Alternative name(s):
Adherin
Gene namesi
Name:ds
ORF Names:CG17941
OrganismiDrosophila melanogaster (Fruit fly)Imported
Taxonomic identifieri7227 [NCBI]
Taxonomic lineageiEukaryotaMetazoaEcdysozoaArthropodaHexapodaInsectaPterygotaNeopteraEndopterygotaDipteraBrachyceraMuscomorphaEphydroideaDrosophilidaeDrosophilaSophophora
Proteomesi
  • UP000000803 Componenti: Chromosome 2L

Organism-specific databases

FlyBaseiFBgn0000497. ds.

Subcellular locationi

Topology

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Topological domaini21 – 3045ExtracellularSequence analysisAdd BLAST3025
Transmembranei3046 – 3066HelicalSequence analysisAdd BLAST21
Topological domaini3067 – 3503CytoplasmicSequence analysisAdd BLAST437

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Cell membrane, Membrane

Pathology & Biotechi

Mutagenesis

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Mutagenesisi236S → A: Decreased phosphorylation by fj. 1 Publication1

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Signal peptidei1 – 20Sequence analysisAdd BLAST20
ChainiPRO_000000400921 – 3503Protein dachsousAdd BLAST3483

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Glycosylationi220N-linked (GlcNAc...)Sequence analysis1
Glycosylationi234N-linked (GlcNAc...)Sequence analysis1
Modified residuei236Phosphoserine1 Publication1
Glycosylationi245N-linked (GlcNAc...)Sequence analysis1
Glycosylationi381N-linked (GlcNAc...)Sequence analysis1
Glycosylationi416N-linked (GlcNAc...)Sequence analysis1
Glycosylationi564N-linked (GlcNAc...)Sequence analysis1
Glycosylationi594N-linked (GlcNAc...)Sequence analysis1
Glycosylationi743N-linked (GlcNAc...)Sequence analysis1
Glycosylationi966N-linked (GlcNAc...)Sequence analysis1
Glycosylationi991N-linked (GlcNAc...)Sequence analysis1
Glycosylationi1006N-linked (GlcNAc...)Sequence analysis1
Glycosylationi1029N-linked (GlcNAc...)Sequence analysis1
Glycosylationi1143N-linked (GlcNAc...)Sequence analysis1
Glycosylationi1236N-linked (GlcNAc...)Sequence analysis1
Glycosylationi1453N-linked (GlcNAc...)Sequence analysis1
Glycosylationi1479N-linked (GlcNAc...)Sequence analysis1
Glycosylationi1524N-linked (GlcNAc...)Sequence analysis1
Glycosylationi1553N-linked (GlcNAc...)Sequence analysis1
Glycosylationi1700N-linked (GlcNAc...)Sequence analysis1
Glycosylationi1884N-linked (GlcNAc...)Sequence analysis1
Glycosylationi1940N-linked (GlcNAc...)Sequence analysis1
Glycosylationi2115N-linked (GlcNAc...)Sequence analysis1
Glycosylationi2211N-linked (GlcNAc...)Sequence analysis1
Glycosylationi2212N-linked (GlcNAc...)Sequence analysis1
Glycosylationi2421N-linked (GlcNAc...)Sequence analysis1
Glycosylationi2511N-linked (GlcNAc...)Sequence analysis1
Glycosylationi2520N-linked (GlcNAc...)Sequence analysis1
Glycosylationi2547N-linked (GlcNAc...)Sequence analysis1
Glycosylationi2588N-linked (GlcNAc...)Sequence analysis1
Glycosylationi2678N-linked (GlcNAc...)Sequence analysis1
Glycosylationi2845N-linked (GlcNAc...)Sequence analysis1
Glycosylationi2967N-linked (GlcNAc...)Sequence analysis1
Modified residuei3465Phosphoserine1 Publication1
Modified residuei3469Phosphoserine1 Publication1

Post-translational modificationi

Phosphorylated by fj on Ser/Thr of cadherin domains.1 Publication

Keywords - PTMi

Glycoprotein, Phosphoprotein

Proteomic databases

PaxDbiQ24292.
PRIDEiQ24292.

PTM databases

iPTMnetiQ24292.

Expressioni

Tissue specificityi

Expressed in embryonic ectoderm. In larvae, expression is restricted to imaginal disks and brain.1 Publication

Developmental stagei

Expressed throughout embryogenesis where it is first detected during gastrulation. Also expressed in larvae and adults.1 Publication

Gene expression databases

BgeeiFBgn0000497.
GenevisibleiQ24292. DM.

Interactioni

Protein-protein interaction databases

BioGridi59500. 6 interactors.
DIPiDIP-49190N.
IntActiQ24292. 4 interactors.
MINTiMINT-898703.
STRINGi7227.FBpp0077708.

Structurei

3D structure databases

ProteinModelPortaliQ24292.
SMRiQ24292.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini22 – 121Cadherin 1PROSITE-ProRule annotationAdd BLAST100
Domaini122 – 233Cadherin 2PROSITE-ProRule annotationAdd BLAST112
Domaini234 – 340Cadherin 3PROSITE-ProRule annotationAdd BLAST107
Domaini345 – 451Cadherin 4PROSITE-ProRule annotationAdd BLAST107
Domaini452 – 558Cadherin 5PROSITE-ProRule annotationAdd BLAST107
Domaini559 – 662Cadherin 6PROSITE-ProRule annotationAdd BLAST104
Domaini663 – 774Cadherin 7PROSITE-ProRule annotationAdd BLAST112
Domaini775 – 878Cadherin 8PROSITE-ProRule annotationAdd BLAST104
Domaini879 – 983Cadherin 9PROSITE-ProRule annotationAdd BLAST105
Domaini984 – 1100Cadherin 10PROSITE-ProRule annotationAdd BLAST117
Domaini1101 – 1203Cadherin 11PROSITE-ProRule annotationAdd BLAST103
Domaini1205 – 1312Cadherin 12PROSITE-ProRule annotationAdd BLAST108
Domaini1313 – 1432Cadherin 13PROSITE-ProRule annotationAdd BLAST120
Domaini1433 – 1549Cadherin 14PROSITE-ProRule annotationAdd BLAST117
Domaini1556 – 1666Cadherin 15PROSITE-ProRule annotationAdd BLAST111
Domaini1667 – 1794Cadherin 16PROSITE-ProRule annotationAdd BLAST128
Domaini1796 – 1899Cadherin 17PROSITE-ProRule annotationAdd BLAST104
Domaini1900 – 2004Cadherin 18PROSITE-ProRule annotationAdd BLAST105
Domaini2005 – 2111Cadherin 19PROSITE-ProRule annotationAdd BLAST107
Domaini2114 – 2269Cadherin 20PROSITE-ProRule annotationAdd BLAST156
Domaini2270 – 2375Cadherin 21PROSITE-ProRule annotationAdd BLAST106
Domaini2375 – 2479Cadherin 22PROSITE-ProRule annotationAdd BLAST105
Domaini2489 – 2595Cadherin 23PROSITE-ProRule annotationAdd BLAST107
Domaini2596 – 2699Cadherin 24PROSITE-ProRule annotationAdd BLAST104
Domaini2701 – 2809Cadherin 25PROSITE-ProRule annotationAdd BLAST109
Domaini2810 – 2916Cadherin 26PROSITE-ProRule annotationAdd BLAST107
Domaini2919 – 3028Cadherin 27PROSITE-ProRule annotationAdd BLAST110

Sequence similaritiesi

Contains 27 cadherin domains.PROSITE-ProRule annotation

Keywords - Domaini

Repeat, Signal, Transmembrane, Transmembrane helix

Phylogenomic databases

eggNOGiKOG1219. Eukaryota.
ENOG410XPEI. LUCA.
InParanoidiQ24292.
KOiK16507.
OrthoDBiEOG091G0048.
PhylomeDBiQ24292.

Family and domain databases

Gene3Di2.60.40.60. 27 hits.
InterProiIPR002126. Cadherin.
IPR015919. Cadherin-like.
IPR020894. Cadherin_CS.
[Graphical view]
PfamiPF00028. Cadherin. 23 hits.
[Graphical view]
PRINTSiPR00205. CADHERIN.
SMARTiSM00112. CA. 27 hits.
[Graphical view]
SUPFAMiSSF49313. SSF49313. 27 hits.
PROSITEiPS00232. CADHERIN_1. 20 hits.
PS50268. CADHERIN_2. 27 hits.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

Q24292-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MLRSSLLILL AIVLLGSSQA ASHDQERERK LEVFEGVAVD YQIGYIGDFG
60 70 80 90 100
GIDSGPPYII VAEAGVETDL AIDRATGEIR TKVKLDRETR ASYSLVAIPL
110 120 130 140 150
SGRNIRVLVT VKDENDNAPT FPQTSMHIEF PENTPREVKR TLLPARDLDL
160 170 180 190 200
EPYNTQRYNI VSGNVNDAFR LSSHRERDGV LYLDLQISGF LDRETTPGYS
210 220 230 240 250
LLIEALDGGT PPLRGFMTVN ITIQDVNDNQ PIFNQSRYFA TVPENATVGT
260 270 280 290 300
SVLQVYASDT DADENGLVEY AINRRQSDKE QMFRIDPRTG AIYINKALDF
310 320 330 340 350
ETKELHELVV VAKDHGEQPL ETTAFVSIRV TDVNDNQPTI NVIFLSDDAS
360 370 380 390 400
PKISESAQPG EFVARISVHD PDSKTEYANV NVTLNGGDGH FALTTRDNSI
410 420 430 440 450
YLVIVHLPLD REIVSNYTLS VVATDKGTPP LHASKSIFLR ITDVNDNPPE
460 470 480 490 500
FEQDLYHANV MEVADPGTSV LQVLAHDRDE GLNSALTYSL AETPETHAQW
510 520 530 540 550
FQIDPQTGLI TTRSHIDCET EPVPQLTVVA RDGGVPPLSS TATVLVTIHD
560 570 580 590 600
VNDNEPIFDQ SFYNVSVAEN EPVGRCILKV SASDPDCGVN AMVNYTIGEG
610 620 630 640 650
FKHLTEFEVR SASGEICIAG ELDFERRSSY EFPVLATDRG GLSTTAMIKM
660 670 680 690 700
QLTDVNDNRP VFYPREYKVS LRESPKASSQ ASSTPIVAVV ATDPDYGNFG
710 720 730 740 750
QVSYRIVAGN EAGIFRIDRS TGEIFVVRPD MLSVRTQPMH MLNISATDGG
760 770 780 790 800
NLRSNADAVV FLSIIDAMQR PPIFEKARYN YYVKEDIPRG TVVGSVIAAS
810 820 830 840 850
GDVAHRSPVR YSIYSGDPDG YFSIETNSGN IRIAKPLDHE AKSQVLLNIQ
860 870 880 890 900
ATLGEPPVYG HTQVNIEVED VNDNAPEFEA SMVRISVPES AELGAPLYAA
910 920 930 940 950
HAHDKDSGSS GQVTYSLVKE SGKGLFAIDA RSGHLILSQH LDYESSQRHT
960 970 980 990 1000
LIVTATDGGV PSLSTNLTIL VDVQDVNDNP PVFEKDEYSV NVSESRSINA
1010 1020 1030 1040 1050
QIIQVNASDL DTGNNARITY RIVDAGVDNV TNSISSSDVS QHFGIFPNSG
1060 1070 1080 1090 1100
WIYLRAPLDR ETRDRYQLTV LATDNGTPAA HAKTRVIVRV LDANDNDPKF
1110 1120 1130 1140 1150
QKSKYEFRIE ENLRRGSVVG VVTASDLDLG ENAAIRYSLL PINSSFQVHP
1160 1170 1180 1190 1200
VTGEISTREP LDRELRELYD LVVEARDQGT PVRSARVPVR IHVSDVNDNA
1210 1220 1230 1240 1250
PEIADPQEDV VSVREEQPPG TEVVRVRAVD RDHGQNASIT YSIVKGRDSD
1260 1270 1280 1290 1300
GHGLFSIDPT SGVIRTRVVL DHEERSIYRL GVAASDGGNP PRETVRMLRV
1310 1320 1330 1340 1350
EVLDLNDNRP TFTSSSLVFR VREDAALGHV VGSISPIERP ADVVRNSVEE
1360 1370 1380 1390 1400
SFEDLRVTYT LNPLTKDLIE AAFDIDRHSG NLVVARLLDR EVQSEFRLEI
1410 1420 1430 1440 1450
RALDTTASNN PQSSAITVKI EVADVNDNAP EWPQDPIDLQ VSEATPVGTI
1460 1470 1480 1490 1500
IHNFTATDAD TGTNGDLQYR LIRYFPQLNE SQEQAMSLFR MDSLTGALSL
1510 1520 1530 1540 1550
QAPLDFEAVQ EYLLIVQALD QSSNVTERLQ TSVTVRLRIL DANDHAPHFV
1560 1570 1580 1590 1600
SPNSSGGKTA SLFISDATRI GEVVAHIVAV DEDSGDNGQL TYEITGGNGE
1610 1620 1630 1640 1650
GRFRINSQTG IIELVKSLPP ATEDVEKGGR FNLIIGAKDH GQPEPKKSSL
1660 1670 1680 1690 1700
NLHLIVQGSH NNPPRFLQAV YRATILENVP SGSFVLQVTA KSLHGAENAN
1710 1720 1730 1740 1750
LSYEIPAGVA NDLFHVDWQR GIITTRGQFD RESQASYVLP VYVRDANRQS
1760 1770 1780 1790 1800
TLSSSAVRKQ RSSDSIGDTS NGQHFDVATI YITVGDVNDN SPEFRPGSCY
1810 1820 1830 1840 1850
GLSVPENSEP GVIHTVVASD LDEGPNADLI YSITGGNLGN KFSIDSSSGE
1860 1870 1880 1890 1900
LSARPLDREQ HSRYTLQIQA SDRGQPKSRQ GHCNITIFVE DQNDNAPRFK
1910 1920 1930 1940 1950
LSKYTGSVQE DAPLGTSVVQ ISAVDADLGV NARLVYSLAN ETQWQFAIDG
1960 1970 1980 1990 2000
QSGLITTVGK LDRELQASYN FMVLATDGGR YEVRSATVPV QINVLDINDN
2010 2020 2030 2040 2050
RPIFERYPYI GQVPALIQPG QTLLKVQALD ADLGANAEIV YSLNAENSAV
2060 2070 2080 2090 2100
SAKFRINPST GALSASQSLA SESGKLLHLE VVARDKGNPP QSSLGLIELL
2110 2120 2130 2140 2150
IGEAPQGTPV LRFQNETYRV MLKENSPSGT RLLQVVALRS DGRRQKVQFS
2160 2170 2180 2190 2200
FGAGNEDGIL SLDSLSGEIR VNKPHLLDYD RFSTPSMSAL SRGRALHYEE
2210 2220 2230 2240 2250
EIDESSEEDP NNSTRSQRAL TSSSFALTNS QPNEIRVVLV ARTADAPFLA
2260 2270 2280 2290 2300
SYAELVIELE DENDNSPKFS QKQFVATVSE GNNKGTFVAQ VHAFDSDAGS
2310 2320 2330 2340 2350
NARLRYHIVD GNHDNAFVIE PAFSGIVRTN IVLDREIRDI YKLKIIATDE
2360 2370 2380 2390 2400
GVPQMTGTAT IRVQIVDVND NQPTFPPNNL VTVSEATELG AVITSISAND
2410 2420 2430 2440 2450
VDTYPALTYR LGAESTVDIE NMSIFALDRY SGKLVLKRRL DYELQQEYEL
2460 2470 2480 2490 2500
DVIASDAAHE ARTVLTVRVN DENDNAPVFL AQQPPAYFAI LPAISEISES
2510 2520 2530 2540 2550
LSVDFDLLTV NATDADSEGN NSKVIYIIEP AQEGFSVHPS NGVVSVNMSR
2560 2570 2580 2590 2600
LQPAVSSSGD YFVRIIAKDA GKPALKSSTL LRVQANDNGS GRSQFLQNQY
2610 2620 2630 2640 2650
RAQISEAAPL GSVVLQLGQD ALDQSLAIIA GNEESAFELL QSKAIVLVKP
2660 2670 2680 2690 2700
LDRERNDLYK LRLVLSHPHG PPLISSLNSS SGISVIITIL DANDNFPIFD
2710 2720 2730 2740 2750
RSAKYEAEIS ELAPLRYSIA QLQAIDADQE NTPNSEVVYD ITSGNDEHMF
2760 2770 2780 2790 2800
TIDLVTGVLF VNNRLDYDSG AKSYELIIRA CDSHHQRPLC SLQPFRLELH
2810 2820 2830 2840 2850
DENDNEPKFP LTEYVHFLAE NEPVGSSVFR AHASDLDKGP FGQLNYSIGP
2860 2870 2880 2890 2900
APSDESSWKM FRVDSESGLV TSAFVFDYEQ RQRYDMELLA SDMGGKKASV
2910 2920 2930 2940 2950
AVRVEIESRD EFTPQFTERT YRFVLPAAVA LPQGYVVGQV TATDSDSGPD
2960 2970 2980 2990 3000
GRVVYQLSAP HSHFKVNRSS GAVLIKRKLK LDGDGDGNLY MDGRDISLVI
3010 3020 3030 3040 3050
SASSGRHNSL SSMAVVEIAL DPLAHPGTNL ASAGGSSSGS IGDWAIGLLV
3060 3070 3080 3090 3100
AFLLVLCAAA GIFLFIHMRS RKPRNAVKPH LATDNAGVGN TNSYVDPSAF
3110 3120 3130 3140 3150
DTIPIRGSIS GGAAGAASGQ FAPPKYDEIP PFGAHAGSSG AATTSELSGS
3160 3170 3180 3190 3200
EQSGSSGRGS AEDDGEDEEI RMINEGPLHH RNGGAGAGSD DGRISDISVQ
3210 3220 3230 3240 3250
NTQEYLARLG IVDHDPSGAG GGASSMAGSS HPMHLYHDDD ATARSDITNL
3260 3270 3280 3290 3300
IYAKLNDVTG AGSEIGSSAD DAGTTAGSIG TIGTAITHGH GVMSSYGEVP
3310 3320 3330 3340 3350
VPVPVVVGGS NVGGSLSSIV HSEEELTGSY NWDYLLDWGP QYQPLAHVFS
3360 3370 3380 3390 3400
EIARLKDDTL SEHSGSGASS SAKSKHSSSH SSAGAGSVVL KPPPSAPPTH
3410 3420 3430 3440 3450
IPPPLLTNVA PRAINLPMRL PPHLSLAPAH LPRSPIGHEA SGSFSTSSAM
3460 3470 3480 3490 3500
SPSFSPSLSP LATRSPSISP LGAGPPTHLP HVSLPRHGHA PQPSQRGNVG

TRM
Length:3,503
Mass (Da):379,780
Last modified:October 1, 2002 - v3
Checksum:i975B09F059F7EEF5
GO

Sequence cautioni

The sequence AAF51468 differs from that shown. Reason: Erroneous initiation.Curated

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti1070V → I in AAA79329 (PubMed:7601355).Curated1
Sequence conflicti1490R → S in AAA79329 (PubMed:7601355).Curated1
Sequence conflicti1636G → S in AAA79329 (PubMed:7601355).Curated1
Sequence conflicti1692S → P in AAA79329 (PubMed:7601355).Curated1
Sequence conflicti1804V → I in AAA79329 (PubMed:7601355).Curated1
Sequence conflicti2029L → I in AAA79329 (PubMed:7601355).Curated1
Sequence conflicti2210P → A in AAA79329 (PubMed:7601355).Curated1
Sequence conflicti2289A → S in AAA79329 (PubMed:7601355).Curated1
Sequence conflicti2536S → T in AAA79329 (PubMed:7601355).Curated1
Sequence conflicti2862R → Q in AAA79329 (PubMed:7601355).Curated1
Sequence conflicti3038S → G in AAA79329 (PubMed:7601355).Curated1

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
L08811 mRNA. Translation: AAA79329.2.
AE014134 Genomic DNA. Translation: AAF51468.3. Different initiation.
RefSeqiNP_001285551.1. NM_001298622.1.
NP_523446.2. NM_078722.3.

Genome annotation databases

GeneIDi33245.
KEGGidme:Dmel_CG17941.
UCSCiCG17941-RA. d. melanogaster.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
L08811 mRNA. Translation: AAA79329.2.
AE014134 Genomic DNA. Translation: AAF51468.3. Different initiation.
RefSeqiNP_001285551.1. NM_001298622.1.
NP_523446.2. NM_078722.3.

3D structure databases

ProteinModelPortaliQ24292.
SMRiQ24292.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi59500. 6 interactors.
DIPiDIP-49190N.
IntActiQ24292. 4 interactors.
MINTiMINT-898703.
STRINGi7227.FBpp0077708.

PTM databases

iPTMnetiQ24292.

Proteomic databases

PaxDbiQ24292.
PRIDEiQ24292.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

GeneIDi33245.
KEGGidme:Dmel_CG17941.
UCSCiCG17941-RA. d. melanogaster.

Organism-specific databases

CTDi109661.
FlyBaseiFBgn0000497. ds.

Phylogenomic databases

eggNOGiKOG1219. Eukaryota.
ENOG410XPEI. LUCA.
InParanoidiQ24292.
KOiK16507.
OrthoDBiEOG091G0048.
PhylomeDBiQ24292.

Miscellaneous databases

GenomeRNAii33245.
PROiQ24292.

Gene expression databases

BgeeiFBgn0000497.
GenevisibleiQ24292. DM.

Family and domain databases

Gene3Di2.60.40.60. 27 hits.
InterProiIPR002126. Cadherin.
IPR015919. Cadherin-like.
IPR020894. Cadherin_CS.
[Graphical view]
PfamiPF00028. Cadherin. 23 hits.
[Graphical view]
PRINTSiPR00205. CADHERIN.
SMARTiSM00112. CA. 27 hits.
[Graphical view]
SUPFAMiSSF49313. SSF49313. 27 hits.
PROSITEiPS00232. CADHERIN_1. 20 hits.
PS50268. CADHERIN_2. 27 hits.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiDS_DROME
AccessioniPrimary (citable) accession number: Q24292
Secondary accession number(s): Q9VPS4
Entry historyi
Integrated into UniProtKB/Swiss-Prot: November 15, 2002
Last sequence update: October 1, 2002
Last modified: November 30, 2016
This is version 136 of the entry and version 3 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programDrosophila annotation project

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Drosophila
    Drosophila: entries, gene names and cross-references to FlyBase
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.