Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Probable transcription-associated protein 1

Gene

tra1

Organism
Dictyostelium discoideum (Slime mold)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Protein inferred from homologyi

Functioni

GO - Molecular functioni

  • ATP binding Source: dictyBase
  • histone acetyltransferase activity Source: dictyBase
  • protein kinase activity Source: dictyBase
  • transcription cofactor activity Source: dictyBase

GO - Biological processi

  • histone acetylation Source: dictyBase
  • protein phosphorylation Source: dictyBase
  • regulation of nucleic acid-templated transcription Source: GOC
Complete GO annotation...

Names & Taxonomyi

Protein namesi
Recommended name:
Probable transcription-associated protein 1
Gene namesi
Name:tra1
ORF Names:DDB_G0281947
OrganismiDictyostelium discoideum (Slime mold)
Taxonomic identifieri44689 [NCBI]
Taxonomic lineageiEukaryotaAmoebozoaMycetozoaDictyosteliidaDictyostelium
Proteomesi
  • UP000002195 Componentsi: Chromosome 3, Unassembled WGS sequence

Organism-specific databases

dictyBaseiDDB_G0281947. tra1.

Subcellular locationi

GO - Cellular componenti

  • NuA4 histone acetyltransferase complex Source: dictyBase
  • nucleus Source: dictyBase
Complete GO annotation...

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 45824582Probable transcription-associated protein 1PRO_0000376004Add
BLAST

Proteomic databases

PaxDbiQ54T85.
PRIDEiQ54T85.

Interactioni

Protein-protein interaction databases

STRINGi44689.DDB0229338.

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini3185 – 3815631FATPROSITE-ProRule annotationAdd
BLAST
Domaini4538 – 458245FATCPROSITE-ProRule annotationAdd
BLAST

Coiled coil

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Coiled coili2944 – 299148Sequence analysisAdd
BLAST

Compositional bias

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Compositional biasi43 – 508Poly-Ser
Compositional biasi67 – 704Poly-Asn
Compositional biasi209 – 2179Poly-Ser
Compositional biasi218 – 2214Poly-Thr
Compositional biasi226 – 2327Poly-Thr
Compositional biasi271 – 2799Poly-Thr
Compositional biasi280 – 2834Poly-Ala
Compositional biasi634 – 6418Poly-Thr
Compositional biasi804 – 8107Poly-Gly
Compositional biasi858 – 8614Poly-Leu
Compositional biasi939 – 9424Poly-Ser
Compositional biasi1030 – 10378Poly-Asn
Compositional biasi1132 – 11387Poly-Asn
Compositional biasi1151 – 119444Poly-AsnAdd
BLAST
Compositional biasi1496 – 15049Poly-Thr
Compositional biasi1554 – 15618Poly-Thr
Compositional biasi1564 – 15674Poly-Ser
Compositional biasi2345 – 23484Poly-Ser
Compositional biasi2351 – 23577Poly-Thr
Compositional biasi2536 – 25405Poly-Thr
Compositional biasi2545 – 25484Poly-Thr
Compositional biasi2558 – 25625Poly-Pro
Compositional biasi2756 – 27627Poly-Thr
Compositional biasi2765 – 27717Poly-Thr
Compositional biasi2774 – 27774Poly-Thr
Compositional biasi2780 – 27834Poly-Thr
Compositional biasi2786 – 27938Poly-Thr
Compositional biasi2948 – 298538Poly-GlnAdd
BLAST
Compositional biasi3114 – 31196Poly-Thr
Compositional biasi3489 – 349810Poly-Thr
Compositional biasi3503 – 35097Poly-Thr
Compositional biasi3661 – 36644Poly-Ser
Compositional biasi3720 – 373011Poly-GlnAdd
BLAST
Compositional biasi3829 – 385931Poly-ThrAdd
BLAST
Compositional biasi3907 – 39104Poly-Thr
Compositional biasi3918 – 39214Poly-Pro
Compositional biasi4067 – 40737Poly-Gln
Compositional biasi4201 – 42066Poly-Asn
Compositional biasi4320 – 433314Poly-SerAdd
BLAST

Sequence similaritiesi

Belongs to the PI3/PI4-kinase family. TRA1 subfamily.Curated
Contains 1 FAT domain.PROSITE-ProRule annotation
Contains 1 FATC domain.PROSITE-ProRule annotation

Keywords - Domaini

Coiled coil

Phylogenomic databases

eggNOGiKOG0889. Eukaryota.
COG5032. LUCA.
InParanoidiQ54T85.
KOiK08874.
OMAiICASIID.
PhylomeDBiQ54T85.

Family and domain databases

InterProiIPR016024. ARM-type_fold.
IPR003152. FATC_dom.
IPR011009. Kinase-like_dom.
IPR003151. PIK-rel_kinase_FAT.
IPR014009. PIK_FAT.
IPR033317. TRA1/TRRAP.
[Graphical view]
PANTHERiPTHR11139:SF1. PTHR11139:SF1. 15 hits.
PfamiPF02259. FAT. 1 hit.
[Graphical view]
SUPFAMiSSF48371. SSF48371. 12 hits.
SSF56112. SSF56112. 2 hits.
PROSITEiPS51189. FAT. 1 hit.
PS51190. FATC. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Q54T85-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MSTNPPQPPP SSTIASNPPQ PIATPMSTTN PSQPTITSSS AASSSSSSSS
60 70 80 90 100
GNPVNFESYA RRCFELNNNN EQTQLLALVT EIRDNIELVH TVEYPTFLNF
110 120 130 140 150
LFPVFYNILR QGAVQFNDGP EQKIRNTILD ILNKLPNNEL LRPHILVLLQ
160 170 180 190 200
LSMYLLEVDN EENALVCLRI IIELHKNYRN ALESEIQPFL NIVLKLYTDL
210 220 230 240 250
PSTIEKTFSS SSSASLSTTT TAISPTTTTT TTPATATTPA TTTATGNTIT
260 270 280 290 300
TPPPATPPST TATAISPTSS TTTTTTATTA AAATIATTTA TTTITPPLPP
310 320 330 340 350
YMIKSIESFK ILTECPIVVI LLFQLYNSYM SSNVPKFIPL IIETLSLQAP
360 370 380 390 400
ANSTVTHHSQ YVDFIAAQVK TLYLLAYVLK WHIEQIKQYS DRFPRSVIQL
410 420 430 440 450
LQNCPAHSSA IRKELLVTLR HILSSDFKSK FIVYLDLLLD EKIILGTSRT
460 470 480 490 500
SYESLRSMAY GSLADFIHNM RNELNINQIS KVVAIYSRHL HDQTNPVSIQ
510 520 530 540 550
IMSVKLIISL MDVIQRKQDP PEYKSRSIIY KVIESFINKF SSLKRSIPKL
560 570 580 590 600
LADQQKEKEK ELKDPQSLKD KLDGLSSANT TTSSTGEIII LDPVKDTRTL
610 620 630 640 650
IKTMTSSLRN IFWSLSACPI NKPGTGITTG AGATTTTTTN TNNTIIPPVR
660 670 680 690 700
IALPSIEESL LFIKLFKSTV KCFPIYGGCN PSPQEEKEMI ENFTASFMML
710 720 730 740 750
DQRTFQEVST FILPFLYQRS LNNPSLLLIP QGFLSVTQMN PTGVQINRVF
760 770 780 790 800
LEVLTPFLYE KIRNLQPTDK PDICMIKLIK LIFNAIQPNN NSGVGGSGGS
810 820 830 840 850
NSSGGGGGGG SNSSNNSTNS NTTTNIDSTC VQQVLSSMIL ILLKLITESK
860 870 880 890 900
QIDSIQYLLL LKTIFKSCTR PDQSKEITLL FPIILETLND LLLSSSHSTM
910 920 930 940 950
IPAVQQLLIE LSLSIPVQIA TLLPSLHLLV KPLMLALDSS SSELLSTTFR
960 970 980 990 1000
ILELIVDNAT GDFLLFTFRD NKSEFLQILS KHLRPAPYFY GPHAIRILGK
1010 1020 1030 1040 1050
MAGKSRSFSV LSPILSIDST SNSRSIPSSN KNNNNNNYYY NGSCSNSENY
1060 1070 1080 1090 1100
SKVFKLLLPC ETGDDKTKSI PLDKSIQSIK NILLYQLDDS YLQSNAYSLL
1110 1120 1130 1140 1150
KYYISLYLSS QDFLINQQSL LNELLNNLKQ SNNNNNNNSS TVNLNIIELD
1160 1170 1180 1190 1200
NENENENDNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNIKTFKT
1210 1220 1230 1240 1250
KEEYLNEIKN FKDLVYCLFL SITNDHLKEK FDSLKFLNNF IYHFVLYLST
1260 1270 1280 1290 1300
FKFNYSIISM KELDPKIFLE ALVDVMSMSS HNIINQSNIE DLQTISTSKF
1310 1320 1330 1340 1350
NKAHITSLLD MIFNCSNQIF SENSNSKKNN ELMTSSTDVK DGEKVEMETE
1360 1370 1380 1390 1400
DSLKKDEMSA AATSEIKKET NVVVENEQDK DTVLISPIFK YLVKLFIKCC
1410 1420 1430 1440 1450
YDKDFSVKGA GLIGIEYIIE NVKLSWIQPF QHLILKSLLF VCEDLSYSGY
1460 1470 1480 1490 1500
QPTIDYASEI IINLIKLCVP NLNIVPDSME IDQSTTTAST TETAATTTTT
1510 1520 1530 1540 1550
ETATPMVTES TAIVTEPTAT TPTATPTSTP TSTSTPTPTP IPTATTSSTT
1560 1570 1580 1590 1600
TAPTTTTTTT TNLSSSSTIN QKPHCKLNQL KLKDRELLKL ILEILMERIT
1610 1620 1630 1640 1650
SWSGHTRSLA QRMLTMISVE ITKIPMSQLI EDLKMTVQKL LPKTPLKSLS
1660 1670 1680 1690 1700
ISLQTGVIDG LTFCLSQKPS PLIEIGADTV RVLQECLNVA GDESSPTQQS
1710 1720 1730 1740 1750
QIKSSSAKSI SATNNLRVCG VEMVATAMTC PDFLQFECLE FKNRIIRMFF
1760 1770 1780 1790 1800
KVVTARNKEM AMAAKRGLAN SIQQQRLHRD LLQTCLRPVL SNITDPKSLS
1810 1820 1830 1840 1850
VPFLQGLSRL LELLSNCFNA ALGEKLFEYL KKFEEAGKLS YLANKYRDSE
1860 1870 1880 1890 1900
EVKICASIID IFHLLPPAAK LLDSTIILTI RLEQSLCKEV TSPYREPLIR
1910 1920 1930 1940 1950
FLAKYPQRTI EIFMGQLPQF NLIFRLILKH QPLSKPIVEE LANTYSIWLE
1960 1970 1980 1990 2000
AHLKSPSADI RFHTLSMVSI IRKQLPNWLP ENRKVLDILI EYWRPLSHMI
2010 2020 2030 2040 2050
QSASNPLDIS NQTLRETKII VKCFLQYCKA HSEETDLYFY MLSVLTLRAS
2060 2070 2080 2090 2100
MDFNFLRDYY QHDLAPSSTI EQKKKIIQTF LIFFKDQTIP SDNKVQAIQN
2110 2120 2130 2140 2150
LITPILTNYF HQTDRNSSSG GGIIEDSLFI QLTKQTLETE VKASYDDTLL
2160 2170 2180 2190 2200
IELLQLETLL VKNLSSVLVD CRKELIKFAW NHLKNEDLTC KQSAYILACG
2210 2220 2230 2240 2250
FIEAYETPHK IVLQVYVPLL RAYQPESKHL VKQALDILMP CFKTRLPGGD
2260 2270 2280 2290 2300
PKNSTWVKWT KKIIVEEGHT TAQLVHIIQL IVRHPQLFYP SRSQFVPHII
2310 2320 2330 2340 2350
LLLPKIALGS NLTAENKKLS IDIADTIIIW EKMRMSNLQQ SIKTSSSSLP
2360 2370 2380 2390 2400
TTTTTTTSSN KPTDSSSLPP NTPIAEGSIT TPSQGGVATP NVSDSTPTPG
2410 2420 2430 2440 2450
IHHGATNIDD EYRPPLSAIE HISLFLIRMA SNWYHINEKC SELLRQTLVI
2460 2470 2480 2490 2500
WPETNIKFSV FEKPMNTDQP QMISTCLSML NLIAEYQVNT FIPNNVVALQ
2510 2520 2530 2540 2550
QSLLQALNSD NAKISSLLGS LFKKILAAFP LPTNNTTTTT PVSSTTTTEQ
2560 2570 2580 2590 2600
SSDSSSLPPP PPVQVTKPIP NEMVSFYTFI GTQFEMILGA FDKNYNLSIL
2610 2620 2630 2640 2650
SNIKVFSDHS ESFIDPYISL IVKVLIRLTR NYLSQDSDGG TGSLANKPLS
2660 2670 2680 2690 2700
SSGSTSQTGG ASQTATSASN VVLKKSNSEI ISGLCKTYGF LKTKTTKLNS
2710 2720 2730 2740 2750
DQRNAFIQSL LVLIERSNDV ELLSEIIKVV DYLISISPSP SPSTTPVVTE
2760 2770 2780 2790 2800
TTIPSTTTTT TTAATTTTTT TPSTTTTAAT TTTAPTTTET TTTAATTTIT
2810 2820 2830 2840 2850
PFLTIKEKIN FLIKLGRVDQ LSNAELSLSY YKLVLSFYSE SNSSSKQELS
2860 2870 2880 2890 2900
QLEPCFMMGL RNTVDQGMRK SLFNILHKSI GTTPYQRLNY IIGVQQWDIL
2910 2920 2930 2940 2950
GTTYWIKHAL DLLLAILPND KFVKISNFCS KLPTSLKFAN RNGNDINQQQ
2960 2970 2980 2990 3000
QQQQQQQQQQ QQQQQQQQQQ QQQQQQQQQQ QQQQQHHQQE QPMEIDENLV
3010 3020 3030 3040 3050
VEQSSSVNKG NEEFKKSLKL HTQWLESLKN EESLKFSEFN ENLRELIFID
3060 3070 3080 3090 3100
SHLVNDLWCH LFSDMWSDLT KEEQFKLSKS LTLLLSKDYT KKVPLVSKPI
3110 3120 3130 3140 3150
IPPTSIPISK PIITTTTSTS SSTSTTTPIT TIPLINNSQL ITIVTLTQQQ
3160 3170 3180 3190 3200
TNPIIVPSLR EPNVIKTWME TLGMCKPIPK VPIEVISFLG ENYNCWYYAI
3210 3220 3230 3240 3250
RMIEQQLIDR QKLLDSTDIN WDYLSYLYGA IGEKDLLYGI YRKRYQCDET
3260 3270 3280 3290 3300
KLGLLLEQFY MFQSSQEVFL SAMNKYSAVG CKPTPRSENL LWEDHWLECA
3310 3320 3330 3340 3350
KRLNQWNFVH EFSKEKNMYD LTIESAWKIP QWNSVKENMK KMMSQGDTSI
3360 3370 3380 3390 3400
RKILQGYFLT NEKRYHEVDP AIVTSNQLIL DKWVSLPERS FRSHTNSLVE
3410 3420 3430 3440 3450
MQQVVELQES VHILKEISNI TLSQQPADLS RSFLTSNYIK SIFNIWRERL
3460 3470 3480 3490 3500
PNKDEDLLIW FELMAWRQQV FNIIGTPSMN GGIGANPVTP TNTTTTITNP
3510 3520 3530 3540 3550
DGTTTTTTTP LPPPQQPINQ IEFASPRYMV LEMAWTMNKY SHIVRKHNII
3560 3570 3580 3590 3600
EVCLNSLSKM FDLQIELHDI FLNLKEQIKC YLQLPTHYDT GISIINSTNL
3610 3620 3630 3640 3650
DFFTPMQKGE FLQLKGEFLN RLGRYDEANQ SFASSVSQYE NSAKNWISWA
3660 3670 3680 3690 3700
HFCDNQFTNH SSSSITPSST PTTYDIKTQW AESAISCYIQ GIKCDPKYGS
3710 3720 3730 3740 3750
RYVPRIFWLL YLNGSGEVPQ QIQTQQQQQQ AAAQGGLPPQ PRKLTPAQSV
3760 3770 3780 3790 3800
FQSFLNSWTI LPQWIWLNYM PQLISGAANL LNFPGYGFLC WQMIGKICYL
3810 3820 3830 3840 3850
FPNSSYYHFR KLVLEMKSNA SKFTTSPPTT TTTATTTTTA TTTITTATTT
3860 3870 3880 3890 3900
STPTQTTPTQ NTTTPIKEES STTTATTTPA VPSTSTPTST SAPAPISTST
3910 3920 3930 3940 3950
NTPPNATTTT PQANTTSPPP PSSTFSPLKM TETLSLGLHQ YHSCLINEID
3960 3970 3980 3990 4000
MMLGSFSILS GSIPAVYQFN GSLNQILLEA FKLNKIEDSI YNSIRSLYKH
4010 4020 4030 4040 4050
YFVNEIKYQN SKEFLEAYKS EFKVDFIEFN LDDIVSDTIK KESDQETNVV
4060 4070 4080 4090 4100
EGTTNVSKEL ETTSEKQQQQ QQQLPTISIL LLIEKLIKWI DRKPDNSLIT
4110 4120 4130 4140 4150
VVNTDQSTNY YGIDTMICLE SICPQLVNFK PSILEIPGQY NTNRDPNIEN
4160 4170 4180 4190 4200
NVKVEKVGMF AKLIKHSNGM VCPRITLYGG NGKAYQFLIE SSPSLINGIT
4210 4220 4230 4240 4250
NSNNNNVARV YERKNQLLGS INSMLIKNRE TRRRGLTLNS YPTVVPIKNS
4260 4270 4280 4290 4300
LTMIQNIGND SIKQLAEVWY THSNQSNLFK PMLKYKEMLL NSNLHTELLS
4310 4320 4330 4340 4350
KKDQDGDLEF TNITEDNNIS SSSSSSSSSG SNSGENSPII DSSKLVVFRE
4360 4370 4380 4390 4400
MSKEIGDELM INYIQSTLLP TNYQDQYEFK LNFSNQFGLH SLLQYILFSD
4410 4420 4430 4440 4450
IGDIDPSKIY LTKSTGSVYY NDWSLKLTNR KLGFDLLQDN PYNQQQLLRL
4460 4470 4480 4490 4500
SPNIRNYLGP LYLEGSYLSS MISTCICLSD LKDQLVNSIN LFIFDEYMCM
4510 4520 4530 4540 4550
NNVEPLQQSE QNKDRNIHYE FIDKTTATVH QMLENRIDSL TPSSQPDKTC
4560 4570 4580
FISPIVKKVN QLIQNSLSSN ISQLDQLSCP WL
Length:4,582
Mass (Da):515,215
Last modified:May 2, 2006 - v2
Checksum:iFD1EAC1BA288654B
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AAFI02000043 Genomic DNA. Translation: EAL66533.2.
RefSeqiXP_640504.5. XM_635412.3.

Genome annotation databases

EnsemblProtistsiDDB0229338; DDB0229338; DDB_G0281947.
GeneIDi8623319.
KEGGiddi:DDB_G0281947.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AAFI02000043 Genomic DNA. Translation: EAL66533.2.
RefSeqiXP_640504.5. XM_635412.3.

3D structure databases

ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi44689.DDB0229338.

Proteomic databases

PaxDbiQ54T85.
PRIDEiQ54T85.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblProtistsiDDB0229338; DDB0229338; DDB_G0281947.
GeneIDi8623319.
KEGGiddi:DDB_G0281947.

Organism-specific databases

dictyBaseiDDB_G0281947. tra1.

Phylogenomic databases

eggNOGiKOG0889. Eukaryota.
COG5032. LUCA.
InParanoidiQ54T85.
KOiK08874.
OMAiICASIID.
PhylomeDBiQ54T85.

Miscellaneous databases

PROiQ54T85.

Family and domain databases

InterProiIPR016024. ARM-type_fold.
IPR003152. FATC_dom.
IPR011009. Kinase-like_dom.
IPR003151. PIK-rel_kinase_FAT.
IPR014009. PIK_FAT.
IPR033317. TRA1/TRRAP.
[Graphical view]
PANTHERiPTHR11139:SF1. PTHR11139:SF1. 15 hits.
PfamiPF02259. FAT. 1 hit.
[Graphical view]
SUPFAMiSSF48371. SSF48371. 12 hits.
SSF56112. SSF56112. 2 hits.
PROSITEiPS51189. FAT. 1 hit.
PS51190. FATC. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "The genome of the social amoeba Dictyostelium discoideum."
    Eichinger L., Pachebat J.A., Gloeckner G., Rajandream M.A., Sucgang R., Berriman M., Song J., Olsen R., Szafranski K., Xu Q., Tunggal B., Kummerfeld S., Madera M., Konfortov B.A., Rivero F., Bankier A.T., Lehmann R., Hamlin N.
    , Davies R., Gaudet P., Fey P., Pilcher K., Chen G., Saunders D., Sodergren E.J., Davis P., Kerhornou A., Nie X., Hall N., Anjard C., Hemphill L., Bason N., Farbrother P., Desany B., Just E., Morio T., Rost R., Churcher C.M., Cooper J., Haydock S., van Driessche N., Cronin A., Goodhead I., Muzny D.M., Mourier T., Pain A., Lu M., Harper D., Lindsay R., Hauser H., James K.D., Quiles M., Madan Babu M., Saito T., Buchrieser C., Wardroper A., Felder M., Thangavelu M., Johnson D., Knights A., Loulseged H., Mungall K.L., Oliver K., Price C., Quail M.A., Urushihara H., Hernandez J., Rabbinowitsch E., Steffen D., Sanders M., Ma J., Kohara Y., Sharp S., Simmonds M.N., Spiegler S., Tivey A., Sugano S., White B., Walker D., Woodward J.R., Winckler T., Tanaka Y., Shaulsky G., Schleicher M., Weinstock G.M., Rosenthal A., Cox E.C., Chisholm R.L., Gibbs R.A., Loomis W.F., Platzer M., Kay R.R., Williams J.G., Dear P.H., Noegel A.A., Barrell B.G., Kuspa A.
    Nature 435:43-57(2005) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: AX4.
  2. "The dictyostelium kinome -- analysis of the protein kinases from a simple model organism."
    Goldberg J.M., Manning G., Liu A., Fey P., Pilcher K.E., Xu Y., Smith J.L.
    PLoS Genet. 2:E38-E38(2006) [PubMed] [Europe PMC] [Abstract]
    Cited for: GENE FAMILY, NOMENCLATURE.

Entry informationi

Entry nameiTRA1_DICDI
AccessioniPrimary (citable) accession number: Q54T85
Entry historyi
Integrated into UniProtKB/Swiss-Prot: May 26, 2009
Last sequence update: May 2, 2006
Last modified: May 11, 2016
This is version 72 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Dictyostelium discoideum
    Dictyostelium discoideum: entries, gene names and cross-references to dictyBase
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.