Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein
Submitted name:

Protein Prkdc

Gene

Prkdc

Organism
Rattus norvegicus (Rat)
Status
Unreviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Catalytic activityi

ATP + a protein = ADP + a phosphoprotein.SAAS annotation

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Molecular functioni

KinaseSAAS annotation, Transferase

Keywords - Ligandi

ATP-bindingSAAS annotation, Nucleotide-binding

Enzyme and pathway databases

ReactomeiR-RNO-5693571. Nonhomologous End-Joining (NHEJ).

Names & Taxonomyi

Protein namesi
Submitted name:
Protein PrkdcImported
Gene namesi
Name:PrkdcImported
OrganismiRattus norvegicus (Rat)Imported
Taxonomic identifieri10116 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeRattus
Proteomesi
  • UP000002494 Componenti: Chromosome 11

Organism-specific databases

RGDi1308982. Prkdc.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

PTM / Processingi

Proteomic databases

PaxDbiD3ZTN0.
PeptideAtlasiD3ZTN0.
PRIDEiD3ZTN0.

Interactioni

Protein-protein interaction databases

STRINGi10116.ENSRNOP00000037412.

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini2882 – 3540659FATInterPro annotationAdd
BLAST
Domaini3746 – 4126381PI3K/PI4KInterPro annotationAdd
BLAST
Domaini4094 – 412633FATCInterPro annotationAdd
BLAST

Sequence similaritiesi

Belongs to the PI3/PI4-kinase family.SAAS annotation
Contains FAT domain.SAAS annotation
Contains FATC domain.SAAS annotation
Contains PI3K/PI4K domain.SAAS annotation

Phylogenomic databases

eggNOGiKOG0891. Eukaryota.
COG5032. LUCA.
GeneTreeiENSGT00830000128321.
InParanoidiD3ZTN0.
KOiK06642.
OrthoDBiEOG7DNNT7.
TreeFamiTF324494.

Family and domain databases

Gene3Di1.10.1070.11. 3 hits.
1.25.10.10. 4 hits.
InterProiIPR011989. ARM-like.
IPR016024. ARM-type_fold.
IPR003152. FATC_dom.
IPR011009. Kinase-like_dom.
IPR012582. NUC194.
IPR000403. PI3/4_kinase_cat_dom.
IPR018936. PI3/4_kinase_CS.
IPR003151. PIK-rel_kinase_FAT.
IPR014009. PIK_FAT.
[Graphical view]
PfamiPF02259. FAT. 1 hit.
PF02260. FATC. 1 hit.
PF08163. NUC194. 1 hit.
PF00454. PI3_PI4_kinase. 1 hit.
[Graphical view]
SMARTiSM01343. FATC. 1 hit.
SM01344. NUC194. 1 hit.
SM00146. PI3Kc. 1 hit.
[Graphical view]
SUPFAMiSSF48371. SSF48371. 11 hits.
SSF56112. SSF56112. 2 hits.
PROSITEiPS51189. FAT. 1 hit.
PS51190. FATC. 1 hit.
PS00915. PI3_4_KINASE_1. 1 hit.
PS00916. PI3_4_KINASE_2. 1 hit.
PS50290. PI3_4_KINASE_3. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

D3ZTN0-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MADPGAGLRC WLLQLQEFVS AADRYNAAGA SYQLIRGLGQ ECVLSTCSAV
60 70 80 90 100
QALQISLVFS KEFGLLVFIR KSLSIDEFRD CREEALKFLC VFLEKIDQKV
110 120 130 140 150
LHYSFDIKNT CISVYTKDRT AKCKIPALDL LIKLLKILRS SRVMDEFKIG
160 170 180 190 200
ELFNKFYGEL VSKSKLPDTV LEKVYELLGV LGEVHPSEMV NHSENLFRAF
210 220 230 240 250
LGELKTQMTS TVREPKFPVL AGCLKGLSSL LCNFTKSMEE DPQTSKEIFG
260 270 280 290 300
FTLKAIRPQI EMKRYAVPLA GLRLLTLHAS QFTTCLLDNY ITLFEVLSKW
310 320 330 340 350
CNHTNVELKK AAHSALESFL KQISFTVAED AELHKSKLKY FMEQFYGIIR
360 370 380 390 400
NADSNNKDLA IAIRGYGLFA GPCKVINAKD VDFMYVELIQ RCKQMFLTQT
410 420 430 440 450
DATEDHVYQM PSFLQSIASV LLYLDTVPEV YTPVLEHLMV VQIDSFPQYS
460 470 480 490 500
PKMQLVCCKA LVKVFLSLTE KGPVHWNCIS AVVHQGLIRI CSKPVVLQKD
510 520 530 540 550
VESRSENRWA SEEVRMGRWK VPTYKDYVDL FQNLLGCDQM ADFILGDETF
560 570 580 590 600
LFVNSSLKSL NHLLYDEFIR SVLKIVEKLD LTLEKQTTGE QEDERTADVW
610 620 630 640 650
VIPTSDPAAN LHPIKPNDFS AFINLVEFCR EILPKKQVSF FEPWVYSFAY
660 670 680 690 700
ELILQSTRLP LISGFYKLLS IAVKNARKIK YFEGVSPKCL KHSPENTEKH
710 720 730 740 750
SCFALFAKFG KEVSVKMKQY KDELLASCLT FVLSLPPDII KLDVRAYVPA
760 770 780 790 800
LQMAFKLGLS HLPLAEIGLH ALKEWSVHID KSIMQPYYKD IVPCLDGYLN
810 820 830 840 850
TSTLSDETKS HWEVSALSRA ARKGFNRDVV KLLKRTRNVT SDEALSLEEI
860 870 880 890 900
RIRVVQILGS LGGQINKNLI TATSGERMKK YVAWDREKRL SFAVPFREMK
910 920 930 940 950
PVIYLDVFLP RVTELALSAS DRQTKVAACE LLHSMVMFML GRATQMPEGQ
960 970 980 990 1000
GLPPMYQLYK HMFPVLLKLA CDVDQVTRQL YEPLVMQLIH WFTNNKKFES
1010 1020 1030 1040 1050
QDTVALLEAI LDGIVDPVDS TLRDFCGQCV QEFLKWSIKQ TTPQQQEKSP
1060 1070 1080 1090 1100
VNSKSLFKRL YSLALHPNAF KRLGAALAFN HIYKEFREEG SLVEQFVFEA
1110 1120 1130 1140 1150
LVTYMESLAL AHADEKSLGT IQQCCDAIDH LRRIIEKKYI SLNKAKKRRL
1160 1170 1180 1190 1200
PRGFPPLTSL CLLDLVKWLF AHCGRPQTEC RHKSIELFYK FVPLLPGNKS
1210 1220 1230 1240 1250
PSLWLRDLIK AEGISFLINT FEGGASSSGQ PSGILAQPTL LHLQGPVSLR
1260 1270 1280 1290 1300
GMLQWLDHLL AALECYNTFI EEKTVQVQEV LGTEVQSSLL KSVAFFLESI
1310 1320 1330 1340 1350
ATCDARTVEQ RFGTGLTGPP SVQEEEKYNY SKCTVLVRIM EFTTTLLVTF
1360 1370 1380 1390 1400
PEDCKLLEKD LCNTKLMKVL VKMLCEPLSI GFNIGDVQVM NHLPSICVNL
1410 1420 1430 1440 1450
LKALRKSPYR DKLETHLKEK VTAQSVEELC AINLCSSAAC QERTKLLSLL
1460 1470 1480 1490 1500
SACKQLHKAG FSHIISASQS TALNHSIGMR LLSLVYNGIA PAQEKQYLQS
1510 1520 1530 1540 1550
LDPSCKSLAS GLLELAFAFG GLCEHLVSLL LNSSMLSTQY LGSSQRNISF
1560 1570 1580 1590 1600
SHGEYFYSLF PEVINTELLK SLDVTVSRLL ESSSDNPKMV STILNGMLDM
1610 1620 1630 1640 1650
SFRDRAVQKH QGLKLATAIL QNWRKCDSWW APDSAPESKT TVLSLLAKML
1660 1670 1680 1690 1700
QIDSTLSFDT NHSSFSEIFT TYANLLADTK LGLHLKGQAV ILLPFFTRVT
1710 1720 1730 1740 1750
EDRLGDLKHI LEKFIVLNFP MKSDEFPPDS LKYNNYVDCM KKFLDALELS
1760 1770 1780 1790 1800
QSPMLLQLMT DILCREQQHI MEELFQTTFK RIARRSPCVT QLNLLESVYT
1810 1820 1830 1840 1850
MFRKGGLLSD VTQAFVDRSL LTLLWHCDLD TLKEFFSRIV VEAIDVLKSR
1860 1870 1880 1890 1900
FTKLNDTFDT QITKKMCYYK MLAVMYSRLL KDDVHSKESK INQAFQGSCV
1910 1920 1930 1940 1950
AEGNELTKTL LKLCHDAFTE NMAGESQLLE KRRLYHCAAY NCAISLISRV
1960 1970 1980 1990 2000
FNELKFYQGF LFSEKPEKNL FIFENLIDLK RCYTFPIEVE VPMERKKKYV
2010 2020 2030 2040 2050
EIRKEARDAA NGASGNPRYM SSLSYLTESS LSEEMSQFDF STGVQSYSYS
2060 2070 2080 2090 2100
SQDRKPTTGH FQRREHQDSM AQDDIIELEM DELNQHECMA PMTSLIKHMQ
2110 2120 2130 2140 2150
RNVIAPKGQE GSISKDLPPW MKFLHDKLEN ASVSLNIRLF LAKLVINTEE
2160 2170 2180 2190 2200
VFRPYAKHWL GPLLQLAVCE NIGEGIHYMI VEIVATILSW TGLATPTGVP
2210 2220 2230 2240 2250
KYEVLANRLL YFLMKYVFHP KRAVFRHNLE IIKTLVECWK ECLSIPYRLI
2260 2270 2280 2290 2300
FEKFSNKDPN SKDNSVGIQL LGIVVANNLP PYDLKGDITS AMYFEALVNN
2310 2320 2330 2340 2350
MSFVKYKEVY AAAAEVLGLI LQHVTERKHV IAESVYELVV KQLKQHQNTM
2360 2370 2380 2390 2400
EDKFIVCLNK IVKGFPPLAD RFLNALFFLL PKFHGVMKTL CLEVVLCRAE
2410 2420 2430 2440 2450
AITGLYLQLK SKDFLQVMRH RDDERQKVCL DIIYKMVAKL KPTELRELLN
2460 2470 2480 2490 2500
PVVEFVSHPS PTCREQMYNI LMWIHDNYRD PESQMDEDSQ EIFKLAKDVL
2510 2520 2530 2540 2550
IQGLIDENLG LQLIIRNFWS HETRLPSNTL DRLLALNSLY SPKIEVHFLS
2560 2570 2580 2590 2600
LATNFLLEMT RMSPDYLNPI FEHPLSECEF QEYTIDPDWR FRSTVLTPMF
2610 2620 2630 2640 2650
IETQAFPSTL NTQTQEGSPS DRRQKPGQVR ATQQQYDFTP TQTSVERSSF
2660 2670 2680 2690 2700
DWLTGSSIDL MADHTVFSSE SLSSSLLFSH KKSEKSQRVS WKSVGPDFGT
2710 2720 2730 2740 2750
KKLGLPGDEV DNQVKSGTHS QTEILRLRRR FLKDQEKLSL LYAKRGLMEQ
2760 2770 2780 2790 2800
KLEKDIKSEL KMKQDAQVVL YRSYRHGDLP DIQIQHSSLI TPLQAVAQKD
2810 2820 2830 2840 2850
PIIAKQLFSS LFSGILKEMH KFRTTSEKNI IIQNLLQDFN RFLNTTFLFF
2860 2870 2880 2890 2900
PPFVSCIQEI SCQHVDLLTL DPAAVRVGCL ASLQQPGGIR LLEEALLHLL
2910 2920 2930 2940 2950
PKEPPTKRIR GKTCLPPDVL RWMELAKLYR SIGEYDVLHG IFSSELGTKQ
2960 2970 2980 2990 3000
DTQNALLAEA RSDYFQAANL YKEALNKLEW LDGEPTEAEK EFWEMASLDC
3010 3020 3030 3040 3050
YNNLSQWKEL EYRSTVNIGS ENSLDLSKMW SKPFYQETYL PYVIRSKLKL
3060 3070 3080 3090 3100
LLQGEDNQSL LTFVDEAMHK ELQQMVLELQ YSQELSLLYI LQDDIDRATY
3110 3120 3130 3140 3150
YIKNAIQIFM QNYSSIDVLL YRSRLAKLQS VQTLAEIEEF LNFIRKHGDL
3160 3170 3180 3190 3200
SSLGPLRRLL KTWTSRYPDT VTDPMHIWDD IITNRCFFLS KIEERLTVSP
3210 3220 3230 3240 3250
GDHSMSVDED EDSFAREGSE PKEDAHHILQ SCRFTMKMKM IESSWKQNNF
3260 3270 3280 3290 3300
SLSMKLLKEM HKESKIRESW RMQWIHSFCQ LNHCRSHTQS PQEQVLTMLK
3310 3320 3330 3340 3350
TITLLDESDI SNYLNKNIQA FCDQNILLGT SCRIMADALS REPACLSGLE
3360 3370 3380 3390 3400
KSKTESIFAL SGSNTENTER VISGLYQRAF HHLSMAVQSA EEETQLSCWG
3410 3420 3430 3440 3450
HEAAAERAHA YMTLVGFCDQ QLRKVEESAS TASQKISTEL EGYPALVVEK
3460 3470 3480 3490 3500
MLRALKLNSS EARLKFPRLL QIIEQYPEET VNIIIKEISS IPCWQFIGWI
3510 3520 3530 3540 3550
GHMVALLDKE EAIAVQHTVE EIADNYPQAI IYPFIISSES YSFQNTSSGH
3560 3570 3580 3590 3600
NNKAFVERIK SKLDQGGVIQ GFINALDQLS NPDMLFKDWV NDTKDELGKN
3610 3620 3630 3640 3650
PVNKKNIEKL YERMYAALGD FRAPGLGPFR RKFIQTFGKE FVKSFGNGGS
3660 3670 3680 3690 3700
KLLTMKLDEF RNITDSLFVR MRKDSKLPGN LKEYSPWMSE FTVKSELEIP
3710 3720 3730 3740 3750
GQYDGKSKPL PEYHVRISGF DERVKVMVSL RKPKRIVIRG HDEKEYPFLV
3760 3770 3780 3790 3800
KGGEDLRQDQ RIEQLFEVMN AILSQDAACS QRNMQLRTYR VVPMTSRLGL
3810 3820 3830 3840 3850
IEWIENTMTL KDLLLSNMSQ EEKVAYDSDS KAPIYDYRDW LIKVSGRSDV
3860 3870 3880 3890 3900
KGYTLMYSRA NRTETVTAFR RRENKVPADL LKRAFVKMST SPEAFLALRS
3910 3920 3930 3940 3950
HFASSHALLC ISHWLLGIGD RHLNNFMVAM ETGSVIGIDF GHAFGSATQF
3960 3970 3980 3990 4000
LPIPELMPFR LTRQFISLML PMKETGLVCT VMVHALRAFR SCAGLLTDTM
4010 4020 4030 4040 4050
EVFVKEPSFD WKGFEQTMLR KGGSWIQEIN VTEKNWYPQH KIRYAKRKLA
4060 4070 4080 4090 4100
GANPAVITCD ELRLGHEASP AFRSYVAVAR GNKDYNIRAQ EPESRLSEET
4110 4120
QVKCLVDQAT DPNILGRTWE GWEPWM
Length:4,126
Mass (Da):472,074
Last modified:July 22, 2015 - v3
Checksum:i8BB4E25C6D8D6505
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AABR07034787 Genomic DNA. No translation available.
RefSeqiXP_003751148.1. XM_003751100.3.
XP_003752582.1. XM_003752534.3.

Genome annotation databases

EnsembliENSRNOT00000035247; ENSRNOP00000037412; ENSRNOG00000025028.
GeneIDi360748.
KEGGirno:360748.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AABR07034787 Genomic DNA. No translation available.
RefSeqiXP_003751148.1. XM_003751100.3.
XP_003752582.1. XM_003752534.3.

3D structure databases

ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi10116.ENSRNOP00000037412.

Proteomic databases

PaxDbiD3ZTN0.
PeptideAtlasiD3ZTN0.
PRIDEiD3ZTN0.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSRNOT00000035247; ENSRNOP00000037412; ENSRNOG00000025028.
GeneIDi360748.
KEGGirno:360748.

Organism-specific databases

CTDi5591.
RGDi1308982. Prkdc.

Phylogenomic databases

eggNOGiKOG0891. Eukaryota.
COG5032. LUCA.
GeneTreeiENSGT00830000128321.
InParanoidiD3ZTN0.
KOiK06642.
OrthoDBiEOG7DNNT7.
TreeFamiTF324494.

Enzyme and pathway databases

ReactomeiR-RNO-5693571. Nonhomologous End-Joining (NHEJ).

Family and domain databases

Gene3Di1.10.1070.11. 3 hits.
1.25.10.10. 4 hits.
InterProiIPR011989. ARM-like.
IPR016024. ARM-type_fold.
IPR003152. FATC_dom.
IPR011009. Kinase-like_dom.
IPR012582. NUC194.
IPR000403. PI3/4_kinase_cat_dom.
IPR018936. PI3/4_kinase_CS.
IPR003151. PIK-rel_kinase_FAT.
IPR014009. PIK_FAT.
[Graphical view]
PfamiPF02259. FAT. 1 hit.
PF02260. FATC. 1 hit.
PF08163. NUC194. 1 hit.
PF00454. PI3_PI4_kinase. 1 hit.
[Graphical view]
SMARTiSM01343. FATC. 1 hit.
SM01344. NUC194. 1 hit.
SM00146. PI3Kc. 1 hit.
[Graphical view]
SUPFAMiSSF48371. SSF48371. 11 hits.
SSF56112. SSF56112. 2 hits.
PROSITEiPS51189. FAT. 1 hit.
PS51190. FATC. 1 hit.
PS00915. PI3_4_KINASE_1. 1 hit.
PS00916. PI3_4_KINASE_2. 1 hit.
PS50290. PI3_4_KINASE_3. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "Genome sequence of the Brown Norway rat yields insights into mammalian evolution."
    Rat Genome Sequencing Project Consortium
    Gibbs R.A., Weinstock G.M., Metzker M.L., Muzny D.M., Sodergren E.J., Scherer S., Scott G., Steffen D., Worley K.C., Burch P.E., Okwuonu G., Hines S., Lewis L., Deramo C., Delgado O., Dugan-Rocha S., Miner G., Morgan M.
    , Hawes A., Gill R., Holt R.A., Adams M.D., Amanatides P.G., Baden-Tillson H., Barnstead M., Chin S., Evans C.A., Ferriera S., Fosler C., Glodek A., Gu Z., Jennings D., Kraft C.L., Nguyen T., Pfannkoch C.M., Sitter C., Sutton G.G., Venter J.C., Woodage T., Smith D., Lee H.-M., Gustafson E., Cahill P., Kana A., Doucette-Stamm L., Weinstock K., Fechtel K., Weiss R.B., Dunn D.M., Green E.D., Blakesley R.W., Bouffard G.G., De Jong P.J., Osoegawa K., Zhu B., Marra M., Schein J., Bosdet I., Fjell C., Jones S., Krzywinski M., Mathewson C., Siddiqui A., Wye N., McPherson J., Zhao S., Fraser C.M., Shetty J., Shatsman S., Geer K., Chen Y., Abramzon S., Nierman W.C., Havlak P.H., Chen R., Durbin K.J., Egan A., Ren Y., Song X.-Z., Li B., Liu Y., Qin X., Cawley S., Cooney A.J., D'Souza L.M., Martin K., Wu J.Q., Gonzalez-Garay M.L., Jackson A.R., Kalafus K.J., McLeod M.P., Milosavljevic A., Virk D., Volkov A., Wheeler D.A., Zhang Z., Bailey J.A., Eichler E.E., Tuzun E., Birney E., Mongin E., Ureta-Vidal A., Woodwark C., Zdobnov E., Bork P., Suyama M., Torrents D., Alexandersson M., Trask B.J., Young J.M., Huang H., Wang H., Xing H., Daniels S., Gietzen D., Schmidt J., Stevens K., Vitt U., Wingrove J., Camara F., Mar Alba M., Abril J.F., Guigo R., Smit A., Dubchak I., Rubin E.M., Couronne O., Poliakov A., Huebner N., Ganten D., Goesele C., Hummel O., Kreitler T., Lee Y.-A., Monti J., Schulz H., Zimdahl H., Himmelbauer H., Lehrach H., Jacob H.J., Bromberg S., Gullings-Handley J., Jensen-Seaman M.I., Kwitek A.E., Lazar J., Pasko D., Tonellato P.J., Twigger S., Ponting C.P., Duarte J.M., Rice S., Goodstadt L., Beatson S.A., Emes R.D., Winter E.E., Webber C., Brandt P., Nyakatura G., Adetobi M., Chiaromonte F., Elnitski L., Eswara P., Hardison R.C., Hou M., Kolbe D., Makova K., Miller W., Nekrutenko A., Riemer C., Schwartz S., Taylor J., Yang S., Zhang Y., Lindpaintner K., Andrews T.D., Caccamo M., Clamp M., Clarke L., Curwen V., Durbin R.M., Eyras E., Searle S.M., Cooper G.M., Batzoglou S., Brudno M., Sidow A., Stone E.A., Payseur B.A., Bourque G., Lopez-Otin C., Puente X.S., Chakrabarti K., Chatterji S., Dewey C., Pachter L., Bray N., Yap V.B., Caspi A., Tesler G., Pevzner P.A., Haussler D., Roskin K.M., Baertsch R., Clawson H., Furey T.S., Hinrichs A.S., Karolchik D., Kent W.J., Rosenbloom K.R., Trumbower H., Weirauch M., Cooper D.N., Stenson P.D., Ma B., Brent M., Arumugam M., Shteynberg D., Copley R.R., Taylor M.S., Riethman H., Mudunuri U., Peterson J., Guyer M., Felsenfeld A., Old S., Mockrin S., Collins F.S.
    Nature 428:493-521(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: Brown NorwayImported.
  2. Ensembl
    Submitted (JUL-2011) to UniProtKB
    Cited for: IDENTIFICATION.
    Strain: Brown NorwayImported.

Entry informationi

Entry nameiD3ZTN0_RAT
AccessioniPrimary (citable) accession number: D3ZTN0
Entry historyi
Integrated into UniProtKB/TrEMBL: April 20, 2010
Last sequence update: July 22, 2015
Last modified: July 6, 2016
This is version 55 of the entry and version 3 of the sequence. [Complete history]
Entry statusiUnreviewed (UniProtKB/TrEMBL)

Miscellaneousi

Caution

The sequence shown here is derived from an Ensembl automatic analysis pipeline and should be considered as preliminary data.Imported

Keywords - Technical termi

Complete proteome, Proteomics identificationCombined sources, Reference proteomeImported

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.