Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Gamma-tubulin complex component 6

Gene

TUBGCP6

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Gamma-tubulin complex is necessary for microtubule nucleation at the centrosome.1 Publication

GO - Molecular functioni

  • gamma-tubulin binding Source: GO_Central
  • microtubule binding Source: UniProtKB
  • structural constituent of cytoskeleton Source: GO_Central

GO - Biological processi

Complete GO annotation...

Enzyme and pathway databases

ReactomeiR-HSA-380270. Recruitment of mitotic centrosome proteins and complexes.
R-HSA-380320. Recruitment of NuMA to mitotic centrosomes.

Names & Taxonomyi

Protein namesi
Recommended name:
Gamma-tubulin complex component 6
Short name:
GCP-6
Gene namesi
Name:TUBGCP6
Synonyms:GCP6, KIAA1669
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 22

Organism-specific databases

HGNCiHGNC:18127. TUBGCP6.

Subcellular locationi

GO - Cellular componenti

  • centrosome Source: UniProtKB
  • cytosol Source: Reactome
  • equatorial microtubule organizing center Source: GO_Central
  • extracellular exosome Source: UniProtKB
  • gamma-tubulin ring complex Source: UniProtKB
  • membrane Source: UniProtKB
  • microtubule Source: UniProtKB-KW
  • spindle pole Source: InterPro
Complete GO annotation...

Keywords - Cellular componenti

Cytoplasm, Cytoskeleton, Microtubule

Pathology & Biotechi

Involvement in diseasei

Microcephaly and chorioretinopathy, autosomal recessive, 1 (MCCRP1)1 Publication
The disease is caused by mutations affecting the gene represented in this entry.
Disease descriptionA syndrome characterized by microcephaly, cognitive impairment, underdeveloped retina and choroid, and epilepsy in some patients. The more anterior parts of the retina, near the periphery and pars plana, have a grayish hue and diminutive vasculature similar to retinopathy of prematurity. Visual impairment becomes evident during the first year of life.
See also OMIM:251270

Organism-specific databases

MalaCardsiTUBGCP6.
MIMi251270. phenotype.
Orphaneti2518. Autosomal recessive chorioretinopathy-microcephaly.
PharmGKBiPA38507.

Polymorphism and mutation databases

BioMutaiTUBGCP6.
DMDMi143811395.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 18191819Gamma-tubulin complex component 6PRO_0000078131Add
BLAST

Proteomic databases

EPDiQ96RT7.
MaxQBiQ96RT7.
PaxDbiQ96RT7.
PeptideAtlasiQ96RT7.
PRIDEiQ96RT7.

PTM databases

iPTMnetiQ96RT7.
PhosphoSiteiQ96RT7.

Expressioni

Gene expression databases

BgeeiQ96RT7.
CleanExiHS_TUBGCP6.
ExpressionAtlasiQ96RT7. baseline and differential.
GenevisibleiQ96RT7. HS.

Organism-specific databases

HPAiHPA043928.

Interactioni

Subunit structurei

Gamma-tubulin complex is composed of gamma-tubulin, TUBGCP2, TUBGCP3, TUBGCP4, TUBGCP5 and TUBGCP6.1 Publication

GO - Molecular functioni

  • gamma-tubulin binding Source: GO_Central
  • microtubule binding Source: UniProtKB

Protein-protein interaction databases

BioGridi124505. 33 interactions.
IntActiQ96RT7. 20 interactions.
MINTiMINT-4989271.
STRINGi9606.ENSP00000248846.

Structurei

3D structure databases

ProteinModelPortaliQ96RT7.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Repeati1027 – 1053271Add
BLAST
Repeati1054 – 1080272Add
BLAST
Repeati1081 – 1107273Add
BLAST
Repeati1108 – 1134274Add
BLAST
Repeati1135 – 1161275Add
BLAST
Repeati1162 – 1188276Add
BLAST
Repeati1189 – 1215277Add
BLAST
Repeati1216 – 1242278Add
BLAST
Repeati1243 – 1269279Add
BLAST

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni1027 – 12692439 X 27 AA tandem repeatsAdd
BLAST

Sequence similaritiesi

Belongs to the TUBGCP family.Curated

Keywords - Domaini

Repeat

Phylogenomic databases

eggNOGiKOG2000. Eukaryota.
ENOG410XQ5B. LUCA.
GeneTreeiENSGT00840000129856.
HOVERGENiHBG031569.
InParanoidiQ96RT7.
KOiK16573.
OMAiHASDAHI.
OrthoDBiEOG7JMGCP.
PhylomeDBiQ96RT7.
TreeFamiTF106321.

Family and domain databases

InterProiIPR007259. GCP.
[Graphical view]
PANTHERiPTHR19302. PTHR19302. 2 hits.
PfamiPF04130. Spc97_Spc98. 1 hit.
[Graphical view]

Sequences (3)i

Sequence statusi: Complete.

This entry describes 3 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: Q96RT7-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MASITQLFDD LCEALLPAAK THLGQRSVNR KRAKRSLKKV AYNALFTNLF
60 70 80 90 100
QDETQQLQPD MSKLPARNKI LMLSFDLRVG GLGPKADRLE ELVEELEAAP
110 120 130 140 150
CCPLLEVGSV LDLLVQLAGS GPPQVLPRKR DYFLNNKHVG RNVPYSGYDC
160 170 180 190 200
DDLSVFEMDV QSLISREECL CHSMIQETLQ VMEAAPGTGL PTVGLFSFGD
210 220 230 240 250
PCGDRFERDT RVSLFGALVH SRTYDMDVRL GLPPVPDNAD LSGLAIKVPP
260 270 280 290 300
SVDQWEDEGF QSASNLTPDS QSEPSVTPDV DLWEAALTYE ASKRRCWERV
310 320 330 340 350
GCPPGHREEP YLTEAGRDAF DKFCRLHQGE LQLLAGGVLQ APQPVLVKEC
360 370 380 390 400
ELVKDVLNVL IGVVSATFSL CQPAQAFVVK RGVHVSGASP ESISSLLSEV
410 420 430 440 450
AEYGTCYTRL SHFSLQPVLD SLYSKGLVFQ AFTSGLRRYL QYYRACVLST
460 470 480 490 500
PPTLSLLTIG FLFKKLGRQL RYLAELCGVG AVLPGTCGGG PRAAFPTGVK
510 520 530 540 550
LLSYLYQEAL HNCSNEHYPV LLSLLKTSCE PYTRFIHDWV YSGVFRDAYG
560 570 580 590 600
EFMIQVNHEY LSFRDKLYWT HGYVLISKEV EDCVPVFLKH IAHDIYVCGK
610 620 630 640 650
TINLLKLCCP RHYLCWSDVP VPRISVIFSL EELKEIEKDC AVYVGRMERV
660 670 680 690 700
ARHSSVSKEE KELRMEIAKQ ELIAHAREAA SRVLSALSDR QMSERMALDA
710 720 730 740 750
RKREQFQRLK EQFVKDQERR QAARQEELDD DFSYARELRD RERRLKSLEE
760 770 780 790 800
ELERKARQAL VDHYSKLSAE AARREQKALW RIQRHRLESA RLRFLLEDEK
810 820 830 840 850
HIQEMLKAVS EAHQPQEPPD VLLSVHPQVT SPGPEHPEGG QGCDSGSAEQ
860 870 880 890 900
HSPAWDGWNR PGLLTPQPLK PLAVGAGGRG LQQAEGARPF SDSLSIGDFL
910 920 930 940 950
PVGPGAEPSV QTGMVPLLEV ALQTINLDLP PSAPGEAPAA ASTQPSRPQE
960 970 980 990 1000
YDFSTVLRPA VATSPAPGPL QAAECSLGSS GLQLWEDSCG KMDACGSASR
1010 1020 1030 1040 1050
ETLLPSHPPR RAALEEGSSQ PTERLFGQVS GGGLPTGDYA SEIAPTRPRW
1060 1070 1080 1090 1100
NTHGHVSDAS IRVGENVSDV APTQPRWNTH GHVSNASISL GESVSDVAPT
1110 1120 1130 1140 1150
RPRWNIHGHV SNASIRVGEN VSDVAPTRPR WNTHGHVSNA SIRVGENVSD
1160 1170 1180 1190 1200
VAPTRPRWNT HGHVSDASIS LGESVSDMAP ARPRWNTHGH VSDASISLGE
1210 1220 1230 1240 1250
SVSDMAPTRP RWNTHGHVSD TSIRVGENVS DVAPIRSRCN THGHVSDASI
1260 1270 1280 1290 1300
SLGEPVSDVV STRPRWNTHV PIPPPHMVLG ALSPEAEPNT PRPQQSPPGH
1310 1320 1330 1340 1350
TSQSALSLGA QSTVLDCGPR LPVEVGPSLS SPSSGCGEGS ISVGENVSDV
1360 1370 1380 1390 1400
APTQPWWPNT PGDSVSEELG PGRSGDTEDL SPNWPLNSQE DTAAQSSPGR
1410 1420 1430 1440 1450
GEEAEASAAE AQGGEQAYLA GLAGQYHLER YPDSYESMSE PPIAHLLRPV
1460 1470 1480 1490 1500
LPRAFAFPVD PQVQSAADET AVQLSELLTL PVLMKRSITA PLAAHISLVN
1510 1520 1530 1540 1550
KAAVDYFFVE LHLEAHYEAL RHFLLMEDGE FAQSLSDLLF EKLGAGQTPG
1560 1570 1580 1590 1600
ELLNPLVLNS VLSKALQCSL HGDTPHASNL SLALKYLPEV FAPNAPDVLS
1610 1620 1630 1640 1650
CLELRYKVDW PLNIVITEGC VSKYSGVFSF LLQLKLMMWA LKDVCFHLKR
1660 1670 1680 1690 1700
TALLSHMAGS VQFRQLQLFK HEMQHFVKVI QGYIANQILH VTWCEFRARL
1710 1720 1730 1740 1750
ATVGDLEEIQ RAHAEYLHKA VFRGLLTEKA APVMNVIHSI FSLVLKFRSQ
1760 1770 1780 1790 1800
LISQAWGPPG GPRGAEHPNF ALMQQSYNTF KYYSHFLFKV VTKLVNRGYQ
1810
PHLEDFLLRI NFNNYYQDA
Length:1,819
Mass (Da):200,498
Last modified:April 3, 2007 - v3
Checksum:i265F89F318996157
GO
Isoform 2 (identifier: Q96RT7-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1724-1757: Missing.

Note: No experimental confirmation available.
Show »
Length:1,785
Mass (Da):196,761
Checksum:i58972A0B8E54CEBF
GO
Isoform 3 (identifier: Q96RT7-3) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1652-1662: Missing.
     1724-1729: Missing.

Note: No experimental confirmation available.
Show »
Length:1,802
Mass (Da):198,761
Checksum:i14228549220D0E42
GO

Sequence cautioni

The sequence AAH42165.1 differs from that shown. Reason: Frameshift at positions 1651 and 1723. Curated
The sequence BAB33339.1 differs from that shown.Intron retention.Curated

Natural variant

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Natural varianti104 – 1041L → P.
Corresponds to variant rs8137873 [ dbSNP | Ensembl ].
VAR_055852
Natural varianti567 – 5671L → S.1 Publication
Corresponds to variant rs4838865 [ dbSNP | Ensembl ].
VAR_031594
Natural varianti624 – 6241I → M.
Corresponds to variant rs35573482 [ dbSNP | Ensembl ].
VAR_055853
Natural varianti884 – 8841A → V.
Corresponds to variant rs34455105 [ dbSNP | Ensembl ].
VAR_055854
Natural varianti1232 – 12321V → M.
Corresponds to variant rs17248287 [ dbSNP | Ensembl ].
VAR_055855
Natural varianti1364 – 13641S → C.
Corresponds to variant rs5771107 [ dbSNP | Ensembl ].
VAR_055856
Natural varianti1377 – 13771T → A.1 Publication
Corresponds to variant rs11703226 [ dbSNP | Ensembl ].
VAR_055857
Natural varianti1621 – 16211V → L.1 Publication
Corresponds to variant rs4838864 [ dbSNP | Ensembl ].
VAR_055858

Alternative sequence

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Alternative sequencei1652 – 166211Missing in isoform 3. 1 PublicationVSP_017208Add
BLAST
Alternative sequencei1724 – 175734Missing in isoform 2. 1 PublicationVSP_001624Add
BLAST
Alternative sequencei1724 – 17296Missing in isoform 3. 1 PublicationVSP_017209

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AF272887 mRNA. Translation: AAK82968.1.
AL022328 Genomic DNA. No translation available.
AB051456 mRNA. Translation: BAB33339.1. Sequence problems.
BC023573 mRNA. Translation: AAH23573.1.
BC042165 mRNA. Translation: AAH42165.1. Frameshift.
CCDSiCCDS14087.1. [Q96RT7-1]
RefSeqiNP_065194.2. NM_020461.3.
UniGeneiHs.336431.

Genome annotation databases

EnsembliENST00000248846; ENSP00000248846; ENSG00000128159. [Q96RT7-1]
GeneIDi85378.
KEGGihsa:85378.
UCSCiuc003bkb.2. human. [Q96RT7-1]

Keywords - Coding sequence diversityi

Alternative splicing, Polymorphism

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AF272887 mRNA. Translation: AAK82968.1.
AL022328 Genomic DNA. No translation available.
AB051456 mRNA. Translation: BAB33339.1. Sequence problems.
BC023573 mRNA. Translation: AAH23573.1.
BC042165 mRNA. Translation: AAH42165.1. Frameshift.
CCDSiCCDS14087.1. [Q96RT7-1]
RefSeqiNP_065194.2. NM_020461.3.
UniGeneiHs.336431.

3D structure databases

ProteinModelPortaliQ96RT7.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi124505. 33 interactions.
IntActiQ96RT7. 20 interactions.
MINTiMINT-4989271.
STRINGi9606.ENSP00000248846.

PTM databases

iPTMnetiQ96RT7.
PhosphoSiteiQ96RT7.

Polymorphism and mutation databases

BioMutaiTUBGCP6.
DMDMi143811395.

Proteomic databases

EPDiQ96RT7.
MaxQBiQ96RT7.
PaxDbiQ96RT7.
PeptideAtlasiQ96RT7.
PRIDEiQ96RT7.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000248846; ENSP00000248846; ENSG00000128159. [Q96RT7-1]
GeneIDi85378.
KEGGihsa:85378.
UCSCiuc003bkb.2. human. [Q96RT7-1]

Organism-specific databases

CTDi85378.
GeneCardsiTUBGCP6.
HGNCiHGNC:18127. TUBGCP6.
HPAiHPA043928.
MalaCardsiTUBGCP6.
MIMi251270. phenotype.
610053. gene.
neXtProtiNX_Q96RT7.
Orphaneti2518. Autosomal recessive chorioretinopathy-microcephaly.
PharmGKBiPA38507.
HUGEiSearch...
GenAtlasiSearch...

Phylogenomic databases

eggNOGiKOG2000. Eukaryota.
ENOG410XQ5B. LUCA.
GeneTreeiENSGT00840000129856.
HOVERGENiHBG031569.
InParanoidiQ96RT7.
KOiK16573.
OMAiHASDAHI.
OrthoDBiEOG7JMGCP.
PhylomeDBiQ96RT7.
TreeFamiTF106321.

Enzyme and pathway databases

ReactomeiR-HSA-380270. Recruitment of mitotic centrosome proteins and complexes.
R-HSA-380320. Recruitment of NuMA to mitotic centrosomes.

Miscellaneous databases

GeneWikiiTUBGCP6.
GenomeRNAii85378.
PROiQ96RT7.
SOURCEiSearch...

Gene expression databases

BgeeiQ96RT7.
CleanExiHS_TUBGCP6.
ExpressionAtlasiQ96RT7. baseline and differential.
GenevisibleiQ96RT7. HS.

Family and domain databases

InterProiIPR007259. GCP.
[Graphical view]
PANTHERiPTHR19302. PTHR19302. 2 hits.
PfamiPF04130. Spc97_Spc98. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), FUNCTION, SUBUNIT, SUBCELLULAR LOCATION, VARIANTS SER-567 AND ALA-1377.
  2. "The DNA sequence of human chromosome 22."
    Dunham I., Hunt A.R., Collins J.E., Bruskiewich R., Beare D.M., Clamp M., Smink L.J., Ainscough R., Almeida J.P., Babbage A.K., Bagguley C., Bailey J., Barlow K.F., Bates K.N., Beasley O.P., Bird C.P., Blakey S.E., Bridgeman A.M.
    , Buck D., Burgess J., Burrill W.D., Burton J., Carder C., Carter N.P., Chen Y., Clark G., Clegg S.M., Cobley V.E., Cole C.G., Collier R.E., Connor R., Conroy D., Corby N.R., Coville G.J., Cox A.V., Davis J., Dawson E., Dhami P.D., Dockree C., Dodsworth S.J., Durbin R.M., Ellington A.G., Evans K.L., Fey J.M., Fleming K., French L., Garner A.A., Gilbert J.G.R., Goward M.E., Grafham D.V., Griffiths M.N.D., Hall C., Hall R.E., Hall-Tamlyn G., Heathcott R.W., Ho S., Holmes S., Hunt S.E., Jones M.C., Kershaw J., Kimberley A.M., King A., Laird G.K., Langford C.F., Leversha M.A., Lloyd C., Lloyd D.M., Martyn I.D., Mashreghi-Mohammadi M., Matthews L.H., Mccann O.T., Mcclay J., Mclaren S., McMurray A.A., Milne S.A., Mortimore B.J., Odell C.N., Pavitt R., Pearce A.V., Pearson D., Phillimore B.J.C.T., Phillips S.H., Plumb R.W., Ramsay H., Ramsey Y., Rogers L., Ross M.T., Scott C.E., Sehra H.K., Skuce C.D., Smalley S., Smith M.L., Soderlund C., Spragon L., Steward C.A., Sulston J.E., Swann R.M., Vaudin M., Wall M., Wallis J.M., Whiteley M.N., Willey D.L., Williams L., Williams S.A., Williamson H., Wilmer T.E., Wilming L., Wright C.L., Hubbard T., Bentley D.R., Beck S., Rogers J., Shimizu N., Minoshima S., Kawasaki K., Sasaki T., Asakawa S., Kudoh J., Shintani A., Shibuya K., Yoshizaki Y., Aoki N., Mitsuyama S., Roe B.A., Chen F., Chu L., Crabtree J., Deschamps S., Do A., Do T., Dorman A., Fang F., Fu Y., Hu P., Hua A., Kenton S., Lai H., Lao H.I., Lewis J., Lewis S., Lin S.-P., Loh P., Malaj E., Nguyen T., Pan H., Phan S., Qi S., Qian Y., Ray L., Ren Q., Shaull S., Sloan D., Song L., Wang Q., Wang Y., Wang Z., White J., Willingham D., Wu H., Yao Z., Zhan M., Zhang G., Chissoe S., Murray J., Miller N., Minx P., Fulton R., Johnson D., Bemis G., Bentley D., Bradshaw H., Bourne S., Cordes M., Du Z., Fulton L., Goela D., Graves T., Hawkins J., Hinds K., Kemp K., Latreille P., Layman D., Ozersky P., Rohlfing T., Scheet P., Walker C., Wamsley A., Wohldmann P., Pepin K., Nelson J., Korf I., Bedell J.A., Hillier L.W., Mardis E., Waterston R., Wilson R., Emanuel B.S., Shaikh T., Kurahashi H., Saitta S., Budarf M.L., McDermid H.E., Johnson A., Wong A.C.C., Morrow B.E., Edelmann L., Kim U.J., Shizuya H., Simon M.I., Dumanski J.P., Peyrard M., Kedra D., Seroussi E., Fransson I., Tapia I., Bruder C.E., O'Brien K.P., Wilkinson P., Bodenteich A., Hartman K., Hu X., Khan A.S., Lane L., Tilahun Y., Wright H.
    Nature 402:489-495(1999) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
  3. "Identification of novel transcribed sequences on human chromosome 22 by expressed sequence tag mapping."
    Hirosawa M., Nagase T., Murahashi Y., Kikuno R., Ohara O.
    DNA Res. 8:1-9(2001) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 33-1819 (ISOFORM 2).
    Tissue: Brain.
  4. "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
    The MGC Project Team
    Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 811-1819 (ISOFORM 1), NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1315-1819 (ISOFORM 3), VARIANT LEU-1621.
    Tissue: Brain and Kidney.
  5. Cited for: IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
  6. Cited for: INVOLVEMENT IN MCCRP1.

Entry informationi

Entry nameiGCP6_HUMAN
AccessioniPrimary (citable) accession number: Q96RT7
Secondary accession number(s): Q5JZ80
, Q6PJ40, Q86YE9, Q9BY91, Q9UGX3, Q9UGX4
Entry historyi
Integrated into UniProtKB/Swiss-Prot: May 2, 2002
Last sequence update: April 3, 2007
Last modified: July 6, 2016
This is version 137 of the entry and version 3 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Human chromosome 22
    Human chromosome 22: entries, gene names and cross-references to MIM
  2. Human entries with polymorphisms or disease mutations
    List of human entries with polymorphisms or disease mutations
  3. Human polymorphisms and disease mutations
    Index of human polymorphisms and disease mutations
  4. MIM cross-references
    Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot
  5. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.