Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Protein SOGA1

Gene

SOGA1

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Regulates autophagy by playing a role in the reduction of glucose production in an adiponectin- and insulin-dependent manner.By similarity

GO - Biological processi

  • insulin receptor signaling pathway Source: UniProtKB
  • negative regulation of gluconeogenesis Source: UniProtKB
  • regulation of autophagy Source: UniProtKB
Complete GO annotation...

Names & Taxonomyi

Protein namesi
Recommended name:
Protein SOGA1
Alternative name(s):
SOGA family member 1
Suppressor of glucose by autophagy
Suppressor of glucose, autophagy-associated protein 1
Cleaved into the following 2 chains:
C-terminal 80 kDa form
Short name:
80-kDa SOGA fragment
Gene namesi
Name:SOGA1
Synonyms:C20orf117, KIAA0889, SOGA
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 20

Organism-specific databases

HGNCiHGNC:16111. SOGA1.

Subcellular locationi

C-terminal 80 kDa form :
  • Secreted By similarity

  • Note: Secreted in primary hepatocyte-conditioned media.By similarity

GO - Cellular componenti

  • extracellular exosome Source: UniProtKB
  • extracellular space Source: UniProtKB
Complete GO annotation...

Keywords - Cellular componenti

Secreted

Pathology & Biotechi

Organism-specific databases

PharmGKBiPA25657.

Polymorphism and mutation databases

BioMutaiSOGA1.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 14231423Protein SOGA1PRO_0000050781Add
BLAST
Chaini1 – 688688N-terminal formBy similarityPRO_0000418054Add
BLAST
Chaini689 – 1421733C-terminal 80 kDa formBy similarityPRO_0000418055Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Modified residuei931 – 9311PhosphoserineCombined sources
Modified residuei1017 – 10171PhosphoserineCombined sources

Post-translational modificationi

Proteolytically cleaved in primary hepatocytes into a C-terminal 80 kDa form (By similarity). Proteolytically cleaved into a C-terminal SOGA 25 kDa form that is detected in plasma.By similarity1 Publication

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sitei688 – 6892CleavageBy similarity

Keywords - PTMi

Phosphoprotein

Proteomic databases

EPDiO94964.
MaxQBiO94964.
PaxDbiO94964.
PeptideAtlasiO94964.
PRIDEiO94964.

2D gel databases

UCD-2DPAGEO94964.

PTM databases

iPTMnetiO94964.
PhosphoSiteiO94964.

Expressioni

Inductioni

Up-regulated in the plasma by adiponectin in healthy fasting female.1 Publication

Gene expression databases

BgeeiO94964.
CleanExiHS_C20orf117.
ExpressionAtlasiO94964. baseline and differential.
GenevisibleiO94964. HS.

Organism-specific databases

HPAiHPA043992.

Interactioni

Subunit structurei

The C-terminal 25 kDa form occurs as a monomer.By similarity

Protein-protein interaction databases

BioGridi126666. 28 interactions.
IntActiO94964. 17 interactions.
STRINGi9606.ENSP00000237536.

Structurei

3D structure databases

ProteinModelPortaliO94964.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Sequence similaritiesi

Belongs to the SOGA family.Curated

Phylogenomic databases

eggNOGiKOG4787. Eukaryota.
ENOG410XUHJ. LUCA.
GeneTreeiENSGT00530000063889.
HOGENOMiHOG000231278.
HOVERGENiHBG080205.
InParanoidiO94964.
OMAiYSRWLCN.
OrthoDBiEOG70GMF8.
PhylomeDBiO94964.
TreeFamiTF331853.

Family and domain databases

InterProiIPR027882. DUF4482.
IPR027881. SOGA.
[Graphical view]
PfamiPF11365. DUF3166. 2 hits.
PF14818. DUF4482. 1 hit.
[Graphical view]

Sequences (4)i

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

This entry describes 4 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: O94964-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MLEMRDVYME EDVYQLQELR QQLDQASKTC RILQYRLRKA ERRSLRAAQT
60 70 80 90 100
GQVDGELIRG LEQDVKVSKD ISMRLHKELE VVEKKRARLE EENEELRQRL
110 120 130 140 150
IETELAKQVL QTELERPREH SLKKRGTRSL GKADKKTLVQ EDSADLKCQL
160 170 180 190 200
HFAKEESALM CKKLTKLAKE NDSMKEELLK YRSLYGDLDS ALSAEELADA
210 220 230 240 250
PHSRETELKV HLKLVEEEAN LLSRRIVELE VENRGLRAEM DDMKDHGGGC
260 270 280 290 300
GGPEARLAFS ALGGGECGES LAELRRHLQF VEEEAELLRR SSAELEDQNK
310 320 330 340 350
LLLNELAKFR SEHELDVALS EDSCSVLSEP SQEELAAAKL QIGELSGKVK
360 370 380 390 400
KLQYENRVLL SNLQRCDLAS CQSTRPMLET DAEAGDSAQC VPAPLGETHE
410 420 430 440 450
SHAVRLCRAR EAEVLPGLRE QAALVSKAID VLVADANGFT AGLRLCLDNE
460 470 480 490 500
CADFRLHEAP DNSEGPRDTK LIHAILVRLS VLQQELNAFT RKADAVLGCS
510 520 530 540 550
VKEQQESFSS LPPLGSQGLS KEILLAKDLG SDFQPPDFRD LPEWEPRIRE
560 570 580 590 600
AFRTGDLDSK PDPSRSFRPY RAEDNDSYAS EIKELQLVLA EAHDSLRGLQ
610 620 630 640 650
EQLSQERQLR KEEADNFNQK MVQLKEDQQR ALLRREFELQ SLSLQRRLEQ
660 670 680 690 700
KFWSQEKNML VQESQQFKHN FLLLFMKLRW FLKRWRQGKV LPSEGDDFLE
710 720 730 740 750
VNSMKELYLL MEEEEINAQH SDNKACTGDS WTQNTPNEYI KTLADMKVTL
760 770 780 790 800
KELCWLLRDE RRGLTELQQQ FAKAKATWET ERAELKGHTS QMELKTGKGA
810 820 830 840 850
GERAGPDWKA ALQREREEQQ HLLAESYSAV MELTRQLQIS ERNWSQEKLQ
860 870 880 890 900
LVERLQGEKQ QVEQQVKELQ NRLSQLQKAA DPWVLKHSEL EKQDNSWKET
910 920 930 940 950
RSEKIHDKEA VSEVELGGNG LKRTKSVSSM SEFESLLDCS PYLAGGDARG
960 970 980 990 1000
KKLPNNPAFG FVSSEPGDPE KDTKEKPGLS SRDCNHLGAL ACQDPPGRQM
1010 1020 1030 1040 1050
QRSYTAPDKT GIRVYYSPPV ARRLGVPVVH DKEGKIIIEP GFLFTTAKPK
1060 1070 1080 1090 1100
ESAEADGLAE SSYGRWLCNF SRQRLDGGSA GSPSAAGPGF PAALHDFEMS
1110 1120 1130 1140 1150
GNMSDDMKEI TNCVRQAMRS GSLERKVKST SSQTVGLASV GTQTIRTVSV
1160 1170 1180 1190 1200
GLQTDPPRSS LHGKAWSPRS SSLVSVRSKQ ISSSLDKVHS RIERPCCSPK
1210 1220 1230 1240 1250
YGSPKLQRRS VSKLDSSKDR SLWNLHQGKQ NGSAWARSTT TRDSPVLRNI
1260 1270 1280 1290 1300
NDGLSSLFSV VEHSGSTESV WKLGMSETRA KPEPPKYGIV QEFFRNVCGR
1310 1320 1330 1340 1350
APSPTSSAGE EGTKKPEPLS PASYHQPEGV ARILNKKAAK LGSSEEVRLT
1360 1370 1380 1390 1400
MLPQVGKDGV LRDGDGAVVL PNEDAVCDCS TQSLTSCFAR SSRSAIRHSP
1410 1420
SKCRLHPSES SWGGEERALP PSE
Length:1,423
Mass (Da):159,760
Last modified:December 16, 2008 - v2
Checksum:iEE50F4D144ABD972
GO
Isoform 2 (identifier: O94964-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-1: M → MEAPAAEPPV...DEIEELRAEM

Note: No experimental confirmation available. Gene prediction based on EST data.
Show »
Length:1,661
Mass (Da):183,858
Checksum:i95230B6E556F3D03
GO
Isoform 3 (identifier: O94964-3) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-871: Missing.
     872-922: RLSQLQKAAD...EVELGGNGLK → MWDWAPTTSL...NNDVSSSVLR
     1374-1423: DAVCDCSTQSLTSCFARSSRSAIRHSPSKCRLHPSESSWGGEERALPPSE → VGGWDLSFLLVGGVSI

Show »
Length:518
Mass (Da):55,761
Checksum:i9F9141EDD1CC6005
GO
Isoform 4 (identifier: O94964-4) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1000-1016: MQRSYTAPDKTGIRVYY → KLPFLLILAPPQPPPIL
     1017-1423: Missing.

Show »
Length:1,016
Mass (Da):115,827
Checksum:i34E50C9A191F15ED
GO

Sequence cautioni

The sequence BAA74912.2 differs from that shown. Reason: Erroneous initiation. Translation N-terminally shortened.Curated

Natural variant

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Natural varianti993 – 9931Q → H.
Corresponds to variant rs34459518 [ dbSNP | Ensembl ].
VAR_056848

Alternative sequence

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Alternative sequencei1 – 871871Missing in isoform 3. 3 PublicationsVSP_035976Add
BLAST
Alternative sequencei1 – 11M → MEAPAAEPPVRGCGPQPAPA PAPAPERKKSHRAPSPARPK DVAGWSLAKGRRGPGPGSAV ACSAAFSSRPDKKGRAVAPG ARGAGVRVAGVRTGVRAKGR PRSGAGPRPPPPPPSLTDSS SEVSDCASEEARLLGLELAL SSDAESAAGGPAGVRTGQPA QPAPSAQQPPRPPASPDEPS VAASSVGSSRLPLSASLAFS DLTEEMLDCGPSGLVRELEE LRSENDYLKDEIEELRAEM in isoform 2. CuratedVSP_035977
Alternative sequencei872 – 92251RLSQL…GNGLK → MWDWAPTTSLQEVNKTVLVF ALTQHTDQGGRPECALSVIS TNNDVSSSVLR in isoform 3. 3 PublicationsVSP_035978Add
BLAST
Alternative sequencei1000 – 101617MQRSY…IRVYY → KLPFLLILAPPQPPPIL in isoform 4. 1 PublicationVSP_040825Add
BLAST
Alternative sequencei1017 – 1423407Missing in isoform 4. 1 PublicationVSP_040826Add
BLAST
Alternative sequencei1374 – 142350DAVCD…LPPSE → VGGWDLSFLLVGGVSI in isoform 3. 3 PublicationsVSP_035979Add
BLAST

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AB020696 mRNA. Translation: BAA74912.2. Different initiation.
AK022023 mRNA. Translation: BAB13954.1.
AK126630 mRNA. Translation: BAC86621.1.
AL391602, AL079335, AL132768 Genomic DNA. Translation: CAI12788.3.
AL132768, AL079335, AL391602 Genomic DNA. Translation: CAI21460.3.
AL079335, AL132768, AL391602 Genomic DNA. Translation: CAI42289.3.
BC113405 mRNA. Translation: AAI13406.1.
BC113433 mRNA. Translation: AAI13434.1.
CCDSiCCDS46598.1. [O94964-4]
CCDS54459.1. [O94964-2]
RefSeqiNP_542194.2. NM_080627.3. [O94964-2]
NP_954650.2. NM_199181.2.
UniGeneiHs.460807.

Genome annotation databases

EnsembliENST00000237536; ENSP00000237536; ENSG00000149639. [O94964-2]
GeneIDi140710.
KEGGihsa:140710.
UCSCiuc021wcx.2. human. [O94964-1]

Keywords - Coding sequence diversityi

Alternative splicing, Polymorphism

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AB020696 mRNA. Translation: BAA74912.2. Different initiation.
AK022023 mRNA. Translation: BAB13954.1.
AK126630 mRNA. Translation: BAC86621.1.
AL391602, AL079335, AL132768 Genomic DNA. Translation: CAI12788.3.
AL132768, AL079335, AL391602 Genomic DNA. Translation: CAI21460.3.
AL079335, AL132768, AL391602 Genomic DNA. Translation: CAI42289.3.
BC113405 mRNA. Translation: AAI13406.1.
BC113433 mRNA. Translation: AAI13434.1.
CCDSiCCDS46598.1. [O94964-4]
CCDS54459.1. [O94964-2]
RefSeqiNP_542194.2. NM_080627.3. [O94964-2]
NP_954650.2. NM_199181.2.
UniGeneiHs.460807.

3D structure databases

ProteinModelPortaliO94964.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi126666. 28 interactions.
IntActiO94964. 17 interactions.
STRINGi9606.ENSP00000237536.

PTM databases

iPTMnetiO94964.
PhosphoSiteiO94964.

Polymorphism and mutation databases

BioMutaiSOGA1.

2D gel databases

UCD-2DPAGEO94964.

Proteomic databases

EPDiO94964.
MaxQBiO94964.
PaxDbiO94964.
PeptideAtlasiO94964.
PRIDEiO94964.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000237536; ENSP00000237536; ENSG00000149639. [O94964-2]
GeneIDi140710.
KEGGihsa:140710.
UCSCiuc021wcx.2. human. [O94964-1]

Organism-specific databases

CTDi140710.
GeneCardsiSOGA1.
HGNCiHGNC:16111. SOGA1.
HPAiHPA043992.
neXtProtiNX_O94964.
PharmGKBiPA25657.
HUGEiSearch...
GenAtlasiSearch...

Phylogenomic databases

eggNOGiKOG4787. Eukaryota.
ENOG410XUHJ. LUCA.
GeneTreeiENSGT00530000063889.
HOGENOMiHOG000231278.
HOVERGENiHBG080205.
InParanoidiO94964.
OMAiYSRWLCN.
OrthoDBiEOG70GMF8.
PhylomeDBiO94964.
TreeFamiTF331853.

Miscellaneous databases

ChiTaRSiSOGA1. human.
GeneWikiiC20orf117.
GenomeRNAii140710.
PROiO94964.

Gene expression databases

BgeeiO94964.
CleanExiHS_C20orf117.
ExpressionAtlasiO94964. baseline and differential.
GenevisibleiO94964. HS.

Family and domain databases

InterProiIPR027882. DUF4482.
IPR027881. SOGA.
[Graphical view]
PfamiPF11365. DUF3166. 2 hits.
PF14818. DUF4482. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "Prediction of the coding sequences of unidentified human genes. XII. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro."
    Nagase T., Ishikawa K., Suyama M., Kikuno R., Hirosawa M., Miyajima N., Tanaka A., Kotani H., Nomura N., Ohara O.
    DNA Res. 5:355-364(1998) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 3).
    Tissue: Brain.
  2. "Complete sequencing and characterization of 21,243 full-length human cDNAs."
    Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S.
    , Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.
    Nat. Genet. 36:40-45(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 3 AND 4).
    Tissue: Cerebellum and Embryo.
  3. "The DNA sequence and comparative analysis of human chromosome 20."
    Deloukas P., Matthews L.H., Ashurst J.L., Burton J., Gilbert J.G.R., Jones M., Stavrides G., Almeida J.P., Babbage A.K., Bagguley C.L., Bailey J., Barlow K.F., Bates K.N., Beard L.M., Beare D.M., Beasley O.P., Bird C.P., Blakey S.E.
    , Bridgeman A.M., Brown A.J., Buck D., Burrill W.D., Butler A.P., Carder C., Carter N.P., Chapman J.C., Clamp M., Clark G., Clark L.N., Clark S.Y., Clee C.M., Clegg S., Cobley V.E., Collier R.E., Connor R.E., Corby N.R., Coulson A., Coville G.J., Deadman R., Dhami P.D., Dunn M., Ellington A.G., Frankland J.A., Fraser A., French L., Garner P., Grafham D.V., Griffiths C., Griffiths M.N.D., Gwilliam R., Hall R.E., Hammond S., Harley J.L., Heath P.D., Ho S., Holden J.L., Howden P.J., Huckle E., Hunt A.R., Hunt S.E., Jekosch K., Johnson C.M., Johnson D., Kay M.P., Kimberley A.M., King A., Knights A., Laird G.K., Lawlor S., Lehvaeslaiho M.H., Leversha M.A., Lloyd C., Lloyd D.M., Lovell J.D., Marsh V.L., Martin S.L., McConnachie L.J., McLay K., McMurray A.A., Milne S.A., Mistry D., Moore M.J.F., Mullikin J.C., Nickerson T., Oliver K., Parker A., Patel R., Pearce T.A.V., Peck A.I., Phillimore B.J.C.T., Prathalingam S.R., Plumb R.W., Ramsay H., Rice C.M., Ross M.T., Scott C.E., Sehra H.K., Shownkeen R., Sims S., Skuce C.D., Smith M.L., Soderlund C., Steward C.A., Sulston J.E., Swann R.M., Sycamore N., Taylor R., Tee L., Thomas D.W., Thorpe A., Tracey A., Tromans A.C., Vaudin M., Wall M., Wallis J.M., Whitehead S.L., Whittaker P., Willey D.L., Williams L., Williams S.A., Wilming L., Wray P.W., Hubbard T., Durbin R.M., Bentley D.R., Beck S., Rogers J.
    Nature 414:865-871(2001) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
  4. "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
    The MGC Project Team
    Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 3).
    Tissue: Brain.
  5. Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-931 AND SER-1017, IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
    Tissue: Cervix carcinoma.
  6. Cited for: PROTEOLYTIC PROCESSING, SUBCELLULAR LOCATION, INDUCTION.
  7. "Toward a comprehensive characterization of a human cancer cell phosphoproteome."
    Zhou H., Di Palma S., Preisinger C., Peng M., Polat A.N., Heck A.J., Mohammed S.
    J. Proteome Res. 12:260-271(2013) [PubMed] [Europe PMC] [Abstract]
    Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-1017, IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
    Tissue: Cervix carcinoma and Erythroleukemia.

Entry informationi

Entry nameiSOGA1_HUMAN
AccessioniPrimary (citable) accession number: O94964
Secondary accession number(s): A6NK10
, Q14DB2, Q5JW51, Q6ZTG8
Entry historyi
Integrated into UniProtKB/Swiss-Prot: November 25, 2002
Last sequence update: December 16, 2008
Last modified: July 6, 2016
This is version 120 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Human chromosome 20
    Human chromosome 20: entries, gene names and cross-references to MIM
  2. Human entries with polymorphisms or disease mutations
    List of human entries with polymorphisms or disease mutations
  3. Human polymorphisms and disease mutations
    Index of human polymorphisms and disease mutations
  4. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.