Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Protein SOGA1

Gene

SOGA1

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: -Experimental evidence at protein leveli

Functioni

Regulates autophagy by playing a role in the reduction of glucose production in an adiponectin- and insulin-dependent manner.By similarity

GO - Biological processi

  • insulin receptor signaling pathway Source: UniProtKB
  • negative regulation of gluconeogenesis Source: UniProtKB
  • regulation of autophagy Source: UniProtKB

Names & Taxonomyi

Protein namesi
Recommended name:
Protein SOGA1
Alternative name(s):
SOGA family member 1
Suppressor of glucose by autophagy
Suppressor of glucose, autophagy-associated protein 1
Cleaved into the following 2 chains:
C-terminal 80 kDa form
Short name:
80-kDa SOGA fragment
Gene namesi
Name:SOGA1
Synonyms:C20orf117, KIAA0889, SOGA
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 20

Organism-specific databases

EuPathDBiHostDB:ENSG00000149639.14
HGNCiHGNC:16111 SOGA1
neXtProtiNX_O94964

Subcellular locationi

Extracellular region or secreted Cytosol Plasma membrane Cytoskeleton Lysosome Endosome Peroxisome ER Golgi apparatus Nucleus Mitochondrion Manual annotation Automatic computational assertionGraphics by Christian Stolte; Source: COMPARTMENTS

Keywords - Cellular componenti

Secreted

Pathology & Biotechi

Organism-specific databases

OpenTargetsiENSG00000149639
PharmGKBiPA25657

Polymorphism and mutation databases

BioMutaiSOGA1

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00000507811 – 1423Protein SOGA1Add BLAST1423
ChainiPRO_00004180541 – 688N-terminal formBy similarityAdd BLAST688
ChainiPRO_0000418055689 – 1421C-terminal 80 kDa formBy similarityAdd BLAST733

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei931PhosphoserineCombined sources1
Modified residuei1017PhosphoserineCombined sources1

Post-translational modificationi

Proteolytically cleaved in primary hepatocytes into a C-terminal 80 kDa form (By similarity). Proteolytically cleaved into a C-terminal SOGA 25 kDa form that is detected in plasma.By similarity1 Publication

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sitei688 – 689CleavageBy similarity2

Keywords - PTMi

Phosphoprotein

Proteomic databases

EPDiO94964
MaxQBiO94964
PaxDbiO94964
PeptideAtlasiO94964
PRIDEiO94964
ProteomicsDBi50579
50580 [O94964-2]
50581 [O94964-3]
50582 [O94964-4]

2D gel databases

UCD-2DPAGEiO94964

PTM databases

iPTMnetiO94964
PhosphoSitePlusiO94964

Expressioni

Inductioni

Up-regulated in the plasma by adiponectin in healthy fasting female.1 Publication

Gene expression databases

BgeeiENSG00000149639 Expressed in 183 organ(s), highest expression level in cerebellum
CleanExiHS_C20orf117
ExpressionAtlasiO94964 baseline and differential
GenevisibleiO94964 HS

Organism-specific databases

HPAiHPA043992

Interactioni

Subunit structurei

The C-terminal 25 kDa form occurs as a monomer.By similarity

Protein-protein interaction databases

BioGridi126666, 33 interactors
IntActiO94964, 28 interactors
MINTiO94964
STRINGi9606.ENSP00000237536

Structurei

3D structure databases

ProteinModelPortaliO94964
SMRiO94964
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Sequence similaritiesi

Belongs to the SOGA family.Curated

Phylogenomic databases

eggNOGiKOG4787 Eukaryota
ENOG410XUHJ LUCA
GeneTreeiENSGT00530000063889
HOGENOMiHOG000231278
HOVERGENiHBG080205
InParanoidiO94964
OMAiYSRWLCN
OrthoDBiEOG091G009R
PhylomeDBiO94964
TreeFamiTF331853

Family and domain databases

InterProiView protein in InterPro
IPR027882 DUF4482
IPR027881 SOGA
PfamiView protein in Pfam
PF14818 DUF4482, 1 hit
PF11365 SOGA, 2 hits

Sequences (4+)i

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

This entry describes 4 isoformsi produced by alternative splicing. AlignAdd to basket

This entry has 4 described isoforms and 2 potential isoforms that are computationally mapped.Show allAlign All

Isoform 1 (identifier: O94964-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide
        10         20         30         40         50
MLEMRDVYME EDVYQLQELR QQLDQASKTC RILQYRLRKA ERRSLRAAQT
60 70 80 90 100
GQVDGELIRG LEQDVKVSKD ISMRLHKELE VVEKKRARLE EENEELRQRL
110 120 130 140 150
IETELAKQVL QTELERPREH SLKKRGTRSL GKADKKTLVQ EDSADLKCQL
160 170 180 190 200
HFAKEESALM CKKLTKLAKE NDSMKEELLK YRSLYGDLDS ALSAEELADA
210 220 230 240 250
PHSRETELKV HLKLVEEEAN LLSRRIVELE VENRGLRAEM DDMKDHGGGC
260 270 280 290 300
GGPEARLAFS ALGGGECGES LAELRRHLQF VEEEAELLRR SSAELEDQNK
310 320 330 340 350
LLLNELAKFR SEHELDVALS EDSCSVLSEP SQEELAAAKL QIGELSGKVK
360 370 380 390 400
KLQYENRVLL SNLQRCDLAS CQSTRPMLET DAEAGDSAQC VPAPLGETHE
410 420 430 440 450
SHAVRLCRAR EAEVLPGLRE QAALVSKAID VLVADANGFT AGLRLCLDNE
460 470 480 490 500
CADFRLHEAP DNSEGPRDTK LIHAILVRLS VLQQELNAFT RKADAVLGCS
510 520 530 540 550
VKEQQESFSS LPPLGSQGLS KEILLAKDLG SDFQPPDFRD LPEWEPRIRE
560 570 580 590 600
AFRTGDLDSK PDPSRSFRPY RAEDNDSYAS EIKELQLVLA EAHDSLRGLQ
610 620 630 640 650
EQLSQERQLR KEEADNFNQK MVQLKEDQQR ALLRREFELQ SLSLQRRLEQ
660 670 680 690 700
KFWSQEKNML VQESQQFKHN FLLLFMKLRW FLKRWRQGKV LPSEGDDFLE
710 720 730 740 750
VNSMKELYLL MEEEEINAQH SDNKACTGDS WTQNTPNEYI KTLADMKVTL
760 770 780 790 800
KELCWLLRDE RRGLTELQQQ FAKAKATWET ERAELKGHTS QMELKTGKGA
810 820 830 840 850
GERAGPDWKA ALQREREEQQ HLLAESYSAV MELTRQLQIS ERNWSQEKLQ
860 870 880 890 900
LVERLQGEKQ QVEQQVKELQ NRLSQLQKAA DPWVLKHSEL EKQDNSWKET
910 920 930 940 950
RSEKIHDKEA VSEVELGGNG LKRTKSVSSM SEFESLLDCS PYLAGGDARG
960 970 980 990 1000
KKLPNNPAFG FVSSEPGDPE KDTKEKPGLS SRDCNHLGAL ACQDPPGRQM
1010 1020 1030 1040 1050
QRSYTAPDKT GIRVYYSPPV ARRLGVPVVH DKEGKIIIEP GFLFTTAKPK
1060 1070 1080 1090 1100
ESAEADGLAE SSYGRWLCNF SRQRLDGGSA GSPSAAGPGF PAALHDFEMS
1110 1120 1130 1140 1150
GNMSDDMKEI TNCVRQAMRS GSLERKVKST SSQTVGLASV GTQTIRTVSV
1160 1170 1180 1190 1200
GLQTDPPRSS LHGKAWSPRS SSLVSVRSKQ ISSSLDKVHS RIERPCCSPK
1210 1220 1230 1240 1250
YGSPKLQRRS VSKLDSSKDR SLWNLHQGKQ NGSAWARSTT TRDSPVLRNI
1260 1270 1280 1290 1300
NDGLSSLFSV VEHSGSTESV WKLGMSETRA KPEPPKYGIV QEFFRNVCGR
1310 1320 1330 1340 1350
APSPTSSAGE EGTKKPEPLS PASYHQPEGV ARILNKKAAK LGSSEEVRLT
1360 1370 1380 1390 1400
MLPQVGKDGV LRDGDGAVVL PNEDAVCDCS TQSLTSCFAR SSRSAIRHSP
1410 1420
SKCRLHPSES SWGGEERALP PSE
Length:1,423
Mass (Da):159,760
Last modified:December 16, 2008 - v2
Checksum:iEE50F4D144ABD972
GO
Isoform 2 (identifier: O94964-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-1: M → MEAPAAEPPV...DEIEELRAEM

Note: No experimental confirmation available. Gene prediction based on EST data.
Show »
Length:1,661
Mass (Da):183,858
Checksum:i95230B6E556F3D03
GO
Isoform 3 (identifier: O94964-3) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-871: Missing.
     872-922: RLSQLQKAAD...EVELGGNGLK → MWDWAPTTSL...NNDVSSSVLR
     1374-1423: DAVCDCSTQSLTSCFARSSRSAIRHSPSKCRLHPSESSWGGEERALPPSE → VGGWDLSFLLVGGVSI

Show »
Length:518
Mass (Da):55,761
Checksum:i9F9141EDD1CC6005
GO
Isoform 4 (identifier: O94964-4) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1000-1016: MQRSYTAPDKTGIRVYY → KLPFLLILAPPQPPPIL
     1017-1423: Missing.

Show »
Length:1,016
Mass (Da):115,827
Checksum:i34E50C9A191F15ED
GO

Computationally mapped potential isoform sequencesi

There are 2 potential isoforms mapped to this entry.BLASTAlignShow allAdd to basket
EntryEntry nameProtein names
Gene namesLengthAnnotation
X6R3R3X6R3R3_HUMAN
Protein SOGA1
SOGA1
1,016Annotation score:
H0YDM2H0YDM2_HUMAN
Protein SOGA1
SOGA1
1,275Annotation score:

Sequence cautioni

The sequence BAA74912 differs from that shown. Reason: Erroneous initiation. Translation N-terminally shortened.Curated

Natural variant

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Natural variantiVAR_056848993Q → H. Corresponds to variant dbSNP:rs34459518Ensembl.1

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_0359761 – 871Missing in isoform 3. 3 PublicationsAdd BLAST871
Alternative sequenceiVSP_0359771M → MEAPAAEPPVRGCGPQPAPA PAPAPERKKSHRAPSPARPK DVAGWSLAKGRRGPGPGSAV ACSAAFSSRPDKKGRAVAPG ARGAGVRVAGVRTGVRAKGR PRSGAGPRPPPPPPSLTDSS SEVSDCASEEARLLGLELAL SSDAESAAGGPAGVRTGQPA QPAPSAQQPPRPPASPDEPS VAASSVGSSRLPLSASLAFS DLTEEMLDCGPSGLVRELEE LRSENDYLKDEIEELRAEM in isoform 2. Curated1
Alternative sequenceiVSP_035978872 – 922RLSQL…GNGLK → MWDWAPTTSLQEVNKTVLVF ALTQHTDQGGRPECALSVIS TNNDVSSSVLR in isoform 3. 3 PublicationsAdd BLAST51
Alternative sequenceiVSP_0408251000 – 1016MQRSY…IRVYY → KLPFLLILAPPQPPPIL in isoform 4. 1 PublicationAdd BLAST17
Alternative sequenceiVSP_0408261017 – 1423Missing in isoform 4. 1 PublicationAdd BLAST407
Alternative sequenceiVSP_0359791374 – 1423DAVCD…LPPSE → VGGWDLSFLLVGGVSI in isoform 3. 3 PublicationsAdd BLAST50

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AB020696 mRNA Translation: BAA74912.2 Different initiation.
AK022023 mRNA Translation: BAB13954.1
AK126630 mRNA Translation: BAC86621.1
AL079335 Genomic DNA No translation available.
AL132768 Genomic DNA No translation available.
AL391602 Genomic DNA No translation available.
BC113405 mRNA Translation: AAI13406.1
BC113433 mRNA Translation: AAI13434.1
CCDSiCCDS46598.1 [O94964-4]
CCDS54459.1 [O94964-2]
RefSeqiNP_542194.2, NM_080627.3 [O94964-2]
NP_954650.2, NM_199181.2
UniGeneiHs.460807

Genome annotation databases

EnsembliENST00000237536; ENSP00000237536; ENSG00000149639 [O94964-2]
GeneIDi140710
KEGGihsa:140710
UCSCiuc021wcx.2 human [O94964-1]

Keywords - Coding sequence diversityi

Alternative splicing, Polymorphism

Similar proteinsi

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AB020696 mRNA Translation: BAA74912.2 Different initiation.
AK022023 mRNA Translation: BAB13954.1
AK126630 mRNA Translation: BAC86621.1
AL079335 Genomic DNA No translation available.
AL132768 Genomic DNA No translation available.
AL391602 Genomic DNA No translation available.
BC113405 mRNA Translation: AAI13406.1
BC113433 mRNA Translation: AAI13434.1
CCDSiCCDS46598.1 [O94964-4]
CCDS54459.1 [O94964-2]
RefSeqiNP_542194.2, NM_080627.3 [O94964-2]
NP_954650.2, NM_199181.2
UniGeneiHs.460807

3D structure databases

ProteinModelPortaliO94964
SMRiO94964
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi126666, 33 interactors
IntActiO94964, 28 interactors
MINTiO94964
STRINGi9606.ENSP00000237536

PTM databases

iPTMnetiO94964
PhosphoSitePlusiO94964

Polymorphism and mutation databases

BioMutaiSOGA1

2D gel databases

UCD-2DPAGEiO94964

Proteomic databases

EPDiO94964
MaxQBiO94964
PaxDbiO94964
PeptideAtlasiO94964
PRIDEiO94964
ProteomicsDBi50579
50580 [O94964-2]
50581 [O94964-3]
50582 [O94964-4]

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000237536; ENSP00000237536; ENSG00000149639 [O94964-2]
GeneIDi140710
KEGGihsa:140710
UCSCiuc021wcx.2 human [O94964-1]

Organism-specific databases

CTDi140710
EuPathDBiHostDB:ENSG00000149639.14
GeneCardsiSOGA1
HGNCiHGNC:16111 SOGA1
HPAiHPA043992
neXtProtiNX_O94964
OpenTargetsiENSG00000149639
PharmGKBiPA25657
HUGEiSearch...
GenAtlasiSearch...

Phylogenomic databases

eggNOGiKOG4787 Eukaryota
ENOG410XUHJ LUCA
GeneTreeiENSGT00530000063889
HOGENOMiHOG000231278
HOVERGENiHBG080205
InParanoidiO94964
OMAiYSRWLCN
OrthoDBiEOG091G009R
PhylomeDBiO94964
TreeFamiTF331853

Miscellaneous databases

ChiTaRSiSOGA1 human
GeneWikiiC20orf117
GenomeRNAii140710
PROiPR:O94964

Gene expression databases

BgeeiENSG00000149639 Expressed in 183 organ(s), highest expression level in cerebellum
CleanExiHS_C20orf117
ExpressionAtlasiO94964 baseline and differential
GenevisibleiO94964 HS

Family and domain databases

InterProiView protein in InterPro
IPR027882 DUF4482
IPR027881 SOGA
PfamiView protein in Pfam
PF14818 DUF4482, 1 hit
PF11365 SOGA, 2 hits
ProtoNetiSearch...

Entry informationi

Entry nameiSOGA1_HUMAN
AccessioniPrimary (citable) accession number: O94964
Secondary accession number(s): A6NK10
, Q14DB2, Q5JW51, Q6ZTG8
Entry historyiIntegrated into UniProtKB/Swiss-Prot: November 25, 2002
Last sequence update: December 16, 2008
Last modified: November 7, 2018
This is version 136 of the entry and version 2 of the sequence. See complete history.
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. SIMILARITY comments
    Index of protein domains and families
  2. Human entries with polymorphisms or disease mutations
    List of human entries with polymorphisms or disease mutations
  3. Human polymorphisms and disease mutations
    Index of human polymorphisms and disease mutations
  4. Human chromosome 20
    Human chromosome 20: entries, gene names and cross-references to MIM
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again