Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Protein SOGA3

Gene

SOGA3

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Protein inferred from homologyi

Functioni

GO - Biological processi

Complete GO annotation...

Enzyme and pathway databases

BioCyciZFISH:G66-32870-MONOMER.

Names & Taxonomyi

Protein namesi
Recommended name:
Protein SOGA3
Gene namesi
Name:SOGA3
Synonyms:C6orf174
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 6

Organism-specific databases

HGNCiHGNC:21494. SOGA3.

Subcellular locationi

Topology

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Transmembranei915 – 935HelicalSequence analysisAdd BLAST21

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Membrane

Pathology & Biotechi

Organism-specific databases

OpenTargetsiENSG00000214338.
ENSG00000255330.
PharmGKBiPA134983080.

Polymorphism and mutation databases

BioMutaiSOGA3.
DMDMi74746351.

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Signal peptidei1 – 21Sequence analysisAdd BLAST21
ChainiPRO_000027135222 – 947Protein SOGA3Add BLAST926

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei569PhosphoserineBy similarity1
Modified residuei781PhosphoserineBy similarity1

Keywords - PTMi

Phosphoprotein

Proteomic databases

EPDiQ5TF21.
MaxQBiQ5TF21.
PaxDbiQ5TF21.
PeptideAtlasiQ5TF21.
PRIDEiQ5TF21.

PTM databases

iPTMnetiQ5TF21.
PhosphoSitePlusiQ5TF21.

Expressioni

Gene expression databases

BgeeiENSG00000214338.
CleanExiHS_C6orf174.
ExpressionAtlasiQ5TF21. baseline and differential.
GenevisibleiQ5TF21. HS.

Organism-specific databases

HPAiHPA035388.
HPA035389.

Interactioni

Protein-protein interaction databases

BioGridi132241. 7 interactors.
STRINGi9606.ENSP00000455908.

Structurei

3D structure databases

ProteinModelPortaliQ5TF21.
SMRiQ5TF21.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Coiled coil

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Coiled coili342 – 726Sequence analysisAdd BLAST385
Coiled coili811 – 835Sequence analysisAdd BLAST25

Compositional bias

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Compositional biasi72 – 81Poly-Gln10
Compositional biasi82 – 247Gly-richAdd BLAST166
Compositional biasi269 – 276Poly-Ala8

Sequence similaritiesi

Belongs to the SOGA family.Curated

Keywords - Domaini

Coiled coil, Signal, Transmembrane, Transmembrane helix

Phylogenomic databases

eggNOGiKOG4787. Eukaryota.
ENOG410XUHJ. LUCA.
GeneTreeiENSGT00530000063889.
HOGENOMiHOG000111576.
HOVERGENiHBG081115.
InParanoidiQ5TF21.
OMAiGHESARH.
OrthoDBiEOG091G03VQ.
PhylomeDBiQ5TF21.
TreeFamiTF331853.

Family and domain databases

InterProiIPR027881. SOGA.
[Graphical view]
PfamiPF11365. SOGA. 2 hits.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

Q5TF21-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MSQPPIGGAA PATAAASPAA AATEARLHPE GSSRKQQRAQ SPARPRDSSL
60 70 80 90 100
RQTIAATRSP VGAGTKLNSV RQQQLQQQQQ QGNKTGSRTG PPASIRGGGG
110 120 130 140 150
GAEKATPLAP KGAAPGAVQP VAGAEAAPAA TLAALGGRRP GPPEEPPREL
160 170 180 190 200
ESVPSKLGEP PPLGEGGGGG GEGGGAGGGS GEREGGAPQP PPPRGWRGKG
210 220 230 240 250
VRAQQRGGSG GEGASPSPSS SSAGKTPGTG SRNSGSGVAG GGSGGGGSYW
260 270 280 290 300
KEGCLQSELI QFHLKKERAA AAAAAAQMHA KNGGGSSSRS SPVSGPPAVC
310 320 330 340 350
ETLAVASASP MAAAAEGPQQ SAEGSASGGG MQAAAPPSSQ PHPQQLQEQE
360 370 380 390 400
EMQEEMEKLR EENETLKNEI DELRTEMDEM RDTFFEEDAC QLQEMRHELE
410 420 430 440 450
RANKNCRILQ YRLRKAERKR LRYAQTGEID GELLRSLEQD LKVAKDVSVR
460 470 480 490 500
LHHELENVEE KRTTTEDENE KLRQQLIEVE IAKQALQNEL EKMKELSLKR
510 520 530 540 550
RGSKDLPKSE KKAQQTPTEE DNEDLKCQLQ FVKEEAALMR KKMAKIDKEK
560 570 580 590 600
DRFEHELQKY RSFYGDLDSP LPKGEAGGPP STREAELKLR LRLVEEEANI
610 620 630 640 650
LGRKIVELEV ENRGLKAELD DLRGDDFNGS ANPLMREQSE SLSELRQHLQ
660 670 680 690 700
LVEDETELLR RNVADLEEQN KRITAELNKY KYKSGGHDSA RHHDNAKTEA
710 720 730 740 750
LQEELKAARL QINELSGKVM QLQYENRVLM SNMQRYDLAS HLGIRGSPRD
760 770 780 790 800
SDAESDAGKK ESDDDSRPPH RKREGPIGGE SDSEEVRNIR CLTPTRSFYP
810 820 830 840 850
APGPWPKSFS DRQQMKDIRS EAERLGKTID RLIADTSTII TEARIYVANG
860 870 880 890 900
DLFGLMDEED DGSRIREHEL LYRINAQMKA FRKELQTFID RLEVPKSADD
910 920 930 940
RGAEEPISVS QMFQPIILLI LILVLFSSLS YTTIFKLVFL FTLFFVL
Length:947
Mass (Da):103,199
Last modified:December 21, 2004 - v1
Checksum:iC920C1D4369F1737
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AL096711 Genomic DNA. No translation available.
CCDSiCCDS43505.1.
RefSeqiNP_001012279.1. NM_001012279.2.
UniGeneiHs.319247.

Genome annotation databases

EnsembliENST00000525778; ENSP00000434570; ENSG00000214338.
GeneIDi387104.
KEGGihsa:387104.
UCSCiuc003qbd.3. human.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AL096711 Genomic DNA. No translation available.
CCDSiCCDS43505.1.
RefSeqiNP_001012279.1. NM_001012279.2.
UniGeneiHs.319247.

3D structure databases

ProteinModelPortaliQ5TF21.
SMRiQ5TF21.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi132241. 7 interactors.
STRINGi9606.ENSP00000455908.

PTM databases

iPTMnetiQ5TF21.
PhosphoSitePlusiQ5TF21.

Polymorphism and mutation databases

BioMutaiSOGA3.
DMDMi74746351.

Proteomic databases

EPDiQ5TF21.
MaxQBiQ5TF21.
PaxDbiQ5TF21.
PeptideAtlasiQ5TF21.
PRIDEiQ5TF21.

Protocols and materials databases

DNASUi387104.
Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000525778; ENSP00000434570; ENSG00000214338.
GeneIDi387104.
KEGGihsa:387104.
UCSCiuc003qbd.3. human.

Organism-specific databases

CTDi387104.
GeneCardsiSOGA3.
HGNCiHGNC:21494. SOGA3.
HPAiHPA035388.
HPA035389.
neXtProtiNX_Q5TF21.
OpenTargetsiENSG00000214338.
ENSG00000255330.
PharmGKBiPA134983080.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiKOG4787. Eukaryota.
ENOG410XUHJ. LUCA.
GeneTreeiENSGT00530000063889.
HOGENOMiHOG000111576.
HOVERGENiHBG081115.
InParanoidiQ5TF21.
OMAiGHESARH.
OrthoDBiEOG091G03VQ.
PhylomeDBiQ5TF21.
TreeFamiTF331853.

Enzyme and pathway databases

BioCyciZFISH:G66-32870-MONOMER.

Miscellaneous databases

GenomeRNAii387104.
PROiQ5TF21.

Gene expression databases

BgeeiENSG00000214338.
CleanExiHS_C6orf174.
ExpressionAtlasiQ5TF21. baseline and differential.
GenevisibleiQ5TF21. HS.

Family and domain databases

InterProiIPR027881. SOGA.
[Graphical view]
PfamiPF11365. SOGA. 2 hits.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiSOGA3_HUMAN
AccessioniPrimary (citable) accession number: Q5TF21
Entry historyi
Integrated into UniProtKB/Swiss-Prot: January 9, 2007
Last sequence update: December 21, 2004
Last modified: November 2, 2016
This is version 94 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Human chromosome 6
    Human chromosome 6: entries, gene names and cross-references to MIM
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.