Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Protein SOGA3

Gene

SOGA3

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Protein inferred from homologyi

Functioni

GO - Biological processi

Complete GO annotation...

Names & Taxonomyi

Protein namesi
Recommended name:
Protein SOGA3
Gene namesi
Name:SOGA3
Synonyms:C6orf174
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 6

Organism-specific databases

HGNCiHGNC:21494. SOGA3.

Subcellular locationi

Topology

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Transmembranei915 – 93521HelicalSequence analysisAdd
BLAST

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Membrane

Pathology & Biotechi

Organism-specific databases

PharmGKBiPA134983080.

Polymorphism and mutation databases

BioMutaiSOGA3.
DMDMi74746351.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 2121Sequence analysisAdd
BLAST
Chaini22 – 947926Protein SOGA3PRO_0000271352Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Modified residuei569 – 5691PhosphoserineBy similarity
Modified residuei781 – 7811PhosphoserineBy similarity

Keywords - PTMi

Phosphoprotein

Proteomic databases

EPDiQ5TF21.
MaxQBiQ5TF21.
PaxDbiQ5TF21.
PeptideAtlasiQ5TF21.
PRIDEiQ5TF21.

PTM databases

iPTMnetiQ5TF21.
PhosphoSiteiQ5TF21.

Expressioni

Gene expression databases

BgeeiQ5TF21.
CleanExiHS_C6orf174.
ExpressionAtlasiQ5TF21. baseline and differential.
GenevisibleiQ5TF21. HS.

Organism-specific databases

HPAiHPA035388.
HPA035389.

Interactioni

Protein-protein interaction databases

BioGridi132241. 7 interactions.
STRINGi9606.ENSP00000455908.

Structurei

3D structure databases

ProteinModelPortaliQ5TF21.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Coiled coil

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Coiled coili342 – 726385Sequence analysisAdd
BLAST
Coiled coili811 – 83525Sequence analysisAdd
BLAST

Compositional bias

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Compositional biasi72 – 8110Poly-Gln
Compositional biasi82 – 247166Gly-richAdd
BLAST
Compositional biasi269 – 2768Poly-Ala

Sequence similaritiesi

Belongs to the SOGA family.Curated

Keywords - Domaini

Coiled coil, Signal, Transmembrane, Transmembrane helix

Phylogenomic databases

eggNOGiKOG4787. Eukaryota.
ENOG410XUHJ. LUCA.
GeneTreeiENSGT00530000063889.
HOGENOMiHOG000111576.
HOVERGENiHBG081115.
InParanoidiQ5TF21.
OMAiGHESARH.
OrthoDBiEOG70GMF8.
PhylomeDBiQ5TF21.
TreeFamiTF331853.

Family and domain databases

InterProiIPR027881. SOGA.
[Graphical view]
PfamiPF11365. DUF3166. 2 hits.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

Q5TF21-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MSQPPIGGAA PATAAASPAA AATEARLHPE GSSRKQQRAQ SPARPRDSSL
60 70 80 90 100
RQTIAATRSP VGAGTKLNSV RQQQLQQQQQ QGNKTGSRTG PPASIRGGGG
110 120 130 140 150
GAEKATPLAP KGAAPGAVQP VAGAEAAPAA TLAALGGRRP GPPEEPPREL
160 170 180 190 200
ESVPSKLGEP PPLGEGGGGG GEGGGAGGGS GEREGGAPQP PPPRGWRGKG
210 220 230 240 250
VRAQQRGGSG GEGASPSPSS SSAGKTPGTG SRNSGSGVAG GGSGGGGSYW
260 270 280 290 300
KEGCLQSELI QFHLKKERAA AAAAAAQMHA KNGGGSSSRS SPVSGPPAVC
310 320 330 340 350
ETLAVASASP MAAAAEGPQQ SAEGSASGGG MQAAAPPSSQ PHPQQLQEQE
360 370 380 390 400
EMQEEMEKLR EENETLKNEI DELRTEMDEM RDTFFEEDAC QLQEMRHELE
410 420 430 440 450
RANKNCRILQ YRLRKAERKR LRYAQTGEID GELLRSLEQD LKVAKDVSVR
460 470 480 490 500
LHHELENVEE KRTTTEDENE KLRQQLIEVE IAKQALQNEL EKMKELSLKR
510 520 530 540 550
RGSKDLPKSE KKAQQTPTEE DNEDLKCQLQ FVKEEAALMR KKMAKIDKEK
560 570 580 590 600
DRFEHELQKY RSFYGDLDSP LPKGEAGGPP STREAELKLR LRLVEEEANI
610 620 630 640 650
LGRKIVELEV ENRGLKAELD DLRGDDFNGS ANPLMREQSE SLSELRQHLQ
660 670 680 690 700
LVEDETELLR RNVADLEEQN KRITAELNKY KYKSGGHDSA RHHDNAKTEA
710 720 730 740 750
LQEELKAARL QINELSGKVM QLQYENRVLM SNMQRYDLAS HLGIRGSPRD
760 770 780 790 800
SDAESDAGKK ESDDDSRPPH RKREGPIGGE SDSEEVRNIR CLTPTRSFYP
810 820 830 840 850
APGPWPKSFS DRQQMKDIRS EAERLGKTID RLIADTSTII TEARIYVANG
860 870 880 890 900
DLFGLMDEED DGSRIREHEL LYRINAQMKA FRKELQTFID RLEVPKSADD
910 920 930 940
RGAEEPISVS QMFQPIILLI LILVLFSSLS YTTIFKLVFL FTLFFVL
Length:947
Mass (Da):103,199
Last modified:December 21, 2004 - v1
Checksum:iC920C1D4369F1737
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AL096711 Genomic DNA. No translation available.
CCDSiCCDS43505.1.
RefSeqiNP_001012279.1. NM_001012279.2.
UniGeneiHs.319247.

Genome annotation databases

EnsembliENST00000525778; ENSP00000434570; ENSG00000214338.
GeneIDi387104.
KEGGihsa:387104.
UCSCiuc003qbd.3. human.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AL096711 Genomic DNA. No translation available.
CCDSiCCDS43505.1.
RefSeqiNP_001012279.1. NM_001012279.2.
UniGeneiHs.319247.

3D structure databases

ProteinModelPortaliQ5TF21.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi132241. 7 interactions.
STRINGi9606.ENSP00000455908.

PTM databases

iPTMnetiQ5TF21.
PhosphoSiteiQ5TF21.

Polymorphism and mutation databases

BioMutaiSOGA3.
DMDMi74746351.

Proteomic databases

EPDiQ5TF21.
MaxQBiQ5TF21.
PaxDbiQ5TF21.
PeptideAtlasiQ5TF21.
PRIDEiQ5TF21.

Protocols and materials databases

DNASUi387104.
Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000525778; ENSP00000434570; ENSG00000214338.
GeneIDi387104.
KEGGihsa:387104.
UCSCiuc003qbd.3. human.

Organism-specific databases

CTDi387104.
GeneCardsiSOGA3.
HGNCiHGNC:21494. SOGA3.
HPAiHPA035388.
HPA035389.
neXtProtiNX_Q5TF21.
PharmGKBiPA134983080.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiKOG4787. Eukaryota.
ENOG410XUHJ. LUCA.
GeneTreeiENSGT00530000063889.
HOGENOMiHOG000111576.
HOVERGENiHBG081115.
InParanoidiQ5TF21.
OMAiGHESARH.
OrthoDBiEOG70GMF8.
PhylomeDBiQ5TF21.
TreeFamiTF331853.

Miscellaneous databases

GenomeRNAii387104.
PROiQ5TF21.

Gene expression databases

BgeeiQ5TF21.
CleanExiHS_C6orf174.
ExpressionAtlasiQ5TF21. baseline and differential.
GenevisibleiQ5TF21. HS.

Family and domain databases

InterProiIPR027881. SOGA.
[Graphical view]
PfamiPF11365. DUF3166. 2 hits.
[Graphical view]
ProtoNetiSearch...

Publicationsi

  1. "The DNA sequence and analysis of human chromosome 6."
    Mungall A.J., Palmer S.A., Sims S.K., Edwards C.A., Ashurst J.L., Wilming L., Jones M.C., Horton R., Hunt S.E., Scott C.E., Gilbert J.G.R., Clamp M.E., Bethel G., Milne S., Ainscough R., Almeida J.P., Ambrose K.D., Andrews T.D.
    , Ashwell R.I.S., Babbage A.K., Bagguley C.L., Bailey J., Banerjee R., Barker D.J., Barlow K.F., Bates K., Beare D.M., Beasley H., Beasley O., Bird C.P., Blakey S.E., Bray-Allen S., Brook J., Brown A.J., Brown J.Y., Burford D.C., Burrill W., Burton J., Carder C., Carter N.P., Chapman J.C., Clark S.Y., Clark G., Clee C.M., Clegg S., Cobley V., Collier R.E., Collins J.E., Colman L.K., Corby N.R., Coville G.J., Culley K.M., Dhami P., Davies J., Dunn M., Earthrowl M.E., Ellington A.E., Evans K.A., Faulkner L., Francis M.D., Frankish A., Frankland J., French L., Garner P., Garnett J., Ghori M.J., Gilby L.M., Gillson C.J., Glithero R.J., Grafham D.V., Grant M., Gribble S., Griffiths C., Griffiths M.N.D., Hall R., Halls K.S., Hammond S., Harley J.L., Hart E.A., Heath P.D., Heathcott R., Holmes S.J., Howden P.J., Howe K.L., Howell G.R., Huckle E., Humphray S.J., Humphries M.D., Hunt A.R., Johnson C.M., Joy A.A., Kay M., Keenan S.J., Kimberley A.M., King A., Laird G.K., Langford C., Lawlor S., Leongamornlert D.A., Leversha M., Lloyd C.R., Lloyd D.M., Loveland J.E., Lovell J., Martin S., Mashreghi-Mohammadi M., Maslen G.L., Matthews L., McCann O.T., McLaren S.J., McLay K., McMurray A., Moore M.J.F., Mullikin J.C., Niblett D., Nickerson T., Novik K.L., Oliver K., Overton-Larty E.K., Parker A., Patel R., Pearce A.V., Peck A.I., Phillimore B.J.C.T., Phillips S., Plumb R.W., Porter K.M., Ramsey Y., Ranby S.A., Rice C.M., Ross M.T., Searle S.M., Sehra H.K., Sheridan E., Skuce C.D., Smith S., Smith M., Spraggon L., Squares S.L., Steward C.A., Sycamore N., Tamlyn-Hall G., Tester J., Theaker A.J., Thomas D.W., Thorpe A., Tracey A., Tromans A., Tubby B., Wall M., Wallis J.M., West A.P., White S.S., Whitehead S.L., Whittaker H., Wild A., Willey D.J., Wilmer T.E., Wood J.M., Wray P.W., Wyatt J.C., Young L., Younger R.M., Bentley D.R., Coulson A., Durbin R.M., Hubbard T., Sulston J.E., Dunham I., Rogers J., Beck S.
    Nature 425:805-811(2003) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].

Entry informationi

Entry nameiSOGA3_HUMAN
AccessioniPrimary (citable) accession number: Q5TF21
Entry historyi
Integrated into UniProtKB/Swiss-Prot: January 9, 2007
Last sequence update: December 21, 2004
Last modified: July 6, 2016
This is version 91 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Human chromosome 6
    Human chromosome 6: entries, gene names and cross-references to MIM
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.