Q9Y4B5 (SOGA2_HUMAN) Reviewed, UniProtKB/Swiss-Prot
Last modified
April 3, 2013.
Version 82.
History...
Names·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order
Names·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize orderNames and origin
| Protein names | Recommended name: Protein SOGA2 Alternative name(s): Coiled-coil domain-containing protein 165 | ||||
| Gene names |
| ||||
| Organism | Homo sapiens (Human) [Reference proteome] | ||||
| Taxonomic identifier | 9606 [NCBI] | ||||
| Taxonomic lineage | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Mammalia › Eutheria › Euarchontoglires › Primates › Haplorrhini › Catarrhini › Hominidae › Homo![]() |
Protein attributes
| Sequence length | 1905 AA. |
| Sequence status | Complete. |
| Protein existence | Evidence at protein level |
General annotation (Comments)
| Post-translational modification | Isoform 4 is phosphorylated upon DNA damage, probably by ATM or ATR. |
| Sequence similarities | Belongs to the SOGA family. |
| Sequence caution | The sequence BAA34522.2 differs from that shown. Reason: Erroneous initiation. Translation N-terminally shortened. |
Ontologies
| Keywords | |
|---|---|
| Coding sequence diversity | Alternative splicing Polymorphism |
| Domain | Coiled coil |
| PTM | Phosphoprotein |
| Technical term | Complete proteome Reference proteome |
| Gene Ontology (GO) | |
| None. [Check GOA] | |
Alternative products
| This entry describes 4 isoforms produced by alternative splicing. [Align] [Select] | ||||||
| Isoform 1 (identifier: Q9Y4B5-1) This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry. | ||||||
| Isoform 2 (identifier: Q9Y4B5-2) The sequence of this isoform differs from the canonical sequence as follows: 1-360: Missing. | ||||||
| Note: No experimental confirmation available. | ||||||
| Isoform 3 (identifier: Q9Y4B5-3) The sequence of this isoform differs from the canonical sequence as follows: 1-360: Missing. 989-989: E → ELRGPPVLPEQSVSIEELQGQLVQAARLHQEETETFTNKIHK | ||||||
| Isoform 4 (identifier: Q9Y4B5-4) The sequence of this isoform differs from the canonical sequence as follows: 1-1004: Missing. 1187-1187: Q → QNCCGYPRINIEEETLGFTRLPAGSTVKTLKSLGLQRLE 1273-1300: Missing. 1894-1905: NQTVLLTAPWGL → ELPCSALAPS...LHGLSQYNSL | ||||||
| Note: Contains a phosphoserine at position 941. Contains a phosphoserine at position 975. |
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||
Molecule processing | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Chain | 1 – 1905 | 1905 | Protein SOGA2 | PRO_0000280113 | |||||
Regions | |||||||||
| Coiled coil | 330 – 404 | 75 | Potential | ||||||
| Coiled coil | 432 – 483 | 52 | Potential | ||||||
| Coiled coil | 513 – 718 | 206 | Potential | ||||||
| Coiled coil | 1143 – 1201 | 59 | Potential | ||||||
| Coiled coil | 1238 – 1278 | 41 | Potential | ||||||
| Compositional bias | 42 – 319 | 278 | Pro-rich | ||||||
| Compositional bias | 54 – 198 | 145 | Ala-rich | ||||||
Amino acid modifications | |||||||||
| Modified residue | 77 | 1 | Phosphoserine Ref.12 | ||||||
| Modified residue | 263 | 1 | Phosphoserine Ref.12 | ||||||
| Modified residue | 549 | 1 | Phosphoserine Ref.9 Ref.10 Ref.11 | ||||||
| Modified residue | 618 | 1 | Phosphoserine Ref.9 | ||||||
| Modified residue | 621 | 1 | Phosphothreonine Ref.9 | ||||||
| Modified residue | 685 | 1 | Phosphoserine Ref.12 | ||||||
| Modified residue | 749 | 1 | Phosphoserine By similarity | ||||||
| Modified residue | 776 | 1 | Phosphoserine Ref.9 Ref.12 | ||||||
| Modified residue | 901 | 1 | Phosphoserine Ref.9 | ||||||
| Modified residue | 923 | 1 | Phosphoserine Ref.9 | ||||||
| Modified residue | 1385 | 1 | Phosphoserine Ref.9 | ||||||
| Modified residue | 1388 | 1 | Phosphoserine Ref.9 | ||||||
| Modified residue | 1399 | 1 | Phosphoserine Ref.12 | ||||||
| Modified residue | 1417 | 1 | Phosphothreonine Ref.7 Ref.9 | ||||||
| Modified residue | 1421 | 1 | Phosphoserine Ref.7 Ref.9 | ||||||
| Modified residue | 1427 | 1 | Phosphotyrosine Ref.9 | ||||||
| Modified residue | 1561 | 1 | Phosphoserine Ref.9 | ||||||
| Modified residue | 1578 | 1 | Phosphoserine Ref.9 | ||||||
| Modified residue | 1583 | 1 | Phosphoserine Ref.9 | ||||||
| Modified residue | 1592 | 1 | Phosphoserine Ref.9 | ||||||
| Modified residue | 1661 | 1 | Phosphoserine Ref.9 | ||||||
| Modified residue | 1667 | 1 | Phosphothreonine Ref.9 | ||||||
| Modified residue | 1675 | 1 | Phosphothreonine Ref.7 Ref.9 | ||||||
| Modified residue | 1679 | 1 | Phosphoserine Ref.7 | ||||||
| Modified residue | 1683 | 1 | Phosphoserine Ref.9 | ||||||
| Modified residue | 1812 | 1 | Phosphoserine Ref.9 | ||||||
| Modified residue | 1814 | 1 | Phosphoserine Ref.9 | ||||||
Natural variations | |||||||||
| Alternative sequence | 1 – 1004 | 1004 | Missing in isoform 4. | VSP_023549 | |||||
| Alternative sequence | 1 – 360 | 360 | Missing in isoform 2 and isoform 3. | VSP_023550 | |||||
| Alternative sequence | 989 | 1 | E → ELRGPPVLPEQSVSIEELQG QLVQAARLHQEETETFTNKI HK in isoform 3. | VSP_023551 | |||||
| Alternative sequence | 1187 | 1 | Q → QNCCGYPRINIEEETLGFTR LPAGSTVKTLKSLGLQRLE in isoform 4. | VSP_023552 | |||||
| Alternative sequence | 1273 – 1300 | 28 | Missing in isoform 4. | VSP_023553 | |||||
| Alternative sequence | 1894 – 1905 | 12 | NQTVL…APWGL → ELPCSALAPSLEPCFSRPER PANRRPPSRWAPHSPTASQP QSPGDPTSLEEHGGEEPPEE QPHRDASLHGLSQYNSL in isoform 4. | VSP_023554 | |||||
| Natural variant | 602 | 1 | M → T. Corresponds to variant rs35739383 [ dbSNP | Ensembl ]. | VAR_055942 | |||||
| Natural variant | 861 | 1 | Q → R. Ref.1 Corresponds to variant rs1965665 [ dbSNP | Ensembl ]. | VAR_031073 | |||||
| Natural variant | 898 | 1 | D → G. Ref.3 Ref.9 Corresponds to variant rs3744979 [ dbSNP | Ensembl ]. | VAR_031074 | |||||
| Natural variant | 1097 | 1 | G → S. Ref.1 Corresponds to variant rs12386117 [ dbSNP | Ensembl ]. | VAR_031075 | |||||
| Natural variant | 1211 | 1 | K → Q. Corresponds to variant rs11874468 [ dbSNP | Ensembl ]. | VAR_031076 | |||||
Sequences
| ||||||||||||||||||||||||||||||||||||
References
| « Hide 'large scale' references | |
| [1] | "Prediction of the coding sequences of unidentified human genes. XI. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro." Nagase T., Ishikawa K., Suyama M., Kikuno R., Miyajima N., Tanaka A., Kotani H., Nomura N., Ohara O. DNA Res. 5:277-286(1998) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2), VARIANTS ARG-861 AND SER-1097. Tissue: Brain. |
| [2] | Ohara O., Suyama M., Nagase T., Ishikawa K., Kikuno R. Submitted (JAN-2005) to the EMBL/GenBank/DDBJ databases Cited for: SEQUENCE REVISION. |
| [3] | "Complete sequencing and characterization of 21,243 full-length human cDNAs." Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S. Sugano S.Nat. Genet. 36:40-45(2004) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 3), VARIANT GLY-898. Tissue: Cerebellum. |
| [4] | "DNA sequence and analysis of human chromosome 18." Nusbaum C., Zody M.C., Borowsky M.L., Kamal M., Kodira C.D., Taylor T.D., Whittaker C.A., Chang J.L., Cuomo C.A., Dewar K., FitzGerald M.G., Yang X., Abouelleil A., Allen N.R., Anderson S., Bloom T., Bugalter B., Butler J. Lander E.S.Nature 437:551-555(2005) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. |
| [5] | "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)." The MGC Project Team Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 4). Tissue: Testis. |
| [6] | "Global, in vivo, and site-specific phosphorylation dynamics in signaling networks." Olsen J.V., Blagoev B., Gnad F., Macek B., Kumar C., Mortensen P., Mann M. Cell 127:635-648(2006) [PubMed] [Europe PMC] [Abstract] Cited for: IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. Tissue: Cervix carcinoma. |
| [7] | "A probability-based approach for high-throughput protein phosphorylation analysis and site localization." Beausoleil S.A., Villen J., Gerber S.A., Rush J., Gygi S.P. Nat. Biotechnol. 24:1285-1292(2006) [PubMed] [Europe PMC] [Abstract] Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT THR-1417; SER-1421; THR-1675 AND SER-1679, MASS SPECTROMETRY. Tissue: Cervix carcinoma. |
| [8] | "Combining protein-based IMAC, peptide-based IMAC, and MudPIT for efficient phosphoproteomic analysis." Cantin G.T., Yi W., Lu B., Park S.K., Xu T., Lee J.-D., Yates J.R. III J. Proteome Res. 7:1346-1351(2008) [PubMed] [Europe PMC] [Abstract] Cited for: IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. Tissue: Cervix carcinoma. |
| [9] | "A quantitative atlas of mitotic phosphorylation." Dephoure N., Zhou C., Villen J., Beausoleil S.A., Bakalarski C.E., Elledge S.J., Gygi S.P. Proc. Natl. Acad. Sci. U.S.A. 105:10762-10767(2008) [PubMed] [Europe PMC] [Abstract] Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-549; SER-618; THR-621; SER-776; SER-901; SER-923; SER-1385; SER-1388; THR-1417; SER-1421; TYR-1427; SER-1561; SER-1578; SER-1583; SER-1592; SER-1661; THR-1667; THR-1675; SER-1683; SER-1812 AND SER-1814, PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-941 AND SER-975 (ISOFORM 4), VARIANT [LARGE SCALE ANALYSIS] GLY-898, MASS SPECTROMETRY. Tissue: Cervix carcinoma. |
| [10] | "Large-scale proteomics analysis of the human kinome." Oppermann F.S., Gnad F., Olsen J.V., Hornberger R., Greff Z., Keri G., Mann M., Daub H. Mol. Cell. Proteomics 8:1751-1764(2009) [PubMed] [Europe PMC] [Abstract] Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-549, MASS SPECTROMETRY. |
| [11] | "Quantitative phosphoproteomic analysis of T cell receptor signaling reveals system-wide modulation of protein-protein interactions." Mayya V., Lundgren D.H., Hwang S.-I., Rezaul K., Wu L., Eng J.K., Rodionov V., Han D.K. Sci. Signal. 2:RA46-RA46(2009) [PubMed] [Europe PMC] [Abstract] Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-549, MASS SPECTROMETRY. Tissue: Leukemic T-cell. |
| [12] | "Quantitative phosphoproteomics reveals widespread full phosphorylation site occupancy during mitosis." Olsen J.V., Vermeulen M., Santamaria A., Kumar C., Miller M.L., Jensen L.J., Gnad F., Cox J., Jensen T.S., Nigg E.A., Brunak S., Mann M. Sci. Signal. 3:RA3-RA3(2010) [PubMed] [Europe PMC] [Abstract] Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-77; SER-263; SER-685; SER-776 AND SER-1399, PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-975 (ISOFORM 4), MASS SPECTROMETRY. Tissue: Cervix carcinoma. |
Cross-references
Sequence databases | |
|---|---|
| EMBL GenBank DDBJ | AB018345 mRNA. Translation: BAA34522.2. Different initiation. AK131528 mRNA. Translation: BAD18666.1. AP000864 Genomic DNA. No translation available. AP001531 Genomic DNA. No translation available. BC040542 mRNA. Translation: AAH40542.2. |
| IPI | IPI00375286. IPI00477620. IPI00792611. IPI00829780. |
| RefSeq | NP_056025.2. NM_015210.3. |
| UniGene | Hs.731797. |
3D structure databases | |
| ProteinModelPortal | Q9Y4B5. |
| ModBase | Search... |
Protein-protein interaction databases | |
| IntAct | Q9Y4B5. 6 interactions. |
| STRING | 9606.ENSP00000352927. |
PTM databases | |
| PhosphoSite | Q9Y4B5. |
Polymorphism databases | |
| DMDM | 134048492. |
Proteomic databases | |
| PaxDb | Q9Y4B5. |
| PRIDE | Q9Y4B5. |
Protocols and materials databases | |
| StructuralBiologyKnowledgebase | Search... |
Genome annotation databases | |
| Ensembl | ENST00000306285; ENSP00000303670; ENSG00000168502. ENST00000306329; ENSP00000305027; ENSG00000168502. ENST00000359865; ENSP00000352927; ENSG00000168502. ENST00000400050; ENSP00000382924; ENSG00000168502. ENST00000517570; ENSP00000429556; ENSG00000168502. ENST00000518815; ENSP00000463465; ENSG00000168502. |
| GeneID | 23255. |
| KEGG | hsa:23255. |
| UCSC | uc002knq.2. human. uc002knr.2. human. uc002kns.2. human. |
Organism-specific databases | |
| CTD | 23255. |
| GeneCards | GC18P008708. |
| HGNC | HGNC:29121. SOGA2. |
| HPA | HPA046245. |
| neXtProt | NX_Q9Y4B5. |
| PharmGKB | PA128394616. |
| HUGE | Search... |
| GenAtlas | Search... |
Phylogenomic databases | |
| eggNOG | NOG80576. |
| HOVERGEN | HBG080205. |
Gene expression databases | |
| Bgee | Q9Y4B5. |
| CleanEx | HS_KIAA0802. |
| Genevestigator | Q9Y4B5. |
Family and domain databases | |
| InterPro | IPR021507. DUF3166. [Graphical view] |
| Pfam | PF11365. DUF3166. 2 hits. [Graphical view] |
| ProtoNet | Search... |
Other | |
| GenomeRNAi | 23255. |
| NextBio | 44978. |
Entry information
| Entry name | SOGA2_HUMAN | ||||||||
| Accession | Primary (citable) accession number: Q9Y4B5 Secondary accession number(s): E9PAY7, Q6ZMQ9, Q8IWA9 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation program | Chordata Protein Annotation Program | ||||||||
| Disclaimer | Any medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care. | ||||||||
Relevant documents
| Human chromosome 18 Human chromosome 18: entries, gene names and cross-references to MIM |
| Human entries with polymorphisms or disease mutations List of human entries with polymorphisms or disease mutations |
| Human polymorphisms and disease mutations Index of human polymorphisms and disease mutations |
| SIMILARITY comments Index of protein domains and families |

Clusters with
