Skip Header

 
Contribute Send feedback
Read comments (1) or add your own

Reviewed, UniProtKB/Swiss-Prot P18583 (SON_HUMAN)

Last modified June 16, 2009. Version 100. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (5) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Binary interactions · Alternative products · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    SON protein
Alternative name(s):
    SON3
    Negative regulatory element-binding protein
      Short name=NRE-binding protein
    DBP-5
    Bax antagonist selected in saccharomyces 1
      Short name=BASS1
Gene names
Name: SON
Synonyms: C21orf50, DBP5, KIAA1019, NREBP
OrganismHomo sapiens (Human)
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length2426 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceEvidence at protein level.

General annotation (Comments)

Function

Represses hepatitis B virus (HBV) core promoter activity and transcription of HBV genes and production of HBV virions. Binds to the consensus DNA sequence: 5'-GA[GT]AN[CG][AG]CC-3'. Might protect cells from apoptosis. Might be involved in pre-mRNA splicing By similarity.

Subcellular location

Nucleus speckle.

Tissue specificity

Widely expressed, with the higher expression seen in leukocyte and heart.

Domain

Contains 8 types of repeats which are distributed in 3 regions.

Miscellaneous

Colocalizes with the pre-mRNA splicing factor SFRS2/SC-35.

Sequence similarities

Contains 1 DRBM (double-stranded RNA-binding) domain.

Contains 1 G-patch domain.

Sequence caution

The sequence CAA44793.1 differs from that shown. Reason: Frameshift at positions 2315, 2412 and 2417.

The sequence CAA45282.1 differs from that shown. Reason: Frameshift at position 2412.

Ontologies

Keywords
   Cellular componentNucleus
   Coding sequence diversityAlternative splicing
Polymorphism
   DomainRepeat
   LigandDNA-binding
RNA-binding
   PTMAcetylation
Phosphoprotein
Gene Ontology (GO)
   Biological processanti-apoptosis Ref.12

Inferred from direct assay. Source: UniProtKB

   Cellular componentnuclear speck

Inferred from electronic annotation. Source: UniProtKB-SubCell

   Molecular functionDNA binding

Traceable author statement. Source: ProtInc

double-stranded RNA binding

Inferred from electronic annotation. Source: InterPro

protein binding

Inferred from physical interaction. Source: IntAct

Complete GO annotation...

Binary interactions

With

Entry

#Exp.

IntAct

Notes

YWHAGP619811EBI-1053513,EBI-359832

Alternative products

This entry describes 10 isoforms produced by alternative splicing. [Align] [Select]

Note: Experimental confirmation may be lacking for some isoforms.
Isoform F (identifier: P18583-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform A (identifier: P18583-2)

The sequence of this isoform differs from the canonical sequence as follows:
     687-687: V → Q
     688-1006: Missing.
     2410-2426: RNGALTRPNCMFFLNRY → INGSAYQPSFASPNKKHAKATAATVVLQAMGLVPKDLMANATCFRSASRR
Isoform B (identifier: P18583-3)

The sequence of this isoform differs from the canonical sequence as follows:
     2220-2303: PVDISTAMSE...KKDQFLRAAP → GRVKRQGRVR...SRLYSSRFWW
     2304-2426: Missing.
Isoform C (identifier: P18583-4)

The sequence of this isoform differs from the canonical sequence as follows:
     2296-2325: DQFLRAAPVTGGMGAVLMRKMGWREGEGLG → GQILVAVFLPRSVPAVLFTTLLLPRPRISS
     2326-2426: Missing.
Note: May be produced at very low levels due to a premature stop codon in the mRNA, leading to nonsense-mediated mRNA decay.
Isoform D (identifier: P18583-5)

The sequence of this isoform differs from the canonical sequence as follows:
     2410-2426: RNGALTRPNCMFFLNRY → INGSAYQPSFASPNKKHAKATAATVVLQAMGLVPKDLMANATCFRSASRR
Isoform E (identifier: P18583-6)

The sequence of this isoform differs from the canonical sequence as follows:
     2108-2108: K → F
     2109-2426: Missing.
Note: May be produced at very low levels due to a premature stop codon in the mRNA, leading to nonsense-mediated mRNA decay.
Isoform G (identifier: P18583-7)

The sequence of this isoform differs from the canonical sequence as follows:
     748-787: Missing.
Isoform H (identifier: P18583-8)

The sequence of this isoform differs from the canonical sequence as follows:
     687-689: VAQ → NVP
     690-2416: Missing.
Isoform I (identifier: P18583-9)

The sequence of this isoform differs from the canonical sequence as follows:
     770-770: S → SMDSQMLASNTMDSQMLASNTMDSQMLASSTMDSQMLATSS
Isoform J (identifier: P18583-10)

The sequence of this isoform differs from the canonical sequence as follows:
     2257-2282: IDAWAQLNSIPGQFTGSTGVQVLTQE → VCSSFLKKIIIYHQPTHTNVPVLMSK
     2283-2426: Missing.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 24262426SON protein
PRO_0000072037

Regions

Repeat1006 – 101161-1
Repeat1014 – 101961-2
Repeat1021 – 102661-3
Repeat1030 – 103561-4
Repeat1038 – 104361-5
Repeat1046 – 105161-6
Repeat1055 – 106061-7
Repeat1063 – 106861-8
Repeat1071 – 107661-9
Repeat1080 – 108561-10
Repeat1089 – 109461-11
Repeat1100 – 110561-12
Repeat1111 – 111661-13
Repeat1121 – 112661-14
Repeat1925 – 193172-1
Repeat1934 – 1952193-1
Repeat1953 – 195972-2
Repeat1960 – 196672-3
Repeat1967 – 197372-4
Repeat1974 – 198072-5
Repeat1981 – 198772-6
Repeat1988 – 199472-7
Repeat1995 – 2013193-2
Domain2305 – 235147G-patch
Domain2371 – 242656DRBM
Region726 – 89517017 X 10 AA tandem repeats of L-A-[ST]-[NSG]-[TS]-MDSQM
Region912 – 9887711 X 7 AA tandem repeats of [DR]-P-Y-R-[LI][AG][QHP]
Region1006 – 112612114 X 6 AA repeats of [ED]-R-S-M-M-S
Region1147 – 1179333 X 11 AA tandem repats of P-P-L-P-P-E-E-P-P-[TME]-[MTG]
Region1359 – 1390324 X 8 AA tandem repeats of V-L-E-SS-[AVT]-VT
Region1925 – 1994707 X 7 AA repeats of P-S-R-R-S-R-[TS]
Region1934 – 2013802 X 19 AA repeats of P-S-R-R-R-R-S-R-S-V-V-R-R-R-S-F-S-I-S
Region2013 – 2039273 X tandem repeats of [ST]-P-[VLI]-R-[RL]-[RK]-[RF]-S-R

Amino acid modifications

Modified residue941Phosphoserine Ref.15
Modified residue1521Phosphoserine Ref.21
Modified residue1541Phosphoserine Ref.18
Modified residue2831Phosphoserine Ref.15 Ref.18 Ref.14 Ref.17
Modified residue3481Phosphoserine Ref.20
Modified residue9101Phosphoserine Ref.14
Modified residue10511Phosphoserine Ref.20
Modified residue15561Phosphoserine Ref.21 Ref.20
Modified residue16511Phosphoserine Ref.14
Modified residue16881Phosphoserine By similarity
Modified residue16971Phosphoserine Ref.15 Ref.21 Ref.18 Ref.14 Ref.20
Modified residue17661Phosphoserine Ref.14
Modified residue17691Phosphoserine Ref.14
Modified residue17821Phosphoserine Ref.21
Modified residue17831Phosphoserine Ref.21
Modified residue17841Phosphoserine By similarity
Modified residue19481Phosphoserine Ref.21
Modified residue19501Phosphoserine Ref.21
Modified residue19541Phosphoserine Ref.21
Modified residue19611Phosphoserine By similarity
Modified residue20091Phosphoserine Ref.15 Ref.21
Modified residue20111Phosphoserine Ref.15 Ref.21
Modified residue20131Phosphoserine Ref.15 Ref.21 Ref.14
Modified residue20291Phosphoserine Ref.15
Modified residue20311Phosphoserine Ref.15
Modified residue20551N6-acetyllysine Ref.16
Modified residue21291Phosphoserine Ref.15 Ref.19

Natural variations

Alternative sequence687 – 6893VAQ → NVP in isoform H.
VSP_004411
Alternative sequence6871V → Q in isoform A.
VSP_004401
Alternative sequence688 – 1006319Missing in isoform A.
VSP_004402
Alternative sequence690 – 24161727Missing in isoform H.
VSP_004412
Alternative sequence748 – 78740Missing in isoform G.
VSP_004410
Alternative sequence7701S → SMDSQMLASNTMDSQMLASN TMDSQMLASSTMDSQMLATS S in isoform I.
VSP_004413
Alternative sequence21081K → F in isoform E.
VSP_004408
Alternative sequence2109 – 2426318Missing in isoform E.
VSP_004409
Alternative sequence2220 – 230384PVDIS…LRAAP → GRVKRQGRVRRQMKQPAASH LTVTRCNSLCGTKPQSEKHR IAENSVITSLPNIGPSLHLW EGSPRYNYLASRFASRLYSS RFWW in isoform B.
VSP_004404
Alternative sequence2257 – 228226IDAWA…VLTQE → VCSSFLKKIIIYHQPTHTNV PVLMSK in isoform J.
VSP_004414
Alternative sequence2283 – 2426144Missing in isoform J.
VSP_004415
Alternative sequence2296 – 232530DQFLR…GEGLG → GQILVAVFLPRSVPAVLFTT LLLPRPRISS in isoform C.
VSP_004406
Alternative sequence2304 – 2426123Missing in isoform B.
VSP_004405
Alternative sequence2326 – 2426101Missing in isoform C.
VSP_004407
Alternative sequence2410 – 242617RNGAL…FLNRY → INGSAYQPSFASPNKKHAKA TAATVVLQAMGLVPKDLMAN ATCFRSASRR in isoform A and isoform D.
VSP_004403
Natural variant8701T → A: dbSNP rs11908823.
VAR_056990
Natural variant15751R → C: dbSNP rs13047599.
VAR_056991

Experimental info

Sequence conflict1261E → K Ref.4
Sequence conflict1291Y → K Ref.4
Sequence conflict4731S → P Ref.2
Sequence conflict4731S → P Ref.6
Sequence conflict14021Y → S Ref.7
Sequence conflict14951N → I Ref.7
Sequence conflict15381N → S Ref.7
Sequence conflict16431I → II Ref.7
Sequence conflict16921L → I Ref.7
Sequence conflict16921L → R Ref.11
Sequence conflict16931A → R Ref.11
Sequence conflict1820 – 18212SS → PH Ref.7
Sequence conflict1820 – 18212SS → PH Ref.11
Sequence conflict19391R → S Ref.12
Sequence conflict20901E → V Ref.2
Sequence conflict20901E → V Ref.7
Sequence conflict21481P → F Ref.2
Sequence conflict21481P → F Ref.7
Sequence conflict2413 – 24164ALTR → SPYQ Ref.2

Sequences

Sequence LengthMass (Da)Tools
Isoform F [UniParc].

Last modified January 23, 2002. Version 3.
Checksum: BBD0E3A604791E03

FASTA2,426263,846
        10         20         30         40         50         60 
MATNIEQIFR SFVVSKFREI QQELSSGRNE GQLNGETNTP IEGNQAGDAA ASARSLPNEE 

        70         80         90        100        110        120 
IVQKIEEVLS GVLDTELRYK PDLKEGSRKS RCVSVQTDPT DEIPTKKSKK HKKHKNKKKK 

       130        140        150        160        170        180 
KKKEKEKKYK RQPEESESKT KSHDDGNIDL ESDSFLKFDS EPSAVALELP TRAFGPSETN 

       190        200        210        220        230        240 
ESPAVVLEPP VVSMEVSEPH ILETLKPATK TAELSVVSTS VISEQSEQSV AVMPEPSMTK 

       250        260        270        280        290        300 
ILDSFAAAPV PTTTLVLKSS EPVVTMSVEY QMKSVLKSVE STSPEPSKIM LVEPPVAKVL 

       310        320        330        340        350        360 
EPSETLVVSS ETPTEVYPEP STSTTMDFPE SSAIEALRLP EQPVDVPSEI ADSSMTRPQE 

       370        380        390        400        410        420 
LPELPKTTAL ELQESSVASA MELPGPPATS MPELQGPPVT PVLELPGPSA TPVPELPGPL 

       430        440        450        460        470        480 
STPVPELPGP PATAVPELPG PSVTPVPQLS QELPGLPAPS MGLEPPQEVP EPSVMAQELP 

       490        500        510        520        530        540 
GLPLVTAAVE LPEQPAVTVA MELTEQPVTT TELEQPVGMT TVEHPGHPEV TTATGLLGQP 

       550        560        570        580        590        600 
EATMVLELPG QPVATTALEL PGQPSVTGVP ELPGLPSATR ALELSGQPVA TGALELPGPL 

       610        620        630        640        650        660 
MAAGALEFSG QSGAAGALEL LGQPLATGVL ELPGQPGAPE LPGQPVATVA LEISVQSVVT 

       670        680        690        700        710        720 
TSELSTMTVS QSLEVPSTTA LESYNTVAQE LPTTLVGETS VTVGVDPLMA PESHILASNT 

       730        740        750        760        770        780 
METHILASNT MDSQMLASNT MDSQMLASNT MDSQMLASST MDSQMLATSS MDSQMLATSS 

       790        800        810        820        830        840 
MDSQMLATST MDSQMLATSS MDSQMLATSS MDSQMLATSS MDSQMLATSS MDSQMLATST 

       850        860        870        880        890        900 
MDSQMLATST MDSQMLATSS MDSQMLASGT MDSQMLASGT MDAQMLASGT MDAQMLASST 

       910        920        930        940        950        960 
QDSAMLGSKS PDPYRLAQDP YRLAQDPYRL GHDPYRLGHD AYRLGQDPYR LGHDPYRLTP 

       970        980        990       1000       1010       1020 
DPYRMSPRPY RIAPRSYRIA PRPYRLAPRP LMLASRRSMM MSYAAERSMM SSYERSMMSY 

      1030       1040       1050       1060       1070       1080 
ERSMMSPMAE RSMMSAYERS MMSAYERSMM SPMAERSMMS AYERSMMSAY ERSMMSPMAD 

      1090       1100       1110       1120       1130       1140 
RSMMSMGADR SMMSSYSAAD RSMMSSYSAA DRSMMSSYTA DRSMMSMAAD SYTDSYTDTY 

      1150       1160       1170       1180       1190       1200 
TEAYMVPPLP PEEPPTMPPL PPEEPPMTPP LPPEEPPEGP ALPTEQSALT AENTWPTEVP 

      1210       1220       1230       1240       1250       1260 
SLPSEESVSQ PEPPVSQSEI SEPSAVPTDY SVSASDPSVL VSEAAVTVPE PPPEPESSIT 

      1270       1280       1290       1300       1310       1320 
LTPVESAVVA EEHEVVPERP VTCMVSETPA MSAEPTVLAS EPPVMSETAE TFDSMRASGH 

      1330       1340       1350       1360       1370       1380 
VASEVSTSLL VPAVTTPVLA ESILEPPAMA APESSAMAVL ESSAVTVLES STVTVLESST 

      1390       1400       1410       1420       1430       1440 
VTVLEPSVVT VPEPPVVAEP DYVTIPVPVV SALEPSVPVL EPAVSVLQPS MIVSEPSVSV 

      1450       1460       1470       1480       1490       1500 
QESTVTVSEP AVTVSEQTQV IPTEVAIEST PMILESSIMS SHVMKGINLS SGDQNLAPEI 

      1510       1520       1530       1540       1550       1560 
GMQEIALHSG EEPHAEEHLK GDFYESEHGI NIDLNINNHL IAKEMEHNTV CAAGTSPVGE 

      1570       1580       1590       1600       1610       1620 
IGEEKILPTS ETKQRTVLDT YPGVSEADAG ETLSSTGPFA LEPDATGTSK GIEFTTASTL 

      1630       1640       1650       1660       1670       1680 
SLVNKYDVDL SLTTQDTEHD MVISTSPSGG SEADIEGPLP AKDIHLDLPS NNNLVSKDTE 

      1690       1700       1710       1720       1730       1740 
EPLPVKESDQ TLAALLSPKE SSGGEKEVPP PPKETLPDSG FSANIEDINE ADLVRPLLPK 

      1750       1760       1770       1780       1790       1800 
DMERLTSLRA GIEGPLLASD VGRDRSAASP VVSSMPERAS ESSSEEKDDY EIFVKVKDTH 

      1810       1820       1830       1840       1850       1860 
EKSKKNKNRD KGEKEKKRDS SLRSRSKRSK SSEHKSRKRT SESRSRARKR SSKSKSHRSQ 

      1870       1880       1890       1900       1910       1920 
TRSRSRSRRR RRSSRSRSKS RGRRSVSKEK RKRSPKHRSK SRERKRKRSS SRDNRKTVRA 

      1930       1940       1950       1960       1970       1980 
RSRTPSRRSR SHTPSRRRRS RSVGRRRSFS ISPSRRSRTP SRRSRTPSRR SRTPSRRSRT 

      1990       2000       2010       2020       2030       2040 
PSRRSRTPSR RSRTPSRRRR SRSVVRRRSF SISPVRLRRS RTPLRRRFSR SPIRRKRSRS 

      2050       2060       2070       2080       2090       2100 
SERGRSPKRL TDLDKAQLLE IAKANAAAMC AKAGVPLPPN LKPAPPPTIE EKVAKKSGGA 

      2110       2120       2130       2140       2150       2160 
TIEELTEKCK QIAQSKEDDD VIVNKPHVSD EEEEEPPFYH HPFKLSEPKP IFFNLNIAAA 

      2170       2180       2190       2200       2210       2220 
KPTPPKSQVT LTKEFPVSSG SQHRKKEADS VYGEWVPVEK NGEENKDDDN VFSSNLPSEP 

      2230       2240       2250       2260       2270       2280 
VDISTAMSER ALAQKRLSEN AFDLEAMSML NRAQERIDAW AQLNSIPGQF TGSTGVQVLT 

      2290       2300       2310       2320       2330       2340 
QEQLANTGAQ AWIKKDQFLR AAPVTGGMGA VLMRKMGWRE GEGLGKNKEG NKEPILVDFK 

      2350       2360       2370       2380       2390       2400 
TDRKGLVAVG ERAQKRSGNF SAAMKDLSGK HPVSALMEIC NKRRWQPPEF LLVHDSGPDH 

      2410       2420 
RKHFLFRVLR NGALTRPNCM FFLNRY 

« Hide

Isoform A.

Checksum: 4CF239651020F78F
Show »

FASTA2,140232,323
Isoform B.

Checksum: B1D7B9CF021F057D
Show »

FASTA2,303250,404
Isoform C.

Checksum: 76F59AD2F29074CE
Show »

FASTA2,325252,270
Isoform D.

Checksum: 463FD501FEBA09F1
Show »

FASTA2,459267,051
Isoform E.

Checksum: 6F9D252454858B8D
Show »

FASTA2,108228,197
Isoform G.

Checksum: C8D378AE8E45292F
Show »

FASTA2,386259,610
Isoform H.

Checksum: 0D0EC41FB50DC72E
Show »

FASTA69973,874
Isoform I.

Checksum: AACA1B99E22ABE5B
Show »

FASTA2,466268,109
Isoform J.

Checksum: 13D1ECA78855B6D3
Show »

FASTA2,282247,839

References

« Hide 'large scale' references
[1]"From PREDs and open reading frames to cDNA isolation: revisiting the human chromosome 21 transcription map."
Reymond A., Friedli M., Neergaard Henrichsen C., Chapot F., Deutsch S., Ucla C., Rossier C., Lyle R., Guipponi M., Antonarakis S.E.
Genomics 78:46-54(2001) [PubMed: 11707072] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORMS A; B; C; D; E AND F).
[2]"Transcription repression of human hepatitis B virus genes by negative regulatory element-binding protein/SON."
Sun C.-T., Lo W.-Y., Wang I.-H., Lo Y.-H., Shiou S.-R., Lai C.-K., Ting L.-P.
J. Biol. Chem. 276:24059-24067(2001) [PubMed: 11306577] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM G), VARIANT CYS-1575.
Tissue: Liver.
[3]"Prediction of the coding sequences of unidentified human genes. XIV. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro."
Kikuno R., Nagase T., Ishikawa K., Hirosawa M., Miyajima N., Tanaka A., Kotani H., Nomura N., Ohara O.
DNA Res. 6:197-205(1999) [PubMed: 10470851] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM B).
Tissue: Brain.
[4]"mRNA 5' region sequence incompleteness: a potential source of systematic errors in translation initiation codon assignment in human mRNAs."
Casadei R., Strippoli P., D'Addabbo P., Canaider S., Lenzi L., Vitale L., Giannone S., Frabetti F., Facchin F., Carinci P., Zannotti M.
Gene 321:185-193(2003) [PubMed: 14637006] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 1-689 (ISOFORM H).
Tissue: Placenta.
[5]"Complete sequencing and characterization of 21,243 full-length human cDNAs."
Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S. expand/collapse author list , Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.
Nat. Genet. 36:40-45(2004) [PubMed: 14702039] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1-130.
Tissue: Smooth muscle.
[6]"Human partial CDS from CD34+ stem cells."
Ye M., Zhang Q.-H., Zhou J., Shen Y., Wu X.-Y., Guan Z.Q., Wang L., Fan H.-Y., Mao Y.-F., Dai M., Huang Q.-H., Chen S.-J., Chen Z.
Submitted (MAY-1999) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1-114, VARIANT CYS-1575.
Tissue: Umbilical cord blood.
[7]"Identification of a protein product of a novel human gene SON and the biological effect upon administering a changed form of this gene into mammalian cells."
Chumakov I.M., Berdichevskii F.B., Sokolova N.V., Reznikov M.V., Prasolov V.S.
Mol. Biol. (Mosk.) 25:731-740(1991) [PubMed: 1944255] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE OF 554-2426 (ISOFORM A).
[8]"The human SON gene: the large and small transcripts contains various 5'-terminal sequences."
Bliskovskii V.V., Kirillov A.V., Zakhariev V.M., Chumakov I.M.
Mol. Biol. (Mosk.) 26:807-812(1992) [PubMed: 1435774] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 709-1079 (ISOFORM I).
Tissue: Placenta.
[9]"Coding part of the son gene small transcript contains four areas of complete tandem repeats."
Bliskovskii V.V., Berdichevskii F.B., Tkachenko A.V., Belova M.E., Chumakov I.M.
Mol. Biol. (Mosk.) 26:793-806(1992) [PubMed: 1435773] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 1009-1131.
Tissue: Placenta.
[10]"A cDNA clone for a novel nuclear protein with DNA binding activity."
Mattioni T., Hume C.R., Konigorski S., Hayes P., Osterweil Z., Lee J.S.
Chromosoma 101:618-624(1992) [PubMed: 1424986] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 1145-2426 (ISOFORM F).
[11]"Decoding of the primary structure of the son3 region in human genome: identification of a new protein with unusual structure and homology with DNA-binding proteins."
Berdichevskii F.B., Chumakov I.M., Kiselev L.L.
Mol. Biol. (Mosk.) 22:794-801(1988) [PubMed: 3054499] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE OF 1692-2175 (ISOFORM A).
[12]"A selection system for human apoptosis inhibitors using yeast."
Greenhalf W., Lee J., Chaudhuri B.
Yeast 15:1307-1321(1999) [PubMed: 10509013] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 1939-2426 (ISOFORM J).
Tissue: Cerebellum.
[13]"An unappreciated role for RNA surveillance."
Hillman R.T., Green R.E., Brenner S.E.
Genome Biol. 5:RESEARCH008.1-RESEARCH008.16(2004) [PubMed: 14759258] [Abstract]
Cited for: SPLICE ISOFORM(S) THAT ARE POTENTIAL NMD TARGET(S).
[14]"Large-scale characterization of HeLa cell nuclear phosphoproteins."
Beausoleil S.A., Jedrychowski M., Schwartz D., Elias J.E., Villen J., Li J., Cohn M.A., Cantley L.C., Gygi S.P.
Proc. Natl. Acad. Sci. U.S.A. 101:12130-12135(2004) [PubMed: 15302935] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-283; SER-910; SER-1651; SER-1697; SER-1766; SER-1769 AND SER-2013, MASS SPECTROMETRY.
Tissue: Epithelium.
[15]"Global, in vivo, and site-specific phosphorylation dynamics in signaling networks."
Olsen J.V., Blagoev B., Gnad F., Macek B., Kumar C., Mortensen P., Mann M.
Cell 127:635-648(2006) [PubMed: 17081983] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-94; SER-283; SER-1697; SER-2009; SER-2011; SER-2013; SER-2029; SER-2031 AND SER-2129, MASS SPECTROMETRY.
Tissue: Epithelium.
[16]"Substrate and functional diversity of lysine acetylation revealed by a proteomics survey."
Kim S.C., Sprung R., Chen Y., Xu Y., Ball H., Pei J., Cheng T., Kho Y., Xiao H., Xiao L., Grishin N.V., White M., Yang X.-J., Zhao Y.
Mol. Cell 23:607-618(2006) [PubMed: 16916647] [Abstract]
Cited for: ACETYLATION [LARGE SCALE ANALYSIS] AT LYS-2055, MASS SPECTROMETRY.
Tissue: Epithelium.
[17]"A probability-based approach for high-throughput protein phosphorylation analysis and site localization."
Beausoleil S.A., Villen J., Gerber S.A., Rush J., Gygi S.P.
Nat. Biotechnol. 24:1285-1292(2006) [PubMed: 16964243] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-283, MASS SPECTROMETRY.
Tissue: Epithelium.
[18]"Phosphoproteome analysis of the human mitotic spindle."
Nousiainen M., Sillje H.H.W., Sauer G., Nigg E.A., Koerner R.
Proc. Natl. Acad. Sci. U.S.A. 103:5391-5396(2006) [PubMed: 16565220] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-154; SER-283 AND SER-1697, MASS SPECTROMETRY.
Tissue: Epithelium.
[19]"Global proteomic profiling of phosphopeptides using electron transfer dissociation tandem mass spectrometry."
Molina H., Horn D.M., Tang N., Mathivanan S., Pandey A.
Proc. Natl. Acad. Sci. U.S.A. 104:2199-2204(2007) [PubMed: 17287340] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-2129, MASS SPECTROMETRY.
[20]"Kinase-selective enrichment enables quantitative phosphoproteomics of the kinome across the cell cycle."
Daub H., Olsen J.V., Bairlein M., Gnad F., Oppermann F.S., Korner R., Greff Z., Keri G., Stemmann O., Mann M.
Mol. Cell 31:438-448(2008) [PubMed: 18691976] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-348; SER-1051; SER-1556 AND SER-1697, MASS SPECTROMETRY.
[21]"A quantitative atlas of mitotic phosphorylation."
Dephoure N., Zhou C., Villen J., Beausoleil S.A., Bakalarski C.E., Elledge S.J., Gygi S.P.
Proc. Natl. Acad. Sci. U.S.A. 105:10762-10767(2008) [PubMed: 18669648] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-152; SER-1556; SER-1697; SER-1782; SER-1783; SER-1948; SER-1950; SER-1954; SER-2009; SER-2011 AND SER-2013, MASS SPECTROMETRY.
+Additional computationally mapped references.

Cross-references

Sequence databases

AF380179 mRNA. Translation: AAL34497.1.
AF380180 mRNA. Translation: AAL34498.1.
AF380181 mRNA. Translation: AAL34499.1.
AF380182 mRNA. Translation: AAL34500.1.
AF380183 mRNA. Translation: AAL34501.1.
AF380184 mRNA. Translation: AAL34502.1.
AY026895 mRNA. Translation: AAK07692.1.
AB028942 mRNA. Translation: BAA82971.2. Different initiation.
AF435977 mRNA. Translation: AAL30810.1.
AK024752 mRNA. Translation: BAB14985.1.
AF161428 mRNA. Translation: AAF28988.1.
AF161430 mRNA. Translation: AAF28990.1.
X63751 mRNA. Translation: CAC69885.1.
X63753 mRNA. Translation: CAA45282.1. Frameshift.
X63071 mRNA. Translation: CAA44793.1. Frameshift.
M36428 Genomic DNA. Translation: AAA36624.1.
AF139897 mRNA. Translation: AAD50078.1.
IPIIPI00000192.
IPI00217930.
IPI00218617.
IPI00218618.
IPI00218619.
IPI00218620.
IPI00218621.
IPI00218622.
IPI00218624.
IPI00748258.
PIRS26650.
RefSeqNP_115571.1.
NP_620305.1.
UniGeneHs.517262

3D structure databases

ModBaseSearch...

Protein-protein interaction databases

IntActP18583. 1 interaction.

PTM databases

PhosphoSiteP18583.

Proteomic databases

PRIDEP18583.

Genome annotation databases

EnsemblENSG00000159140. Homo sapiens. [Contig view]
GeneID6651.
KEGGhsa:6651.

Organism-specific databases

GeneCardsGC21P033837.
HGNCHGNC:11183. SON.
MIM182465. gene.
PharmGKBPA27459.
HUGESearch...
GenAtlasSearch...

Phylogenomic databases

HOVERGENP18583.

Gene expression databases

ArrayExpressP18583.
BgeeP18583.
GermOnlineENSG00000159140. Homo sapiens.

Family and domain databases

InterProIPR001159. Ds-RNA_bd.
IPR014720. dsRNA-bd-like.
IPR000467. G_patch.
[Graphical view]
Gene3DG3DSA:3.30.160.20. dsRNA-bd-like. 1 hit.
PfamPF00035. dsrm. 1 hit.
PF01585. G-patch. 1 hit.
[Graphical view]
ProDomPD153432. Csurface_antigen. 1 hit.
[Graphical view] [Entries sharing at least one domain]
SMARTSM00443. G_patch. 1 hit.
[Graphical view]
PROSITEPS50137. DS_RBD. 1 hit.
PS50174. G_PATCH. 1 hit.
[Graphical view]
ProtoNetSearch...

Other Resources

NextBio25923.
SOURCESearch...

Entry information

Entry nameSON_HUMAN
AccessionPrimary (citable) accession number: P18583
Secondary accession number(s): O14487 expand/collapse secondary AC list , O95981, Q14120, Q9H7B1, Q9P070, Q9P072, Q9UKP9, Q9UPY0
Entry history
Integrated into UniProtKB/Swiss-Prot: November 1, 1990
Last sequence update: January 23, 2002
Last modified: June 16, 2009
This is version 100 of the entry and version 3 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectHPI (Human Proteome Initiative)

Relevant documents

Human chromosome 21

Human chromosome 21: entries, gene names and cross-references to MIM

Human entries with polymorphisms or disease mutations

List of human entries with polymorphisms or disease mutations

Human polymorphisms and disease mutations

Index of human polymorphisms and disease mutations

MIM cross-references

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Binary interactions · Alternative products · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents