Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q9Y6X0 (SETBP_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified May 29, 2013. Version 102. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (5) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Web links·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
SET-binding protein

Short name=SEB
Gene names
Name:SETBP1
Synonyms:KIAA0437
OrganismHomo sapiens (Human) [Reference proteome]
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length1596 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Subunit structure

Interacts with SET. Ref.5

Subcellular location

Nucleus Ref.5.

Tissue specificity

Expressed in numerous tissues. Ref.5

Involvement in disease

Schinzel-Giedion midface retraction syndrome (SGMFS) [MIM:269150]: A disorder characterized by severe mental retardation, distinctive facial features, and multiple congenital malformations including skeletal abnormalities, genitourinary and renal malformations, cardiac defects, as well as a higher-than-normal prevalence of tumors, notably neuroepithelial neoplasia.
Note: The disease is caused by mutations affecting the gene represented in this entry. Ref.8

Sequence similarities

Contains 3 A.T hook DNA-binding domains.

Sequence caution

The sequence AAI46777.1 differs from that shown. Reason: Erroneous initiation. Translation N-terminally extended.

The sequence BAA24826.2 differs from that shown. Reason: Erroneous initiation. Translation N-terminally shortened.

The sequence BAA82444.1 differs from that shown. Reason: Erroneous initiation. Translation N-terminally extended.

Ontologies

Keywords
   Cellular componentNucleus
   Coding sequence diversityAlternative splicing
Polymorphism
   DiseaseDisease mutation
   DomainRepeat
   LigandDNA-binding
   PTMPhosphoprotein
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
   Cellular_componentnucleus

Inferred from electronic annotation. Source: UniProtKB-SubCell

   Molecular_functionDNA binding

Inferred from electronic annotation. Source: UniProtKB-KW

Complete GO annotation...

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q9Y6X0-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q9Y6X0-2)

The sequence of this isoform differs from the canonical sequence as follows:
     181-242: AYERPQKHST...QNCFISPESG → IKDSSKEEVW...SEPAVWAQEV
     243-1596: Missing.
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 15961596SET-binding protein
PRO_0000097698

Regions

Repeat1520 – 152781
Repeat1528 – 153582
Repeat1536 – 154383
DNA binding584 – 59613A.T hook 1
DNA binding1016 – 102813A.T hook 2
DNA binding1451 – 146313A.T hook 3
Region1520 – 1543243 X 8 AA tandem repeats of P-P-L-P-P-P-P-P

Amino acid modifications

Modified residue12661Phosphoserine Ref.6
Modified residue12721Phosphoserine Ref.6

Natural variations

Alternative sequence181 – 24262AYERP…SPESG → IKDSSKEEVWKRRGGQGIPF KKQFLSQERAMCFSCPRNPF PAKPGSLTLPFHSEPAVWAQ EV in isoform 2.
VSP_039060
Alternative sequence243 – 15961354Missing in isoform 2.
VSP_039061
Natural variant2311V → L.
Corresponds to variant rs11082414 [ dbSNP | Ensembl ].
VAR_024347
Natural variant8681D → A in SGMFS. Ref.8
VAR_063806
Natural variant8681D → N in SGMFS. Ref.8
VAR_063807
Natural variant8701G → D in SGMFS. Ref.8
VAR_063808
Natural variant8701G → S in SGMFS. Ref.8
VAR_063809
Natural variant8711I → T in SGMFS. Ref.8
VAR_063810
Natural variant11011V → I. Ref.1 Ref.4 Ref.5
Corresponds to variant rs3744825 [ dbSNP | Ensembl ].
VAR_054646
Natural variant11301P → T.
Corresponds to variant rs1064204 [ dbSNP | Ensembl ].
VAR_020317
Natural variant11621R → W in a colorectal cancer sample; somatic mutation. Ref.7
VAR_035987

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified April 20, 2010. Version 3.
Checksum: 466A6E0A1A8EEF41

FASTA1,596175,008
        10         20         30         40         50         60 
MESRETLSSS RQRGGESDFL PVSSAKPPAA PGCAGEPLLS TPGPGKGIPV GGERMEPEEE 

        70         80         90        100        110        120 
DELGSGRDVD SNSNADSEKW VAGDGLEEQE FSIKEANFTE GSLKLKIQTT KRAKKPPKNL 

       130        140        150        160        170        180 
ENYICPPEIK ITIKQSGDQK VSRAGKNSKA TKEEERSHSK KKLLTASDLA ASDLKGFQPQ 

       190        200        210        220        230        240 
AYERPQKHST LHYDTGLPQD FTGDTLKPKH QQKSSSQNHM DWSTNSDSGP VTQNCFISPE 

       250        260        270        280        290        300 
SGRETASTSK IPALEPVASF AKAQGKKGSA GNTWSQLSNN NKDLLLGGVA PSPSSHSSPA 

       310        320        330        340        350        360 
PPSSSAECNG LQPLVDQDGG GTKEPPEPPT VGSKKKSSKK DVISQTIPNP DLDWVKNAQK 

       370        380        390        400        410        420 
AFDNTEGKRE GYSADSAQEA SPARQNVSSA SNPENDSSHV RITIPIKAPS LDPTNHKRKK 

       430        440        450        460        470        480 
RQSIKAVVEK IMPEKALASG ITMSSEVVNR ILSNSEGNKK DPRVPKLSKM IENESPSVGL 

       490        500        510        520        530        540 
ETGGNAEKVI PGGVSKPRKP PMVMTPPTCT DHSPSRKLPE IQHPKFAAKR RWTCSKPKPS 

       550        560        570        580        590        600 
TMLREAVMAT SDKLMLEPPS AYPITPSSPL YTNTDSLTVI TPVKKKRGRP KKQPLLTVET 

       610        620        630        640        650        660 
IHEGTSTSPV SPISREFPGT KKRKRRRNLA KLAQLVPGED KPMSEMKFHK KVGKLGVLDK 

       670        680        690        700        710        720 
KTIKTINKMK TLKRKNILNQ ILSCSSSVAL KAKAPPETSP GAAAIESKLG KQINVSKRGT 

       730        740        750        760        770        780 
IYIGKKRGRK PRAELPPPSE EPKTAIKHPR PVSSQPDVPA VPSNFQSLVA SSPAAMHPLS 

       790        800        810        820        830        840 
TQLGGSNGNL SPASTETNFS ELKTMPNLQP ISALPTKTQK GIHSGTWKLS PPRLMANSPS 

       850        860        870        880        890        900 
HLCEIGSLKE ITLSPVSESH SEETIPSDSG IGTDNNSTSD QAEKSSESRR RYSFDFCSLD 

       910        920        930        940        950        960 
NPEAIPSDTS TKNRHGHRQK HLIVDNFLAH ESLKKPKHKR KRKSLQNRDD LQFLADLEEL 

       970        980        990       1000       1010       1020 
ITKFQVFRIS HRSYTFYHEN PYPSIFRINF DHYYPVPYIQ YDPLLYLRRT SDLKSKKKRG 

      1030       1040       1050       1060       1070       1080 
RPAKTNDTMT KVPFLQGFSY PIPSGSYYAP YGMPYTSMPM MNLGYYGQYP APLYLSHTLG 

      1090       1100       1110       1120       1130       1140 
AASPFMRPTV PPPQFHTNSH VKMSGAAKHK AKHGVHLQGP VSMGLGDMQP SLNPPKVGSA 

      1150       1160       1170       1180       1190       1200 
SLSSGRLHKR KHKHKHKHKE DRILGTHDNL SGLFAGKATG FSSHILSERL SSADKELPLV 

      1210       1220       1230       1240       1250       1260 
SEKNKHKEKQ KHQHSEAGHK ASKNNFEVDT LSTLSLSDAQ HWTQAKEKGD LSSEPVDSCT 

      1270       1280       1290       1300       1310       1320 
KRYSGSGGDG GSTRSENLDV FSEMNPSNDK WDSDVSGSKR RSYEGFGTYR EKDIQAFKMN 

      1330       1340       1350       1360       1370       1380 
RKERSSYDSS MSPGMPSPHL KVDQTAVHSK NEGSVPTMMT RKKPAAVDSV TIPPAPVLSL 

      1390       1400       1410       1420       1430       1440 
LAASAATSDA VGSSLKKRFK RREIEAIQCE VRKMCNYTKI LSTKKNLDHV NKILKAKRLQ 

      1450       1460       1470       1480       1490       1500 
RQSKTGNNFV KKRRGRPRKQ PTQFDEDSRD QMPVLEKCID LPSKRGQKPS LSPLVLEPAA 

      1510       1520       1530       1540       1550       1560 
SQDTIMATIE AVIHMAREAP PLPPPPPPPL PPPPPPPLPP PPPLPKTPRG GKRKHKPQAP 

      1570       1580       1590 
AQPPQQSPPQ QPLPQEEEVK AKRQRKSRGS ESEVLP 

« Hide

Isoform 2 [UniParc].

Checksum: 185162E51CFF9310
Show »

FASTA24226,397

References

« Hide 'large scale' references
[1]"Prediction of the coding sequences of unidentified human genes. VIII. 78 new cDNA clones from brain which code for large proteins in vitro."
Ishikawa K., Nagase T., Nakajima D., Seki N., Ohira M., Miyajima N., Tanaka A., Kotani H., Nomura N., Ohara O.
DNA Res. 4:307-313(1997) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1), VARIANT ILE-1101.
Tissue: Brain.
[2]"Construction of expression-ready cDNA clones for KIAA genes: manual curation of 330 KIAA cDNA clones."
Nakajima D., Okazaki N., Yamakawa H., Kikuno R., Ohara O., Nagase T.
DNA Res. 9:99-106(2002) [PubMed] [Europe PMC] [Abstract]
Cited for: SEQUENCE REVISION.
[3]"DNA sequence and analysis of human chromosome 18."
Nusbaum C., Zody M.C., Borowsky M.L., Kamal M., Kodira C.D., Taylor T.D., Whittaker C.A., Chang J.L., Cuomo C.A., Dewar K., FitzGerald M.G., Yang X., Abouelleil A., Allen N.R., Anderson S., Bloom T., Bugalter B., Butler J. expand/collapse author list , Cook A., DeCaprio D., Engels R., Garber M., Gnirke A., Hafez N., Hall J.L., Norman C.H., Itoh T., Jaffe D.B., Kuroki Y., Lehoczky J., Lui A., Macdonald P., Mauceli E., Mikkelsen T.S., Naylor J.W., Nicol R., Nguyen C., Noguchi H., O'Leary S.B., Piqani B., Smith C.L., Talamas J.A., Topham K., Totoki Y., Toyoda A., Wain H.M., Young S.K., Zeng Q., Zimmer A.R., Fujiyama A., Hattori M., Birren B.W., Sakaki Y., Lander E.S.
Nature 437:551-555(2005) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[4]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1 AND 2), VARIANT ILE-1101.
Tissue: Brain.
[5]"Identification and characterization of SEB, a novel protein that binds to the acute undifferentiated leukemia-associated protein SET."
Minakuchi M., Kakazu N., Gorrin-Rivas M.J., Abe T., Copeland T.D., Ueda K., Adachi Y.
Eur. J. Biochem. 268:1340-1351(2001) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 54-1596 (ISOFORM 1), INTERACTION WITH SET, SUBCELLULAR LOCATION, TISSUE SPECIFICITY, VARIANT ILE-1101.
Tissue: Cervix carcinoma.
[6]"Quantitative phosphoproteomic analysis of T cell receptor signaling reveals system-wide modulation of protein-protein interactions."
Mayya V., Lundgren D.H., Hwang S.-I., Rezaul K., Wu L., Eng J.K., Rodionov V., Han D.K.
Sci. Signal. 2:RA46-RA46(2009) [PubMed] [Europe PMC] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-1266 AND SER-1272, MASS SPECTROMETRY.
Tissue: Leukemic T-cell.
[7]"The consensus coding sequences of human breast and colorectal cancers."
Sjoeblom T., Jones S., Wood L.D., Parsons D.W., Lin J., Barber T.D., Mandelker D., Leary R.J., Ptak J., Silliman N., Szabo S., Buckhaults P., Farrell C., Meeh P., Markowitz S.D., Willis J., Dawson D., Willson J.K.V. expand/collapse author list , Gazdar A.F., Hartigan J., Wu L., Liu C., Parmigiani G., Park B.H., Bachman K.E., Papadopoulos N., Vogelstein B., Kinzler K.W., Velculescu V.E.
Science 314:268-274(2006) [PubMed] [Europe PMC] [Abstract]
Cited for: VARIANT [LARGE SCALE ANALYSIS] TRP-1162.
[8]"De novo mutations of SETBP1 cause Schinzel-Giedion syndrome."
Hoischen A., van Bon B.W., Gilissen C., Arts P., van Lier B., Steehouwer M., de Vries P., de Reuver R., Wieskamp N., Mortier G., Devriendt K., Amorim M.Z., Revencu N., Kidd A., Barbosa M., Turner A., Smith J., Oley C. expand/collapse author list , Henderson A., Hayes I.M., Thompson E.M., Brunner H.G., de Vries B.B., Veltman J.A.
Nat. Genet. 42:483-485(2010) [PubMed] [Europe PMC] [Abstract]
Cited for: VARIANTS SGMFS ASN-868; ALA-868; ASP-870; SER-870 AND THR-871.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AB007897 mRNA. Translation: BAA24826.2. Different initiation.
AC015954 Genomic DNA. No translation available.
AC021766 Genomic DNA. No translation available.
AC090376 Genomic DNA. No translation available.
AC105074 Genomic DNA. No translation available.
AC120049 Genomic DNA. No translation available.
BC062338 mRNA. Translation: AAH62338.1.
BC146776 mRNA. Translation: AAI46777.1. Different initiation.
AB022660 mRNA. Translation: BAA82444.1. Different initiation.
IPIIPI00159049.
IPI00441195.
PIRT00063.
RefSeqNP_001123582.1. NM_001130110.1.
NP_056374.2. NM_015559.2.
UniGeneHs.435458.

3D structure databases

ProteinModelPortalQ9Y6X0.
ModBaseSearch...

Protein-protein interaction databases

IntActQ9Y6X0. 2 interactions.
STRING9606.ENSP00000282030.

PTM databases

PhosphoSiteQ9Y6X0.

Polymorphism databases

DMDM294862494.

Proteomic databases

PaxDbQ9Y6X0.
PRIDEQ9Y6X0.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENST00000282030; ENSP00000282030; ENSG00000152217.
ENST00000426838; ENSP00000390687; ENSG00000152217.
GeneID26040.
KEGGhsa:26040.
UCSCuc002lay.3. human.
uc010dni.3. human.

Organism-specific databases

CTD26040.
GeneCardsGC18P042260.
HGNCHGNC:15573. SETBP1.
MIM269150. phenotype.
611060. gene.
neXtProtNX_Q9Y6X0.
Orphanet798. Schinzel-Giedion syndrome.
PharmGKBPA37982.
HUGESearch...
GenAtlasSearch...

Phylogenomic databases

eggNOGNOG317891.
HOGENOMHOG000154293.
HOVERGENHBG060433.
InParanoidQ9Y6X0.
OMAAVIHMAR.
OrthoDBEOG44TP74.

Gene expression databases

ArrayExpressQ9Y6X0.
BgeeQ9Y6X0.
CleanExHS_SETBP1.
GenevestigatorQ9Y6X0.
GermOnlineENSG00000152217. Homo sapiens.

Family and domain databases

InterProIPR017956. AT_hook_DNA-bd_motif.
[Graphical view]
SMARTSM00384. AT_hook. 3 hits.
[Graphical view]
ProtoNetSearch...

Other

ChiTaRSSETBP1. human.
GenomeRNAi26040.
NextBio47871.
SOURCESearch...

Entry information

Entry nameSETBP_HUMAN
AccessionPrimary (citable) accession number: Q9Y6X0
Secondary accession number(s): A6H8W5, Q6P6C3, Q9UEF3
Entry history
Integrated into UniProtKB/Swiss-Prot: April 13, 2004
Last sequence update: April 20, 2010
Last modified: May 29, 2013
This is version 102 of the entry and version 3 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

Human chromosome 18

Human chromosome 18: entries, gene names and cross-references to MIM

Human entries with polymorphisms or disease mutations

List of human entries with polymorphisms or disease mutations

Human polymorphisms and disease mutations

Index of human polymorphisms and disease mutations

MIM cross-references

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot

SIMILARITY comments

Index of protein domains and families