Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Upstream-binding protein 1

Gene

Ubp1

Organism
Mus musculus (Mouse)
Status
Reviewed-Annotation score: Annotation score: 4 out of 5-Experimental evidence at protein leveli

Functioni

Functions as a transcriptional activator in a promoter context-dependent manner. Involved in regulation of the alpha-globin gene in erythroid cells. Activation of the alpha-globin promoter in erythroid cells is via synergistic interaction with TFCP2. Functions as a trans-acting factor that regulates the domestic strain CYP2D9 gene through specific association with the regulatory element SDI-A1. Binding to SDI-A1 depends on the type of nucleotide at position 299; binding is abolished by a nucleotide substitution at this position. Modulates the placental expression of CYP11A1 (By similarity).By similarity2 Publications

GO - Molecular functioni

GO - Biological processi

  • angiogenesis Source: MGI
  • regulation of transcription from RNA polymerase II promoter Source: GO_Central
  • transcription, DNA-templated Source: UniProtKB-KW
Complete GO annotation...

Keywords - Biological processi

Transcription, Transcription regulation

Keywords - Ligandi

DNA-binding

Names & Taxonomyi

Protein namesi
Recommended name:
Upstream-binding protein 1
Alternative name(s):
Nuclear factor 2d9
Short name:
NF2d9
Gene namesi
Name:Ubp1
Synonyms:Cp2b, Nf2d9
OrganismiMus musculus (Mouse)
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus
Proteomesi
  • UP000000589 Componenti: Chromosome 9

Organism-specific databases

MGIiMGI:104889. Ubp1.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00002290271 – 540Upstream-binding protein 1Add BLAST540

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei22PhosphoserineBy similarity1
Modified residuei390PhosphoserineCombined sources1
Modified residuei393PhosphoserineCombined sources1

Keywords - PTMi

Phosphoprotein

Proteomic databases

EPDiQ811S7.
PaxDbiQ811S7.
PeptideAtlasiQ811S7.
PRIDEiQ811S7.

PTM databases

iPTMnetiQ811S7.
PhosphoSitePlusiQ811S7.

Expressioni

Tissue specificityi

Ubiquitous. Highly expressed in erythroid cells.2 Publications

Gene expression databases

BgeeiENSMUSG00000009741.
CleanExiMM_UBP1.
GenevisibleiQ811S7. MM.

Interactioni

Subunit structurei

Interacts with TFCP2 and PIAS1, and is probably part of a complex containing TFCP2, UBP1 and PIAS1.2 Publications

Protein-protein interaction databases

BioGridi204421. 9 interactors.
IntActiQ811S7. 2 interactors.
STRINGi10090.ENSMUSP00000081946.

Structurei

3D structure databases

ProteinModelPortaliQ811S7.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Regioni1 – 40Transcriptional activationAdd BLAST40
Regioni274 – 309Erythroid-specific transcriptional activationAdd BLAST36

Sequence similaritiesi

Belongs to the grh/CP2 family. CP2 subfamily.Curated

Phylogenomic databases

eggNOGiKOG4091. Eukaryota.
ENOG410XNZ6. LUCA.
GeneTreeiENSGT00760000119235.
HOGENOMiHOG000230625.
HOVERGENiHBG053805.
InParanoidiQ811S7.
KOiK09275.
OMAiWPDTPTA.
OrthoDBiEOG091G0YV2.
PhylomeDBiQ811S7.
TreeFamiTF314132.

Family and domain databases

Gene3Di1.10.150.50. 1 hit.
InterProiIPR007604. CP2.
IPR013761. SAM/pointed.
[Graphical view]
PfamiPF04516. CP2. 1 hit.
[Graphical view]
SUPFAMiSSF47769. SSF47769. 1 hit.

Sequences (2)i

Sequence statusi: Complete.

This entry describes 2 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: Q811S7-1) [UniParc]FASTAAdd to basket
Also known as: CP2b

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MAWVLSMDEV IESGLVHDFD SSLSGIGQEL GAGAYSMSDV LALPIFKQED
60 70 80 90 100
SSLSLEDEAK HPPFQYVMCA ATSPAVKLHD ETLTYLNQGQ SYEIRMLDNR
110 120 130 140 150
KMGDMPELSG KLVKSIIRVV FHDRRLQYTE HQQLEGWKWN RPGDRLLDLD
160 170 180 190 200
IPMSVGIIDT RTNPSQLNAV EFLWDPAKRT SAFIQVHCIS TEFTPRKHGG
210 220 230 240 250
EKGVPFRIQV DTFKQNENGE YTDHLHSASC QIKVFKPKGA DRKQKNDREK
260 270 280 290 300
MEKRTAHEKE KYQPSYDTTI LTEMRLEPII EDAVEHEQKK SSKRTLPADY
310 320 330 340 350
GDSLAKRGSC SPWPDTPTAY VNNSPSPAPT FTSSQPSTCS VPDSNSSSPN
360 370 380 390 400
HQGDGAAQAS GEQIQPSATT QETQQWLLKN RFSSYTRLFS NFSGADLLKL
410 420 430 440 450
TKEDLVQICG AADGIRLYNS LKSRSVRPRL TIYVCQEQPS STALQGQPQA
460 470 480 490 500
AGSGGESGGG TPSVYHAIYL EEMVASEVAR KLASVFNIPF HQINQVYRQG
510 520 530 540
PTGIHILVSD QMVQNFQDET CFLFSTVKAE NNDGIHIILK
Length:540
Mass (Da):60,212
Last modified:June 1, 2003 - v1
Checksum:iA0D66024E36CCEB9
GO
Isoform 2 (identifier: Q811S7-2) [UniParc]FASTAAdd to basket
Also known as: CP2a

The sequence of this isoform differs from the canonical sequence as follows:
     274-309: Missing.

Show »
Length:504
Mass (Da):56,173
Checksum:iE0F79D8580F830D7
GO

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti287E → K in BAE24526 (PubMed:16141072).Curated1

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_017731274 – 309Missing in isoform 2. 3 PublicationsAdd BLAST36

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U20086 mRNA. Translation: AAC52244.1.
AY182062 mRNA. Translation: AAO24130.1.
BC139828 mRNA. Translation: AAI39829.1.
AK079815 mRNA. Translation: BAC37754.1.
AK143282 mRNA. Translation: BAE25332.1.
AK140942 mRNA. Translation: BAE24526.1.
CCDSiCCDS40788.1. [Q811S7-1]
CCDS40789.1. [Q811S7-2]
PIRiI49257.
RefSeqiNP_001076788.1. NM_001083319.1. [Q811S7-1]
NP_038727.1. NM_013699.2. [Q811S7-2]
XP_006512114.2. XM_006512051.3. [Q811S7-1]
XP_006512117.2. XM_006512054.3. [Q811S7-2]
UniGeneiMm.28052.
Mm.386808.

Genome annotation databases

EnsembliENSMUST00000009885; ENSMUSP00000009885; ENSMUSG00000009741. [Q811S7-2]
ENSMUST00000084885; ENSMUSP00000081946; ENSMUSG00000009741. [Q811S7-1]
ENSMUST00000116492; ENSMUSP00000112192; ENSMUSG00000009741. [Q811S7-2]
GeneIDi22221.
KEGGimmu:22221.
UCSCiuc009rwx.1. mouse. [Q811S7-1]
uc009rwy.1. mouse. [Q811S7-2]

Keywords - Coding sequence diversityi

Alternative splicing

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U20086 mRNA. Translation: AAC52244.1.
AY182062 mRNA. Translation: AAO24130.1.
BC139828 mRNA. Translation: AAI39829.1.
AK079815 mRNA. Translation: BAC37754.1.
AK143282 mRNA. Translation: BAE25332.1.
AK140942 mRNA. Translation: BAE24526.1.
CCDSiCCDS40788.1. [Q811S7-1]
CCDS40789.1. [Q811S7-2]
PIRiI49257.
RefSeqiNP_001076788.1. NM_001083319.1. [Q811S7-1]
NP_038727.1. NM_013699.2. [Q811S7-2]
XP_006512114.2. XM_006512051.3. [Q811S7-1]
XP_006512117.2. XM_006512054.3. [Q811S7-2]
UniGeneiMm.28052.
Mm.386808.

3D structure databases

ProteinModelPortaliQ811S7.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi204421. 9 interactors.
IntActiQ811S7. 2 interactors.
STRINGi10090.ENSMUSP00000081946.

PTM databases

iPTMnetiQ811S7.
PhosphoSitePlusiQ811S7.

Proteomic databases

EPDiQ811S7.
PaxDbiQ811S7.
PeptideAtlasiQ811S7.
PRIDEiQ811S7.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSMUST00000009885; ENSMUSP00000009885; ENSMUSG00000009741. [Q811S7-2]
ENSMUST00000084885; ENSMUSP00000081946; ENSMUSG00000009741. [Q811S7-1]
ENSMUST00000116492; ENSMUSP00000112192; ENSMUSG00000009741. [Q811S7-2]
GeneIDi22221.
KEGGimmu:22221.
UCSCiuc009rwx.1. mouse. [Q811S7-1]
uc009rwy.1. mouse. [Q811S7-2]

Organism-specific databases

CTDi7342.
MGIiMGI:104889. Ubp1.

Phylogenomic databases

eggNOGiKOG4091. Eukaryota.
ENOG410XNZ6. LUCA.
GeneTreeiENSGT00760000119235.
HOGENOMiHOG000230625.
HOVERGENiHBG053805.
InParanoidiQ811S7.
KOiK09275.
OMAiWPDTPTA.
OrthoDBiEOG091G0YV2.
PhylomeDBiQ811S7.
TreeFamiTF314132.

Miscellaneous databases

ChiTaRSiUbp1. mouse.
PROiQ811S7.
SOURCEiSearch...

Gene expression databases

BgeeiENSMUSG00000009741.
CleanExiMM_UBP1.
GenevisibleiQ811S7. MM.

Family and domain databases

Gene3Di1.10.150.50. 1 hit.
InterProiIPR007604. CP2.
IPR013761. SAM/pointed.
[Graphical view]
PfamiPF04516. CP2. 1 hit.
[Graphical view]
SUPFAMiSSF47769. SSF47769. 1 hit.
ProtoNetiSearch...

Entry informationi

Entry nameiUBIP1_MOUSE
AccessioniPrimary (citable) accession number: Q811S7
Secondary accession number(s): A4QPG4
, Q3UPR3, Q3US11, Q60786, Q8C514
Entry historyi
Integrated into UniProtKB/Swiss-Prot: March 21, 2006
Last sequence update: June 1, 2003
Last modified: November 30, 2016
This is version 111 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Miscellaneous

Present in both domestic and wild mouse strains. Recognizes the genetic difference at position 299 in the SDI-A1 element.

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. MGD cross-references
    Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.