Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Heterogeneous nuclear ribonucleoprotein U

Gene

HNRNPU

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Component of the CRD-mediated complex that promotes MYC mRNA stabilization. Binds to pre-mRNA. Has high affinity for scaffold-attached region (SAR) DNA. Binds to double- and single-stranded DNA and RNA. Plays a role in the circadian regulation of the core clock component ARNTL/BMAL1 transcription (By similarity).By similarity1 Publication

Regions

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Nucleotide bindingi504 – 5118ATPSequence analysis

GO - Molecular functioni

  • ATP binding Source: UniProtKB-KW
  • core promoter binding Source: UniProtKB
  • DNA binding Source: ProtInc
  • enhancer binding Source: Ensembl
  • poly(A) RNA binding Source: UniProtKB
  • poly(G) binding Source: Ensembl
  • ribonucleoprotein complex binding Source: Ensembl
  • RNA binding Source: ProtInc
  • telomerase RNA binding Source: BHF-UCL

GO - Biological processi

  • cellular response to dexamethasone stimulus Source: Ensembl
  • circadian regulation of gene expression Source: UniProtKB
  • CRD-mediated mRNA stabilization Source: UniProtKB
  • gene expression Source: Reactome
  • mRNA splicing, via spliceosome Source: UniProtKB
  • negative regulation of telomere maintenance via telomerase Source: BHF-UCL
  • osteoblast differentiation Source: UniProtKB
  • positive regulation of DNA topoisomerase (ATP-hydrolyzing) activity Source: Ensembl
  • positive regulation of gene expression Source: Ensembl
  • RNA processing Source: ProtInc
Complete GO annotation...

Keywords - Molecular functioni

Ribonucleoprotein

Keywords - Biological processi

Biological rhythms, mRNA processing, mRNA splicing

Keywords - Ligandi

ATP-binding, DNA-binding, Nucleotide-binding, RNA-binding

Enzyme and pathway databases

ReactomeiR-HSA-72163. mRNA Splicing - Major Pathway.
R-HSA-72203. Processing of Capped Intron-Containing Pre-mRNA.
SIGNORiQ00839.

Names & Taxonomyi

Protein namesi
Recommended name:
Heterogeneous nuclear ribonucleoprotein U
Short name:
hnRNP U
Alternative name(s):
Scaffold attachment factor A
Short name:
SAF-A
p120
pp120
Gene namesi
Name:HNRNPU
Synonyms:HNRPU, SAFA, U21.1
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 1

Organism-specific databases

HGNCiHGNC:5048. HNRNPU.

Subcellular locationi

GO - Cellular componenti

  • catalytic step 2 spliceosome Source: UniProtKB
  • cell surface Source: UniProtKB-SubCell
  • CRD-mediated mRNA stability complex Source: UniProtKB
  • cytoplasmic ribonucleoprotein granule Source: ParkinsonsUK-UCL
  • extracellular matrix Source: BHF-UCL
  • intracellular ribonucleoprotein complex Source: UniProtKB
  • membrane Source: UniProtKB
  • nucleoplasm Source: HPA
  • nucleus Source: UniProtKB
Complete GO annotation...

Keywords - Cellular componenti

Cytoplasm, Nucleus, Spliceosome

Pathology & Biotechi

Organism-specific databases

PharmGKBiPA162391486.

Polymorphism and mutation databases

BioMutaiHNRNPU.
DMDMi254763463.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Initiator methionineiRemovedCombined sources3 Publications
Chaini2 – 825824Heterogeneous nuclear ribonucleoprotein UPRO_0000081872Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Modified residuei2 – 21N-acetylserine; partialCombined sources2 Publications
Modified residuei4 – 41PhosphoserineCombined sources
Modified residuei59 – 591PhosphoserineCombined sources
Modified residuei66 – 661PhosphoserineCombined sources
Modified residuei255 – 2551CitrullineBy similarity
Modified residuei265 – 2651N6-acetyllysine; alternateCombined sources
Cross-linki265 – 265Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO1); alternateCombined sources
Cross-linki265 – 265Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2); alternateCombined sources
Modified residuei267 – 2671PhosphoserineCombined sources
Modified residuei271 – 2711PhosphoserineCombined sources
Modified residuei286 – 2861PhosphothreonineCombined sources
Modified residuei352 – 3521N6-acetyllysineCombined sources
Modified residuei516 – 5161N6-acetyllysineCombined sources
Modified residuei524 – 5241N6-acetyllysineCombined sources
Modified residuei532 – 5321PhosphothreonineCombined sources
Modified residuei551 – 5511N6-acetyllysineCombined sources
Modified residuei565 – 5651N6-acetyllysineCombined sources
Modified residuei582 – 5821PhosphothreonineCombined sources
Cross-linki609 – 609Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)Combined sources
Modified residuei635 – 6351N6-acetyllysineCombined sources
Cross-linki670 – 670Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)Combined sources
Modified residuei739 – 7391Dimethylated arginine; in A2780 ovarian carcinoma cell line1 Publication
Modified residuei814 – 8141N6-acetyllysineCombined sources
Isoform Short (identifier: Q00839-2)
Modified residuei215 – 2151N6-acetyllysineCombined sources

Post-translational modificationi

Extensively phosphorylated.1 Publication
Arg-739 is dimethylated, probably to asymmetric dimethylarginine.1 Publication
Citrullinated by PADI4.By similarity

Keywords - PTMi

Acetylation, Citrullination, Isopeptide bond, Methylation, Phosphoprotein, Ubl conjugation

Proteomic databases

EPDiQ00839.
MaxQBiQ00839.
PaxDbiQ00839.
PeptideAtlasiQ00839.
PRIDEiQ00839.

PTM databases

iPTMnetiQ00839.
PhosphoSiteiQ00839.
SwissPalmiQ00839.

Expressioni

Gene expression databases

BgeeiENSG00000153187.
CleanExiHS_HNRNPU.
ExpressionAtlasiQ00839. baseline and differential.
GenevisibleiQ00839. HS.

Organism-specific databases

HPAiCAB011532.
HPA041057.
HPA058707.

Interactioni

Subunit structurei

Identified in the spliceosome C complex. Component of the coding region determinant (CRD)-mediated complex, composed of DHX9, HNRNPU, IGF2BP1, SYNCRIP and YBX1. Identified in a IGF2BP1-dependent mRNP granule complex containing untranslated mRNAs. Interacts with IGF2BP1, IGF2BP2, IGF2BP3 and ERBB4. Ligand for CR2. Interacts with CRY1 (By similarity). Interacts with UBQLN2 (PubMed:25616961).By similarity7 Publications

Binary interactionsi

WithEntry#Exp.IntActNotes
CDKN2AP427712EBI-351126,EBI-375053
UBQLN2Q9UHD93EBI-351126,EBI-947187

Protein-protein interaction databases

BioGridi109433. 522 interactions.
DIPiDIP-684N.
IntActiQ00839. 305 interactions.
MINTiMINT-1654412.
STRINGi9606.ENSP00000283179.

Structurei

Secondary structure

1
825
Legend: HelixTurnBeta strand
Show more details
Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Turni24 – 252

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
EntryMethodResolution (Å)ChainPositionsPDBsum
1ZRJNMR-A23-42[»]
ProteinModelPortaliQ00839.
SMRiQ00839. Positions 3-48, 298-470.
ModBaseiSearch...
MobiDBiSearch...

Miscellaneous databases

EvolutionaryTraceiQ00839.

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini8 – 4235SAPPROSITE-ProRule annotationAdd
BLAST
Domaini267 – 464198B30.2/SPRYPROSITE-ProRule annotationAdd
BLAST

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni714 – 73926RNA-binding RGG-box1 PublicationAdd
BLAST

Compositional bias

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Compositional biasi2 – 160159Asp/Glu-rich (acidic)Add
BLAST
Compositional biasi84 – 9411Poly-GluAdd
BLAST
Compositional biasi161 – 20949Gln-richAdd
BLAST
Compositional biasi703 – 825123Gly-richAdd
BLAST
Compositional biasi740 – 75011Poly-GlyAdd
BLAST

Sequence similaritiesi

Contains 1 B30.2/SPRY domain.PROSITE-ProRule annotation
Contains 1 SAP domain.PROSITE-ProRule annotation

Phylogenomic databases

eggNOGiENOG410IR1W. Eukaryota.
ENOG410Y1WQ. LUCA.
GeneTreeiENSGT00390000020210.
HOGENOMiHOG000253920.
HOVERGENiHBG061101.
InParanoidiQ00839.
KOiK12888.
OMAiRQMADTG.
OrthoDBiEOG091G041T.
PhylomeDBiQ00839.
TreeFamiTF317301.

Family and domain databases

Gene3Di1.10.720.30. 1 hit.
3.40.50.300. 1 hit.
InterProiIPR001870. B30.2/SPRY.
IPR013320. ConA-like_dom.
IPR026745. hnRNP_U.
IPR027417. P-loop_NTPase.
IPR003034. SAP_dom.
IPR003877. SPRY_dom.
[Graphical view]
PANTHERiPTHR12381:SF11. PTHR12381:SF11. 3 hits.
PfamiPF02037. SAP. 1 hit.
PF00622. SPRY. 1 hit.
[Graphical view]
SMARTiSM00513. SAP. 1 hit.
SM00449. SPRY. 1 hit.
[Graphical view]
SUPFAMiSSF49899. SSF49899. 1 hit.
SSF52540. SSF52540. 1 hit.
PROSITEiPS50188. B302_SPRY. 1 hit.
PS50800. SAP. 1 hit.
[Graphical view]

Sequences (2)i

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

This entry describes 2 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform Long (identifier: Q00839-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MSSSPVNVKK LKVSELKEEL KKRRLSDKGL KAELMERLQA ALDDEEAGGR
60 70 80 90 100
PAMEPGNGSL DLGGDSAGRS GAGLEQEAAA GGDEEEEEEE EEEEGISALD
110 120 130 140 150
GDQMELGEEN GAAGAADSGP MEEEEAASED ENGDDQGFQE GEDELGDEEE
160 170 180 190 200
GAGDENGHGE QQPQPPATQQ QQPQQQRGAA KEAAGKSSGP TSLFAVTVAP
210 220 230 240 250
PGARQGQQQA GGKKKAEGGG GGGRPGAPAA GDGKTEQKGG DKKRGVKRPR
260 270 280 290 300
EDHGRGYFEY IEENKYSRAK SPQPPVEEED EHFDDTVVCL DTYNCDLHFK
310 320 330 340 350
ISRDRLSASS LTMESFAFLW AGGRASYGVS KGKVCFEMKV TEKIPVRHLY
360 370 380 390 400
TKDIDIHEVR IGWSLTTSGM LLGEEEFSYG YSLKGIKTCN CETEDYGEKF
410 420 430 440 450
DENDVITCFA NFESDEVELS YAKNGQDLGV AFKISKEVLA GRPLFPHVLC
460 470 480 490 500
HNCAVEFNFG QKEKPYFPIP EEYTFIQNVP LEDRVRGPKG PEEKKDCEVV
510 520 530 540 550
MMIGLPGAGK TTWVTKHAAE NPGKYNILGT NTIMDKMMVA GFKKQMADTG
560 570 580 590 600
KLNTLLQRAP QCLGKFIEIA ARKKRNFILD QTNVSAAAQR RKMCLFAGFQ
610 620 630 640 650
RKAVVVCPKD EDYKQRTQKK AEVEGKDLPE HAVLKMKGNF TLPEVAECFD
660 670 680 690 700
EITYVELQKE EAQKLLEQYK EESKKALPPE KKQNTGSKKS NKNKSGKNQF
710 720 730 740 750
NRGGGHRGRG GFNMRGGNFR GGAPGNRGGY NRRGNMPQRG GGGGGSGGIG
760 770 780 790 800
YPYPRAPVFP GRGSYSNRGN YNRGGMPNRG NYNQNFRGRG NNRGYKNQSQ
810 820
GYNQWQQGQF WGQKPWSQHY HQGYY
Length:825
Mass (Da):90,584
Last modified:July 28, 2009 - v6
Checksum:i5D4EC4188436831F
GO
Isoform Short (identifier: Q00839-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     213-231: Missing.

Show »
Length:806
Mass (Da):88,980
Checksum:iBCE1AD365501EDC1
GO

Sequence cautioni

The sequence AAC19382 differs from that shown.Aberrant splicing.Curated

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti603 – 6031A → V in AAH07950 (PubMed:15489334).Curated

Natural variant

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Natural varianti712 – 7121F → L.2 Publications
Corresponds to variant rs1052660 [ dbSNP | Ensembl ].
VAR_014712

Alternative sequence

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Alternative sequencei213 – 23119Missing in isoform Short. 3 PublicationsVSP_005846Add
BLAST

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
X65488 mRNA. Translation: CAA46472.1.
AF068846 mRNA. Translation: AAC19382.1. Sequence problems.
BX323046 Genomic DNA. Translation: CAI11058.1.
BC003367 mRNA. Translation: AAH03367.1.
BC003621 mRNA. Translation: AAH03621.1.
BC007950 mRNA. Translation: AAH07950.2.
BC024767 mRNA. Translation: AAH24767.1.
BC034925 mRNA. Translation: AAH34925.1.
CCDSiCCDS31081.1. [Q00839-2]
CCDS41479.1. [Q00839-1]
PIRiS22765.
RefSeqiNP_004492.2. NM_004501.3. [Q00839-2]
NP_114032.2. NM_031844.2. [Q00839-1]
UniGeneiHs.106212.
Hs.411490.
Hs.743421.

Genome annotation databases

EnsembliENST00000283179; ENSP00000283179; ENSG00000153187. [Q00839-1]
ENST00000444376; ENSP00000393151; ENSG00000153187. [Q00839-2]
GeneIDi3192.
KEGGihsa:3192.
UCSCiuc001iaz.2. human. [Q00839-1]

Keywords - Coding sequence diversityi

Alternative splicing, Polymorphism

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
X65488 mRNA. Translation: CAA46472.1.
AF068846 mRNA. Translation: AAC19382.1. Sequence problems.
BX323046 Genomic DNA. Translation: CAI11058.1.
BC003367 mRNA. Translation: AAH03367.1.
BC003621 mRNA. Translation: AAH03621.1.
BC007950 mRNA. Translation: AAH07950.2.
BC024767 mRNA. Translation: AAH24767.1.
BC034925 mRNA. Translation: AAH34925.1.
CCDSiCCDS31081.1. [Q00839-2]
CCDS41479.1. [Q00839-1]
PIRiS22765.
RefSeqiNP_004492.2. NM_004501.3. [Q00839-2]
NP_114032.2. NM_031844.2. [Q00839-1]
UniGeneiHs.106212.
Hs.411490.
Hs.743421.

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
EntryMethodResolution (Å)ChainPositionsPDBsum
1ZRJNMR-A23-42[»]
ProteinModelPortaliQ00839.
SMRiQ00839. Positions 3-48, 298-470.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi109433. 522 interactions.
DIPiDIP-684N.
IntActiQ00839. 305 interactions.
MINTiMINT-1654412.
STRINGi9606.ENSP00000283179.

PTM databases

iPTMnetiQ00839.
PhosphoSiteiQ00839.
SwissPalmiQ00839.

Polymorphism and mutation databases

BioMutaiHNRNPU.
DMDMi254763463.

Proteomic databases

EPDiQ00839.
MaxQBiQ00839.
PaxDbiQ00839.
PeptideAtlasiQ00839.
PRIDEiQ00839.

Protocols and materials databases

DNASUi3192.
Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000283179; ENSP00000283179; ENSG00000153187. [Q00839-1]
ENST00000444376; ENSP00000393151; ENSG00000153187. [Q00839-2]
GeneIDi3192.
KEGGihsa:3192.
UCSCiuc001iaz.2. human. [Q00839-1]

Organism-specific databases

CTDi3192.
GeneCardsiHNRNPU.
HGNCiHGNC:5048. HNRNPU.
HPAiCAB011532.
HPA041057.
HPA058707.
MIMi602869. gene.
neXtProtiNX_Q00839.
PharmGKBiPA162391486.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiENOG410IR1W. Eukaryota.
ENOG410Y1WQ. LUCA.
GeneTreeiENSGT00390000020210.
HOGENOMiHOG000253920.
HOVERGENiHBG061101.
InParanoidiQ00839.
KOiK12888.
OMAiRQMADTG.
OrthoDBiEOG091G041T.
PhylomeDBiQ00839.
TreeFamiTF317301.

Enzyme and pathway databases

ReactomeiR-HSA-72163. mRNA Splicing - Major Pathway.
R-HSA-72203. Processing of Capped Intron-Containing Pre-mRNA.
SIGNORiQ00839.

Miscellaneous databases

ChiTaRSiHNRNPU. human.
EvolutionaryTraceiQ00839.
GeneWikiiHNRPU.
GenomeRNAii3192.
PROiQ00839.
SOURCEiSearch...

Gene expression databases

BgeeiENSG00000153187.
CleanExiHS_HNRNPU.
ExpressionAtlasiQ00839. baseline and differential.
GenevisibleiQ00839. HS.

Family and domain databases

Gene3Di1.10.720.30. 1 hit.
3.40.50.300. 1 hit.
InterProiIPR001870. B30.2/SPRY.
IPR013320. ConA-like_dom.
IPR026745. hnRNP_U.
IPR027417. P-loop_NTPase.
IPR003034. SAP_dom.
IPR003877. SPRY_dom.
[Graphical view]
PANTHERiPTHR12381:SF11. PTHR12381:SF11. 3 hits.
PfamiPF02037. SAP. 1 hit.
PF00622. SPRY. 1 hit.
[Graphical view]
SMARTiSM00513. SAP. 1 hit.
SM00449. SPRY. 1 hit.
[Graphical view]
SUPFAMiSSF49899. SSF49899. 1 hit.
SSF52540. SSF52540. 1 hit.
PROSITEiPS50188. B302_SPRY. 1 hit.
PS50800. SAP. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiHNRPU_HUMAN
AccessioniPrimary (citable) accession number: Q00839
Secondary accession number(s): O75507
, Q8N174, Q96HY9, Q9BQ09
Entry historyi
Integrated into UniProtKB/Swiss-Prot: April 1, 1993
Last sequence update: July 28, 2009
Last modified: September 7, 2016
This is version 187 of the entry and version 6 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

3D-structure, Complete proteome, Direct protein sequencing, Reference proteome

Documents

  1. Human chromosome 1
    Human chromosome 1: entries, gene names and cross-references to MIM
  2. Human entries with polymorphisms or disease mutations
    List of human entries with polymorphisms or disease mutations
  3. Human polymorphisms and disease mutations
    Index of human polymorphisms and disease mutations
  4. MIM cross-references
    Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot
  5. PDB cross-references
    Index of Protein Data Bank (PDB) cross-references
  6. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.