Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

U1 small nuclear ribonucleoprotein 70 kDa

Gene

RNU1

Organism
Arabidopsis thaliana (Mouse-ear cress)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Mediates the splicing of pre-mRNA by binding to the loop I region of U1-snRNA.

GO - Molecular functioni

GO - Biological processi

  • mRNA splicing, via spliceosome Source: TAIR
Complete GO annotation...

Keywords - Molecular functioni

Ribonucleoprotein

Keywords - Biological processi

mRNA processing, mRNA splicing

Keywords - Ligandi

RNA-binding

Enzyme and pathway databases

ReactomeiR-ATH-72163. mRNA Splicing - Major Pathway.

Names & Taxonomyi

Protein namesi
Recommended name:
U1 small nuclear ribonucleoprotein 70 kDa
Short name:
U1 snRNP 70 kDa
Short name:
U1-70K
Short name:
snRNP70
Gene namesi
Name:RNU1
Ordered Locus Names:At3g50670
ORF Names:T3A5.50
OrganismiArabidopsis thaliana (Mouse-ear cress)
Taxonomic identifieri3702 [NCBI]
Taxonomic lineageiEukaryotaViridiplantaeStreptophytaEmbryophytaTracheophytaSpermatophytaMagnoliophytaeudicotyledonsGunneridaePentapetalaerosidsmalvidsBrassicalesBrassicaceaeCamelineaeArabidopsis
Proteomesi
  • UP000006548 Componenti: Chromosome 3

Organism-specific databases

TAIRiAT3G50670.

Subcellular locationi

GO - Cellular componenti

  • commitment complex Source: GO_Central
  • nuclear speck Source: UniProtKB-SubCell
  • nucleus Source: TAIR
  • precatalytic spliceosome Source: GO_Central
  • U1 snRNP Source: GO_Central
  • U2-type prespliceosome Source: GO_Central
Complete GO annotation...

Keywords - Cellular componenti

Nucleus, Spliceosome

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00000818851 – 427U1 small nuclear ribonucleoprotein 70 kDaAdd BLAST427

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei282PhosphoserineCombined sources1

Post-translational modificationi

Phosphorylated. The association and dissociation with SR45 is not affected by the phosphorylation status (PubMed:18414657).1 Publication

Keywords - PTMi

Phosphoprotein

Proteomic databases

PaxDbiQ42404.

PTM databases

iPTMnetiQ42404.

Expressioni

Tissue specificityi

Ubiquitous.

Gene expression databases

GenevisibleiQ42404. AT.

Interactioni

Subunit structurei

Component of the spliceosome. Interacts with CYP63, U2AF35A, U2AF35B, SRZ21, RSZ22, SR34, SR45, SR45A and SCL33.6 Publications

Binary interactionsi

WithEntry#Exp.IntActNotes
RSZ21O811274EBI-1633812,EBI-927172
RSZ22O811264EBI-1633812,EBI-1633829
SCL33Q9SEU43EBI-1633812,EBI-927103
SR45Q9SEE94EBI-1633812,EBI-1792008

Protein-protein interaction databases

BioGridi9548. 12 interactors.
IntActiQ42404. 9 interactors.
STRINGi3702.AT3G50670.1.

Structurei

3D structure databases

ProteinModelPortaliQ42404.
SMRiQ42404.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini138 – 216RRMPROSITE-ProRule annotationAdd BLAST79

Compositional bias

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Compositional biasi256 – 280Arg/Glu-rich (mixed charge)Add BLAST25
Compositional biasi287 – 427Arg/Asp/Glu-rich (mixed charge)Add BLAST141

Sequence similaritiesi

Contains 1 RRM (RNA recognition motif) domain.PROSITE-ProRule annotation

Phylogenomic databases

eggNOGiKOG0113. Eukaryota.
COG0724. LUCA.
HOGENOMiHOG000236289.
InParanoidiQ42404.
KOiK11093.
OMAiFDYQNTA.
OrthoDBiEOG09360JBB.
PhylomeDBiQ42404.

Family and domain databases

Gene3Di3.30.70.330. 1 hit.
InterProiIPR012677. Nucleotide-bd_a/b_plait.
IPR000504. RRM_dom.
IPR022023. U1snRNP70_N.
[Graphical view]
PfamiPF00076. RRM_1. 1 hit.
PF12220. U1snRNP70_N. 1 hit.
[Graphical view]
SMARTiSM00360. RRM. 1 hit.
[Graphical view]
SUPFAMiSSF54928. SSF54928. 1 hit.
PROSITEiPS50102. RRM. 1 hit.
[Graphical view]

Sequences (2)i

Sequence statusi: Complete.

This entry describes 2 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform Long (identifier: Q42404-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MGDSGDPFLR NPNAAVQARA KVQNRANVLQ LKLMGQSHPT GLTNNLLKLF
60 70 80 90 100
EPRPPLEYKP PPEKRKCPPY TGMAQFVSNF AEPGDPEYAP PKPEVELPSQ
110 120 130 140 150
KRERIHKLRL EKGVEKAAED LKKYDPNNDP NATGDPYKTL FVSRLNYESS
160 170 180 190 200
ESKIKREFES YGPIKRVHLV TDQLTNKPKG YAFIEYMHTR DMKAAYKQAD
210 220 230 240 250
GQKIDGRRVL VDVERGRTVP NWRPRRLGGG LGTSRVGGGE EIVGEQQPQG
260 270 280 290 300
RTSQSEEPSR PREEREKSRE KGKERERSRE LSHEQPRERS RDRPREDKHH
310 320 330 340 350
RDRDQGGRDR DRDSRRDRDR TRDRGDRDRR DRDRGRDRTS RDHDRDRSRK
360 370 380 390 400
KERDYEGGEY EHEGGGRSRE RDAEYRGEPE ETRGYYEDDQ GDTDRYSHRY
410 420
DKMEEDDFRY EREYKRSKRS ESREYVR
Length:427
Mass (Da):50,388
Last modified:November 1, 1996 - v1
Checksum:i2B35E824EF35A341
GO
Isoform Short (identifier: Q42404-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     168-204: HLVTDQLTNKPKGYAFIEYMHTRDMKAAYKQADGQKI → GYSEHSLAGSVRICVMASLSRALCSICFILSTKVFQG
     205-427: Missing.

Show »
Length:204
Mass (Da):22,849
Checksum:i129A38D92AF9A9A1
GO

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_005853168 – 204HLVTD…DGQKI → GYSEHSLAGSVRICVMASLS RALCSICFILSTKVFQG in isoform Short. 1 PublicationAdd BLAST37
Alternative sequenceiVSP_005854205 – 427Missing in isoform Short. 1 PublicationAdd BLAST223

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
M93439 mRNA. Translation: AAD12773.1.
U52909 Genomic DNA. Translation: AAD12774.1.
U52909 Genomic DNA. Translation: AAD12775.1.
U52910 mRNA. Translation: AAD12776.1.
AL132979 Genomic DNA. Translation: CAB62436.1.
CP002686 Genomic DNA. Translation: AEE78692.1.
CP002686 Genomic DNA. Translation: AEE78693.1.
AY039874 mRNA. Translation: AAK63978.1.
AY094003 mRNA. Translation: AAM16264.1.
PIRiS71367.
RefSeqiNP_190636.1. NM_114927.4. [Q42404-1]
NP_850676.1. NM_180345.1. [Q42404-2]
UniGeneiAt.25571.

Genome annotation databases

EnsemblPlantsiAT3G50670.1; AT3G50670.1; AT3G50670. [Q42404-1]
GeneIDi824230.
GrameneiAT3G50670.1; AT3G50670.1; AT3G50670.
KEGGiath:AT3G50670.

Keywords - Coding sequence diversityi

Alternative splicing

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
M93439 mRNA. Translation: AAD12773.1.
U52909 Genomic DNA. Translation: AAD12774.1.
U52909 Genomic DNA. Translation: AAD12775.1.
U52910 mRNA. Translation: AAD12776.1.
AL132979 Genomic DNA. Translation: CAB62436.1.
CP002686 Genomic DNA. Translation: AEE78692.1.
CP002686 Genomic DNA. Translation: AEE78693.1.
AY039874 mRNA. Translation: AAK63978.1.
AY094003 mRNA. Translation: AAM16264.1.
PIRiS71367.
RefSeqiNP_190636.1. NM_114927.4. [Q42404-1]
NP_850676.1. NM_180345.1. [Q42404-2]
UniGeneiAt.25571.

3D structure databases

ProteinModelPortaliQ42404.
SMRiQ42404.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi9548. 12 interactors.
IntActiQ42404. 9 interactors.
STRINGi3702.AT3G50670.1.

PTM databases

iPTMnetiQ42404.

Proteomic databases

PaxDbiQ42404.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblPlantsiAT3G50670.1; AT3G50670.1; AT3G50670. [Q42404-1]
GeneIDi824230.
GrameneiAT3G50670.1; AT3G50670.1; AT3G50670.
KEGGiath:AT3G50670.

Organism-specific databases

TAIRiAT3G50670.

Phylogenomic databases

eggNOGiKOG0113. Eukaryota.
COG0724. LUCA.
HOGENOMiHOG000236289.
InParanoidiQ42404.
KOiK11093.
OMAiFDYQNTA.
OrthoDBiEOG09360JBB.
PhylomeDBiQ42404.

Enzyme and pathway databases

ReactomeiR-ATH-72163. mRNA Splicing - Major Pathway.

Miscellaneous databases

PROiQ42404.

Gene expression databases

GenevisibleiQ42404. AT.

Family and domain databases

Gene3Di3.30.70.330. 1 hit.
InterProiIPR012677. Nucleotide-bd_a/b_plait.
IPR000504. RRM_dom.
IPR022023. U1snRNP70_N.
[Graphical view]
PfamiPF00076. RRM_1. 1 hit.
PF12220. U1snRNP70_N. 1 hit.
[Graphical view]
SMARTiSM00360. RRM. 1 hit.
[Graphical view]
SUPFAMiSSF54928. SSF54928. 1 hit.
PROSITEiPS50102. RRM. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiRU17_ARATH
AccessioniPrimary (citable) accession number: Q42404
Secondary accession number(s): Q42378
Entry historyi
Integrated into UniProtKB/Swiss-Prot: April 27, 2001
Last sequence update: November 1, 1996
Last modified: November 30, 2016
This is version 137 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programPlant Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Arabidopsis thaliana
    Arabidopsis thaliana: entries and gene names
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.