Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Splicing factor U2AF 35 kDa subunit

Gene

U2af1

Organism
Mus musculus (Mouse)
Status
Reviewed-Annotation score: Annotation score: 4 out of 5-Experimental evidence at protein leveli

Functioni

Plays a critical role in both constitutive and enhancer-dependent splicing by mediating protein-protein interactions and protein-RNA interactions required for accurate 3'-splice site selection. Recruits U2 snRNP to the branch point. Directly mediates interactions between U2AF2 and proteins bound to the enhancers and thus may function as a bridge between U2AF2 and the enhancer complex to recruit it to the adjacent intron (By similarity).By similarity

Regions

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Zinc fingeri12 – 40C3H1-type 1PROSITE-ProRule annotationAdd BLAST29
Zinc fingeri149 – 176C3H1-type 2PROSITE-ProRule annotationAdd BLAST28

GO - Molecular functioni

  • metal ion binding Source: UniProtKB-KW
  • nucleotide binding Source: InterPro
  • pre-mRNA 3'-splice site binding Source: GO_Central
  • RS domain binding Source: MGI

GO - Biological processi

Complete GO annotation...

Keywords - Biological processi

mRNA processing, mRNA splicing

Keywords - Ligandi

Metal-binding, RNA-binding, Zinc

Enzyme and pathway databases

ReactomeiR-MMU-109688. Cleavage of Growing Transcript in the Termination Region.
R-MMU-159236. Transport of Mature mRNA derived from an Intron-Containing Transcript.
R-MMU-72163. mRNA Splicing - Major Pathway.
R-MMU-72187. mRNA 3'-end processing.

Names & Taxonomyi

Protein namesi
Recommended name:
Splicing factor U2AF 35 kDa subunit
Alternative name(s):
U2 auxiliary factor 35 kDa subunit
U2 snRNP auxiliary factor small subunit
Gene namesi
Name:U2af1
OrganismiMus musculus (Mouse)
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus
Proteomesi
  • UP000000589 Componenti: Chromosome 17

Organism-specific databases

MGIiMGI:98884. U2af1.

Subcellular locationi

GO - Cellular componenti

  • nuclear speck Source: UniProtKB
  • spliceosomal complex Source: GO_Central
  • U2AF Source: GO_Central
Complete GO annotation...

Keywords - Cellular componenti

Nucleus, Spliceosome

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Initiator methionineiRemovedBy similarity
ChainiPRO_00000819952 – 239Splicing factor U2AF 35 kDa subunitAdd BLAST238

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei2N-acetylalanineBy similarity1
Modified residuei39N6-methyllysineBy similarity1
Modified residuei61PhosphoserineBy similarity1
Modified residuei145PhosphoserineBy similarity1
Modified residuei165Omega-N-methylarginineBy similarity1

Keywords - PTMi

Acetylation, Methylation, Phosphoprotein

Proteomic databases

EPDiQ9D883.
MaxQBiQ9D883.
PaxDbiQ9D883.
PRIDEiQ9D883.

PTM databases

iPTMnetiQ9D883.
PhosphoSitePlusiQ9D883.
SwissPalmiQ9D883.

Expressioni

Tissue specificityi

Expressed in primary spermatocytes and elongating spermatids (at protein level).1 Publication

Gene expression databases

BgeeiENSMUSG00000061613.
ExpressionAtlasiQ9D883. baseline and differential.
GenevisibleiQ9D883. MM.

Interactioni

Subunit structurei

Identified in the spliceosome C complex. Heterodimer with U2AF2. Interacts with ZRANB2 (By similarity). Interacts (via RS domain) with PHF5A (via N-terminus).By similarity1 Publication

GO - Molecular functioni

  • RS domain binding Source: MGI

Protein-protein interaction databases

BioGridi223844. 3 interactors.
IntActiQ9D883. 3 interactors.
MINTiMINT-4139036.
STRINGi10090.ENSMUSP00000014684.

Structurei

3D structure databases

ProteinModelPortaliQ9D883.
SMRiQ9D883.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini65 – 147RRMPROSITE-ProRule annotationAdd BLAST83

Compositional bias

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Compositional biasi179 – 238Arg/Gly/Ser-rich (RS domain)Add BLAST60

Domaini

The C-terminal SR-rich domain is required for interactions with SR proteins and the splicing regulators TRA and TRA2, and the N-terminal domain is required for formation of the U2AF1/U2AF2 heterodimer.By similarity

Sequence similaritiesi

Belongs to the splicing factor SR family.Curated
Contains 2 C3H1-type zinc fingers.PROSITE-ProRule annotation
Contains 1 RRM (RNA recognition motif) domain.PROSITE-ProRule annotation

Zinc finger

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Zinc fingeri12 – 40C3H1-type 1PROSITE-ProRule annotationAdd BLAST29
Zinc fingeri149 – 176C3H1-type 2PROSITE-ProRule annotationAdd BLAST28

Keywords - Domaini

Repeat, Zinc-finger

Phylogenomic databases

eggNOGiKOG2202. Eukaryota.
ENOG410Z5PX. LUCA.
GeneTreeiENSGT00530000063193.
HOGENOMiHOG000178619.
HOVERGENiHBG054605.
InParanoidiQ9D883.
KOiK12836.
OrthoDBiEOG091G0CUI.
PhylomeDBiQ9D883.

Family and domain databases

Gene3Di3.30.70.330. 1 hit.
InterProiIPR012677. Nucleotide-bd_a/b_plait.
IPR000504. RRM_dom.
IPR003954. RRM_dom_euk.
IPR009145. U2AF_small.
IPR000571. Znf_CCCH.
[Graphical view]
PANTHERiPTHR12620. PTHR12620. 1 hit.
PfamiPF00076. RRM_1. 1 hit.
PF00642. zf-CCCH. 2 hits.
[Graphical view]
PRINTSiPR01848. U2AUXFACTOR.
SMARTiSM00360. RRM. 1 hit.
SM00361. RRM_1. 1 hit.
SM00356. ZnF_C3H1. 2 hits.
[Graphical view]
SUPFAMiSSF54928. SSF54928. 1 hit.
PROSITEiPS50102. RRM. 1 hit.
PS50103. ZF_C3H1. 2 hits.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

Q9D883-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MAEYLASIFG TEKDKVNCSF YFKIGACRHG DRCSRLHNKP TFSQTIALLN
60 70 80 90 100
IYRNPQNSSQ SADGLRCAVS DVEMQEHYDE FFEEVFTEME EKYGEVEEMN
110 120 130 140 150
VCDNLGDHLV GNVYVKFRRE EDAEKAVIDL NNRWFNGQPI HAELSPVTDF
160 170 180 190 200
REACCRQYEM GECTRGGFCN FMHLKPISRE LRRELYGRRR KKHRSRSRSR
210 220 230
ERRSRSRDRG RGGGGGGGGG GGRERDRRRS RDRERSGRF
Length:239
Mass (Da):27,815
Last modified:January 23, 2007 - v4
Checksum:iDFF944210581244D
GO

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti187G → R in BAB25609 (PubMed:16141072).Curated1

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK008332 mRNA. Translation: BAB25609.1.
AK012849 mRNA. Translation: BAB28511.1.
AK088979 mRNA. Translation: BAC40683.1.
AK150872 mRNA. Translation: BAE29923.1.
BC002184 mRNA. Translation: AAH02184.1.
CCDSiCCDS28611.1.
RefSeqiNP_001157241.1. NM_001163769.1.
NP_077149.2. NM_024187.4.
UniGeneiMm.379289.

Genome annotation databases

EnsembliENSMUST00000014684; ENSMUSP00000014684; ENSMUSG00000061613.
GeneIDi108121.
KEGGimmu:108121.
UCSCiuc008bvo.2. mouse.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK008332 mRNA. Translation: BAB25609.1.
AK012849 mRNA. Translation: BAB28511.1.
AK088979 mRNA. Translation: BAC40683.1.
AK150872 mRNA. Translation: BAE29923.1.
BC002184 mRNA. Translation: AAH02184.1.
CCDSiCCDS28611.1.
RefSeqiNP_001157241.1. NM_001163769.1.
NP_077149.2. NM_024187.4.
UniGeneiMm.379289.

3D structure databases

ProteinModelPortaliQ9D883.
SMRiQ9D883.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi223844. 3 interactors.
IntActiQ9D883. 3 interactors.
MINTiMINT-4139036.
STRINGi10090.ENSMUSP00000014684.

PTM databases

iPTMnetiQ9D883.
PhosphoSitePlusiQ9D883.
SwissPalmiQ9D883.

Proteomic databases

EPDiQ9D883.
MaxQBiQ9D883.
PaxDbiQ9D883.
PRIDEiQ9D883.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSMUST00000014684; ENSMUSP00000014684; ENSMUSG00000061613.
GeneIDi108121.
KEGGimmu:108121.
UCSCiuc008bvo.2. mouse.

Organism-specific databases

CTDi7307.
MGIiMGI:98884. U2af1.

Phylogenomic databases

eggNOGiKOG2202. Eukaryota.
ENOG410Z5PX. LUCA.
GeneTreeiENSGT00530000063193.
HOGENOMiHOG000178619.
HOVERGENiHBG054605.
InParanoidiQ9D883.
KOiK12836.
OrthoDBiEOG091G0CUI.
PhylomeDBiQ9D883.

Enzyme and pathway databases

ReactomeiR-MMU-109688. Cleavage of Growing Transcript in the Termination Region.
R-MMU-159236. Transport of Mature mRNA derived from an Intron-Containing Transcript.
R-MMU-72163. mRNA Splicing - Major Pathway.
R-MMU-72187. mRNA 3'-end processing.

Miscellaneous databases

ChiTaRSiU2af1. mouse.
PROiQ9D883.
SOURCEiSearch...

Gene expression databases

BgeeiENSMUSG00000061613.
ExpressionAtlasiQ9D883. baseline and differential.
GenevisibleiQ9D883. MM.

Family and domain databases

Gene3Di3.30.70.330. 1 hit.
InterProiIPR012677. Nucleotide-bd_a/b_plait.
IPR000504. RRM_dom.
IPR003954. RRM_dom_euk.
IPR009145. U2AF_small.
IPR000571. Znf_CCCH.
[Graphical view]
PANTHERiPTHR12620. PTHR12620. 1 hit.
PfamiPF00076. RRM_1. 1 hit.
PF00642. zf-CCCH. 2 hits.
[Graphical view]
PRINTSiPR01848. U2AUXFACTOR.
SMARTiSM00360. RRM. 1 hit.
SM00361. RRM_1. 1 hit.
SM00356. ZnF_C3H1. 2 hits.
[Graphical view]
SUPFAMiSSF54928. SSF54928. 1 hit.
PROSITEiPS50102. RRM. 1 hit.
PS50103. ZF_C3H1. 2 hits.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiU2AF1_MOUSE
AccessioniPrimary (citable) accession number: Q9D883
Secondary accession number(s): Q564E4, Q99LX2, Q9CZ98
Entry historyi
Integrated into UniProtKB/Swiss-Prot: November 16, 2001
Last sequence update: January 23, 2007
Last modified: November 30, 2016
This is version 140 of the entry and version 4 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. MGD cross-references
    Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.