Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

pre-mRNA 3' end processing protein WDR33

Gene

WDR33

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Essential for both cleavage and polyadenylation of pre-mRNA 3' ends.1 Publication

GO - Molecular functioni

  • poly(A) RNA binding Source: UniProtKB

GO - Biological processi

  • mRNA cleavage Source: GO_Central
  • mRNA processing Source: UniProtKB-KW
  • postreplication repair Source: UniProtKB
  • spermatogenesis Source: UniProtKB
Complete GO annotation...

Keywords - Biological processi

mRNA processing

Enzyme and pathway databases

BioCyciZFISH:ENSG00000136709-MONOMER.
SignaLinkiQ9C0J8.

Names & Taxonomyi

Protein namesi
Recommended name:
pre-mRNA 3' end processing protein WDR33
Alternative name(s):
WD repeat-containing protein 33
WD repeat-containing protein WDC146
Gene namesi
Name:WDR33
Synonyms:WDC146
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 2

Organism-specific databases

HGNCiHGNC:25651. WDR33.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Nucleus

Pathology & Biotechi

Organism-specific databases

OpenTargetsiENSG00000136709.
PharmGKBiPA134943440.

Polymorphism and mutation databases

BioMutaiWDR33.
DMDMi209572695.

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Initiator methionineiRemovedCombined sources
ChainiPRO_00000513822 – 1336pre-mRNA 3' end processing protein WDR33Add BLAST1335

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei2N-acetylalanineCombined sources1
Modified residuei7PhosphoserineCombined sources1
Modified residuei46N6-acetyllysineCombined sources1
Cross-linki530Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)Combined sources
Modified residuei782Omega-N-methylarginineCombined sources1
Modified residuei915Asymmetric dimethylarginineCombined sources1
Modified residuei987Omega-N-methylarginineBy similarity1
Modified residuei1035Omega-N-methylarginineBy similarity1
Modified residuei1210PhosphoserineCombined sources1
Modified residuei1262Omega-N-methylarginineBy similarity1
Modified residuei1315Asymmetric dimethylarginine; alternateCombined sources1
Modified residuei1315Omega-N-methylarginine; alternateCombined sources1

Keywords - PTMi

Acetylation, Isopeptide bond, Methylation, Phosphoprotein, Ubl conjugation

Proteomic databases

EPDiQ9C0J8.
MaxQBiQ9C0J8.
PaxDbiQ9C0J8.
PeptideAtlasiQ9C0J8.
PRIDEiQ9C0J8.

PTM databases

iPTMnetiQ9C0J8.
PhosphoSitePlusiQ9C0J8.
SwissPalmiQ9C0J8.

Expressioni

Tissue specificityi

Most highly expressed in testis.1 Publication

Gene expression databases

BgeeiENSG00000136709.
CleanExiHS_WDR33.
ExpressionAtlasiQ9C0J8. baseline and differential.
GenevisibleiQ9C0J8. HS.

Organism-specific databases

HPAiHPA026897.
HPA046527.

Interactioni

Subunit structurei

Component of the cleavage and polyadenylation specificity factor (CPSF) module of the pre-mRNA 3'-end processing complex. Interacts with CPSF3/CPSF73.1 Publication

Protein-protein interaction databases

BioGridi120620. 58 interactors.
IntActiQ9C0J8. 28 interactors.
MINTiMINT-1371235.
STRINGi9606.ENSP00000325377.

Structurei

3D structure databases

ProteinModelPortaliQ9C0J8.
SMRiQ9C0J8.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Repeati117 – 156WD 1Add BLAST40
Repeati159 – 198WD 2Add BLAST40
Repeati200 – 239WD 3Add BLAST40
Repeati242 – 283WD 4Add BLAST42
Repeati286 – 325WD 5Add BLAST40
Repeati329 – 369WD 6Add BLAST41
Repeati373 – 412WD 7Add BLAST40
Domaini618 – 770Collagen-likeAdd BLAST153

Sequence similaritiesi

Contains 1 collagen-like domain.Curated
Contains 7 WD repeats.PROSITE-ProRule annotation

Keywords - Domaini

Collagen, Repeat, WD repeat

Phylogenomic databases

eggNOGiKOG0284. Eukaryota.
COG2319. LUCA.
GeneTreeiENSGT00730000111130.
HOGENOMiHOG000126810.
HOVERGENiHBG062379.
KOiK15542.
OMAiQGFQQPH.
OrthoDBiEOG091G048U.
PhylomeDBiQ9C0J8.
TreeFamiTF317659.

Family and domain databases

Gene3Di2.130.10.10. 1 hit.
InterProiIPR015943. WD40/YVTN_repeat-like_dom.
IPR001680. WD40_repeat.
IPR017986. WD40_repeat_dom.
[Graphical view]
PfamiPF00400. WD40. 5 hits.
[Graphical view]
SMARTiSM00320. WD40. 7 hits.
[Graphical view]
SUPFAMiSSF50978. SSF50978. 1 hit.
PROSITEiPS50082. WD_REPEATS_2. 6 hits.
PS50294. WD_REPEATS_REGION. 1 hit.
[Graphical view]

Sequences (3)i

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

This entry describes 3 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: Q9C0J8-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MATEIGSPPR FFHMPRFQHQ APRQLFYKRP DFAQQQAMQQ LTFDGKRMRK
60 70 80 90 100
AVNRKTIDYN PSVIKYLENR IWQRDQRDMR AIQPDAGYYN DLVPPIGMLN
110 120 130 140 150
NPMNAVTTKF VRTSTNKVKC PVFVVRWTPE GRRLVTGASS GEFTLWNGLT
160 170 180 190 200
FNFETILQAH DSPVRAMTWS HNDMWMLTAD HGGYVKYWQS NMNNVKMFQA
210 220 230 240 250
HKEAIREASF SPTDNKFATC SDDGTVRIWD FLRCHEERIL RGHGADVKCV
260 270 280 290 300
DWHPTKGLVV SGSKDSQQPI KFWDPKTGQS LATLHAHKNT VMEVKLNLNG
310 320 330 340 350
NWLLTASRDH LCKLFDIRNL KEELQVFRGH KKEATAVAWH PVHEGLFASG
360 370 380 390 400
GSDGSLLFWH VGVEKEVGGM EMAHEGMIWS LAWHPLGHIL CSGSNDHTSK
410 420 430 440 450
FWTRNRPGDK MRDRYNLNLL PGMSEDGVEY DDLEPNSLAV IPGMGIPEQL
460 470 480 490 500
KLAMEQEQMG KDESNEIEMT IPGLDWGMEE VMQKDQKKVP QKKVPYAKPI
510 520 530 540 550
PAQFQQAWMQ NKVPIPAPNE VLNDRKEDIK LEEKKKTQAE IEQEMATLQY
560 570 580 590 600
TNPQLLEQLK IERLAQKQVE QIQPPPSSGT PLLGPQPFPG QGPMSQIPQG
610 620 630 640 650
FQQPHPSQQM PMNMAQMGPP GPQGQFRPPG PQGQMGPQGP PLHQGGGGPQ
660 670 680 690 700
GFMGPQGPQG PPQGLPRPQD MHGPQGMQRH PGPHGPLGPQ GPPGPQGSSG
710 720 730 740 750
PQGHMGPQGP PGPQGHIGPQ GPPGPQGHLG PQGPPGTQGM QGPPGPRGMQ
760 770 780 790 800
GPPHPHGIQG GPGSQGIQGP VSQGPLMGLN PRGMQGPPGP RENQGPAPQG
810 820 830 840 850
MIMGHPPQEM RGPHPPGGLL GHGPQEMRGP QEIRGMQGPP PQGSMLGPPQ
860 870 880 890 900
ELRGPPGSQS QQGPPQGSLG PPPQGGMQGP PGPQGQQNPA RGPHPSQGPI
910 920 930 940 950
PFQQQKTPLL GDGPRAPFNQ EGQSTGPPPL IPGLGQQGAQ GRIPPLNPGQ
960 970 980 990 1000
GPGPNKGDSR GPPNHHMGPM SERRHEQSGG PEHGPERGPF RGGQDCRGPP
1010 1020 1030 1040 1050
DRRGPHPDFP DDFSRPDDFH PDKRFGHRLR EFEGRGGPLP QEEKWRRGGP
1060 1070 1080 1090 1100
GPPFPPDHRE FSEGDGRGAA RGPPGAWEGR RPGDERFPRD PEDPRFRGRR
1110 1120 1130 1140 1150
EESFRRGAPP RHEGRAPPRG RDGFPGPEDF GPEENFDASE EAARGRDLRG
1160 1170 1180 1190 1200
RGRGTPRGGR KGLLPTPDEF PRFEGGRKPD SWDGNREPGP GHEHFRDTPR
1210 1220 1230 1240 1250
PDHPPHDGHS PASRERSSSL QGMDMASLPP RKRPWHDGPG TSEHREMEAP
1260 1270 1280 1290 1300
GGPSEDRGGK GRGGPGPAQR VPKSGRSSSL DGEHHDGYHR DEPFGGPPGS
1310 1320 1330
GTPSRGGRSG SNWGRGSNMN SGPPRRGASR GGGRGR
Length:1,336
Mass (Da):145,891
Last modified:October 14, 2008 - v2
Checksum:iDE45F510F93BB783
GO
Isoform 2 (identifier: Q9C0J8-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     209-326: SFSPTDNKFA...IRNLKEELQV → RFIHNIPFSV...YFIPNKEFSL
     327-1336: Missing.

Show »
Length:326
Mass (Da):38,294
Checksum:i1F1F704B877F7904
GO
Isoform 3 (identifier: Q9C0J8-3) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     242-257: GHGADVKCVDWHPTKG → DTCFHHCRCYFLSVKR
     258-326: Missing.
     327-1336: Missing.

Note: No experimental confirmation available.
Show »
Length:257
Mass (Da):30,288
Checksum:i2235A458B412633E
GO

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti113T → A in BAD97039 (Ref. 3) Curated1
Sequence conflicti715G → S in BAB32435 (PubMed:11162572).Curated1
Isoform 2 (identifier: Q9C0J8-2)
Sequence conflicti274R → Q in BAD97039 (Ref. 3) Curated1
Sequence conflicti306F → S in AAH05401 (PubMed:15489334).Curated1

Natural variant

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Natural variantiVAR_04671733A → S.Corresponds to variant rs11557686dbSNPEnsembl.1
Natural variantiVAR_053427711P → R.Corresponds to variant rs12615078dbSNPEnsembl.1

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_041333209 – 326SFSPT…EELQV → RFIHNIPFSVVPIVMVKLFS KCILGAEMHGLCQFLGNFLH PINTIFFFVFTHSPFCWHLS EVVLSRYQPLQYVRDVLSAA FCTGFLFSFMINNVYTLFLF IIYCVRQEYFIPNKEFSL in isoform 2. 3 PublicationsAdd BLAST118
Alternative sequenceiVSP_042684242 – 257GHGAD…HPTKG → DTCFHHCRCYFLSVKR in isoform 3. 1 PublicationAdd BLAST16
Alternative sequenceiVSP_042685258 – 326Missing in isoform 3. 1 PublicationAdd BLAST69
Alternative sequenceiVSP_041334327 – 1336Missing in isoform 2 and isoform 3. 3 PublicationsAdd BLAST1010

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AB044749 mRNA. Translation: BAB32435.1.
AK002156 mRNA. Translation: BAA92113.1.
AK223319 mRNA. Translation: BAD97039.1.
AC006011 Genomic DNA. Translation: AAX82033.1.
CH471103 Genomic DNA. Translation: EAW95342.1.
CH471103 Genomic DNA. Translation: EAW95343.1.
BC005401 mRNA. Translation: AAH05401.1. Different termination.
BC013990 mRNA. Translation: AAH13990.1.
BC068484 mRNA. Translation: AAH68484.1.
AL834365 mRNA. Translation: CAH10688.1.
CCDSiCCDS2150.1. [Q9C0J8-1]
CCDS42746.1. [Q9C0J8-3]
CCDS46407.1. [Q9C0J8-2]
RefSeqiNP_001006623.1. NM_001006622.2. [Q9C0J8-2]
NP_001006624.1. NM_001006623.2. [Q9C0J8-3]
NP_060853.3. NM_018383.4. [Q9C0J8-1]
XP_011509738.1. XM_011511436.1. [Q9C0J8-1]
UniGeneiHs.554831.

Genome annotation databases

EnsembliENST00000322313; ENSP00000325377; ENSG00000136709. [Q9C0J8-1]
ENST00000393006; ENSP00000376730; ENSG00000136709. [Q9C0J8-3]
ENST00000409658; ENSP00000387186; ENSG00000136709. [Q9C0J8-2]
GeneIDi55339.
KEGGihsa:55339.
UCSCiuc002tpg.3. human. [Q9C0J8-1]

Keywords - Coding sequence diversityi

Alternative splicing, Polymorphism

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AB044749 mRNA. Translation: BAB32435.1.
AK002156 mRNA. Translation: BAA92113.1.
AK223319 mRNA. Translation: BAD97039.1.
AC006011 Genomic DNA. Translation: AAX82033.1.
CH471103 Genomic DNA. Translation: EAW95342.1.
CH471103 Genomic DNA. Translation: EAW95343.1.
BC005401 mRNA. Translation: AAH05401.1. Different termination.
BC013990 mRNA. Translation: AAH13990.1.
BC068484 mRNA. Translation: AAH68484.1.
AL834365 mRNA. Translation: CAH10688.1.
CCDSiCCDS2150.1. [Q9C0J8-1]
CCDS42746.1. [Q9C0J8-3]
CCDS46407.1. [Q9C0J8-2]
RefSeqiNP_001006623.1. NM_001006622.2. [Q9C0J8-2]
NP_001006624.1. NM_001006623.2. [Q9C0J8-3]
NP_060853.3. NM_018383.4. [Q9C0J8-1]
XP_011509738.1. XM_011511436.1. [Q9C0J8-1]
UniGeneiHs.554831.

3D structure databases

ProteinModelPortaliQ9C0J8.
SMRiQ9C0J8.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi120620. 58 interactors.
IntActiQ9C0J8. 28 interactors.
MINTiMINT-1371235.
STRINGi9606.ENSP00000325377.

PTM databases

iPTMnetiQ9C0J8.
PhosphoSitePlusiQ9C0J8.
SwissPalmiQ9C0J8.

Polymorphism and mutation databases

BioMutaiWDR33.
DMDMi209572695.

Proteomic databases

EPDiQ9C0J8.
MaxQBiQ9C0J8.
PaxDbiQ9C0J8.
PeptideAtlasiQ9C0J8.
PRIDEiQ9C0J8.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000322313; ENSP00000325377; ENSG00000136709. [Q9C0J8-1]
ENST00000393006; ENSP00000376730; ENSG00000136709. [Q9C0J8-3]
ENST00000409658; ENSP00000387186; ENSG00000136709. [Q9C0J8-2]
GeneIDi55339.
KEGGihsa:55339.
UCSCiuc002tpg.3. human. [Q9C0J8-1]

Organism-specific databases

CTDi55339.
GeneCardsiWDR33.
HGNCiHGNC:25651. WDR33.
HPAiHPA026897.
HPA046527.
neXtProtiNX_Q9C0J8.
OpenTargetsiENSG00000136709.
PharmGKBiPA134943440.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiKOG0284. Eukaryota.
COG2319. LUCA.
GeneTreeiENSGT00730000111130.
HOGENOMiHOG000126810.
HOVERGENiHBG062379.
KOiK15542.
OMAiQGFQQPH.
OrthoDBiEOG091G048U.
PhylomeDBiQ9C0J8.
TreeFamiTF317659.

Enzyme and pathway databases

BioCyciZFISH:ENSG00000136709-MONOMER.
SignaLinkiQ9C0J8.

Miscellaneous databases

ChiTaRSiWDR33. human.
GeneWikiiWDR33.
GenomeRNAii55339.
PROiQ9C0J8.

Gene expression databases

BgeeiENSG00000136709.
CleanExiHS_WDR33.
ExpressionAtlasiQ9C0J8. baseline and differential.
GenevisibleiQ9C0J8. HS.

Family and domain databases

Gene3Di2.130.10.10. 1 hit.
InterProiIPR015943. WD40/YVTN_repeat-like_dom.
IPR001680. WD40_repeat.
IPR017986. WD40_repeat_dom.
[Graphical view]
PfamiPF00400. WD40. 5 hits.
[Graphical view]
SMARTiSM00320. WD40. 7 hits.
[Graphical view]
SUPFAMiSSF50978. SSF50978. 1 hit.
PROSITEiPS50082. WD_REPEATS_2. 6 hits.
PS50294. WD_REPEATS_REGION. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiWDR33_HUMAN
AccessioniPrimary (citable) accession number: Q9C0J8
Secondary accession number(s): Q05DP8
, Q53FG9, Q587J1, Q69YF7, Q6NUQ0, Q9NUL1
Entry historyi
Integrated into UniProtKB/Swiss-Prot: April 3, 2002
Last sequence update: October 14, 2008
Last modified: November 2, 2016
This is version 140 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Human chromosome 2
    Human chromosome 2: entries, gene names and cross-references to MIM
  2. Human entries with polymorphisms or disease mutations
    List of human entries with polymorphisms or disease mutations
  3. Human polymorphisms and disease mutations
    Index of human polymorphisms and disease mutations
  4. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.