Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q9C0J8 (WDR33_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified January 25, 2012. Version 93. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (4) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
pre-mRNA 3' end processing protein WDR33
Alternative name(s):
WD repeat-containing protein 33
WD repeat-containing protein WDC146
Gene names
Name:WDR33
Synonyms:WDC146
OrganismHomo sapiens (Human)
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length1336 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Essential for both cleavage and polyadenylation of pre-mRNA 3' ends. Ref.11

Subunit structure

Component of the cleavage and polyadenylation specificity factor (CPSF) module of the pre-mRNA 3' end processing complex. Interacts with CPSF3/CPSF73. Ref.11

Subcellular location

Nucleus Ref.1 Ref.11.

Tissue specificity

Most highly expressed in testis. Ref.1

Sequence similarities

Contains 1 collagen-like domain.

Contains 7 WD repeats.

Ontologies

Keywords
   Biological processmRNA processing
   Cellular componentNucleus
   Coding sequence diversityAlternative splicing
Polymorphism
   DomainCollagen
Repeat
WD repeat
   PTMAcetylation
Phosphoprotein
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
   Biological processpostreplication repair

Non-traceable author statement Ref.1. Source: UniProtKB

spermatogenesis

Non-traceable author statement Ref.1. Source: UniProtKB

   Cellular componentcollagen

Inferred from electronic annotation. Source: UniProtKB-KW

nucleus

Inferred from direct assay Ref.1. Source: UniProtKB

Complete GO annotation...

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q9C0J8-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q9C0J8-2)

The sequence of this isoform differs from the canonical sequence as follows:
     209-326: SFSPTDNKFA...IRNLKEELQV → RFIHNIPFSV...YFIPNKEFSL
     327-1336: Missing.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier
Sequence conflict2741R → Q in BAD97039. Ref.3
Sequence conflict3061F → S in AAH05401. Ref.6

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Initiator methionine11Removed
Chain2 – 13361335pre-mRNA 3' end processing protein WDR33
PRO_0000051382

Regions

Repeat117 – 15640WD 1
Repeat159 – 19840WD 2
Repeat200 – 23940WD 3
Repeat242 – 28342WD 4
Repeat286 – 32540WD 5
Repeat329 – 36941WD 6
Repeat373 – 41240WD 7
Domain618 – 770153Collagen-like

Amino acid modifications

Modified residue21N-acetylalanine Ref.10
Modified residue71Phosphoserine Ref.8 Ref.10
Modified residue461N6-acetyllysine Ref.12
Modified residue12101Phosphoserine Ref.9

Natural variations

Alternative sequence209 – 326118SFSPT…EELQV → RFIHNIPFSVVPIVMVKLFS KCILGAEMHGLCQFLGNFLH PINTIFFFVFTHSPFCWHLS EVVLSRYQPLQYVRDVLSAA FCTGFLFSFMINNVYTLFLF IIYCVRQEYFIPNKEFSL in isoform 2.
VSP_041333
Alternative sequence327 – 13361010Missing in isoform 2.
VSP_041334
Natural variant331A → S.
Corresponds to variant rs11557686 [ dbSNP | Ensembl ].
VAR_046717
Natural variant7111P → R.
Corresponds to variant rs12615078 [ dbSNP | Ensembl ].
VAR_053427

Experimental info

Sequence conflict1131T → A in BAD97039. Ref.3
Sequence conflict7151G → S in BAB32435. Ref.1

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified October 14, 2008. Version 2.
Checksum: DE45F510F93BB783

FASTA1,336145,891
        10         20         30         40         50         60 
MATEIGSPPR FFHMPRFQHQ APRQLFYKRP DFAQQQAMQQ LTFDGKRMRK AVNRKTIDYN 

        70         80         90        100        110        120 
PSVIKYLENR IWQRDQRDMR AIQPDAGYYN DLVPPIGMLN NPMNAVTTKF VRTSTNKVKC 

       130        140        150        160        170        180 
PVFVVRWTPE GRRLVTGASS GEFTLWNGLT FNFETILQAH DSPVRAMTWS HNDMWMLTAD 

       190        200        210        220        230        240 
HGGYVKYWQS NMNNVKMFQA HKEAIREASF SPTDNKFATC SDDGTVRIWD FLRCHEERIL 

       250        260        270        280        290        300 
RGHGADVKCV DWHPTKGLVV SGSKDSQQPI KFWDPKTGQS LATLHAHKNT VMEVKLNLNG 

       310        320        330        340        350        360 
NWLLTASRDH LCKLFDIRNL KEELQVFRGH KKEATAVAWH PVHEGLFASG GSDGSLLFWH 

       370        380        390        400        410        420 
VGVEKEVGGM EMAHEGMIWS LAWHPLGHIL CSGSNDHTSK FWTRNRPGDK MRDRYNLNLL 

       430        440        450        460        470        480 
PGMSEDGVEY DDLEPNSLAV IPGMGIPEQL KLAMEQEQMG KDESNEIEMT IPGLDWGMEE 

       490        500        510        520        530        540 
VMQKDQKKVP QKKVPYAKPI PAQFQQAWMQ NKVPIPAPNE VLNDRKEDIK LEEKKKTQAE 

       550        560        570        580        590        600 
IEQEMATLQY TNPQLLEQLK IERLAQKQVE QIQPPPSSGT PLLGPQPFPG QGPMSQIPQG 

       610        620        630        640        650        660 
FQQPHPSQQM PMNMAQMGPP GPQGQFRPPG PQGQMGPQGP PLHQGGGGPQ GFMGPQGPQG 

       670        680        690        700        710        720 
PPQGLPRPQD MHGPQGMQRH PGPHGPLGPQ GPPGPQGSSG PQGHMGPQGP PGPQGHIGPQ 

       730        740        750        760        770        780 
GPPGPQGHLG PQGPPGTQGM QGPPGPRGMQ GPPHPHGIQG GPGSQGIQGP VSQGPLMGLN 

       790        800        810        820        830        840 
PRGMQGPPGP RENQGPAPQG MIMGHPPQEM RGPHPPGGLL GHGPQEMRGP QEIRGMQGPP 

       850        860        870        880        890        900 
PQGSMLGPPQ ELRGPPGSQS QQGPPQGSLG PPPQGGMQGP PGPQGQQNPA RGPHPSQGPI 

       910        920        930        940        950        960 
PFQQQKTPLL GDGPRAPFNQ EGQSTGPPPL IPGLGQQGAQ GRIPPLNPGQ GPGPNKGDSR 

       970        980        990       1000       1010       1020 
GPPNHHMGPM SERRHEQSGG PEHGPERGPF RGGQDCRGPP DRRGPHPDFP DDFSRPDDFH 

      1030       1040       1050       1060       1070       1080 
PDKRFGHRLR EFEGRGGPLP QEEKWRRGGP GPPFPPDHRE FSEGDGRGAA RGPPGAWEGR 

      1090       1100       1110       1120       1130       1140 
RPGDERFPRD PEDPRFRGRR EESFRRGAPP RHEGRAPPRG RDGFPGPEDF GPEENFDASE 

      1150       1160       1170       1180       1190       1200 
EAARGRDLRG RGRGTPRGGR KGLLPTPDEF PRFEGGRKPD SWDGNREPGP GHEHFRDTPR 

      1210       1220       1230       1240       1250       1260 
PDHPPHDGHS PASRERSSSL QGMDMASLPP RKRPWHDGPG TSEHREMEAP GGPSEDRGGK 

      1270       1280       1290       1300       1310       1320 
GRGGPGPAQR VPKSGRSSSL DGEHHDGYHR DEPFGGPPGS GTPSRGGRSG SNWGRGSNMN 

      1330 
SGPPRRGASR GGGRGR 

« Hide

Isoform 2 [UniParc].

Checksum: 1F1F704B877F7904
Show »

FASTA32638,294

References

« Hide 'large scale' references
[1]"A novel WD40 repeat protein, WDC146, highly expressed during spermatogenesis in a stage-specific manner."
Ito S., Sakai A., Nomura T., Miki Y., Ouchida M., Sasaki J., Shimizu K.
Biochem. Biophys. Res. Commun. 280:656-663(2001) [PubMed: 11162572] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), SUBCELLULAR LOCATION, TISSUE SPECIFICITY.
[2]"Complete sequencing and characterization of 21,243 full-length human cDNAs."
Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S. expand/collapse author list , Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.
Nat. Genet. 36:40-45(2004) [PubMed: 14702039] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
Tissue: Placenta.
[3]Suzuki Y., Sugano S., Totoki Y., Toyoda A., Takeda T., Sakaki Y., Tanaka A., Yokoyama S.
Submitted (APR-2005) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
Tissue: Small intestine.
[4]"Generation and annotation of the DNA sequences of human chromosomes 2 and 4."
Hillier L.W., Graves T.A., Fulton R.S., Fulton L.A., Pepin K.H., Minx P., Wagner-McPherson C., Layman D., Wylie K., Sekhon M., Becker M.C., Fewell G.A., Delehaunty K.D., Miner T.L., Nash W.E., Kremitzki C., Oddy L., Du H. expand/collapse author list , Sun H., Bradshaw-Cordum H., Ali J., Carter J., Cordes M., Harris A., Isak A., van Brunt A., Nguyen C., Du F., Courtney L., Kalicki J., Ozersky P., Abbott S., Armstrong J., Belter E.A., Caruso L., Cedroni M., Cotton M., Davidson T., Desai A., Elliott G., Erb T., Fronick C., Gaige T., Haakenson W., Haglund K., Holmes A., Harkins R., Kim K., Kruchowski S.S., Strong C.M., Grewal N., Goyea E., Hou S., Levy A., Martinka S., Mead K., McLellan M.D., Meyer R., Randall-Maher J., Tomlinson C., Dauphin-Kohlberg S., Kozlowicz-Reilly A., Shah N., Swearengen-Shahid S., Snider J., Strong J.T., Thompson J., Yoakum M., Leonard S., Pearman C., Trani L., Radionenko M., Waligorski J.E., Wang C., Rock S.M., Tin-Wollam A.-M., Maupin R., Latreille P., Wendl M.C., Yang S.-P., Pohl C., Wallis J.W., Spieth J., Bieri T.A., Berkowicz N., Nelson J.O., Osborne J., Ding L., Meyer R., Sabo A., Shotland Y., Sinha P., Wohldmann P.E., Cook L.L., Hickenbotham M.T., Eldred J., Williams D., Jones T.A., She X., Ciccarelli F.D., Izaurralde E., Taylor J., Schmutz J., Myers R.M., Cox D.R., Huang X., McPherson J.D., Mardis E.R., Clifton S.W., Warren W.C., Chinwalla A.T., Eddy S.R., Marra M.A., Ovcharenko I., Furey T.S., Miller W., Eichler E.E., Bork P., Suyama M., Torrents D., Waterston R.H., Wilson R.K.
Nature 434:724-731(2005) [PubMed: 15815621] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[5]Mural R.J., Istrail S., Sutton G.G., Florea L., Halpern A.L., Mobarry C.M., Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., Flanigan M.J., Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., Hannenhalli S., Turner R. expand/collapse author list , Yooseph S., Lu F., Nusskern D.R., Shue B.C., Zheng X.H., Zhong F., Delcher A.L., Huson D.H., Kravitz S.A., Mouchard L., Reinert K., Remington K.A., Clark A.G., Waterman M.S., Eichler E.E., Adams M.D., Hunkapiller M.W., Myers E.W., Venter J.C.
Submitted (JUL-2005) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[6]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
Tissue: Brain and Pancreas.
[7]"The full-ORF clone resource of the German cDNA consortium."
Bechtel S., Rosenfelder H., Duda A., Schmidt C.P., Ernst U., Wellenreuther R., Mehrle A., Schuster C., Bahr A., Bloecker H., Heubner D., Hoerlein A., Michel G., Wedler H., Koehrer K., Ottenwaelder B., Poustka A., Wiemann S., Schupp I.
BMC Genomics 8:399-399(2007) [PubMed: 17974005] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1187-1336 (ISOFORM 1).
Tissue: Melanoma.
[8]"Global, in vivo, and site-specific phosphorylation dynamics in signaling networks."
Olsen J.V., Blagoev B., Gnad F., Macek B., Kumar C., Mortensen P., Mann M.
Cell 127:635-648(2006) [PubMed: 17081983] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-7, MASS SPECTROMETRY.
Tissue: Cervix carcinoma.
[9]"A quantitative atlas of mitotic phosphorylation."
Dephoure N., Zhou C., Villen J., Beausoleil S.A., Bakalarski C.E., Elledge S.J., Gygi S.P.
Proc. Natl. Acad. Sci. U.S.A. 105:10762-10767(2008) [PubMed: 18669648] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-1210, MASS SPECTROMETRY.
Tissue: Cervix carcinoma.
[10]"Lys-N and trypsin cover complementary parts of the phosphoproteome in a refined SCX-based approach."
Gauci S., Helbig A.O., Slijper M., Krijgsveld J., Heck A.J., Mohammed S.
Anal. Chem. 81:4493-4501(2009) [PubMed: 19413330] [Abstract]
Cited for: ACETYLATION [LARGE SCALE ANALYSIS] AT ALA-2, PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-7, MASS SPECTROMETRY.
Tissue: Embryonic kidney.
[11]"Molecular architecture of the human pre-mRNA 3' processing complex."
Shi Y., Di Giammartino D.C., Taylor D., Sarkeshik A., Rice W.J., Yates J.R. III, Frank J., Manley J.L.
Mol. Cell 33:365-376(2009) [PubMed: 19217410] [Abstract]
Cited for: FUNCTION, IDENTIFICATION IN THE 3' PRE-MRNA END PROCESSING COMPLEX, SUBCELLULAR LOCATION, INTERACTION WITH CPSF3.
[12]"Lysine acetylation targets protein complexes and co-regulates major cellular functions."
Choudhary C., Kumar C., Gnad F., Nielsen M.L., Rehman M., Walther T., Olsen J.V., Mann M.
Science 325:834-840(2009) [PubMed: 19608861] [Abstract]
Cited for: ACETYLATION [LARGE SCALE ANALYSIS] AT LYS-46, MASS SPECTROMETRY.
[13]"Initial characterization of the human central proteome."
Burkard T.R., Planyavsky M., Kaupe I., Breitwieser F.P., Buerckstuemmer T., Bennett K.L., Superti-Furga G., Colinge J.
BMC Syst. Biol. 5:17-17(2011) [PubMed: 21269460] [Abstract]
Cited for: IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AB044749 mRNA. Translation: BAB32435.1.
AK002156 mRNA. Translation: BAA92113.1.
AK223319 mRNA. Translation: BAD97039.1.
AC006011 Genomic DNA. Translation: AAX82033.1.
CH471103 Genomic DNA. Translation: EAW95342.1.
CH471103 Genomic DNA. Translation: EAW95343.1.
BC005401 mRNA. Translation: AAH05401.1. Different termination.
BC013990 mRNA. Translation: AAH13990.1.
AL834365 mRNA. Translation: CAH10688.1.
IPIIPI00106567.
IPI00385811.
RefSeqNP_001006623.1. NM_001006622.2.
NP_060853.3. NM_018383.4.
UniGeneHs.554831.

3D structure databases

ProteinModelPortalQ9C0J8.
SMRQ9C0J8. Positions 117-411.
ModBaseSearch...

Protein-protein interaction databases

IntActQ9C0J8. 13 interactions.

PTM databases

PhosphoSiteQ9C0J8.

Polymorphism databases

DMDM209572695.

Proteomic databases

PRIDEQ9C0J8.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENST00000322313; ENSP00000325377; ENSG00000136709.
GeneID55339.
KEGGhsa:55339.

Organism-specific databases

CTD55339.
GeneCardsGC02M128557.
HGNCHGNC:25651. WDR33.
HPAHPA026897.
neXtProtNX_Q9C0J8.
PharmGKBPA134943440.
GenAtlasSearch...

Phylogenomic databases

eggNOGprNOG07493.
GeneTreeENSGT00540000070914.
HOGENOMHBG125697.
HOVERGENHBG054623.
InParanoidQ9C0J8.
OMAGGPQGFM.
OrthoDBEOG4V6ZG7.
PhylomeDBQ9C0J8.

Gene expression databases

ArrayExpressQ9C0J8.
BgeeQ9C0J8.
CleanExHS_WDR33.
GenevestigatorQ9C0J8.
GermOnlineENSG00000136709. Homo sapiens.

Family and domain databases

InterProIPR008160. Collagen.
IPR015943. WD40/YVTN_repeat-like_dom.
IPR001680. WD40_repeat.
IPR011046. WD40_repeat-like_dom.
IPR017986. WD40_repeat_dom.
[Graphical view]
Gene3DG3DSA:2.130.10.10. WD40/YVTN_repeat-like. 1 hit.
KOK15542.
PfamPF01391. Collagen. 2 hits.
PF00400. WD40. 6 hits.
[Graphical view]
SMARTSM00320. WD40. 7 hits.
[Graphical view]
SUPFAMSSF50978. WD40_like. 1 hit.
PROSITEPS00678. WD_REPEATS_1. False negative.
PS50082. WD_REPEATS_2. 6 hits.
PS50294. WD_REPEATS_REGION. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameWDR33_HUMAN
AccessionPrimary (citable) accession number: Q9C0J8
Secondary accession number(s): Q05DP8 expand/collapse secondary AC list , Q53FG9, Q587J1, Q69YF7, Q9NUL1
Entry history
Integrated into UniProtKB/Swiss-Prot: April 3, 2002
Last sequence update: October 14, 2008
Last modified: January 25, 2012
This is version 93 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

Human chromosome 2

Human chromosome 2: entries, gene names and cross-references to MIM

Human entries with polymorphisms or disease mutations

List of human entries with polymorphisms or disease mutations

Human polymorphisms and disease mutations

Index of human polymorphisms and disease mutations

SIMILARITY comments

Index of protein domains and families