Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q8TBZ3 (WDR20_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified July 9, 2014. Version 110. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (4) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
WD repeat-containing protein 20
Alternative name(s):
Protein DMR
Gene names
Name:WDR20
OrganismHomo sapiens (Human) [Reference proteome]
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length569 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level

General annotation (Comments)

Sequence similarities

Contains 5 WD repeats.

Sequence caution

The sequence AAL56014.1 differs from that shown. Reason: Erroneous initiation.

Ontologies

Keywords
   Coding sequence diversityAlternative splicing
Polymorphism
   DomainRepeat
WD repeat
   PTMAcetylation
Phosphoprotein
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
None. [Check GOA]

Alternative products

This entry describes 8 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q8TBZ3-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q8TBZ3-2)

The sequence of this isoform differs from the canonical sequence as follows:
     565-569: VSFNP → GSLSSPSQASSPGGTVV
Note: No experimental confirmation available.
Isoform 3 (identifier: Q8TBZ3-3)

The sequence of this isoform differs from the canonical sequence as follows:
     145-195: RLIDKSRVTC...TAPHYQLLKQ → NSCQHLWKVD...RMQGVLQDQN
     196-569: Missing.
Note: No experimental confirmation available.
Isoform 4 (identifier: Q8TBZ3-4)

The sequence of this isoform differs from the canonical sequence as follows:
     1-9: Missing.
     84-144: Missing.
Note: No experimental confirmation available.
Isoform 5 (identifier: Q8TBZ3-5)

The sequence of this isoform differs from the canonical sequence as follows:
     84-144: Missing.
Note: No experimental confirmation available.
Isoform 6 (identifier: Q8TBZ3-6)

The sequence of this isoform differs from the canonical sequence as follows:
     84-144: Missing.
     565-569: VSFNP → GSLSSPSQASSPGGTVV
Note: No experimental confirmation available.
Isoform 7 (identifier: Q8TBZ3-7)

The sequence of this isoform differs from the canonical sequence as follows:
     144-144: E → ENSCQHLWKVDWNEERQNEGSKTSEEALVTVQ
Isoform 8 (identifier: Q8TBZ3-8)

The sequence of this isoform differs from the canonical sequence as follows:
     84-569: AADLSKPIDK...RPGKVVSFNP → TIP
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Initiator methionine11Removed Ref.11
Chain2 – 569568WD repeat-containing protein 20
PRO_0000051366

Regions

Repeat147 – 18741WD 1
Repeat218 – 25740WD 2
Repeat260 – 29940WD 3
Repeat347 – 39145WD 4
Repeat531 – 56838WD 5

Amino acid modifications

Modified residue21N-acetylalanine Ref.11
Modified residue4321Phosphoserine Ref.7 Ref.10
Modified residue4341Phosphoserine Ref.7 Ref.8

Natural variations

Alternative sequence1 – 99Missing in isoform 4.
VSP_045225
Alternative sequence84 – 569486AADLS…VSFNP → TIP in isoform 8.
VSP_047064
Alternative sequence84 – 14461Missing in isoform 4, isoform 5 and isoform 6.
VSP_045226
Alternative sequence1441E → ENSCQHLWKVDWNEERQNEG SKTSEEALVTVQ in isoform 7.
VSP_047065
Alternative sequence145 – 19551RLIDK…QLLKQ → NSCQHLWKVDWNEERQNEGS KTSEEALVTVQPAEHFCRQE DRMQGVLQDQN in isoform 3.
VSP_043412
Alternative sequence196 – 569374Missing in isoform 3.
VSP_043413
Alternative sequence565 – 5695VSFNP → GSLSSPSQASSPGGTVV in isoform 2 and isoform 6.
VSP_024387
Natural variant1591P → H. Ref.4
Corresponds to variant rs17852545 [ dbSNP | Ensembl ].
VAR_031580
Natural variant4441G → C.
Corresponds to variant rs12888595 [ dbSNP | Ensembl ].
VAR_053425

Experimental info

Sequence conflict1951Q → H in AAL56014. Ref.5
Sequence conflict2991R → K in AAL56014. Ref.5
Sequence conflict3371L → P in BAG60080. Ref.2
Sequence conflict3531S → P in BAG60080. Ref.2
Sequence conflict4741H → L in CAB63713. Ref.6

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified April 3, 2007. Version 2.
Checksum: 5779A68D94C34CAF

FASTA56962,893
        10         20         30         40         50         60 
MATEGGGKEM NEIKTQFTTR EGLYKLLPHS EYSRPNRVPF NSQGSNPVRV SFVNLNDQSG 

        70         80         90        100        110        120 
NGDRLCFNVG RELYFYIYKG VRKAADLSKP IDKRIYKGTQ PTCHDFNHLT ATAESVSLLV 

       130        140        150        160        170        180 
GFSAGQVQLI DPIKKETSKL FNEERLIDKS RVTCVKWVPG SESLFLVAHS SGNMYLYNVE 

       190        200        210        220        230        240 
HTCGTTAPHY QLLKQGESFA VHTCKSKSTR NPLLKWTVGE GALNEFAFSP DGKFLACVSQ 

       250        260        270        280        290        300 
DGFLRVFNFD SVELHGTMKS YFGGLLCVCW SPDGKYIVTG GEDDLVTVWS FVDCRVIARG 

       310        320        330        340        350        360 
HGHKSWVSVV AFDPYTTSVE EGDPMEFSGS DEDFQDLLHF GRDRANSTQS RLSKRNSTDS 

       370        380        390        400        410        420 
RPVSVTYRFG SVGQDTQLCL WDLTEDILFP HQPLSRARTH TNVMNATSPP AGSNGNSVTT 

       430        440        450        460        470        480 
PGNSVPPPLP RSNSLPHSAV SNAGSKSSVM DGAIASGVSK FATLSLHDRK ERHHEKDHKR 

       490        500        510        520        530        540 
NHSMGHISSK SSDKLNLVTK TKTDPAKTLG TPLCPRMEDV PLLEPLICKK IAHERLTVLI 

       550        560 
FLEDCIVTAC QEGFICTWGR PGKVVSFNP 

« Hide

Isoform 2 [UniParc].

Checksum: BBCBE3C8CBEF8F75
Show »

FASTA58163,848
Isoform 3 [UniParc].

Checksum: F21E71C2C629E6CB
Show »

FASTA19522,205
Isoform 4 [UniParc].

Checksum: A8E7A7927C37544A
Show »

FASTA49955,348
Isoform 5 [UniParc].

Checksum: 88D69FEEBB379AE0
Show »

FASTA50856,208
Isoform 6 [UniParc].

Checksum: 3052279BDE983E1D
Show »

FASTA52057,163
Isoform 7 [UniParc].

Checksum: CD7F36221219CDEA
Show »

FASTA60066,521
Isoform 8 [UniParc].

Checksum: 9C7D52794DEBA5E3
Show »

FASTA869,820

References

« Hide 'large scale' references
[1]"Full-length cDNA libraries and normalization."
Li W.B., Gruber C., Jessee J., Polayes D.
Submitted (FEB-2003) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 4).
Tissue: Neuroblastoma.
[2]"Complete sequencing and characterization of 21,243 full-length human cDNAs."
Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S. expand/collapse author list , Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.
Nat. Genet. 36:40-45(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 7).
[3]"The DNA sequence and analysis of human chromosome 14."
Heilig R., Eckenberg R., Petit J.-L., Fonknechten N., Da Silva C., Cattolico L., Levy M., Barbe V., De Berardinis V., Ureta-Vidal A., Pelletier E., Vico V., Anthouard V., Rowen L., Madan A., Qin S., Sun H., Du H. expand/collapse author list , Pepin K., Artiguenave F., Robert C., Cruaud C., Bruels T., Jaillon O., Friedlander L., Samson G., Brottier P., Cure S., Segurens B., Aniere F., Samain S., Crespeau H., Abbasi N., Aiach N., Boscus D., Dickhoff R., Dors M., Dubois I., Friedman C., Gouyvenoux M., James R., Madan A., Mairey-Estrada B., Mangenot S., Martins N., Menard M., Oztas S., Ratcliffe A., Shaffer T., Trask B., Vacherie B., Bellemere C., Belser C., Besnard-Gonnet M., Bartol-Mavel D., Boutard M., Briez-Silla S., Combette S., Dufosse-Laurent V., Ferron C., Lechaplais C., Louesse C., Muselet D., Magdelenat G., Pateau E., Petit E., Sirvain-Trukniewicz P., Trybou A., Vega-Czarny N., Bataille E., Bluet E., Bordelais I., Dubois M., Dumont C., Guerin T., Haffray S., Hammadi R., Muanga J., Pellouin V., Robert D., Wunderle E., Gauguet G., Roy A., Sainte-Marthe L., Verdier J., Verdier-Discala C., Hillier L.W., Fulton L., McPherson J., Matsuda F., Wilson R., Scarpelli C., Gyapay G., Wincker P., Saurin W., Quetier F., Waterston R., Hood L., Weissenbach J.
Nature 421:601-607(2003) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[4]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1 AND 3), VARIANT HIS-159.
Tissue: Testis.
[5]Li N., Chen T., Wan T., Zhang W., Cao X.
Submitted (DEC-2000) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 2-569 (ISOFORM 2).
[6]"The full-ORF clone resource of the German cDNA consortium."
Bechtel S., Rosenfelder H., Duda A., Schmidt C.P., Ernst U., Wellenreuther R., Mehrle A., Schuster C., Bahr A., Bloecker H., Heubner D., Hoerlein A., Michel G., Wedler H., Koehrer K., Ottenwaelder B., Poustka A., Wiemann S., Schupp I.
BMC Genomics 8:399-399(2007) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 232-569 (ISOFORM 1).
Tissue: Testis.
[7]"A quantitative atlas of mitotic phosphorylation."
Dephoure N., Zhou C., Villen J., Beausoleil S.A., Bakalarski C.E., Elledge S.J., Gygi S.P.
Proc. Natl. Acad. Sci. U.S.A. 105:10762-10767(2008) [PubMed] [Europe PMC] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-432 AND SER-434, IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
Tissue: Cervix carcinoma.
[8]"Quantitative phosphoproteomic analysis of T cell receptor signaling reveals system-wide modulation of protein-protein interactions."
Mayya V., Lundgren D.H., Hwang S.-I., Rezaul K., Wu L., Eng J.K., Rodionov V., Han D.K.
Sci. Signal. 2:RA46-RA46(2009) [PubMed] [Europe PMC] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-434, IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
Tissue: Leukemic T-cell.
[9]"Quantitative phosphoproteomics reveals widespread full phosphorylation site occupancy during mitosis."
Olsen J.V., Vermeulen M., Santamaria A., Kumar C., Miller M.L., Jensen L.J., Gnad F., Cox J., Jensen T.S., Nigg E.A., Brunak S., Mann M.
Sci. Signal. 3:RA3-RA3(2010) [PubMed] [Europe PMC] [Abstract]
Cited for: IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
Tissue: Cervix carcinoma.
[10]"System-wide temporal characterization of the proteome and phosphoproteome of human embryonic stem cell differentiation."
Rigbolt K.T., Prokhorova T.A., Akimov V., Henningsen J., Johansen P.T., Kratchmarova I., Kassem M., Mann M., Olsen J.V., Blagoev B.
Sci. Signal. 4:RS3-RS3(2011) [PubMed] [Europe PMC] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-432, IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
[11]"N-terminal acetylome analyses and functional insights of the N-terminal acetyltransferase NatB."
Van Damme P., Lasa M., Polevoda B., Gazquez C., Elosegui-Artola A., Kim D.S., De Juan-Pardo E., Demeyer K., Hole K., Larrea E., Timmerman E., Prieto J., Arnesen T., Sherman F., Gevaert K., Aldabe R.
Proc. Natl. Acad. Sci. U.S.A. 109:12449-12454(2012) [PubMed] [Europe PMC] [Abstract]
Cited for: ACETYLATION [LARGE SCALE ANALYSIS] AT ALA-2, IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS], CLEAVAGE OF INITIATOR METHIONINE [LARGE SCALE ANALYSIS].

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
BX248274 mRNA. Translation: CAD62602.1.
AK297727 mRNA. Translation: BAG60080.1.
AL133223 Genomic DNA. No translation available.
AL359402 Genomic DNA. No translation available.
BC028387 mRNA. Translation: AAH28387.1.
BC030654 mRNA. Translation: AAH30654.1.
AF327354 mRNA. Translation: AAL56014.1. Different initiation.
AL133558 mRNA. Translation: CAB63713.1.
CCDSCCDS55942.1. [Q8TBZ3-8]
CCDS55943.1. [Q8TBZ3-7]
CCDS55944.1. [Q8TBZ3-3]
CCDS55945.1. [Q8TBZ3-6]
CCDS9968.1. [Q8TBZ3-2]
CCDS9969.1. [Q8TBZ3-1]
CCDS9970.1. [Q8TBZ3-5]
PIRT43440.
RefSeqNP_001229343.1. NM_001242414.1. [Q8TBZ3-8]
NP_001229344.1. NM_001242415.1. [Q8TBZ3-3]
NP_001229345.1. NM_001242416.1. [Q8TBZ3-6]
NP_001229346.1. NM_001242417.1. [Q8TBZ3-7]
NP_001229347.1. NM_001242418.1.
NP_653175.2. NM_144574.3. [Q8TBZ3-1]
NP_851808.1. NM_181291.2. [Q8TBZ3-2]
NP_851825.1. NM_181308.2. [Q8TBZ3-5]
UniGeneHs.36859.

3D structure databases

ProteinModelPortalQ8TBZ3.
SMRQ8TBZ3. Positions 223-385, 531-561.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

BioGrid124884. 21 interactions.
IntActQ8TBZ3. 17 interactions.
STRING9606.ENSP00000335434.

PTM databases

PhosphoSiteQ8TBZ3.

Polymorphism databases

DMDM143811476.

Proteomic databases

MaxQBQ8TBZ3.
PaxDbQ8TBZ3.
PRIDEQ8TBZ3.

Protocols and materials databases

DNASU91833.
StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENST00000299135; ENSP00000299135; ENSG00000140153. [Q8TBZ3-8]
ENST00000322340; ENSP00000314209; ENSG00000140153. [Q8TBZ3-3]
ENST00000335263; ENSP00000335434; ENSG00000140153. [Q8TBZ3-2]
ENST00000342702; ENSP00000341037; ENSG00000140153. [Q8TBZ3-1]
ENST00000454394; ENSP00000406084; ENSG00000140153. [Q8TBZ3-7]
ENST00000555879; ENSP00000452470; ENSG00000140153. [Q8TBZ3-8]
ENST00000556511; ENSP00000451633; ENSG00000140153. [Q8TBZ3-5]
ENST00000556807; ENSP00000450636; ENSG00000140153. [Q8TBZ3-6]
GeneID91833.
KEGGhsa:91833.
UCSCuc001ykz.3. human. [Q8TBZ3-1]
uc001ylc.3. human. [Q8TBZ3-3]
uc001yld.3. human. [Q8TBZ3-2]

Organism-specific databases

CTD91833.
GeneCardsGC14P102606.
HGNCHGNC:19667. WDR20.
HPAHPA007720.
neXtProtNX_Q8TBZ3.
PharmGKBPA134936678.
GenAtlasSearch...

Phylogenomic databases

eggNOGCOG2319.
HOGENOMHOG000148775.
HOVERGENHBG011270.
OMATAQILWA.
OrthoDBEOG7G1V5P.
PhylomeDBQ8TBZ3.
TreeFamTF314961.

Enzyme and pathway databases

SignaLinkQ8TBZ3.

Gene expression databases

ArrayExpressQ8TBZ3.
BgeeQ8TBZ3.
CleanExHS_WDR20.
GenevestigatorQ8TBZ3.

Family and domain databases

Gene3D2.130.10.10. 3 hits.
InterProIPR015943. WD40/YVTN_repeat-like_dom.
IPR001680. WD40_repeat.
IPR017986. WD40_repeat_dom.
[Graphical view]
PfamPF00400. WD40. 2 hits.
[Graphical view]
SMARTSM00320. WD40. 4 hits.
[Graphical view]
SUPFAMSSF50978. SSF50978. 3 hits.
PROSITEPS50082. WD_REPEATS_2. 1 hit.
PS50294. WD_REPEATS_REGION. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

GenomeRNAi91833.
NextBio77478.
PROQ8TBZ3.

Entry information

Entry nameWDR20_HUMAN
AccessionPrimary (citable) accession number: Q8TBZ3
Secondary accession number(s): B4DN18 expand/collapse secondary AC list , E7EUY8, F8W9S4, G3V2F8, G3V5R0, H0YJJ1, Q86TU2, Q8NCN7, Q8WXX2, Q9UF86
Entry history
Integrated into UniProtKB/Swiss-Prot: February 12, 2003
Last sequence update: April 3, 2007
Last modified: July 9, 2014
This is version 110 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

SIMILARITY comments

Index of protein domains and families

Human polymorphisms and disease mutations

Index of human polymorphisms and disease mutations

Human entries with polymorphisms or disease mutations

List of human entries with polymorphisms or disease mutations

Human chromosome 14

Human chromosome 14: entries, gene names and cross-references to MIM