Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Wilms tumor protein homolog

Gene

WT1

Organism
Sus scrofa (Pig)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at transcript leveli

Functioni

Transcription factor that plays an important role in cellular development and cell survival. Regulates the expression of numerous target genes, including EPO. Plays an essential role for development of the urogenital system. Recognizes and binds to the DNA sequence 5'-CGCCCCCGC-3'. It has a tumor suppressor as well as an oncogenic role in tumor formation. Function may be isoform-specific: isoforms lacking the KTS motif may act as transcription factors. Isoforms containing the KTS motif may bind mRNA and play a role in mRNA metabolism or splicing. Isoform 1 has lower affinity for DNA, and can bind RNA (By similarity).By similarity

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sitei424 – 4241Important for interaction with target DNABy similarity
Sitei430 – 4301Important for interaction with target DNABy similarity

Regions

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Zinc fingeri323 – 34725C2H2-type 1PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri353 – 37725C2H2-type 2PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri383 – 40523C2H2-type 3PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri414 – 43825C2H2-type 4PROSITE-ProRule annotationAdd
BLAST

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Biological processi

Transcription, Transcription regulation

Keywords - Ligandi

DNA-binding, Metal-binding, RNA-binding, Zinc

Names & Taxonomyi

Protein namesi
Recommended name:
Wilms tumor protein homolog
Gene namesi
Name:WT1
OrganismiSus scrofa (Pig)
Taxonomic identifieri9823 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaLaurasiatheriaCetartiodactylaSuinaSuidaeSus
Proteomesi
  • UP000008227 Componenti: Unplaced

Subcellular locationi

Isoform 1 :
Isoform 4 :

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Cytoplasm, Nucleus

Pathology & Biotechi

Keywords - Diseasei

Tumor suppressor

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 449449Wilms tumor protein homologPRO_0000047133Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Cross-linki73 – 73Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO)By similarity
Cross-linki177 – 177Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO)By similarity

Keywords - PTMi

Isopeptide bond, Ubl conjugation

Proteomic databases

PaxDbiO62651.
PeptideAtlasiO62651.

Expressioni

Developmental stagei

Expressed during kidney development.

Interactioni

Subunit structurei

Interacts with ZNF224 via the zinc-finger region. Interacts with WTAP, AMER1 and SRY. Homodimer. Interacts with WTIP. Interacts with actively translating polysomes. Detected in nuclear ribonucleoprotein (mRNP) particles. Interacts with U2AF2. Interacts with HNRNPU via the zinc-finger region. Interacts with CITED2. Interacts with RBM4 (By similarity).By similarity

GO - Molecular functioni

Protein-protein interaction databases

STRINGi9823.ENSSSCP00000019417.

Structurei

3D structure databases

ProteinModelPortaliO62651.
SMRiO62651. Positions 318-438.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni367 – 38115Important for interaction with target DNABy similarityAdd
BLAST
Regioni393 – 4019Important for interaction with target DNABy similarity

Motif

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Motifi408 – 4103KTS motifBy similarity

Compositional bias

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Compositional biasi28 – 8356Pro-richAdd
BLAST

Sequence similaritiesi

Contains 4 C2H2-type zinc fingers.PROSITE-ProRule annotation

Zinc finger

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Zinc fingeri323 – 34725C2H2-type 1PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri353 – 37725C2H2-type 2PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri383 – 40523C2H2-type 3PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri414 – 43825C2H2-type 4PROSITE-ProRule annotationAdd
BLAST

Keywords - Domaini

Repeat, Zinc-finger

Phylogenomic databases

eggNOGiKOG1721. Eukaryota.
COG5048. LUCA.
HOVERGENiHBG006960.
InParanoidiO62651.
KOiK09234.

Family and domain databases

Gene3Di3.30.160.60. 4 hits.
InterProiIPR017987. Wilms_tumour.
IPR000976. Wilms_tumour_N.
IPR007087. Znf_C2H2.
IPR015880. Znf_C2H2-like.
IPR013087. Znf_C2H2/integrase_DNA-bd.
[Graphical view]
PfamiPF02165. WT1. 1 hit.
PF00096. zf-C2H2. 1 hit.
[Graphical view]
PRINTSiPR00049. WILMSTUMOUR.
SMARTiSM00355. ZnF_C2H2. 4 hits.
[Graphical view]
PROSITEiPS00028. ZINC_FINGER_C2H2_1. 4 hits.
PS50157. ZINC_FINGER_C2H2_2. 4 hits.
[Graphical view]

Sequences (4)i

Sequence statusi: Complete.

This entry describes 4 isoformsi produced by alternative splicing. AlignAdd to basket

Note: Experimental confirmation may be lacking for some isoforms.
Isoform 1 (identifier: O62651-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MGSDVRDLNA LLPAVPSLGG GGGCALPVSG AAEWAPVLDF APPGASAYGS
60 70 80 90 100
LGGPAPPPAP PPPPPPPPHS FIKQEPSWGG AEPHEEQCLS AFTVHFSGQF
110 120 130 140 150
TGTAGACRYE PFGPPPPSQA SSGQARMFPN APYLPSCLES QPAIRNQGYS
160 170 180 190 200
TVTFDGTPSY GHTPSHHAAQ FPNHSFKHED PMGQQGSLGE QQYSVPPPVY
210 220 230 240 250
GCHTSTDSCT GSQALLLRTP YSSDNLYQMT SQLECMTWNQ MNLGATLKGV
260 270 280 290 300
AAGTSSSMKW TEGQSNHGAG YESDSHATPI LCGAQYRIHT HGVFRGIQDV
310 320 330 340 350
RRVPGVAPTL VRSASETSEK RPFMCAYPGC NKRYFKLSHL QMHSRKHTGE
360 370 380 390 400
KPYQCDFKDC ERRFSRSDQL KRHQRRHTGV KPFQCKTCQR KFSRSDHLKT
410 420 430 440
HTRTHTGKTS EKPFSCRWPS CQKKFARSDE LVRHHNMHQR NMSKLQLAL
Length:449
Mass (Da):49,166
Last modified:August 1, 1998 - v1
Checksum:i9C3E557B96F5A7B3
GO
Isoform 2 (identifier: O62651-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     250-266: Missing.
     408-410: Missing.

Show »
Length:429
Mass (Da):47,127
Checksum:iBB1FC9A497B09469
GO
Isoform 3 (identifier: O62651-3) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     250-266: Missing.

Show »
Length:432
Mass (Da):47,443
Checksum:i891551FAB065275F
GO
Isoform 4 (identifier: O62651-4) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     408-410: Missing.

Show »
Length:446
Mass (Da):48,850
Checksum:iAA3959247BAA0A56
GO

RNA editingi

Edited at position 281.By similarity
Partially edited.By similarity

Natural variant

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Natural varianti281 – 2811L → P in RNA edited version.

Alternative sequence

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Alternative sequencei250 – 26617Missing in isoform 2 and isoform 3. 1 PublicationVSP_006870Add
BLAST
Alternative sequencei408 – 4103Missing in isoform 2 and isoform 4. 1 PublicationVSP_006871

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AB010969 mRNA. Translation: BAA28147.1.
RefSeqiNP_001001264.1. NM_001001264.1. [O62651-1]
UniGeneiSsc.20977.

Genome annotation databases

GeneIDi397338.
KEGGissc:397338.

Keywords - Coding sequence diversityi

Alternative splicing, RNA editing

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AB010969 mRNA. Translation: BAA28147.1.
RefSeqiNP_001001264.1. NM_001001264.1. [O62651-1]
UniGeneiSsc.20977.

3D structure databases

ProteinModelPortaliO62651.
SMRiO62651. Positions 318-438.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi9823.ENSSSCP00000019417.

Proteomic databases

PaxDbiO62651.
PeptideAtlasiO62651.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

GeneIDi397338.
KEGGissc:397338.

Organism-specific databases

CTDi7490.

Phylogenomic databases

eggNOGiKOG1721. Eukaryota.
COG5048. LUCA.
HOVERGENiHBG006960.
InParanoidiO62651.
KOiK09234.

Family and domain databases

Gene3Di3.30.160.60. 4 hits.
InterProiIPR017987. Wilms_tumour.
IPR000976. Wilms_tumour_N.
IPR007087. Znf_C2H2.
IPR015880. Znf_C2H2-like.
IPR013087. Znf_C2H2/integrase_DNA-bd.
[Graphical view]
PfamiPF02165. WT1. 1 hit.
PF00096. zf-C2H2. 1 hit.
[Graphical view]
PRINTSiPR00049. WILMSTUMOUR.
SMARTiSM00355. ZnF_C2H2. 4 hits.
[Graphical view]
PROSITEiPS00028. ZINC_FINGER_C2H2_1. 4 hits.
PS50157. ZINC_FINGER_C2H2_2. 4 hits.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiWT1_PIG
AccessioniPrimary (citable) accession number: O62651
Entry historyi
Integrated into UniProtKB/Swiss-Prot: December 15, 1998
Last sequence update: August 1, 1998
Last modified: July 6, 2016
This is version 109 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Miscellaneous

Presence of the KTS motif hinders interactions between DNA and zinc-finger 4.By similarity

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.