Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Piwi-like protein 4

Gene

Piwil4

Organism
Mus musculus (Mouse)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Plays a central role during spermatogenesis by repressing transposable elements and preventing their mobilization, which is essential for the germline integrity. Acts via the piRNA metabolic process, which mediates the repression of transposable elements during meiosis by forming complexes composed of piRNAs and Piwi proteins and governs the methylation and subsequent repression of transposons. Directly binds piRNAs, a class of 24 to 30 nucleotide RNAs that are generated by a Dicer-independent mechanism and are primarily derived from transposons and other repeated sequence elements. Associates with secondary piRNAs antisense and PIWIL2/MILI is required for such association. The piRNA process acts upstream of known mediators of DNA methylation. Participates in a piRNA amplification loop. Besides their function in transposable elements repression, piRNAs are probably involved in other processes during meiosis such as translation regulation.4 Publications

GO - Molecular functioni

  • piRNA binding Source: UniProtKB

GO - Biological processi

  • cell differentiation Source: UniProtKB-KW
  • DNA methylation involved in gamete generation Source: UniProtKB
  • gene silencing by RNA Source: UniProtKB
  • meiotic cell cycle Source: UniProtKB-KW
  • multicellular organism development Source: UniProtKB-KW
  • negative regulation of transposition Source: MGI
  • piRNA metabolic process Source: UniProtKB
  • regulation of translation Source: UniProtKB-KW
  • spermatogenesis Source: UniProtKB
Complete GO annotation...

Keywords - Molecular functioni

Developmental protein

Keywords - Biological processi

Differentiation, Meiosis, RNA-mediated gene silencing, Spermatogenesis, Translation regulation

Keywords - Ligandi

RNA-binding

Enzyme and pathway databases

ReactomeiR-MMU-5601884. PIWI-interacting RNA (piRNA) biogenesis.

Names & Taxonomyi

Protein namesi
Recommended name:
Piwi-like protein 4
Short name:
mAgo5
Gene namesi
Name:Piwil4
Synonyms:Miwi2
OrganismiMus musculus (Mouse)
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus
Proteomesi
  • UP000000589 Componenti: Chromosome 9

Organism-specific databases

MGIiMGI:3041167. Piwil4.

Subcellular locationi

GO - Cellular componenti

  • cytoplasm Source: UniProtKB
  • cytosol Source: Reactome
  • nucleoplasm Source: Reactome
  • nucleus Source: UniProtKB
  • P granule Source: UniProtKB
  • piP-body Source: UniProtKB
Complete GO annotation...

Keywords - Cellular componenti

Cytoplasm, Nucleus

Pathology & Biotechi

Disruption phenotypei

Males show spermatogenesis defects due to demethylation and subsequent derepression of transposable elements. Meiotic-progression is impaired during early prophase of meiosis I, followed by a marked and progressive loss of germ cells with age.2 Publications

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00002345731 – 848Piwi-like protein 4Add BLAST848

Post-translational modificationi

Arginine methylation by PRMT5 is required for the interaction with Tudor domain-containing protein (TDRD1, TDRKH/TDRD2 and TDRD9) and subsequent localization to the meiotic nuage, also named P granule.1 Publication

Keywords - PTMi

Methylation

Proteomic databases

PeptideAtlasiQ8CGT6.
PRIDEiQ8CGT6.

PTM databases

iPTMnetiQ8CGT6.
PhosphoSitePlusiQ8CGT6.

Expressioni

Tissue specificityi

Detected in male germ cells beginning around 14.5-15.5 dpc but is absent from female germ cells. From 15.5 dpc and until birth, it is present in male germ cells. Not detected in somatic cells of the embryonic gonad. Expression declines soon after birth, reaching undetectable levels in 4 day-old mice (at protein level).1 Publication

Gene expression databases

BgeeiENSMUSG00000036912.
CleanExiMM_PIWIL4.
ExpressionAtlasiQ8CGT6. baseline and differential.
GenevisibleiQ8CGT6. MM.

Interactioni

Subunit structurei

Interacts with PRMT5 and WDR77 (PubMed:19584108). Interacts (when methylated on arginine residues) with TDRD1, TDRKH/TDRD2 and TDRD9 (PubMed:19584108, PubMed:23714778). Interacts with MOV10L1 (PubMed:20534472, PubMed:20547853).4 Publications

Structurei

3D structure databases

ProteinModelPortaliQ8CGT6.
SMRiQ8CGT6.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini263 – 379PAZPROSITE-ProRule annotationAdd BLAST117
Domaini541 – 834PiwiPROSITE-ProRule annotationAdd BLAST294

Sequence similaritiesi

Belongs to the argonaute family. Piwi subfamily.Curated
Contains 1 PAZ domain.PROSITE-ProRule annotation
Contains 1 Piwi domain.PROSITE-ProRule annotation

Phylogenomic databases

GeneTreeiENSGT00760000119148.
HOGENOMiHOG000254789.
HOVERGENiHBG049411.
InParanoidiQ8CGT6.
KOiK02156.
OMAiQATSDFQ.
PhylomeDBiQ8CGT6.
TreeFamiTF354206.

Family and domain databases

Gene3Di3.30.420.10. 1 hit.
InterProiIPR003100. PAZ_dom.
IPR003165. Piwi.
IPR012337. RNaseH-like_dom.
[Graphical view]
PfamiPF02170. PAZ. 1 hit.
PF02171. Piwi. 1 hit.
[Graphical view]
SMARTiSM00949. PAZ. 1 hit.
SM00950. Piwi. 1 hit.
[Graphical view]
SUPFAMiSSF101690. SSF101690. 1 hit.
SSF53098. SSF53098. 1 hit.
PROSITEiPS50821. PAZ. 1 hit.
PS50822. PIWI. 1 hit.
[Graphical view]

Sequences (3)i

Sequence statusi: Complete.

This entry describes 3 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: Q8CGT6-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MSGRARVRAR GITTGHSARE VGRSSRDLMV TSASPGDSEA GGGTSVISQP
60 70 80 90 100
YELGVSSGDG GRTFMERRGK GRQDFEELGV CTREKLTHVK DCKTGSSGIP
110 120 130 140 150
VRLVTNLFNL DLPQDWQLYQ YHVTYSPDLA SRRLRIALLY NHSILSDKAK
160 170 180 190 200
AFDGASLFLS EKLDQKVTEL TSETQRGETI KITLTLTSKL FPNSPVCIQF
210 220 230 240 250
FNVIFRKILK NLSMYQIGRN FYKPSEPVEI PQYKLSLWPG FAISVSHFES
260 270 280 290 300
KLLFNADVNY KVLRNETVLD FMTDLCLRTG MSCFTEMCHK QLVGLVVLTR
310 320 330 340 350
YNNKTYRIDD IDWSVKPTQA FQKRDGSEVT YVDYYKQQYD ITLSDLNQPV
360 370 380 390 400
LVSLLKRKRN DNSEPQMVHL MPELCFLTGL SSQATSDFRL MKAVAEETRL
410 420 430 440 450
SPVGRQQQLA RLVDDIQRNP VARFELETWG LHFGSQLSLT GRVVPSEKIL
460 470 480 490 500
LQDHTCQPAF AADWSKDMRS CKVLSSQPLN RWLIVCCNRA EHLIEAFLSC
510 520 530 540 550
LRRVGGSMGF NVGYPKIIKV DETPAAFLRA IQVHGDPDVQ LVMCILPSNQ
560 570 580 590 600
KNYYDSIKKY LSSDCPVPSQ CVLTRTLNKQ GTMLSVATKI AMQMTCKLGG
610 620 630 640 650
ELWSVEIPLK SLMVVGIDIC RDALNKNVVV VGFVASINSR ITRWFSRCVL
660 670 680 690 700
QRTAADIADC LKVCMTGALN RWYRHNHDLP ARIVVYRDGV GNGQLKAVLE
710 720 730 740 750
YEVPQLLKSV TECGSDARSC RLSVVVVRKR CLLRLFASTD HTVQNPPLGT
760 770 780 790 800
VVDSEATRPE WYDFYLISQT ANRGTVSPTH YNVIYDDNAL KPDHMQRLTF
810 820 830 840
KLCHLYYNWQ GLISVPAPCQ YAHKLTFLVA QSVHKEPSLE LANNLFYL
Note: No experimental confirmation available.
Length:848
Mass (Da):95,773
Last modified:March 24, 2009 - v3
Checksum:i907DB987D47947FD
GO
Isoform 2 (identifier: Q8CGT6-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     811-848: GLISVPAPCQYAHKLTFLVAQSVHKEPSLELANNLFYL → SCHGIQAALKPVTVPLPQLLSTGTMQ

Note: No experimental confirmation available.
Show »
Length:836
Mass (Da):94,252
Checksum:i6E060595D0C1117F
GO
Isoform 3 (identifier: Q8CGT6-3) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     761-761: Missing.

Show »
Length:847
Mass (Da):95,587
Checksum:i08D0A952655C0974
GO

Sequence cautioni

The sequence AAN75583 differs from that shown. Reason: Erroneous gene model prediction.Curated
The sequence BAC28462 differs from that shown. Reason: Frameshift at position 745.Curated
The sequence CAO77951 differs from that shown. Reason: Erroneous gene model prediction.Curated

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti644W → R in BAF65667 (PubMed:18381894).Curated1
Sequence conflicti808N → D in BAF65667 (PubMed:18381894).Curated1

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_036666761Missing in isoform 3. 1 Publication1
Alternative sequenceiVSP_018376811 – 848GLISV…NLFYL → SCHGIQAALKPVTVPLPQLL STGTMQ in isoform 2. 1 PublicationAdd BLAST38

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AY135692 Genomic DNA. Translation: AAN75583.1. Sequence problems.
AB258534 mRNA. Translation: BAF65667.1.
CT030247 Genomic DNA. Translation: CAO77949.1.
CT030247 Genomic DNA. Translation: CAO77951.1. Sequence problems.
AK033746 mRNA. Translation: BAC28462.1. Frameshift.
RefSeqiNP_808573.2. NM_177905.3.
UniGeneiMm.334969.

Genome annotation databases

EnsembliENSMUST00000115644; ENSMUSP00000111308; ENSMUSG00000036912. [Q8CGT6-1]
GeneIDi330890.
KEGGimmu:330890.
UCSCiuc009oet.1. mouse. [Q8CGT6-1]
uc009oev.1. mouse. [Q8CGT6-3]

Keywords - Coding sequence diversityi

Alternative splicing

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AY135692 Genomic DNA. Translation: AAN75583.1. Sequence problems.
AB258534 mRNA. Translation: BAF65667.1.
CT030247 Genomic DNA. Translation: CAO77949.1.
CT030247 Genomic DNA. Translation: CAO77951.1. Sequence problems.
AK033746 mRNA. Translation: BAC28462.1. Frameshift.
RefSeqiNP_808573.2. NM_177905.3.
UniGeneiMm.334969.

3D structure databases

ProteinModelPortaliQ8CGT6.
SMRiQ8CGT6.
ModBaseiSearch...
MobiDBiSearch...

PTM databases

iPTMnetiQ8CGT6.
PhosphoSitePlusiQ8CGT6.

Proteomic databases

PeptideAtlasiQ8CGT6.
PRIDEiQ8CGT6.

Protocols and materials databases

DNASUi330890.
Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSMUST00000115644; ENSMUSP00000111308; ENSMUSG00000036912. [Q8CGT6-1]
GeneIDi330890.
KEGGimmu:330890.
UCSCiuc009oet.1. mouse. [Q8CGT6-1]
uc009oev.1. mouse. [Q8CGT6-3]

Organism-specific databases

CTDi143689.
MGIiMGI:3041167. Piwil4.

Phylogenomic databases

GeneTreeiENSGT00760000119148.
HOGENOMiHOG000254789.
HOVERGENiHBG049411.
InParanoidiQ8CGT6.
KOiK02156.
OMAiQATSDFQ.
PhylomeDBiQ8CGT6.
TreeFamiTF354206.

Enzyme and pathway databases

ReactomeiR-MMU-5601884. PIWI-interacting RNA (piRNA) biogenesis.

Miscellaneous databases

PROiQ8CGT6.
SOURCEiSearch...

Gene expression databases

BgeeiENSMUSG00000036912.
CleanExiMM_PIWIL4.
ExpressionAtlasiQ8CGT6. baseline and differential.
GenevisibleiQ8CGT6. MM.

Family and domain databases

Gene3Di3.30.420.10. 1 hit.
InterProiIPR003100. PAZ_dom.
IPR003165. Piwi.
IPR012337. RNaseH-like_dom.
[Graphical view]
PfamiPF02170. PAZ. 1 hit.
PF02171. Piwi. 1 hit.
[Graphical view]
SMARTiSM00949. PAZ. 1 hit.
SM00950. Piwi. 1 hit.
[Graphical view]
SUPFAMiSSF101690. SSF101690. 1 hit.
SSF53098. SSF53098. 1 hit.
PROSITEiPS50821. PAZ. 1 hit.
PS50822. PIWI. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiPIWL4_MOUSE
AccessioniPrimary (citable) accession number: Q8CGT6
Secondary accession number(s): A6P4B9
, A6X963, A6X964, Q8CC75
Entry historyi
Integrated into UniProtKB/Swiss-Prot: May 16, 2006
Last sequence update: March 24, 2009
Last modified: November 30, 2016
This is version 102 of the entry and version 3 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. MGD cross-references
    Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.