Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

YLP motif-containing protein 1

Gene

YLPM1

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Plays a role in the reduction of telomerase activity during differentiation of embryonic stem cells by binding to the core promoter of TERT and controlling its down-regulation.By similarity

GO - Molecular functioni

  • poly(A) RNA binding Source: UniProtKB

GO - Biological processi

Complete GO annotation...

Keywords - Molecular functioni

Repressor

Keywords - Biological processi

Transcription, Transcription regulation

Enzyme and pathway databases

BioCyciZFISH:ENSG00000119596-MONOMER.

Names & Taxonomyi

Protein namesi
Recommended name:
YLP motif-containing protein 1
Alternative name(s):
Nuclear protein ZAP3
ZAP113
Gene namesi
Name:YLPM1
Synonyms:C14orf170, ZAP3
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 14

Organism-specific databases

HGNCiHGNC:17798. YLPM1.

Subcellular locationi

GO - Cellular componenti

  • cytoplasm Source: HPA
  • nuclear speck Source: UniProtKB-SubCell
  • nucleoplasm Source: HPA
  • nucleus Source: UniProtKB
Complete GO annotation...

Keywords - Cellular componenti

Nucleus

Pathology & Biotechi

Organism-specific databases

DisGeNETi56252.
OpenTargetsiENSG00000119596.
PharmGKBiPA134962086.

Polymorphism and mutation databases

BioMutaiYLPM1.
DMDMi57015374.

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00000662981 – 1951YLP motif-containing protein 1Add BLAST1951

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei540N6-methyllysineCombined sources1
Modified residuei561PhosphoserineCombined sources1
Modified residuei619Omega-N-methylarginineCombined sources1
Modified residuei634PhosphoserineCombined sources1
Modified residuei636Omega-N-methylarginineCombined sources1
Cross-linki788Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO1)Combined sources
Cross-linki858Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO1)Combined sources
Cross-linki858Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)Combined sources
Modified residuei905PhosphoserineCombined sources1
Modified residuei924PhosphoserineCombined sources1
Modified residuei1207PhosphoserineCombined sources1

Keywords - PTMi

Isopeptide bond, Methylation, Phosphoprotein, Ubl conjugation

Proteomic databases

EPDiP49750.
MaxQBiP49750.
PaxDbiP49750.
PeptideAtlasiP49750.
PRIDEiP49750.

PTM databases

iPTMnetiP49750.
PhosphoSitePlusiP49750.

Expressioni

Tissue specificityi

Expressed in neuronal, neuroblastoma and embryonic kidney cell lines (at protein level).1 Publication

Gene expression databases

BgeeiENSG00000119596.
CleanExiHS_YLPM1.
ExpressionAtlasiP49750. baseline and differential.
GenevisibleiP49750. HS.

Organism-specific databases

HPAiHPA048070.
HPA061123.

Interactioni

Subunit structurei

Interacts with PPP1CA and NCOA5. Forms a complex with ILF2, ILF3, KHDRBS1, RBMX, NCOA5 and PPP1CA (By similarity).By similarity

Binary interactionsi

WithEntry#Exp.IntActNotes
CACNA1AO005552EBI-712871,EBI-766279

Protein-protein interaction databases

BioGridi121117. 48 interactors.
DIPiDIP-53676N.
IntActiP49750. 27 interactors.
STRINGi9606.ENSP00000324463.

Structurei

3D structure databases

ProteinModelPortaliP49750.
SMRiP49750.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Regioni1901 – 1908Involved in interaction with PPP1CABy similarity8

Compositional bias

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Compositional biasi15 – 205Pro-richAdd BLAST191
Compositional biasi382 – 430Gln-richAdd BLAST49
Compositional biasi807 – 1209Arg-richAdd BLAST403
Compositional biasi1488 – 1577Arg-richAdd BLAST90

Phylogenomic databases

eggNOGiKOG2400. Eukaryota.
ENOG410YIR5. LUCA.
GeneTreeiENSGT00440000039837.
HOGENOMiHOG000168351.
HOVERGENiHBG079363.
InParanoidiP49750.
KOiK17602.
OMAiMETQMDK.
OrthoDBiEOG091G01X4.
PhylomeDBiP49750.
TreeFamiTF329361.

Family and domain databases

Gene3Di3.40.50.300. 1 hit.
InterProiIPR027417. P-loop_NTPase.
IPR026314. YLP_motif_con_p1.
[Graphical view]
PANTHERiPTHR13413. PTHR13413. 1 hit.
SUPFAMiSSF52540. SSF52540. 1 hit.

Sequences (3)i

Sequence statusi: Complete.

This entry describes 3 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: P49750-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MYPNWGRYGG SSHYPPPPVP PPPPVALPEA SPGPGYSSST TPAAPSSSGF
60 70 80 90 100
MSFREQHLAQ LQQLQQMHQK QMQCVLQPHH LPPPPLPPPP VMPGGGYGDW
110 120 130 140 150
QPPPPPMPPP PGPALSYQKQ QQYKHQMLHH QRDGPPGLVP MELESPPESP
160 170 180 190 200
PVPPGSYMPP SQSYMPPPQP PPSYYPPTSS QPYLPPAQPS PSQSPPSQSY
210 220 230 240 250
LAPTPSYSSS SSSSQSYLSH SQSYLPSSQA SPSRPSQGHS KSQLLAPPPP
260 270 280 290 300
SAPPGNKTTV QQEPLESGAK NKSTEQQQAA PEPDPSTMTP QEQQQYWYRQ
310 320 330 340 350
HLLSLQQRTK VHLPGHKKGP VVAKDTPEPV KEEVTVPATS QVPESPSSEE
360 370 380 390 400
PPLPPPNEEV PPPLPPEEPQ SEDPEEDARL KQLQAAAAHW QQHQQHRVGF
410 420 430 440 450
QYQGIMQKHT QLQQILQQYQ QIIQPPPHIQ ATTPPPGIPP PGVPQGIPPQ
460 470 480 490 500
LTAAPVPPAS SSQSSQVPEK PRPALLPTPV SFGSAPPTTY HPPLQSAGPS
510 520 530 540 550
EQVNSKAPLS KSALPYSSFS SDQGLGESSA APSQPITAVK DMPVRSGGLL
560 570 580 590 600
PDPPRSSYLE SPRGPRFDGP RRFEDLGSRC EGPRPKGPRF EGNRPDGPRP
610 620 630 640 650
RYEGHPAEGT KSKWGMIPRG PASQFYITPS TSLSPRQSGP QWKGPKPAFG
660 670 680 690 700
QQHQQQPKSQ AEPLSGNKEP LADTSSNQQK NFKMQSAAFS IAADVKDVKA
710 720 730 740 750
AQSNENLSDS QQEPPKSEVS EGPVEPSNWD QNVQSMETQI DKAQAVTQPV
760 770 780 790 800
PLANKPVPAQ STFPSKTGGM EGGTAVATSS LTADNDFKPV GIGLPHSENN
810 820 830 840 850
QDKGLPRPDN RDNRLEGNRG NSSSYRGPGQ SRMEDTRDKG LVNRGRGQAI
860 870 880 890 900
SRGPGLVKQE DFRDKMMGRR EDSREKMNRG EGSRDRGLVR PGSSREKVPG
910 920 930 940 950
GLQGSQDRGA AGSRERGPPR RAGSQERGPL RRAGSRERIP PRRAGSRERG
960 970 980 990 1000
PPRGPGSRER GLGRSDFGRD RGPFRPEPGD GGEKMYPYHR DEPPRAPWNH
1010 1020 1030 1040 1050
GEERGHEEFP LDGRNAPMER ERLDDWDRER YWRECERDYQ DDTLELYNRE
1060 1070 1080 1090 1100
DRFSAPPSRS HDGDRRGPWW DDWERDQDMD EDYNREMERD MDRDVDRISR
1110 1120 1130 1140 1150
PMDMYDRSLD NEWDRDYGRP LDEQESQFRE RDIPSLPPLP PLPPLPPLDR
1160 1170 1180 1190 1200
YRDDRWREER NREHGYDRDF RDRGELRIRE YPERGDTWRE KRDYVPDRMD
1210 1220 1230 1240 1250
WERERLSDRW YPSDVDRHSP MAEHMPSSHH SSEMMGSDAS LDSDQGLGGV
1260 1270 1280 1290 1300
MVLSQRQHEI ILKAAQELKM LREQKEQLQK MKDFGSEPQM ADHLPPQESR
1310 1320 1330 1340 1350
LQNTSSRPGM YPPPGSYRPP PPMGKPPGSI VRPSAPPARS SVPVTRPPVP
1360 1370 1380 1390 1400
IPPPPPPPPL PPPPPVIKPQ TSAVEQERWD EDSFYGLWDT NDEQGLNSEF
1410 1420 1430 1440 1450
KSETAAIPSA PVLPPPPVHS SIPPPGPVPM GMPPMSKPPP VQQTVDYGHG
1460 1470 1480 1490 1500
RDISTNKVEQ IPYGERITLR PDPLPERSTF ETEHAGQRDR YDRERDREPY
1510 1520 1530 1540 1550
FDRQSNVIAD HRDFKRDRET HRDRDRDRGV IDYDRDRFDR ERRPRDDRAQ
1560 1570 1580 1590 1600
SYRDKKDHSS SRRGGFDRPS YDRKSDRPVY EGPSMFGGER RTYPEERMPL
1610 1620 1630 1640 1650
PAPSLSHQPP PAPRVEKKPE SKNVDDILKP PGRESRPERI VVIMRGLPGS
1660 1670 1680 1690 1700
GKTHVAKLIR DKEVEFGGPA PRVLSLDDYF ITEVEKEEKD PDSGKKVKKK
1710 1720 1730 1740 1750
VMEYEYEAEM EETYRTSMFK TFKKTLDDGF FPFIILDAIN DRVRHFDQFW
1760 1770 1780 1790 1800
SAAKTKGFEV YLAEMSADNQ TCGKRNIHGR KLKEINKMAD HWETAPRHMM
1810 1820 1830 1840 1850
RLDIRSLLQD AAIEEVEMED FDANIEEQKE EKKDAEEEES ELGYIPKSKW
1860 1870 1880 1890 1900
EMDTSEAKLD KLDGLRTGTK RKRDWEAIAS RMEDYLQLPD DYDTRASEPG
1910 1920 1930 1940 1950
KKRVRWADLE EKKDADRKRA IGFVVGQTDW EKITDESGHL AEKALNRTKY

I
Note: No experimental confirmation available.
Length:1,951
Mass (Da):219,985
Last modified:January 4, 2005 - v3
Checksum:i11A41E942E59FFE0
GO
Isoform 3 (identifier: P49750-3) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1843-1862: GYIPKSKWEMDTSEAKLDKL → VGDRPTTLNSVSLLKFLKKV
     1863-1951: Missing.

Show »
Length:1,862
Mass (Da):209,483
Checksum:i98C60B71D7C50896
GO
Isoform 4 (identifier: P49750-4) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     431-432: AT → TMSVDMQLRH...VMPLPPLSSA

Show »
Length:2,146
Mass (Da):241,645
Checksum:i2C8F694D43061047
GO

Sequence cautioni

The sequence AAC42008 differs from that shown. Reason: Frameshift at position 1723.Curated
The sequence AAF61275 differs from that shown. Reason: Erroneous gene model prediction.Curated

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti421Q → R in AK095760 (PubMed:14702039).Curated1
Sequence conflicti621P → S in AAC42006 (PubMed:7596406).Curated1
Sequence conflicti1404T → I in AAC42008 (PubMed:7596406).Curated1
Isoform 4 (identifier: P49750-4)
Sequence conflicti524S → P in AK095760 (PubMed:14702039).Curated1
Isoform 3 (identifier: P49750-3)
Sequence conflicti1861K → E in AAC42008 (PubMed:7596406).Curated1

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_040649431 – 432AT → TMSVDMQLRHYEMQQQQFQH LYQEWEREFQLWEEQLHSYP HKDQLQEYEKQWKTWQGHMK ATQSYLQEKVNSFQNMKNQY MGNMSMPPPFVPYSQMPPPL PTMPPPVLPPSLPPPVMPPA LPATVPPPGMPPPVMPPSLP TSVPPPGMPPSLSSAGPPPV LPPPSLSSAGPPPVLPPPSL SSTAPPPVMPLPPLSSA in isoform 4. 1 Publication2
Alternative sequenceiVSP_0125381843 – 1862GYIPK…KLDKL → VGDRPTTLNSVSLLKFLKKV in isoform 3. 1 PublicationAdd BLAST20
Alternative sequenceiVSP_0125391863 – 1951Missing in isoform 3. 1 PublicationAdd BLAST89

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AC007956 Genomic DNA. Translation: AAF61275.1. Sequence problems.
AK095760 mRNA. No translation available.
L40403 mRNA. Translation: AAC42008.1. Frameshift.
L40400 mRNA. Translation: AAC42006.1.
BC007792 mRNA. Translation: AAH07792.1.
CCDSiCCDS45135.1. [P49750-4]
RefSeqiNP_062535.2. NM_019589.2. [P49750-4]
UniGeneiHs.531111.

Genome annotation databases

EnsembliENST00000325680; ENSP00000324463; ENSG00000119596. [P49750-4]
GeneIDi56252.
KEGGihsa:56252.
UCSCiuc001xqj.5. human. [P49750-1]

Keywords - Coding sequence diversityi

Alternative splicing

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AC007956 Genomic DNA. Translation: AAF61275.1. Sequence problems.
AK095760 mRNA. No translation available.
L40403 mRNA. Translation: AAC42008.1. Frameshift.
L40400 mRNA. Translation: AAC42006.1.
BC007792 mRNA. Translation: AAH07792.1.
CCDSiCCDS45135.1. [P49750-4]
RefSeqiNP_062535.2. NM_019589.2. [P49750-4]
UniGeneiHs.531111.

3D structure databases

ProteinModelPortaliP49750.
SMRiP49750.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi121117. 48 interactors.
DIPiDIP-53676N.
IntActiP49750. 27 interactors.
STRINGi9606.ENSP00000324463.

PTM databases

iPTMnetiP49750.
PhosphoSitePlusiP49750.

Polymorphism and mutation databases

BioMutaiYLPM1.
DMDMi57015374.

Proteomic databases

EPDiP49750.
MaxQBiP49750.
PaxDbiP49750.
PeptideAtlasiP49750.
PRIDEiP49750.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000325680; ENSP00000324463; ENSG00000119596. [P49750-4]
GeneIDi56252.
KEGGihsa:56252.
UCSCiuc001xqj.5. human. [P49750-1]

Organism-specific databases

CTDi56252.
DisGeNETi56252.
GeneCardsiYLPM1.
HGNCiHGNC:17798. YLPM1.
HPAiHPA048070.
HPA061123.
neXtProtiNX_P49750.
OpenTargetsiENSG00000119596.
PharmGKBiPA134962086.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiKOG2400. Eukaryota.
ENOG410YIR5. LUCA.
GeneTreeiENSGT00440000039837.
HOGENOMiHOG000168351.
HOVERGENiHBG079363.
InParanoidiP49750.
KOiK17602.
OMAiMETQMDK.
OrthoDBiEOG091G01X4.
PhylomeDBiP49750.
TreeFamiTF329361.

Enzyme and pathway databases

BioCyciZFISH:ENSG00000119596-MONOMER.

Miscellaneous databases

ChiTaRSiYLPM1. human.
GeneWikiiYLPM1.
GenomeRNAii56252.
PROiP49750.

Gene expression databases

BgeeiENSG00000119596.
CleanExiHS_YLPM1.
ExpressionAtlasiP49750. baseline and differential.
GenevisibleiP49750. HS.

Family and domain databases

Gene3Di3.40.50.300. 1 hit.
InterProiIPR027417. P-loop_NTPase.
IPR026314. YLP_motif_con_p1.
[Graphical view]
PANTHERiPTHR13413. PTHR13413. 1 hit.
SUPFAMiSSF52540. SSF52540. 1 hit.
ProtoNetiSearch...

Entry informationi

Entry nameiYLPM1_HUMAN
AccessioniPrimary (citable) accession number: P49750
Secondary accession number(s): P49752, Q96I64, Q9P1V7
Entry historyi
Integrated into UniProtKB/Swiss-Prot: October 1, 1996
Last sequence update: January 4, 2005
Last modified: November 2, 2016
This is version 133 of the entry and version 3 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Human chromosome 14
    Human chromosome 14: entries, gene names and cross-references to MIM

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.