Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q5XJE5 (LEO1_MOUSE) Reviewed, UniProtKB/Swiss-Prot

Last modified May 1, 2013. Version 79. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
RNA polymerase-associated protein LEO1
Gene names
Name:Leo1
Synonyms:Gm185
OrganismMus musculus (Mouse) [Reference proteome]
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus

Protein attributes

Sequence length667 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Component of the PAF1 complex (PAF1C) which has multiple functions during transcription by RNA polymerase II and is implicated in regulation of development and maintenance of embryonic stem cell pluripotency. PAF1C associates with RNA polymerase II through interaction with POLR2A CTD non-phosphorylated and 'Ser-2'- and 'Ser-5'-phosphorylated forms and is involved in transcriptional elongation, acting both indepentently and synergistically with TCEA1 and in cooperation with the DSIF complex and HTATSF1. PAF1C is required for transcription of Hox and Wnt target genes. PAF1C is involved in hematopoiesis and stimulates transcriptional activity of MLL1. PAF1C is involved in histone modifications such as ubiquitination of histone H2B and methylation on histone H3 'Lys-4' (H3K4me3). PAF1C recruits the RNF20/40 E3 ubiquitin-protein ligase complex and the E2 enzyme UBE2A or UBE2B to chromatin which mediate monoubiquitination of 'Lys-120' of histone H2B (H2BK120ub1); UB2A/B-mediated H2B ubiquitination is proposed to be coupled to transcription. PAF1C is involved in mRNA 3' end formation probably through association with cleavage and poly(A) factors. Involved in polyadenylation of mRNA precursors. Connects PAF1C to Wnt signaling By similarity. Ref.7

Subunit structure

Component of the PAF1 complex, which consists of CDC73, PAF1, LEO1, CTR9, RTF1 and WDR61. Interacts with TCEA1, SUPT5H and CTNNB1 By similarity.

Subcellular location

Nucleus By similarity.

Sequence similarities

Belongs to the LEO1 family.

Ontologies

Keywords
   Biological processTranscription
Transcription regulation
Wnt signaling pathway
   Cellular componentNucleus
   Coding sequence diversityAlternative splicing
   Molecular functionActivator
   PTMAcetylation
Phosphoprotein
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
   Biological_processWnt receptor signaling pathway

Inferred from electronic annotation. Source: UniProtKB-KW

endodermal cell fate commitment

Inferred from mutant phenotype Ref.7. Source: UniProtKB

histone H2B ubiquitination

Inferred from electronic annotation. Source: Compara

histone monoubiquitination

Inferred from electronic annotation. Source: Compara

mRNA polyadenylation

Inferred from sequence or structural similarity. Source: UniProtKB

negative regulation of myeloid cell differentiation

Inferred from sequence or structural similarity. Source: UniProtKB

positive regulation of mRNA 3'-end processing

Inferred from sequence or structural similarity. Source: UniProtKB

positive regulation of transcription elongation from RNA polymerase II promoter

Inferred from sequence or structural similarity. Source: UniProtKB

positive regulation of transcription from RNA polymerase II promoter

Inferred from sequence or structural similarity. Source: UniProtKB

stem cell maintenance

Inferred from mutant phenotype Ref.7. Source: UniProtKB

transcription, DNA-dependent

Inferred from electronic annotation. Source: UniProtKB-KW

   Cellular_componentCdc73/Paf1 complex

Inferred from sequence or structural similarity. Source: UniProtKB

nucleolus

Inferred from electronic annotation. Source: Compara

Complete GO annotation...

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q5XJE5-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q5XJE5-2)

The sequence of this isoform differs from the canonical sequence as follows:
     308-324: DNNGTMDLFGGADDISS → GSPFTLYAGLLHSSLCL
     325-667: Missing.
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Initiator methionine11Removed By similarity
Chain2 – 667666RNA polymerase-associated protein LEO1
PRO_0000247820

Regions

Compositional bias26 – 330305Asp-rich

Amino acid modifications

Modified residue21N-acetylalanine By similarity
Modified residue101Phosphoserine By similarity
Modified residue141Phosphoserine By similarity
Modified residue1511Phosphoserine By similarity
Modified residue1541Phosphoserine By similarity
Modified residue1621Phosphoserine By similarity
Modified residue1711Phosphoserine By similarity
Modified residue1791Phosphoserine By similarity
Modified residue1891Phosphothreonine By similarity
Modified residue1981Phosphoserine By similarity
Modified residue2061Phosphoserine By similarity
Modified residue2131Phosphoserine By similarity
Modified residue2211Phosphoserine By similarity
Modified residue2301Phosphoserine By similarity
Modified residue2391Phosphoserine By similarity
Modified residue2471Phosphoserine Ref.6
Modified residue2551Phosphoserine Ref.6
Modified residue2781Phosphoserine Ref.5
Modified residue2801Phosphoserine Ref.5
Modified residue2951Phosphoserine Ref.6
Modified residue2971Phosphoserine Ref.6
Modified residue3011Phosphoserine By similarity
Modified residue6081Phosphoserine Ref.5
Modified residue6091Phosphoserine Ref.5
Modified residue6111Phosphoserine Ref.5
Modified residue6151Phosphoserine Ref.5
Modified residue6311Phosphoserine Ref.3 Ref.6
Modified residue6591Phosphoserine Ref.4

Natural variations

Alternative sequence308 – 32417DNNGT…DDISS → GSPFTLYAGLLHSSLCL in isoform 2.
VSP_020054
Alternative sequence325 – 667343Missing in isoform 2.
VSP_020055

Experimental info

Sequence conflict5001A → D in AAH83358. Ref.2

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified July 27, 2011. Version 2.
Checksum: 73A9E04DB6ACFFDB

FASTA66775,597
        10         20         30         40         50         60 
MADMEDLFGS EAESEAERKD SESESDSDSD QDNGASGSNA SGSESDQDDR GDSGQPSNKE 

        70         80         90        100        110        120 
LFGDDSEEEG ASHHSGSDNH SERSDNRSEA SERSDHEDNE PSDEDQHSGS EAHNDDDDEG 

       130        140        150        160        170        180 
HRSDEGSRHS EAEGSEKAQS DDEKWDGEDK SDQSDDEKLQ NSDDEDREQG SDEDKLQNSD 

       190        200        210        220        230        240 
DDEEKMQNTD DEDRAQISDD DRQQLSEEEK GNSDDEHPVA SDNDEEKQNS DDEDQPQVSD 

       250        260        270        280        290        300 
EEKMQNSDDE RPQVSDEDGR RSDGEEEQDQ KSESARGSDS EDEVLRLKRK NAIPSDSEAD 

       310        320        330        340        350        360 
SDTEVPKDNN GTMDLFGGAD DISSGSDGED KPPTPGQPVD ENGLPQDQQE EEPIPETRIE 

       370        380        390        400        410        420 
VEIPKVNTDL GNDLYFVKLP NFLSVEPRPF DPQYYEDEFE DEEMLDEEGR TRLKLKVENT 

       430        440        450        460        470        480 
IRWRIRRDEE GNEIKESNAR IVKWSDGSMS LHLGNEVFDV YKAPLQGDHN HLFIRQGTGL 

       490        500        510        520        530        540 
QGQAVFKTKL TFRPHSTDSA THRKMTLSLA DRCSKTQKIR ILPMAGRDPE CQRTEMIKKE 

       550        560        570        580        590        600 
EERLRASIRR ESQQRRMREK QHQRGLSASY LEPDRYDEEE EGEESVSLAA IKNRYKGGIR 

       610        620        630        640        650        660 
EERARIYSSD SDEGSEEDKA QRLLKAKKLN SDEEGESSGK RKAEDDDKAN KKHKKYVISD 


EEEEEDD 

« Hide

Isoform 2 [UniParc].

Checksum: DCDE6326BA1C6305
Show »

FASTA32435,869

References

« Hide 'large scale' references
[1]"Lineage-specific biology revealed by a finished genome assembly of the mouse."
Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X., Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y., Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S. expand/collapse author list , Teague B., Potamousis K., Churas C., Place M., Herschleb J., Runnheim R., Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., Lindblad-Toh K., Eichler E.E., Ponting C.P.
PLoS Biol. 7:E1000112-E1000112(2009) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: C57BL/6J.
[2]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1 AND 2).
Strain: C57BL/6.
Tissue: Brain.
[3]"Comprehensive identification of phosphorylation sites in postsynaptic density preparations."
Trinidad J.C., Specht C.G., Thalhammer A., Schoepfer R., Burlingame A.L.
Mol. Cell. Proteomics 5:914-922(2006) [PubMed] [Europe PMC] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-631, MASS SPECTROMETRY.
Tissue: Brain.
[4]"Protein phosphorylation and expression profiling by Yin-yang multidimensional liquid chromatography (Yin-yang MDLC) mass spectrometry."
Dai J., Jin W.-H., Sheng Q.-H., Shieh C.-H., Wu J.-R., Zeng R.
J. Proteome Res. 6:250-262(2007) [PubMed] [Europe PMC] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-659, MASS SPECTROMETRY.
Tissue: Liver.
[5]"A differential phosphoproteomic analysis of retinoic acid-treated P19 cells."
Smith J.C., Duchesne M.A., Tozzi P., Ethier M., Figeys D.
J. Proteome Res. 6:3174-3186(2007) [PubMed] [Europe PMC] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-278; SER-280; SER-608; SER-609; SER-611 AND SER-615, MASS SPECTROMETRY.
Tissue: Teratocarcinoma.
[6]"Solid tumor proteome and phosphoproteome analysis by high resolution mass spectrometry."
Zanivan S., Gnad F., Wickstroem S.A., Geiger T., Macek B., Cox J., Faessler R., Mann M.
J. Proteome Res. 7:5314-5326(2008) [PubMed] [Europe PMC] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-247; SER-255; SER-295; SER-297 AND SER-631, MASS SPECTROMETRY.
Tissue: Melanoma.
[7]"A genome-scale RNAi screen for Oct4 modulators defines a role of the Paf1 complex for embryonic stem cell identity."
Ding L., Paszkowski-Rogacz M., Nitzsche A., Slabicki M.M., Heninger A.K., de Vries I., Kittler R., Junqueira M., Shevchenko A., Schulz H., Hubner N., Doss M.X., Sachinidis A., Hescheler J., Iacone R., Anastassiadis K., Stewart A.F., Pisabarro M.T. expand/collapse author list , Caldarelli A., Poser I., Theis M., Buchholz F.
Cell Stem Cell 4:403-415(2009) [PubMed] [Europe PMC] [Abstract]
Cited for: FUNCTION.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AC115880 Genomic DNA. No translation available.
BC082540 mRNA. Translation: AAH82540.1.
BC083358 mRNA. Translation: AAH83358.1.
IPIIPI00474486.
IPI00776281.
RefSeqNP_001034611.1. NM_001039522.1.
UniGeneMm.41508.

3D structure databases

ProteinModelPortalQ5XJE5.
ModBaseSearch...

Protein-protein interaction databases

IntActQ5XJE5. 3 interactions.
STRING10090.ENSMUSP00000046905.

PTM databases

PhosphoSiteQ5XJE5.

Proteomic databases

PaxDbQ5XJE5.
PRIDEQ5XJE5.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENSMUST00000048937; ENSMUSP00000046905; ENSMUSG00000042487.
GeneID235497.
KEGGmmu:235497.
UCSCuc009qsh.1. mouse.

Organism-specific databases

CTD123169.
MGIMGI:2685031. Leo1.

Phylogenomic databases

eggNOGNOG130430.
GeneTreeENSGT00550000074952.
HOGENOMHOG000253934.
HOVERGENHBG081913.
InParanoidQ5XJE5.
KOK15177.
OMANNGTMDL.
OrthoDBEOG4HQDJC.

Gene expression databases

BgeeQ5XJE5.
GenevestigatorQ5XJE5.
GermOnlineENSMUSG00000042487. Mus musculus.

Family and domain databases

InterProIPR007149. Leo1.
[Graphical view]
PfamPF04004. Leo1. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

NextBio382712.
SOURCESearch...

Entry information

Entry nameLEO1_MOUSE
AccessionPrimary (citable) accession number: Q5XJE5
Secondary accession number(s): E9QPK8, Q640R1
Entry history
Integrated into UniProtKB/Swiss-Prot: July 25, 2006
Last sequence update: July 27, 2011
Last modified: May 1, 2013
This is version 79 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Relevant documents

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot

SIMILARITY comments

Index of protein domains and families