Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q91YR7 (PRP6_MOUSE) Reviewed, UniProtKB/Swiss-Prot

Last modified July 9, 2014. Version 111. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Pre-mRNA-processing factor 6
Alternative name(s):
PRP6 homolog
U5 snRNP-associated 102 kDa protein
Short name=U5-102 kDa protein
Gene names
Name:Prpf6
OrganismMus musculus (Mouse) [Reference proteome]
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus

Protein attributes

Sequence length941 AA.
Sequence statusComplete.
Protein existenceEvidence at transcript level

General annotation (Comments)

Function

Involved in pre-mRNA splicing as component of the U4/U6-U5 tri-snRNP complex, one of the building blocks of the spliceosome By similarity. Enhances dihydrotestosterone-induced transactivation activity of AR, as well as dexamethasone-induced transactivation activity of NR3C1, but does not affect estrogen-induced transactivation By similarity.

Subunit structure

Associates with the U5 snRNP particle. Component of the U4/U6-U5 tri-snRNP complex composed of the U4, U6 and U5 snRNAs and at least PRPF3, PRPF4, PRPF6, PRPF8, PRPF31, SNRNP200, TXNL4A, SNRNP40, DDX23, CD2BP2, PPIH, NHP2L1, EFTUD2, SART1 and USP39, LSm proteins LSm2-8 and Sm proteins. Interacts with ARAF1. Identified in the spliceosome C complex By similarity. Interacts with AR and NR3C1, but not ESR1, independently of the presence of hormones By similarity.

Subcellular location

Nucleusnucleoplasm By similarity. Nucleus speckle By similarity. Note: Localized in splicing speckles By similarity.

Sequence similarities

Contains 9 HAT repeats.

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q91YR7-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q91YR7-2)

The sequence of this isoform differs from the canonical sequence as follows:
     550-570: CVAHNALECARAIYAYALQVF → VSFLACFPACSLDRNSGPINL
     571-941: Missing.
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 941941Pre-mRNA-processing factor 6
PRO_0000205760

Regions

Repeat384 – 41633HAT 1
Repeat418 – 44427HAT 2
Repeat445 – 47632HAT 3
Repeat554 – 58633HAT 4
Repeat588 – 62033HAT 5
Repeat622 – 65433HAT 6
Repeat689 – 72133HAT 7
Repeat723 – 75533HAT 8
Repeat855 – 88733HAT 9
Compositional bias67 – 726Poly-Asp

Amino acid modifications

Modified residue2661Phosphothreonine By similarity
Modified residue2751Phosphothreonine By similarity
Modified residue2791Phosphoserine By similarity

Natural variations

Alternative sequence550 – 57021CVAHN…ALQVF → VSFLACFPACSLDRNSGPIN L in isoform 2.
VSP_002063
Alternative sequence571 – 941371Missing in isoform 2.
VSP_002064

Experimental info

Sequence conflict2321T → A in BAE26451. Ref.1
Sequence conflict3901K → R in BAE26451. Ref.1

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified December 1, 2001. Version 1.
Checksum: A08C3AC49AE1B9E9

FASTA941106,722
        10         20         30         40         50         60 
MNKKKKPFLG MPAPLGYVPG LGRGATGFTT RSDIGPARDA NDPVDDRHAP PGKRTVGDQM 

        70         80         90        100        110        120 
KKNQAADDDD EDLNDTNYDE FNGYAGSLFS SGPYEKDDEE ADAIYAALDK RMDERRKERR 

       130        140        150        160        170        180 
EQREKEEIEK YRMERPKIQQ QFSDLKRKLA EVTEEEWLSI PEVGDARNKR QRNPRYEKLT 

       190        200        210        220        230        240 
PVPDSFFAKH LQTGENHTSV DPRQTQFGGL NTPYPGGLNT PYPGGMTPGL MTPGTGELDM 

       250        260        270        280        290        300 
RKIGQARNTL MDMRLSQVSD SVSGQTVVDP KGYLTDLNSM IPTHGGDIND IKKARLLLKS 

       310        320        330        340        350        360 
VRETNPHHPP AWIASARLEE VTGKLQVARN LIMKGTEMCP KSEDVWLEAA RLQPGDTAKA 

       370        380        390        400        410        420 
VVAQAVRHLP QSVRIYIRAA ELETDIRAKK RVLRKALEHV PNSVRLWKAA VELEEPEDAR 

       430        440        450        460        470        480 
IMLSRAVECC PTSVELWLAL ARLETYENAR KVLNKARENI PTDRHIWITA AKLEEANGNT 

       490        500        510        520        530        540 
QMVEKIIDRA ITSLRANGVE INREQWIQDA EECDRAGSVA TCQAVMRAVI GIGIEEEDRK 

       550        560        570        580        590        600 
HTWMEDADSC VAHNALECAR AIYAYALQVF PSKKSVWLRA AYFEKNHGTR ESLEALLQRA 

       610        620        630        640        650        660 
VAHCPKAEVL WLMGAKSKWL AGDVPAARSI LALAFQANPN SEEIWLAAVK LESENNEYER 

       670        680        690        700        710        720 
ARRLLAKARS SAPTARVFMK SVKLEWVLGN ISAAQELCEE ALRHYEDFPK LWMMKGQIEE 

       730        740        750        760        770        780 
QGELMEKARE AYNQGLKKCP HSTPLWLLLS RLEEKIGQLT RARAILEKSR LKNPKNPGLW 

       790        800        810        820        830        840 
LESVRLEYRA GLKNIANTLM AKALQECPNS GILWSEAVFL EARPQRKTKS VDALKKCEHD 

       850        860        870        880        890        900 
PHVLLAVAKL FWSERKITKA REWFHRTVKI DSDLGDAWAF FYKFELQHGT EEQQEEVRKR 

       910        920        930        940 
CENAEPRHGE LWCAVSKDIT NWQRKIGEIL VLVAARIKNT F 

« Hide

Isoform 2 [UniParc].

Checksum: 16E5B26A91A0C60B
Show »

FASTA57063,960

References

[1]"The transcriptional landscape of the mammalian genome."
Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J. expand/collapse author list , Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.
Science 309:1559-1563(2005) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
Strain: C57BL/6J.
Tissue: Embryo and Embryonic head.
[2]"Lineage-specific biology revealed by a finished genome assembly of the mouse."
Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X., Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y., Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S. expand/collapse author list , Teague B., Potamousis K., Churas C., Place M., Herschleb J., Runnheim R., Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., Lindblad-Toh K., Eichler E.E., Ponting C.P.
PLoS Biol. 7:E1000112-E1000112(2009) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: C57BL/6J.
[3]Mural R.J., Adams M.D., Myers E.W., Smith H.O., Venter J.C.
Submitted (JUL-2005) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[4]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1 AND 2).
Strain: FVB/N.
Tissue: Mammary gland.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
BC005801 mRNA. Translation: AAH05801.1.
BC014869 mRNA. Translation: AAH14869.1.
BC023691 mRNA. Translation: AAH23691.2.
BC025030 mRNA. Translation: AAH25030.1.
AK011639 mRNA. Translation: BAB27751.1.
AK145461 mRNA. Translation: BAE26451.1.
AK081998 mRNA. Translation: BAC38390.1.
AL844529 Genomic DNA. Translation: CAX15563.1.
CH466626 Genomic DNA. Translation: EDL07438.1.
CCDSCCDS38381.1. [Q91YR7-1]
RefSeqNP_598462.1. NM_133701.2. [Q91YR7-1]
UniGeneMm.292001.

3D structure databases

ProteinModelPortalQ91YR7.
SMRQ91YR7. Positions 293-346, 444-478, 726-789.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

BioGrid213097. 2 interactions.
IntActQ91YR7. 2 interactions.
MINTMINT-4108694.

PTM databases

PhosphoSiteQ91YR7.

Proteomic databases

MaxQBQ91YR7.
PaxDbQ91YR7.
PRIDEQ91YR7.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENSMUST00000002529; ENSMUSP00000002529; ENSMUSG00000002455. [Q91YR7-1]
ENSMUST00000136481; ENSMUSP00000121340; ENSMUSG00000002455. [Q91YR7-1]
GeneID68879.
KEGGmmu:68879.
UCSCuc008omy.1. mouse. [Q91YR7-2]
uc008omz.1. mouse. [Q91YR7-1]

Organism-specific databases

CTD24148.
MGIMGI:1922946. Prpf6.

Phylogenomic databases

eggNOGCOG0457.
GeneTreeENSGT00550000075016.
HOGENOMHOG000116247.
HOVERGENHBG023330.
InParanoidQ3ULJ7.
KOK12855.
OMARQRFYAV.
OrthoDBEOG7RV9FB.
PhylomeDBQ91YR7.
TreeFamTF105743.

Gene expression databases

ArrayExpressQ91YR7.
BgeeQ91YR7.
CleanExMM_PRPF6.
GenevestigatorQ91YR7.

Family and domain databases

Gene3D1.25.40.10. 4 hits.
InterProIPR003107. HAT.
IPR010491. PRP1_N.
IPR013026. TPR-contain_dom.
IPR011990. TPR-like_helical.
IPR019734. TPR_repeat.
[Graphical view]
PfamPF06424. PRP1_N. 1 hit.
PF13181. TPR_8. 1 hit.
[Graphical view]
SMARTSM00386. HAT. 13 hits.
[Graphical view]
ProtoNetSearch...

Other

ChiTaRSPRPF6. mouse.
NextBio328105.
PROQ91YR7.
SOURCESearch...

Entry information

Entry namePRP6_MOUSE
AccessionPrimary (citable) accession number: Q91YR7
Secondary accession number(s): Q3ULJ7 expand/collapse secondary AC list , Q542P0, Q8CIK9, Q8R3M8, Q99JN1, Q9CSZ0
Entry history
Integrated into UniProtKB/Swiss-Prot: October 19, 2002
Last sequence update: December 1, 2001
Last modified: July 9, 2014
This is version 111 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot