Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

THO complex subunit 5 homolog

Gene

Thoc5

Organism
Mus musculus (Mouse)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Acts as component of the THO subcomplex of the TREX complex which is thought to couple mRNA transcription, processing and nuclear export, and which specifically associates with spliced mRNA and not with unspliced pre-mRNA. TREX is recruited to spliced mRNAs by a transcription-independent mechanism, binds to mRNA upstream of the exon-junction complex (EJC) and is recruited in a splicing- and cap-dependent manner to a region near the 5' end of the mRNA where it functions in mRNA export to the cytoplasm via the TAP/NFX1 pathway. THOC5 in conjunction with ALYREF/THOC4 functions in NXF1-NXT1 mediated nuclear export of HSP70 mRNA; both proteins enhance the RNA binding activity of NXF1 and are required for NXF1 localization to the nuclear rim. Involved in transcription elongation and genome stability. Involved in alternative polyadenylation site choice by recruiting CPSF6 to 5' region of target genes; probably mediates association of the TREX and CFIm complexes.
Regulates the expression of myeloid transcription factors CEBPA, CEBPB and GAB2 by enhancing the levels of phosphatidylinositol 3,4,5-trisphosphate. May be involved in the differentiation of granulocytes and adipocytes. Essential for hematopoietic primitive cell survival and plays an integral role in monocytic development.

GO - Molecular functioni

  • mRNA binding Source: MGI

GO - Biological processi

  • blastocyst development Source: MGI
  • cell morphogenesis Source: MGI
  • monocyte differentiation Source: UniProtKB
  • mRNA export from nucleus Source: MGI
  • mRNA processing Source: UniProtKB-KW
  • negative regulation of DNA damage checkpoint Source: UniProtKB
  • negative regulation of macrophage differentiation Source: MGI
  • positive regulation of DNA-templated transcription, elongation Source: UniProtKB
  • primitive hemopoiesis Source: UniProtKB
  • regulation of gene expression Source: MGI
  • regulation of mRNA export from nucleus Source: MGI
  • regulation of stem cell division Source: MGI
  • RNA splicing Source: UniProtKB-KW
  • stem cell division Source: MGI
  • viral mRNA export from host cell nucleus Source: MGI
Complete GO annotation...

Keywords - Biological processi

Differentiation, mRNA processing, mRNA splicing, mRNA transport, Transport

Keywords - Ligandi

RNA-binding

Names & Taxonomyi

Protein namesi
Recommended name:
THO complex subunit 5 homolog
Alternative name(s):
Fms-interacting protein
Short name:
FMIP1 Publication
Gene namesi
Name:Thoc5
Synonyms:Fmip, Kiaa0983
OrganismiMus musculus (Mouse)
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus
Proteomesi
  • UP000000589 Componenti: Unplaced

Organism-specific databases

MGIiMGI:1351333. Thoc5.

Subcellular locationi

  • Nucleus 1 Publication
  • Cytoplasm 1 Publication

  • Note: Shuttles between nucleus and cytoplasm.By similarity

GO - Cellular componenti

  • cytoplasm Source: UniProtKB-SubCell
  • nuclear chromosome Source: GO_Central
  • nucleoplasm Source: MGI
  • nucleus Source: CACAO
  • THO complex Source: MGI
  • THO complex part of transcription export complex Source: MGI
  • transcription export complex Source: MGI
Complete GO annotation...

Keywords - Cellular componenti

Cytoplasm, Nucleus

Pathology & Biotechi

Disruption phenotypei

Embryonic lethality seen before day 5.5 of embryonic development (E5.5).1 Publication

Mutagenesis

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Mutagenesisi5 – 62SS → AA: Enhances nuclear localization. 1 Publication
Mutagenesisi5 – 62SS → EE: Abolishes nuclear localization. 1 Publication
Mutagenesisi8 – 92KR → TG: Abolishes nuclear localization. 1 Publication

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Initiator methionineiRemovedBy similarity
Chaini2 – 683682THO complex subunit 5 homologPRO_0000310555Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Modified residuei2 – 21N-acetylserineBy similarity
Modified residuei5 – 51Phosphoserine1 Publication
Modified residuei6 – 61Phosphoserine1 Publication
Modified residuei225 – 2251Phosphotyrosine1 Publication
Modified residuei307 – 3071PhosphoserineCombined sources
Modified residuei312 – 3121PhosphoserineCombined sources
Modified residuei314 – 3141PhosphoserineCombined sources
Modified residuei328 – 3281PhosphothreonineCombined sources1 Publication

Post-translational modificationi

Phosphorylated on tyrosine upon binding to activated CSF1R; which causes a dissociation of the two proteins. Phosphorylation on Ser-5 and/or Ser-6 is required for nuclear export. Phosphorylated on Thr-328 in insulin-stimulated adipocytes.4 Publications

Keywords - PTMi

Acetylation, Phosphoprotein

Proteomic databases

EPDiQ8BKT7.
MaxQBiQ8BKT7.
PaxDbiQ8BKT7.
PeptideAtlasiQ8BKT7.
PRIDEiQ8BKT7.

PTM databases

iPTMnetiQ8BKT7.
PhosphoSiteiQ8BKT7.

Expressioni

Tissue specificityi

Ubiquitously expressed, with highest levels in testis, liver and heart.2 Publications

Inductioni

Up-regulated following CSF1 stimulation.1 Publication

Gene expression databases

BgeeiENSMUSG00000034274.
CleanExiMM_THOC5.

Interactioni

Subunit structurei

Interacts with phosphorylated CSF1R (PubMed:10597251). Component of the THO complex, which is composed of THOC1, THOC2, THOC3, THOC5, THOC6 and THOC7; together with at least ALYREF/THOC4, DDX39B, SARNP/CIP29 and CHTOP, THO forms the transcription/export (TREX) complex which seems to have a dynamic structure involving ATP-dependent remodeling. Interacts with ALYREF/THOC4, and THOC7. Interacts (via N-terminus) with the NTF2 domain of NXF1 (By similarity). Forms a complex with CEBPB (PubMed:19015024). Interacts with CPSF6; indicative for an association with the cleavage factor Im (CFIm) complex (By similarity). Interacts with THOC1 (PubMed:16909111). Interacts with LUZP4 (By similarity). Interacts with NCBP3 (By similarity).By similarity2 Publications

Protein-protein interaction databases

BioGridi223611. 4 interactions.
IntActiQ8BKT7. 1 interaction.
MINTiMINT-4111331.
STRINGi10090.ENSMUSP00000045580.

Family & Domainsi

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni2 – 199198Interaction with THOC7By similarityAdd
BLAST
Regioni2 – 144143Interaction with CSF1R1 PublicationAdd
BLAST

Motif

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Motifi7 – 104Nuclear localization signal

Sequence similaritiesi

Belongs to the THOC5 family.Curated

Phylogenomic databases

eggNOGiKOG2216. Eukaryota.
ENOG410YCE5. LUCA.
HOGENOMiHOG000007514.
HOVERGENiHBG051271.
InParanoidiQ8BKT7.
KOiK13174.
PhylomeDBiQ8BKT7.
TreeFamiTF314812.

Family and domain databases

InterProiIPR019163. THO_Thoc5.
[Graphical view]
PfamiPF09766. FimP. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

Q8BKT7-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MSSESSKKRK PKVIRSDGTP TEGKRNRSDT EQEGKYYSEE AEVDLRDPGR
60 70 80 90 100
DYELYKYTCQ ELQRLMAEIQ DLKSKGSKDV AIEIEERRIQ SCVHFMTLKK
110 120 130 140 150
LNRLAHIRLK KGRDQTHEAK QKVDAYHLQL QNLLYEVMHL QKEITKCLEF
160 170 180 190 200
KSKHEEIDLV SLEEFYSEAP PSISKAEITM GDPHQQTLAR LDWELEQRKR
210 220 230 240 250
LAEKYRECLS NKEKILKEIE VKRDYLSSLQ PRLNSIMQAS LPVQEYLFMP
260 270 280 290 300
FDQAHKQYET ARHLPPPLYV LFVQATAYGQ ACDKTLSVAI EGSVDEAKAL
310 320 330 340 350
FKPPEDSQDD ESDSDAEEEQ TTKRRRPTLG VQLDDKRKEM LKRHPLSVLL
360 370 380 390 400
DLKCKDNSVL HLTFYYLMNL NIMTVKAKVT TAVELITPIS AGDLLSPDSV
410 420 430 440 450
LSCLYPGDHG KKTPNPANQY QFDKVGILTL RDYVLELGHP YLWVQKLGGL
460 470 480 490 500
HFPKEQPQQT VMPDHSQSAS HMETTMKLLK TRVQSRLALH KQFASLEHGI
510 520 530 540 550
VPVTSDCQDL FPAKVVSRLV KWVIITHEDY MELHFTKDIV EAGLAGDTNL
560 570 580 590 600
YYLALIERGT AKLQAAVVLN PGYSSIPPVF RLCLNWKGEK TNSNDDNIRA
610 620 630 640 650
MESEVNVCYK ELCGPRPSHQ LLTNQLQRLC VLLDVYLETE SHDDSFEGPK
660 670 680
EFPQEKMCLR LFRGPSRMKP FKYNHPQGFF SHR
Length:683
Mass (Da):78,686
Last modified:November 13, 2007 - v2
Checksum:i08B795B688AE09CA
GO

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti80 – 801V → L in BAC34398 (PubMed:16141072).Curated
Sequence conflicti526 – 5261T → A in AAH39758 (PubMed:15489334).Curated

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK050734 mRNA. Translation: BAC34398.1.
BC039758 mRNA. Translation: AAH39758.1.
AK173078 mRNA. Translation: BAD32356.1.
CCDSiCCDS24393.1.
RefSeqiNP_766026.1. NM_172438.3.
UniGeneiMm.28969.
Mm.446885.

Genome annotation databases

GeneIDi107829.
KEGGimmu:107829.
UCSCiuc007hvl.1. mouse.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK050734 mRNA. Translation: BAC34398.1.
BC039758 mRNA. Translation: AAH39758.1.
AK173078 mRNA. Translation: BAD32356.1.
CCDSiCCDS24393.1.
RefSeqiNP_766026.1. NM_172438.3.
UniGeneiMm.28969.
Mm.446885.

3D structure databases

ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi223611. 4 interactions.
IntActiQ8BKT7. 1 interaction.
MINTiMINT-4111331.
STRINGi10090.ENSMUSP00000045580.

PTM databases

iPTMnetiQ8BKT7.
PhosphoSiteiQ8BKT7.

Proteomic databases

EPDiQ8BKT7.
MaxQBiQ8BKT7.
PaxDbiQ8BKT7.
PeptideAtlasiQ8BKT7.
PRIDEiQ8BKT7.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

GeneIDi107829.
KEGGimmu:107829.
UCSCiuc007hvl.1. mouse.

Organism-specific databases

CTDi8563.
MGIiMGI:1351333. Thoc5.
RougeiSearch...

Phylogenomic databases

eggNOGiKOG2216. Eukaryota.
ENOG410YCE5. LUCA.
HOGENOMiHOG000007514.
HOVERGENiHBG051271.
InParanoidiQ8BKT7.
KOiK13174.
PhylomeDBiQ8BKT7.
TreeFamiTF314812.

Miscellaneous databases

PROiQ8BKT7.
SOURCEiSearch...

Gene expression databases

BgeeiENSMUSG00000034274.
CleanExiMM_THOC5.

Family and domain databases

InterProiIPR019163. THO_Thoc5.
[Graphical view]
PfamiPF09766. FimP. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiTHOC5_MOUSE
AccessioniPrimary (citable) accession number: Q8BKT7
Secondary accession number(s): Q69ZU0, Q8CHR3
Entry historyi
Integrated into UniProtKB/Swiss-Prot: November 13, 2007
Last sequence update: November 13, 2007
Last modified: September 7, 2016
This is version 92 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. MGD cross-references
    Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.