Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein
Submitted name:

THO Complex (Transcription factor/nuclear export) subunit

Gene

thoc-3

Organism
Caenorhabditis elegans
Status
Unreviewed-Annotation score: Annotation score: 1 out of 5-Experimental evidence at protein leveli

Functioni

GO - Biological processi

Enzyme and pathway databases

ReactomeiR-CEL-109688. Cleavage of Growing Transcript in the Termination Region.
R-CEL-159236. Transport of Mature mRNA derived from an Intron-Containing Transcript.
R-CEL-72187. mRNA 3'-end processing.
SignaLinkiP91867.

Names & Taxonomyi

Protein namesi
Submitted name:
THO Complex (Transcription factor/nuclear export) subunitImported
Gene namesi
Name:thoc-3Imported
ORF Names:CELE_F32H2.4Imported, F32H2.4Imported
OrganismiCaenorhabditis elegansImported
Taxonomic identifieri6239 [NCBI]
Taxonomic lineageiEukaryotaMetazoaEcdysozoaNematodaChromadoreaRhabditidaRhabditoideaRhabditidaePeloderinaeCaenorhabditis
Proteomesi
  • UP000001940 Componenti: Chromosome I

Organism-specific databases

WormBaseiF32H2.4; CE42808; WBGene00009341; thoc-3.

Subcellular locationi

GO - Cellular componenti

  • THO complex Source: WormBase
  • THO complex part of transcription export complex Source: GO_Central

PTM / Processingi

Proteomic databases

EPDiP91867.
PaxDbiP91867.
PeptideAtlasiP91867.

Expressioni

Gene expression databases

BgeeiWBGene00009341.

Interactioni

Protein-protein interaction databases

STRINGi6239.F32H2.4.

Structurei

3D structure databases

ProteinModelPortaliP91867.
SMRiP91867.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini32 – 288WD_REPEATS_REGIONInterPro annotationAdd BLAST257
Repeati76 – 118WDPROSITE-ProRule annotationAdd BLAST43
Repeati205 – 239WDPROSITE-ProRule annotationAdd BLAST35
Repeati247 – 277WDPROSITE-ProRule annotationAdd BLAST31

Keywords - Domaini

WD repeatPROSITE-ProRule annotation

Phylogenomic databases

eggNOGiKOG1407. Eukaryota.
ENOG410XPZJ. LUCA.
GeneTreeiENSGT00880000137922.
HOGENOMiHOG000115395.
InParanoidiP91867.
KOiK12880.
OMAiLWDTTDW.
OrthoDBiEOG091G0BFM.
PhylomeDBiP91867.

Family and domain databases

Gene3Di2.130.10.10. 1 hit.
InterProiView protein in InterPro
IPR015943. WD40/YVTN_repeat-like_dom.
IPR001680. WD40_repeat.
IPR017986. WD40_repeat_dom.
PfamiView protein in Pfam
PF00400. WD40. 3 hits.
SMARTiView protein in SMART
SM00320. WD40. 5 hits.
SUPFAMiSSF50978. SSF50978. 1 hit.
PROSITEiView protein in PROSITE
PS50082. WD_REPEATS_2. 3 hits.
PS50294. WD_REPEATS_REGION. 1 hit.

Sequencei

Sequence statusi: Complete.

P91867-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MTVSSRERRC ARLLTAADAS TYFEKYKRIR STDMKVQQCQ SIAFNCDGTK
60 70 80 90 100
LVCGAFDKKV SVANVDGGRL RFSWVGSSHS SSVEQVACSE KQPNLFASAS
110 120 130 140 150
ADRNICVWDI RQSKPTHRIS NRVGNFFISW SPCDEYFIFL DKDNRINTVD
160 170 180 190 200
IRNYQVVNSY EMKTFSHELT FHPLSNHVFV AESGGKVEIL KFAGGALEPV
210 220 230 240 250
TSIQAHSHQV ECLAVSISKD GRKLAVGASD ASCSLWDLEE LICERVIPRH
260 270 280 290 300
DYGIRAVSFS CNGQLLASGS EDHSIDIAYV PDGSRCHEIK HTGETYSVAW
310 320 330
HPNSLLLAYT ASDSMDNREA AHVKTFGHST V
Length:331
Mass (Da):36,773
Last modified:September 2, 2008 - v3
Checksum:i477F3082476CFD6C
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
BX284601 Genomic DNA. Translation: CAB04240.3.
RefSeqiNP_492416.2. NM_060015.3.
UniGeneiCel.18802.

Genome annotation databases

EnsemblMetazoaiF32H2.4; F32H2.4; WBGene00009341.
GeneIDi172713.
KEGGicel:CELE_F32H2.4.
UCSCiF32H2.4. c. elegans.

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.

Entry informationi

Entry nameiP91867_CAEEL
AccessioniPrimary (citable) accession number: P91867
Entry historyiIntegrated into UniProtKB/TrEMBL: May 1, 1997
Last sequence update: September 2, 2008
Last modified: June 7, 2017
This is version 146 of the entry and version 3 of the sequence. See complete history.
Entry statusiUnreviewed (UniProtKB/TrEMBL)

Miscellaneousi

Keywords - Technical termi

Complete proteome, Proteomics identificationCombined sources, Reference proteomeImported