Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

O94842 (TOX4_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified July 9, 2014. Version 115. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
TOX high mobility group box family member 4
Alternative name(s):
Epidermal Langerhans cell protein LCP1
Gene names
Name:TOX4
Synonyms:C14orf92, KIAA0737
OrganismHomo sapiens (Human) [Reference proteome]
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length621 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Component of the PTW/PP1 phosphatase complex, which plays a role in the control of chromatin structure and cell cycle progression during the transition from mitosis into interphase. Ref.7

Subunit structure

Component of the PTW/PP1 phosphatase complex, composed of PPP1R10/PNUTS, TOX4, WDR82 and PPP1CA or PPP1CB or PPP1CC. Interacts with PPP1R10/PNUTS. Ref.7

Subcellular location

Nucleus Probable. Note: Associated with chromatin. Ref.7

Sequence similarities

Contains 1 HMG box DNA-binding domain.

Sequence caution

The sequence BAA34457.2 differs from that shown. Reason: Erroneous initiation. Translation N-terminally shortened.

Ontologies

Keywords
   Cellular componentNucleus
   Coding sequence diversityAlternative splicing
   LigandDNA-binding
   PTMPhosphoprotein
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
   Cellular_componentPTW/PP1 phosphatase complex

Inferred from direct assay Ref.7. Source: UniProtKB

chromatin

Inferred from direct assay Ref.7. Source: UniProtKB

nucleus

Inferred from direct assay. Source: HPA

   Molecular_functionDNA binding

Inferred from electronic annotation. Source: UniProtKB-KW

protein binding

Inferred from physical interaction Ref.7. Source: UniProtKB

Complete GO annotation...

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: O94842-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: O94842-2)

The sequence of this isoform differs from the canonical sequence as follows:
     1-24: MEFPGGNDNYLTITGPSHPFLSGA → M
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 621621TOX high mobility group box family member 4
PRO_0000048568

Regions

DNA binding223 – 29169HMG box
Motif213 – 2186Nuclear localization signal Potential
Compositional bias401 – 534134Gln/Pro-rich
Compositional bias426 – 4349Poly-Ala

Amino acid modifications

Modified residue1781Phosphoserine Ref.5
Modified residue1821Phosphoserine Ref.5
Modified residue3131Phosphothreonine Ref.8
Modified residue3151Phosphoserine Ref.8

Natural variations

Alternative sequence1 – 2424MEFPG…FLSGA → M in isoform 2.
VSP_053873

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified May 1, 1999. Version 1.
Checksum: D5EEAE6FA4756CB1

FASTA62166,195
        10         20         30         40         50         60 
MEFPGGNDNY LTITGPSHPF LSGAETFHTP SLGDEEFEIP PISLDSDPSL AVSDVVGHFD 

        70         80         90        100        110        120 
DLADPSSSQD GSFSAQYGVQ TLDMPVGMTH GLMEQGGGLL SGGLTMDLDH SIGTQYSANP 

       130        140        150        160        170        180 
PVTIDVPMTD MTSGLMGHSQ LTTIDQSELS SQLGLSLGGG TILPPAQSPE DRLSTTPSPT 

       190        200        210        220        230        240 
SSLHEDGVED FRRQLPSQKT VVVEAGKKQK APKKRKKKDP NEPQKPVSAY ALFFRDTQAA 

       250        260        270        280        290        300 
IKGQNPNATF GEVSKIVASM WDSLGEEQKQ VYKRKTEAAK KEYLKALAAY KDNQECQATV 

       310        320        330        340        350        360 
ETVELDPAPP SQTPSPPPMA TVDPASPAPA SIEPPALSPS IVVNSTLSSY VANQASSGAG 

       370        380        390        400        410        420 
GQPNITKLII TKQMLPSSIT MSQGGMVTVI PATVVTSRGL QLGQTSTATI QPSQQAQIVT 

       430        440        450        460        470        480 
RSVLQAAAAA AAAASMQLPP PRLQPPPLQQ MPQPPTQQQV TILQQPPPLQ AMQQPPPQKV 

       490        500        510        520        530        540 
RINLQQQPPP LQIKSVPLPT LKMQTTLVPP TVESSPERPM NNSPEAHTVE APSPETICEM 

       550        560        570        580        590        600 
ITDVVPEVES PSQMDVELVS GSPVALSPQP RCVRSGCENP PIVSKDWDNE YCSNECVVKH 

       610        620 
CRDVFLAWVA SRNSNTVVFV K 

« Hide

Isoform 2 [UniParc].

Checksum: F157F071E5DEF8ED
Show »

FASTA59863,821

References

« Hide 'large scale' references
[1]"Prediction of the coding sequences of unidentified human genes. XI. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro."
Nagase T., Ishikawa K., Suyama M., Kikuno R., Miyajima N., Tanaka A., Kotani H., Nomura N., Ohara O.
DNA Res. 5:277-286(1998) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
Tissue: Brain.
[2]"Complete sequencing and characterization of 21,243 full-length human cDNAs."
Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S. expand/collapse author list , Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.
Nat. Genet. 36:40-45(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
[3]"The DNA sequence and analysis of human chromosome 14."
Heilig R., Eckenberg R., Petit J.-L., Fonknechten N., Da Silva C., Cattolico L., Levy M., Barbe V., De Berardinis V., Ureta-Vidal A., Pelletier E., Vico V., Anthouard V., Rowen L., Madan A., Qin S., Sun H., Du H. expand/collapse author list , Pepin K., Artiguenave F., Robert C., Cruaud C., Bruels T., Jaillon O., Friedlander L., Samson G., Brottier P., Cure S., Segurens B., Aniere F., Samain S., Crespeau H., Abbasi N., Aiach N., Boscus D., Dickhoff R., Dors M., Dubois I., Friedman C., Gouyvenoux M., James R., Madan A., Mairey-Estrada B., Mangenot S., Martins N., Menard M., Oztas S., Ratcliffe A., Shaffer T., Trask B., Vacherie B., Bellemere C., Belser C., Besnard-Gonnet M., Bartol-Mavel D., Boutard M., Briez-Silla S., Combette S., Dufosse-Laurent V., Ferron C., Lechaplais C., Louesse C., Muselet D., Magdelenat G., Pateau E., Petit E., Sirvain-Trukniewicz P., Trybou A., Vega-Czarny N., Bataille E., Bluet E., Bordelais I., Dubois M., Dumont C., Guerin T., Haffray S., Hammadi R., Muanga J., Pellouin V., Robert D., Wunderle E., Gauguet G., Roy A., Sainte-Marthe L., Verdier J., Verdier-Discala C., Hillier L.W., Fulton L., McPherson J., Matsuda F., Wilson R., Scarpelli C., Gyapay G., Wincker P., Saurin W., Quetier F., Waterston R., Hood L., Weissenbach J.
Nature 421:601-607(2003) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[4]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
Tissue: Lung.
[5]"A quantitative atlas of mitotic phosphorylation."
Dephoure N., Zhou C., Villen J., Beausoleil S.A., Bakalarski C.E., Elledge S.J., Gygi S.P.
Proc. Natl. Acad. Sci. U.S.A. 105:10762-10767(2008) [PubMed] [Europe PMC] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-178 AND SER-182, IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
Tissue: Cervix carcinoma.
[6]"Lys-N and trypsin cover complementary parts of the phosphoproteome in a refined SCX-based approach."
Gauci S., Helbig A.O., Slijper M., Krijgsveld J., Heck A.J., Mohammed S.
Anal. Chem. 81:4493-4501(2009) [PubMed] [Europe PMC] [Abstract]
Cited for: IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
[7]"Identification and characterization of a novel human PP1 phosphatase complex."
Lee J.H., You J., Dobrota E., Skalnik D.G.
J. Biol. Chem. 285:24466-24476(2010) [PubMed] [Europe PMC] [Abstract]
Cited for: IDENTIFICATION IN THE PTW/PP1 PHOSPHATASE COMPLEX, FUNCTION, SUBCELLULAR LOCATION, INTERACTION WITH PPP1R10/PNUTS.
[8]"Quantitative phosphoproteomics reveals widespread full phosphorylation site occupancy during mitosis."
Olsen J.V., Vermeulen M., Santamaria A., Kumar C., Miller M.L., Jensen L.J., Gnad F., Cox J., Jensen T.S., Nigg E.A., Brunak S., Mann M.
Sci. Signal. 3:RA3-RA3(2010) [PubMed] [Europe PMC] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT THR-313 AND SER-315, IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
Tissue: Cervix carcinoma.
[9]"System-wide temporal characterization of the proteome and phosphoproteome of human embryonic stem cell differentiation."
Rigbolt K.T., Prokhorova T.A., Akimov V., Henningsen J., Johansen P.T., Kratchmarova I., Kassem M., Mann M., Olsen J.V., Blagoev B.
Sci. Signal. 4:RS3-RS3(2011) [PubMed] [Europe PMC] [Abstract]
Cited for: IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AB018280 mRNA. Translation: BAA34457.2. Different initiation.
AK298555 mRNA. Translation: BAG60750.1.
BC013689 mRNA. Translation: AAH13689.1.
CCDSCCDS32043.1.
RefSeqNP_055643.1. NM_014828.2. [O94842-1]
UniGeneHs.555910.

3D structure databases

ProteinModelPortalO94842.
SMRO94842. Positions 219-291.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

BioGrid115209. 28 interactions.
IntActO94842. 18 interactions.
MINTMINT-1193640.
STRING9606.ENSP00000262709.

PTM databases

PhosphoSiteO94842.

Proteomic databases

MaxQBO94842.
PaxDbO94842.
PeptideAtlasO94842.
PRIDEO94842.

Protocols and materials databases

DNASU9878.
StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENST00000262709; ENSP00000262709; ENSG00000092203.
ENST00000405508; ENSP00000385102; ENSG00000092203.
ENST00000448790; ENSP00000393080; ENSG00000092203.
GeneID9878.
KEGGhsa:9878.
UCSCuc001way.3. human. [O94842-1]

Organism-specific databases

CTD9878.
GeneCardsGC14P021944.
HGNCHGNC:20161. TOX4.
HPAHPA017880.
HPA027551.
MIM614032. gene.
neXtProtNX_O94842.
PharmGKBPA162406753.
HUGESearch...
GenAtlasSearch...

Phylogenomic databases

eggNOGNOG284736.
HOGENOMHOG000230949.
HOVERGENHBG051013.
InParanoidO94842.
OMAPPTLKMQ.
OrthoDBEOG7R834J.
PhylomeDBO94842.
TreeFamTF106481.

Gene expression databases

ArrayExpressO94842.
BgeeO94842.
CleanExHS_TOX4.
GenevestigatorO94842.

Family and domain databases

Gene3D1.10.30.10. 1 hit.
InterProIPR009071. HMG_box_dom.
[Graphical view]
PfamPF00505. HMG_box. 1 hit.
[Graphical view]
SMARTSM00398. HMG. 1 hit.
[Graphical view]
SUPFAMSSF47095. SSF47095. 1 hit.
PROSITEPS50118. HMG_BOX_2. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

ChiTaRSTOX4. human.
GeneWikiTOX4.
GenomeRNAi9878.
NextBio35474003.
PROO94842.
SOURCESearch...

Entry information

Entry nameTOX4_HUMAN
AccessionPrimary (citable) accession number: O94842
Secondary accession number(s): B4DPY8, E7EV69
Entry history
Integrated into UniProtKB/Swiss-Prot: July 19, 2003
Last sequence update: May 1, 1999
Last modified: July 9, 2014
This is version 115 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

SIMILARITY comments

Index of protein domains and families

MIM cross-references

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot

Human chromosome 14

Human chromosome 14: entries, gene names and cross-references to MIM