Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Centromere protein U

Gene

CENPU

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Component of the CENPA-NAC (nucleosome-associated) complex, a complex that plays a central role in assembly of kinetochore proteins, mitotic progression and chromosome segregation. The CENPA-NAC complex recruits the CENPA-CAD (nucleosome distal) complex and may be involved in incorporation of newly synthesized CENPA into centromeres. Plays an important role in the correct PLK1 localization to the mitotic kinetochores. A scaffold protein responsible for the initial recruitment and maintenance of the kinetochore PLK1 population until its degradation. Involved in transcriptional repression.3 Publications

GO - Biological processi

Complete GO annotation...

Keywords - Molecular functioni

Repressor

Keywords - Biological processi

Host-virus interaction, Transcription, Transcription regulation

Enzyme and pathway databases

ReactomeiR-HSA-2467813. Separation of Sister Chromatids.
R-HSA-2500257. Resolution of Sister Chromatid Cohesion.
R-HSA-5663220. RHO GTPases Activate Formins.
R-HSA-606279. Deposition of new CENPA-containing nucleosomes at the centromere.
R-HSA-68877. Mitotic Prometaphase.
SIGNORiQ71F23.

Names & Taxonomyi

Protein namesi
Recommended name:
Centromere protein U
Short name:
CENP-U
Alternative name(s):
Centromere protein of 50 kDa
Short name:
CENP-50
Interphase centromere complex protein 24
KSHV latent nuclear antigen-interacting protein 1
MLF1-interacting protein
Polo-box-interacting protein 1
Gene namesi
Name:CENPU
Synonyms:ICEN24, KLIP1, MLF1IP, PBIP1
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 4

Organism-specific databases

HGNCiHGNC:21348. CENPU.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Centromere, Chromosome, Cytoplasm, Kinetochore, Nucleus

Pathology & Biotechi

Mutagenesis

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Mutagenesisi77S → A: Insensitive to PLK1-induced degradation. 1 Publication1
Mutagenesisi78T → A: Insensitive to PLK1-induced degradation. 1 Publication1
Mutagenesisi78T → D: Failed to enhance the PLK1-dependent degradation. 1 Publication1
Mutagenesisi78T → E: Failed to enhance the PLK1-dependent degradation. 1 Publication1

Organism-specific databases

DisGeNETi79682.
OpenTargetsiENSG00000151725.
PharmGKBiPA134893791.

Polymorphism and mutation databases

BioMutaiMLF1IP.
DMDMi74712714.

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00002476721 – 418Centromere protein UAdd BLAST418

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei78Phosphothreonine; by PLK11 Publication1
Modified residuei98PhosphothreonineCombined sources1
Modified residuei108PhosphoserineCombined sources1
Modified residuei110PhosphothreonineCombined sources1
Modified residuei111PhosphoserineCombined sources1
Modified residuei116PhosphoserineBy similarity1
Modified residuei120PhosphoserineBy similarity1
Modified residuei136PhosphoserineCombined sources1
Modified residuei139PhosphoserineCombined sources1
Modified residuei141PhosphoserineCombined sources1
Modified residuei190PhosphoserineBy similarity1
Modified residuei194PhosphoserineCombined sources1
Modified residuei232PhosphoserineCombined sources1

Post-translational modificationi

Phosphorylated by PLK1 at Thr-78, creating a self-tethering site that specifically interacts with the polo-box domain of PLK1.1 Publication

Keywords - PTMi

Phosphoprotein

Proteomic databases

EPDiQ71F23.
MaxQBiQ71F23.
PaxDbiQ71F23.
PeptideAtlasiQ71F23.
PRIDEiQ71F23.

PTM databases

iPTMnetiQ71F23.
PhosphoSitePlusiQ71F23.

Expressioni

Tissue specificityi

Expressed at high levels in the testis, fetal liver, thymus, bone marrow and at lower levels in the lymph nodes, placenta, colon and spleen. Present in all cell lines examined, including B-cells, T-cells, epithelial cells and fibroblast cells. Expressed at high levels in glioblastoma cell lines.3 Publications

Gene expression databases

BgeeiENSG00000151725.
CleanExiHS_MLF1IP.
ExpressionAtlasiQ71F23. baseline and differential.
GenevisibleiQ71F23. HS.

Organism-specific databases

HPAiHPA022048.

Interactioni

Subunit structurei

Component of the CENPA-NAC complex, at least composed of CENPA, CENPC, CENPH, CENPM, CENPN, CENPT and CENPU. The CENPA-NAC complex interacts with the CENPA-CAD complex, composed of CENPI, CENPK, CENPL, CENPO, CENPP, CENPQ, CENPR and CENPS. Interacts with the N-terminal domain of Kaposi's sarcoma-associated herpesvirus latent nuclear antigen (LNA). Interacts with MLF1. Interacts with PLK1.5 Publications

Binary interactionsi

WithEntry#Exp.IntActNotes
NUP62P371983EBI-2515234,EBI-347978
TFIP11Q9UBB93EBI-2515234,EBI-1105213

Protein-protein interaction databases

BioGridi122805. 63 interactors.
DIPiDIP-48539N.
IntActiQ71F23. 42 interactors.
MINTiMINT-4992596.
STRINGi9606.ENSP00000281453.

Structurei

3D structure databases

ProteinModelPortaliQ71F23.
SMRiQ71F23.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Coiled coil

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Coiled coili297 – 356Sequence analysisAdd BLAST60
Coiled coili397 – 417Sequence analysisAdd BLAST21

Motif

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Motifi6 – 23Nuclear localization signalSequence analysisAdd BLAST18
Motifi303 – 320Nuclear localization signalSequence analysisAdd BLAST18

Keywords - Domaini

Coiled coil

Phylogenomic databases

eggNOGiENOG410J00R. Eukaryota.
ENOG410Z48V. LUCA.
GeneTreeiENSGT00390000015511.
HOGENOMiHOG000236255.
HOVERGENiHBG081090.
InParanoidiQ71F23.
KOiK11513.
OMAiKQLHQDY.
OrthoDBiEOG091G09Q1.
PhylomeDBiQ71F23.
TreeFamiTF330780.

Family and domain databases

InterProiIPR025214. CENP-U.
[Graphical view]
PfamiPF13097. CENP-U. 1 hit.
[Graphical view]

Sequences (3)i

Sequence statusi: Complete.

This entry describes 3 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: Q71F23-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MAPRGRRRPR PHRSEGARRS KNTLERTHSM KDKAGQKCKP IDVFDFPDNS
60 70 80 90 100
DVSSIGRLGE NEKDEETYET FDPPLHSTAI YADEEEFSKH CGLSLSSTPP
110 120 130 140 150
GKEAKRSSDT SGNEASEIES VKISAKKPGR KLRPISDDSE SIEESDTRRK
160 170 180 190 200
VKSAEKISTQ RHEVIRTTAS SELSEKPAES VTSKKTGPLS AQPSVEKENL
210 220 230 240 250
AIESQSKTQK KGKISHDKRK KSRSKAIGSD TSDIVHIWCP EGMKTSDIKE
260 270 280 290 300
LNIVLPEFEK THLEHQQRIE SKVCKAAIAT FYVNVKEQFI KMLKESQMLT
310 320 330 340 350
NLKRKNAKMI SDIEKKRQRM IEVQDELLRL EPQLKQLQTK YDELKERKSS
360 370 380 390 400
LRNAAYFLSN LKQLYQDYSD VQAQEPNVKE TYDSSSLPAL LFKARTLLGA
410
ESHLRNINHQ LEKLLDQG
Length:418
Mass (Da):47,522
Last modified:July 5, 2004 - v1
Checksum:iA99BC012EF7188F9
GO
Isoform 2 (identifier: Q71F23-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-242: Missing.

Show »
Length:176
Mass (Da):20,710
Checksum:iC5731EE5F7ADCE1D
GO
Isoform 3 (identifier: Q71F23-3) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     331-337: EPQLKQL → WTGAGLW
     338-418: Missing.

Show »
Length:337
Mass (Da):38,102
Checksum:i24A4BBC85B3E98E1
GO

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti163E → G in AAI07745 (PubMed:15489334).Curated1

Natural variant

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Natural variantiVAR_04869216G → R.Corresponds to variant rs902174dbSNPEnsembl.1
Natural variantiVAR_02714416G → S.Corresponds to variant rs902174dbSNPEnsembl.1
Natural variantiVAR_027145157I → T.Corresponds to variant rs6552804dbSNPEnsembl.1
Natural variantiVAR_027146214I → M.Corresponds to variant rs4616798dbSNPEnsembl.1
Natural variantiVAR_048693279A → T.Corresponds to variant rs34007339dbSNPEnsembl.1

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_0200301 – 242Missing in isoform 2. 3 PublicationsAdd BLAST242
Alternative sequenceiVSP_053526331 – 337EPQLKQL → WTGAGLW in isoform 3. 1 Publication7
Alternative sequenceiVSP_053527338 – 418Missing in isoform 3. 1 PublicationAdd BLAST81

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AF469667 mRNA. Translation: AAQ05290.1.
AF516710 mRNA. Translation: AAQ08228.1.
AK027121 mRNA. Translation: BAB15665.1.
CR457376 mRNA. Translation: CAG33657.1.
DQ907910 mRNA. Translation: ABI49142.1.
AC079257 Genomic DNA. No translation available.
CH471056 Genomic DNA. Translation: EAX04666.1.
BC107744 mRNA. Translation: AAI07745.1.
BC031520 mRNA. Translation: AAH31520.1.
BC131556 mRNA. Translation: AAI31557.1.
CCDSiCCDS3838.1. [Q71F23-1]
RefSeqiNP_078905.2. NM_024629.3. [Q71F23-1]
UniGeneiHs.575032.

Genome annotation databases

EnsembliENST00000281453; ENSP00000281453; ENSG00000151725. [Q71F23-1]
GeneIDi79682.
KEGGihsa:79682.
UCSCiuc003iwq.4. human. [Q71F23-1]

Keywords - Coding sequence diversityi

Alternative splicing, Polymorphism

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AF469667 mRNA. Translation: AAQ05290.1.
AF516710 mRNA. Translation: AAQ08228.1.
AK027121 mRNA. Translation: BAB15665.1.
CR457376 mRNA. Translation: CAG33657.1.
DQ907910 mRNA. Translation: ABI49142.1.
AC079257 Genomic DNA. No translation available.
CH471056 Genomic DNA. Translation: EAX04666.1.
BC107744 mRNA. Translation: AAI07745.1.
BC031520 mRNA. Translation: AAH31520.1.
BC131556 mRNA. Translation: AAI31557.1.
CCDSiCCDS3838.1. [Q71F23-1]
RefSeqiNP_078905.2. NM_024629.3. [Q71F23-1]
UniGeneiHs.575032.

3D structure databases

ProteinModelPortaliQ71F23.
SMRiQ71F23.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi122805. 63 interactors.
DIPiDIP-48539N.
IntActiQ71F23. 42 interactors.
MINTiMINT-4992596.
STRINGi9606.ENSP00000281453.

PTM databases

iPTMnetiQ71F23.
PhosphoSitePlusiQ71F23.

Polymorphism and mutation databases

BioMutaiMLF1IP.
DMDMi74712714.

Proteomic databases

EPDiQ71F23.
MaxQBiQ71F23.
PaxDbiQ71F23.
PeptideAtlasiQ71F23.
PRIDEiQ71F23.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000281453; ENSP00000281453; ENSG00000151725. [Q71F23-1]
GeneIDi79682.
KEGGihsa:79682.
UCSCiuc003iwq.4. human. [Q71F23-1]

Organism-specific databases

CTDi79682.
DisGeNETi79682.
GeneCardsiCENPU.
HGNCiHGNC:21348. CENPU.
HPAiHPA022048.
MIMi611511. gene.
neXtProtiNX_Q71F23.
OpenTargetsiENSG00000151725.
PharmGKBiPA134893791.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiENOG410J00R. Eukaryota.
ENOG410Z48V. LUCA.
GeneTreeiENSGT00390000015511.
HOGENOMiHOG000236255.
HOVERGENiHBG081090.
InParanoidiQ71F23.
KOiK11513.
OMAiKQLHQDY.
OrthoDBiEOG091G09Q1.
PhylomeDBiQ71F23.
TreeFamiTF330780.

Enzyme and pathway databases

ReactomeiR-HSA-2467813. Separation of Sister Chromatids.
R-HSA-2500257. Resolution of Sister Chromatid Cohesion.
R-HSA-5663220. RHO GTPases Activate Formins.
R-HSA-606279. Deposition of new CENPA-containing nucleosomes at the centromere.
R-HSA-68877. Mitotic Prometaphase.
SIGNORiQ71F23.

Miscellaneous databases

GeneWikiiMLF1IP.
GenomeRNAii79682.
PROiQ71F23.
SOURCEiSearch...

Gene expression databases

BgeeiENSG00000151725.
CleanExiHS_MLF1IP.
ExpressionAtlasiQ71F23. baseline and differential.
GenevisibleiQ71F23. HS.

Family and domain databases

InterProiIPR025214. CENP-U.
[Graphical view]
PfamiPF13097. CENP-U. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiCENPU_HUMAN
AccessioniPrimary (citable) accession number: Q71F23
Secondary accession number(s): A2RRD9
, Q09GN2, Q32Q71, Q9H5G1
Entry historyi
Integrated into UniProtKB/Swiss-Prot: July 25, 2006
Last sequence update: July 5, 2004
Last modified: November 2, 2016
This is version 112 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Human chromosome 4
    Human chromosome 4: entries, gene names and cross-references to MIM
  2. Human entries with polymorphisms or disease mutations
    List of human entries with polymorphisms or disease mutations
  3. Human polymorphisms and disease mutations
    Index of human polymorphisms and disease mutations
  4. MIM cross-references
    Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.