Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Entry version 194 (31 Jul 2019)
Sequence version 3 (16 Jan 2019)
Previous versions | rss
Other tutorials and videosHelp videoFeedback
Protein

Centromere protein F

Gene

CENPF

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score:

Annotation score:5 out of 5

<p>The annotation score provides a heuristic measure of the annotation content of a UniProtKB entry or proteome. This score <strong>cannot</strong> be used as a measure of the accuracy of the annotation as we cannot define the ‘correct annotation’ for any given protein.<p><a href='/help/annotation_score' target='_top'>More...</a></p>
-Experimental evidence at protein leveli <p>This indicates the type of evidence that supports the existence of the protein. Note that the ‘protein existence’ evidence does not give information on the accuracy or correctness of the sequence(s) displayed.<p><a href='/help/protein_existence' target='_top'>More...</a></p>

<p>This section provides any useful information about the protein, mostly biological knowledge.<p><a href='/help/function_section' target='_top'>More...</a></p>Functioni

Required for kinetochore function and chromosome segregation in mitosis. Required for kinetochore localization of dynein, LIS1, NDE1 and NDEL1. Regulates recycling of the plasma membrane by acting as a link between recycling vesicles and the microtubule network though its association with STX4 and SNAP25. Acts as a potential inhibitor of pocket protein-mediated cellular processes during development by regulating the activity of RB proteins during cell division and proliferation. May play a regulatory or permissive role in the normal embryonic cardiomyocyte cell cycle and in promoting continued mitosis in transformed, abnormally dividing neonatal cardiomyocytes. Interaction with RB directs embryonic stem cells toward a cardiac lineage. Involved in the regulation of DNA synthesis and hence cell cycle progression, via its C-terminus. Has a potential role regulating skeletal myogenesis and in cell differentiation in embryogenesis. Involved in dendritic cell regulation of T-cell immunity against chlamydia.4 Publications

<p>The <a href="http://www.geneontology.org/">Gene Ontology (GO)</a> project provides a set of hierarchical controlled vocabulary split into 3 categories:<p><a href='/help/gene_ontology' target='_top'>More...</a></p>GO - Molecular functioni

GO - Biological processi

<p>UniProtKB Keywords constitute a <a href="http://www.uniprot.org/keywords">controlled vocabulary</a> with a hierarchical structure. Keywords summarise the content of a UniProtKB entry and facilitate the search for proteins of interest.<p><a href='/help/keywords' target='_top'>More...</a></p>Keywordsi

Molecular functionDevelopmental protein
Biological processCell cycle, Cell division, Differentiation, DNA synthesis, Mitosis, Myogenesis

Enzyme and pathway databases

Reactome - a knowledgebase of biological pathways and processes

More...
Reactomei
R-HSA-141444 Amplification of signal from unattached kinetochores via a MAD2 inhibitory signal
R-HSA-156711 Polo-like kinase mediated events
R-HSA-2467813 Separation of Sister Chromatids
R-HSA-2500257 Resolution of Sister Chromatid Cohesion
R-HSA-5663220 RHO GTPases Activate Formins
R-HSA-68877 Mitotic Prometaphase

SIGNOR Signaling Network Open Resource

More...
SIGNORi
P49454

<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Names & Taxonomyi

<p>This subsection of the <a href="http://www.uniprot.org/help/names_and_taxonomy_section">Names and taxonomy</a> section provides an exhaustive list of all names of the protein, from commonly used to obsolete, to allow unambiguous identification of a protein.<p><a href='/help/protein_names' target='_top'>More...</a></p>Protein namesi
Recommended name:
Centromere protein F
Short name:
CENP-F
Alternative name(s):
AH antigen
Kinetochore protein CENPF
Mitosin
<p>This subsection of the <a href="http://www.uniprot.org/help/names_and_taxonomy_section">Names and taxonomy</a> section indicates the name(s) of the gene(s) that code for the protein sequence(s) described in the entry. Four distinct tokens exist: ‘Name’, ‘Synonyms’, ‘Ordered locus names’ and ‘ORF names’.<p><a href='/help/gene_name' target='_top'>More...</a></p>Gene namesi
Name:CENPF
<p>This subsection of the <a href="http://www.uniprot.org/help/names_and_taxonomy_section">Names and taxonomy</a> section provides information on the name(s) of the organism that is the source of the protein sequence.<p><a href='/help/organism-name' target='_top'>More...</a></p>OrganismiHomo sapiens (Human)
<p>This subsection of the <a href="http://www.uniprot.org/help/names_and_taxonomy_section">Names and taxonomy</a> section shows the unique identifier assigned by the NCBI to the source organism of the protein. This is known as the ‘taxonomic identifier’ or ‘taxid’.<p><a href='/help/taxonomic_identifier' target='_top'>More...</a></p>Taxonomic identifieri9606 [NCBI]
<p>This subsection of the <a href="http://www.uniprot.org/help/names_and_taxonomy_section">Names and taxonomy</a> section contains the taxonomic hierarchical classification lineage of the source organism. It lists the nodes as they appear top-down in the taxonomic tree, with the more general grouping listed first.<p><a href='/help/taxonomic_lineage' target='_top'>More...</a></p>Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
<p>This subsection of the <a href="http://www.uniprot.org/help/names_and_taxonomy_section">Names and taxonomy</a> section is present for entries that are part of a <a href="http://www.uniprot.org/proteomes">proteome</a>, i.e. of a set of proteins thought to be expressed by organisms whose genomes have been completely sequenced.<p><a href='/help/proteomes_manual' target='_top'>More...</a></p>Proteomesi
  • UP000005640 <p>A UniProt <a href="http://www.uniprot.org/manual/proteomes_manual">proteome</a> can consist of several components. <br></br>The component name refers to the genomic component encoding a set of proteins.<p><a href='/help/proteome_component' target='_top'>More...</a></p> Componenti: Chromosome 1

Organism-specific databases

Human Gene Nomenclature Database

More...
HGNCi
HGNC:1857 CENPF

Online Mendelian Inheritance in Man (OMIM)

More...
MIMi
600236 gene

neXtProt; the human protein knowledge platform

More...
neXtProti
NX_P49454

<p>This section provides information on the location and the topology of the mature protein in the cell.<p><a href='/help/subcellular_location_section' target='_top'>More...</a></p>Subcellular locationi

Extracellular region or secreted Cytosol Plasma membrane Cytoskeleton Lysosome Endosome Peroxisome ER Golgi apparatus Nucleus Mitochondrion Manual annotation Automatic computational assertionGraphics by Christian Stolte & Seán O’Donoghue; Source: COMPARTMENTS

Keywords - Cellular componenti

Centromere, Chromosome, Cytoplasm, Cytoskeleton, Kinetochore, Nucleus

<p>This section provides information on the disease(s) and phenotype(s) associated with a protein.<p><a href='/help/pathology_and_biotech_section' target='_top'>More...</a></p>Pathology & Biotechi

<p>This subsection of the ‘Pathology and Biotech’ section provides information on the disease(s) associated with genetic variations in a given protein. The information is extracted from the scientific literature and diseases that are also described in the <a href="http://www.ncbi.nlm.nih.gov/sites/entrez?db=omim">OMIM</a> database are represented with a <a href="http://www.uniprot.org/diseases">controlled vocabulary</a> in the following way:<p><a href='/help/involvement_in_disease' target='_top'>More...</a></p>Involvement in diseasei

Stromme syndrome (STROMS)2 Publications
The disease is caused by mutations affecting the gene represented in this entry.
Disease descriptionAn autosomal recessive congenital disorder characterized by intestinal atresia, ocular anomalies, microcephaly, and renal and cardiac abnormalities in some patients. The disease has features of a ciliopathy, and lethality in early childhood is observed in severe cases.
Related information in OMIM

Keywords - Diseasei

Ciliopathy, Primary ciliary dyskinesia

Organism-specific databases

DisGeNET

More...
DisGeNETi
1063

MalaCards human disease database

More...
MalaCardsi
CENPF
MIMi243605 phenotype

Open Targets

More...
OpenTargetsi
ENSG00000117724

Orphanet; a database dedicated to information on rare diseases and orphan drugs

More...
Orphaneti
444069 Lethal fetal brain malformation-duodenal atresia-bilateral renal hypoplasia syndrome
506307 Stromme syndrome

The Pharmacogenetics and Pharmacogenomics Knowledge Base

More...
PharmGKBi
PA26401

Polymorphism and mutation databases

BioMuta curated single-nucleotide variation and disease association database

More...
BioMutai
CENPF

Domain mapping of disease mutations (DMDM)

More...
DMDMi
156630875

<p>This section describes post-translational modifications (PTMs) and/or processing events.<p><a href='/help/ptm_processing_section' target='_top'>More...</a></p>PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the ‘PTM / Processing’ section describes the extent of a polypeptide chain in the mature protein following processing.<p><a href='/help/chain' target='_top'>More...</a></p>ChainiPRO_00000894771 – 3111Centromere protein FAdd BLAST3111
<p>This subsection of the <a href="http://www.uniprot.org/help/ptm_processing_section">PTM / Processing</a> section describes a propeptide, which is a part of a protein that is cleaved during maturation or activation. Once cleaved, a propeptide generally has no independent biological function.<p><a href='/help/propep' target='_top'>More...</a></p>PropeptideiPRO_00003967443112 – 3114Removed in mature formCurated3

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the ‘PTM / Processing’ section specifies the position and type of each modified residue excluding <a href="http://www.uniprot.org/manual/lipid">lipids</a>, <a href="http://www.uniprot.org/manual/carbohyd">glycans</a> and <a href="http://www.uniprot.org/manual/crosslnk">protein cross-links</a>.<p><a href='/help/mod_res' target='_top'>More...</a></p>Modified residuei106PhosphoserineCombined sources1
Modified residuei144PhosphothreonineCombined sources1
Modified residuei151PhosphothreonineCombined sources1
Modified residuei154PhosphothreonineCombined sources1
Modified residuei158PhosphotyrosineCombined sources1
Modified residuei242PhosphoserineCombined sources1
Modified residuei276PhosphoserineCombined sources1
Modified residuei773PhosphoserineCombined sources1
Modified residuei783PhosphoserineCombined sources1
Modified residuei821PhosphoserineCombined sources1
Modified residuei834PhosphoserineCombined sources1
Modified residuei838PhosphoserineCombined sources1
Modified residuei876PhosphoserineCombined sources1
Modified residuei1248PhosphoserineCombined sources1
Modified residuei1255PhosphoserineCombined sources1
Modified residuei1259PhosphoserineCombined sources1
Modified residuei1651PhosphoserineCombined sources1
Modified residuei1652PhosphoserineCombined sources1
Modified residuei1654PhosphoserineCombined sources1
Modified residuei1726PhosphoserineCombined sources1
Modified residuei1862PhosphothreonineCombined sources1
Modified residuei1868PhosphoserineCombined sources1
Modified residuei1892PhosphoserineCombined sources1
Modified residuei2416PhosphoserineCombined sources1
Modified residuei2417PhosphoserineCombined sources1
Modified residuei2779N6-acetyllysineCombined sources1
Modified residuei2900PhosphoserineCombined sources1
Modified residuei2911PhosphoserineCombined sources1
Modified residuei2922PhosphoserineCombined sources1
Modified residuei2936PhosphoserineCombined sources1
Modified residuei2949PhosphothreonineCombined sources1
Modified residuei2952PhosphoserineCombined sources1
Modified residuei2998PhosphoserineCombined sources1
Modified residuei3023PhosphoserineCombined sources1
Modified residuei3026PhosphoserineCombined sources1
Modified residuei3054PhosphoserineCombined sources1
Modified residuei3079PhosphoserineCombined sources1
Modified residuei3083PhosphoserineCombined sources1
Modified residuei3111Cysteine methyl esterCurated1
<p>This subsection of the <a href="http://www.uniprot.org/help/ptm_processing_section">PTM / Processing</a> section specifies the position(s) and the type of covalently attached lipid group(s).<p><a href='/help/lipid' target='_top'>More...</a></p>Lipidationi3111S-farnesyl cysteine1 Publication1

<p>This subsection of the <a href="http://www.uniprot.org/help/ptm_processing_section">PTM/processing</a> section describes post-translational modifications (PTMs). This subsection <strong>complements</strong> the information provided at the sequence level or describes modifications for which <strong>position-specific data is not yet available</strong>.<p><a href='/help/post-translational_modification' target='_top'>More...</a></p>Post-translational modificationi

Hyperphosphorylated during mitosis.

Keywords - PTMi

Acetylation, Lipoprotein, Methylation, Phosphoprotein, Prenylation

Proteomic databases

Encyclopedia of Proteome Dynamics

More...
EPDi
P49454

jPOST - Japan Proteome Standard Repository/Database

More...
jPOSTi
P49454

MaxQB - The MaxQuant DataBase

More...
MaxQBi
P49454

PaxDb, a database of protein abundance averages across all three domains of life

More...
PaxDbi
P49454

PeptideAtlas

More...
PeptideAtlasi
P49454

PRoteomics IDEntifications database

More...
PRIDEi
P49454

ProteomicsDB human proteome resource

More...
ProteomicsDBi
56015

PTM databases

CarbonylDB database of protein carbonylation sites

More...
CarbonylDBi
P49454

iPTMnet integrated resource for PTMs in systems biology context

More...
iPTMneti
P49454

Comprehensive resource for the study of protein post-translational modifications (PTMs) in human, mouse and rat.

More...
PhosphoSitePlusi
P49454

<p>This section provides information on the expression of a gene at the mRNA or protein level in cells or in tissues of multicellular organisms.<p><a href='/help/expression_section' target='_top'>More...</a></p>Expressioni

<p>This subsection of the ‘Expression’ section provides information on the expression of the gene product at various stages of a cell, tissue or organism development. By default, the information is derived from experiments at the mRNA level, unless specified ‘at the protein level’.<p><a href='/help/developmental_stage' target='_top'>More...</a></p>Developmental stagei

Gradually accumulates during the cell cycle, reaching peak levels in G2 and M phase, and is rapidly degraded upon completion of mitosis.1 Publication

Gene expression databases

Bgee dataBase for Gene Expression Evolution

More...
Bgeei
ENSG00000117724 Expressed in 151 organ(s), highest expression level in heart

ExpressionAtlas, Differential and Baseline Expression

More...
ExpressionAtlasi
P49454 baseline and differential

Genevisible search portal to normalized and curated expression data from Genevestigator

More...
Genevisiblei
P49454 HS

Organism-specific databases

Human Protein Atlas

More...
HPAi
CAB070134
HPA052382
HPA064308

<p>This section provides information on the quaternary structure of a protein and on interaction(s) with other proteins or protein complexes.<p><a href='/help/interaction_section' target='_top'>More...</a></p>Interactioni

<p>This subsection of the <a href="http://www.uniprot.org/help/interaction_section">'Interaction'</a> section provides information about the protein quaternary structure and interaction(s) with other proteins or protein complexes (with the exception of physiological receptor-ligand interactions which are annotated in the <a href="http://www.uniprot.org/help/function_section">'Function'</a> section).<p><a href='/help/subunit_structure' target='_top'>More...</a></p>Subunit structurei

Interacts with and STX4 (via C-terminus) (By similarity).

Interacts (via N-terminus) with RBL1, RBL2 and SNAP25 (By similarity). Self-associates.

Interacts with CENP-E and BUBR1 (via C-terminus).

Interacts (via C-terminus) with NDE1, NDEL1 and RB1.

By similarity4 Publications

<p>This subsection of the '<a href="http://www.uniprot.org/help/interaction_section%27">Interaction</a> section provides information about binary protein-protein interactions. The data presented in this section are a quality-filtered subset of binary interactions automatically derived from the <a href="http://www.ebi.ac.uk/intact/">IntAct database</a>. It is updated on a monthly basis. Each binary interaction is displayed on a separate line.<p><a href='/help/binary_interactions' target='_top'>More...</a></p>Binary interactionsi

GO - Molecular functioni

Protein-protein interaction databases

The Biological General Repository for Interaction Datasets (BioGrid)

More...
BioGridi
107492, 55 interactors

Protein interaction database and analysis system

More...
IntActi
P49454, 34 interactors

Molecular INTeraction database

More...
MINTi
P49454

STRING: functional protein association networks

More...
STRINGi
9606.ENSP00000355922

<p>This section provides information on the tertiary and secondary structure of a protein.<p><a href='/help/structure_section' target='_top'>More...</a></p>Structurei

3D structure databases

SWISS-MODEL Repository - a database of annotated 3D protein structure models

More...
SMRi
P49454

Database of comparative protein structure models

More...
ModBasei
Search...

<p>This section provides information on sequence similarities with other proteins and the domain(s) present in a protein.<p><a href='/help/family_and_domains_section' target='_top'>More...</a></p>Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the ‘Family and Domains’ section indicates the positions and types of repeated sequence motifs or repeated domains within the protein.<p><a href='/help/repeat' target='_top'>More...</a></p>Repeati2111 – 22901Add BLAST180
Repeati2293 – 24722Add BLAST180

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the ‘Family and Domains’ section describes a region of interest that cannot be described in other subsections.<p><a href='/help/region' target='_top'>More...</a></p>Regioni1 – 481Interaction with SNAP25 and required for localization to the cytoplasmBy similarityAdd BLAST481
Regioni2026 – 2351Interaction with NDE1 and NDEL11 PublicationAdd BLAST326
Regioni2111 – 24722 X 177 AA tandem repeatsAdd BLAST362
Regioni2392 – 3017Sufficient for centromere localizationAdd BLAST626
Regioni2392 – 2829Sufficient for self-associationAdd BLAST438
Regioni2831 – 3017Sufficient for nuclear localizationAdd BLAST187

Coiled coil

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the ‘Family and domains’ section denotes the positions of regions of coiled coil within the protein.<p><a href='/help/coiled' target='_top'>More...</a></p>Coiled coili13 – 131Sequence analysisAdd BLAST119
Coiled coili280 – 685Sequence analysisAdd BLAST406
Coiled coili899 – 989Sequence analysisAdd BLAST91
Coiled coili1196 – 1244Sequence analysisAdd BLAST49
Coiled coili1549 – 1646Sequence analysisAdd BLAST98
Coiled coili1890 – 2078Sequence analysisAdd BLAST189
Coiled coili2107 – 2891Sequence analysisAdd BLAST785

Motif

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the ‘Family and Domains’ section describes a short (usually not more than 20 amino acids) conserved sequence motif of biological significance.<p><a href='/help/motif' target='_top'>More...</a></p>Motifi2919 – 2936Nuclear localization signalSequence analysisAdd BLAST18

<p>This subsection of the ‘Family and domains’ section provides information about the sequence similarity with other proteins.<p><a href='/help/sequence_similarities' target='_top'>More...</a></p>Sequence similaritiesi

Belongs to the centromere protein F family.Curated

Keywords - Domaini

Coiled coil, Repeat

Phylogenomic databases

evolutionary genealogy of genes: Non-supervised Orthologous Groups

More...
eggNOGi
ENOG410IGJF Eukaryota
ENOG410XS5F LUCA

Ensembl GeneTree

More...
GeneTreei
ENSGT00730000111187

InParanoid: Eukaryotic Ortholog Groups

More...
InParanoidi
P49454

KEGG Orthology (KO)

More...
KOi
K11499

Database for complete collections of gene phylogenies

More...
PhylomeDBi
P49454

TreeFam database of animal gene trees

More...
TreeFami
TF101133

Family and domain databases

Integrated resource of protein families, domains and functional sites

More...
InterProi
View protein in InterPro
IPR018302 CenpF/LEK1_Rb-prot-bd
IPR019513 Centromere_CenpF_leu-rich_rpt
IPR018463 Centromere_CenpF_N

Pfam protein domain database

More...
Pfami
View protein in Pfam
PF10490 CENP-F_C_Rb_bdg, 1 hit
PF10473 CENP-F_leu_zip, 3 hits
PF10481 CENP-F_N, 1 hit

<p>This section displays by default the canonical protein sequence and upon request all isoforms described in the entry. It also includes information pertinent to the sequence(s), including <a href="http://www.uniprot.org/help/sequence_length">length</a> and <a href="http://www.uniprot.org/help/sequences">molecular weight</a>. The information is filed in different subsections. The current subsections and their content are listed below:<p><a href='/help/sequences_section' target='_top'>More...</a></p>Sequence (1+)i

<p>This subsection of the <a href="http://www.uniprot.org/help/sequences_section">Sequence</a> section indicates if the <a href="http://www.uniprot.org/help/canonical_and_isoforms">canonical sequence</a> displayed by default in the entry is complete or not.<p><a href='/help/sequence_status' target='_top'>More...</a></p>Sequence statusi: Complete.

<p>This subsection of the <a href="http://www.uniprot.org/help/sequences_section">Sequence</a> section indicates if the <a href="http://www.uniprot.org/help/canonical_and_isoforms">canonical sequence</a> displayed by default in the entry is in its mature form or if it represents the precursor.<p><a href='/help/sequence_processing' target='_top'>More...</a></p>Sequence processingi: The displayed sequence is further processed into a mature form.

This entry has 1 described isoform and 1 potential isoform that is computationally mapped.Show allAlign All

P49454-1 [UniParc]FASTAAdd to basket
« Hide
        10         20         30         40         50
MSWALEEWKE GLPTRALQKI QELEGQLDKL KKEKQQRQFQ LDSLEAALQK
60 70 80 90 100
QKQKVENEKT EGTNLKRENQ RLMEICESLE KTKQKISHEL QVKESQVNFQ
110 120 130 140 150
EGQLNSGKKQ IEKLEQELKR CKSELERSQQ AAQSADVSLN PCNTPQKIFT
160 170 180 190 200
TPLTPSQYYS GSKYEDLKEK YNKEVEERKR LEAEVKALQA KKASQTLPQA
210 220 230 240 250
TMNHRDIARH QASSSVFSWQ QEKTPSHLSS NSQRTPIRRD FSASYFSGEQ
260 270 280 290 300
EVTPSRSTLQ IGKRDANSSF FDNSSSPHLL DQLKAQNQEL RNKINELELR
310 320 330 340 350
LQGHEKEMKG QVNKFQELQL QLEKAKVELI EKEKVLNKCR DELVRTTAQY
360 370 380 390 400
DQASTKYTAL EQKLKKLTED LSCQRQNAES ARCSLEQKIK EKEKEFQEEL
410 420 430 440 450
SRQQRSFQTL DQECIQMKAR LTQELQQAKN MHNVLQAELD KLTSVKQQLE
460 470 480 490 500
NNLEEFKQKL CRAEQAFQAS QIKENELRRS MEEMKKENNL LKSHSEQKAR
510 520 530 540 550
EVCHLEAELK NIKQCLNQSQ NFAEEMKAKN TSQETMLRDL QEKINQQENS
560 570 580 590 600
LTLEKLKLAV ADLEKQRDCS QDLLKKREHH IEQLNDKLSK TEKESKALLS
610 620 630 640 650
ALELKKKEYE ELKEEKTLFS CWKSENEKLL TQMESEKENL QSKINHLETC
660 670 680 690 700
LKTQQIKSHE YNERVRTLEM DRENLSVEIR NLHNVLDSKS VEVETQKLAY
710 720 730 740 750
MELQQKAEFS DQKHQKEIEN MCLKTSQLTG QVEDLEHKLQ LLSNEIMDKD
760 770 780 790 800
RCYQDLHAEY ESLRDLLKSK DASLVTNEDH QRSLLAFDQQ PAMHHSFANI
810 820 830 840 850
IGEQGSMPSE RSECRLEADQ SPKNSAILQN RVDSLEFSLE SQKQMNSDLQ
860 870 880 890 900
KQCEELVQIK GEIEENLMKA EQMHQSFVAE TSQRISKLQE DTSAHQNVVA
910 920 930 940 950
ETLSALENKE KELQLLNDKV ETEQAEIQEL KKSNHLLEDS LKELQLLSET
960 970 980 990 1000
LSLEKKEMSS IISLNKREIE ELTQENGTLK EINASLNQEK MNLIQKSESF
1010 1020 1030 1040 1050
ANYIDEREKS ISELSDQYKQ EKLILLQRCE ETGNAYEDLS QKYKAAQEKN
1060 1070 1080 1090 1100
SKLECLLNEC TSLCENRKNE LEQLKEAFAK EHQEFLTKLA FAEERNQNLM
1110 1120 1130 1140 1150
LELETVQQAL RSEMTDNQNN SKSEAGGLKQ EIMTLKEEQN KMQKEVNDLL
1160 1170 1180 1190 1200
QENEQLMKVM KTKHECQNLE SEPIRNSVKE RESERNQCNF KPQMDLEVKE
1210 1220 1230 1240 1250
ISLDSYNAQL VQLEAMLRNK ELKLQESEKE KECLQHELQT IRGDLETSNL
1260 1270 1280 1290 1300
QDMQSQEISG LKDCEIDAEE KYISGPHELS TSQNDNAHLQ CSLQTTMNKL
1310 1320 1330 1340 1350
NELEKICEIL QAEKYELVTE LNDSRSECIT ATRKMAEEVG KLLNEVKILN
1360 1370 1380 1390 1400
DDSGLLHGEL VEDIPGGEFG EQPNEQHPVS LAPLDESNSY EHLTLSDKEV
1410 1420 1430 1440 1450
QMHFAELQEK FLSLQSEHKI LHDQHCQMSS KMSELQTYVD SLKAENLVLS
1460 1470 1480 1490 1500
TNLRNFQGDL VKEMQLGLEE GLVPSLSSSC VPDSSSLSSL GDSSFYRALL
1510 1520 1530 1540 1550
EQTGDMSLLS NLEGAVSANQ CSVDEVFCSS LQEENLTRKE TPSAPAKGVE
1560 1570 1580 1590 1600
ELESLCEVYR QSLEKLEEKM ESQGIMKNKE IQELEQLLSS ERQELDCLRK
1610 1620 1630 1640 1650
QYLSENEQWQ QKLTSVTLEM ESKLAAEKKQ TEQLSLELEV ARLQLQGLDL
1660 1670 1680 1690 1700
SSRSLLGIDT EDAIQGRNES CDISKEHTSE TTERTPKHDV HQICDKDAQQ
1710 1720 1730 1740 1750
DLNLDIEKIT ETGAVKPTGE CSGEQSPDTN YEPPGEDKTQ GSSECISELS
1760 1770 1780 1790 1800
FSGPNALVPM DFLGNQEDIH NLQLRVKETS NENLRLLHVI EDRDRKVESL
1810 1820 1830 1840 1850
LNEMKELDSK LHLQEVQLMT KIEACIELEK IVGELKKENS DLSEKLEYFS
1860 1870 1880 1890 1900
CDHQELLQRV ETSEGLNSDL EMHADKSSRE DIGDNVAKVN DSWKERFLDV
1910 1920 1930 1940 1950
ENELSRIRSE KASIEHEALY LEADLEVVQT EKLCLEKDNE NKQKVIVCLE
1960 1970 1980 1990 2000
EELSVVTSER NQLRGELDTM SKKTTALDQL SEKMKEKTQE LESHQSECLH
2010 2020 2030 2040 2050
CIQVAEAEVK EKTELLQTLS SDVSELLKDK THLQEKLQSL EKDSQALSLT
2060 2070 2080 2090 2100
KCELENQIAQ LNKEKELLVK ESESLQARLS ESDYEKLNVS KALEAALVEK
2110 2120 2130 2140 2150
GEFALRLSST QEEVHQLRRG IEKLRVRIEA DEKKQLHIAE KLKEREREND
2160 2170 2180 2190 2200
SLKDKVENLE RELQMSEENQ ELVILDAENS KAEVETLKTQ IEEMARSLKV
2210 2220 2230 2240 2250
FELDLVTLRS EKENLTKQIQ EKQGQLSELD KLLSSFKSLL EEKEQAEIQI
2260 2270 2280 2290 2300
KEESKTAVEM LQNQLKELNE AVAALCGDQE IMKATEQSLD PPIEEEHQLR
2310 2320 2330 2340 2350
NSIEKLRARL EADEKKQLCV LQQLKESEHH ADLLKGRVEN LERELEIART
2360 2370 2380 2390 2400
NQEHAALEAE NSKGEVETLK AKIEGMTQSL RGLELDVVTI RSEKENLTNE
2410 2420 2430 2440 2450
LQKEQERISE LEIINSSFEN ILQEKEQEKV QMKEKSSTAM EMLQTQLKEL
2460 2470 2480 2490 2500
NERVAALHND QEACKAKEQN LSSQVECLEL EKAQLLQGLD EAKNNYIVLQ
2510 2520 2530 2540 2550
SSVNGLIQEV EDGKQKLEKK DEEISRLKNQ IQDQEQLVSK LSQVEGEHQL
2560 2570 2580 2590 2600
WKEQNLELRN LTVELEQKIQ VLQSKNASLQ DTLEVLQSSY KNLENELELT
2610 2620 2630 2640 2650
KMDKMSFVEK VNKMTAKETE LQREMHEMAQ KTAELQEELS GEKNRLAGEL
2660 2670 2680 2690 2700
QLLLEEIKSS KDQLKELTLE NSELKKSLDC MHKDQVEKEG KVREEIAEYQ
2710 2720 2730 2740 2750
LRLHEAEKKH QALLLDTNKQ YEVEIQTYRE KLTSKEECLS SQKLEIDLLK
2760 2770 2780 2790 2800
SSKEELNNSL KATTQILEEL KKTKMDNLKY VNQLKKENER AQGKMKLLIK
2810 2820 2830 2840 2850
SCKQLEEEKE ILQKELSQLQ AAQEKQKTGT VMDTKVDELT TEIKELKETL
2860 2870 2880 2890 2900
EEKTKEADEY LDKYCSLLIS HEKLEKAKEM LETQVAHLCS QQSKQDSRGS
2910 2920 2930 2940 2950
PLLGPVVPGP SPIPSVTEKR LSSGQNKASG KRQRSSGIWE NGRGPTPATP
2960 2970 2980 2990 3000
ESFSKKSKKA VMSGIHPAED TEGTEFEPEG LPEVVKKGFA DIPTGKTSPY
3010 3020 3030 3040 3050
ILRRTTMATR TSPRLAAQKL ALSPLSLGKE NLAESSKPTA GGSRSQKVKV
3060 3070 3080 3090 3100
AQRSPVDSGT ILREPTTKSV PVNNLPERSP TDSPREGLRV KRGRLVPSPK
3110
AGLESNGSEN CKVQ
Length:3,114
Mass (Da):357,527
Last modified:January 16, 2019 - v3
<p>The checksum is a form of redundancy check that is calculated from the sequence. It is useful for tracking sequence updates.</p> <p>It should be noted that while, in theory, two different sequences could have the same checksum value, the likelihood that this would happen is extremely low.</p> <p>However UniProtKB may contain entries with identical sequences in case of multiple genes (paralogs).</p> <p>The checksum is computed as the sequence 64-bit Cyclic Redundancy Check value (CRC64) using the generator polynomial: x<sup>64</sup> + x<sup>4</sup> + x<sup>3</sup> + x + 1. The algorithm is described in the ISO 3309 standard. </p> <p class="publication">Press W.H., Flannery B.P., Teukolsky S.A. and Vetterling W.T.<br /> <strong>Cyclic redundancy and other checksums</strong><br /> <a href="http://www.nrbook.com/b/bookcpdf.php">Numerical recipes in C 2nd ed., pp896-902, Cambridge University Press (1993)</a>)</p> Checksum:i1FC81972F947B375
GO

<p>In eukaryotic reference proteomes, unreviewed entries that are likely to belong to the same gene are computationally mapped, based on gene identifiers from Ensembl, EnsemblGenomes and model organism databases.<p><a href='/help/gene_centric_isoform_mapping' target='_top'>More...</a></p>Computationally mapped potential isoform sequencesi

There is 1 potential isoform mapped to this entry.BLASTAlignShow allAdd to basket
EntryEntry nameProtein names
Gene namesLengthAnnotation
A0A087WTY4A0A087WTY4_HUMAN
Centromere protein F
CENPF
175Annotation score:

Annotation score:1 out of 5

<p>The annotation score provides a heuristic measure of the annotation content of a UniProtKB entry or proteome. This score <strong>cannot</strong> be used as a measure of the accuracy of the annotation as we cannot define the ‘correct annotation’ for any given protein.<p><a href='/help/annotation_score' target='_top'>More...</a></p>

<p>This subsection of the ‘Sequence’ section reports difference(s) between the protein sequence shown in the UniProtKB entry and other available protein sequences derived from the same gene.<p><a href='/help/sequence_caution' target='_top'>More...</a></p>Sequence cautioni

The sequence AAA82889 differs from that shown. Probable cloning artifact.Curated

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the ‘Sequence’ section reports difference(s) between the canonical sequence (displayed by default in the entry) and the different sequence submissions merged in the entry. These various submissions may originate from different sequencing projects, different types of experiments, or different biological samples. Sequence conflicts are usually of unknown origin.<p><a href='/help/conflict' target='_top'>More...</a></p>Sequence conflicti16A → T in AAA82889 (PubMed:7542657).Curated1
Sequence conflicti48L → P in AAA82889 (PubMed:7542657).Curated1
Sequence conflicti48L → P in AAA82935 (PubMed:7651420).Curated1
Sequence conflicti52K → T in AAA82889 (PubMed:7542657).Curated1
Sequence conflicti52K → T in AAA82935 (PubMed:7651420).Curated1
Sequence conflicti611Missing in AAA82935 (PubMed:7651420).Curated1
Sequence conflicti1515A → V in AAA82889 (PubMed:7542657).Curated1
Sequence conflicti1715V → L in AAA82935 (PubMed:7651420).Curated1
Sequence conflicti2146 – 2147ER → DG in AAA86889 (PubMed:7612011).Curated2
Sequence conflicti2239L → Q in AAA86889 (PubMed:7612011).Curated1
Sequence conflicti2396N → D in AAA82889 (PubMed:7542657).Curated1
Sequence conflicti2396N → D in AAA86889 (PubMed:7612011).Curated1
Sequence conflicti2449 – 2465ELNER…QEACK → SSMREWQPCIMTKKPVS in AAA86889 (PubMed:7612011).CuratedAdd BLAST17
Sequence conflicti2943R → G in AAA82889 (PubMed:7542657).Curated1
Sequence conflicti2943R → G in AAA82935 (PubMed:7651420).Curated1
Sequence conflicti2943R → G in AAA86889 (PubMed:7612011).Curated1

Natural variant

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the ‘Sequence’ section describes natural variant(s) of the protein sequence.<p><a href='/help/variant' target='_top'>More...</a></p>Natural variantiVAR_055049250Q → L1 PublicationCorresponds to variant dbSNP:rs1050065Ensembl.1
Natural variantiVAR_055050272D → G1 PublicationCorresponds to variant dbSNP:rs1050066Ensembl.1
Natural variantiVAR_034712300R → C. Corresponds to variant dbSNP:rs17023281Ensembl.1
Natural variantiVAR_034713494H → Q. Corresponds to variant dbSNP:rs2070065Ensembl.1
Natural variantiVAR_034714701M → V. Corresponds to variant dbSNP:rs3795524Ensembl.1
Natural variantiVAR_034715754Q → E. Corresponds to variant dbSNP:rs3795523Ensembl.1
Natural variantiVAR_034716815R → H. Corresponds to variant dbSNP:rs3795522Ensembl.1
Natural variantiVAR_0347171018Y → D. Corresponds to variant dbSNP:rs3795519Ensembl.1
Natural variantiVAR_0347181033G → R. Corresponds to variant dbSNP:rs3795518Ensembl.1
Natural variantiVAR_0347191105T → I. Corresponds to variant dbSNP:rs12067133Ensembl.1
Natural variantiVAR_0347201412L → S. Corresponds to variant dbSNP:rs3795517Ensembl.1
Natural variantiVAR_0556381768D → N. Corresponds to variant dbSNP:rs3748692Ensembl.1
Natural variantiVAR_0347231915E → A. Corresponds to variant dbSNP:rs3790647Ensembl.1
Natural variantiVAR_0148393106N → K3 PublicationsCorresponds to variant dbSNP:rs7289Ensembl.1

Sequence databases

Select the link destinations:

EMBL nucleotide sequence database

More...
EMBLi

GenBank nucleotide sequence database

More...
GenBanki

DNA Data Bank of Japan; a nucleotide sequence database

More...
DDBJi
Links Updated
U19769 mRNA Translation: AAA82889.1 Sequence problems.
U30872 mRNA Translation: AAA82935.1
AL445666 Genomic DNA No translation available.
AL445305 Genomic DNA No translation available.
BC172232 mRNA Translation: AAI72232.1
U25725 mRNA Translation: AAA86889.1

The Consensus CDS (CCDS) project

More...
CCDSi
CCDS31023.1

Protein sequence database of the Protein Information Resource

More...
PIRi
PC4035

NCBI Reference Sequences

More...
RefSeqi
NP_057427.3, NM_016343.3
XP_016855575.1, XM_017000086.1

Genome annotation databases

Ensembl eukaryotic genome annotation project

More...
Ensembli
ENST00000366955; ENSP00000355922; ENSG00000117724

Database of genes from NCBI RefSeq genomes

More...
GeneIDi
1063

KEGG: Kyoto Encyclopedia of Genes and Genomes

More...
KEGGi
hsa:1063

UCSC genome browser

More...
UCSCi
uc001hkm.4 human

Keywords - Coding sequence diversityi

Polymorphism

<p>This section provides links to proteins that are similar to the protein sequence(s) described in this entry at different levels of sequence identity thresholds (100%, 90% and 50%) based on their membership in UniProt Reference Clusters (<a href="http://www.uniprot.org/help/uniref">UniRef</a>).<p><a href='/help/similar_proteins_section' target='_top'>More...</a></p>Similar proteinsi

<p>This section is used to point to information related to entries and found in data collections other than UniProtKB.<p><a href='/help/cross_references_section' target='_top'>More...</a></p>Cross-referencesi

<p>This subsection of the <a href="http://www.uniprot.org/manual/cross_references_section">Cross-references</a> section provides links to various web resources that are relevant for a specific protein.<p><a href='/help/web_resource' target='_top'>More...</a></p>Web resourcesi

Atlas of Genetics and Cytogenetics in Oncology and Haematology

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U19769 mRNA Translation: AAA82889.1 Sequence problems.
U30872 mRNA Translation: AAA82935.1
AL445666 Genomic DNA No translation available.
AL445305 Genomic DNA No translation available.
BC172232 mRNA Translation: AAI72232.1
U25725 mRNA Translation: AAA86889.1
CCDSiCCDS31023.1
PIRiPC4035
RefSeqiNP_057427.3, NM_016343.3
XP_016855575.1, XM_017000086.1

3D structure databases

SMRiP49454
ModBaseiSearch...

Protein-protein interaction databases

BioGridi107492, 55 interactors
IntActiP49454, 34 interactors
MINTiP49454
STRINGi9606.ENSP00000355922

PTM databases

CarbonylDBiP49454
iPTMnetiP49454
PhosphoSitePlusiP49454

Polymorphism and mutation databases

BioMutaiCENPF
DMDMi156630875

Proteomic databases

EPDiP49454
jPOSTiP49454
MaxQBiP49454
PaxDbiP49454
PeptideAtlasiP49454
PRIDEiP49454
ProteomicsDBi56015

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000366955; ENSP00000355922; ENSG00000117724
GeneIDi1063
KEGGihsa:1063
UCSCiuc001hkm.4 human

Organism-specific databases

Comparative Toxicogenomics Database

More...
CTDi
1063
DisGeNETi1063

GeneCards: human genes, protein and diseases

More...
GeneCardsi
CENPF
HGNCiHGNC:1857 CENPF
HPAiCAB070134
HPA052382
HPA064308
MalaCardsiCENPF
MIMi243605 phenotype
600236 gene
neXtProtiNX_P49454
OpenTargetsiENSG00000117724
Orphaneti444069 Lethal fetal brain malformation-duodenal atresia-bilateral renal hypoplasia syndrome
506307 Stromme syndrome
PharmGKBiPA26401

GenAtlas: human gene database

More...
GenAtlasi
Search...

Phylogenomic databases

eggNOGiENOG410IGJF Eukaryota
ENOG410XS5F LUCA
GeneTreeiENSGT00730000111187
InParanoidiP49454
KOiK11499
PhylomeDBiP49454
TreeFamiTF101133

Enzyme and pathway databases

ReactomeiR-HSA-141444 Amplification of signal from unattached kinetochores via a MAD2 inhibitory signal
R-HSA-156711 Polo-like kinase mediated events
R-HSA-2467813 Separation of Sister Chromatids
R-HSA-2500257 Resolution of Sister Chromatid Cohesion
R-HSA-5663220 RHO GTPases Activate Formins
R-HSA-68877 Mitotic Prometaphase
SIGNORiP49454

Miscellaneous databases

ChiTaRS: a database of human, mouse and fruit fly chimeric transcripts and RNA-sequencing data

More...
ChiTaRSi
CENPF human

The Gene Wiki collection of pages on human genes and proteins

More...
GeneWikii
CENPF

Database of phenotypes from RNA interference screens in Drosophila and Homo sapiens

More...
GenomeRNAii
1063

Protein Ontology

More...
PROi
PR:P49454

The Stanford Online Universal Resource for Clones and ESTs

More...
SOURCEi
Search...

Gene expression databases

BgeeiENSG00000117724 Expressed in 151 organ(s), highest expression level in heart
ExpressionAtlasiP49454 baseline and differential
GenevisibleiP49454 HS

Family and domain databases

InterProiView protein in InterPro
IPR018302 CenpF/LEK1_Rb-prot-bd
IPR019513 Centromere_CenpF_leu-rich_rpt
IPR018463 Centromere_CenpF_N
PfamiView protein in Pfam
PF10490 CENP-F_C_Rb_bdg, 1 hit
PF10473 CENP-F_leu_zip, 3 hits
PF10481 CENP-F_N, 1 hit

ProtoNet; Automatic hierarchical classification of proteins

More...
ProtoNeti
Search...

MobiDB: a database of protein disorder and mobility annotations

More...
MobiDBi
Search...

<p>This section provides general information on the entry.<p><a href='/help/entry_information_section' target='_top'>More...</a></p>Entry informationi

<p>This subsection of the ‘Entry information’ section provides a mnemonic identifier for a UniProtKB entry, but it is not a stable identifier. Each reviewed entry is assigned a unique entry name upon integration into UniProtKB/Swiss-Prot.<p><a href='/help/entry_name' target='_top'>More...</a></p>Entry nameiCENPF_HUMAN
<p>This subsection of the ‘Entry information’ section provides one or more accession number(s). These are stable identifiers and should be used to cite UniProtKB entries. Upon integration into UniProtKB, each entry is assigned a unique accession number, which is called ‘Primary (citable) accession number’.<p><a href='/help/accession_numbers' target='_top'>More...</a></p>AccessioniPrimary (citable) accession number: P49454
Secondary accession number(s): Q13171, Q13246, Q5VVM7
<p>This subsection of the ‘Entry information’ section shows the date of integration of the entry into UniProtKB, the date of the last sequence update and the date of the last annotation modification (‘Last modified’). The version number for both the entry and the <a href="http://www.uniprot.org/help/canonical_and_isoforms">canonical sequence</a> are also displayed.<p><a href='/help/entry_history' target='_top'>More...</a></p>Entry historyiIntegrated into UniProtKB/Swiss-Prot: February 1, 1996
Last sequence update: January 16, 2019
Last modified: July 31, 2019
This is version 194 of the entry and version 3 of the sequence. See complete history.
<p>This subsection of the ‘Entry information’ section indicates whether the entry has been manually annotated and reviewed by UniProtKB curators or not, in other words, if the entry belongs to the Swiss-Prot section of UniProtKB (<strong>reviewed</strong>) or to the computer-annotated TrEMBL section (<strong>unreviewed</strong>).<p><a href='/help/entry_status' target='_top'>More...</a></p>Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

<p>This section contains any relevant information that doesn’t fit in any other defined sections<p><a href='/help/miscellaneous_section' target='_top'>More...</a></p>Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. SIMILARITY comments
    Index of protein domains and families
  2. Human entries with polymorphisms or disease mutations
    List of human entries with polymorphisms or disease mutations
  3. Human polymorphisms and disease mutations
    Index of human polymorphisms and disease mutations
  4. MIM cross-references
    Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot
  5. Human chromosome 1
    Human chromosome 1: entries, gene names and cross-references to MIM
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again