Q61191 (HCFC1_MOUSE) Reviewed, UniProtKB/Swiss-Prot
Last modified
May 1, 2013.
Version 123.
History...
Names·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order
Names·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize orderNames and origin
| Protein names | Recommended name: Host cell factor 1 Short name=HCF Short name=HCF-1 Alternative name(s): C1 factor Cleaved into the following 12 chains: | ||
| Gene names |
| ||
| Organism | Mus musculus (Mouse) [Reference proteome] | ||
| Taxonomic identifier | 10090 [NCBI] | ||
| Taxonomic lineage | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Mammalia › Eutheria › Euarchontoglires › Glires › Rodentia › Sciurognathi › Muroidea › Muridae › Murinae › Mus › Mus![]() |
Protein attributes
| Sequence length | 2045 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is further processed into a mature form. |
| Protein existence | Evidence at protein level |
General annotation (Comments)
| Function | Involved in control of the cell cycle. Also antagonizes transactivation by ZBTB17 and GABP2; represses ZBTB17 activation of the p15(INK4b) promoter and inhibits its ability to recruit p300. Coactivator for EGR2 and GABP2. Tethers the chromatin modifying Set1/Ash2 histone H3 'Lys-4' methyltransferase (H3K4me) and Sin3 histone deacetylase (HDAC) complexes (involved in the activation and repression of transcription respectively) together. As part of the NSL complex it may be involved in acetylation of nucleosomal histone H4 on several lysine residues. Ref.1 Ref.7 |
| Subunit structure | Composed predominantly of six polypeptides ranging from 110 to 150 kDa and a minor 300 kDa polypeptide. The majority of N- and C-terminal cleavage products remain tightly, albeit non-covalently, associated. Interacts with POU2F1, CREB3, ZBTB17, EGR2, E2F4, CREBZF, SP1, GABP2, Sin3 HDAC complex (SIN3A, HDAC1, HDAC2, SUDS3), SAP30, SIN3B and FHL2. Component of a MLL1 complex, composed of at least the core components MLL, ASH2L, HCFC1, WDR5 and RBBP5, as well as the facultative components BAP18, CHD8, DPY30, E2F6, HCFC2, HSP70, INO80C, KANSL1, LAS1L, MAX, MCRS1, MEN1, MGA, KAT8, PELP1, PHF20,. PRP31, RING2, RUVBL1, RUVBL2, SENP3, TAF1, TAF4, TAF6, TAF7, TAF9 and TEX10. Component of the MLL5-L complex, composed of at least MLL5, STK38, PPP1CA, PPP1CB, PPP1CC, HCFC1, ACTB and OGT. Interacts directly with OGT; the interaction, which requires the cleavage site domain, glycosylates and promotes proteolytic processing of HCFC1 and retains OGT in the nucleus. Interacts with TET2 and TET3. Interacts with HCFC1R1. Component of the SET1 complex, at least composed of the catalytic subunit (SETD1A or SETD1B), WDR5, WDR82, RBBP5, ASH2L, CXXC1, HCFC1 and DPY30. Interacts (via HBM motif) with SETD1A. Component of the NSL complex at least composed of MOF/KAT8, KANSL1, KANSL2, KANSL3, MCRS1, PHF20, OGT1/OGT, WDR5 and HCFC1 By similarity. Interacts with THAP11. Ref.1 Ref.9 UniProtKB P51610 |
| Subcellular location | Nucleus. Cytoplasm. Note: HCFC1R1 modulates its subcellular localization and overexpression of HCFC1R1 leads to accumulation of HCFC1 in the cytoplasm. Non-processed HCFC1 associates with chromatin. Colocalizes with CREB3 and CANX in the ER By similarity. Ref.7 |
| Tissue specificity | Expressed in liver, pituitary gland, skeletal muscle, kidney, eye and brain (at protein level). Also observed at low level in heart, spleen and lung. Ref.1 Ref.7 |
| Domain | The HCF repeat is a highly specific proteolytic cleavage signal By similarity. UniProtKB P51610 The kelch repeats fold into a 6-bladed kelch beta-propeller called the beta-propeller domain which mediates interaction with HCFC1R1 By similarity. |
| Post-translational modification | Proteolytically cleaved at one or several PPCE--THET sites within the HCF repeats. Cleavage is promoted by O-glycosylation By similarity. UniProtKB P51610 O-glycosylated. GlcNAcylation by OGT promotes proteolytic processing By similarity. Ubiquitinated. Lys-1817 and Lys-1818 are ubiquitinated both via 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains. BAP1 mediated deubiquitination of 'Lys-48'-linked polyubiquitin chains; deubiquitination by BAP1 does not seem to stabilize the protein By similarity. |
| Sequence similarities | Contains 5 Kelch repeats. |
| Sequence caution | The sequence CAF25305.1 differs from that shown. Reason: Frameshift at position 339. |
Ontologies
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||||||||||||||||||||
Molecule processing | |||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Initiator methionine | 1 | 1 | Removed By similarity | ||||||||||||||||||||||||
| Chain | 2 – 1432 | 1431 | HCF N-terminal chain 6 By similarity | PRO_0000016635 | |||||||||||||||||||||||
| Chain | 2 – 1332 | 1331 | HCF N-terminal chain 5 By similarity | PRO_0000016636 | |||||||||||||||||||||||
| Chain | 2 – 1304 | 1303 | HCF N-terminal chain 4 By similarity | PRO_0000016637 | |||||||||||||||||||||||
| Chain | 2 – 1110 | 1109 | HCF N-terminal chain 3 By similarity | PRO_0000016638 | |||||||||||||||||||||||
| Chain | 2 – 1081 | 1080 | HCF N-terminal chain 2 By similarity | PRO_0000016639 | |||||||||||||||||||||||
| Chain | 2 – 1019 | 1018 | HCF N-terminal chain 1 By similarity | PRO_0000016640 | |||||||||||||||||||||||
| Chain | 1020 – 2045 | 1026 | HCF C-terminal chain 1 By similarity UniProtKB P51610 | PRO_0000016641 | |||||||||||||||||||||||
| Chain | 1082 – 2045 | 964 | HCF C-terminal chain 2 By similarity UniProtKB P51610 | PRO_0000016642 | |||||||||||||||||||||||
| Chain | 1111 – 2045 | 935 | HCF C-terminal chain 3 By similarity UniProtKB P51610 | PRO_0000016643 | |||||||||||||||||||||||
| Chain | 1305 – 2045 | 741 | HCF C-terminal chain 4 By similarity UniProtKB P51610 | PRO_0000016644 | |||||||||||||||||||||||
| Chain | 1333 – 2045 | 713 | HCF C-terminal chain 5 By similarity UniProtKB P51610 | PRO_0000016645 | |||||||||||||||||||||||
| Chain | 1433 – 2045 | 613 | HCF C-terminal chain 6 By similarity UniProtKB P51610 | PRO_0000016646 | |||||||||||||||||||||||
Regions | |||||||||||||||||||||||||||
| Repeat | 44 – 89 | 46 | Kelch 1 | ||||||||||||||||||||||||
| Repeat | 93 – 140 | 48 | Kelch 2 | ||||||||||||||||||||||||
| Repeat | 148 – 194 | 47 | Kelch 3 | ||||||||||||||||||||||||
| Repeat | 217 – 265 | 49 | Kelch 4 | ||||||||||||||||||||||||
| Repeat | 266 – 313 | 48 | Kelch 5 | ||||||||||||||||||||||||
| Repeat | 1010 – 1035 | 26 | HCF repeat 1 | ||||||||||||||||||||||||
| Repeat | 1072 – 1097 | 26 | HCF repeat 2 | ||||||||||||||||||||||||
| Repeat | 1101 – 1126 | 26 | HCF repeat 3 | ||||||||||||||||||||||||
| Repeat | 1157 – 1182 | 26 | HCF repeat 4; degenerate | ||||||||||||||||||||||||
| Repeat | 1295 – 1320 | 26 | HCF repeat 5 | ||||||||||||||||||||||||
| Repeat | 1323 – 1348 | 26 | HCF repeat 6 | ||||||||||||||||||||||||
| Repeat | 1358 – 1383 | 26 | HCF repeat 7; degenerate | ||||||||||||||||||||||||
| Repeat | 1423 – 1448 | 26 | HCF repeat 8 | ||||||||||||||||||||||||
| Region | 500 – 550 | 51 | Required for interaction with OGT By similarity | ||||||||||||||||||||||||
| Region | 610 – 722 | 113 | Interaction with SIN3A By similarity | ||||||||||||||||||||||||
| Region | 750 – 902 | 153 | Interaction with ZBTB17 By similarity | ||||||||||||||||||||||||
| Region | 813 – 912 | 100 | Interaction with GABP2 By similarity | ||||||||||||||||||||||||
| Compositional bias | 574 – 1500 | 927 | Thr-rich | ||||||||||||||||||||||||
| Compositional bias | 1622 – 1674 | 53 | Ala-rich | ||||||||||||||||||||||||
| Compositional bias | 1684 – 1720 | 37 | Gln-rich | ||||||||||||||||||||||||
Sites | |||||||||||||||||||||||||||
| Site | 1019 – 1020 | 2 | Cleavage; by autolysis By similarity UniProtKB P51610 | ||||||||||||||||||||||||
| Site | 1081 – 1082 | 2 | Cleavage; by autolysis By similarity UniProtKB P51610 | ||||||||||||||||||||||||
| Site | 1110 – 1111 | 2 | Cleavage; by autolysis By similarity UniProtKB P51610 | ||||||||||||||||||||||||
| Site | 1304 – 1305 | 2 | Cleavage; by autolysis By similarity UniProtKB P51610 | ||||||||||||||||||||||||
| Site | 1332 – 1333 | 2 | Cleavage; by autolysis By similarity UniProtKB P51610 | ||||||||||||||||||||||||
| Site | 1432 – 1433 | 2 | Cleavage; by autolysis By similarity UniProtKB P51610 | ||||||||||||||||||||||||
Amino acid modifications | |||||||||||||||||||||||||||
| Modified residue | 2 | 1 | N-acetylalanine By similarity | ||||||||||||||||||||||||
| Modified residue | 6 | 1 | Phosphoserine By similarity | ||||||||||||||||||||||||
| Modified residue | 288 | 1 | N6-acetyllysine By similarity | ||||||||||||||||||||||||
| Modified residue | 411 | 1 | Phosphoserine By similarity | ||||||||||||||||||||||||
| Modified residue | 666 | 1 | Phosphoserine By similarity | ||||||||||||||||||||||||
| Modified residue | 669 | 1 | Phosphoserine By similarity | ||||||||||||||||||||||||
| Modified residue | 813 | 1 | N6-acetyllysine By similarity | ||||||||||||||||||||||||
| Modified residue | 1204 | 1 | Phosphoserine By similarity | ||||||||||||||||||||||||
| Modified residue | 1500 | 1 | Phosphothreonine By similarity | ||||||||||||||||||||||||
| Modified residue | 1516 | 1 | Phosphoserine By similarity | ||||||||||||||||||||||||
| Modified residue | 2015 | 1 | N6-acetyllysine By similarity | ||||||||||||||||||||||||
| Cross-link | 105 | Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in ubiquitin) By similarity | |||||||||||||||||||||||||
| Cross-link | 163 | Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in ubiquitin) By similarity | |||||||||||||||||||||||||
| Cross-link | 244 | Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in ubiquitin) By similarity | |||||||||||||||||||||||||
| Cross-link | 363 | Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in ubiquitin) By similarity | |||||||||||||||||||||||||
| Cross-link | 1817 | Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in ubiquitin) By similarity | |||||||||||||||||||||||||
| Cross-link | 1818 | Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in ubiquitin) By similarity | |||||||||||||||||||||||||
Experimental info | |||||||||||||||||||||||||||
| Sequence conflict | 483 | 1 | V → R in AAB01163. Ref.1 | ||||||||||||||||||||||||
| Sequence conflict | 483 | 1 | V → R in AAD09225. Ref.1 | ||||||||||||||||||||||||
| Sequence conflict | 520 | 1 | P → S in CAF25305. Ref.6 | ||||||||||||||||||||||||
| Sequence conflict | 567 – 568 | 2 | VL → AW in AAB01163. Ref.1 | ||||||||||||||||||||||||
| Sequence conflict | 567 – 568 | 2 | VL → AW in AAD09225. Ref.1 | ||||||||||||||||||||||||
| Sequence conflict | 711 | 1 | Q → H in AAD09225. Ref.1 | ||||||||||||||||||||||||
| Sequence conflict | 1104 | 1 | C → S in AAB01163. Ref.1 | ||||||||||||||||||||||||
| Sequence conflict | 1104 | 1 | C → S in AAD09225. Ref.1 | ||||||||||||||||||||||||
| Sequence conflict | 1230 | 1 | G → A in AAB01163. Ref.1 | ||||||||||||||||||||||||
| Sequence conflict | 1230 | 1 | G → A in AAD09225. Ref.1 | ||||||||||||||||||||||||
Secondary structure | |||||||||||||||||||||||||||
Helix Strand Turn | |||||||||||||||||||||||||||
| Beta strand | 1897 – 1899 | 3 | |||||||||||||||||||||||||
| Beta strand | 1905 – 1912 | 8 | |||||||||||||||||||||||||
| Beta strand | 1915 – 1921 | 7 | |||||||||||||||||||||||||
| Beta strand | 1924 – 1926 | 3 | |||||||||||||||||||||||||
| Beta strand | 1930 – 1938 | 9 | |||||||||||||||||||||||||
| Beta strand | 1965 – 1968 | 4 | |||||||||||||||||||||||||
| Helix | 1973 – 1977 | 5 | |||||||||||||||||||||||||
| Beta strand | 1984 – 1997 | 14 | |||||||||||||||||||||||||
| Beta strand | 2005 – 2010 | 6 | |||||||||||||||||||||||||
Sequences
| ||||||||||||||||||
References
| « Hide 'large scale' references | |
| [1] | "The mouse homologue of the human transcription factor C1 (host cell factor). Conservation of forms and function." Kristie T.M. J. Biol. Chem. 272:26749-26755(1997) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA], FUNCTION, SUBUNIT, TISSUE SPECIFICITY. Tissue: Fetal liver. |
| [2] | "Lineage-specific biology revealed by a finished genome assembly of the mouse." Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X., Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y., Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S. Ponting C.P.PLoS Biol. 7:E1000112-E1000112(2009) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. Strain: C57BL/6J. |
| [3] | Mural R.J., Adams M.D., Myers E.W., Smith H.O., Venter J.C. Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. |
| [4] | "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)." The MGC Project Team Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. Tissue: Limb mesenchyme. |
| [5] | "The transcriptional landscape of the mammalian genome." Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J. Hayashizaki Y.Science 309:1559-1563(2005) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1-209. Strain: NOD. Tissue: Thymus. |
| [6] | "Towards functional annotation of all Xq28 genes: expression and intracellular localization analyses reveal novel candidates for disease genes." Kolb A.A., Mehrle A., Bechtel S., Wellenreuther R., Simpson J., Pepperkok R., Wiemann S., Poustka A. Submitted (FEB-2004) to the EMBL/GenBank/DDBJ databases Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 165-520. Strain: NMRI. Tissue: Embryo. |
| [7] | "Nuclear localization of the C1 factor (host cell factor) in sensory neurons correlates with reactivation of herpes simplex virus from latency." Kristie T.M., Vogel J.L., Sears A.E. Proc. Natl. Acad. Sci. U.S.A. 96:1229-1233(1999) [PubMed] [Europe PMC] [Abstract] Cited for: FUNCTION, SUBCELLULAR LOCATION, TISSUE SPECIFICITY. Strain: BALB/c. |
| [8] | "O-linked N-acetylglucosamine proteomics of postsynaptic density preparations using lectin weak affinity chromatography and mass spectrometry." Vosseller K., Trinidad J.C., Chalkley R.J., Specht C.G., Thalhammer A., Lynn A.J., Snedecor J.O., Guan S., Medzihradszky K.F., Maltby D.A., Schoepfer R., Burlingame A.L. Mol. Cell. Proteomics 5:923-934(2006) [PubMed] [Europe PMC] [Abstract] Cited for: GLYCOSYLATION [LARGE SCALE ANALYSIS], MASS SPECTROMETRY. Tissue: Brain. |
| [9] | "Ronin is essential for embryogenesis and the pluripotency of mouse embryonic stem cells." Dejosez M., Krumenacker J.S., Zitur L.J., Passeri M., Chu L.-F., Songyang Z., Thomson J.A., Zwaka T.P. Cell 133:1162-1174(2008) [PubMed] [Europe PMC] [Abstract] Cited for: INTERACTION WITH THAP11. |
| [10] | Erratum Dejosez M., Krumenacker J.S., Zitur L.J., Passeri M., Chu L.-F., Songyang Z., Thomson J.A., Zwaka T.P. Cell 134:692-692(2008) |
| + | Additional computationally mapped references. |
Cross-references
Sequence databases | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| EMBL GenBank DDBJ | U53925 Genomic DNA. Translation: AAB01163.1. U80821 Genomic DNA. Translation: AAD09225.1. AL672002 Genomic DNA. Translation: CAM18726.1. CH466650 Genomic DNA. Translation: EDL29851.1. BC053742 mRNA. Translation: AAH53742.1. AK088827 mRNA. Translation: BAC40597.1. AJ627036 mRNA. Translation: CAF25305.1. Frameshift. | ||||||||||||
| IPI | IPI00828543. | ||||||||||||
| RefSeq | NP_032250.2. NM_008224.3. | ||||||||||||
| UniGene | Mm.248353. Mm.439140. | ||||||||||||
3D structure databases | |||||||||||||
| PDBe RCSB PDB PDBj |
| ||||||||||||
| ProteinModelPortal | Q61191. | ||||||||||||
| SMR | Q61191. Positions 360-400, 1821-2013. | ||||||||||||
| ModBase | Search... | ||||||||||||
Protein-protein interaction databases | |||||||||||||
| IntAct | Q61191. 2 interactions. | ||||||||||||
| STRING | 10090.ENSMUSP00000110012. | ||||||||||||
PTM databases | |||||||||||||
| PhosphoSite | Q61191. | ||||||||||||
Proteomic databases | |||||||||||||
| PaxDb | Q61191. | ||||||||||||
| PRIDE | Q61191. | ||||||||||||
Protocols and materials databases | |||||||||||||
| StructuralBiologyKnowledgebase | Search... | ||||||||||||
Genome annotation databases | |||||||||||||
| Ensembl | ENSMUST00000033761; ENSMUSP00000033761; ENSMUSG00000031386. | ||||||||||||
| GeneID | 15161. | ||||||||||||
| KEGG | mmu:15161. | ||||||||||||
Organism-specific databases | |||||||||||||
| CTD | 3054. | ||||||||||||
| MGI | MGI:105942. Hcfc1. | ||||||||||||
Phylogenomic databases | |||||||||||||
| eggNOG | NOG12793. | ||||||||||||
| GeneTree | ENSGT00700000104199. | ||||||||||||
| HOGENOM | HOG000293192. | ||||||||||||
| HOVERGEN | HBG051888. | ||||||||||||
| KO | K14966. | ||||||||||||
| OrthoDB | EOG49GKGP. | ||||||||||||
Gene expression databases | |||||||||||||
| ArrayExpress | Q61191. | ||||||||||||
| Bgee | Q61191. | ||||||||||||
| CleanEx | MM_HCFC1. | ||||||||||||
| Genevestigator | Q61191. | ||||||||||||
| GermOnline | ENSMUSG00000031386. Mus musculus. | ||||||||||||
Family and domain databases | |||||||||||||
| Gene3D | 2.120.10.80. 2 hits. 2.60.40.10. 2 hits. | ||||||||||||
| InterPro | IPR003961. Fibronectin_type3. IPR013783. Ig-like_fold. IPR015915. Kelch-typ_b-propeller. IPR006652. Kelch_1. IPR011498. Kelch_2. [Graphical view] | ||||||||||||
| Pfam | PF01344. Kelch_1. 2 hits. PF07646. Kelch_2. 1 hit. [Graphical view] | ||||||||||||
| SMART | SM00060. FN3. 2 hits. [Graphical view] | ||||||||||||
| SUPFAM | SSF49265. FN_III-like. 2 hits. | ||||||||||||
| ProtoNet | Search... | ||||||||||||
Other | |||||||||||||
| ChiTaRS | HCFC1. mouse. | ||||||||||||
| NextBio | 287652. | ||||||||||||
| SOURCE | Search... | ||||||||||||
Entry information
| Entry name | HCFC1_MOUSE | ||||||||
| Accession | Primary (citable) accession number: Q61191 Secondary accession number(s): B1AUX1 Q9QWH2 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation program | Chordata Protein Annotation Program | ||||||||
Relevant documents
| MGD cross-references Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot |
| PDB cross-references Index of Protein Data Bank (PDB) cross-references |
| SIMILARITY comments Index of protein domains and families |

Clusters with
