Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Host cell factor 2

Gene

Hcfc2

Organism
Rattus norvegicus (Rat)
Status
Reviewed-Annotation score: Annotation score: 2 out of 5-Experimental evidence at transcript leveli

Functioni

GO - Biological processi

  • negative regulation of transcription from RNA polymerase II promoter Source: UniProtKB
Complete GO annotation...

Names & Taxonomyi

Protein namesi
Recommended name:
Host cell factor 2
Short name:
HCF-2
Alternative name(s):
C2 factor
Gene namesi
Name:Hcfc2
OrganismiRattus norvegicus (Rat)
Taxonomic identifieri10116 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeRattus
Proteomesi
  • UP000002494 Componenti: Unplaced

Organism-specific databases

RGDi1307385. Hcfc2.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Cytoplasm, Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 723723Host cell factor 2PRO_0000119074Add
BLAST

Proteomic databases

PaxDbiQ5RKG2.
PRIDEiQ5RKG2.

Interactioni

Subunit structurei

Binds KMT2A/MLL1. Component of the MLL1/MLL complex, at least composed of KMT2A/MLL1, ASH2L, RBBP5, DPY30, WDR5, MEN1, HCFC1 and HCFC2 (By similarity).By similarity

Protein-protein interaction databases

STRINGi10116.ENSRNOP00000014951.

Structurei

3D structure databases

ProteinModelPortaliQ5RKG2.
SMRiQ5RKG2. Positions 614-723.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Repeati34 – 7946Kelch 1Add
BLAST
Repeati83 – 13048Kelch 2Add
BLAST
Repeati207 – 25549Kelch 3Add
BLAST
Repeati257 – 30549Kelch 4Add
BLAST
Domaini357 – 43680Fibronectin type-III 1PROSITE-ProRule annotationAdd
BLAST
Domaini516 – 60691Fibronectin type-III 2PROSITE-ProRule annotationAdd
BLAST
Domaini608 – 720113Fibronectin type-III 3PROSITE-ProRule annotationAdd
BLAST

Sequence similaritiesi

Contains 3 fibronectin type-III domains.PROSITE-ProRule annotation
Contains 4 Kelch repeats.Curated

Keywords - Domaini

Kelch repeat, Repeat

Phylogenomic databases

eggNOGiKOG4152. Eukaryota.
ENOG410Y5AC. LUCA.
HOGENOMiHOG000021205.
HOVERGENiHBG051889.
InParanoidiQ5RKG2.
KOiK14966.

Family and domain databases

Gene3Di2.120.10.80. 1 hit.
2.130.10.80. 1 hit.
2.60.40.10. 3 hits.
InterProiIPR003961. FN3_dom.
IPR015916. Gal_Oxidase_b-propeller.
IPR013783. Ig-like_fold.
IPR015915. Kelch-typ_b-propeller.
IPR006652. Kelch_1.
[Graphical view]
PfamiPF01344. Kelch_1. 1 hit.
[Graphical view]
SMARTiSM00060. FN3. 2 hits.
[Graphical view]
SUPFAMiSSF49265. SSF49265. 2 hits.
PROSITEiPS50853. FN3. 2 hits.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Q5RKG2-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MAAPSLLNWR RVSSFTGPVP RARHGHRAVA IRELMIIFGG GNEGIADELH
60 70 80 90 100
VYNTVTNQWF LPAVRGDIPP GCAAHGFVCD GTRILVFGGM VEYGRYSNEL
110 120 130 140 150
YELQASRWLW KKVKPQPPPS GLPPCPRLGH SFSLYGNKCY LFAGLANESE
160 170 180 190 200
DSNNNVPRYL NDFYELELQH GSGVVGWSVP ATKGTVPSPR ESHTAVIYCK
210 220 230 240 250
RDSGSPKMYV FGGMCGARLD DLWQLDLETM SWSKPETKGT VPLPRSLHTA
260 270 280 290 300
SVIGNKMYIF GGWVPHKGEN TENSPHDCEW RCTSSFSYLN LDTAEWTTLV
310 320 330 340 350
SDSQEDKKNS RPRPRAGHCA VAIGTRLYFW SGRDGYKKAL NSQVCCKDLW
360 370 380 390 400
YLDTEKPPAP SQVQLIKATT NSFHVKWDEV PTVEGYLLQL NTDLTHQAAS
410 420 430 440 450
PDASAAPNTL GGRTDPHRQG SNSILHNSVS DPANCTKPEH TAVAARGMSL
460 470 480 490 500
KSKPDSRAAD SSVALHSPLA PNTSNNNSCM ADMLWKSEVD EICALPATKI
510 520 530 540 550
SRVEAHAAAT PFSKETPSNP VAILKAEQWC DVGIFKNNTA LVSQFYLLPK
560 570 580 590 600
GKQSMSKVGN ADVPDYSLLK KQDLVPGTVY KFRVAAINGC GIGPFSKLSE
610 620 630 640 650
FKTCIPGFPG APSTVRISKN VEGIHLSWEP PTSPSGNILE YSAYLAIRTA
660 670 680 690 700
QVQDNPSQLV FMRIYCGLKT SCIVTAGQLA NAHIDYTSRP AIVFRISAKN
710 720
EKGYGPATQV RWLQGNSKKA PLS
Length:723
Mass (Da):79,149
Last modified:December 21, 2004 - v1
Checksum:iFC09D7455BEEAD1F
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
BC085951 mRNA. Translation: AAH85951.1.
RefSeqiNP_001008358.1. NM_001008357.1.
UniGeneiRn.90662.

Genome annotation databases

GeneIDi314704.
KEGGirno:314704.
UCSCiRGD:1307385. rat.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
BC085951 mRNA. Translation: AAH85951.1.
RefSeqiNP_001008358.1. NM_001008357.1.
UniGeneiRn.90662.

3D structure databases

ProteinModelPortaliQ5RKG2.
SMRiQ5RKG2. Positions 614-723.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi10116.ENSRNOP00000014951.

Proteomic databases

PaxDbiQ5RKG2.
PRIDEiQ5RKG2.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

GeneIDi314704.
KEGGirno:314704.
UCSCiRGD:1307385. rat.

Organism-specific databases

CTDi29915.
RGDi1307385. Hcfc2.

Phylogenomic databases

eggNOGiKOG4152. Eukaryota.
ENOG410Y5AC. LUCA.
HOGENOMiHOG000021205.
HOVERGENiHBG051889.
InParanoidiQ5RKG2.
KOiK14966.

Miscellaneous databases

NextBioi668111.
PROiQ5RKG2.

Family and domain databases

Gene3Di2.120.10.80. 1 hit.
2.130.10.80. 1 hit.
2.60.40.10. 3 hits.
InterProiIPR003961. FN3_dom.
IPR015916. Gal_Oxidase_b-propeller.
IPR013783. Ig-like_fold.
IPR015915. Kelch-typ_b-propeller.
IPR006652. Kelch_1.
[Graphical view]
PfamiPF01344. Kelch_1. 1 hit.
[Graphical view]
SMARTiSM00060. FN3. 2 hits.
[Graphical view]
SUPFAMiSSF49265. SSF49265. 2 hits.
PROSITEiPS50853. FN3. 2 hits.
[Graphical view]
ProtoNetiSearch...

Publicationsi

  1. "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
    The MGC Project Team
    Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
    Tissue: Testis.

Entry informationi

Entry nameiHCFC2_RAT
AccessioniPrimary (citable) accession number: Q5RKG2
Entry historyi
Integrated into UniProtKB/Swiss-Prot: April 26, 2005
Last sequence update: December 21, 2004
Last modified: November 11, 2015
This is version 75 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.