ID E0VC46_PEDHC Unreviewed; 3311 AA. AC E0VC46; DT 02-NOV-2010, integrated into UniProtKB/TrEMBL. DT 02-NOV-2010, sequence version 1. DT 27-MAR-2024, entry version 95. DE RecName: Full=DNA-directed RNA polymerase II subunit RPB7 {ECO:0000256|ARBA:ARBA00015928}; GN Name=8231358 {ECO:0000313|EnsemblMetazoa:PHUM079870-PA}; GN ORFNames=Phum_PHUM079870 {ECO:0000313|EMBL:EEB10952.1}; OS Pediculus humanus subsp. corporis (Body louse). OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; OC Neoptera; Paraneoptera; Psocodea; Phthiraptera; Anoplura; Pediculidae; OC Pediculus. OX NCBI_TaxID=121224; RN [1] {ECO:0000313|EMBL:EEB10952.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=USDA {ECO:0000313|EMBL:EEB10952.1}; RA Kirkness E., Hannick L., Hass B., Bruggner R., Lawson D., Bidwell S., RA Joardar V., Caler E., Walenz B., Inman J., Schobel S., Galinsky K., RA Amedeo P., Strausberg R.; RT "Annotation of Pediculus humanus corporis strain USDA."; RL Submitted (APR-2007) to the EMBL/GenBank/DDBJ databases. RN [2] {ECO:0000313|EMBL:EEB10952.1} RP NUCLEOTIDE SEQUENCE. RC STRAIN=USDA {ECO:0000313|EMBL:EEB10952.1}; RG The Human Body Louse Genome Consortium; RA Kirkness E., Walenz B., Hass B., Bruggner R., Strausberg R.; RT "The genome of the human body louse."; RL Submitted (APR-2007) to the EMBL/GenBank/DDBJ databases. RN [3] {ECO:0000313|EnsemblMetazoa:PHUM079870-PA} RP IDENTIFICATION. RC STRAIN=USDA {ECO:0000313|EnsemblMetazoa:PHUM079870-PA}; RG EnsemblMetazoa; RL Submitted (FEB-2021) to UniProtKB. CC -!- SUBUNIT: Component of the RNA polymerase II (Pol II) complex consisting CC of 12 subunits. RPB4 and RPB7 form a subcomplex that protrudes from the CC 10-subunit Pol II core complex. {ECO:0000256|ARBA:ARBA00025894}. CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AAZO01000953; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; DS235047; EEB10952.1; -; Genomic_DNA. DR RefSeq; XP_002423690.1; XM_002423645.1. DR STRING; 121224.E0VC46; -. DR EnsemblMetazoa; PHUM079870-RA; PHUM079870-PA; PHUM079870. DR GeneID; 8231358; -. DR KEGG; phu:Phum_PHUM079870; -. DR CTD; 8231358; -. DR VEuPathDB; VectorBase:PHUM079870; -. DR eggNOG; KOG1084; Eukaryota. DR eggNOG; KOG3298; Eukaryota. DR HOGENOM; CLU_225836_0_0_1; -. DR InParanoid; E0VC46; -. DR OMA; DTKMMEC; -. DR OrthoDB; 5490909at2759; -. DR Proteomes; UP000009046; Unassembled WGS sequence. DR GO; GO:0000428; C:DNA-directed RNA polymerase complex; IEA:UniProtKB-KW. DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell. DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro. DR GO; GO:0140938; F:histone H3 methyltransferase activity; IEA:UniProt. DR GO; GO:0016279; F:protein-lysine N-methyltransferase activity; IEA:UniProt. DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro. DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro. DR GO; GO:0006351; P:DNA-templated transcription; IEA:InterPro. DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW. DR CDD; cd15506; PHD1_KMT2A_like; 1. DR CDD; cd15508; PHD3_KMT2A_like; 1. DR CDD; cd15489; PHD_SF; 1. DR CDD; cd04329; RNAP_II_Rpb7_N; 1. DR CDD; cd04462; S1_RNAPII_Rpb7; 1. DR CDD; cd19170; SET_KMT2A_2B; 1. DR Gene3D; 3.30.160.360; -; 1. DR Gene3D; 1.20.920.10; Bromodomain-like; 1. DR Gene3D; 2.40.50.140; Nucleic acid-binding proteins; 1. DR Gene3D; 3.30.1490.120; RNA polymerase Rpb7-like, N-terminal domain; 1. DR Gene3D; 2.170.270.10; SET domain; 1. DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 3. DR InterPro; IPR036427; Bromodomain-like_sf. DR InterPro; IPR034732; EPHD. DR InterPro; IPR003889; FYrich_C. DR InterPro; IPR003888; FYrich_N. DR InterPro; IPR047219; KMT2A_2B_SET. DR InterPro; IPR012340; NA-bd_OB-fold. DR InterPro; IPR003616; Post-SET_dom. DR InterPro; IPR036898; RNA_pol_Rpb7-like_N_sf. DR InterPro; IPR005576; Rpb7-like_N. DR InterPro; IPR003029; S1_domain. DR InterPro; IPR001214; SET_dom. DR InterPro; IPR046341; SET_dom_sf. DR InterPro; IPR011011; Znf_FYVE_PHD. DR InterPro; IPR001628; Znf_hrmn_rcpt. DR InterPro; IPR001965; Znf_PHD. DR InterPro; IPR019787; Znf_PHD-finger. DR InterPro; IPR013083; Znf_RING/FYVE/PHD. DR PANTHER; PTHR45838:SF4; HISTONE-LYSINE N-METHYLTRANSFERASE TRITHORAX; 1. DR PANTHER; PTHR45838; HISTONE-LYSINE-N-METHYLTRANSFERASE 2 KMT2 FAMILY MEMBER; 1. DR Pfam; PF05965; FYRC; 1. DR Pfam; PF05964; FYRN; 1. DR Pfam; PF00628; PHD; 1. DR Pfam; PF00575; S1; 1. DR Pfam; PF00856; SET; 1. DR Pfam; PF03876; SHS2_Rpb7-N; 1. DR Pfam; PF13771; zf-HC5HC2H; 1. DR SMART; SM00542; FYRC; 1. DR SMART; SM00249; PHD; 4. DR SMART; SM00317; SET; 1. DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 3. DR SUPFAM; SSF88798; N-terminal, heterodimerisation domain of RBP7 (RpoE); 1. DR SUPFAM; SSF50249; Nucleic acid-binding proteins; 1. DR SUPFAM; SSF82199; SET domain; 1. DR PROSITE; PS51805; EPHD; 1. DR PROSITE; PS51543; FYRC; 1. DR PROSITE; PS51542; FYRN; 1. DR PROSITE; PS51030; NUCLEAR_REC_DBD_2; 1. DR PROSITE; PS50868; POST_SET; 1. DR PROSITE; PS50280; SET; 1. DR PROSITE; PS50016; ZF_PHD_2; 3. PE 4: Predicted; KW Chromatin regulator {ECO:0000256|ARBA:ARBA00022853}; KW DNA-binding {ECO:0000256|ARBA:ARBA00023125}; KW Metal-binding {ECO:0000256|ARBA:ARBA00022723}; KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603, KW ECO:0000313|EMBL:EEB10952.1}; Nucleus {ECO:0000256|ARBA:ARBA00023242}; KW Reference proteome {ECO:0000313|Proteomes:UP000009046}; KW Repeat {ECO:0000256|ARBA:ARBA00022737}; KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691}; KW Transcription {ECO:0000256|ARBA:ARBA00023163}; KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}; KW Transferase {ECO:0000256|ARBA:ARBA00022679, ECO:0000313|EMBL:EEB10952.1}; KW Zinc {ECO:0000256|ARBA:ARBA00022833}; KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE- KW ProRule:PRU00146}. FT DOMAIN 563..663 FT /note="Nuclear receptor" FT /evidence="ECO:0000259|PROSITE:PS51030" FT DOMAIN 920..973 FT /note="PHD-type" FT /evidence="ECO:0000259|PROSITE:PS50016" FT DOMAIN 970..1021 FT /note="PHD-type" FT /evidence="ECO:0000259|PROSITE:PS50016" FT DOMAIN 1049..1110 FT /note="PHD-type" FT /evidence="ECO:0000259|PROSITE:PS50016" FT DOMAIN 1484..1592 FT /note="PHD-type" FT /evidence="ECO:0000259|PROSITE:PS51805" FT DOMAIN 3171..3289 FT /note="SET" FT /evidence="ECO:0000259|PROSITE:PS50280" FT DOMAIN 3295..3311 FT /note="Post-SET" FT /evidence="ECO:0000259|PROSITE:PS50868" FT REGION 803..826 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1296..1317 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 2011..2042 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 2898..2917 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 2930..2951 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 2012..2042 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 2898..2912 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" SQ SEQUENCE 3311 AA; 374070 MW; 4FAE8C5D6519026D CRC64; MKISLEHEIL LHPRYFGPQL LETVKQKLYA EVEGTCTGKY GFVIAVTLID NIGAGLILPG QGFVVYPVKY RAIVFRPFKG EVLDAVVTQV NKVGMFAEIG PLSCFISHHS IPADMQFCPN FNPPCYKSKD EDVVIQADDE IRLKIVGTRV DASGIFAIGT LMDDYLGLVG LTSVVAKMAT DRSLADRGIV EIKMGRSKFP GKPTKIGNRK RVNLSSCSLP EDNDSSTAAK NIYLGLSMFN ETFGDNETIT PGPFYGFSSN EIDEAMAEAR LQQKEAEVAH KKQGIDINFW DSFQSELFSV SSFTKHKLLP DNNCEKESKT IVSFDGGGLP FDFPFSHDNS DNEPSRTLNV NVVENNKVHK INKKVRISSK LEKTSSKIKK LRAPRFKFNK NKLQCSPIAN RSLQSQAAKK LLNKAKMTNL LEINKNLPAD KKFILPSRST HSSRVIKPNK RFFEDNIIKK KYLNSQSCLN ATVNNKQNLL EESRKKFETR KLHWENKSSE NLNLSESSNS SSSKFILREA RLQLDKDKTQ TSVLEGPFSS PNLSSLCSNT QLKINLMQST PGNKECAVCG IVIFCKFVEK PKEFGIFCCE NCQKFIWKII KSLESKKDFK LQCLKGNGLC EIVNIMKAQH FNVIESSYST KCQACWLKLC LKNFKMPNKL KQNLLKVLPV NMQINLSLDY GPTSSDTSSS KFRFSAKNMK HFKWFNENNS QEIRAVNKNN IIDLPWSNIS ESNILNKKVL RKRTKNGENT VQSNIFLGQN EKIKSVNDKN RQSIVLKGPR VKHVCRSASV ALGQPLATFP TDLGSKKKKT NESSEDCNKC TTSPSPESNG VECQEVEFFS ENVVDSKEEG KEKKLNTFKE TTCNLKEVQV SRLNNGVKSK KKTNIKIENP VSMDFWESYD PDEICETGFC LIGSEDFSVR ALCFLCGSSG QEKLIHCASC CEPYHEFCID EAQLKLQNNT WKFDWVCPRC TVCFTCGKTS GQQLKCVKCD NSYHIECVDR VGGRLLHSPD RPWVCSICLR CKSCNGVDVS VFVGNLPLCR ACFVLRQKGN FCPLCQRCYN DDDYDSKMME CGQCKCWVHA KCEGLSDEKY QVLSFLPESV EYVCRMCCVM PPAPWWIAVE AELKSGYLGV LKALSKNRKA CTMLKWSPRK QCTCRPVEYT SRAFDLDVLN RTSKMLDVEL EEISNSEKEI NEKKKIHEEK EMIEKYQPAI LSERKTSIEV INHPTESLKL TLKIMKNCDT ETENITESTT KDKSVKDIFD DEKLQENSER VKNLIANQKF SDNSQSVNRK ISPFEYNKVN SSNSGNTLDS DAGNSAKDRS SIYDQNCFQL SSLETNKANG DISVNLLEGT RECVCFLDVE SYNSRSKPFS PNLMSIKKKV TTSEYSSLHQ FHQDMERMIA TSHSTELMDL YHQTIREIFP WFDPKFSKVQ NSSQKVSSIK SAESTPVKST PKKIIPEEEC KIPKYKLYDQ PLDYFYSDYT VQDTRICVMC KVVGDGLPGH TGRLLYCGQN EWVHCNCALW SGEVFEEIDG SLQNVHSAIS RSRSIKCPEC NLKGASIGCC ARSCQETYHF SCAKKLGCAF MDDKTMYCLA HLKDVNNKLV QNEKEFEIRR PVYVELDKRK MKTADSCQIR FVVGSLHVNN LGKFAGKLSD ETKSIIPDQF SCTRLFWSCF EPWRIVRYHF SVRLVEAENT QGVDTGINLT VDHSKDPELV ELKLKQLKYF QENNSFLEIN EKLVENQLGE KRKLVELKAK CDKIKISDSN SPKKDKKNQK VVRKILDQYD FKDDENCLSD NQNTADLLPP DVEEAIFKDL PHDILDGISM QDIFMDIKNY EYNSKENSEE LYEGDDNLFN SLNKIHKKNE NIQNESIWTE SIDCKLEVIG ETSLVSKSRN VKRNRTDDSL LLPSKKSTTL KTYQKSTNLN RNYKCENFNN KWKKHSKCHV TVNLLEPQER GNMLQELKFT DGLVSITTGA IGSMKDLKSR LEDNKRSIQE EGKENKFLAW QTRLQPRLLQ VDGGVDSSSS GSEAGESPQR MTEESVTSKL VDSSSQILDP LYSIKSSNVP EFQNFSTSEN SHHSNKSSVS FSKGNTDSNS FILPQVDGVE DCNSDVSDSE IQNCKKYSLK EIQNFNKVMV RKRDLINVFD LNYKIVQCDG GVDSECEEET FKNCNLSDGS VYPLVAKVEE EPVKCGKCRC TYRTKISYQR HLESCTSDFI LSSSEAEMSD EESPKRHFNN ISLKQKSDNI NALAIPAKTT NKEHSKKNIS YNSHLANDSL QISSHLNPIN FEKPEVKKEK VKKTYTRRKV PTATKKNSPV NNKQQVQQIL LQQQSAAAAP ALIVQDVTGS NLMSTAYIDS TPTLGYLAGI ESQHFLNKTQ LISTPGGVIS GNFHVPSPSE SQVLGGLCLN TQNVQNNFPG FIIQQQPNIQ GPVLSSQNSP VILSNEQLMV GSTDSYNVFQ DPRTGGMFLT SRNAPVFYNM ETIVSNTVMQ TNQFVSNVLG TTFSQTSTQI FQASEMEQIV NVPSNYILVN SNHAGANRLF PSNDLLCVQT QPTFIQNQTP IQLGSSNQTV QPSQIFNPSI PTAQILNFDP KTLPIQNNSF RCLQPVCPPI ILGKVSNGLN NNVVNSTVPQ VLQKTPASTP TIVNPVVTSK SVPTIKPQQL NGFSSSQMTK LPVKIPEKPA IPLKSNPQSQ IQSRSRALVD YINNFRSEAN ETLLKSAQVQ NKTKNNICDK SSSQSYNLNS SNKSINLNTC ISTQQKVMQN QNNFHKSSSL STELVNPSCK TEVKSTNNLK IPKSASSELL VTKAEIQSPA SLKSEKIILP AKPLAANEPI PSVKNFKIKN EKITSCTKTE ESCPRRESSL ISTSEETEKS PCAGVKIFQN LPPSNFEMQE KEDGSTVPDS KNIEISMHED SRPVKDEYPK QDNNTNHVLP LQENCQKINI SPKNNNVKSS PKGHSKDGEN LFLKQEENYK IKRKINFLDQ NSFDSKSLGI NDKFESPPDK DASEVVADEN ISFSKKSIEE KGPKLIFDVI SDDGFKKSSS SMAELWANIC DSVQEMRAML KIPSLPLTKK DYEILGMQNH GLQHCLEQLK GVENCVNYKF KYFTRDDSRR DSEDEVKENP SGCARTEKFT NRKEYDMFAW LASKHRKLPK LTESAEETLC SIRRANNLPM AMKYRNLKET SKLYVGVYRS QIHGRGLFCL REIEAGEMVI EYAGEVIRAN LSDKREKYYT EKGIGCYMFR IDDHFVVDAT MKGNAARFIN HSCEPNCYSR VVDILGKKHI VIFALRKINI MEELTYDYKF PFEDEKISCH CLSKKCKKYL N //