P0C6W3 (R1AB_BCHK4) Reviewed, UniProtKB/Swiss-Prot
Last modified October 16, 2013. Version 49. History...
Names and origin
|Protein names||Recommended name:|
Replicase polyprotein 1ab
Cleaved into the following 15 chains:
|Organism||Bat coronavirus HKU4 (BtCoV) (BtCoV/HKU4/2004) [Complete proteome]|
|Taxonomic identifier||694007 [NCBI]|
|Taxonomic lineage||Viruses › ssRNA positive-strand viruses, no DNA stage › Nidovirales › Coronaviridae › Coronavirinae › Betacoronavirus|
|Virus host||Tylonycteris pachypus (Lesser bamboo bat) [TaxID: 258959]|
|Sequence length||7119 AA.|
|Sequence processing||The displayed sequence is further processed into a mature form.|
|Protein existence||Inferred from homology|
General annotation (Comments)
The replicase polyprotein of coronaviruses is a multifunctional protein: it contains the activities necessary for the transcription of negative stranded RNA, leader RNA, subgenomic mRNAs and progeny virion RNA as well as proteinases responsible for the cleavage of the polyprotein into functional products By similarity.
The papain-like proteinase (PL-PRO) is responsible for the cleavages located at the N-terminus of replicase polyprotein. In addition, PL-PRO possesses a deubiquitinating/deISGylating activity and processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. Antagonizes innate immune induction of type I interferon by blocking the phosphorylation, dimerization and subsequent nuclear translocation of host IRF-3 By similarity.
The main proteinase 3CL-PRO is responsible for the majority of cleavages as it cleaves the C-terminus of replicase polyprotein at 11 sites. Recognizes substrates containing the core sequence [ILMVF]-Q-|-[SGACN]. Inhibited by the substrate-analog Cbz-Val-Asn-Ser-Thr-Leu-Gln-CMK. Also contains an ADP-ribose-1''-phosphate (ADRP)-binding function By similarity.
The helicase which contains a zinc finger structure displays RNA and DNA duplex-unwinding activities with 5' to 3' polarity. Its ATPase activity is strongly stimulated by poly(U), poly(dT), poly(C), poly(dA), but not by poly(G). Activity of helicase is dependent on magnesium By similarity.
The exoribonuclease acts on both ssRNA and dsRNA in a 3' to 5' direction By similarity.
Nsp7-nsp8 hexadecamer may possibly confer processivity to the polymerase, maybe by binding to dsRNA or by producing primers utilized by the latter By similarity.
Nsp9 is a ssRNA-binding protein By similarity.
NendoU is a Mn2+-dependent, uridylate-specific enzyme, which leaves 2'-3'-cyclic phosphates 5' to the cleaved bond By similarity.
Non-structural protein 1: binds to the 40S ribosomal subunit and inhibits host translation. The nsp1-40S ribosome complex further induces an endonucleolytic cleavage near the 5'UTR of host mRNAs, targeting them for degradation. By suppressing host gene expression, nsp1 facilitates efficient viral gene expression in infected cells and evasion from host immune response By similarity.
Nucleoside triphosphate + RNA(n) = diphosphate + RNA(n+1).
ATP + H2O = ADP + phosphate.
TSAVLQ-|-SGFRK-NH2 and SGVTFQ-|-GKFKK the two peptides corresponding to the two self-cleavage sites of the SARS 3C-like proteinase are the two most reactive peptide substrates. The enzyme exhibits a strong preference for substrates containing Gln at P1 position and Leu at P2 position.
Thiol-dependent hydrolysis of ester, thioester, amide, peptide and isopeptide bonds formed by the C-terminal Gly of ubiquitin (a 76-residue protein attached to proteins as an intracellular targeting signal).
3CL-PRO exists as monomer and homodimer. Eight copies of nsp7 and eight copies of nsp8 assemble to form a heterohexadecamer. Nsp9 is a dimer. Nsp10 forms a dodecamer By similarity.
Non-structural protein 7: Host cytoplasm › host perinuclear region By similarity. Note: nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes By similarity.
Non-structural protein 8: Host cytoplasm › host perinuclear region By similarity. Note: nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes By similarity.
Non-structural protein 9: Host cytoplasm › host perinuclear region By similarity. Note: nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes By similarity.
Non-structural protein 10: Host cytoplasm › host perinuclear region By similarity. Note: nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes By similarity.
Helicase: Host endoplasmic reticulum-Golgi intermediate compartment Potential. Note: The helicase interacts with the N protein in membranous complexes and colocalizes with sites of synthesis of new viral RNA By similarity.
The hydrophobic domains (HD) could mediate the membrane association of the replication complex and thereby alter the architecture of the host cell membrane By similarity.
Specific enzymatic cleavages in vivo by its own proteases yield mature proteins. 3CL-PRO and PL-PRO proteinases are autocatalytically processed By similarity.
Belongs to the coronaviruses polyprotein 1ab family.
Contains 1 (+)RNA virus helicase ATP-binding domain.
Contains 1 (+)RNA virus helicase C-terminal domain.
Contains 1 CV MBD (coronavirus metal-binding) domain.
Contains 1 Macro domain.
Contains 1 peptidase C16 domain.
Contains 1 peptidase C30 domain.
Contains 1 RdRp catalytic domain.
|This entry describes 2 isoforms produced by ribosomal frameshifting. [Align] [Select]|
|Isoform Replicase polyprotein 1ab (identifier: P0C6W3-1) |
Also known as: pp1ab;
This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
|Note: Produced by -1 ribosomal frameshifting at the 1a-1b genes boundary.|
|Isoform Replicase polyprotein 1a (identifier: P0C6T4-1) |
Also known as: pp1a; ORF1a polyprotein;
The sequence of this isoform can be found in the external entry P0C6T4.
Isoforms of the same protein are often annotated in two different entries if their sequences differ significantly.
|Note: Produced by conventional translation.|
Sequence annotation (Features)
|Feature key||Position(s)||Length||Description||Graphical view||Feature identifier|
|Chain||1 – 195||195||Non-structural protein 1 Potential||PRO_0000290288|
|Chain||196 – 847||652||Non-structural protein 2 Potential||PRO_0000290289|
|Chain||848 – 2784||1937||Non-structural protein 3 Potential||PRO_0000290290|
|Chain||2785 – 3291||507||Non-structural protein 4 Potential||PRO_0000290291|
|Chain||3292 – 3597||306||3C-like proteinase Potential||PRO_0000290292|
|Chain||3598 – 3889||292||Non-structural protein 6 Potential||PRO_0000290293|
|Chain||3890 – 3972||83||Non-structural protein 7 Potential||PRO_0000290294|
|Chain||3973 – 4171||199||Non-structural protein 8 Potential||PRO_0000290295|
|Chain||4172 – 4281||110||Non-structural protein 9 Potential||PRO_0000290296|
|Chain||4282 – 4420||139||Non-structural protein 10 Potential||PRO_0000290297|
|Chain||4421 – 5354||934||RNA-directed RNA polymerase Potential||PRO_0000290298|
|Chain||5355 – 5952||598||Helicase Potential||PRO_0000290299|
|Chain||5953 – 6475||523||Exoribonuclease Potential||PRO_0000290300|
|Chain||6476 – 6817||342||Uridylate-specific endoribonuclease Potential||PRO_0000290301|
|Chain||6818 – 7119||302||Putative 2'-O-methyl transferase Potential||PRO_0000290302|
|Transmembrane||2112 – 2132||21||Helical; Potential|
|Transmembrane||2145 – 2165||21||Helical; Potential|
|Transmembrane||2222 – 2242||21||Helical; Potential|
|Transmembrane||2326 – 2346||21||Helical; Potential|
|Transmembrane||2350 – 2370||21||Helical; Potential|
|Transmembrane||2375 – 2395||21||Helical; Potential|
|Transmembrane||2800 – 2820||21||Helical; Potential|
|Transmembrane||3072 – 3092||21||Helical; Potential|
|Transmembrane||3105 – 3125||21||Helical; Potential|
|Transmembrane||3149 – 3169||21||Helical; Potential|
|Transmembrane||3603 – 3623||21||Helical; Potential|
|Transmembrane||3637 – 3657||21||Helical; Potential|
|Transmembrane||3662 – 3682||21||Helical; Potential|
|Transmembrane||3707 – 3727||21||Helical; Potential|
|Transmembrane||3735 – 3755||21||Helical; Potential|
|Transmembrane||3784 – 3804||21||Helical; Potential|
|Transmembrane||3808 – 3828||21||Helical; Potential|
|Domain||1152 – 1321||170||Macro|
|Domain||1593 – 1864||272||Peptidase C16|
|Domain||3292 – 3597||306||Peptidase C30|
|Domain||5034 – 5196||163||RdRp catalytic|
|Domain||5355 – 5438||84||CV MBD|
|Domain||5611 – 5792||182||(+)RNA virus helicase ATP-binding|
|Domain||5793 – 5967||175||(+)RNA virus helicase C-terminal|
|Zinc finger||1714 – 1751||38||C4-type By similarity|
|Zinc finger||4355 – 4371||17||By similarity|
|Zinc finger||4397 – 4410||14||By similarity|
|Nucleotide binding||5636 – 5643||8||ATP By similarity|
|Region||2112 – 2395||284||HD1 By similarity|
|Region||2800 – 3169||370||HD2 By similarity|
|Region||3603 – 3828||226||HD3 By similarity|
|Compositional bias||960 – 1049||90||Glu-rich|
|Compositional bias||4161 – 4166||6||Poly-Ser|
|Compositional bias||5359 – 5384||26||Cys-rich|
|Active site||1634||1||For PL-PRO activity By similarity|
|Active site||1800||1||For PL-PRO activity By similarity|
|Active site||3332||1||For 3CL-PRO activity By similarity|
|Active site||3439||1||For 3CL-PRO activity By similarity|
|Site||195 – 196||2||Cleavage Potential|
|Site||847 – 848||2||Cleavage; by PL-PRO Potential|
|Site||2784 – 2785||2||Cleavage; by PL-PRO Potential|
|Site||3291 – 3292||2||Cleavage; by 3CL-PRO Potential|
|Site||3597 – 3598||2||Cleavage; by 3CL-PRO Potential|
|Site||3889 – 3890||2||Cleavage; by 3CL-PRO Potential|
|Site||3972 – 3973||2||Cleavage; by 3CL-PRO Potential|
|Site||4171 – 4172||2||Cleavage; by 3CL-PRO Potential|
|Site||4281 – 4282||2||Cleavage; by 3CL-PRO Potential|
|Site||4420 – 4421||2||Cleavage; by 3CL-PRO Potential|
|Site||5354 – 5355||2||Cleavage; by 3CL-PRO Potential|
|Site||5952 – 5953||2||Cleavage; by 3CL-PRO Potential|
|Site||6475 – 6476||2||Cleavage; by 3CL-PRO Potential|
|Site||6817 – 6818||2||Cleavage; by 3CL-PRO Potential|
|||"Comparative analysis of twelve genomes of three novel group 2c and group 2d coronaviruses reveals unique group and subgroup features."|
Woo P.C.Y., Wang M., Lau S.K.P., Xu H.F., Poon R.W.S., Guo R., Wong B.H.L., Gao K., Tsoi H.-W., Huang Y., Li K.S.M., Lam C.S.F., Chan K.-H., Zheng B.-J., Yuen K.-Y.
J. Virol. 81:1574-1585(2007) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA].
Strain: Isolate HKU4-1.
|EF065505 Genomic RNA. Translation: ABN10838.1.|
|RefSeq||YP_001039952.1. NC_009019.1. |
3D structure databases
|SMR||P0C6W3. Positions 3889-3972, 4221-4281, 4291-4411. |
Protocols and materials databases
Genome annotation databases
Family and domain databases
|InterPro||IPR027351. (+)RNA_virus_helicase_core_dom. |
|Pfam||PF06478. Corona_RPol_N. 1 hit. |
PF01661. Macro. 1 hit.
PF09401. NSP10. 1 hit.
PF06471. NSP11. 1 hit.
PF06460. NSP13. 1 hit.
PF08716. nsp7. 1 hit.
PF08717. nsp8. 1 hit.
PF08710. nsp9. 1 hit.
PF05409. Peptidase_C30. 1 hit.
PF00680. RdRP_1. 1 hit.
PF11633. SUD-M. 1 hit.
PF08715. Viral_protease. 1 hit.
|SMART||SM00506. A1pp. 1 hit. |
|SUPFAM||SSF101816. SSF101816. 1 hit. |
SSF144246. SSF144246. 1 hit.
SSF50494. SSF50494. 1 hit.
SSF52540. SSF52540. 1 hit.
|PROSITE||PS51653. CV_MBD. 1 hit. |
PS51442. M_PRO. 1 hit.
PS51154. MACRO. 1 hit.
PS51124. PEPTIDASE_C16. 1 hit.
PS51657. PSRV_HELICASE. 1 hit.
PS50507. RDRP_SSRNA_POS. 1 hit.
|Accession||Primary (citable) accession number: P0C6W3|
Secondary accession number(s): A3EX93
|Entry status||Reviewed (UniProtKB/Swiss-Prot)|
|Annotation program||Viral Protein Annotation Program|