P0C6T4 (R1A_BCHK4) Reviewed, UniProtKB/Swiss-Prot
Last modified October 16, 2013. Version 39. History...
Names and origin
|Protein names||Recommended name:|
Replicase polyprotein 1a
Cleaved into the following 11 chains:
|Organism||Bat coronavirus HKU4 (BtCoV) (BtCoV/HKU4/2004) [Complete proteome]|
|Taxonomic identifier||694007 [NCBI]|
|Taxonomic lineage||Viruses › ssRNA positive-strand viruses, no DNA stage › Nidovirales › Coronaviridae › Coronavirinae › Betacoronavirus|
|Virus host||Tylonycteris pachypus (Lesser bamboo bat) [TaxID: 258959]|
|Sequence length||4434 AA.|
|Sequence processing||The displayed sequence is further processed into a mature form.|
|Protein existence||Inferred from homology|
General annotation (Comments)
The papain-like proteinase (PL-PRO) is responsible for the cleavages located at the N-terminus of replicase polyprotein. In addition, PL-PRO possesses a deubiquitinating/deISGylating activity and processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. Antagonizes innate immune induction of type I interferon by blocking the phosphorylation, dimerization and subsequent nuclear translocation of host IRF-3 By similarity.
The main proteinase 3CL-PRO is responsible for the majority of cleavages as it cleaves the C-terminus of replicase polyprotein at 11 sites. Recognizes substrates containing the core sequence [ILMVF]-Q-|-[SGACN]. Inhibited by the substrate-analog Cbz-Val-Asn-Ser-Thr-Leu-Gln-CMK. Also contains an ADP-ribose-1''-phosphate (ADRP)-binding function By similarity.
Nsp7-nsp8 hexadecamer may possibly confer processivity to the polymerase, maybe by binding to dsRNA or by producing primers utilized by the latter By similarity.
Nsp9 is a ssRNA-binding protein By similarity.
Non-structural protein 1: binds to the 40S ribosomal subunit and inhibits host translation. The nsp1-40S ribosome complex further induces an endonucleolytic cleavage near the 5'UTR of host mRNAs, targeting them for degradation. By suppressing host gene expression, nsp1 facilitates efficient viral gene expression in infected cells and evasion from host immune response By similarity.
TSAVLQ-|-SGFRK-NH2 and SGVTFQ-|-GKFKK the two peptides corresponding to the two self-cleavage sites of the SARS 3C-like proteinase are the two most reactive peptide substrates. The enzyme exhibits a strong preference for substrates containing Gln at P1 position and Leu at P2 position.
Thiol-dependent hydrolysis of ester, thioester, amide, peptide and isopeptide bonds formed by the C-terminal Gly of ubiquitin (a 76-residue protein attached to proteins as an intracellular targeting signal).
3CL-PRO exists as monomer and homodimer. Eight copies of nsp7 and eight copies of nsp8 assemble to form a heterohexadecamer. Nsp9 is a dimer. Nsp10 forms a dodecamer By similarity.
Non-structural protein 7: Host cytoplasm › host perinuclear region By similarity. Note: nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes By similarity.
Non-structural protein 8: Host cytoplasm › host perinuclear region By similarity. Note: nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes By similarity.
Non-structural protein 9: Host cytoplasm › host perinuclear region By similarity. Note: nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes By similarity.
Non-structural protein 10: Host cytoplasm › host perinuclear region By similarity. Note: nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes By similarity.
The hydrophobic domains (HD) could mediate the membrane association of the replication complex and thereby alter the architecture of the host cell membrane By similarity.
Specific enzymatic cleavages in vivo by its own proteases yield mature proteins. 3CL-PRO and PL-PRO proteinases are autocatalytically processed By similarity.
Belongs to the coronaviruses polyprotein 1ab family.
Contains 1 Macro domain.
Contains 1 peptidase C16 domain.
Contains 1 peptidase C30 domain.
|This entry describes 2 isoforms produced by ribosomal frameshifting. [Align] [Select]|
|Isoform Replicase polyprotein 1a (identifier: P0C6T4-1) |
Also known as: pp1a; ORF1a polyprotein;
This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
|Note: Produced by conventional translation.|
|Isoform Replicase polyprotein 1ab (identifier: P0C6W3-1) |
Also known as: pp1ab;
The sequence of this isoform can be found in the external entry P0C6W3.
Isoforms of the same protein are often annotated in two different entries if their sequences differ significantly.
|Note: Produced by -1 ribosomal frameshifting at the 1a-1b genes boundary.|
Sequence annotation (Features)
|Feature key||Position(s)||Length||Description||Graphical view||Feature identifier|
|Chain||1 – 4434||4434||Replicase polyprotein 1a||PRO_0000338086|
|Chain||1 – 195||195||Non-structural protein 1 Potential||PRO_0000338087|
|Chain||196 – 847||652||Non-structural protein 2 Potential||PRO_0000338088|
|Chain||848 – 2784||1937||Non-structural protein 3 Potential||PRO_0000338089|
|Chain||2785 – 3291||507||Non-structural protein 4 Potential||PRO_0000338090|
|Chain||3292 – 3597||306||3C-like proteinase Potential||PRO_0000338091|
|Chain||3598 – 3889||292||Non-structural protein 6 Potential||PRO_0000338092|
|Chain||3890 – 3972||83||Non-structural protein 7 Potential||PRO_0000338093|
|Chain||3973 – 4171||199||Non-structural protein 8 Potential||PRO_0000338094|
|Chain||4172 – 4281||110||Non-structural protein 9 Potential||PRO_0000338095|
|Chain||4281 – 4434||154||Non-structural protein 11 Potential||PRO_0000338097|
|Chain||4282 – 4420||139||Non-structural protein 10 Potential||PRO_0000338096|
|Transmembrane||2145 – 2165||21||Helical; Potential|
|Transmembrane||2222 – 2242||21||Helical; Potential|
|Transmembrane||2326 – 2346||21||Helical; Potential|
|Transmembrane||2350 – 2370||21||Helical; Potential|
|Transmembrane||2375 – 2395||21||Helical; Potential|
|Transmembrane||2800 – 2820||21||Helical; Potential|
|Transmembrane||3072 – 3092||21||Helical; Potential|
|Transmembrane||3105 – 3125||21||Helical; Potential|
|Transmembrane||3149 – 3169||21||Helical; Potential|
|Transmembrane||3603 – 3623||21||Helical; Potential|
|Transmembrane||3637 – 3657||21||Helical; Potential|
|Transmembrane||3662 – 3682||21||Helical; Potential|
|Transmembrane||3707 – 3727||21||Helical; Potential|
|Transmembrane||3735 – 3755||21||Helical; Potential|
|Transmembrane||3784 – 3804||21||Helical; Potential|
|Transmembrane||3808 – 3828||21||Helical; Potential|
|Domain||1152 – 1321||170||Macro|
|Domain||1593 – 1864||272||Peptidase C16|
|Domain||3292 – 3597||306||Peptidase C30|
|Zinc finger||1714 – 1751||38||C4-type By similarity|
|Zinc finger||4355 – 4371||17||By similarity|
|Zinc finger||4397 – 4410||14||By similarity|
|Region||2112 – 2395||284||HD1 By similarity|
|Region||2800 – 3169||370||HD2 By similarity|
|Region||3603 – 3828||226||HD3 By similarity|
|Compositional bias||960 – 1049||90||Glu-rich|
|Compositional bias||4161 – 4166||6||Poly-Ser|
|Active site||1634||1||For PL-PRO activity By similarity|
|Active site||1800||1||For PL-PRO activity By similarity|
|Active site||3332||1||For 3CL-PRO activity By similarity|
|Active site||3439||1||For 3CL-PRO activity By similarity|
|Site||195 – 196||2||Cleavage Potential|
|Site||847 – 848||2||Cleavage; by PL-PRO Potential|
|Site||2784 – 2785||2||Cleavage; by PL-PRO Potential|
|Site||3291 – 3292||2||Cleavage; by 3CL-PRO Potential|
|Site||3597 – 3598||2||Cleavage; by 3CL-PRO Potential|
|Site||3889 – 3890||2||Cleavage; by 3CL-PRO Potential|
|Site||3972 – 3973||2||Cleavage; by 3CL-PRO Potential|
|Site||4171 – 4172||2||Cleavage; by 3CL-PRO Potential|
|Site||4281 – 4282||2||Cleavage; by 3CL-PRO Potential|
|Site||4420 – 4421||2||Cleavage; by 3CL-PRO Potential|
|||"Comparative analysis of twelve genomes of three novel group 2c and group 2d coronaviruses reveals unique group and subgroup features."|
Woo P.C.Y., Wang M., Lau S.K.P., Xu H.F., Poon R.W.S., Guo R., Wong B.H.L., Gao K., Tsoi H.-W., Huang Y., Li K.S.M., Lam C.S.F., Chan K.-H., Zheng B.-J., Yuen K.-Y.
J. Virol. 81:1574-1585(2007) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA].
Strain: Isolate HKU4-1.
|EF065505 Genomic RNA. No translation available.|
3D structure databases
|SMR||P0C6T4. Positions 3889-3972, 4221-4281, 4291-4411. |
Protocols and materials databases
Family and domain databases
|InterPro||IPR002589. Macro_dom. |
|Pfam||PF01661. Macro. 1 hit. |
PF09401. NSP10. 1 hit.
PF08716. nsp7. 1 hit.
PF08717. nsp8. 1 hit.
PF08710. nsp9. 1 hit.
PF05409. Peptidase_C30. 1 hit.
PF11633. SUD-M. 1 hit.
PF08715. Viral_protease. 1 hit.
|SMART||SM00506. A1pp. 1 hit. |
|SUPFAM||SSF101816. SSF101816. 1 hit. |
SSF144246. SSF144246. 1 hit.
SSF50494. SSF50494. 1 hit.
|PROSITE||PS51442. M_PRO. 1 hit. |
PS51154. MACRO. 1 hit.
PS51124. PEPTIDASE_C16. 1 hit.
|Accession||Primary (citable) accession number: P0C6T4|
Secondary accession number(s): A3EX93
|Entry status||Reviewed (UniProtKB/Swiss-Prot)|
|Annotation program||Viral Protein Annotation Program|