P0C6U5 (R1A_CVHN5) Reviewed, UniProtKB/Swiss-Prot
Last modified December 11, 2013. Version 38. History...
Names and origin
|Protein names||Recommended name:|
Replicase polyprotein 1a
Cleaved into the following 11 chains:
|Organism||Human coronavirus HKU1 (isolate N5) (HCoV-HKU1) [Complete proteome]|
|Taxonomic identifier||443241 [NCBI]|
|Taxonomic lineage||Viruses › ssRNA positive-strand viruses, no DNA stage › Nidovirales › Coronaviridae › Coronavirinae › Betacoronavirus ›|
|Virus host||Homo sapiens (Human) [TaxID: 9606]|
|Sequence length||4421 AA.|
|Sequence processing||The displayed sequence is further processed into a mature form.|
|Protein existence||Inferred from homology|
General annotation (Comments)
The papain-like proteinase 1 (PL1-PRO) and papain-like proteinase 2 (PL2-PRO) are responsible for the cleavages located at the N-terminus of the replicase polyprotein. In addition, PLP2 possesses a deubiquitinating/deISGylating activity and processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. Antagonizes innate immune induction of type I interferon by blocking the phosphorylation, dimerization and subsequent nuclear translocation of host IRF-3 By similarity.
The main proteinase 3CL-PRO is responsible for the majority of cleavages as it cleaves the C-terminus of replicase polyprotein at 11 sites. Recognizes substrates containing the core sequence [ILMVF]-Q-|-[SGACN]. Inhibited by the substrate-analog Cbz-Val-Asn-Ser-Thr-Leu-Gln-CMK. Also contains an ADP-ribose-1''-phosphate (ADRP)-binding function By similarity.
Nsp7-nsp8 hexadecamer may possibly confer processivity to the polymerase, maybe by binding to dsRNA or by producing primers utilized by the latter By similarity.
Nsp9 is a ssRNA-binding protein By similarity.
Non-structural protein 1: binds to the 40S ribosomal subunit and inhibits host translation. The nsp1-40S ribosome complex further induces an endonucleolytic cleavage near the 5'UTR of host mRNAs, targeting them for degradation. By suppressing host gene expression, nsp1 facilitates efficient viral gene expression in infected cells and evasion from host immune response By similarity.
TSAVLQ-|-SGFRK-NH2 and SGVTFQ-|-GKFKK the two peptides corresponding to the two self-cleavage sites of the SARS 3C-like proteinase are the two most reactive peptide substrates. The enzyme exhibits a strong preference for substrates containing Gln at P1 position and Leu at P2 position.
Thiol-dependent hydrolysis of ester, thioester, amide, peptide and isopeptide bonds formed by the C-terminal Gly of ubiquitin (a 76-residue protein attached to proteins as an intracellular targeting signal).
3CL-PRO exists as monomer and homodimer. Eight copies of nsp7 and eight copies of nsp8 assemble to form a heterohexadecamer. Nsp9 is a dimer. Nsp10 forms a dodecamer By similarity.
Non-structural protein 7: Host cytoplasm › host perinuclear region By similarity. Note: nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes.
Non-structural protein 8: Host cytoplasm › host perinuclear region By similarity. Note: nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes.
Non-structural protein 9: Host cytoplasm › host perinuclear region By similarity. Note: nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes.
Non-structural protein 10: Host cytoplasm › host perinuclear region By similarity. Note: nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes.
The hydrophobic domains (HD) could mediate the membrane association of the replication complex and thereby alter the architecture of the host cell membrane.
Specific enzymatic cleavages in vivo by its own proteases yield mature proteins. 3CL-PRO and PL-PRO proteinases are autocatalytically processed By similarity.
Isolate N5 belongs to genotype C. Genotype C probably arose from recombination between genotypes A and B.
Belongs to the coronaviruses polyprotein 1ab family.
Contains 1 Macro domain.
Contains 2 peptidase C16 domains.
Contains 1 peptidase C30 domain.
|This entry describes 2 isoforms produced by ribosomal frameshifting. [Align] [Select]|
|Isoform Replicase polyprotein 1a (identifier: P0C6U5-1) |
Also known as: pp1a; ORF1a polyprotein;
This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
|Note: Produced by conventional translation.|
|Isoform Replicase polyprotein 1ab (identifier: P0C6X4-1) |
Also known as: pp1ab;
The sequence of this isoform can be found in the external entry P0C6X4.
Isoforms of the same protein are often annotated in two different entries if their sequences differ significantly.
|Note: Produced by -1 ribosomal frameshifting at the 1a-1b genes boundary.|
Sequence annotation (Features)
|Feature key||Position(s)||Length||Description||Graphical view||Feature identifier|
|Chain||1 – 4421||4421||Replicase polyprotein 1a||PRO_0000338218|
|Chain||1 – 222||222||Non-structural protein 1 By similarity||PRO_0000338219|
|Chain||223 – 809||587||Non-structural protein 2 By similarity||PRO_0000338220|
|Chain||810 – 2788||1979||Non-structural protein 3 By similarity||PRO_0000338221|
|Chain||2789 – 3284||496||Non-structural protein 4 By similarity||PRO_0000338222|
|Chain||3285 – 3587||303||3C-like proteinase By similarity||PRO_0000338223|
|Chain||3588 – 3874||287||Non-structural protein 6 By similarity||PRO_0000338224|
|Chain||3875 – 3966||92||Non-structural protein 7 By similarity||PRO_0000338225|
|Chain||3967 – 4160||194||Non-structural protein 8 By similarity||PRO_0000338226|
|Chain||4161 – 4270||110||Non-structural protein 9 By similarity||PRO_0000338227|
|Chain||4271 – 4407||137||Non-structural protein 10 By similarity||PRO_0000338228|
|Chain||4408 – 4421||14||Non-structural protein 11 Potential||PRO_0000338229|
|Transmembrane||2176 – 2196||21||Helical; Potential|
|Transmembrane||2237 – 2257||21||Helical; Potential|
|Transmembrane||2268 – 2288||21||Helical; Potential|
|Transmembrane||2351 – 2371||21||Helical; Potential|
|Transmembrane||2393 – 2413||21||Helical; Potential|
|Transmembrane||2794 – 2814||21||Helical; Potential|
|Transmembrane||3069 – 3089||21||Helical; Potential|
|Transmembrane||3101 – 3121||21||Helical; Potential|
|Transmembrane||3128 – 3148||21||Helical; Potential|
|Transmembrane||3153 – 3173||21||Helical; Potential|
|Transmembrane||3601 – 3621||21||Helical; Potential|
|Transmembrane||3626 – 3646||21||Helical; Potential|
|Transmembrane||3651 – 3671||21||Helical; Potential|
|Transmembrane||3694 – 3714||21||Helical; Potential|
|Transmembrane||3722 – 3742||21||Helical; Potential|
|Transmembrane||3750 – 3770||21||Helical; Potential|
|Transmembrane||3793 – 3813||21||Helical; Potential|
|Repeat||945 – 954||10||1|
|Repeat||955 – 964||10||2|
|Repeat||965 – 974||10||3|
|Repeat||975 – 984||10||4|
|Repeat||985 – 994||10||5|
|Repeat||995 – 1004||10||6|
|Repeat||1005 – 1014||10||7|
|Repeat||1015 – 1024||10||8|
|Repeat||1025 – 1034||10||9|
|Domain||1073 – 1323||251||Peptidase C16 1|
|Domain||1301 – 1472||172||Macro|
|Domain||1668 – 1928||261||Peptidase C16 2|
|Domain||3285 – 3587||303||Peptidase C30|
|Zinc finger||1188 – 1216||29||C4-type 1 By similarity|
|Zinc finger||1785 – 1821||37||C4-type 2 By similarity|
|Zinc finger||4344 – 4360||17||By similarity|
|Zinc finger||4386 – 4399||14||By similarity|
|Region||945 – 1034||90||9 X 10 AA tandem repeat of N-[DN]-D-E-D-V-V-T-G-D|
|Region||2176 – 2413||238||HD1 By similarity|
|Region||2794 – 3173||380||HD2 By similarity|
|Region||3601 – 3813||213||HD3 By similarity|
|Compositional bias||934 – 1066||133||Asp-rich|
|Compositional bias||2179 – 2271||93||Phe-rich|
|Compositional bias||2566 – 2569||4||Poly-Val|
|Active site||1111||1||For PL1-PRO activity By similarity|
|Active site||1262||1||For PL1-PRO activity By similarity|
|Active site||1707||1||For PL2-PRO activity By similarity|
|Active site||1864||1||For PL2-PRO activity By similarity|
|Active site||3325||1||For 3CL-PRO activity By similarity|
|Active site||3429||1||For 3CL-PRO activity By similarity|
|Site||222 – 223||2||Cleavage; by PL1-PRO By similarity|
|Site||809 – 810||2||Cleavage; by PL1-PRO By similarity|
|Site||2788 – 2789||2||Cleavage; by PL2-PRO By similarity|
|Site||3284 – 3285||2||Cleavage; by 3CL-PRO By similarity|
|Site||3587 – 3588||2||Cleavage; by 3CL-PRO By similarity|
|Site||3874 – 3875||2||Cleavage; by 3CL-PRO By similarity|
|Site||3966 – 3967||2||Cleavage; by 3CL-PRO By similarity|
|Site||4160 – 4161||2||Cleavage; by 3CL-PRO By similarity|
|Site||4270 – 4271||2||Cleavage; by 3CL-PRO By similarity|
|Site||4407 – 4408||2||Cleavage; by 3CL-PRO By similarity|
|||"Comparative analysis of 22 coronavirus HKU1 genomes reveals a novel genotype and evidence of natural recombination in coronavirus HKU1."|
Woo P.C.Y., Lau S.K.P., Yip C.C.Y., Huang Y., Tsoi H.-W., Chan K.-H., Yuen K.-Y.
J. Virol. 80:7136-7145(2006) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA].
|DQ339101 Genomic RNA. No translation available.|
3D structure databases
|SMR||P0C6U5. Positions 4002-4152, 4276-4400. |
Protocols and materials databases
Family and domain databases
|InterPro||IPR022570. Coronavirus_NSP1. |
|Pfam||PF11963. DUF3477. 1 hit. |
PF01661. Macro. 1 hit.
PF09401. NSP10. 1 hit.
PF08716. nsp7. 1 hit.
PF08717. nsp8. 1 hit.
PF08710. nsp9. 1 hit.
PF01831. Peptidase_C16. 1 hit.
PF05409. Peptidase_C30. 1 hit.
PF08715. Viral_protease. 1 hit.
|SMART||SM00506. A1pp. 1 hit. |
|SUPFAM||SSF101816. SSF101816. 1 hit. |
SSF144246. SSF144246. 1 hit.
SSF50494. SSF50494. 1 hit.
|PROSITE||PS51442. M_PRO. 1 hit. |
PS51154. MACRO. 1 hit.
PS51124. PEPTIDASE_C16. 2 hits.
|Accession||Primary (citable) accession number: P0C6U5|
Secondary accession number(s): Q0ZME9
|Entry status||Reviewed (UniProtKB/Swiss-Prot)|
|Annotation program||Viral Protein Annotation Program|