Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Basket 0
(max 400 entries)x

Your basket is currently empty.

Select item(s) and click on "Add to basket" to create your own collection here
(400 entries max)

P0C6X2

- R1AB_CVHN1

UniProt

P0C6X2 - R1AB_CVHN1

Protein

Replicase polyprotein 1ab

Gene

rep

Organism
Human coronavirus HKU1 (isolate N1) (HCoV-HKU1)
Status
Reviewed - Annotation score: 5 out of 5- Protein inferred from homologyi
    • BLAST
    • Align
    • Format
    • Add to basket
    • History
      Entry version 57 (01 Oct 2014)
      Sequence version 1 (10 Jun 2008)
      Previous versions | rss
    • Help video
    • Feedback
    • Comment

    Functioni

    The replicase polyprotein of coronaviruses is a multifunctional protein: it contains the activities necessary for the transcription of negative stranded RNA, leader RNA, subgenomic mRNAs and progeny virion RNA as well as proteinases responsible for the cleavage of the polyprotein into functional products.
    The papain-like proteinase 1 (PL1-PRO) and papain-like proteinase 2 (PL2-PRO) are responsible for the cleavages located at the N-terminus of the replicase polyprotein. In addition, PLP2 possesses a deubiquitinating/deISGylating activity and processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. Antagonizes innate immune induction of type I interferon by blocking the phosphorylation, dimerization and subsequent nuclear translocation of host IRF-3 By similarity.By similarity
    The main proteinase 3CL-PRO is responsible for the majority of cleavages as it cleaves the C-terminus of replicase polyprotein at 11 sites. Recognizes substrates containing the core sequence [ILMVF]-Q-|-[SGACN]. Inhibited by the substrate-analog Cbz-Val-Asn-Ser-Thr-Leu-Gln-CMK. Also contains an ADP-ribose-1''-phosphate (ADRP)-binding function By similarity.PROSITE-ProRule annotation
    The helicase which contains a zinc finger structure displays RNA and DNA duplex-unwinding activities with 5' to 3' polarity. ATPase activity is strongly stimulated by poly(U), poly(dT), poly(C), poly(dA), but not by poly(G) By similarity.By similarity
    The exoribonuclease acts on both ssRNA and dsRNA in a 3' to 5' direction.By similarity
    Nsp7-nsp8 hexadecamer may possibly confer processivity to the polymerase, maybe by binding to dsRNA or by producing primers utilized by the latter.By similarity
    Nsp9 is a ssRNA-binding protein.By similarity
    NendoU is a Mn2+-dependent, uridylate-specific enzyme, which leaves 2'-3'-cyclic phosphates 5' to the cleaved bond.By similarity
    Non-structural protein 1: binds to the 40S ribosomal subunit and inhibits host translation. The nsp1-40S ribosome complex further induces an endonucleolytic cleavage near the 5'UTR of host mRNAs, targeting them for degradation. By suppressing host gene expression, nsp1 facilitates efficient viral gene expression in infected cells and evasion from host immune response By similarity.By similarity

    Catalytic activityi

    Nucleoside triphosphate + RNA(n) = diphosphate + RNA(n+1).
    ATP + H2O = ADP + phosphate.
    TSAVLQ-|-SGFRK-NH2 and SGVTFQ-|-GKFKK the two peptides corresponding to the two self-cleavage sites of the SARS 3C-like proteinase are the two most reactive peptide substrates. The enzyme exhibits a strong preference for substrates containing Gln at P1 position and Leu at P2 position.
    Thiol-dependent hydrolysis of ester, thioester, amide, peptide and isopeptide bonds formed by the C-terminal Gly of ubiquitin (a 76-residue protein attached to proteins as an intracellular targeting signal).

    Sites

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Sitei222 – 2232Cleavage; by PL1-PROBy similarity
    Sitei809 – 8102Cleavage; by PL1-PROBy similarity
    Active sitei1161 – 11611For PL1-PRO activityPROSITE-ProRule annotation
    Active sitei1312 – 13121For PL1-PRO activityPROSITE-ProRule annotation
    Active sitei1757 – 17571For PL2-PRO activityPROSITE-ProRule annotation
    Active sitei1914 – 19141For PL2-PRO activityPROSITE-ProRule annotation
    Sitei2838 – 28392Cleavage; by PL2-PROBy similarity
    Sitei3334 – 33352Cleavage; by 3CL-PROBy similarity
    Active sitei3375 – 33751For 3CL-PRO activityPROSITE-ProRule annotation
    Active sitei3479 – 34791For 3CL-PRO activityPROSITE-ProRule annotation
    Sitei3637 – 36382Cleavage; by 3CL-PROBy similarity
    Sitei3924 – 39252Cleavage; by 3CL-PROBy similarity
    Sitei4016 – 40172Cleavage; by 3CL-PROBy similarity
    Sitei4210 – 42112Cleavage; by 3CL-PROBy similarity
    Sitei4320 – 43212Cleavage; by 3CL-PROBy similarity
    Sitei4457 – 44582Cleavage; by 3CL-PROBy similarity
    Sitei5385 – 53862Cleavage; by 3CL-PROBy similarity
    Sitei6509 – 65102Cleavage; by 3CL-PROBy similarity
    Sitei6883 – 68842Cleavage; by 3CL-PROBy similarity

    Regions

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Zinc fingeri1238 – 126629C4-type 1PROSITE-ProRule annotationAdd
    BLAST
    Zinc fingeri1835 – 187137C4-type 2PROSITE-ProRule annotationAdd
    BLAST
    Zinc fingeri4394 – 441017By similarityAdd
    BLAST
    Zinc fingeri4436 – 444914By similarityAdd
    BLAST
    Nucleotide bindingi5666 – 56738ATPBy similarity

    GO - Molecular functioni

    1. ATP binding Source: UniProtKB-KW
    2. cysteine-type endopeptidase activity Source: InterPro
    3. endonuclease activity Source: UniProtKB-KW
    4. exoribonuclease activity, producing 5'-phosphomonoesters Source: InterPro
    5. helicase activity Source: UniProtKB-KW
    6. methyltransferase activity Source: InterPro
    7. omega peptidase activity Source: InterPro
    8. RNA binding Source: UniProtKB-KW
    9. RNA-directed RNA polymerase activity Source: UniProtKB-KW
    10. zinc ion binding Source: InterPro

    GO - Biological processi

    1. induction by virus of catabolism of host mRNA Source: UniProtKB-KW
    2. induction by virus of host autophagy Source: UniProtKB-KW
    3. modulation by virus of host protein ubiquitination Source: UniProtKB-KW
    4. suppression by virus of host gene expression Source: UniProtKB-KW
    5. suppression by virus of host ISG15 activity Source: UniProtKB-KW
    6. suppression by virus of host type I interferon-mediated signaling pathway Source: UniProtKB-KW
    7. transcription, DNA-templated Source: InterPro
    8. viral genome replication Source: InterPro
    9. viral protein processing Source: InterPro

    Keywords - Molecular functioni

    Endonuclease, Exonuclease, Helicase, Hydrolase, Nuclease, Nucleotidyltransferase, Protease, RNA-directed RNA polymerase, Thiol protease, Transferase

    Keywords - Biological processi

    Activation of host autophagy by virus, Decay of host mRNAs by virus, Eukaryotic host gene expression shutoff by virus, Eukaryotic host translation shutoff by virus, Host gene expression shutoff by virus, Host mRNA suppression by virus, Host-virus interaction, Inhibition of host innate immune response by virus, Inhibition of host interferon signaling pathway by virus, Inhibition of host ISG15 by virus, Modulation of host ubiquitin pathway by viral deubiquitinase, Modulation of host ubiquitin pathway by virus, Ubl conjugation pathway, Viral immunoevasion, Viral RNA replication

    Keywords - Ligandi

    ATP-binding, Metal-binding, Nucleotide-binding, RNA-binding, Zinc

    Names & Taxonomyi

    Protein namesi
    Recommended name:
    Replicase polyprotein 1ab
    Short name:
    pp1ab
    Alternative name(s):
    ORF1ab polyprotein
    Cleaved into the following 15 chains:
    Non-structural protein 1
    Short name:
    nsp1
    Alternative name(s):
    p28
    Non-structural protein 2
    Short name:
    nsp2
    Alternative name(s):
    p65
    Alternative name(s):
    PL1-PRO/PL2-PRO
    PL1/PL2
    Papain-like proteinases 1/2
    p210
    Non-structural protein 4
    Short name:
    nsp4
    Alternative name(s):
    Peptide HD2
    p44
    3C-like proteinase (EC:3.4.22.-)
    Short name:
    3CL-PRO
    Short name:
    3CLp
    Alternative name(s):
    M-PRO
    nsp5
    p27
    Non-structural protein 6
    Short name:
    nsp6
    Non-structural protein 7
    Short name:
    nsp7
    Alternative name(s):
    p10
    Non-structural protein 8
    Short name:
    nsp8
    Alternative name(s):
    p22
    Non-structural protein 9
    Short name:
    nsp9
    Alternative name(s):
    p12
    Non-structural protein 10
    Short name:
    nsp10
    Alternative name(s):
    Growth factor-like peptide
    Short name:
    GFL
    p15
    RNA-directed RNA polymerase (EC:2.7.7.48)
    Short name:
    Pol
    Short name:
    RdRp
    Alternative name(s):
    nsp12
    p100
    Helicase (EC:3.6.4.12, EC:3.6.4.13)
    Short name:
    Hel
    Alternative name(s):
    nsp13
    p67
    Exoribonuclease (EC:3.1.13.-)
    Short name:
    ExoN
    Alternative name(s):
    nsp14
    Alternative name(s):
    NendoU
    nsp15
    p35
    Alternative name(s):
    nsp16
    Gene namesi
    Name:rep
    ORF Names:1a-1b
    OrganismiHuman coronavirus HKU1 (isolate N1) (HCoV-HKU1)
    Taxonomic identifieri443239 [NCBI]
    Taxonomic lineageiVirusesssRNA positive-strand viruses, no DNA stageNidoviralesCoronaviridaeCoronavirinaeBetacoronavirus
    Virus hostiHomo sapiens (Human) [TaxID: 9606]
    ProteomesiUP000008170: Genome

    Subcellular locationi

    Chain Non-structural protein 7 : Host cytoplasmhost perinuclear region By similarity
    Note: nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes.
    Chain Non-structural protein 8 : Host cytoplasmhost perinuclear region By similarity
    Note: nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes.
    Chain Non-structural protein 9 : Host cytoplasmhost perinuclear region By similarity
    Note: nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes.
    Chain Non-structural protein 10 : Host cytoplasmhost perinuclear region By similarity
    Note: nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes.
    Chain Helicase : Host endoplasmic reticulum-Golgi intermediate compartment Curated
    Note: The helicase interacts with the N protein in membranous complexes and colocalizes with sites of synthesis of new viral RNA.

    GO - Cellular componenti

    1. host cell endoplasmic reticulum-Golgi intermediate compartment Source: UniProtKB-SubCell
    2. host cell membrane Source: UniProtKB-SubCell
    3. host cell perinuclear region of cytoplasm Source: UniProtKB-SubCell
    4. integral component of membrane Source: UniProtKB-KW

    Keywords - Cellular componenti

    Host cytoplasm, Host membrane, Membrane

    PTM / Processingi

    Molecule processing

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Chaini1 – 222222Non-structural protein 1By similarityPRO_5000093333Add
    BLAST
    Chaini223 – 809587Non-structural protein 2By similarityPRO_5000093334Add
    BLAST
    Chaini810 – 28382029Non-structural protein 3By similarityPRO_5000093335Add
    BLAST
    Chaini2839 – 3334496Non-structural protein 4By similarityPRO_5000093336Add
    BLAST
    Chaini3335 – 36373033C-like proteinaseBy similarityPRO_5000093337Add
    BLAST
    Chaini3638 – 3924287Non-structural protein 6By similarityPRO_5000093338Add
    BLAST
    Chaini3925 – 401692Non-structural protein 7By similarityPRO_5000093339Add
    BLAST
    Chaini4017 – 4210194Non-structural protein 8By similarityPRO_5000093340Add
    BLAST
    Chaini4211 – 4320110Non-structural protein 9By similarityPRO_5000093341Add
    BLAST
    Chaini4321 – 4457137Non-structural protein 10By similarityPRO_5000093342Add
    BLAST
    Chaini4458 – 5385928RNA-directed RNA polymeraseBy similarityPRO_5000093343Add
    BLAST
    Chaini5386 – 5988603HelicaseBy similarityPRO_5000093344Add
    BLAST
    Chaini5989 – 6509521ExoribonucleaseBy similarityPRO_5000093345Add
    BLAST
    Chaini6510 – 6883374Uridylate-specific endoribonucleaseBy similarityPRO_5000093346Add
    BLAST
    Chaini6884 – 7182299Putative 2'-O-methyl transferaseBy similarityPRO_5000093347Add
    BLAST

    Post-translational modificationi

    Specific enzymatic cleavages in vivo by its own proteases yield mature proteins. 3CL-PRO and PL-PRO proteinases are autocatalytically processed By similarity.By similarity

    Proteomic databases

    PRIDEiP0C6X2.

    Interactioni

    Subunit structurei

    3CL-PRO exists as monomer and homodimer. Eight copies of nsp7 and eight copies of nsp8 assemble to form a heterohexadecamer. Nsp9 is a dimer. Nsp10 forms a dodecamer By similarity.By similarity

    Structurei

    3D structure databases

    ProteinModelPortaliP0C6X2.
    SMRiP0C6X2. Positions 4052-4202, 4326-4450, 6510-6878.
    ModBaseiSearch...
    MobiDBiSearch...

    Transmembrane

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Transmembranei2226 – 224621HelicalSequence AnalysisAdd
    BLAST
    Transmembranei2287 – 230721HelicalSequence AnalysisAdd
    BLAST
    Transmembranei2318 – 233821HelicalSequence AnalysisAdd
    BLAST
    Transmembranei2401 – 242121HelicalSequence AnalysisAdd
    BLAST
    Transmembranei2443 – 246321HelicalSequence AnalysisAdd
    BLAST
    Transmembranei2844 – 286421HelicalSequence AnalysisAdd
    BLAST
    Transmembranei3119 – 313921HelicalSequence AnalysisAdd
    BLAST
    Transmembranei3151 – 317121HelicalSequence AnalysisAdd
    BLAST
    Transmembranei3178 – 319821HelicalSequence AnalysisAdd
    BLAST
    Transmembranei3203 – 322321HelicalSequence AnalysisAdd
    BLAST
    Transmembranei3651 – 367121HelicalSequence AnalysisAdd
    BLAST
    Transmembranei3676 – 369621HelicalSequence AnalysisAdd
    BLAST
    Transmembranei3701 – 372121HelicalSequence AnalysisAdd
    BLAST
    Transmembranei3744 – 376421HelicalSequence AnalysisAdd
    BLAST
    Transmembranei3772 – 379221HelicalSequence AnalysisAdd
    BLAST
    Transmembranei3800 – 382021HelicalSequence AnalysisAdd
    BLAST
    Transmembranei3843 – 386321HelicalSequence AnalysisAdd
    BLAST

    Family & Domainsi

    Domains and Repeats

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Repeati945 – 954101
    Repeati955 – 964102
    Repeati965 – 974103
    Repeati975 – 984104
    Repeati985 – 994105
    Repeati995 – 1004106
    Repeati1005 – 1014107
    Repeati1015 – 1024108
    Repeati1025 – 1034109
    Repeati1035 – 10441010
    Repeati1045 – 10541011
    Repeati1055 – 10641012
    Repeati1065 – 10741013
    Repeati1075 – 10841014
    Domaini1123 – 1373251Peptidase C16 1PROSITE-ProRule annotationAdd
    BLAST
    Domaini1351 – 1522172MacroPROSITE-ProRule annotationAdd
    BLAST
    Domaini1718 – 1978261Peptidase C16 2PROSITE-ProRule annotationAdd
    BLAST
    Domaini3335 – 3637303Peptidase C30PROSITE-ProRule annotationAdd
    BLAST
    Domaini5065 – 5227163RdRp catalyticAdd
    BLAST
    Domaini5386 – 546984CV MBDAdd
    BLAST
    Domaini5641 – 5822182(+)RNA virus helicase ATP-bindingAdd
    BLAST
    Domaini5823 – 5992170(+)RNA virus helicase C-terminalAdd
    BLAST

    Region

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Regioni945 – 108414014 X 10 AA tandem repeat of N-[DN]-D-E-D-V-V-T-G-DAdd
    BLAST
    Regioni2226 – 2463238HD1By similarityAdd
    BLAST
    Regioni2844 – 3223380HD2By similarityAdd
    BLAST
    Regioni3651 – 3863213HD3By similarityAdd
    BLAST

    Compositional bias

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Compositional biasi934 – 1116183Asp-richAdd
    BLAST
    Compositional biasi2229 – 232193Phe-richAdd
    BLAST
    Compositional biasi2616 – 26194Poly-Val
    Compositional biasi5390 – 541526Cys-richAdd
    BLAST

    Domaini

    The hydrophobic domains (HD) could mediate the membrane association of the replication complex and thereby alter the architecture of the host cell membrane.

    Sequence similaritiesi

    Contains 1 Macro domain.PROSITE-ProRule annotation
    Contains 2 peptidase C16 domains.PROSITE-ProRule annotation
    Contains 1 peptidase C30 domain.PROSITE-ProRule annotation
    Contains 1 RdRp catalytic domain.Curated

    Zinc finger

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Zinc fingeri1238 – 126629C4-type 1PROSITE-ProRule annotationAdd
    BLAST
    Zinc fingeri1835 – 187137C4-type 2PROSITE-ProRule annotationAdd
    BLAST
    Zinc fingeri4394 – 441017By similarityAdd
    BLAST
    Zinc fingeri4436 – 444914By similarityAdd
    BLAST

    Keywords - Domaini

    Repeat, Transmembrane, Transmembrane helix, Zinc-finger

    Family and domain databases

    Gene3Di3.40.50.300. 2 hits.
    InterProiIPR027351. (+)RNA_virus_helicase_core_dom.
    IPR022570. Coronavirus_NSP1.
    IPR009461. Coronavirus_NSP16.
    IPR027352. CV_MBD_dom.
    IPR002589. Macro_dom.
    IPR009466. NSP11.
    IPR014828. NSP7.
    IPR014829. NSP8.
    IPR014822. NSP9.
    IPR027417. P-loop_NTPase.
    IPR002705. Pept_C30/C16_B_coronavir.
    IPR008740. Peptidase_C30.
    IPR013016. Peptidase_C30/C16.
    IPR001205. RNA-dir_pol_C.
    IPR009469. RNA_pol_N_coronovir.
    IPR018995. RNA_synth_NSP10_coronavirus.
    IPR029063. SAM-dependent_MTases-like.
    IPR009003. Trypsin-like_Pept_dom.
    IPR014827. Viral_protease.
    [Graphical view]
    PfamiPF06478. Corona_RPol_N. 1 hit.
    PF11963. DUF3477. 1 hit.
    PF01661. Macro. 1 hit.
    PF09401. NSP10. 1 hit.
    PF06471. NSP11. 1 hit.
    PF06460. NSP13. 1 hit.
    PF08716. nsp7. 1 hit.
    PF08717. nsp8. 1 hit.
    PF08710. nsp9. 1 hit.
    PF01831. Peptidase_C16. 1 hit.
    PF05409. Peptidase_C30. 1 hit.
    PF00680. RdRP_1. 1 hit.
    PF08715. Viral_protease. 1 hit.
    [Graphical view]
    SMARTiSM00506. A1pp. 1 hit.
    [Graphical view]
    SUPFAMiSSF101816. SSF101816. 1 hit.
    SSF144246. SSF144246. 1 hit.
    SSF50494. SSF50494. 1 hit.
    SSF52540. SSF52540. 1 hit.
    SSF53335. SSF53335. 1 hit.
    PROSITEiPS51653. CV_MBD. 1 hit.
    PS51442. M_PRO. 1 hit.
    PS51154. MACRO. 1 hit.
    PS51124. PEPTIDASE_C16. 2 hits.
    PS51657. PSRV_HELICASE. 1 hit.
    [Graphical view]

    Sequences (2)i

    Sequence statusi: Complete.

    Sequence processingi: The displayed sequence is further processed into a mature form.

    This entry describes 2 isoformsi produced by ribosomal frameshifting. Align

    Isoform Replicase polyprotein 1ab (identifier: P0C6X2-1) [UniParc]FASTAAdd to Basket

    Also known as: pp1ab

    This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

    « Hide

    MIKTSKYGLG FKWAPEFRWL LPDAAEELAS PMKSDEGGLC PSTGQAMESV     50
    GFVYDNHVKI DCRCILGQEW HVQSNLIRDI FVHEDLHVVE VLTKTAVKSG 100
    TAILIKSPLH SLGGFPKGYV MGLFRSYKTK RYVVHHLSMT TSTTNFGEDF 150
    LGWIVPFGFM PSYVHKWFQF CRLYIEESDL IISNFKFDDY DFSVEDAYAE 200
    VHAEPKGKYS QKAYALLRQY RGIKPVLFVD QYGCDYSGKL ADCLQAYGHY 250
    SLQDMRQKQS VWLANCDFDI VVAWHVVRDS RFVMRLQTIA TICGIKYVAQ 300
    PTEDVVDGDV VIREPVHLLS ADAIVLKLPS LMKVMTHMDD FSIKSIYNVD 350
    LCDCGFVMQY GYVDCFNDNC DFYGWVSGNM MDGFSCPLCC TVYDSSEVKA 400
    QSSGVIPENP VLFTNSTDTV NHDSFNLYGY SVTPFGSCIY WSPRPGLWIP 450
    IIKSSVKSYD DLVYSGVVGC KSIVKETALI THALYLDYVQ CKCGNLEQNH 500
    ILGVNNSWCR QLLLNRGDYN MLLKNIDLFV KRRADFACKF AVCGDGFVPF 550
    LLDGLIPRSY YLIQSGIFFT SLMSQFSQEV SDMCLKMCIL FMDRVSVATF 600
    YIEHYVNRLV TQFKLLGTTL VNKMVNWFNT MLDASAPATG WLLYQLLNGL 650
    FVVSQANFNF VALIPDYAKI LVNKFYTFFK LLLECVTVDV LKDMPVLKTI 700
    NGLVCIVGNK FYNVSTGLIP GFVLPCNAQE QQIYFFEGVA ESVIVEDDVI 750
    ENVKSSLSSY EYCQPPKSVE KICIIDNMYM GKCGDKFFPI VMNDKNICLL 800
    DQAWRFPCAG RKVNFNEKPV VMEIPSLMTV KVMFDLDSTF DDILGKVCSE 850
    FEVEKGVTVD DFVAVVCDAI ENALNSCKEH PVVGYQVRAF LNKLNENVVY 900
    LFDEAGDEAM ASRMYCTFAI EDVEDVISSE AVEDTIDGVV EDTINDDEDV 950
    VTGDNDDEDV VTGDNDDEDV VTGDNDDEDV VTGDNDDEDV VTGDNDDEDV 1000
    VTGDNDDEDV VTGDNDDEDV VTGDNDDEDV VTGDNDDEDV VTGDNDDEDV 1050
    VTGDNDDEDV VTGDNDDEDV VTGDNDDEDV VTGDNNDEEI VTGDNDDQIV 1100
    VTGDDVDDIE SIYDFDTYKA LLVFNDVYND ALFVSYGSSV ETETYFKVNG 1150
    LWSPTITHTN CWLRSVLLVM QKLPFKFKDL AIENMWLSYK VGYNQSFVDY 1200
    LLTTIPKAIV LPQGGFVADF AYWFLNQFDI NAYANWCCLK CGFSFDLNGL 1250
    DALFFYGDIV SHVCKCGHNM TLIAADLPCT LHFSLFDDNF CAFCTPKKIF 1300
    IAACAVDVNV CHSVAVIGDE QIDGKFVTKF SGDKFDFIVG YGMSFSMSSF 1350
    ELPQLYGLCI TPNVCFVKGD IINVARLVKA DVIVNPANGH MLHGGGVAKA 1400
    IAVAAGKKFS KETAAMVKSK GVCQVGDCYV STGGKLCKTI LNIVGPDARQ 1450
    DGRQSYVLLA RAYKHLNNYD CCLSTLISAG IFSVPADVSL TYLLGVVDKQ 1500
    VILVSNNKED FDIIQKCQIT SVVGTKALAV RLTANVGRVI KFETDAYKLF 1550
    LSGDDCFVSN SSVIQEVLLL RHDIQLNNDV RDYLLSKMTS LPKDWRLINK 1600
    FDVINGVKTV KYFECPNSIY ICSQGKDFGY VCDGSFYKAT VNQVCVLLAK 1650
    KIDVLLTVDG VNFKSISLTV GEVFGKILGN VFCDGIDVTK LKCSDFYADK 1700
    ILYQYENLSL ADISAVQSSF GFDQQQLLAY YNFLTVCKWS VVVNGPFFSF 1750
    EQSHNNCYVN VACLMLQHIN LKFNKWQWQE AWYEFRAGRP HRLVALVLAK 1800
    GHFKFDEPSD ATDFIRVVLK QADLSGAICE LELICDCGIK QESRVGVDAV 1850
    MHFGTLAKTD LFNGYKIGCN CAGRIVHCTK LNVPFLICSN TPLSKDLPDD 1900
    VVAANMFMGV GVGHYTHLKC GSPYQHYDAC SVKKYTGVSG CLTDCLYLKN 1950
    LTQTFTSMLT NYFLDDVEMV AYNPDLSQYY CDNGKYYTKP IIKAQFKPFA 2000
    KVDGVYTNFK LVGHDICAQL NDKLGFNVDL PFVEYKVTVW PVATGDVVLA 2050
    SDDLYVKRYF KGCETFGKPV IWFCHDEASL NSLTYFNKPS FKSENRYSVL 2100
    SVDSVSEESQ GNVVTSVMES QISTKEVKLK GVRKTVKIED AIIVNDENSS 2150
    IKVVKSLSLV DVWDMYLTGC DYVVWVANEL SRLVKSPTVR EYIRYGIKPI 2200
    TIPIDLLCLR DDNQTLLVPK IFKARAIEFY GFLKWLFIYV FSLLHFTNDK 2250
    TIFYTTEIAS KFTFNLFCLA LKNAFQTFRW SIFIKGFLVV ATVFLFWFNF 2300
    LYINVIFSDF YLPNISVFPI FVGRIVMWIK ATFGLVTICD FYSKLGVGFT 2350
    SHFCNGSFIC ELCHSGFDML DTYAAIDFVQ YEVDRRVLFD YVSLVKLIVE 2400
    LVIGYSLYTV WFYPLFCLIG LQLFTTWLPD LFMLETMHWL IRFIVFVANM 2450
    LPAFVLLRFY IVVTAMYKVV GFIRHIVYGC NKAGCLFCYK RNCSVRVKCS 2500
    TIVGGVIRYY DITANGGTGF CVKHQWNCFN CHSFKPGNTF ITVEAAIELS 2550
    KELKRPVNPT DASHYVVTDI KQVGCMMRLF YDRDGQRVYD DVDASLFVDI 2600
    NNLLHSKVKV VPNLYVVVVE SDADRANFLN AVVFYAQSLY RPILLVDKKL 2650
    ITTACNGISV TQTMFDVYVD TFMSHFDVDR KSFNNFVNIA HASLREGVQL 2700
    EKVLDTFVGC VRKCCSIDSD VETRFITKSM ISAVAAGLEF TDENYNNLVP 2750
    TYLKSDNIVA ADLGVLIQNG AKHVQGNVAK AANISCIWFI DAFNQLTADL 2800
    QHKLKKACVK TGLKLKLTFN KQEASVPILT TPFSLKGGVV LSNLLYILFF 2850
    VSLICFILLW ALLPTYSVYK SDIHLPAYAS FKVIDNGVVR DISVNDLCFA 2900
    NKFFQFDQWY ESTFGSVYYH NSMDCPIVVA VMDEDIGSTM FNVPTKVLRH 2950
    GFHVLHFLTY AFASDSVQCY TPHIQISYND FYASGCVLSS LCTMFKRGDG 3000
    TPHPYCYSDG VMKNASLYTS LVPHTRYSLA NSNGFIRFPD VISEGIVRIV 3050
    RTRSMTYCRV GACEYAEEGI CFNFNSSWVL NNDYYRSMPG TFCGRDLFDL 3100
    FYQFFSSLIR PIDFFSLTAS SIFGAILAIV VVLVFYYLIK LKRAFGDYTS 3150
    VVVINVVVWC INFLMLFVFQ VYPICACVYA CFYFYVTLYF PSEISVIMHL 3200
    QWIVMYGAIM PFWFCVTYVA MVIANHVLWL FSYCRKIGVN VCSDSTFEET 3250
    SLTTFMITKD SYCRLKNSVS DVAYNRYLSL YNKYRYYSGK MDTAAYREAA 3300
    CSQLAKAMET FNHNNGNDVL YQPPTASVST SFLQSGIVKM VSPTSKIEPC 3350
    IVSVTYGSMT LNGLWLDDKV YCPRHVICSS SNMNEPDYSA LLCRVTLGDF 3400
    TIMSGRMSLT VVSYQMQGCQ LVLTVSLQNP YTPKYTFGNV KPGETFTVLA 3450
    AYNGRPQGAF HVTMRSSYTI KGSFLCGSCG SVGYVLTGDS VKFVYMHQLE 3500
    LSTGCHTGTD FTGNFYGPYR DAQVVQLPVK DYVQTVNVIA WLYAAILNNC 3550
    AWFVQNDVCS TEDFNVWAMA NGFSQVKADL VLDALASMTG VSIETLLAAI 3600
    KRLYMGFQGR QILGSCTFED ELAPSDVYQQ LAGVKLQSKT KRFIKETIYW 3650
    ILISTFLFSC IISAFVKWTI FMYINTHMIG VTLCVLCFVS FMMLLVKHKH 3700
    FYLTMYIIPV LCTLFYVNYL VVYKEGFRGF TYVWLSYFVP AVNFTYVYEV 3750
    FYGCILCVFA IFITMHSINH DIFSLMFLVG RIVTLISMWY FGSNLEEDVL 3800
    LFITAFLGTY TWTTILSLAI AKIVANWLSV NIFYFTDVPY IKLILLSYLF 3850
    IGYILSCYWG FFSLLNSVFR MPMGVYNYKI SVQELRYMNA NGLRPPRNSF 3900
    EAILLNLKLL GIGGVPVIEV SQIQSKLTDV KCANVVLLNC LQHLHVASNS 3950
    KLWQYCSVLH NEILSTSDLS VAFDKLAQLL IVLFANPAAV DTKCLASIDE 4000
    VSDDYVQDST VLQALQSEFV NMASFVEYEV AKKNLADAKN SGSVNQQQIK 4050
    QLEKACNIAK SVYERDKAVA RKLERMADLA LTNMYKEARI NDKKSKVVSA 4100
    LQTMLFSMVR KLDNQALNSI LDNAVKGCVP LSAIPALAAN TLTIVIPDKQ 4150
    VFDKVVDNVY VTYAGSVWHI QTVQDADGIN KQLTDISVDS NWPLVIIANR 4200
    YNEVANAVMQ NNELMPHKLK IQVVNSGSDM NCNIPTQCYY NNGSSGRIVY 4250
    AVLSDVDGLK YTKIMKDDGN CVVLELDPPC KFSIQDVKGL KIKYLYFIKG 4300
    CNTLARGWVV GTLSSTIRLQ AGVATEYAAN SSILSLCAFS VDPKKTYLDY 4350
    IQQGGVPIIN CVKMLCDHAG TGMAITIKPE ATINQDSYGG ASVCIYCRAR 4400
    VEHPDVDGIC KLRGKFVQVP LGIKDPILYV LTHDVCQVCG FWRDGSCSCV 4450
    GSSVAVQSKD LNFLNRVRGT SVNARLVPCA SGLSTDVQLR AFDICNTNRA 4500
    GIGLYYKVNC CRFQRIDDDG NKLDKFFVVK RTNLEVYNKE KTYYELTKSC 4550
    GVVAEHDFFT FDIDGSRVPH IVRRNLSKYT MLDLCYALRH FDRNDCSILC 4600
    EILCEYADCK ESYFSKKDWY DFVENPDIIN IYKKLGPIFN RALLNTVIFA 4650
    DTLVEVGLVG VLTLDNQDLY GQWYDFGDFI QTAPGFGVAV ADSYYSYMMP 4700
    MLTMCHVLDC ELFVNDSYRQ FDLVQYDFTD YKLELFNKYF KYWGMKYHPN 4750
    TVDCDNDRCI IHCANFNILF SMVLPNTCFG PLVRQIFVDG VPFVVSIGYH 4800
    YKELGVVMNL DVDTHRYRLS LKDLLLYAAD PAMHVASASA LLDLRTCCFS 4850
    VAAITSGIKF QTVKPGNFNQ DFYEFVKSKG LFKEGSTVDL KHFFFTQDGN 4900
    AAITDYNYYK YNLPTMVDIK QLLFVLEVVY KYFEIYDGGC IPASQVIVNN 4950
    YDKSAGYPFN KFGKARLYYE ALSFEEQNEI YAYTKRNVLP TLTQMNLKYA 5000
    ISAKNRARTV AGVSILSTMT GRMFHQKCLK SIAATRGVPV VIGTTKFYGG 5050
    WDDMLRHLIK DVDNPVLMGW DYPKCDRAMP NILRIVSSLV LARKHEFCCS 5100
    HGDRFYRLAN ECAQVLSEIV MCGGCYYVKP GGTSSGDATT AFANSVFNIC 5150
    QAVTANVCSL MACNGHKIED LSIRNLQKRL YSNVYRTDYV DYTFVNEYYE 5200
    FLCKHFSMMI LSDDGVVCYN SDYASKGYIA NISVFQQVLY YQNNVFMSES 5250
    KCWVENDITN GPHEFCSQHT MLVKIDGDYV YLPYPDPSRI LGAGCFVDDL 5300
    LKTDSVLLIE RFVSLAIDAY PLVHHENEEY QKVFRVYLEY IKKLYNDLGT 5350
    QILDSYSVIL STCDGLKFTE ESFYKNMYLK SAVMQSVGAC VVCSSQTSLR 5400
    CGSCIRKPLL CCKCCYDHVM ATNHKYVLSV SPYVCNAPNC DVSDVTKLYL 5450
    GGMSYYCENH KPHYSFKLVM NGMVFGLYKQ SCTGSPYIDD FNKIASCKWT 5500
    EVDDYVLANE CIERLKLFAA ETQKATEEAF KQSYASATIQ EIVSDREVIL 5550
    CWETGKVKPP LNKNYVFTGY HFTSTGKTVL GEYVFDKSEL TNGVYYRATT 5600
    TYKLSIGDVF VLTSHSVASL SAPTLVPQEN YASIRFSSVY SVPLVFQNNV 5650
    ANYQHIGMKR YCTVQGPPGT GKSHLAIGLA VYYYTARVVY TAASHAAVDA 5700
    LCEKAYKFLN INDCTRIIPA KVRVDCYDKF KINDTTCKYV FTTINALPEL 5750
    VTDIVVVDEV SMLTNYELSV INARIKAKHY VYIGDPAQLP APRVLLSKGS 5800
    LEPRHFNSIT KIMCCLGPDI FLGNCYRCPK EIVETVSALV YDNKLKAKND 5850
    NSSLCFKVYF KGQTTHESSS AVNIQQIYLI SKFLKANPVW NSAVFISPYN 5900
    SQNYVAKRVL GVQTQTVDSA QGSEYDYVIY SQTAETAHSV NVNRFNVAIT 5950
    RAKKGIFCVM SNMQLFESLN FITLPLDKIQ NQTLPRLHCT TNLFKDCSKS 6000
    CLGYHPAHAP SFLAVDDKYK VNENLAVNLN ICEPVLTYSR LISLMGFKLD 6050
    LTLDGYSKLF ITKDEAIKRV RGWVGFDVEG AHATRENIGT NFPLQIGFST 6100
    GVDFVVEATG LFAERDCYTF KKTVAKAPPG EKFKHLIPLM SKGQKWDIVR 6150
    IRIVQMLSDY LLDLSDSVVF ITWSASFELT CLRYFAKLGR ELNCNVCSNR 6200
    ATCYNSRTGY YGCWRHSYTC DYVYNPLIVD IQQWGYTGSL TSNHDIICNV 6250
    HKGAHVASAD AIMTRCLAIY DCFCKSVNWN LEYPIISNEV SINTSCRLLQ 6300
    RVMLKAAMLC NRYNLCYDIG NPKGLACVKD YEFKFYDAFP VAKSVKQLFY 6350
    VYDVHKDNFK DGLCMFWNCN VDKYPSNSIV CRFDTRVLNK LNLPGCNGGS 6400
    LYVNKHAFHT NPFTRTVFEN LKPMPFFYYS DTPCVYVDGL ESKQVDYVPL 6450
    RSATCITRCN LGGAVCSKHA EEYCNYLESY NIVTTAGFTF WVYKNFDFYN 6500
    LWNTFTTLQS LENVIYNLVN VGHYDGRTGE LPCAIMNDKV VVKINNVDTV 6550
    IFKNNTSFPT NIAVELFTKR SIRHHPELKI LRNLNIDICW KHVLWDYVKD 6600
    SLFCSSTYGV CKYTDLKFIE NLNILFDGRD TGALEAFRKA RNGVFISTEK 6650
    LSRLSMIKGP QRADLNGVIV DKVGELKVEF WFAMRKDGDD VIFSRTDSLC 6700
    SSHYWSPQGN LGGNCAGNVI GNDALTRFTI FTQSRVLSSF EPRSDLERDF 6750
    IDMDDNLFIA KYGLEDYAFD HIVYGSFNHK VIGGLHLLIG LFRRKKKSNL 6800
    LIQEFLQYDS SIHSYFITDQ ECGSSKSVCT VIDLLLDDFV SIVKSLNLSC 6850
    VSKVVNINVD FKDFQFMLWC NDNKIMTFYP KMQATNDWKP GYSMPVLYKY 6900
    LNVPLERVSL WNYGKPINLP TGCMMNVAKY TQLCQYLNTT TLAVPVNMRV 6950
    LHLGAGSDKE VAPGSAVLRQ WLPSGSILVD NDLNPFVSDS LVTYFGDCMT 7000
    LPFDCHWDLI ISDMYDPLTK NIGDYNVSKD GFFTYICHLI RDKLSLGGSV 7050
    AIKITEFSWN ADLYKLMSCF AFWTVFCTNV NASSSEGFLI GINYLGKSSF 7100
    EIDGNVMHAN YLFWRNSTTW NGGAYSLFDM TKFSLKLAGT AVVNLRPDQL 7150
    NDLVYSLIER GKLLVRDTRK EIFVGDSLVN TC 7182

    Note: Produced by -1 ribosomal frameshifting at the 1a-1b genes boundary.

    Length:7,182
    Mass (Da):810,941
    Last modified:June 10, 2008 - v1
    Checksum:i663B34161D6248BC
    GO
    Isoform Replicase polyprotein 1a (identifier: P0C6U3-1) [UniParc]FASTAAdd to Basket

    Also known as: pp1a, ORF1a polyprotein

    The sequence of this isoform can be found in the external entry P0C6U3.
    Isoforms of the same protein are often annotated in two different entries if their sequences differ significantly.

    Note: Produced by conventional translation.

    Length:4,471
    Mass (Da):502,589
    GO

    Sequence databases

    Select the link destinations:
    EMBL
    GenBank
    DDBJ
    Links Updated
    AY597011 Genomic RNA. Translation: AAT98578.1.
    RefSeqiYP_173236.1. NC_006577.2.

    Genome annotation databases

    GeneIDi3200429.

    Keywords - Coding sequence diversityi

    Ribosomal frameshifting

    Cross-referencesi

    Sequence databases

    Select the link destinations:
    EMBL
    GenBank
    DDBJ
    Links Updated
    AY597011 Genomic RNA. Translation: AAT98578.1 .
    RefSeqi YP_173236.1. NC_006577.2.

    3D structure databases

    ProteinModelPortali P0C6X2.
    SMRi P0C6X2. Positions 4052-4202, 4326-4450, 6510-6878.
    ModBasei Search...
    MobiDBi Search...

    Proteomic databases

    PRIDEi P0C6X2.

    Protocols and materials databases

    Structural Biology Knowledgebase Search...

    Genome annotation databases

    GeneIDi 3200429.

    Family and domain databases

    Gene3Di 3.40.50.300. 2 hits.
    InterProi IPR027351. (+)RNA_virus_helicase_core_dom.
    IPR022570. Coronavirus_NSP1.
    IPR009461. Coronavirus_NSP16.
    IPR027352. CV_MBD_dom.
    IPR002589. Macro_dom.
    IPR009466. NSP11.
    IPR014828. NSP7.
    IPR014829. NSP8.
    IPR014822. NSP9.
    IPR027417. P-loop_NTPase.
    IPR002705. Pept_C30/C16_B_coronavir.
    IPR008740. Peptidase_C30.
    IPR013016. Peptidase_C30/C16.
    IPR001205. RNA-dir_pol_C.
    IPR009469. RNA_pol_N_coronovir.
    IPR018995. RNA_synth_NSP10_coronavirus.
    IPR029063. SAM-dependent_MTases-like.
    IPR009003. Trypsin-like_Pept_dom.
    IPR014827. Viral_protease.
    [Graphical view ]
    Pfami PF06478. Corona_RPol_N. 1 hit.
    PF11963. DUF3477. 1 hit.
    PF01661. Macro. 1 hit.
    PF09401. NSP10. 1 hit.
    PF06471. NSP11. 1 hit.
    PF06460. NSP13. 1 hit.
    PF08716. nsp7. 1 hit.
    PF08717. nsp8. 1 hit.
    PF08710. nsp9. 1 hit.
    PF01831. Peptidase_C16. 1 hit.
    PF05409. Peptidase_C30. 1 hit.
    PF00680. RdRP_1. 1 hit.
    PF08715. Viral_protease. 1 hit.
    [Graphical view ]
    SMARTi SM00506. A1pp. 1 hit.
    [Graphical view ]
    SUPFAMi SSF101816. SSF101816. 1 hit.
    SSF144246. SSF144246. 1 hit.
    SSF50494. SSF50494. 1 hit.
    SSF52540. SSF52540. 1 hit.
    SSF53335. SSF53335. 1 hit.
    PROSITEi PS51653. CV_MBD. 1 hit.
    PS51442. M_PRO. 1 hit.
    PS51154. MACRO. 1 hit.
    PS51124. PEPTIDASE_C16. 2 hits.
    PS51657. PSRV_HELICASE. 1 hit.
    [Graphical view ]
    ProtoNeti Search...

    Publicationsi

    1. "Characterization and complete genome sequence of a novel coronavirus, coronavirus HKU1, from patients with pneumonia."
      Woo P.C.Y., Lau S.K.P., Chu C.-M., Chan K.-H., Tsoi H.-W., Huang Y., Wong B.H.L., Poon R.W.S., Cai J.J., Luk W.-K., Poon L.L.M., Wong S.S.Y., Guan Y., Peiris J.S.M., Yuen K.-Y.
      J. Virol. 79:884-895(2005) [PubMed] [Europe PMC] [Abstract]
      Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA].

    Entry informationi

    Entry nameiR1AB_CVHN1
    AccessioniPrimary (citable) accession number: P0C6X2
    Secondary accession number(s): Q5MQD2
    Entry historyi
    Integrated into UniProtKB/Swiss-Prot: June 10, 2008
    Last sequence update: June 10, 2008
    Last modified: October 1, 2014
    This is version 57 of the entry and version 1 of the sequence. [Complete history]
    Entry statusiReviewed (UniProtKB/Swiss-Prot)
    Annotation programViral Protein Annotation Program

    Miscellaneousi

    Miscellaneous

    Isolate N1 belongs to genotype A.

    Keywords - Technical termi

    Complete proteome

    Documents

    1. Peptidase families
      Classification of peptidase families and list of entries
    2. SIMILARITY comments
      Index of protein domains and families

    External Data

    Dasty 3