Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Basket 0
(max 400 entries)x

Your basket is currently empty.

Select item(s) and click on "Add to basket" to create your own collection here
(400 entries max)

P0C6U7

- R1A_CVHOC

UniProt

P0C6U7 - R1A_CVHOC

Protein

Replicase polyprotein 1a

Gene

1a

Organism
Human coronavirus OC43 (HCoV-OC43)
Status
Reviewed - Annotation score: 5 out of 5- Protein inferred from homologyi
    • BLAST
    • Align
    • Format
    • Add to basket
    • History
      Entry version 45 (01 Oct 2014)
      Sequence version 1 (10 Jun 2008)
      Previous versions | rss
    • Help video
    • Feedback
    • Comment

    Functioni

    The papain-like proteinase 1 (PL1-PRO) and papain-like proteinase 2 (PL2-PRO) are responsible for the cleavages located at the N-terminus of the replicase polyprotein. In addition, PLP2 possesses a deubiquitinating/deISGylating activity and processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. Antagonizes innate immune induction of type I interferon by blocking the phosphorylation, dimerization and subsequent nuclear translocation of host IRF-3 By similarity.By similarity
    The main proteinase 3CL-PRO is responsible for the majority of cleavages as it cleaves the C-terminus of replicase polyprotein at 11 sites. Recognizes substrates containing the core sequence [ILMVF]-Q-|-[SGACN]. Inhibited by the substrate-analog Cbz-Val-Asn-Ser-Thr-Leu-Gln-CMK. Also contains an ADP-ribose-1''-phosphate (ADRP)-binding function By similarity.PROSITE-ProRule annotation
    Nsp7-nsp8 hexadecamer may possibly confer processivity to the polymerase, maybe by binding to dsRNA or by producing primers utilized by the latter.By similarity
    Nsp9 is a ssRNA-binding protein.By similarity
    Non-structural protein 1: binds to the 40S ribosomal subunit and inhibits host translation. The nsp1-40S ribosome complex further induces an endonucleolytic cleavage near the 5'UTR of host mRNAs, targeting them for degradation. By suppressing host gene expression, nsp1 facilitates efficient viral gene expression in infected cells and evasion from host immune response By similarity.By similarity

    Catalytic activityi

    TSAVLQ-|-SGFRK-NH2 and SGVTFQ-|-GKFKK the two peptides corresponding to the two self-cleavage sites of the SARS 3C-like proteinase are the two most reactive peptide substrates. The enzyme exhibits a strong preference for substrates containing Gln at P1 position and Leu at P2 position.
    Thiol-dependent hydrolysis of ester, thioester, amide, peptide and isopeptide bonds formed by the C-terminal Gly of ubiquitin (a 76-residue protein attached to proteins as an intracellular targeting signal).

    Sites

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Sitei246 – 2472Cleavage; by PL1-PROBy similarity
    Sitei851 – 8522Cleavage; by PL1-PROBy similarity
    Active sitei1074 – 10741For PL1-PRO activityPROSITE-ProRule annotation
    Active sitei1225 – 12251For PL1-PRO activityPROSITE-ProRule annotation
    Active sitei1671 – 16711For PL2-PRO activityPROSITE-ProRule annotation
    Active sitei1828 – 18281For PL2-PRO activityPROSITE-ProRule annotation
    Sitei2750 – 27512Cleavage; by PL2-PROBy similarity
    Sitei3246 – 32472Cleavage; by 3CL-PROBy similarity
    Active sitei3287 – 32871For 3CL-PRO activityPROSITE-ProRule annotation
    Active sitei3391 – 33911For 3CL-PRO activityPROSITE-ProRule annotation
    Sitei3549 – 35502Cleavage; by 3CL-PROBy similarity
    Sitei3836 – 38372Cleavage; by 3CL-PROBy similarity
    Sitei3925 – 39262Cleavage; by 3CL-PROBy similarity
    Sitei4122 – 41232Cleavage; by 3CL-PROBy similarity
    Sitei4232 – 42332Cleavage; by 3CL-PROBy similarity
    Sitei4369 – 43702Cleavage; by 3CL-PROBy similarity

    Regions

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Zinc fingeri1151 – 117929C4-type 1PROSITE-ProRule annotationAdd
    BLAST
    Zinc fingeri1749 – 178537C4-type 2PROSITE-ProRule annotationAdd
    BLAST
    Zinc fingeri4306 – 432217By similarityAdd
    BLAST
    Zinc fingeri4348 – 436114By similarityAdd
    BLAST

    GO - Molecular functioni

    1. cysteine-type endopeptidase activity Source: InterPro
    2. omega peptidase activity Source: InterPro
    3. RNA binding Source: UniProtKB-KW
    4. RNA-directed RNA polymerase activity Source: InterPro
    5. zinc ion binding Source: InterPro

    GO - Biological processi

    1. induction by virus of catabolism of host mRNA Source: UniProtKB-KW
    2. induction by virus of host autophagy Source: UniProtKB-KW
    3. modulation by virus of host protein ubiquitination Source: UniProtKB-KW
    4. suppression by virus of host IRF3 activity Source: UniProtKB-KW
    5. suppression by virus of host ISG15 activity Source: UniProtKB-KW
    6. suppression by virus of host type I interferon-mediated signaling pathway Source: UniProtKB-KW
    7. viral genome replication Source: InterPro
    8. viral protein processing Source: InterPro

    Keywords - Molecular functioni

    Hydrolase, Protease, Thiol protease

    Keywords - Biological processi

    Activation of host autophagy by virus, Decay of host mRNAs by virus, Eukaryotic host gene expression shutoff by virus, Eukaryotic host translation shutoff by virus, Host gene expression shutoff by virus, Host mRNA suppression by virus, Host-virus interaction, Inhibition of host innate immune response by virus, Inhibition of host interferon signaling pathway by virus, Inhibition of host IRF3 by virus, Inhibition of host ISG15 by virus, Inhibition of host RLR pathway by virus, Modulation of host ubiquitin pathway by viral deubiquitinase, Modulation of host ubiquitin pathway by virus, Ubl conjugation pathway, Viral immunoevasion

    Keywords - Ligandi

    Metal-binding, RNA-binding, Zinc

    Names & Taxonomyi

    Protein namesi
    Recommended name:
    Replicase polyprotein 1a
    Short name:
    pp1a
    Alternative name(s):
    ORF1a polyprotein
    Cleaved into the following 11 chains:
    Non-structural protein 1
    Short name:
    nsp1
    Alternative name(s):
    p28
    Non-structural protein 2
    Short name:
    nsp2
    Alternative name(s):
    p65
    Alternative name(s):
    PL1-PRO/PL2-PRO
    PL1/PL2
    Papain-like proteinases 1/2
    p210
    Non-structural protein 4
    Short name:
    nsp4
    Alternative name(s):
    Peptide HD2
    p44
    3C-like proteinase (EC:3.4.22.-)
    Short name:
    3CL-PRO
    Short name:
    3CLp
    Alternative name(s):
    M-PRO
    nsp5
    p27
    Non-structural protein 6
    Short name:
    nsp6
    Non-structural protein 7
    Short name:
    nsp7
    Alternative name(s):
    p10
    Non-structural protein 8
    Short name:
    nsp8
    Alternative name(s):
    p22
    Non-structural protein 9
    Short name:
    nsp9
    Alternative name(s):
    p12
    Non-structural protein 10
    Short name:
    nsp10
    Alternative name(s):
    Growth factor-like peptide
    Short name:
    GFL
    p15
    Non-structural protein 11
    Short name:
    nsp11
    Gene namesi
    ORF Names:1a
    OrganismiHuman coronavirus OC43 (HCoV-OC43)
    Taxonomic identifieri31631 [NCBI]
    Taxonomic lineageiVirusesssRNA positive-strand viruses, no DNA stageNidoviralesCoronaviridaeCoronavirinaeBetacoronavirus
    Virus hostiHomo sapiens (Human) [TaxID: 9606]
    ProteomesiUP000007552: Genome

    Subcellular locationi

    Chain Non-structural protein 7 : Host cytoplasmhost perinuclear region By similarity
    Note: nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes By similarity.By similarity
    Chain Non-structural protein 8 : Host cytoplasmhost perinuclear region By similarity
    Note: nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes By similarity.By similarity
    Chain Non-structural protein 9 : Host cytoplasmhost perinuclear region By similarity
    Note: nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes By similarity.By similarity
    Chain Non-structural protein 10 : Host cytoplasmhost perinuclear region By similarity
    Note: nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes By similarity.By similarity

    GO - Cellular componenti

    1. host cell membrane Source: UniProtKB-SubCell
    2. host cell perinuclear region of cytoplasm Source: UniProtKB-SubCell
    3. integral component of membrane Source: UniProtKB-KW

    Keywords - Cellular componenti

    Host cytoplasm, Host membrane, Membrane

    PTM / Processingi

    Molecule processing

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Chaini1 – 43834383Replicase polyprotein 1aPRO_0000338242Add
    BLAST
    Chaini1 – 246246Non-structural protein 1By similarityPRO_0000338243Add
    BLAST
    Chaini247 – 851605Non-structural protein 2By similarityPRO_0000338244Add
    BLAST
    Chaini852 – 27501899Non-structural protein 3By similarityPRO_0000338245Add
    BLAST
    Chaini2751 – 3246496Non-structural protein 4By similarityPRO_0000338246Add
    BLAST
    Chaini3247 – 35493033C-like proteinaseBy similarityPRO_0000338247Add
    BLAST
    Chaini3550 – 3836287Non-structural protein 6By similarityPRO_0000338248Add
    BLAST
    Chaini3837 – 392589Non-structural protein 7By similarityPRO_0000338249Add
    BLAST
    Chaini3926 – 4122197Non-structural protein 8By similarityPRO_0000338250Add
    BLAST
    Chaini4123 – 4232110Non-structural protein 9By similarityPRO_0000338251Add
    BLAST
    Chaini4233 – 4369137Non-structural protein 10By similarityPRO_0000338252Add
    BLAST
    Chaini4370 – 438314Non-structural protein 11Sequence AnalysisPRO_0000338253Add
    BLAST

    Post-translational modificationi

    Specific enzymatic cleavages in vivo by its own proteases yield mature proteins. 3CL-PRO and PL-PRO proteinases are autocatalytically processed By similarity.By similarity

    Interactioni

    Subunit structurei

    3CL-PRO exists as monomer and homodimer. Eight copies of nsp7 and eight copies of nsp8 assemble to form a heterohexadecamer. Nsp9 is a dimer. Nsp10 forms a dodecamer By similarity.By similarity

    Structurei

    3D structure databases

    ProteinModelPortaliP0C6U7.
    SMRiP0C6U7. Positions 3964-4114, 4238-4362.
    ModBaseiSearch...
    MobiDBiSearch...

    Transmembrane

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Transmembranei2138 – 215821HelicalSequence AnalysisAdd
    BLAST
    Transmembranei2199 – 221921HelicalSequence AnalysisAdd
    BLAST
    Transmembranei2221 – 224121HelicalSequence AnalysisAdd
    BLAST
    Transmembranei2313 – 233321HelicalSequence AnalysisAdd
    BLAST
    Transmembranei2343 – 236321HelicalSequence AnalysisAdd
    BLAST
    Transmembranei2365 – 238521HelicalSequence AnalysisAdd
    BLAST
    Transmembranei2752 – 277221HelicalSequence AnalysisAdd
    BLAST
    Transmembranei2824 – 284421HelicalSequence AnalysisAdd
    BLAST
    Transmembranei3009 – 302921HelicalSequence AnalysisAdd
    BLAST
    Transmembranei3031 – 305121HelicalSequence AnalysisAdd
    BLAST
    Transmembranei3063 – 308321HelicalSequence AnalysisAdd
    BLAST
    Transmembranei3090 – 311021HelicalSequence AnalysisAdd
    BLAST
    Transmembranei3115 – 313521HelicalSequence AnalysisAdd
    BLAST
    Transmembranei3558 – 357821HelicalSequence AnalysisAdd
    BLAST
    Transmembranei3588 – 360821HelicalSequence AnalysisAdd
    BLAST
    Transmembranei3615 – 363521HelicalSequence AnalysisAdd
    BLAST
    Transmembranei3657 – 367721HelicalSequence AnalysisAdd
    BLAST
    Transmembranei3684 – 370421HelicalSequence AnalysisAdd
    BLAST
    Transmembranei3711 – 373121HelicalSequence AnalysisAdd
    BLAST
    Transmembranei3755 – 377521HelicalSequence AnalysisAdd
    BLAST

    Family & Domainsi

    Domains and Repeats

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Domaini1036 – 1274239Peptidase C16 1PROSITE-ProRule annotationAdd
    BLAST
    Domaini1275 – 1435161MacroPROSITE-ProRule annotationAdd
    BLAST
    Domaini1631 – 1892262Peptidase C16 2PROSITE-ProRule annotationAdd
    BLAST
    Domaini3247 – 3549303Peptidase C30PROSITE-ProRule annotationAdd
    BLAST

    Region

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Regioni2138 – 2385248HD1By similarityAdd
    BLAST
    Regioni2752 – 3135384HD2By similarityAdd
    BLAST
    Regioni3319 – 3775457HD3By similarityAdd
    BLAST

    Domaini

    The hydrophobic domains (HD) could mediate the membrane association of the replication complex and thereby alter the architecture of the host cell membrane.

    Sequence similaritiesi

    Contains 1 Macro domain.PROSITE-ProRule annotation
    Contains 2 peptidase C16 domains.PROSITE-ProRule annotation
    Contains 1 peptidase C30 domain.PROSITE-ProRule annotation

    Zinc finger

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Zinc fingeri1151 – 117929C4-type 1PROSITE-ProRule annotationAdd
    BLAST
    Zinc fingeri1749 – 178537C4-type 2PROSITE-ProRule annotationAdd
    BLAST
    Zinc fingeri4306 – 432217By similarityAdd
    BLAST
    Zinc fingeri4348 – 436114By similarityAdd
    BLAST

    Keywords - Domaini

    Repeat, Transmembrane, Transmembrane helix, Zinc-finger

    Family and domain databases

    InterProiIPR022570. Coronavirus_NSP1.
    IPR002589. Macro_dom.
    IPR014828. NSP7.
    IPR014829. NSP8.
    IPR014822. NSP9.
    IPR002705. Pept_C30/C16_B_coronavir.
    IPR008740. Peptidase_C30.
    IPR013016. Peptidase_C30/C16.
    IPR018995. RNA_synth_NSP10_coronavirus.
    IPR009003. Trypsin-like_Pept_dom.
    IPR014827. Viral_protease.
    [Graphical view]
    PfamiPF11963. DUF3477. 1 hit.
    PF01661. Macro. 1 hit.
    PF09401. NSP10. 1 hit.
    PF08716. nsp7. 1 hit.
    PF08717. nsp8. 1 hit.
    PF08710. nsp9. 1 hit.
    PF01831. Peptidase_C16. 1 hit.
    PF05409. Peptidase_C30. 1 hit.
    PF08715. Viral_protease. 1 hit.
    [Graphical view]
    SMARTiSM00506. A1pp. 1 hit.
    [Graphical view]
    SUPFAMiSSF101816. SSF101816. 1 hit.
    SSF144246. SSF144246. 1 hit.
    SSF50494. SSF50494. 1 hit.
    PROSITEiPS51442. M_PRO. 1 hit.
    PS51154. MACRO. 1 hit.
    PS51124. PEPTIDASE_C16. 2 hits.
    [Graphical view]

    Sequences (2)i

    Sequence statusi: Complete.

    Sequence processingi: The displayed sequence is further processed into a mature form.

    This entry describes 2 isoformsi produced by ribosomal frameshifting. Align

    Isoform Replicase polyprotein 1a (identifier: P0C6U7-1) [UniParc]FASTAAdd to Basket

    Also known as: pp1a, ORF1a polyprotein

    This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

    « Hide

    MSKINKYGLE LHWAPEFPWM FEDAEEKLDN PSSSEVDMIC STTAQKLETD     50
    GICPENHVMV DCRRLLKQEC CVQSSLIREI VMNASPYHLE VLLQDALQSR 100
    EAVLVTTPLG MSLEACYVRG CNPKGWTMGL FRRRSVCNTG RCTVNKHVAY 150
    QLYMIDPAGV CLGAGQFVGW VIPLAFMPVQ SRKFIVPWVM YLRKRGEKGA 200
    YNKDHGCGGF GHVYDFKVED AYDQVHDEPK GKFSKKAYAL IRGYRGVKPL 250
    LYVDQYGCDY TGSLADGLEA YADKTLQEMK ALFPTWSQEL PFDVIVAWHV 300
    VRDPRYVMRL QSAATICSVA YVANPTEDLC DGSVVIKEPV HVYADDSIIL 350
    RQYNLFDIMS HFYMEADTVV NAFYGVALKD CGFVMQFGYI DCEQDSCDFK 400
    GWIPGNMIDG FACTTCGHVY EVGDLIAQSS GVLPVNPVLH TKSAAGYGGF 450
    GCKDSFTLYG QTVVYFGGCV YWSPARNIWI PILKSSVKSY DSLVYTGVLG 500
    CKAIVKETNL ICKALYLDYV QHKCGNLHQR ELLGVSDVWH KQLLINRGVY 550
    KPLLENIDYF NMRRAKFSLE TFTVCADGFM PFLLDDLVPR AYYLAVSGQA 600
    FCDYADKLCH AVVSKSKELL DVSLDSLGAA IHYLNSKIVD LAQHFSDFGT 650
    SFVSKIVHFF KTFTTSTALA FAWVLFHVLH GAYIVVESDI YFVKNIPRYA 700
    SAVAQAFQSV AKVVLDSLRV TFIDGLSCFK IGRRRICLSG RKIYEVERGL 750
    LHSSQLPLDV YDLTMPSQVQ KAKQKPIYLK GSGSDFSLAD SVVEVVTTSL 800
    TPCGYSEPPK VADKICIVDN VYMAKAGDKY YPVVVDDHVG LLDQAWRVPC 850
    AGRRVTFKEQ PTVKEIISMP KIIKVFYELD NDFNTILNTA CGVFEVDDTV 900
    DMEEFYAVVI DAIEEKLSPC KELEGVGAKV SAFLQKLEDN PLFLFDEAGE 950
    EVFAPKLYCA FTAPEDDDFL EESDVEEDDV EGEETDLTIT SAGQPCVASE 1000
    QEESSEVLED TLDDGPSVET SDSQVEEDVE MSDFVDLESV IQDYENVCFE 1050
    FYTTEPEFVK VLGLYVPKAT RNNCWLRSVL AVMQKLPCQF KDKNLQDLWV 1100
    LYKQQYSQLF VDTLVNKIPA NIVLPQGGYV ADFAYWFLTL CDWQCVAYWK 1150
    CIKCDLALKL KGLDAMFFYG DVVSHICKCG ESMVLIDVDV PFTAHFALKD 1200
    KLFCAFITKR IVYKAACVVD VNDSHSMAVV DGKQIDDHRI TSITSDKFDF 1250
    IIGHGMSFSM TTFEIAQLYG SCITPNVCFV KGDIIKVSKL VKAEVVVNPA 1300
    NGHMVHGGGV AKAIAVAAGQ QFVKETTNMV KSKGVCATGD CYVSTGGKLC 1350
    KTVLNVVGPD ARTQGKQSYV LLERVYKHFN NYDCVVTTLI SAGIFSVPSD 1400
    VSLTYLLGTA KKQVVLVSNN QEDFDLISKC QITAVEGTKK LAARLSFNVG 1450
    RSIVYETDAN KLILINDVAF VSTFNVLQDV LSLRHDIALD DDARTFVQSN 1500
    VDVLPEGWRV VNKFYQINGV RTVKYFECTG GIDICSQDKV FGYVQQGIFN 1550
    KATVAQIKAL FLDKVDILLT VDGVNFTNRF VPVGESFGKS LGNVFCDGVN 1600
    VTKHKCDINY KGKVFFQFDN LSSEDLKAVR SSFNFDQKEL LAYYNMLVNC 1650
    FKWQVVVNGK YFTFKQANNN CFVNVSCLML QSLHLTFKIV QWQEAWLEFR 1700
    SGRPARFVAL VLAKGGFKFG DPADSRDFLR VVFSQVDLTG AICDFEIACK 1750
    CGVKQEQRTG LDAVMHFGTL SREDLEIGYT VDCSCGKKLI HCVRFDVPFL 1800
    ICSNTPASVK LPKGVGSANI FIGDNVGHYV HVKCEQSYQL YDASNVKKVT 1850
    DVTGKLSDCL YLKNLKQTFK SVLTTYYLDD VKKIEYKPDL SQYYCDGGKY 1900
    YTQRIIKAQF KTFEKVDGVY TNFKLIGHTV CDSLNSKLGF DSSKEFVEYK 1950
    ITEWPTATGD VVLANDDLYV KRYERGCITF GKPVIWLSHE KASLNSLTYF 2000
    NRPLLVDDNK FDVLKVDDVD DSGDSSESGA KETKEINIIK LSGVKKPFKV 2050
    EDSVIVNDDT SETKYVKSLS IVDVYDMWLT GCKYVVRTAN ALSRAVNVPT 2100
    IRKFIKFGMT LVSIPIDLLN LREIKPAVNV VKAVRNKTSA CFNFIKWLFV 2150
    LLFGWIKISA DNKVIYTTEI ASKLTCKLVA LAFKNAFLTF KWSMVARGAC 2200
    IIATIFLLWF NFIYANVIFS DFYLPKIGFL PTFVGKIAQW IKNTFSLVTI 2250
    CDLYSIQDVG FKNQYCNGSI ACQFCLAGFD MLDNYKAIDV VQYEADRRAF 2300
    VDYTGVLKIV IELIVSYALY TAWFYPLFAL ISIQILTTWL PELFMLSTLH 2350
    WSFRLLVALA NMLPAHVFMR FYIIIASFIK LFSLFKHVAY GCSKSGCLFC 2400
    YKRNRSLRVK CSTIVGGMIR YYDVMANGGT GFCSKHQWNC IDCDSYKPGN 2450
    TFITVEAALD LSKELKRPIQ PTDVAYHTVT DVKQVGCSMR LFYDRDGQRI 2500
    YDDVNASLFV DYSNLLHSKV KSVPNMHVVV VENDADKANF LNAAVFYAQS 2550
    LFRPILMVDK NLITTANTGT SVTETMFDVY VDTFLSMFDV DKKSLNALIA 2600
    TAHSSIKQGT QIYKVLDTFL SCARKSCSID SDVDTKCLAD SVMSAVSAGL 2650
    ELTDESCNNL VPTYLKSDNI VAADLGVLIQ NSAKHVQGNV AKIAGVSCIW 2700
    SVDAFNQFSS DFQHKLKKAC CKTGLKLKLT YNKQMANVSV LTTPFSLKGG 2750
    AVFSYFVYVC FVLSLVCFIG LWCLMPTYTV HKSDFQLPVY ASYKVLDNGV 2800
    IRDVSVEDVC FANKFEQFDQ WYESTFGLSY YSNSMACPIV VAVIDQDFGS 2850
    TVFNVPTKVL RYGYHVLHFI THALSADGVQ CYTPHSQISY SNFYASGCVL 2900
    SSACTMFTMA DGSPQPYCYT DGLMQNASLY SSLVPHVRYN LANAKGFIRF 2950
    PEVLREGLVR VVRTRSMSYC RVGLCEEADE GICFNFNGSW VLNNDYYRSL 3000
    PGTFCGRDVF DLIYQLFKGL AQPVDFLALT ASSIAGAILA VIVVLVFYYL 3050
    IKLKRAFGDY TSVVFVNVIV WCVNFMMLFV FQVYPTLSCV YAICYFYATL 3100
    YFPSEISVIM HLQWLVMYGT IMPLWFCLLY IAVVVSNHAF WVFSYCRKLG 3150
    TSVRSDGTFE EMALTTFMIT KDSYCKLKNS LSDVAFNRYL SLYNKYRYYS 3200
    GKMDTAAYRE AACSQLAKAM DTFTNNNGSD VLYQPPTASV STSFLQSGIV 3250
    KMVNPTSKVE PCVVSVTYGN MTLNGLWLDD KVYCPRHVIC SASDMTNPDY 3300
    TNLLCRVTSS DFTVLFDRLS LTVMSYQMRG CMLVLTVTLQ NSRTPKYTFG 3350
    VVKPGETFTV LAAYNGKPQG AFHVTMRSSY TIKGSFLCGS CGSVGYVIMG 3400
    DCVKFVYMHQ LELSTGCHTG TDFNGDFYGP YKDAQVVQLP IQDYIQSVNF 3450
    LAWLYAAILN NCNWFIQSDK CSVEDFNVWA LSNGFSQVKS DLVIDALASM 3500
    TGVSLETLLA AIKRLKNGFQ GRQIMGSCSF EDELTPSDVY QQLAGIKLQS 3550
    KRTRLFKGTV CWIMASTFLF SCIITAFVKW TMFMYVTTNM FSITFCALCV 3600
    ISLAMLLVKH KHLYLTMYIT PVLFTLLYNN YLVVYKHTFR GYVYAWLSYY 3650
    VPSVEYTYTD EVIYGMLLLV GMVFVTLRSI NHDLFSFIMF VGRLISVFSL 3700
    WYKGSNLEEE ILLMLASLFG TYTWTTVLSM AVAKVIAKWV AVNVLYFTDI 3750
    PQIKIVLLCY LFIGYIISCY WGLFSLMNSL FRMPLGVYNY KISVQELRYM 3800
    NANGLRPPKN SFEALMLNFK LLGIGGVPII EVSQFQSKLT DVKCANVVLL 3850
    NCLQHLHVAS NSKLWHYCST LHNEILATSD LSVAFEKLAQ LLIVLFANPA 3900
    AVDSKCLTSI EEVCDDYAKD NTVLQALQSE FVNMASFVEY EVAKKNLDEA 3950
    RFSGSANQQQ LKQLEKACNI AKSAYERDRA VAKKLERMAD LALTNMYKEA 4000
    RINDKKSKVV SALQTMLFSM VRKLDNQALN SILDNAVKGC VPLNAIPSLA 4050
    ANTLNIIVPD KSVYDQIVDN IYVTYAGNVW QIQTIQDSDG TNKQLNEISD 4100
    DCNWPLVIIA NRYNEVSATV LQNNELMPAK LKIQVVNSGP DQTCNTPTQC 4150
    YYNNSNNGKI VYAILSDVDG LKYTKILKDD GNFVVLELDP PCKFTVQDAK 4200
    GLKIKYLYFV KGCNTLARGW VVGTISSTVR LQAGTATEYA SNSSILSLCA 4250
    FSVDPKKTYL DFIQQGGTPI ANCVKMLCDH AGTGMAITVK PDATTSQDSY 4300
    GGASVCIYCR ARVEHPDVDG LCKLRGKFVQ VPVGIKDPVS YVLTHDVCRV 4350
    CGFWRDGSCS CVSTDTTVQS KDTNFLNGFG VRV 4383

    Note: Produced by conventional translation.

    Length:4,383
    Mass (Da):491,305
    Last modified:June 10, 2008 - v1
    Checksum:i264040B0DDCDF54E
    GO
    Isoform Replicase polyprotein 1ab (identifier: P0C6X6-1) [UniParc]FASTAAdd to Basket

    Also known as: pp1ab

    The sequence of this isoform can be found in the external entry P0C6X6.
    Isoforms of the same protein are often annotated in two different entries if their sequences differ significantly.

    Note: Produced by -1 ribosomal frameshifting at the 1a-1b genes boundary.

    Length:7,095
    Mass (Da):797,914
    GO

    Experimental Info

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Sequence conflicti426 – 4261I → M(PubMed:15650185)Curated
    Sequence conflicti813 – 8131D → A(PubMed:15650185)Curated
    Sequence conflicti4376 – 43772LN → FK(PubMed:15280490)Curated

    Natural variant

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Natural varianti88 – 881H → D in strain: ATCC VR-759 and Isolate clinical OC43-Paris.
    Natural varianti207 – 2071C → R in strain: ATCC VR-759 and Isolate clinical OC43-Paris.
    Natural varianti291 – 2911P → L in strain: ATCC VR-759 and Isolate clinical OC43-Paris.
    Natural varianti317 – 3171C → R in strain: ATCC VR-759 and Isolate clinical OC43-Paris.
    Natural varianti356 – 3561F → V in strain: ATCC VR-759 and Isolate clinical OC43-Paris.
    Natural varianti545 – 5451I → L in strain: ATCC VR-759 and Isolate clinical OC43-Paris.
    Natural varianti762 – 7621D → N in strain: Isolate 87309 Belgium 2003.
    Natural varianti953 – 9531F → L in strain: ATCC VR-759, Isolate clinical OC43-Paris and Isolate 87309 Belgium 2003.
    Natural varianti989 – 9891I → V in strain: ATCC VR-759, Isolate clinical OC43-Paris and Isolate 87309 Belgium 2003.
    Natural varianti1305 – 13051V → A in strain: ATCC VR-759, Isolate clinical OC43-Paris and Isolate 87309 Belgium 2003.
    Natural varianti1328 – 13281N → D in strain: ATCC VR-759 and Isolate clinical OC43-Paris.
    Natural varianti1379 – 13791F → L in strain: ATCC VR-759 and Isolate clinical OC43-Paris.
    Natural varianti1504 – 15041L → V in strain: ATCC VR-759 and Isolate clinical OC43-Paris.
    Natural varianti1740 – 17401G → E in strain: Isolate 87309 Belgium 2003.
    Natural varianti1825 – 18251N → K in strain: ATCC VR-759 and Isolate clinical OC43-Paris.
    Natural varianti1936 – 19361S → A in strain: ATCC VR-759, Isolate clinical OC43-Paris and Isolate 87309 Belgium 2003.
    Natural varianti1965 – 19651N → T in strain: ATCC VR-759, Isolate clinical OC43-Paris.
    Natural varianti2004 – 20041L → S in strain: ATCC VR-759, Isolate clinical OC43-Paris and Isolate 87309 Belgium 2003.
    Natural varianti2022 – 20221S → G in strain: ATCC VR-759 and Isolate clinical OC43-Paris.
    Natural varianti2138 – 21403TSA → ISV in strain: ATCC VR-759 and Isolate clinical OC43-Paris.
    Natural varianti2256 – 22561I → M in strain: ATCC VR-759 and Isolate clinical OC43-Paris.
    Natural varianti2257 – 22571Q → H in strain: Isolate 87309 Belgium 2003.
    Natural varianti2386 – 23861K → R in strain: ATCC VR-759 and Isolate clinical OC43-Paris.
    Natural varianti2500 – 25001I → T in strain: ATCC VR-759, Isolate clinical OC43-Paris and Isolate 87309 Belgium 2003.
    Natural varianti2921 – 29211D → E in strain: ATCC VR-759 and Isolate clinical OC43-Paris.
    Natural varianti2961 – 29611V → I in strain: ATCC VR-759 and Isolate clinical OC43-Paris.
    Natural varianti3086 – 30861T → I in strain: ATCC VR-759 and Isolate clinical OC43-Paris.
    Natural varianti3440 – 34401P → L in strain: ATCC VR-759 and Isolate clinical OC43-Paris.
    Natural varianti3451 – 34511L → V in strain: ATCC VR-759 and Isolate clinical OC43-Paris.
    Natural varianti3466 – 34661I → V in strain: ATCC VR-759 and Isolate clinical OC43-Paris.
    Natural varianti4067 – 40671I → V in strain: ATCC VR-759 and Isolate clinical OC43-Paris.
    Natural varianti4071 – 40711I → V in strain: ATCC VR-759, Isolate clinical OC43-Paris and Isolate 87309 Belgium 2003.

    Sequence databases

    Select the link destinations:
    EMBL
    GenBank
    DDBJ
    Links Updated
    AY903459 Genomic RNA. No translation available.
    AY903460 Genomic RNA. No translation available.
    AY585228 Genomic RNA. No translation available.
    AY585229 Genomic RNA. No translation available.
    AY391777 Genomic RNA. No translation available.

    Keywords - Coding sequence diversityi

    Ribosomal frameshifting

    Cross-referencesi

    Sequence databases

    Select the link destinations:
    EMBL
    GenBank
    DDBJ
    Links Updated
    AY903459 Genomic RNA. No translation available.
    AY903460 Genomic RNA. No translation available.
    AY585228 Genomic RNA. No translation available.
    AY585229 Genomic RNA. No translation available.
    AY391777 Genomic RNA. No translation available.

    3D structure databases

    ProteinModelPortali P0C6U7.
    SMRi P0C6U7. Positions 3964-4114, 4238-4362.
    ModBasei Search...
    MobiDBi Search...

    Protocols and materials databases

    Structural Biology Knowledgebase Search...

    Family and domain databases

    InterProi IPR022570. Coronavirus_NSP1.
    IPR002589. Macro_dom.
    IPR014828. NSP7.
    IPR014829. NSP8.
    IPR014822. NSP9.
    IPR002705. Pept_C30/C16_B_coronavir.
    IPR008740. Peptidase_C30.
    IPR013016. Peptidase_C30/C16.
    IPR018995. RNA_synth_NSP10_coronavirus.
    IPR009003. Trypsin-like_Pept_dom.
    IPR014827. Viral_protease.
    [Graphical view ]
    Pfami PF11963. DUF3477. 1 hit.
    PF01661. Macro. 1 hit.
    PF09401. NSP10. 1 hit.
    PF08716. nsp7. 1 hit.
    PF08717. nsp8. 1 hit.
    PF08710. nsp9. 1 hit.
    PF01831. Peptidase_C16. 1 hit.
    PF05409. Peptidase_C30. 1 hit.
    PF08715. Viral_protease. 1 hit.
    [Graphical view ]
    SMARTi SM00506. A1pp. 1 hit.
    [Graphical view ]
    SUPFAMi SSF101816. SSF101816. 1 hit.
    SSF144246. SSF144246. 1 hit.
    SSF50494. SSF50494. 1 hit.
    PROSITEi PS51442. M_PRO. 1 hit.
    PS51154. MACRO. 1 hit.
    PS51124. PEPTIDASE_C16. 2 hits.
    [Graphical view ]
    ProtoNeti Search...

    Publicationsi

    1. "Circulation of genetically distinct contemporary human coronavirus OC43 strains."
      Vijgen L., Keyaerts E., Lemey P., Moes E., Li S., Vandamme A.M., Van Ranst M.
      Virology 337:85-92(2005) [PubMed] [Europe PMC] [Abstract]
      Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA].
      Strain: Isolate 19572 Belgium 2004 and Isolate 87309 Belgium 2003.
    2. "Human respiratory coronavirus OC43: genetic stability and neuroinvasion."
      St Jean J.R., Jacomy H., Desforges M., Vabret A., Freymuth F., Talbot P.J.
      J. Virol. 78:8824-8834(2004) [PubMed] [Europe PMC] [Abstract]
      Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA].
      Strain: Isolate ATCC VR-759 and Isolate clinical OC43-Paris.
    3. "Complete genomic sequence of human coronavirus OC43: molecular clock analysis suggests a relatively recent zoonotic coronavirus transmission event."
      Vijgen L., Keyaerts E., Moes E., Thoelen I., Wollants E., Lemey P., Vandamme A.M., Van Ranst M.
      J. Virol. 79:1595-1604(2005) [PubMed] [Europe PMC] [Abstract]
      Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA].
      Strain: Isolate ATCC VR-759.

    Entry informationi

    Entry nameiR1A_CVHOC
    AccessioniPrimary (citable) accession number: P0C6U7
    Secondary accession number(s): Q9WAC3
    Entry historyi
    Integrated into UniProtKB/Swiss-Prot: June 10, 2008
    Last sequence update: June 10, 2008
    Last modified: October 1, 2014
    This is version 45 of the entry and version 1 of the sequence. [Complete history]
    Entry statusiReviewed (UniProtKB/Swiss-Prot)
    Annotation programViral Protein Annotation Program

    Miscellaneousi

    Miscellaneous

    The sequence shown is that of isolate 19572 Belgium 2004.

    Keywords - Technical termi

    Complete proteome

    Documents

    1. Peptidase families
      Classification of peptidase families and list of entries
    2. SIMILARITY comments
      Index of protein domains and families

    External Data

    Dasty 3