Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Entry version 118 (31 Jul 2019)
Sequence version 2 (12 Apr 2005)
Previous versions | rss
Help videoAdd a publicationFeedback
Protein

Replicase polyprotein 1ab

Gene

rep

Organism
Lactate dehydrogenase elevating virus (strain Plagemann) (LDV)
Status
Reviewed-Annotation score:

Annotation score:5 out of 5

<p>The annotation score provides a heuristic measure of the annotation content of a UniProtKB entry or proteome. This score <strong>cannot</strong> be used as a measure of the accuracy of the annotation as we cannot define the ‘correct annotation’ for any given protein.<p><a href='/help/annotation_score' target='_top'>More...</a></p>
-Experimental evidence at protein leveli <p>This indicates the type of evidence that supports the existence of the protein. Note that the ‘protein existence’ evidence does not give information on the accuracy or correctness of the sequence(s) displayed.<p><a href='/help/protein_existence' target='_top'>More...</a></p>

<p>This section provides any useful information about the protein, mostly biological knowledge.<p><a href='/help/function_section' target='_top'>More...</a></p>Functioni

The replicase polyprotein 1ab is a multifunctional protein: it contains the activities necessary for the transcription of negative stranded RNA, leader RNA, subgenomic mRNAs and progeny virion RNA as well as proteinases responsible for the cleavage of the polyprotein into functional products.
The Nsp1 chain is essential for viral subgenomic mRNA synthesis.By similarity
The 3C-like serine proteinase chain is responsible for the majority of cleavages as it cleaves the C-terminus of the polyprotein.By similarity
The helicase chain, which contains a zinc finger structure, displays RNA and DNA duplex-unwinding activities with 5' to 3' polarity.By similarity

<p>This subsection of the <a href="http://www.uniprot.org/help/function_section">Function</a> section describes the catalytic activity of an enzyme, i.e. a chemical reaction that the enzyme catalyzes.<p><a href='/help/catalytic_activity' target='_top'>More...</a></p>Catalytic activityi

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the <a href="http://www.uniprot.org/help/function_section">Function</a> section is used for enzymes and indicates the residues directly involved in catalysis.<p><a href='/help/act_site' target='_top'>More...</a></p>Active sitei76For Nsp1-alpha papain-like cysteine proteinase activityPROSITE-ProRule annotation1 Publication1
Active sitei147For Nsp1-alpha papain-like cysteine proteinase activityPROSITE-ProRule annotation1 Publication1
Active sitei269For Nsp1-beta papain-like cysteine proteinase activityPROSITE-ProRule annotation1 Publication1
Active sitei340For Nsp1-beta papain-like cysteine proteinase activityPROSITE-ProRule annotation1 Publication1
Active sitei390For Nsp2 cysteine proteinase activityPROSITE-ProRule annotation1
Active sitei456For Nsp2 cysteine proteinase activityPROSITE-ProRule annotation1
Active sitei1551Charge relay system; for 3C-like serine proteinase activityPROSITE-ProRule annotation1
Active sitei1576Charge relay system; for 3C-like serine proteinase activityPROSITE-ProRule annotation1
Active sitei1628Charge relay system; for 3C-like serine proteinase activityPROSITE-ProRule annotation1
<p>This subsection of the <a href="http://www.uniprot.org/help/function_section">Function</a> section indicates at which position the protein binds a given metal ion. The nature of the metal is indicated in the ‘Description’ field.<p><a href='/help/metal' target='_top'>More...</a></p>Metal bindingi2850Zinc 1PROSITE-ProRule annotation1
Metal bindingi2853Zinc 1PROSITE-ProRule annotation1
Metal bindingi2863Zinc 2PROSITE-ProRule annotation1
Metal bindingi2868Zinc 1PROSITE-ProRule annotation1
Metal bindingi2871Zinc 1PROSITE-ProRule annotation1
Metal bindingi2873Zinc 2PROSITE-ProRule annotation1
Metal bindingi2875Zinc 2PROSITE-ProRule annotation1
Metal bindingi2877Zinc 2PROSITE-ProRule annotation1
Metal bindingi2884Zinc 3PROSITE-ProRule annotation1
Metal bindingi2886Zinc 3PROSITE-ProRule annotation1
Metal bindingi2893Zinc 3PROSITE-ProRule annotation1
Metal bindingi2896Zinc 3PROSITE-ProRule annotation1

Regions

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the <a href="http://www.uniprot.org/help/function_section">Function</a> section specifies the position(s) and type(s) of zinc fingers within the protein.<p><a href='/help/zn_fing' target='_top'>More...</a></p>Zinc fingeri8 – 28C4-type; atypicalAdd BLAST21
<p>This subsection of the <a href="http://www.uniprot.org/help/function_section">Function</a> section describes a region in the protein which binds nucleotide phosphates. It always involves more than one amino acid and includes all residues involved in nucleotide-binding.<p><a href='/help/np_bind' target='_top'>More...</a></p>Nucleotide bindingi2992 – 2999ATPBy similarity8

<p>The <a href="http://www.geneontology.org/">Gene Ontology (GO)</a> project provides a set of hierarchical controlled vocabulary split into 3 categories:<p><a href='/help/gene_ontology' target='_top'>More...</a></p>GO - Molecular functioni

GO - Biological processi

<p>UniProtKB Keywords constitute a <a href="http://www.uniprot.org/keywords">controlled vocabulary</a> with a hierarchical structure. Keywords summarise the content of a UniProtKB entry and facilitate the search for proteins of interest.<p><a href='/help/keywords' target='_top'>More...</a></p>Keywordsi

Molecular functionHelicase, Hydrolase, Nucleotidyltransferase, Protease, RNA-directed RNA polymerase, Serine protease, Thiol protease, Transferase
Biological processViral RNA replication
LigandATP-binding, Metal-binding, Nucleotide-binding, Zinc

Protein family/group databases

MEROPS protease database

More...
MEROPSi
C31.001

<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Names & Taxonomyi

<p>This subsection of the <a href="http://www.uniprot.org/help/names_and_taxonomy_section">Names and taxonomy</a> section provides an exhaustive list of all names of the protein, from commonly used to obsolete, to allow unambiguous identification of a protein.<p><a href='/help/protein_names' target='_top'>More...</a></p>Protein namesi
Recommended name:
Replicase polyprotein 1ab
Alternative name(s):
ORF1ab polyprotein
Cleaved into the following 15 chains:
Nsp1-alpha papain-like cysteine proteinase (EC:3.4.22.-)
Alternative name(s):
PCP1-alpha
Nsp1-beta papain-like cysteine proteinase (EC:3.4.22.-)
Alternative name(s):
PCP1-beta
Nsp2 cysteine proteinase (EC:3.4.22.-)
Alternative name(s):
CP2
Short name:
CP
Non-structural protein 3
Short name:
Nsp3
3C-like serine proteinase (EC:3.4.21.-)
Short name:
3CLSP
Alternative name(s):
Nsp4
Non-structural protein 5-6-7
Short name:
Nsp5-6-7
Non-structural protein 5
Short name:
Nsp5
Non-structural protein 6
Short name:
Nsp6
Non-structural protein 7-alpha
Short name:
Nsp7-alpha
Non-structural protein 7-beta
Short name:
Nsp7-beta
Non-structural protein 8
Short name:
Nsp8
RNA-directed RNA polymerase (EC:2.7.7.48)
Short name:
Pol
Short name:
RdRp
Alternative name(s):
Nsp9
Helicase (EC:3.6.4.12, EC:3.6.4.13)
Short name:
Hel
Alternative name(s):
Nsp10
Non-structural protein 11
Short name:
Nsp11
Non-structural protein 12
Short name:
Nsp12
<p>This subsection of the <a href="http://www.uniprot.org/help/names_and_taxonomy_section">Names and taxonomy</a> section indicates the name(s) of the gene(s) that code for the protein sequence(s) described in the entry. Four distinct tokens exist: ‘Name’, ‘Synonyms’, ‘Ordered locus names’ and ‘ORF names’.<p><a href='/help/gene_name' target='_top'>More...</a></p>Gene namesi
Name:rep
ORF Names:1a-1b
<p>This subsection of the <a href="http://www.uniprot.org/help/names_and_taxonomy_section">Names and taxonomy</a> section provides information on the name(s) of the organism that is the source of the protein sequence.<p><a href='/help/organism-name' target='_top'>More...</a></p>OrganismiLactate dehydrogenase elevating virus (strain Plagemann) (LDV)
<p>This subsection of the <a href="http://www.uniprot.org/help/names_and_taxonomy_section">Names and taxonomy</a> section shows the unique identifier assigned by the NCBI to the source organism of the protein. This is known as the ‘taxonomic identifier’ or ‘taxid’.<p><a href='/help/taxonomic_identifier' target='_top'>More...</a></p>Taxonomic identifieri300016 [NCBI]
<p>This subsection of the <a href="http://www.uniprot.org/help/names_and_taxonomy_section">Names and taxonomy</a> section contains the taxonomic hierarchical classification lineage of the source organism. It lists the nodes as they appear top-down in the taxonomic tree, with the more general grouping listed first.<p><a href='/help/taxonomic_lineage' target='_top'>More...</a></p>Taxonomic lineageiVirusesRiboviriaNidoviralesArnidovirineaeArteriviridaeVariarterivirinaeGammaarterivirusGammaarterivirus lacdeh
<p>This subsection of the <a href="http://www.uniprot.org/help/names_and_taxonomy_section">Names and taxonomy</a> section only exists in viral entries and indicates the host(s) either as a specific organism or taxonomic group of organisms that are susceptible to be infected by a virus.<p><a href='/help/virus_host' target='_top'>More...</a></p>Virus hostiMus musculus (Mouse) [TaxID: 10090]

<p>This section provides information on the location and the topology of the mature protein in the cell.<p><a href='/help/subcellular_location_section' target='_top'>More...</a></p>Subcellular locationi

Topology

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the <a href="http://www.uniprot.org/help/subcellular_location_section">'Subcellular location'</a> section describes the extent of a membrane-spanning region of the protein. It denotes the presence of both alpha-helical transmembrane regions and the membrane spanning regions of beta-barrel transmembrane proteins.<p><a href='/help/transmem' target='_top'>More...</a></p>Transmembranei942 – 962HelicalSequence analysisAdd BLAST21
Transmembranei977 – 997HelicalSequence analysisAdd BLAST21
Transmembranei1010 – 1030HelicalSequence analysisAdd BLAST21
Transmembranei1060 – 1080HelicalSequence analysisAdd BLAST21
Transmembranei1085 – 1105HelicalSequence analysisAdd BLAST21
Transmembranei1289 – 1309HelicalSequence analysisAdd BLAST21
Transmembranei1364 – 1384HelicalSequence analysisAdd BLAST21
Transmembranei1386 – 1406HelicalSequence analysisAdd BLAST21
Transmembranei1425 – 1445HelicalSequence analysisAdd BLAST21
Transmembranei1715 – 1735HelicalSequence analysisAdd BLAST21
Transmembranei1737 – 1757HelicalSequence analysisAdd BLAST21
Transmembranei1761 – 1781HelicalSequence analysisAdd BLAST21
Transmembranei1832 – 1852HelicalSequence analysisAdd BLAST21

GO - Cellular componenti

Keywords - Cellular componenti

Host cytoplasm, Host membrane, Membrane

<p>This section provides information on the disease(s) and phenotype(s) associated with a protein.<p><a href='/help/pathology_and_biotech_section' target='_top'>More...</a></p>Pathology & Biotechi

Mutagenesis

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the <a href="http://www.uniprot.org/manual/pathology_and_biotech_section">'Pathology and Biotech'</a> section describes the effect of the experimental mutation of one or more amino acid(s) on the biological properties of the protein.<p><a href='/help/mutagen' target='_top'>More...</a></p>Mutagenesisi76C → S: Complete loss of cleavage between nsp1-alpha and nsp1-beta. 1 Publication1
Mutagenesisi269C → A: Complete loss of cleavage between nsp1-beta and nsp2. 1 Publication1

<p>This section describes post-translational modifications (PTMs) and/or processing events.<p><a href='/help/ptm_processing_section' target='_top'>More...</a></p>PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the ‘PTM / Processing’ section describes the extent of a polypeptide chain in the mature protein following processing.<p><a href='/help/chain' target='_top'>More...</a></p>ChainiPRO_00000366441 – 3616Replicase polyprotein 1abAdd BLAST3616
ChainiPRO_00000366461 – ?Nsp1-alpha papain-like cysteine proteinaseBy similarity
ChainiPRO_0000036647? – 380Nsp1-beta papain-like cysteine proteinaseBy similarity
ChainiPRO_0000036648381 – 1286Nsp2 cysteine proteinaseBy similarityAdd BLAST906
ChainiPRO_00000366491287 – 1512Non-structural protein 3By similarityAdd BLAST226
ChainiPRO_00000366501513 – 17143C-like serine proteinaseBy similarityAdd BLAST202
ChainiPRO_00000366511715 – 2161Non-structural protein 5-6-7By similarityAdd BLAST447
ChainiPRO_00004231141715 – 1878Non-structural protein 5By similarityAdd BLAST164
ChainiPRO_00004231151879 – 1894Non-structural protein 6By similarityAdd BLAST16
ChainiPRO_00004231161895 – 2026Non-structural protein 7-alphaBy similarityAdd BLAST132
ChainiPRO_00004231172027 – 2161Non-structural protein 7-betaBy similarityAdd BLAST135
ChainiPRO_00000366522162 – 2843RNA-directed RNA polymeraseBy similarityAdd BLAST682
ChainiPRO_00000366532162 – 2206Non-structural protein 8By similarityAdd BLAST45
ChainiPRO_00000366542844 – 3272HelicaseBy similarityAdd BLAST429
ChainiPRO_00000366553273 – 3494Non-structural protein 11By similarityAdd BLAST222
ChainiPRO_00000366563495 – 3616Non-structural protein 12By similarityAdd BLAST122

<p>This subsection of the <a href="http://www.uniprot.org/help/ptm_processing_section">PTM/processing</a> section describes post-translational modifications (PTMs). This subsection <strong>complements</strong> the information provided at the sequence level or describes modifications for which <strong>position-specific data is not yet available</strong>.<p><a href='/help/post-translational_modification' target='_top'>More...</a></p>Post-translational modificationi

Specific enzymatic cleavages in vivo by its own proteases yield mature proteins. There are two alternative pathways for processing. Either nsp4-5 is cleaved, which represents the major pathway or the nsp5-6 and nsp6-7 are processed, which represents the minor pathway. The major pathway occurs when nsp2 acts as cofactor for nsp4 (By similarity).By similarity

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection describes interesting single amino acid sites on the sequence that are not defined in any other subsection. This subsection can be displayed in different sections (‘Function’, ‘PTM / Processing’, ‘Pathology and Biotech’) according to its content.<p><a href='/help/site' target='_top'>More...</a></p>Sitei181 – 182Cleavage; by autolysisSequence analysis2
Sitei381 – 382Cleavage; by autolysisSequence analysis2
Sitei1286 – 1287Cleavage; by CP2By similarity2
Sitei1512 – 1513Cleavage; by 3CLSPBy similarity2
Sitei1714 – 1715Cleavage; by 3CLSPBy similarity2
Sitei1878 – 1879Cleavage; by 3CLSPBy similarity2
Sitei1894 – 1895Cleavage; by 3CLSPBy similarity2
Sitei2026 – 2027Cleavage; by 3CLSPBy similarity2
Sitei2161 – 2162Cleavage; by 3CLSPBy similarity2
Sitei2843 – 2844Cleavage; by 3CLSPBy similarity2
Sitei3272 – 3273Cleavage; by 3CLSPBy similarity2
Sitei3494 – 3495Cleavage; by 3CLSPBy similarity2

Proteomic databases

PRoteomics IDEntifications database

More...
PRIDEi
Q83017

<p>This section provides information on sequence similarities with other proteins and the domain(s) present in a protein.<p><a href='/help/family_and_domains_section' target='_top'>More...</a></p>Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the <a href="http://www.uniprot.org/help/family_and_domains_section">Family and Domains</a> section describes the position and type of a domain, which is defined as a specific combination of secondary structures organized into a characteristic three-dimensional structure or fold.<p><a href='/help/domain' target='_top'>More...</a></p>Domaini69 – 181Peptidase C31PROSITE-ProRule annotationAdd BLAST113
Domaini262 – 381Peptidase C32PROSITE-ProRule annotationAdd BLAST120
Domaini381 – 486Peptidase C33PROSITE-ProRule annotationAdd BLAST106
Domaini1513 – 1714Peptidase S32PROSITE-ProRule annotationAdd BLAST202
Domaini2590 – 2724RdRp catalyticPROSITE-ProRule annotationAdd BLAST135
Domaini2844 – 2907AV ZBDPROSITE-ProRule annotationAdd BLAST64
Domaini2964 – 3116(+)RNA virus helicase ATP-bindingAdd BLAST153
Domaini3117 – 3248(+)RNA virus helicase C-terminalAdd BLAST132

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the ‘Family and Domains’ section describes a region of interest that cannot be described in other subsections.<p><a href='/help/region' target='_top'>More...</a></p>Regioni981 – 1105HD1Add BLAST125
Regioni1289 – 1448HD2Add BLAST160
Regioni1737 – 1852HD3Add BLAST116

<p>This subsection of the ‘Family and domains’ section provides general information on the biological role of a domain. The term ‘domain’ is intended here in its wide acceptation, it may be a structural domain, a transmembrane region or a functional domain. Several domains are described in this subsection.<p><a href='/help/domain_cc' target='_top'>More...</a></p>Domaini

The hydrophobic domains (HD) could mediate the membrane association of the replication complex and thereby alter the architecture of the host cell membrane.By similarity

<p>This subsection of the ‘Family and domains’ section provides information about the sequence similarity with other proteins.<p><a href='/help/sequence_similarities' target='_top'>More...</a></p>Sequence similaritiesi

Belongs to the arteriviridae polyprotein family.Curated

Zinc finger

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Zinc fingeri8 – 28C4-type; atypicalAdd BLAST21

Keywords - Domaini

Transmembrane, Transmembrane helix, Zinc-finger

Family and domain databases

Gene3D Structural and Functional Annotation of Protein Families

More...
Gene3Di
1.20.58.950, 1 hit
3.30.40.20, 1 hit
3.90.70.60, 1 hit
3.90.70.70, 1 hit

Integrated resource of protein families, domains and functional sites

More...
InterProi
View protein in InterPro
IPR027351 (+)RNA_virus_helicase_core_dom
IPR031932 Arteri_nsp7a
IPR038451 Arteri_nsp7a_sf
IPR008743 Arterivirus_Nsp2_C33
IPR023338 Arterivirus_NSP4_peptidase
IPR008741 AV_PCPalpha
IPR038155 AV_PCPalpha_sf
IPR025773 AV_PCPbeta
IPR038154 AV_PCPbeta_sf
IPR027355 AV_ZBD
IPR023183 Chymotrypsin-like_C
IPR022230 DUF3756
IPR008760 EAV_peptidase_S32
IPR037227 EndoU-like
IPR027417 P-loop_NTPase
IPR009003 Peptidase_S1_PA
IPR001205 RNA-dir_pol_C
IPR007094 RNA-dir_pol_PSvirus

Pfam protein domain database

More...
Pfami
View protein in Pfam
PF16749 Arteri_nsp7a, 1 hit
PF12581 DUF3756, 1 hit
PF05410 Peptidase_C31, 1 hit
PF05411 Peptidase_C32, 1 hit
PF05412 Peptidase_C33, 1 hit
PF05579 Peptidase_S32, 1 hit
PF00680 RdRP_1, 1 hit
PF01443 Viral_helicase1, 1 hit

Superfamily database of structural and functional annotation

More...
SUPFAMi
SSF142877 SSF142877, 1 hit
SSF50494 SSF50494, 1 hit
SSF52540 SSF52540, 2 hits

PROSITE; a protein domain and family database

More...
PROSITEi
View protein in PROSITE
PS51538 AV_CP, 1 hit
PS51493 AV_NSP4_PRO, 1 hit
PS51539 AV_PCP_ALPHA, 1 hit
PS51540 AV_PCP_BETA, 1 hit
PS51652 AV_ZBD, 1 hit
PS51657 PSRV_HELICASE, 1 hit
PS50507 RDRP_SSRNA_POS, 1 hit

<p>This section displays by default the canonical protein sequence and upon request all isoforms described in the entry. It also includes information pertinent to the sequence(s), including <a href="http://www.uniprot.org/help/sequence_length">length</a> and <a href="http://www.uniprot.org/help/sequences">molecular weight</a>. The information is filed in different subsections. The current subsections and their content are listed below:<p><a href='/help/sequences_section' target='_top'>More...</a></p>Sequences (2)i

<p>This subsection of the <a href="http://www.uniprot.org/help/sequences_section">Sequence</a> section indicates if the <a href="http://www.uniprot.org/help/canonical_and_isoforms">canonical sequence</a> displayed by default in the entry is complete or not.<p><a href='/help/sequence_status' target='_top'>More...</a></p>Sequence statusi: Complete.

<p>This subsection of the <a href="http://www.uniprot.org/help/sequences_section">Sequence</a> section indicates if the <a href="http://www.uniprot.org/help/canonical_and_isoforms">canonical sequence</a> displayed by default in the entry is in its mature form or if it represents the precursor.<p><a href='/help/sequence_processing' target='_top'>More...</a></p>Sequence processingi: The displayed sequence is further processed into a mature form.

This entry describes 2 <p>This subsection of the ‘Sequence’ section lists the alternative protein sequences (isoforms) that can be generated from the same gene by a single or by the combination of up to four biological events (alternative promoter usage, alternative splicing, alternative initiation and ribosomal frameshifting). Additionally, this section gives relevant information on each alternative protein isoform.<p><a href='/help/alternative_products' target='_top'>More...</a></p> isoformsi produced by ribosomal frameshifting. AlignAdd to basket
Isoform Replicase polyprotein 1ab (identifier: Q83017-1) [UniParc]FASTAAdd to basket
Also known as: pp1ab

This isoform has been chosen as the <div> <p><b>What is the canonical sequence?</b><p><a href='/help/canonical_and_isoforms' target='_top'>More...</a></p>canonicali sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide
        10         20         30         40         50
MQSGFDRCLC TPNARVFWEH GQVYCTRCLA ARPLLPLSQQ NPRLGALGLF
60 70 80 90 100
YRPATPLTWE APITYPTKEC RPGGLCWLSG IYPIARMTSG NHNFQARLNF
110 120 130 140 150
VASVVYRDGK LTSKHLEEEF EVYSRGCRWY PITGPVPGIA LYANAVHVSD
160 170 180 190 200
EPFPGCTHVL SNLPLPQQPL RKGLCPFSDA RAEVWRYKGN TIFVSEQGYL
210 220 230 240 250
WTTGSNDSVP EPWGEARRLC EKIIASLPAD HLVKIEFSNY PFDYSFTGGD
260 270 280 290 300
GAGYVLFPCK KNDTKFSKCW EKVFEDHSSW KVACEEADLA DRMGYRTPAG
310 320 330 340 350
VAGPYLARRL QYRGLRAVVK PEQNDYVVWA LGVPESYIRH ISRAGEPVEN
360 370 380 390 400
FFVRVGEFSI VSNCVATPYP KFRFQTRKYY GYSPPGDGAC GLHCISAIIN
410 420 430 440 450
DIFGDALCTK LTNCSRDSSE WLSDQDMYQL VMTARLPATL GHCPSATYKL
460 470 480 490 500
DCVNQHWTVT KRKGDRALGG LSPECVRGVC GGECKFVPTY PREINLELAA
510 520 530 540 550
KSPISALAFS LGVEPYCDCW NFTNSVLVND SLAVETARAG EAYRSAMGIP
560 570 580 590 600
KDDWVLLAEL MTENCLTRRE VLDKLQRGLR LHATSKPGSP ASVSPASSID
610 620 630 640 650
FSAAGLLLDG TESDKEAVVA VNNDCYTVLG FDKNSATKSE QELATGLFSE
660 670 680 690 700
LVEPMETSTS KHESRKILEA ASRALKSAKP KRKRNKKKKT SSPTPTPPET
710 720 730 740 750
PTREVPGAIE VVSGDEEAGA CESATIVPDK AQARPPPRPK RQALKKAEQG
760 770 780 790 800
FILKDIIWNP TESGVKCLTI VEDVRAFLKS ITPPGGALGT RARITAHIVE
810 820 830 840 850
QFHVIRESTP ELVLAHAEHQ AKNMHELLLS EKAKLILGIG EDTLKKLVSS
860 870 880 890 900
QRSLPRSIGF GAWLSDQQKT ADSCGEREFV EVPLKSGAEP TPSKRDLGVS
910 920 930 940 950
LGDQLSQDGA PRLSSSTACE IKERVPPIKD SGGGLGQKFM AWLNHQVFLL
960 970 980 990 1000
SSHLLAMWSV VLGSRQKLNW ADYVYTLFCL CCVLLCFHFP AIGFIPLAGC
1010 1020 1030 1040 1050
VFGSPWRVRL SVFSVWLCVA VVVFQEVLPE PGSVCSSASA ECAAALERYS
1060 1070 1080 1090 1100
GNGVHRPVNH IGVGLVGTVA GFVARVVGGP RHYWFYFLRL MVVLDLGLVF
1110 1120 1130 1140 1150
LAVALRGRCK KCFCKCVRVA PHEVHLRVFP LTKVARPTLE AVCDMYSAPR
1160 1170 1180 1190 1200
VDPILVATGI KGCWQGKVSP HQVTDKPVSY SNLEEKKISN KTVVPPPTDP
1210 1220 1230 1240 1250
QQAVKCLKVL QCGGSIQDVG VPEVKKVSKV PYKAPFFPNV SIDPECYIVV
1260 1270 1280 1290 1300
DPVTYSAAMR GGYGVSHLIV GTGDFAEVNG LRFVSGGHVA DFVCLGLYVM
1310 1320 1330 1340 1350
LNFLISAWLS SPVSCGRGTN DPWCKNPFSY PVVGQGVMCN SHLCISEDGL
1360 1370 1380 1390 1400
TSPMVLSYSL IDWALMIAVI ATVAIFIAKV SLLVDVICVF LCLLMYVFPP
1410 1420 1430 1440 1450
LSVIAFAFPF ALCKVHLHPV TLVWVQFFLL AVNFWAGVAV AVILISSWFL
1460 1470 1480 1490 1500
ARATSSTGLV TPYDVHLVTS TPRGASSLAS APEGTYLAAV RRSALTGRCC
1510 1520 1530 1540 1550
MFVPTNFGSV LEGSLRTRGC AKNVVSVFGS ASGSGGVFTI HGNPVVVTAT
1560 1570 1580 1590 1600
HLLSDGKARV SCVGFSQCLT FKSVGDYAFA RVAEWKGDAP KVELSDRRGR
1610 1620 1630 1640 1650
AYCSPQVEWS LVLLGPNTAF CFTKCGDSGS PVVDEDGNLI GVHTGSNKRG
1660 1670 1680 1690 1700
SGMITTHNGK TLGMSNVKLS EMCQHYGGSG VPVSTVRLPK HLIVDVEAVA
1710 1720 1730 1740 1750
SDLVAVVESL PTPEGALSSV QLLCVFFFLW RLIHVPFVPV IAVAFFFLNE
1760 1770 1780 1790 1800
ILPVVLARLM FSFALSLFSV FTGFSVQVLL LRLVIAALNR SAVSFGSFLL
1810 1820 1830 1840 1850
GQLFHCCLMP SHLETLGPVP GYFYPSTTEV ASKEIFVTLL AIHVLALLLS
1860 1870 1880 1890 1900
LFKRPMLADV LVGNGSFDAA FFLKYFAEGN LRDGVSDSCN MTPEGLTAAL
1910 1920 1930 1940 1950
AITLSDDDLE FLQRHSEFKC FVSASNMRNG AKEFIESAYA RALRAQLAAT
1960 1970 1980 1990 2000
DKIKASKSIL AKLESFAGGV VTKVEPGDVV VVLGKKIVGD LVEITINDVK
2010 2020 2030 2040 2050
HVIRVIETRV MAGTQFSVGT ICGDLENACE DPSGLVKTSK KQRRRQKRTG
2060 2070 2080 2090 2100
LGTEVVGTVE IDGVSYNKVW HKATGDVTYE GFLVSENSRL RTLGTSAIGR
2110 2120 2130 2140 2150
FQEFIRKHGS KVKTSVEKYP VGKNKHIEFA VTTYNLDGEE FDVPDHEPLE
2160 2170 2180 2190 2200
WTITIGDSDL EAERLTVDQA LRHMGHDSLL TPKEKEKLAR IIESLNGLQQ
2210 2220 2230 2240 2250
SSALNCLTTS GLERCSRGGV TVSKDAVKIV KYHSRTFSIG DVNLKVMSFD
2260 2270 2280 2290 2300
EYRRTMGKPG HLLVAKLTDG VVVMRKHEPS LVDVILTGED AEFFPRTHGP
2310 2320 2330 2340 2350
GNTGIHRFVW DFESPPVDLE LELSEQIITA CSMRRGDAPA LDLPYKLHPV
2360 2370 2380 2390 2400
RGDPYRHRGV LFNTRFGDIT YLIPEKTKEP LHAAACYNKG VPVSDSETLV
2410 2420 2430 2440 2450
ATTLPHGFEL YVPTLPPSVL EYLDSRPDTP RMLTKHGCAS AAEKDLQKFD
2460 2470 2480 2490 2500
LSRQGFVLPG VLYMVRRYLS RLIGVRRRLF MPSTYPAKNS MAGINGGRFP
2510 2520 2530 2540 2550
LTWLQSHPDI DALCKRACEE HWQTVTPCTL KKQYCSKSKT RTILGTNNFV
2560 2570 2580 2590 2600
ALGLRSALSG VTQGFMRKGI GTPICLGKNK FTPLPVRIGG RCLEADLASC
2610 2620 2630 2640 2650
DRSTPAIIRW FTTNLLFELA GAEEWIPSYV LNCCHDVVST MSGCFDKRGG
2660 2670 2680 2690 2700
LSSGDPVTSI SNTVYSLIIY AQHMVLSAFR CGHKIGGLFL QDSLEMEQLF
2710 2720 2730 2740 2750
ELQPLLVYSD DVVFYNESDE LPNYHFFVDH LDLMLGFKTD RSKTVITSEP
2760 2770 2780 2790 2800
KLPGCRISGG RVLVPQRDRI VAALAYQMKA SCVGEYFASA AAILMDACAC
2810 2820 2830 2840 2850
CDHDESWYFD LVCGIAECAG SPWFRFPGPS FFLDMWNRLS AEEKKKCRTC
2860 2870 2880 2890 2900
AHCGAPATLV SSCGLNLCDY HGHGHPHCPV VLPCGHAVGS GVCEQCSSSA
2910 2920 2930 2940 2950
MNLNTELDIL LMCVPYHPPK VELLSVNDKV SSLPPGAYQA RGGVVSVRRD
2960 2970 2980 2990 3000
ILGNVVDLPD GDYQVMKVAQ TCADISMVSV NSNILRSQFV TGAPGTGKTT
3010 3020 3030 3040 3050
YLLSVVRDDD VIYTPTHRTM LDVVKALKVC RFDPPKDTPL EFPVPGRTGP
3060 3070 3080 3090 3100
TVRLIGAGFV PGRVSYLDEA AYCNPLDVLK VLSKTPLVCV GDLNQLPPVG
3110 3120 3130 3140 3150
FNGPCFAFSL MPGRQLIEVF RFGPAVVNSI KKFYKEELVP RGPDTGVKFL
3160 3170 3180 3190 3200
KQYQPYGQVL TPYHRDRVDG AITIDSSQGC TYDVVTVYLP TPKSLNSARA
3210 3220 3230 3240 3250
LVALTRARHY VFIYDPYDQL QQYLQVFEHE PADAWAFWCG DQPKMIVGGV
3260 3270 3280 3290 3300
VKQLAGHSRT TDLKLQQLMG LEGTASPLPQ VGHNLGFYYS PDLIQFAKIP
3310 3320 3330 3340 3350
PELCKHWPVV TAQNRTEWPD RLVCGMNKMD KNSRAVFCAG YYVGPSIFLG
3360 3370 3380 3390 3400
VPGVVSYYLT KYLKGESVPL PDSIMSTGRI RLNVREYLDE NEIEFAKKCP
3410 3420 3430 3440 3450
QPFIGEVKGS NVGGCHHVTS RFLPPVLVPG SVVKVGVSCP GKAAKGLCTV
3460 3470 3480 3490 3500
TDVYLPELDS YLHPPSKSMD YKLLVDFQPV KLMVWKDATA YFHEGIRPME
3510 3520 3530 3540 3550
AMSRFLKVPE GEGVFFDLDE FVTNAKVSKL PCKYSVSAHQ FLTEVVLSMT
3560 3570 3580 3590 3600
PTSEAPPDYE LLFARAYCVP GLDVGTLNAY IYKRGPSTYT TSNFARLVKD
3610
TAVPVGCKGS GYMFPK
Note: Produced by -1 ribosomal frameshifting at the 1a-1b genes boundary.
Length:3,616
Mass (Da):396,434
Last modified:April 12, 2005 - v2
<p>The checksum is a form of redundancy check that is calculated from the sequence. It is useful for tracking sequence updates.</p> <p>It should be noted that while, in theory, two different sequences could have the same checksum value, the likelihood that this would happen is extremely low.</p> <p>However UniProtKB may contain entries with identical sequences in case of multiple genes (paralogs).</p> <p>The checksum is computed as the sequence 64-bit Cyclic Redundancy Check value (CRC64) using the generator polynomial: x<sup>64</sup> + x<sup>4</sup> + x<sup>3</sup> + x + 1. The algorithm is described in the ISO 3309 standard. </p> <p class="publication">Press W.H., Flannery B.P., Teukolsky S.A. and Vetterling W.T.<br /> <strong>Cyclic redundancy and other checksums</strong><br /> <a href="http://www.nrbook.com/b/bookcpdf.php">Numerical recipes in C 2nd ed., pp896-902, Cambridge University Press (1993)</a>)</p> Checksum:i6167A231CDE7B611
GO
Isoform Replicase polyprotein 1a (identifier: Q83017-2) [UniParc]FASTAAdd to basket
Also known as: pp1a, ORF1a polyprotein

The sequence of this isoform differs from the canonical sequence as follows:
     2207-3616: Missing.

Note: Produced by conventional translation.
Show »
Length:2,206
Mass (Da):240,639
Checksum:iF1D239FE11780422
GO

<p>This subsection of the ‘Sequence’ section reports difference(s) between the protein sequence shown in the UniProtKB entry and other available protein sequences derived from the same gene.<p><a href='/help/sequence_caution' target='_top'>More...</a></p>Sequence cautioni

The sequence AAA85664 differs from that shown. Reason: Erroneous initiation.Curated

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the ‘Sequence’ section reports difference(s) between the canonical sequence (displayed by default in the entry) and the different sequence submissions merged in the entry. These various submissions may originate from different sequencing projects, different types of experiments, or different biological samples. Sequence conflicts are usually of unknown origin.<p><a href='/help/conflict' target='_top'>More...</a></p>Sequence conflicti1185E → K (PubMed:1870216).Curated1
Sequence conflicti2748 – 2753SEPKLP → QNPNFL (PubMed:1870216).Curated6
Sequence conflicti2771 – 2777VAALAYQ → RRGASVS (PubMed:1870216).Curated7
Sequence conflicti2820 – 2823GSPW → RKEG (PubMed:1870216).Curated4
Sequence conflicti2843E → K (PubMed:1870216).Curated1

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the ‘Sequence’ section describes the sequence of naturally occurring alternative protein isoform(s). The changes in the amino acid sequence may be due to alternative splicing, alternative promoter usage, alternative initiation, or ribosomal frameshifting.<p><a href='/help/var_seq' target='_top'>More...</a></p>Alternative sequenceiVSP_0328892207 – 3616Missing in isoform Replicase polyprotein 1a. CuratedAdd BLAST1410

Sequence databases

Select the link destinations:

EMBL nucleotide sequence database

More...
EMBLi

GenBank nucleotide sequence database

More...
GenBanki

DNA Data Bank of Japan; a nucleotide sequence database

More...
DDBJi
Links Updated
U15146 Genomic RNA Translation: AAA85663.1
U15146 Genomic RNA Translation: AAA85664.1 Different initiation.
S69379 Genomic RNA Translation: AAB30448.1
S50064 mRNA Translation: AAB19478.1
S50068 mRNA Translation: AAB19479.1

Protein sequence database of the Protein Information Resource

More...
PIRi
B40901
C40901
JQ1998
PQ0618

Keywords - Coding sequence diversityi

Ribosomal frameshifting

<p>This section provides links to proteins that are similar to the protein sequence(s) described in this entry at different levels of sequence identity thresholds (100%, 90% and 50%) based on their membership in UniProt Reference Clusters (<a href="http://www.uniprot.org/help/uniref">UniRef</a>).<p><a href='/help/similar_proteins_section' target='_top'>More...</a></p>Similar proteinsi

<p>This section is used to point to information related to entries and found in data collections other than UniProtKB.<p><a href='/help/cross_references_section' target='_top'>More...</a></p>Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U15146 Genomic RNA Translation: AAA85663.1
U15146 Genomic RNA Translation: AAA85664.1 Different initiation.
S69379 Genomic RNA Translation: AAB30448.1
S50064 mRNA Translation: AAB19478.1
S50068 mRNA Translation: AAB19479.1
PIRiB40901
C40901
JQ1998
PQ0618

3D structure databases

Database of comparative protein structure models

More...
ModBasei
Search...

SWISS-MODEL Interactive Workspace

More...
SWISS-MODEL-Workspacei
Submit a new modelling project...

Protein family/group databases

MEROPSiC31.001

Proteomic databases

PRIDEiQ83017

Family and domain databases

Gene3Di1.20.58.950, 1 hit
3.30.40.20, 1 hit
3.90.70.60, 1 hit
3.90.70.70, 1 hit
InterProiView protein in InterPro
IPR027351 (+)RNA_virus_helicase_core_dom
IPR031932 Arteri_nsp7a
IPR038451 Arteri_nsp7a_sf
IPR008743 Arterivirus_Nsp2_C33
IPR023338 Arterivirus_NSP4_peptidase
IPR008741 AV_PCPalpha
IPR038155 AV_PCPalpha_sf
IPR025773 AV_PCPbeta
IPR038154 AV_PCPbeta_sf
IPR027355 AV_ZBD
IPR023183 Chymotrypsin-like_C
IPR022230 DUF3756
IPR008760 EAV_peptidase_S32
IPR037227 EndoU-like
IPR027417 P-loop_NTPase
IPR009003 Peptidase_S1_PA
IPR001205 RNA-dir_pol_C
IPR007094 RNA-dir_pol_PSvirus
PfamiView protein in Pfam
PF16749 Arteri_nsp7a, 1 hit
PF12581 DUF3756, 1 hit
PF05410 Peptidase_C31, 1 hit
PF05411 Peptidase_C32, 1 hit
PF05412 Peptidase_C33, 1 hit
PF05579 Peptidase_S32, 1 hit
PF00680 RdRP_1, 1 hit
PF01443 Viral_helicase1, 1 hit
SUPFAMiSSF142877 SSF142877, 1 hit
SSF50494 SSF50494, 1 hit
SSF52540 SSF52540, 2 hits
PROSITEiView protein in PROSITE
PS51538 AV_CP, 1 hit
PS51493 AV_NSP4_PRO, 1 hit
PS51539 AV_PCP_ALPHA, 1 hit
PS51540 AV_PCP_BETA, 1 hit
PS51652 AV_ZBD, 1 hit
PS51657 PSRV_HELICASE, 1 hit
PS50507 RDRP_SSRNA_POS, 1 hit

ProtoNet; Automatic hierarchical classification of proteins

More...
ProtoNeti
Search...

MobiDB: a database of protein disorder and mobility annotations

More...
MobiDBi
Search...

<p>This section provides general information on the entry.<p><a href='/help/entry_information_section' target='_top'>More...</a></p>Entry informationi

<p>This subsection of the ‘Entry information’ section provides a mnemonic identifier for a UniProtKB entry, but it is not a stable identifier. Each reviewed entry is assigned a unique entry name upon integration into UniProtKB/Swiss-Prot.<p><a href='/help/entry_name' target='_top'>More...</a></p>Entry nameiRPOA_LDVP
<p>This subsection of the ‘Entry information’ section provides one or more accession number(s). These are stable identifiers and should be used to cite UniProtKB entries. Upon integration into UniProtKB, each entry is assigned a unique accession number, which is called ‘Primary (citable) accession number’.<p><a href='/help/accession_numbers' target='_top'>More...</a></p>AccessioniPrimary (citable) accession number: Q83017
Secondary accession number(s): Q83018
, Q83024, Q83025, Q86716
<p>This subsection of the ‘Entry information’ section shows the date of integration of the entry into UniProtKB, the date of the last sequence update and the date of the last annotation modification (‘Last modified’). The version number for both the entry and the <a href="http://www.uniprot.org/help/canonical_and_isoforms">canonical sequence</a> are also displayed.<p><a href='/help/entry_history' target='_top'>More...</a></p>Entry historyiIntegrated into UniProtKB/Swiss-Prot: April 12, 2005
Last sequence update: April 12, 2005
Last modified: July 31, 2019
This is version 118 of the entry and version 2 of the sequence. See complete history.
<p>This subsection of the ‘Entry information’ section indicates whether the entry has been manually annotated and reviewed by UniProtKB curators or not, in other words, if the entry belongs to the Swiss-Prot section of UniProtKB (<strong>reviewed</strong>) or to the computer-annotated TrEMBL section (<strong>unreviewed</strong>).<p><a href='/help/entry_status' target='_top'>More...</a></p>Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programViral Protein Annotation Program

<p>This section contains any relevant information that doesn’t fit in any other defined sections<p><a href='/help/miscellaneous_section' target='_top'>More...</a></p>Miscellaneousi

Documents

  1. SIMILARITY comments
    Index of protein domains and families
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again