Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Entry version 116 (11 Dec 2019)
Sequence version 2 (12 Apr 2005)
Previous versions | rss
Help videoAdd a publicationFeedback
Protein

Replicase polyprotein 1ab

Gene

rep

Organism
Lactate dehydrogenase elevating virus (strain C) (LDV)
Status
Reviewed-Annotation score:

Annotation score:5 out of 5

<p>The annotation score provides a heuristic measure of the annotation content of a UniProtKB entry or proteome. This score <strong>cannot</strong> be used as a measure of the accuracy of the annotation as we cannot define the 'correct annotation' for any given protein.<p><a href='/help/annotation_score' target='_top'>More...</a></p>
-Protein inferred from homologyi <p>This indicates the type of evidence that supports the existence of the protein. Note that the 'protein existence' evidence does not give information on the accuracy or correctness of the sequence(s) displayed.<p><a href='/help/protein_existence' target='_top'>More...</a></p>

<p>This section provides any useful information about the protein, mostly biological knowledge.<p><a href='/help/function_section' target='_top'>More...</a></p>Functioni

The replicase polyprotein 1ab is a multifunctional protein: it contains the activities necessary for the transcription of negative stranded RNA, leader RNA, subgenomic mRNAs and progeny virion RNA as well as proteinases responsible for the cleavage of the polyprotein into functional products.
The Nsp1 chain is essential for viral subgenomic mRNA synthesis.By similarity
The 3C-like serine proteinase chain is responsible for the majority of cleavages as it cleaves the C-terminus of the polyprotein.By similarity
The helicase chain, which contains a zinc finger structure, displays RNA and DNA duplex-unwinding activities with 5' to 3' polarity.By similarity

<p>This subsection of the <a href="http://www.uniprot.org/help/function%5Fsection">Function</a> section describes the catalytic activity of an enzyme, i.e. a chemical reaction that the enzyme catalyzes.<p><a href='/help/catalytic_activity' target='_top'>More...</a></p>Catalytic activityi

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the <a href="http://www.uniprot.org/help/function%5Fsection">Function</a> section is used for enzymes and indicates the residues directly involved in catalysis.<p><a href='/help/act_site' target='_top'>More...</a></p>Active sitei76For Nsp1-alpha papain-like cysteine proteinase activityPROSITE-ProRule annotation1
Active sitei147For Nsp1-alpha papain-like cysteine proteinase activityPROSITE-ProRule annotation1
Active sitei269For Nsp1-beta papain-like cysteine proteinase activityPROSITE-ProRule annotation1
Active sitei340For Nsp1-beta papain-like cysteine proteinase activityPROSITE-ProRule annotation1
Active sitei390For Nsp2 cysteine proteinase activityPROSITE-ProRule annotation1
Active sitei456For Nsp2 cysteine proteinase activityPROSITE-ProRule annotation1
Active sitei1549Charge relay system; for 3C-like serine proteinase activityPROSITE-ProRule annotation1
Active sitei1574Charge relay system; for 3C-like serine proteinase activityPROSITE-ProRule annotation1
Active sitei1626Charge relay system; for 3C-like serine proteinase activityPROSITE-ProRule annotation1
<p>This subsection of the <a href="http://www.uniprot.org/help/function%5Fsection">Function</a> section indicates at which position the protein binds a given metal ion. The nature of the metal is indicated in the 'Description' field.<p><a href='/help/metal' target='_top'>More...</a></p>Metal bindingi2871Zinc 1PROSITE-ProRule annotation1
Metal bindingi2874Zinc 1PROSITE-ProRule annotation1
Metal bindingi2884Zinc 2PROSITE-ProRule annotation1
Metal bindingi2889Zinc 1PROSITE-ProRule annotation1
Metal bindingi2892Zinc 1PROSITE-ProRule annotation1
Metal bindingi2894Zinc 2PROSITE-ProRule annotation1
Metal bindingi2896Zinc 2PROSITE-ProRule annotation1
Metal bindingi2898Zinc 2PROSITE-ProRule annotation1
Metal bindingi2905Zinc 3PROSITE-ProRule annotation1
Metal bindingi2907Zinc 3PROSITE-ProRule annotation1
Metal bindingi2914Zinc 3PROSITE-ProRule annotation1
Metal bindingi2917Zinc 3PROSITE-ProRule annotation1

Regions

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the <a href="http://www.uniprot.org/help/function%5Fsection">Function</a> section specifies the position(s) and type(s) of zinc fingers within the protein.<p><a href='/help/zn_fing' target='_top'>More...</a></p>Zinc fingeri8 – 28C4-type; atypicalAdd BLAST21
<p>This subsection of the <a href="http://www.uniprot.org/help/function%5Fsection">Function</a> section describes a region in the protein which binds nucleotide phosphates. It always involves more than one amino acid and includes all residues involved in nucleotide-binding.<p><a href='/help/np_bind' target='_top'>More...</a></p>Nucleotide bindingi3013 – 3020ATPBy similarity8

<p>The <a href="http://www.geneontology.org/">Gene Ontology (GO)</a> project provides a set of hierarchical controlled vocabulary split into 3 categories:<p><a href='/help/gene_ontology' target='_top'>More...</a></p>GO - Molecular functioni

GO - Biological processi

<p>UniProtKB Keywords constitute a <a href="http://www.uniprot.org/keywords">controlled vocabulary</a> with a hierarchical structure. Keywords summarise the content of a UniProtKB entry and facilitate the search for proteins of interest.<p><a href='/help/keywords' target='_top'>More...</a></p>Keywordsi

Molecular functionHelicase, Hydrolase, Nucleotidyltransferase, Protease, RNA-directed RNA polymerase, Serine protease, Thiol protease, Transferase
Biological processViral RNA replication
LigandATP-binding, Metal-binding, Nucleotide-binding, Zinc

<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Names & Taxonomyi

<p>This subsection of the <a href="http://www.uniprot.org/help/names%5Fand%5Ftaxonomy%5Fsection">Names and taxonomy</a> section provides an exhaustive list of all names of the protein, from commonly used to obsolete, to allow unambiguous identification of a protein.<p><a href='/help/protein_names' target='_top'>More...</a></p>Protein namesi
Recommended name:
Replicase polyprotein 1ab
Alternative name(s):
ORF1ab polyprotein
Cleaved into the following 15 chains:
Nsp1-alpha papain-like cysteine proteinase (EC:3.4.22.-)
Alternative name(s):
PCP1-alpha
Nsp1-beta papain-like cysteine proteinase (EC:3.4.22.-)
Alternative name(s):
PCP1-beta
Nsp2 cysteine proteinase (EC:3.4.22.-)
Alternative name(s):
CP2
Short name:
CP
Non-structural protein 3
Short name:
Nsp3
3C-like serine proteinase (EC:3.4.21.-)
Short name:
3CLSP
Alternative name(s):
Nsp4
Non-structural protein 5-6-7
Short name:
Nsp5-6-7
Non-structural protein 5
Short name:
Nsp5
Non-structural protein 6
Short name:
Nsp6
Non-structural protein 7-alpha
Short name:
Nsp7-alpha
Non-structural protein 7-beta
Short name:
Nsp7-beta
Non-structural protein 8
Short name:
Nsp8
RNA-directed RNA polymerase (EC:2.7.7.48)
Short name:
Pol
Short name:
RdRp
Alternative name(s):
Nsp9
Helicase (EC:3.6.4.12, EC:3.6.4.13)
Short name:
Hel
Alternative name(s):
Nsp10
Non-structural protein 11
Short name:
Nsp11
Non-structural protein 12
Short name:
Nsp12
<p>This subsection of the <a href="http://www.uniprot.org/help/names%5Fand%5Ftaxonomy%5Fsection">Names and taxonomy</a> section indicates the name(s) of the gene(s) that code for the protein sequence(s) described in the entry. Four distinct tokens exist: 'Name', 'Synonyms', 'Ordered locus names' and 'ORF names'.<p><a href='/help/gene_name' target='_top'>More...</a></p>Gene namesi
Name:rep
ORF Names:1a-1b
<p>This subsection of the <a href="http://www.uniprot.org/help/names%5Fand%5Ftaxonomy%5Fsection">Names and taxonomy</a> section provides information on the name(s) of the organism that is the source of the protein sequence.<p><a href='/help/organism-name' target='_top'>More...</a></p>OrganismiLactate dehydrogenase elevating virus (strain C) (LDV)
<p>This subsection of the <a href="http://www.uniprot.org/help/names%5Fand%5Ftaxonomy%5Fsection">Names and taxonomy</a> section shows the unique identifier assigned by the NCBI to the source organism of the protein. This is known as the 'taxonomic identifier' or 'taxid'.<p><a href='/help/taxonomic_identifier' target='_top'>More...</a></p>Taxonomic identifieri300015 [NCBI]
<p>This subsection of the <a href="http://www.uniprot.org/help/names%5Fand%5Ftaxonomy%5Fsection">Names and taxonomy</a> section contains the taxonomic hierarchical classification lineage of the source organism. It lists the nodes as they appear top-down in the taxonomic tree, with the more general grouping listed first.<p><a href='/help/taxonomic_lineage' target='_top'>More...</a></p>Taxonomic lineageiVirusesRiboviriaNidoviralesArnidovirineaeArteriviridaeVariarterivirinaeGammaarterivirusGammaarterivirus lacdeh
<p>This subsection of the <a href="http://www.uniprot.org/help/names%5Fand%5Ftaxonomy%5Fsection">Names and taxonomy</a> section only exists in viral entries and indicates the host(s) either as a specific organism or taxonomic group of organisms that are susceptible to be infected by a virus.<p><a href='/help/virus_host' target='_top'>More...</a></p>Virus hostiMus musculus domesticus (western European house mouse) [TaxID: 10092]

<p>This section provides information on the location and the topology of the mature protein in the cell.<p><a href='/help/subcellular_location_section' target='_top'>More...</a></p>Subcellular locationi

Topology

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the <a href="http://www.uniprot.org/help/subcellular%5Flocation%5Fsection">'Subcellular location'</a> section describes the extent of a membrane-spanning region of the protein. It denotes the presence of both alpha-helical transmembrane regions and the membrane spanning regions of beta-barrel transmembrane proteins.<p><a href='/help/transmem' target='_top'>More...</a></p>Transmembranei940 – 960HelicalSequence analysisAdd BLAST21
Transmembranei981 – 1001HelicalSequence analysisAdd BLAST21
Transmembranei1083 – 1103HelicalSequence analysisAdd BLAST21
Transmembranei1287 – 1307HelicalSequence analysisAdd BLAST21
Transmembranei1362 – 1382HelicalSequence analysisAdd BLAST21
Transmembranei1390 – 1410HelicalSequence analysisAdd BLAST21
Transmembranei1423 – 1443HelicalSequence analysisAdd BLAST21
Transmembranei1735 – 1755HelicalSequence analysisAdd BLAST21
Transmembranei1761 – 1781HelicalSequence analysisAdd BLAST21
Transmembranei1801 – 1821HelicalSequence analysisAdd BLAST21
Transmembranei1824 – 1844HelicalSequence analysisAdd BLAST21
Transmembranei1853 – 1873HelicalSequence analysisAdd BLAST21

GO - Cellular componenti

Keywords - Cellular componenti

Host cytoplasm, Host membrane, Membrane

<p>This section describes post-translational modifications (PTMs) and/or processing events.<p><a href='/help/ptm_processing_section' target='_top'>More...</a></p>PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the 'PTM / Processing' section describes the extent of a polypeptide chain in the mature protein following processing or proteolytic cleavage.<p><a href='/help/chain' target='_top'>More...</a></p>ChainiPRO_00000366311 – 3637Replicase polyprotein 1abAdd BLAST3637
ChainiPRO_00000366331 – ?Nsp1-alpha papain-like cysteine proteinaseBy similarity
ChainiPRO_0000036634? – 380Nsp1-beta papain-like cysteine proteinaseBy similarity
ChainiPRO_0000036635381 – 1284Nsp2 cysteine proteinaseBy similarityAdd BLAST904
ChainiPRO_00000366361285 – 1510Non-structural protein 3By similarityAdd BLAST226
ChainiPRO_00000366371511 – 17123C-like serine proteinaseBy similarityAdd BLAST202
ChainiPRO_00000366381713 – 2181Non-structural protein 5-6-7By similarityAdd BLAST469
ChainiPRO_00004231101713 – 1898Non-structural protein 5By similarityAdd BLAST186
ChainiPRO_00004231111899 – 1914Non-structural protein 6By similarityAdd BLAST16
ChainiPRO_00004231121915 – 2046Non-structural protein 7-alphaBy similarityAdd BLAST132
ChainiPRO_00004231132047 – 2181Non-structural protein 7-betaBy similarityAdd BLAST135
ChainiPRO_00000366392182 – 2864RNA-directed RNA polymeraseBy similarityAdd BLAST683
ChainiPRO_00000366402182 – 2226Non-structural protein 8By similarityAdd BLAST45
ChainiPRO_00000366412865 – 3293HelicaseBy similarityAdd BLAST429
ChainiPRO_00000366423294 – 3515Non-structural protein 11By similarityAdd BLAST222
ChainiPRO_00000366433516 – 3637Non-structural protein 12By similarityAdd BLAST122

<p>This subsection of the <a href="http://www.uniprot.org/help/ptm%5Fprocessing%5Fsection">PTM/processing</a> section describes post-translational modifications (PTMs). This subsection <strong>complements</strong> the information provided at the sequence level or describes modifications for which <strong>position-specific data is not yet available</strong>.<p><a href='/help/post-translational_modification' target='_top'>More...</a></p>Post-translational modificationi

Specific enzymatic cleavages in vivo by its own proteases yield mature proteins. There are two alternative pathways for processing. Either nsp4-5 is cleaved, which represents the major pathway or the nsp5-6 and nsp6-7 are processed, which represents the minor pathway. The major pathway occurs when nsp2 acts as cofactor for nsp4 (By similarity).By similarity

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection describes interesting single amino acid sites on the sequence that are not defined in any other subsection. This subsection can be displayed in different sections ('Function', 'PTM / Processing', 'Pathology and Biotech') according to its content.<p><a href='/help/site' target='_top'>More...</a></p>Sitei181 – 182Cleavage; by autolysisSequence analysis2
Sitei381 – 382Cleavage; by autolysisSequence analysis2
Sitei1284 – 1285Cleavage; by CP2By similarity2
Sitei1510 – 1511Cleavage; by 3CLSPBy similarity2
Sitei1712 – 1713Cleavage; by 3CLSPBy similarity2
Sitei1898 – 1899Cleavage; by 3CLSPBy similarity2
Sitei1914 – 1915Cleavage; by 3CLSPBy similarity2
Sitei2046 – 2047Cleavage; by 3CLSPBy similarity2
Sitei2181 – 2182Cleavage; by 3CLSPBy similarity2
Sitei2864 – 2865Cleavage; by 3CLSPBy similarity2
Sitei3293 – 3294Cleavage; by 3CLSPBy similarity2
Sitei3515 – 3516Cleavage; by 3CLSPBy similarity2

Proteomic databases

PRoteomics IDEntifications database

More...
PRIDEi
Q06502

<p>This section provides information on sequence similarities with other proteins and the domain(s) present in a protein.<p><a href='/help/family_and_domains_section' target='_top'>More...</a></p>Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the <a href="http://www.uniprot.org/help/family%5Fand%5Fdomains%5Fsection">Family and Domains</a> section describes the position and type of a domain, which is defined as a specific combination of secondary structures organized into a characteristic three-dimensional structure or fold.<p><a href='/help/domain' target='_top'>More...</a></p>Domaini69 – 181Peptidase C31PROSITE-ProRule annotationAdd BLAST113
Domaini262 – 381Peptidase C32PROSITE-ProRule annotationAdd BLAST120
Domaini381 – 486Peptidase C33PROSITE-ProRule annotationAdd BLAST106
Domaini1511 – 1712Peptidase S32PROSITE-ProRule annotationAdd BLAST202
Domaini2611 – 2745RdRp catalyticPROSITE-ProRule annotationAdd BLAST135
Domaini2865 – 2928AV ZBDPROSITE-ProRule annotationAdd BLAST64
Domaini2985 – 3137(+)RNA virus helicase ATP-bindingAdd BLAST153
Domaini3138 – 3269(+)RNA virus helicase C-terminalAdd BLAST132

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the 'Family and Domains' section describes a region of interest that cannot be described in other subsections.<p><a href='/help/region' target='_top'>More...</a></p>Regioni69 – 183PCP1-alphaAdd BLAST115
Regioni262 – 380PCP1-betaAdd BLAST119
Regioni979 – 1103HD1Add BLAST125
Regioni1287 – 1446HD2Add BLAST160
Regioni1735 – 1872HD3Add BLAST138

<p>This subsection of the 'Family and domains' section provides general information on the biological role of a domain. The term 'domain' is intended here in its wide acceptation, it may be a structural domain, a transmembrane region or a functional domain. Several domains are described in this subsection.<p><a href='/help/domain_cc' target='_top'>More...</a></p>Domaini

The hydrophobic domains (HD) could mediate the membrane association of the replication complex and thereby alter the architecture of the host cell membrane.By similarity

<p>This subsection of the 'Family and domains' section provides information about the sequence similarity with other proteins.<p><a href='/help/sequence_similarities' target='_top'>More...</a></p>Sequence similaritiesi

Belongs to the arteriviridae polyprotein family.Curated

Zinc finger

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Zinc fingeri8 – 28C4-type; atypicalAdd BLAST21

Keywords - Domaini

Transmembrane, Transmembrane helix, Zinc-finger

Family and domain databases

Gene3D Structural and Functional Annotation of Protein Families

More...
Gene3Di
1.20.58.950, 1 hit
3.30.40.20, 1 hit
3.90.70.60, 1 hit
3.90.70.70, 1 hit

Integrated resource of protein families, domains and functional sites

More...
InterProi
View protein in InterPro
IPR027351 (+)RNA_virus_helicase_core_dom
IPR031932 Arteri_nsp7a
IPR038451 Arteri_nsp7a_sf
IPR008743 Arterivirus_Nsp2_C33
IPR023338 Arterivirus_NSP4_peptidase
IPR008741 AV_PCPalpha
IPR038155 AV_PCPalpha_sf
IPR025773 AV_PCPbeta
IPR038154 AV_PCPbeta_sf
IPR027355 AV_ZBD
IPR023183 Chymotrypsin-like_C
IPR022230 DUF3756
IPR008760 EAV_peptidase_S32
IPR037227 EndoU-like
IPR027417 P-loop_NTPase
IPR009003 Peptidase_S1_PA
IPR001205 RNA-dir_pol_C
IPR007094 RNA-dir_pol_PSvirus

Pfam protein domain database

More...
Pfami
View protein in Pfam
PF16749 Arteri_nsp7a, 1 hit
PF12581 DUF3756, 1 hit
PF05410 Peptidase_C31, 1 hit
PF05411 Peptidase_C32, 1 hit
PF05412 Peptidase_C33, 1 hit
PF05579 Peptidase_S32, 1 hit
PF00680 RdRP_1, 1 hit
PF01443 Viral_helicase1, 1 hit

Superfamily database of structural and functional annotation

More...
SUPFAMi
SSF142877 SSF142877, 1 hit
SSF50494 SSF50494, 1 hit
SSF52540 SSF52540, 1 hit

PROSITE; a protein domain and family database

More...
PROSITEi
View protein in PROSITE
PS51538 AV_CP, 1 hit
PS51493 AV_NSP4_PRO, 1 hit
PS51539 AV_PCP_ALPHA, 1 hit
PS51540 AV_PCP_BETA, 1 hit
PS51652 AV_ZBD, 1 hit
PS51657 PSRV_HELICASE, 1 hit
PS50507 RDRP_SSRNA_POS, 1 hit

<p>This section displays by default the canonical protein sequence and upon request all isoforms described in the entry. It also includes information pertinent to the sequence(s), including <a href="http://www.uniprot.org/help/sequence%5Flength">length</a> and <a href="http://www.uniprot.org/help/sequences">molecular weight</a>. The information is filed in different subsections. The current subsections and their content are listed below:<p><a href='/help/sequences_section' target='_top'>More...</a></p>Sequences (2)i

<p>This subsection of the <a href="http://www.uniprot.org/help/sequences%5Fsection">Sequence</a> section indicates if the <a href="http://www.uniprot.org/help/canonical%5Fand%5Fisoforms">canonical sequence</a> displayed by default in the entry is complete or not.<p><a href='/help/sequence_status' target='_top'>More...</a></p>Sequence statusi: Complete.

<p>This subsection of the <a href="http://www.uniprot.org/help/sequences%5Fsection">Sequence</a> section indicates if the <a href="http://www.uniprot.org/help/canonical%5Fand%5Fisoforms">canonical sequence</a> displayed by default in the entry is in its mature form or if it represents the precursor.<p><a href='/help/sequence_processing' target='_top'>More...</a></p>Sequence processingi: The displayed sequence is further processed into a mature form.

This entry describes 2 <p>This subsection of the 'Sequence' section lists the alternative protein sequences (isoforms) that can be generated from the same gene by a single or by the combination of up to four biological events (alternative promoter usage, alternative splicing, alternative initiation and ribosomal frameshifting). Additionally, this section gives relevant information on each alternative protein isoform. This section is only present in reviewed entries, i.e. in UniProtKB/Swiss-Prot.<p><a href='/help/alternative_products' target='_top'>More...</a></p> isoformsi produced by ribosomal frameshifting. AlignAdd to basket
Isoform Replicase polyprotein 1ab (identifier: Q06502-1) [UniParc]FASTAAdd to basket
Also known as: pp1ab

This isoform has been chosen as the <div> <p><b>What is the canonical sequence?</b><p><a href='/help/canonical_and_isoforms' target='_top'>More...</a></p>canonicali sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide
        10         20         30         40         50
MQSGFDRCLC TPNARVFWER GQVYCTRCLA ARPLLPLSQQ HPRLGALGLF
60 70 80 90 100
YRPASPLSWE APVTYPTKEC RPGGMCWLSS IYPIARMTSG NHNFQARLNF
110 120 130 140 150
IASVVYRDGK LTSKHLEEDF EVYSRGCRWY PITGPVPGIA LYANAVHVSD
160 170 180 190 200
ESFPGATHVL SNLPLPQQPL RKGLCPFADA RANVWRYKGN TVFVSPQGYL
210 220 230 240 250
WTTGSNDSVP EPWGEDRRLC EKIISSLPAD HLVKINFSNY PFDYSFTGGD
260 270 280 290 300
GAGFVVFPCK ERDTKFSKCW EKIFEDHSGW MAACEEADLA DRMGYRTPAG
310 320 330 340 350
VAGPYLARRL QVRGLRAVVK PENNDYIVWA LGVPESYIRH VSRAGEPVEE
360 370 380 390 400
FFVKVGEFSI VSNCVVTPHP KFRFQTRKYY GYSPPGDGAC GLHCISAMLN
410 420 430 440 450
DIFGDSFTTR LGKCSRDSSE WLSDQDLYQL VMTANLPATI GHCPSAIYKL
460 470 480 490 500
DCVNQHWTVT KRKGDRAVGR LAPDCLRGVC GECEMGIHIG ADTDLSPIVE
510 520 530 540 550
LQLAQDVSPR PGALLWFLEL HELCVVDDDF AHAIARAGEE YRRAMGIPRD
560 570 580 590 600
DWVILAELMT ENCRTRHQVL EKLQRGLQLQ ASSRPSSPAS VSPASSVDLS
610 620 630 640 650
AAGLLLSGTE SDKEAVVAVN DGCYTVLGFD KNEATKSEQD LATDLFCDLV
660 670 680 690 700
KPMETSTTKL ESRKILEAAA KALESCKPKR KRSRKKKTRT PSPTCSVDAA
710 720 730 740 750
VAEPTSVNSL GNQDTRETCA SEKKAEKCPT PTPPPRPKRA ALKNSNSGCV
760 770 780 790 800
LKDIIWNQTG PGVKCLTIVE DVRAFLKGIT PPGGVLSTRS RITKHIVDHF
810 820 830 840 850
HSICEQTPEL VLAHAEHQAK NLHELLASET AKLILGIGED PLKKLVGSQR
860 870 880 890 900
SLPRRLGFGA WLGGQQKTSG GCGEREFKDV GRKSGAERTP SKRDLGVSLG
910 920 930 940 950
DQLSQDGARR LSSSTACEIK ESVPPIIDSG GGLSQKFMAW LNHQVFVLSS
960 970 980 990 1000
HLLAVWSFIF GSRQVLGVFD YVYTLFCLCC VLLCFYLPAI GFMTLVGCVF
1010 1020 1030 1040 1050
GSPWRVRLSV FSVWLCVAVV VFQEVLPEPG AVCTSASAER AAALERYTSN
1060 1070 1080 1090 1100
GVHRPVNHLS VGLVGTVAGF VARSVGGPRR YWFYFLRLMV LLDLGLVFLA
1110 1120 1130 1140 1150
VALRGSCKKC FCKCVRTASH EVQLRVFPST KVARTTLEAI CDMYSAPRVD
1160 1170 1180 1190 1200
PIFIATGVRG CWTGSVSPHQ VTEKPVSYSN LDDKKISNKT VVPPPTDPQQ
1210 1220 1230 1240 1250
AVRCLKVLQC GGSIQDVSVP EVKKVTKVPF KAPFFPNVTI DPECYIVVDP
1260 1270 1280 1290 1300
VTYSAAMRGG YGVSHLIVGL GDFAEVNGLR FVSGGQIADF VCLGLYVLLN
1310 1320 1330 1340 1350
FLLSAWLSSP VSCGRGTNDP WCRNPFSYPV VGQGVMCNSH LCVAEDGLTS
1360 1370 1380 1390 1400
PMTLSYSLID WALMVAIMAT VAIFFAKISL LVDVVCVFCC LLMYAFPSLS
1410 1420 1430 1440 1450
IAAFGFPFVL CKVSLHPITL VWVQFFLLAV NVWAGVASVV VLISSWFLAR
1460 1470 1480 1490 1500
ATSSLGLITP YDVHMITATP RGASSLASAP EGTYLAAVRR SALTGRCCMF
1510 1520 1530 1540 1550
VPTNFGSVLE GSLRTRGCAK NVVSVFGSAS GSGGVFTING NPVVVTASHL
1560 1570 1580 1590 1600
LSDGKARVSC VGFSQCLDFK CAGDYAFARV ANWKGDAPKA ELSHRRGRAY
1610 1620 1630 1640 1650
CSPLVGLSLD LLGKNSAFCF TKCGDSGSPV VDEDGNLLGI HTGSNKRGSG
1660 1670 1680 1690 1700
MVTTHGGKTL GMANVKLSEM CPHYSGPGVP VSTVKLPKHL VVDVETVSSD
1710 1720 1730 1740 1750
LVAVVESLPA LEGALSSMQL LCVFFFLWRL IHVPDVPVIR IAFFFLNEIL
1760 1770 1780 1790 1800
PVMLARLMFS FALSLFFCVH WLFCSSVAVA FGDCCSKSVT GYSVQVLLLR
1810 1820 1830 1840 1850
LVIAALNRPC GPFGFSLLGQ LSQCCLMLCL LDIELQLLGC LYLGQLLMWP
1860 1870 1880 1890 1900
PKEIFFHPTG QFMFLPLFLS LFKRNALADM LVGNGCFDAA FFLKYFAEGN
1910 1920 1930 1940 1950
LRDGVSDSCN MTPEGLTAAL AITLSDDDLE FLQRHSEFKC FVSASNMRNG
1960 1970 1980 1990 2000
AKEFIESAYA RALRAQLAAT DKIKASKSIL AKLESFAGGV VTQVEPGDVV
2010 2020 2030 2040 2050
VVLGKKVIGD LVEVVINDAK HVIRVIETRT MAGTQFSVGT ICGDLENACE
2060 2070 2080 2090 2100
DPSGLVKTSK KQARRQKRTG LGTEVVGTVV IDGVSYNKVW HIATGDVTYE
2110 2120 2130 2140 2150
GCLVTENPQL RPLGMTTIGR FQEFIRKHGE KVKTSVEKYP VGKKKSVEFN
2160 2170 2180 2190 2200
ITTYLLDGEE YDVPDHEPLE WTITIGESDL EAERLTVDQA LRHMGHDSLL
2210 2220 2230 2240 2250
TAKEKEKLAR IIESLNGLQQ ASALNCLATS GLDRCTRGGL TVSGDAVKLV
2260 2270 2280 2290 2300
RYHSRTFSIG DVNLKVMGRE EYGRTVGKQG HCLVANLVDG VVVMRKHEPS
2310 2320 2330 2340 2350
LVDVLLTGED ADLISPTHGP GNTGVHGFTW DFEAPPTDLE LELSEQIITA
2360 2370 2380 2390 2400
CSIRRGDAPS LDLPYKLHPV RGNPYRDRGV LYNTRFGDIK YLTPQKTKEP
2410 2420 2430 2440 2450
LHAAACFNPK GVPVSDSETL VATTLPHGFE LYVPTIPQSV LEYLDSRPMH
2460 2470 2480 2490 2500
RKCCVRAVVR GLAECDLQKF DLSRQGFVLP GVLYMVRRYL CRLVGIRRRL
2510 2520 2530 2540 2550
FLPSTYPAKN SMAGINGNRF PTHVVQSHPD IDALCERACK EHWQTVTPCT
2560 2570 2580 2590 2600
LKKQYCSKAK TRTILGTNNF VALGLRSALS GVTQGFMRKG IGSPICLGKN
2610 2620 2630 2640 2650
KFTPLPTKVS GRCLEADLAS CDRSTPAIIR WFTTNLLFEL AGPEEWIPSY
2660 2670 2680 2690 2700
VLNCCHDAVS TMSGCFDKRG GLSSGDPVTS VSNTVYSLVI YAQHMVLSAF
2710 2720 2730 2740 2750
RCGHKVGGLF LRDSLEMEQL FELQPLLVYS DDVVLYDESS ELPNYHFFVD
2760 2770 2780 2790 2800
HLDLMLGFKT DRSKTVITSD PQFPGCRIAA GRVLVPQRDR ILAALAYHMK
2810 2820 2830 2840 2850
ASCVSDYFAS AAAILMDACA CCDYDEDWYF DLVCGIADCA RKEGFRFPGP
2860 2870 2880 2890 2900
SFYVDMWKRL SVEEKKKCRT CAHCGAPSTL VSSCGLNLCD YHGHGHPHCP
2910 2920 2930 2940 2950
VVLPCGHAVG SGVCDGCSSP VMSLNTELDK LLACVPYHPP KVELLSVNDG
2960 2970 2980 2990 3000
VSSLPPGRYQ ARGGVVSVRR DILGNVVDLP DGDYQVMKVA QTCADICMVS
3010 3020 3030 3040 3050
INSHILRSQF ITGAPGTGKT TYLLSVVRDD DVIYTPTHRT MLDVVKALGT
3060 3070 3080 3090 3100
CRFDPPKDTP LEFPVPSRTG PCVRLIRAGF IPGRVSYLDE AAYCNPLDVL
3110 3120 3130 3140 3150
KILSKTPLVC VGDLNQLPPV DFIGPCYAFA LMLGRQLIEV FRFGPSIVNP
3160 3170 3180 3190 3200
IKKFYREELV SRGPDTGVKF LKSYQPYGQV LTPYHRDRVD GAITIDSSQG
3210 3220 3230 3240 3250
CTYDVITVYL PTPKSLNSAR ALVAITRARF YVFVYDPHNQ LEQYLNMSEH
3260 3270 3280 3290 3300
EPAGAVAFWC GEQPMMISEG RVQRLSGPAQ TTDPKLQQLM GLEGTASPLP
3310 3320 3330 3340 3350
QVAHNLGFYY SPDLVQFARI PSELCKHWPV VTAQNRTDWP DRLVCSMSKI
3360 3370 3380 3390 3400
DKCSRAIFCA GYHVGPSVFL GVPGVVSYYL TKFLKGKPVP LPDSLMSTGR
3410 3420 3430 3440 3450
IALNVREYLD EKEMEFSSRC PHAFIGEVKG SNVGGCHHVT SRYLPPVLVP
3460 3470 3480 3490 3500
GSVVKIGVSC PGKAAKELCT VTDVYLPELD PYLNPPTKSM DYKLLVDFQP
3510 3520 3530 3540 3550
VKLMVWKDAT AYFHEGIRPM ESMSRFLKVP QEEGVFFDLD EFVTNAKVSK
3560 3570 3580 3590 3600
LPCKYSVSAN QFLTDVVLSM THPSLAPPDY ELLFARAYCV PGLDVGTLNA
3610 3620 3630
YIYRRGPSTY TTSNIARLVK DICCPVGCKG SGYMFPK
Note: Produced by -1 ribosomal frameshifting at the 1a-1b genes boundary.
Length:3,637
Mass (Da):398,663
Last modified:April 12, 2005 - v2
<p>The checksum is a form of redundancy check that is calculated from the sequence. It is useful for tracking sequence updates.</p> <p>It should be noted that while, in theory, two different sequences could have the same checksum value, the likelihood that this would happen is extremely low.</p> <p>However UniProtKB may contain entries with identical sequences in case of multiple genes (paralogs).</p> <p>The checksum is computed as the sequence 64-bit Cyclic Redundancy Check value (CRC64) using the generator polynomial: x<sup>64</sup> + x<sup>4</sup> + x<sup>3</sup> + x + 1. The algorithm is described in the ISO 3309 standard. </p> <p class="publication">Press W.H., Flannery B.P., Teukolsky S.A. and Vetterling W.T.<br /> <strong>Cyclic redundancy and other checksums</strong><br /> <a href="http://www.nrbook.com/b/bookcpdf.php">Numerical recipes in C 2nd ed., pp896-902, Cambridge University Press (1993)</a>)</p> Checksum:i186FAFAFA5BCA311
GO
Isoform Replicase polyprotein 1a (identifier: Q06502-2) [UniParc]FASTAAdd to basket
Also known as: pp1a, ORF1a polyprotein

The sequence of this isoform differs from the canonical sequence as follows:
     2227-3637: Missing.

Note: Produced by conventional translation.Curated
Show »
Length:2,226
Mass (Da):242,933
Checksum:i837887661FAE286C
GO

<p>This subsection of the 'Sequence' section reports difference(s) between the protein sequence shown in the UniProtKB entry and other available protein sequences derived from the same gene.<p><a href='/help/sequence_caution' target='_top'>More...</a></p>Sequence cautioni

The sequence AAA74104 differs from that shown. Reason: Erroneous initiation.Curated

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the 'Sequences' section is used to describe region(s) of a sequence for which the authors are unsure about the sequence assignment.<p><a href='/help/unsure' target='_top'>More...</a></p>Sequence uncertaintyi1698S or P1
Sequence uncertaintyi2243S or F1

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the 'Sequence' section describes the sequence of naturally occurring alternative protein isoform(s). The changes in the amino acid sequence may be due to alternative splicing, alternative promoter usage, alternative initiation, or ribosomal frameshifting.<p><a href='/help/var_seq' target='_top'>More...</a></p>Alternative sequenceiVSP_0328882227 – 3637Missing in isoform Replicase polyprotein 1a. CuratedAdd BLAST1411

Sequence databases

Select the link destinations:

EMBL nucleotide sequence database

More...
EMBLi

GenBank nucleotide sequence database

More...
GenBanki

DNA Data Bank of Japan; a nucleotide sequence database

More...
DDBJi
Links Updated
L13298 Genomic RNA Translation: AAA74103.1
L13298 Genomic RNA Translation: AAA74104.1 Different initiation.

Keywords - Coding sequence diversityi

Ribosomal frameshifting

<p>This section provides links to proteins that are similar to the protein sequence(s) described in this entry at different levels of sequence identity thresholds (100%, 90% and 50%) based on their membership in UniProt Reference Clusters (<a href="http://www.uniprot.org/help/uniref">UniRef</a>).<p><a href='/help/similar_proteins_section' target='_top'>More...</a></p>Similar proteinsi

<p>This section is used to point to information related to entries and found in data collections other than UniProtKB.<p><a href='/help/cross_references_section' target='_top'>More...</a></p>Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
L13298 Genomic RNA Translation: AAA74103.1
L13298 Genomic RNA Translation: AAA74104.1 Different initiation.

3D structure databases

Database of comparative protein structure models

More...
ModBasei
Search...

SWISS-MODEL Interactive Workspace

More...
SWISS-MODEL-Workspacei
Submit a new modelling project...

Proteomic databases

PRIDEiQ06502

Family and domain databases

Gene3Di1.20.58.950, 1 hit
3.30.40.20, 1 hit
3.90.70.60, 1 hit
3.90.70.70, 1 hit
InterProiView protein in InterPro
IPR027351 (+)RNA_virus_helicase_core_dom
IPR031932 Arteri_nsp7a
IPR038451 Arteri_nsp7a_sf
IPR008743 Arterivirus_Nsp2_C33
IPR023338 Arterivirus_NSP4_peptidase
IPR008741 AV_PCPalpha
IPR038155 AV_PCPalpha_sf
IPR025773 AV_PCPbeta
IPR038154 AV_PCPbeta_sf
IPR027355 AV_ZBD
IPR023183 Chymotrypsin-like_C
IPR022230 DUF3756
IPR008760 EAV_peptidase_S32
IPR037227 EndoU-like
IPR027417 P-loop_NTPase
IPR009003 Peptidase_S1_PA
IPR001205 RNA-dir_pol_C
IPR007094 RNA-dir_pol_PSvirus
PfamiView protein in Pfam
PF16749 Arteri_nsp7a, 1 hit
PF12581 DUF3756, 1 hit
PF05410 Peptidase_C31, 1 hit
PF05411 Peptidase_C32, 1 hit
PF05412 Peptidase_C33, 1 hit
PF05579 Peptidase_S32, 1 hit
PF00680 RdRP_1, 1 hit
PF01443 Viral_helicase1, 1 hit
SUPFAMiSSF142877 SSF142877, 1 hit
SSF50494 SSF50494, 1 hit
SSF52540 SSF52540, 1 hit
PROSITEiView protein in PROSITE
PS51538 AV_CP, 1 hit
PS51493 AV_NSP4_PRO, 1 hit
PS51539 AV_PCP_ALPHA, 1 hit
PS51540 AV_PCP_BETA, 1 hit
PS51652 AV_ZBD, 1 hit
PS51657 PSRV_HELICASE, 1 hit
PS50507 RDRP_SSRNA_POS, 1 hit

ProtoNet; Automatic hierarchical classification of proteins

More...
ProtoNeti
Search...

MobiDB: a database of protein disorder and mobility annotations

More...
MobiDBi
Search...

<p>This section provides general information on the entry.<p><a href='/help/entry_information_section' target='_top'>More...</a></p>Entry informationi

<p>This subsection of the 'Entry information' section provides a mnemonic identifier for a UniProtKB entry, but it is not a stable identifier. Each reviewed entry is assigned a unique entry name upon integration into UniProtKB/Swiss-Prot.<p><a href='/help/entry_name' target='_top'>More...</a></p>Entry nameiRPOA_LDVC
<p>This subsection of the 'Entry information' section provides one or more accession number(s). These are stable identifiers and should be used to cite UniProtKB entries. Upon integration into UniProtKB, each entry is assigned a unique accession number, which is called 'Primary (citable) accession number'.<p><a href='/help/accession_numbers' target='_top'>More...</a></p>AccessioniPrimary (citable) accession number: Q06502
Secondary accession number(s): Q06503
<p>This subsection of the 'Entry information' section shows the date of integration of the entry into UniProtKB, the date of the last sequence update and the date of the last annotation modification ('Last modified'). The version number for both the entry and the <a href="http://www.uniprot.org/help/canonical%5Fand%5Fisoforms">canonical sequence</a> are also displayed.<p><a href='/help/entry_history' target='_top'>More...</a></p>Entry historyiIntegrated into UniProtKB/Swiss-Prot: April 12, 2005
Last sequence update: April 12, 2005
Last modified: December 11, 2019
This is version 116 of the entry and version 2 of the sequence. See complete history.
<p>This subsection of the 'Entry information' section indicates whether the entry has been manually annotated and reviewed by UniProtKB curators or not, in other words, if the entry belongs to the Swiss-Prot section of UniProtKB (<strong>reviewed</strong>) or to the computer-annotated TrEMBL section (<strong>unreviewed</strong>).<p><a href='/help/entry_status' target='_top'>More...</a></p>Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programViral Protein Annotation Program

<p>This section contains any relevant information that doesn't fit in any other defined sections<p><a href='/help/miscellaneous_section' target='_top'>More...</a></p>Miscellaneousi

Documents

  1. SIMILARITY comments
    Index of protein domains and families
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again