Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Entry version 124 (31 Jul 2019)
Sequence version 4 (11 Nov 2015)
Previous versions | rss
Help videoAdd a publicationFeedback
Protein

Replicase polyprotein 1ab

Gene

rep

Organism
Simian hemorrhagic fever virus (SHFV)
Status
Reviewed-Annotation score:

Annotation score:5 out of 5

<p>The annotation score provides a heuristic measure of the annotation content of a UniProtKB entry or proteome. This score <strong>cannot</strong> be used as a measure of the accuracy of the annotation as we cannot define the ‘correct annotation’ for any given protein.<p><a href='/help/annotation_score' target='_top'>More...</a></p>
-Experimental evidence at transcript leveli <p>This indicates the type of evidence that supports the existence of the protein. Note that the ‘protein existence’ evidence does not give information on the accuracy or correctness of the sequence(s) displayed.<p><a href='/help/protein_existence' target='_top'>More...</a></p>

<p>This section provides any useful information about the protein, mostly biological knowledge.<p><a href='/help/function_section' target='_top'>More...</a></p>Functioni

The replicase polyprotein 1ab is a multifunctional protein: it contains the activities necessary for the transcription of negative stranded RNA, leader RNA, subgenomic mRNAs and progeny virion RNA as well as proteinases responsible for the cleavage of the polyprotein into functional products.
The Nsp1 chain is essential for viral subgenomic mRNA synthesis.By similarity
The 3C-like serine proteinase chain is responsible for the majority of cleavages as it cleaves the C-terminus of the polyprotein.By similarity
The helicase chain, which contains a zinc finger structure, displays RNA and DNA duplex-unwinding activities with 5' to 3' polarity.By similarity

<p>This subsection of the <a href="http://www.uniprot.org/help/function_section">Function</a> section describes the catalytic activity of an enzyme, i.e. a chemical reaction that the enzyme catalyzes.<p><a href='/help/catalytic_activity' target='_top'>More...</a></p>Catalytic activityi

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the <a href="http://www.uniprot.org/help/function_section">Function</a> section is used for enzymes and indicates the residues directly involved in catalysis.<p><a href='/help/act_site' target='_top'>More...</a></p>Active sitei63For Nsp1-alpha papain-like cysteine proteinase activityPROSITE-ProRule annotation1
Active sitei130For Nsp1-alpha papain-like cysteine proteinase activityPROSITE-ProRule annotation1
Active sitei246For Nsp1-beta papain-like cysteine proteinase activityPROSITE-ProRule annotation1
Active sitei309For Nsp1-beta papain-like cysteine proteinase activityPROSITE-ProRule annotation1
Active sitei378For Nsp2 cysteine proteinase activityPROSITE-ProRule annotation1
Active sitei430For Nsp2 cysteine proteinase activityPROSITE-ProRule annotation1
Active sitei1502Charge relay system; for 3C-like serine proteinase activityPROSITE-ProRule annotation1
Active sitei1527Charge relay system; for 3C-like serine proteinase activityPROSITE-ProRule annotation1
Active sitei1580Charge relay system; for 3C-like serine proteinase activityPROSITE-ProRule annotation1
<p>This subsection of the <a href="http://www.uniprot.org/help/function_section">Function</a> section indicates at which position the protein binds a given metal ion. The nature of the metal is indicated in the ‘Description’ field.<p><a href='/help/metal' target='_top'>More...</a></p>Metal bindingi2752Zinc 1PROSITE-ProRule annotation1
Metal bindingi2755Zinc 1PROSITE-ProRule annotation1
Metal bindingi2765Zinc 2PROSITE-ProRule annotation1
Metal bindingi2770Zinc 1PROSITE-ProRule annotation1
Metal bindingi2773Zinc 1PROSITE-ProRule annotation1
Metal bindingi2777Zinc 2PROSITE-ProRule annotation1
Metal bindingi2779Zinc 2PROSITE-ProRule annotation1
Metal bindingi2782Zinc 2PROSITE-ProRule annotation1
Metal bindingi2789Zinc 3PROSITE-ProRule annotation1
Metal bindingi2791Zinc 3PROSITE-ProRule annotation1
Metal bindingi2798Zinc 3PROSITE-ProRule annotation1
Metal bindingi2801Zinc 3PROSITE-ProRule annotation1

Regions

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the <a href="http://www.uniprot.org/help/function_section">Function</a> section specifies the position(s) and type(s) of zinc fingers within the protein.<p><a href='/help/zn_fing' target='_top'>More...</a></p>Zinc fingeri3 – 23C4-type; atypicalAdd BLAST21
<p>This subsection of the <a href="http://www.uniprot.org/help/function_section">Function</a> section describes a region in the protein which binds nucleotide phosphates. It always involves more than one amino acid and includes all residues involved in nucleotide-binding.<p><a href='/help/np_bind' target='_top'>More...</a></p>Nucleotide bindingi2897 – 2904ATPBy similarity8

<p>The <a href="http://www.geneontology.org/">Gene Ontology (GO)</a> project provides a set of hierarchical controlled vocabulary split into 3 categories:<p><a href='/help/gene_ontology' target='_top'>More...</a></p>GO - Molecular functioni

GO - Biological processi

<p>UniProtKB Keywords constitute a <a href="http://www.uniprot.org/keywords">controlled vocabulary</a> with a hierarchical structure. Keywords summarise the content of a UniProtKB entry and facilitate the search for proteins of interest.<p><a href='/help/keywords' target='_top'>More...</a></p>Keywordsi

Molecular functionHelicase, Hydrolase, Nucleotidyltransferase, Protease, RNA-directed RNA polymerase, Serine protease, Thiol protease, Transferase
Biological processViral RNA replication
LigandATP-binding, Metal-binding, Nucleotide-binding, Zinc

Protein family/group databases

MEROPS protease database

More...
MEROPSi
C32.002

<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Names & Taxonomyi

<p>This subsection of the <a href="http://www.uniprot.org/help/names_and_taxonomy_section">Names and taxonomy</a> section provides an exhaustive list of all names of the protein, from commonly used to obsolete, to allow unambiguous identification of a protein.<p><a href='/help/protein_names' target='_top'>More...</a></p>Protein namesi
Recommended name:
Replicase polyprotein 1ab
Alternative name(s):
ORF1ab polyprotein
Cleaved into the following 15 chains:
Nsp1-alpha papain-like cysteine proteinase (EC:3.4.22.-)
Alternative name(s):
PCP1-alpha
Nsp1-beta papain-like cysteine proteinase (EC:3.4.22.-)
Alternative name(s):
PCP1-beta
Nsp2 cysteine proteinase (EC:3.4.22.-)
Alternative name(s):
CP2
Short name:
CP
Non-structural protein 3
Short name:
Nsp3
3C-like serine proteinase (EC:3.4.21.-)
Short name:
3CLSP
Alternative name(s):
Nsp4
Non-structural protein 5-6-7
Short name:
Nsp5-6-7
Non-structural protein 5
Short name:
Nsp5
Non-structural protein 6
Short name:
Nsp6
Non-structural protein 7-alpha
Short name:
Nsp7-alpha
Non-structural protein 7-beta
Short name:
Nsp7-beta
Non-structural protein 8
Short name:
Nsp8
RNA-directed RNA polymerase (EC:2.7.7.48)
Short name:
Pol
Short name:
RdRp
Alternative name(s):
Nsp9
Helicase (EC:3.6.4.12, EC:3.6.4.13)
Short name:
Hel
Alternative name(s):
Nsp10
Non-structural protein 11
Short name:
Nsp11
Non-structural protein 12
Short name:
Nsp12
<p>This subsection of the <a href="http://www.uniprot.org/help/names_and_taxonomy_section">Names and taxonomy</a> section indicates the name(s) of the gene(s) that code for the protein sequence(s) described in the entry. Four distinct tokens exist: ‘Name’, ‘Synonyms’, ‘Ordered locus names’ and ‘ORF names’.<p><a href='/help/gene_name' target='_top'>More...</a></p>Gene namesi
Name:rep
ORF Names:1a-1b
<p>This subsection of the <a href="http://www.uniprot.org/help/names_and_taxonomy_section">Names and taxonomy</a> section provides information on the name(s) of the organism that is the source of the protein sequence.<p><a href='/help/organism-name' target='_top'>More...</a></p>OrganismiSimian hemorrhagic fever virus (SHFV)
<p>This subsection of the <a href="http://www.uniprot.org/help/names_and_taxonomy_section">Names and taxonomy</a> section shows the unique identifier assigned by the NCBI to the source organism of the protein. This is known as the ‘taxonomic identifier’ or ‘taxid’.<p><a href='/help/taxonomic_identifier' target='_top'>More...</a></p>Taxonomic identifieri38143 [NCBI]
<p>This subsection of the <a href="http://www.uniprot.org/help/names_and_taxonomy_section">Names and taxonomy</a> section contains the taxonomic hierarchical classification lineage of the source organism. It lists the nodes as they appear top-down in the taxonomic tree, with the more general grouping listed first.<p><a href='/help/taxonomic_lineage' target='_top'>More...</a></p>Taxonomic lineageiVirusesRiboviriaNidoviralesArnidovirineaeArteriviridaeSimarterivirinaeDeltaarterivirusHedartevirusDeltaarterivirus hemfev
<p>This subsection of the <a href="http://www.uniprot.org/help/names_and_taxonomy_section">Names and taxonomy</a> section only exists in viral entries and indicates the host(s) either as a specific organism or taxonomic group of organisms that are susceptible to be infected by a virus.<p><a href='/help/virus_host' target='_top'>More...</a></p>Virus hostiErythrocebus patas (Red guenon) (Cercopithecus patas) [TaxID: 9538]
Macaca (macaques) [TaxID: 9539]
<p>This subsection of the <a href="http://www.uniprot.org/help/names_and_taxonomy_section">Names and taxonomy</a> section is present for entries that are part of a <a href="http://www.uniprot.org/proteomes">proteome</a>, i.e. of a set of proteins thought to be expressed by organisms whose genomes have been completely sequenced.<p><a href='/help/proteomes_manual' target='_top'>More...</a></p>Proteomesi
  • UP000106311 <p>A UniProt <a href="http://www.uniprot.org/manual/proteomes_manual">proteome</a> can consist of several components. <br></br>The component name refers to the genomic component encoding a set of proteins.<p><a href='/help/proteome_component' target='_top'>More...</a></p> Componenti: Genome

<p>This section provides information on the location and the topology of the mature protein in the cell.<p><a href='/help/subcellular_location_section' target='_top'>More...</a></p>Subcellular locationi

Topology

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the <a href="http://www.uniprot.org/help/subcellular_location_section">'Subcellular location'</a> section describes the extent of a membrane-spanning region of the protein. It denotes the presence of both alpha-helical transmembrane regions and the membrane spanning regions of beta-barrel transmembrane proteins.<p><a href='/help/transmem' target='_top'>More...</a></p>Transmembranei922 – 942HelicalSequence analysisAdd BLAST21
Transmembranei951 – 971HelicalSequence analysisAdd BLAST21
Transmembranei1019 – 1039HelicalSequence analysisAdd BLAST21
Transmembranei1239 – 1259HelicalSequence analysisAdd BLAST21
Transmembranei1316 – 1336HelicalSequence analysisAdd BLAST21
Transmembranei1345 – 1365HelicalSequence analysisAdd BLAST21
Transmembranei1381 – 1401HelicalSequence analysisAdd BLAST21
Transmembranei1673 – 1693HelicalSequence analysisAdd BLAST21
Transmembranei1711 – 1731HelicalSequence analysisAdd BLAST21
Transmembranei1744 – 1764HelicalSequence analysisAdd BLAST21
Transmembranei1784 – 1804HelicalSequence analysisAdd BLAST21

GO - Cellular componenti

Keywords - Cellular componenti

Host cytoplasm, Host membrane, Membrane

<p>This section describes post-translational modifications (PTMs) and/or processing events.<p><a href='/help/ptm_processing_section' target='_top'>More...</a></p>PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the ‘PTM / Processing’ section describes the extent of a polypeptide chain in the mature protein following processing.<p><a href='/help/chain' target='_top'>More...</a></p>ChainiPRO_00000367091 – 3595Replicase polyprotein 1abAdd BLAST3595
ChainiPRO_00000367111 – ?Nsp1-alpha papain-like cysteine proteinaseBy similarity
ChainiPRO_0000036712? – 350Nsp1-beta papain-like cysteine proteinaseBy similarity
ChainiPRO_0000036713351 – 1236Nsp2 cysteine proteinaseBy similarityAdd BLAST886
ChainiPRO_00000367141237 – 1463Non-structural protein 3By similarityAdd BLAST227
ChainiPRO_00000367151464 – 16643C-like serine proteinaseBy similarityAdd BLAST201
ChainiPRO_00000367161665 – 2055Non-structural protein 5-6-7By similarityAdd BLAST391
ChainiPRO_00004231381665 – 1830Non-structural protein 5By similarityAdd BLAST166
ChainiPRO_00004231391831 – 1843Non-structural protein 6By similarityAdd BLAST13
ChainiPRO_00004231401844 – 1999Non-structural protein 7-alphaBy similarityAdd BLAST156
ChainiPRO_00004231412000 – 2055Non-structural protein 7-betaBy similarityAdd BLAST56
ChainiPRO_00000367172056 – 2746RNA-directed RNA polymeraseBy similarityAdd BLAST691
ChainiPRO_00000367182056 – 2104Non-structural protein 8By similarityAdd BLAST49
ChainiPRO_00000367192746 – 3195HelicaseBy similarityAdd BLAST450
ChainiPRO_00000367203196 – 3418Non-structural protein 11By similarityAdd BLAST223
ChainiPRO_00000367213419 – 3595Non-structural protein 12By similarityAdd BLAST177

<p>This subsection of the <a href="http://www.uniprot.org/help/ptm_processing_section">PTM/processing</a> section describes post-translational modifications (PTMs). This subsection <strong>complements</strong> the information provided at the sequence level or describes modifications for which <strong>position-specific data is not yet available</strong>.<p><a href='/help/post-translational_modification' target='_top'>More...</a></p>Post-translational modificationi

Specific enzymatic cleavages in vivo by its own proteases yield mature proteins. There are two alternative pathways for processing. Either nsp4-5 is cleaved, which represents the major pathway or the nsp5-6 and nsp6-7 are processed, which represents the minor pathway. The major pathway occurs when nsp2 acts as cofactor for nsp4 (By similarity).By similarity

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection describes interesting single amino acid sites on the sequence that are not defined in any other subsection. This subsection can be displayed in different sections (‘Function’, ‘PTM / Processing’, ‘Pathology and Biotech’) according to its content.<p><a href='/help/site' target='_top'>More...</a></p>Sitei164 – 165Cleavage; by autolysisSequence analysis2
Sitei350 – 351Cleavage; by autolysisSequence analysis2
Sitei1236 – 1237Cleavage; by CP2By similarity2
Sitei1463 – 1464Cleavage; by 3CLSPBy similarity2
Sitei1664 – 1665Cleavage; by 3CLSPBy similarity2
Sitei1830 – 1831Cleavage; by 3CLSPBy similarity2
Sitei1843 – 1844Cleavage; by 3CLSPBy similarity2
Sitei1999 – 2000Cleavage; by 3CLSPBy similarity2
Sitei2055 – 2056Cleavage; by 3CLSPBy similarity2
Sitei2745 – 2746Cleavage; by 3CLSPBy similarity2
Sitei3195 – 3196Cleavage; by 3CLSPBy similarity2
Sitei3418 – 3419Cleavage; by 3CLSPBy similarity2

Proteomic databases

PRoteomics IDEntifications database

More...
PRIDEi
Q68772

<p>This section provides information on sequence similarities with other proteins and the domain(s) present in a protein.<p><a href='/help/family_and_domains_section' target='_top'>More...</a></p>Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the <a href="http://www.uniprot.org/help/family_and_domains_section">Family and Domains</a> section describes the position and type of a domain, which is defined as a specific combination of secondary structures organized into a characteristic three-dimensional structure or fold.<p><a href='/help/domain' target='_top'>More...</a></p>Domaini56 – 164Peptidase C31PROSITE-ProRule annotationAdd BLAST109
Domaini239 – 350Peptidase C32PROSITE-ProRule annotationAdd BLAST112
Domaini633 – 736Peptidase C33PROSITE-ProRule annotationAdd BLAST104
Domaini1464 – 1664Peptidase S32PROSITE-ProRule annotationAdd BLAST201
Domaini2491 – 2625RdRp catalyticPROSITE-ProRule annotationAdd BLAST135
Domaini2746 – 2812AV ZBDPROSITE-ProRule annotationAdd BLAST67
Domaini2862 – 3022(+)RNA virus helicase ATP-bindingAdd BLAST161
Domaini3023 – 3157(+)RNA virus helicase C-terminalAdd BLAST135

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the ‘Family and Domains’ section describes a region of interest that cannot be described in other subsections.<p><a href='/help/region' target='_top'>More...</a></p>Regioni922 – 1039HD1Add BLAST118
Regioni1239 – 1399HD2Add BLAST161
Regioni1687 – 1804HD3Add BLAST118

<p>This subsection of the ‘Family and domains’ section provides general information on the biological role of a domain. The term ‘domain’ is intended here in its wide acceptation, it may be a structural domain, a transmembrane region or a functional domain. Several domains are described in this subsection.<p><a href='/help/domain_cc' target='_top'>More...</a></p>Domaini

The hydrophobic domains (HD) could mediate the membrane association of the replication complex and thereby alter the architecture of the host cell membrane.By similarity

<p>This subsection of the ‘Family and domains’ section provides information about the sequence similarity with other proteins.<p><a href='/help/sequence_similarities' target='_top'>More...</a></p>Sequence similaritiesi

Belongs to the arteriviridae polyprotein family.Curated

Zinc finger

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Zinc fingeri3 – 23C4-type; atypicalAdd BLAST21

Keywords - Domaini

Transmembrane, Transmembrane helix, Zinc-finger

Phylogenomic databases

Database of Orthologous Groups

More...
OrthoDBi
33at10239

Family and domain databases

Gene3D Structural and Functional Annotation of Protein Families

More...
Gene3Di
1.20.58.950, 1 hit
3.90.70.70, 2 hits

Integrated resource of protein families, domains and functional sites

More...
InterProi
View protein in InterPro
IPR027351 (+)RNA_virus_helicase_core_dom
IPR031932 Arteri_nsp7a
IPR038451 Arteri_nsp7a_sf
IPR008743 Arterivirus_Nsp2_C33
IPR023338 Arterivirus_NSP4_peptidase
IPR008741 AV_PCPalpha
IPR025773 AV_PCPbeta
IPR038154 AV_PCPbeta_sf
IPR027355 AV_ZBD
IPR008760 EAV_peptidase_S32
IPR037227 EndoU-like
IPR027417 P-loop_NTPase
IPR009003 Peptidase_S1_PA
IPR001205 RNA-dir_pol_C
IPR007094 RNA-dir_pol_PSvirus

Pfam protein domain database

More...
Pfami
View protein in Pfam
PF16749 Arteri_nsp7a, 1 hit
PF05411 Peptidase_C32, 2 hits
PF05412 Peptidase_C33, 1 hit
PF05579 Peptidase_S32, 1 hit
PF00680 RdRP_1, 1 hit
PF01443 Viral_helicase1, 1 hit

Superfamily database of structural and functional annotation

More...
SUPFAMi
SSF142877 SSF142877, 1 hit
SSF50494 SSF50494, 1 hit
SSF52540 SSF52540, 1 hit

PROSITE; a protein domain and family database

More...
PROSITEi
View protein in PROSITE
PS51538 AV_CP, 1 hit
PS51493 AV_NSP4_PRO, 1 hit
PS51539 AV_PCP_ALPHA, 1 hit
PS51540 AV_PCP_BETA, 1 hit
PS51652 AV_ZBD, 1 hit
PS51657 PSRV_HELICASE, 1 hit
PS50507 RDRP_SSRNA_POS, 1 hit

<p>This section displays by default the canonical protein sequence and upon request all isoforms described in the entry. It also includes information pertinent to the sequence(s), including <a href="http://www.uniprot.org/help/sequence_length">length</a> and <a href="http://www.uniprot.org/help/sequences">molecular weight</a>. The information is filed in different subsections. The current subsections and their content are listed below:<p><a href='/help/sequences_section' target='_top'>More...</a></p>Sequences (2)i

<p>This subsection of the <a href="http://www.uniprot.org/help/sequences_section">Sequence</a> section indicates if the <a href="http://www.uniprot.org/help/canonical_and_isoforms">canonical sequence</a> displayed by default in the entry is complete or not.<p><a href='/help/sequence_status' target='_top'>More...</a></p>Sequence statusi: Complete.

<p>This subsection of the <a href="http://www.uniprot.org/help/sequences_section">Sequence</a> section indicates if the <a href="http://www.uniprot.org/help/canonical_and_isoforms">canonical sequence</a> displayed by default in the entry is in its mature form or if it represents the precursor.<p><a href='/help/sequence_processing' target='_top'>More...</a></p>Sequence processingi: The displayed sequence is further processed into a mature form.

This entry describes 2 <p>This subsection of the ‘Sequence’ section lists the alternative protein sequences (isoforms) that can be generated from the same gene by a single or by the combination of up to four biological events (alternative promoter usage, alternative splicing, alternative initiation and ribosomal frameshifting). Additionally, this section gives relevant information on each alternative protein isoform.<p><a href='/help/alternative_products' target='_top'>More...</a></p> isoformsi produced by ribosomal frameshifting. AlignAdd to basket
Isoform Replicase polyprotein 1ab (identifier: Q68772-1) [UniParc]FASTAAdd to basket
Also known as: pp1ab

This isoform has been chosen as the <div> <p><b>What is the canonical sequence?</b><p><a href='/help/canonical_and_isoforms' target='_top'>More...</a></p>canonicali sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide
        10         20         30         40         50
MFCECPRSNL VVMCSGAFCC VLCGHRRRPR PASESDRAKY GPIVQYVEAR
60 70 80 90 100
VAHVYSGLEG RYCALEMIPI TYGNKFPYCK PLPVSFVIKT LAGVQGDLTR
110 120 130 140 150
LEETPLPGGY GVIPCWGPHL AAVGYLSPAH VGRDWFEGAT HAIVHIGSYG
160 170 180 190 200
GHERPTTIPF NTTGGDVYQL GTCTIVETID HVEWHAGVKP GTAICPLDRI
210 220 230 240 250
DFAQKVITAF PEGFLANKAW LGDKRGTLKV EADPETAALS FEHGRCWLKL
260 270 280 290 300
FPDPACELTT ASTFGYQLNC GVQGKYIARR LQTNGLKLVQ NQEGKFIAYT
310 320 330 340 350
FHRGSWLGHI GHADESVPPD CQIIARFDVL PYNEWSPLPL LKLPGKTYFG
360 370 380 390 400
GNASSVSWPE WKYDEQLLYA DSLTAGFCWL QLFPPLSRKS EAQRAILAQQ
410 420 430 440 450
VNNYGVTGTY LEYRLRQYGI VLAECDYGEH YIYAAASDSS IRHISPVPIH
460 470 480 490 500
DRHHVFVTRL TARFGAFDEG FDLGFGTRYG RRRGGGKKSG QSSGVRAPGR
510 520 530 540 550
TTPDLAGDWG KAVDDQEKTA SKVTTDKAMS TSEPAVVQVG CETKPVADAA
560 570 580 590 600
AVPASVNSTG CALLPVQADP CCTAGVAAKE SEPKAVAAPS IPITFGAPAG
610 620 630 640 650
ETLPVAASPL VVKKDKRCIS VKLTAKKALP KETFIPPPDG GCGVHAFAAI
660 670 680 690 700
QYHINTGHWP EQKPVVNWAY EAWTTNEDIG HMICSTETPA ALEPCLHARY
710 720 730 740 750
VVRLDSDHWV VDHYPNRPMC FVEACAHGWC SSLLSEPTGE EGEHLVDCSA
760 770 780 790 800
LYDCLGKFRN GTEFADTVLG LSKTAHCCNK RVPTPRKQAI MSLLNRPNCV
810 820 830 840 850
PCIAPPSQVR TVDPSQPAAP LPPVPRPRKR KAAAQQVSKV PSEQDPSLAH
860 870 880 890 900
DPPEKPDSVR PPKLGYLDRA WNNMLARTHK LHNLQQRVFG LYPQLLSMLL
910 920 930 940 950
PSGARPSTPR LLGCYFSMAV AMFFLFLGSP LFILCAVLAG VIAPSARYPK
960 970 980 990 1000
ILCCCLVVVY ICTLFADAIS SVCDNDDADC RAFLSDLGDR YSTNQPVYIT
1010 1020 1030 1040 1050
PGPATFFLAV SRNFFVVSVA LFPLHLLLLM VDVLLVIGVL CMDGYCFRCF
1060 1070 1080 1090 1100
SRCVRKAPEE VSLLTIPQSR VSRRFLLDIC DFYSAPPVDI IRLATGLNGC
1110 1120 1130 1140 1150
FRGDYSPIGS STSVITADKI DVKKVSCRTV CSFPSCPSEA VKVLHVLSVR
1160 1170 1180 1190 1200
GQMCAHNEQK VEKVDALPCK NPLFPYDLSS KKIVPVDSGT YEILSSIGCD
1210 1220 1230 1240 1250
MSHLVIGDGD FFKVMGVPRP SPFTVMRLRA CRVVGGGRIF RTALAAAWVL
1260 1270 1280 1290 1300
FFVCAGYWVQ MSTPCGIGTN DPFCKSSFGV PTYVNQGVCH GQYCASSKGV
1310 1320 1330 1340 1350
SRATSILTVR NPAVAPYIVL AACLVYLASV YVPGIIEVSL LVLNALLPAG
1360 1370 1380 1390 1400
PAISALRTLV MIIAAPHLSM KYIAFFCCTT AFVDFTSVVV VLTALLVGWI
1410 1420 1430 1440 1450
LARYTGIGGF VTPYDIHDVV KSQRDGVAVA NAPPNTYLGA VRRAALTGKP
1460 1470 1480 1490 1500
AFFVANNTGI VLEGLLREKT RASNSVSVYG VTCGSGGLFS DGNNTVCLTA
1510 1520 1530 1540 1550
THVCGNNKAV VDYQGTRYEA VFTTKGDYAS AVVPIPGAFP PLKFAPQSYT
1560 1570 1580 1590 1600
GRAYWYANTG VETGFVGTTG CLVFSGPGDS GSPIITPDGL IVGVHTGSDS
1610 1620 1630 1640 1650
KGSGAYTTPN GLTVSGPLSL KEMGAHYEGP IVDVPTRLPR NVHNDTKSVP
1660 1670 1680 1690 1700
QPLARLLESS INLEGGLGTI QLIIVAVVLW KYAVDPLSIP FVVAFFLLNE
1710 1720 1730 1740 1750
ILPKCLIRCF YNYSLFCLAA FSPLASRIFF IRLLTAALNR NPTALICHAC
1760 1770 1780 1790 1800
FAGIAVLNDF IILGDIRLAL RFTSFYVVGV NHDAIAIAVI GALVCVAACC
1810 1820 1830 1840 1850
LELFGLPQMA SVIGCHGSFD PTFLSRYVHE GIRQGVSSGF GTESLSTALA
1860 1870 1880 1890 1900
CALSEDELNF LAQAVDHKAI VSAIHVHKTL QDYILSKNAK ILRASLASVH
1910 1920 1930 1940 1950
ANHNASKALA SLDKFLQGTS TQLKPGDPVI LLGSTSAELV SVFSGDSEYI
1960 1970 1980 1990 2000
AEPIRSHPVA GTICTLCVVQ AKCEGGLVTQ VNGKFSPAKY LAVAGKVLAD
2010 2020 2030 2040 2050
HPDYKLENDG RFPRTREDRV KDSVQVDTVD IGSHTFKKMW NKTTGDVWYD
2060 2070 2080 2090 2100
IIMPESAANP LAVHDLDSAV AAIGMSKEIP EKDMNRLRAI ISKLQGLVSS
2110 2120 2130 2140 2150
EALNLLTAAG CTSADRSGLV ITLDYAKIIT HHARTRAFSS IDFKVVSPDE
2160 2170 2180 2190 2200
AMRTARLSPS PQPIIASFSD DKFLLLRRHP PSLLDVLTKG LDATCREPLH
2210 2220 2230 2240 2250
SPGDQGIDGY LWDFEAPHSK EAIWLSNQII SACAARRGDA PGCYPYKLHP
2260 2270 2280 2290 2300
VRGDPYRVGN VLKNTRFGDV TYTAVSDSDS PWLKVASINS GGCPVVTDRV
2310 2320 2330 2340 2350
LGSTIPVGSE IYLPTLPESV LDYLDSRPDC PTYYTQHGCE AAALQDLKKF
2360 2370 2380 2390 2400
NLSTQGFILP EVLNIVRNYL LGTIGYRPAI YKPSTVPSND SHAGINGLSF
2410 2420 2430 2440 2450
STKTLQALPD IDELCEKAIA EVWQTVTPVT LKKQFCSKAK TRTILGTNAM
2460 2470 2480 2490 2500
ASLALRALLS GVTQGFQLAG KNSPICLGKS KFDPCTFEVK GRCLETDLAS
2510 2520 2530 2540 2550
CDRSTPAIVR HFATKLLFEM ACAERALPLY VVNCCHDLIV TQTSAATKRG
2560 2570 2580 2590 2600
GLSSGDPVTS IANTIYSLVL YVQHMVLTLL ENGHPLSLKF LSGKLNFQDL
2610 2620 2630 2640 2650
YKLQAFIVYS DDLILLNESD DLPNFERWVP HLELALGFKV DPKKTVITSN
2660 2670 2680 2690 2700
PGFLGCEYRH GWLVPQKQRV LAALAYHVNA KDVHTYYINA TAILNDASAL
2710 2720 2730 2740 2750
SAFEPDWFDD LVIGLADCAR KDGYSFPGPA AFREFFSRVS GYQFEGKEVQ
2760 2770 2780 2790 2800
VCSICCSTAR TTSLCGMALC DFCAHRHYHP GCHVLSSFCK HVIGSNTCKM
2810 2820 2830 2840 2850
CSIPILKDRT KFAELLASDQ YRSVCTVEVT VVDGYTDAAP GRYSYQKKQY
2860 2870 2880 2890 2900
MLRKERRGCP LDLPDGKYSM KLLPNSCSGI CVPKAQENAT LSNFVVGPPG
2910 2920 2930 2940 2950
SGKTTFISNL LDDDAVVYCP THVSLIAYSK SLPAARFSVP RGQDPAEYGT
2960 2970 2980 2990 3000
PALSGPTLQL LSAGYVPGAK HYLDEACYAN PFDVFKLLSK TPITAIGDPA
3010 3020 3030 3040 3050
QLTPVGFDTP LYVFELMKKN ALHAIYRFGQ NICNAIQPCY STKLVSQRQG
3060 3070 3080 3090 3100
DTEVIFQTKF APRGKVLTPY HRDRVGAAVT IDSSQGSTYD VVTLYLPTKG
3110 3120 3130 3140 3150
SLTLARGLVG ITRARERLYV YDPHHQLAKY FNLQPSSTTI RPHAVVIDGK
3160 3170 3180 3190 3200
ARVMLSDKCY AAPEDFPGML CTARPATAAD RKILEETCLK LDFLESGSLS
3210 3220 3230 3240 3250
PLPRVCYNLG FYYSPDITKL LPIPSELAKH WPVATNRNNP EWPNRLVVSA
3260 3270 3280 3290 3300
TRLSPLSHPA VCAGYYVGDS LFVGTPNVTS YWLTKFLDGR AVPMEDSVYS
3310 3320 3330 3340 3350
TGRFEMDIRD YLDSAERDFA AKHPHAFIGD TKGTTVGGCH HITSQYLPHV
3360 3370 3380 3390 3400
LPADSVVKVG VSKPGVAHKA LCTVTDIYLP MLGSYTSPPT QSKVYKVNVD
3410 3420 3430 3440 3450
HKACKLMVWR DQTMYFQEGF DYHTLVDALR FVRLSSDGVY RVAPELTPMI
3460 3470 3480 3490 3500
GNRRLDLGAK PLRPVDLAIT PWDDPKCEFL VTHASPFDMS DEFLLVNAFD
3510 3520 3530 3540 3550
FIKEDLLGKS VTPVYFYKRL SEPLHFDQNL PPHVGAILSK APRFISLAKV
3560 3570 3580 3590
FNFCFTPTAC HCKVSVKTAT GDHMCKCSLS SDEFLSRFNP TVGTP
Note: Produced by -1 ribosomal frameshifting at the 1a-1b genes boundary.
Length:3,595
Mass (Da):391,443
Last modified:November 11, 2015 - v4
<p>The checksum is a form of redundancy check that is calculated from the sequence. It is useful for tracking sequence updates.</p> <p>It should be noted that while, in theory, two different sequences could have the same checksum value, the likelihood that this would happen is extremely low.</p> <p>However UniProtKB may contain entries with identical sequences in case of multiple genes (paralogs).</p> <p>The checksum is computed as the sequence 64-bit Cyclic Redundancy Check value (CRC64) using the generator polynomial: x<sup>64</sup> + x<sup>4</sup> + x<sup>3</sup> + x + 1. The algorithm is described in the ISO 3309 standard. </p> <p class="publication">Press W.H., Flannery B.P., Teukolsky S.A. and Vetterling W.T.<br /> <strong>Cyclic redundancy and other checksums</strong><br /> <a href="http://www.nrbook.com/b/bookcpdf.php">Numerical recipes in C 2nd ed., pp896-902, Cambridge University Press (1993)</a>)</p> Checksum:i75E2CB1BE0242E18
GO
Isoform Replicase polyprotein 1a (identifier: Q68772-2) [UniParc]FASTAAdd to basket
Also known as: pp1a, ORF1a polyprotein

The sequence of this isoform differs from the canonical sequence as follows:
     2105-3595: Missing.

Note: Produced by conventional translation.
Show »
Length:2,104
Mass (Da):227,223
Checksum:i6C9623FD4D88B9F2
GO

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the ‘Sequence’ section describes the sequence of naturally occurring alternative protein isoform(s). The changes in the amino acid sequence may be due to alternative splicing, alternative promoter usage, alternative initiation, or ribosomal frameshifting.<p><a href='/help/var_seq' target='_top'>More...</a></p>Alternative sequenceiVSP_0328942105 – 3595Missing in isoform Replicase polyprotein 1a. CuratedAdd BLAST1491

Sequence databases

Select the link destinations:

EMBL nucleotide sequence database

More...
EMBLi

GenBank nucleotide sequence database

More...
GenBanki

DNA Data Bank of Japan; a nucleotide sequence database

More...
DDBJi
Links Updated
AF180391 Genomic RNA Translation: AAA68933.3
AF180391 Genomic RNA Translation: AAB63390.4
U28864 mRNA Translation: AAB40002.1

NCBI Reference Sequences

More...
RefSeqi
NP_203542.2, NC_003092.2
YP_009109556.3, NC_003092.2 [Q68772-1]

Genome annotation databases

Database of genes from NCBI RefSeq genomes

More...
GeneIDi
22220025

KEGG: Kyoto Encyclopedia of Genes and Genomes

More...
KEGGi
vg:22220025

Keywords - Coding sequence diversityi

Ribosomal frameshifting

<p>This section provides links to proteins that are similar to the protein sequence(s) described in this entry at different levels of sequence identity thresholds (100%, 90% and 50%) based on their membership in UniProt Reference Clusters (<a href="http://www.uniprot.org/help/uniref">UniRef</a>).<p><a href='/help/similar_proteins_section' target='_top'>More...</a></p>Similar proteinsi

<p>This section is used to point to information related to entries and found in data collections other than UniProtKB.<p><a href='/help/cross_references_section' target='_top'>More...</a></p>Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AF180391 Genomic RNA Translation: AAA68933.3
AF180391 Genomic RNA Translation: AAB63390.4
U28864 mRNA Translation: AAB40002.1
RefSeqiNP_203542.2, NC_003092.2
YP_009109556.3, NC_003092.2 [Q68772-1]

3D structure databases

Database of comparative protein structure models

More...
ModBasei
Search...

SWISS-MODEL Interactive Workspace

More...
SWISS-MODEL-Workspacei
Submit a new modelling project...

Protein family/group databases

MEROPSiC32.002

Proteomic databases

PRIDEiQ68772

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

GeneIDi22220025
KEGGivg:22220025

Phylogenomic databases

OrthoDBi33at10239

Family and domain databases

Gene3Di1.20.58.950, 1 hit
3.90.70.70, 2 hits
InterProiView protein in InterPro
IPR027351 (+)RNA_virus_helicase_core_dom
IPR031932 Arteri_nsp7a
IPR038451 Arteri_nsp7a_sf
IPR008743 Arterivirus_Nsp2_C33
IPR023338 Arterivirus_NSP4_peptidase
IPR008741 AV_PCPalpha
IPR025773 AV_PCPbeta
IPR038154 AV_PCPbeta_sf
IPR027355 AV_ZBD
IPR008760 EAV_peptidase_S32
IPR037227 EndoU-like
IPR027417 P-loop_NTPase
IPR009003 Peptidase_S1_PA
IPR001205 RNA-dir_pol_C
IPR007094 RNA-dir_pol_PSvirus
PfamiView protein in Pfam
PF16749 Arteri_nsp7a, 1 hit
PF05411 Peptidase_C32, 2 hits
PF05412 Peptidase_C33, 1 hit
PF05579 Peptidase_S32, 1 hit
PF00680 RdRP_1, 1 hit
PF01443 Viral_helicase1, 1 hit
SUPFAMiSSF142877 SSF142877, 1 hit
SSF50494 SSF50494, 1 hit
SSF52540 SSF52540, 1 hit
PROSITEiView protein in PROSITE
PS51538 AV_CP, 1 hit
PS51493 AV_NSP4_PRO, 1 hit
PS51539 AV_PCP_ALPHA, 1 hit
PS51540 AV_PCP_BETA, 1 hit
PS51652 AV_ZBD, 1 hit
PS51657 PSRV_HELICASE, 1 hit
PS50507 RDRP_SSRNA_POS, 1 hit

ProtoNet; Automatic hierarchical classification of proteins

More...
ProtoNeti
Search...

MobiDB: a database of protein disorder and mobility annotations

More...
MobiDBi
Search...

<p>This section provides general information on the entry.<p><a href='/help/entry_information_section' target='_top'>More...</a></p>Entry informationi

<p>This subsection of the ‘Entry information’ section provides a mnemonic identifier for a UniProtKB entry, but it is not a stable identifier. Each reviewed entry is assigned a unique entry name upon integration into UniProtKB/Swiss-Prot.<p><a href='/help/entry_name' target='_top'>More...</a></p>Entry nameiRPOA_SHFV
<p>This subsection of the ‘Entry information’ section provides one or more accession number(s). These are stable identifiers and should be used to cite UniProtKB entries. Upon integration into UniProtKB, each entry is assigned a unique accession number, which is called ‘Primary (citable) accession number’.<p><a href='/help/accession_numbers' target='_top'>More...</a></p>AccessioniPrimary (citable) accession number: Q68772
Secondary accession number(s): P89132, Q87077
<p>This subsection of the ‘Entry information’ section shows the date of integration of the entry into UniProtKB, the date of the last sequence update and the date of the last annotation modification (‘Last modified’). The version number for both the entry and the <a href="http://www.uniprot.org/help/canonical_and_isoforms">canonical sequence</a> are also displayed.<p><a href='/help/entry_history' target='_top'>More...</a></p>Entry historyiIntegrated into UniProtKB/Swiss-Prot: April 12, 2005
Last sequence update: November 11, 2015
Last modified: July 31, 2019
This is version 124 of the entry and version 4 of the sequence. See complete history.
<p>This subsection of the ‘Entry information’ section indicates whether the entry has been manually annotated and reviewed by UniProtKB curators or not, in other words, if the entry belongs to the Swiss-Prot section of UniProtKB (<strong>reviewed</strong>) or to the computer-annotated TrEMBL section (<strong>unreviewed</strong>).<p><a href='/help/entry_status' target='_top'>More...</a></p>Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programViral Protein Annotation Program

<p>This section contains any relevant information that doesn’t fit in any other defined sections<p><a href='/help/miscellaneous_section' target='_top'>More...</a></p>Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. SIMILARITY comments
    Index of protein domains and families
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again