Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Genome polyprotein

Gene
N/A
Organism
Classical swine fever virus (strain Alfort) (CSFV) (Hog cholera virus)
Status
Reviewed-Annotation score: -Experimental evidence at protein leveli

Functioni

N-terminal protease: Leader cysteine autoprotease that cleaves itself from the nascent polyprotein during translation of the viral mRNA. Once released, plays a role in the inhibition of host innate immune response by interacting with host IRF3 and inducing its proteasomal degradation.3 Publications
Capsid protein C: Packages viral RNA to form a viral nucleocapsid and thereby protects viral RNA. Plays also a role in transcription regulation. Protects the incoming virus against IFN-induced effectors.2 Publications
E(rns) glycoprotein: Plays a role in viral entry. Interacts with host RPSA that acts as a cellular attachment receptor for the virus. Possesses also intrinsic ribonuclease (RNase) activity that can inhibit the production of type I interferon and assist in the development of persistent infections.5 Publications
Envelope glycoprotein E1: Plays a role in cell attachment and subsequent fusion of viral and cellular membranes. Therefore, mediates together with envelope glycoprotein E2 the viral entry.1 Publication
Envelope glycoprotein E2: Plays a role in cell attachment and subsequent fusion of viral and cellular membranes. Therefore, mediates together with envelope glycoprotein E1 the viral entry.1 Publication
P7: Plays an essential role in the virus replication cycle by acting as a viroporin. Forms ion conductive pores, which alters the cell permeability allowing the transport of ions and other small molecules.2 Publications
Non-structural protein 2-3: Autoprotease that associates with the host chaperone JIV and cleaves the NS2-3 protein between NS2 and NS3. Plays also a role in the formation of infectious particles.1 Publication
Cysteine protease NS2: Plays a roile in the regulation of viral RNA replication.1 Publication
Serine protease NS3: Multifunctional protein that contains an N-terminal protease and a C-terminal helicase, playing essential roles in viral polyprotein processing and viral genome replication. The chymotrypsin-like serine protease activity utilizes NS4A as an essential cofactor and catalyzes the cleavage of the polyprotein leading to the release of NS4A, NS4B, NS5A, and NS5B. Plays a role in the inhibition of host NF-kappa-B activation by interacting with and inhibiting host TRAF6. Interacts with NS5B to enhance RNA-dependent RNA polymerase activity.2 Publications
Non-structural protein 4A: Acts as a cofactor for the NS3 protease activity.1 Publication
RNA-directed RNA polymerase: Replicates the viral (+) and (-) genome.PROSITE-ProRule annotation

Catalytic activityi

Leu is conserved at position P1 for all four cleavage sites. Alanine is found at position P1' of the NS4A-NS4B cleavage site, whereas serine is found at position P1' of the NS3-NS4A, NS4B-NS5A and NS5A-NS5B cleavage sites.
Nucleoside triphosphate + RNA(n) = diphosphate + RNA(n+1).PROSITE-ProRule annotation
NTP + H2O = NDP + phosphate.
ATP + H2O = ADP + phosphate.

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Active sitei22For N-terminal protease activityPROSITE-ProRule annotation1 Publication1
Active sitei49For N-terminal protease activityPROSITE-ProRule annotation1 Publication1
Active sitei69For N-terminal protease activityPROSITE-ProRule annotation2 Publications1
Active sitei1447For cysteine protease NS2 activityPROSITE-ProRule annotation1
Active sitei1461For cysteine protease NS2 activityPROSITE-ProRule annotation1
Active sitei1512For cysteine protease NS2 activityPROSITE-ProRule annotation1
Active sitei1658Charge relay system; for serine protease NS3 activityPROSITE-ProRule annotation1
Active sitei1695Charge relay system; for serine protease NS3 activityPROSITE-ProRule annotation1
Active sitei1752Charge relay system; for serine protease NS3 activityPROSITE-ProRule annotation1

GO - Molecular functioni

GO - Biological processi

Keywordsi

Molecular functionHelicase, Hydrolase, Ion channel, Nucleotidyltransferase, Protease, RNA-directed RNA polymerase, Serine protease, Thiol protease, Transferase, Viral ion channel
Biological processActivation of host autophagy by virus, Fusion of virus membrane with host endosomal membrane, Fusion of virus membrane with host membrane, Host-virus interaction, Inhibition of host innate immune response by virus, Inhibition of host IRF3 by virus, Inhibition of host RLR pathway by virus, Ion transport, Transport, Viral attachment to host cell, Viral immunoevasion, Viral penetration into host cytoplasm, Viral RNA replication, Virus entry into host cell
LigandATP-binding, Nucleotide-binding

Protein family/group databases

MEROPSiC53.001

Names & Taxonomyi

Protein namesi
Recommended name:
Genome polyprotein
Cleaved into the following 13 chains:
N-terminal protease (EC:3.4.22.-)
Short name:
N-pro
Alternative name(s):
Autoprotease p20
Alternative name(s):
Core protein
Alternative name(s):
gp44/48
Alternative name(s):
gp33
Alternative name(s):
gp55
Alternative name(s):
Non-structural protein 2
Alternative name(s):
Non-structural protein 3
Alternative name(s):
NS5B
OrganismiClassical swine fever virus (strain Alfort) (CSFV) (Hog cholera virus)
Taxonomic identifieri11097 [NCBI]
Taxonomic lineageiVirusesssRNA virusesssRNA positive-strand viruses, no DNA stageFlaviviridaePestivirus
Virus hostiSus scrofa (Pig) [TaxID: 9823]
Proteomesi
  • UP000008568 Componenti: Genome

Subcellular locationi

N-terminal protease :
Capsid protein C :
E(rns) glycoprotein :
Envelope glycoprotein E2 :
Cysteine protease NS2 :
  • Host membrane PROSITE-ProRule annotation; Multi-pass membrane protein PROSITE-ProRule annotation

Topology

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Transmembranei1140 – 1164HelicalPROSITE-ProRule annotationAdd BLAST25
Transmembranei1189 – 1209HelicalPROSITE-ProRule annotationAdd BLAST21
Transmembranei1217 – 1237HelicalPROSITE-ProRule annotationAdd BLAST21
Transmembranei1247 – 1267HelicalPROSITE-ProRule annotationAdd BLAST21
Transmembranei1281 – 1301HelicalPROSITE-ProRule annotationAdd BLAST21
Transmembranei1360 – 1380HelicalPROSITE-ProRule annotationAdd BLAST21
Transmembranei1568 – 1588HelicalPROSITE-ProRule annotationAdd BLAST21

GO - Cellular componenti

  • host cell cytoplasm Source: AgBase
  • host cell endoplasmic reticulum Source: AgBase
  • host cell nucleus Source: AgBase
  • host cell surface Source: UniProtKB-SubCell
  • integral component of membrane Source: UniProtKB-KW
  • integral to membrane of host cell Source: UniProtKB-KW
  • virion membrane Source: UniProtKB-SubCell

Keywords - Cellular componenti

Host cell membrane, Host cytoplasm, Host membrane, Membrane, Virion

Pathology & Biotechi

Mutagenesis

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Mutagenesisi22E → V: Almost complete loss of cleavage between N-pro and C. 1 Publication1
Mutagenesisi40H → L: No effect. 1 Publication1
Mutagenesisi49H → L: Complete loss of cleavage between N-pro and C. 1 Publication1
Mutagenesisi69C → A: Complete loss of cleavage between N-pro and C. 1 Publication1
Mutagenesisi69C → S: Complete loss of cleavage between N-pro and C. 1 Publication1
Mutagenesisi99H → L: No effect. 1 Publication1
Mutagenesisi112C → A: No effect. 1 Publication1
Mutagenesisi112C → S: No effect. 1 Publication1
Mutagenesisi130H → L: No effect. 1 Publication1
Mutagenesisi134C → A: No effect. 1 Publication1
Mutagenesisi134C → S: No effect. 1 Publication1
Mutagenesisi138C → A: No effect. 1 Publication1
Mutagenesisi138C → S: No effect. 1 Publication1
Mutagenesisi161C → A: No effect. 1 Publication1
Mutagenesisi161C → S: No effect. 1 Publication1

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00000380501 – 168N-terminal proteaseAdd BLAST168
ChainiPRO_0000038051169 – 267Capsid protein CAdd BLAST99
ChainiPRO_0000038052268 – 494E(rns) glycoproteinAdd BLAST227
ChainiPRO_0000038053495 – 656Envelope glycoprotein E1Add BLAST162
ChainiPRO_0000038054657 – 1062Envelope glycoprotein E2Add BLAST406
ChainiPRO_00000380551063 – 1132p7By similarityAdd BLAST70
ChainiPRO_00000380561133 – 2272Non-structural protein 2-3By similarityAdd BLAST1140
ChainiPRO_00003493611133 – 1589Cysteine protease NS2PROSITE-ProRule annotationAdd BLAST457
ChainiPRO_00000380571590 – 2272Serine protease NS3By similarityAdd BLAST683
ChainiPRO_00000380582273 – 2336Non-structural protein 4ABy similarityAdd BLAST64
ChainiPRO_00000380592337 – 2683Non-structural protein 4BBy similarityAdd BLAST347
ChainiPRO_00000380602684 – 3180Non-structural protein 5ABy similarityAdd BLAST497
ChainiPRO_00000380613181 – 3898RNA-directed RNA polymeraseBy similarityAdd BLAST718

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Glycosylationi157N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi269N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi274N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi278N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi293N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi332N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi362N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi367N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi410N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi425N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi500N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi594N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi805N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi810N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi874N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi918N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi949N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi986N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi1713N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi2134N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi2217N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi2494N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi2787N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi2815N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi2891N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi3211N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi3316N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi3689N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi3698N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi3794N-linked (GlcNAc...) asparagine; by hostSequence analysis1

Post-translational modificationi

The E(rns) glycoprotein is heavily glycosylated.1 Publication
The viral RNA of pestiviruses is expressed as a single polyprotein which undergoes post-translational proteolytic processing resulting in the production of at least eleven individual proteins. The N-terminal protease cleaves itself from the nascent polyprotein autocatalytically and thereby generates the N-terminus of the adjacent viral capsid protein C.2 Publications
Cleavage between E2 and p7 is partial.By similarity

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sitei168 – 169Cleavage; by autolysis2
Sitei267 – 268Cleavage; by host signal peptidase2
Sitei494 – 495Cleavage2
Sitei656 – 657Cleavage; by host signal peptidase2
Sitei1062 – 1063Cleavage; by host signal peptidase; partialBy similarity2
Sitei1132 – 1133Cleavage; by host signal peptidaseBy similarity2
Sitei1589 – 1590Cleavage; partial; cysteine protease NS2PROSITE-ProRule annotation2
Sitei2272 – 2273Cleavage; by serine protease NS3By similarity2
Sitei2336 – 2337Cleavage; by serine protease NS3By similarity2
Sitei2683 – 2684Cleavage; by serine protease NS3By similarity2
Sitei3180 – 3181Cleavage; by serine protease NS3By similarity2

Keywords - PTMi

Disulfide bond, Glycoprotein

Proteomic databases

PRIDEiP19712

Miscellaneous databases

PMAP-CutDBP19712

Expressioni

Inductioni

Translated cap independently from an internal ribosome entry site (IRES).1 Publication

Interactioni

Subunit structurei

N-terminal protease: Interacts with host IRF3 (PubMed:17215286, PubMed:27334592). Capsid protein C: Interacts with host OS9 (PubMed:25010283). E(rns) glycoprotein: Homodimer (PubMed:29235980); disulfide-linked. Interacts with host RPSA. Envelope glycoprotein E2: Interacts with host TRX2. Forms disulfide-linked homodimers as well as heterodimers with E1. Serine protease NS3: Interacts with host TRAF6; this interaction inhibits host NF-kappa-B pathway. Interacts with NS5B; this interaction enhances RNA-dependent RNA polymerase activity. Interacts with protein NS4A.6 Publications

Protein-protein interaction databases

IntActiP19712, 96 interactors

Structurei

Secondary structure

13898
Legend: HelixTurnBeta strandPDB Structure known for this area
Show more details
Feature keyPosition(s)DescriptionActionsGraphical viewLength
Beta strandi1598 – 1600Combined sources3
Helixi1610 – 1614Combined sources5
Beta strandi1632 – 1639Combined sources8
Beta strandi1642 – 1649Combined sources8
Beta strandi1652 – 1655Combined sources4
Helixi1657 – 1660Combined sources4
Beta strandi1665 – 1669Combined sources5
Beta strandi1672 – 1676Combined sources5
Beta strandi1680 – 1682Combined sources3
Turni1683 – 1686Combined sources4
Beta strandi1687 – 1691Combined sources5
Beta strandi1703 – 1712Combined sources10
Beta strandi1718 – 1727Combined sources10
Beta strandi1730 – 1734Combined sources5
Helixi1745 – 1748Combined sources4
Beta strandi1755 – 1758Combined sources4
Turni1759 – 1761Combined sources3
Beta strandi1764 – 1767Combined sources4
Beta strandi1770 – 1777Combined sources8
Beta strandi1780 – 1782Combined sources3
Helixi1792 – 1803Combined sources12
Beta strandi1810 – 1814Combined sources5
Beta strandi1817 – 1819Combined sources3
Beta strandi1821 – 1823Combined sources3
Helixi1824 – 1833Combined sources10
Beta strandi1839 – 1845Combined sources7
Helixi1846 – 1859Combined sources14
Beta strandi1865 – 1871Combined sources7
Beta strandi1880 – 1885Combined sources6
Helixi1886 – 1889Combined sources4
Helixi1894 – 1901Combined sources8
Beta strandi1905 – 1910Combined sources6
Helixi1912 – 1914Combined sources3
Helixi1917 – 1927Combined sources11
Helixi1928 – 1932Combined sources5
Beta strandi1935 – 1939Combined sources5
Beta strandi1956 – 1958Combined sources3
Turni1970 – 1972Combined sources3
Beta strandi1973 – 1976Combined sources4
Beta strandi1979 – 1982Combined sources4
Helixi1983 – 1987Combined sources5
Beta strandi1990 – 1993Combined sources4
Helixi1997 – 2009Combined sources13
Beta strandi2014 – 2017Combined sources4
Helixi2024 – 2030Combined sources7
Beta strandi2032 – 2034Combined sources3
Beta strandi2036 – 2041Combined sources6
Helixi2042 – 2045Combined sources4
Beta strandi2054 – 2057Combined sources4
Beta strandi2061 – 2068Combined sources8
Beta strandi2070 – 2073Combined sources4
Beta strandi2075 – 2082Combined sources8
Helixi2086 – 2093Combined sources8
Beta strandi2096 – 2100Combined sources5
Beta strandi2103 – 2105Combined sources3
Beta strandi2112 – 2114Combined sources3
Helixi2118 – 2124Combined sources7
Helixi2125 – 2128Combined sources4
Turni2129 – 2132Combined sources4
Helixi2135 – 2146Combined sources12
Helixi2153 – 2166Combined sources14
Helixi2173 – 2181Combined sources9
Helixi2188 – 2193Combined sources6
Turni2194 – 2196Combined sources3
Beta strandi2204 – 2207Combined sources4
Beta strandi2210 – 2215Combined sources6
Beta strandi2234 – 2237Combined sources4
Helixi2238 – 2246Combined sources9
Helixi2257 – 2270Combined sources14
Helixi2272 – 2275Combined sources4
Beta strandi2276 – 2279Combined sources4
Beta strandi2298 – 2312Combined sources15
Helixi2315 – 2317Combined sources3
Beta strandi2336 – 2343Combined sources8

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
4CBGX-ray2.82A/B/C/D1782-2280[»]
4CBHX-ray2.51A/B/C/D1782-2280[»]
4CBIX-ray3.00A/B/C/D1782-2280[»]
4CBLX-ray3.05A/B/C/D1792-2280[»]
4CBMX-ray3.27A/B/C/D1782-2280[»]
5MZ4X-ray3.05A/B1590-2280[»]
ProteinModelPortaliP19712
SMRiP19712
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini1 – 168Peptidase C53Add BLAST168
Domaini1441 – 1589Peptidase C74PROSITE-ProRule annotationAdd BLAST149
Domaini1590 – 1763Peptidase S31PROSITE-ProRule annotationAdd BLAST174
Domaini1802 – 1960Helicase ATP-bindingPROSITE-ProRule annotationAdd BLAST159
Domaini1978 – 2179Helicase C-terminalPROSITE-ProRule annotationAdd BLAST202
Domaini3519 – 3642RdRp catalyticPROSITE-ProRule annotationAdd BLAST124

Sequence similaritiesi

Belongs to the pestivirus polyprotein family.Curated

Keywords - Domaini

Transmembrane, Transmembrane helix

Phylogenomic databases

OrthoDBiVOG09000032

Family and domain databases

CDDicd00079 HELICc, 1 hit
Gene3Di3.90.730.10, 1 hit
InterProiView protein in InterPro
IPR021824 Capsid-C_pestivirus
IPR011492 DEAD_Flavivir
IPR014001 Helicase_ATP-bd
IPR001650 Helicase_C
IPR022120 NS2
IPR030399 NS2_C74
IPR027417 P-loop_NTPase
IPR008751 Peptidase_C53
IPR032521 Pestivirus_E2
IPR000280 Pestivirus_NS3_S31
IPR007094 RNA-dir_pol_PSvirus
IPR002166 RNA_pol_HCV
IPR036430 RNase_T2-like_sf
IPR033130 RNase_T2_His_AS_2
PfamiView protein in Pfam
PF11889 DUF3409, 1 hit
PF07652 Flavi_DEAD, 1 hit
PF00271 Helicase_C, 1 hit
PF05550 Peptidase_C53, 1 hit
PF12387 Peptidase_C74, 1 hit
PF05578 Peptidase_S31, 1 hit
PF16329 Pestivirus_E2, 1 hit
PF00998 RdRP_3, 1 hit
PRINTSiPR00729 CDVENDOPTASE
SMARTiView protein in SMART
SM00487 DEXDc, 1 hit
SM00490 HELICc, 1 hit
SUPFAMiSSF52540 SSF52540, 1 hit
SSF55895 SSF55895, 1 hit
PROSITEiView protein in PROSITE
PS51192 HELICASE_ATP_BIND_1, 1 hit
PS51194 HELICASE_CTER, 1 hit
PS51692 PESTIVIRUS_NS2_PRO, 1 hit
PS51535 PESTIVIRUS_NS3PRO, 1 hit
PS50507 RDRP_SSRNA_POS, 1 hit
PS00531 RNASE_T2_2, 1 hit

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

P19712-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MELNHFELLY KTSKQKPVGV EEPVYDTAGR PLFGNPSEVH PQSTLKLPHD
60 70 80 90 100
RGRGDIRTTL RDLPRKGDCR SGNHLGPVSG IYIKPGPVYY QDYTGPVYHR
110 120 130 140 150
APLEFFDEAQ FCEVTKRIGR VTGSDGKLYH IYVCVDGCIL LKLAKRGTPR
160 170 180 190 200
TLKWIRNFTN CPLWVTSCSD DGASGSKDKK PDRMNKGKLK IAPREHEKDS
210 220 230 240 250
KTKPPDATIV VEGVKYQIKK KGKVKGKNTQ DGLYHNKNKP PESRKKLEKA
260 270 280 290 300
LLAWAVITIL LYQPVAAENI TQWNLSDNGT NGIQRAMYLR GVNRSLHGIW
310 320 330 340 350
PEKICKGVPT HLATDTELKE IRGMMDASER TNYTCCRLQR HEWNKHGWCN
360 370 380 390 400
WYNIDPWIQL MNRTQTNLTE GPPDKECAVT CRYDKNTDVN VVTQARNRPT
410 420 430 440 450
TLTGCKKGKN FSFAGTVIEG PCNFNVSVED ILYGDHECGS LLQDTALYLL
460 470 480 490 500
DGMTNTIENA RQGAARVTSW LGRQLSTAGK KLERRSKTWF GAYALSPYCN
510 520 530 540 550
VTRKIGYIWY TNNCTPACLP KNTKIIGPGK FDTNAEDGKI LHEMGGHLSE
560 570 580 590 600
FLLLSLVILS DFAPETASTL YLILHYAIPQ SHEEPEGCDT NQLNLTVKLR
610 620 630 640 650
TEDVVPSSVW NIGKYVCVRP DWWPYETKVA LLFEEAGQVI KLVLRALRDL
660 670 680 690 700
TRVWNSASTT AFLICLIKVL RGQVVQGIIW LLLVTGAQGR LACKEDYRYA
710 720 730 740 750
ISSTNEIGLL GAEGLTTTWK EYSHGLQLDD GTVKAVCTAG SFKVTALNVV
760 770 780 790 800
SRRYLASLHK RALPTSVTFE LLFDGTNPAI EEMDDDFGFG LCPFDTSPVI
810 820 830 840 850
KGKYNTTLLN GSAFYLVCPI GWTGVVECTA VSPTTLRTEV VKTFRRDKPF
860 870 880 890 900
PHRVDCVTTI VEKEDLFHCK LGGNWTCVKG DPVTYKGGQV KQCRWCGFEF
910 920 930 940 950
KEPYGLPHYP IGKCILTNET GYRVVDSTDC NRDGVVISTE GEHECLIGNT
960 970 980 990 1000
TVKVHALDER LGPMPCRPKE IVSSEGPVRK TSCTFNYTKT LRNKYYEPRD
1010 1020 1030 1040 1050
SYFQQYMLKG EYQYWFNLDV TDHHTDYFAE FVVLVVVALL GGRYVLWLIV
1060 1070 1080 1090 1100
TYIILTEQLA AGLQLGQGEV VLIGNLITHT DNEVVVYFLL LYLVIRDEPI
1110 1120 1130 1140 1150
KKWILLLFHA MTNNPVKTIT VALLMISGVA KGGKIDGGWQ RQPVTSFDIQ
1160 1170 1180 1190 1200
LALAVVVVVV MLLAKRDPTT FPLVITVATL RTAKITNGFS TDLVIATVSA
1210 1220 1230 1240 1250
ALLTWTYISD YYKYKTWLQY LVSTVTGIFL IRVLKGIGEL DLHAPTLPSH
1260 1270 1280 1290 1300
RPLFYILVYL ISTAVVTRWN LDVAGLLLQC VPTLLMVFTM WADILTLILI
1310 1320 1330 1340 1350
LPTYELTKLY YLKEVKIGAE RGWLWKTNYK RVNDIYEVDQ TSEGVYLFPS
1360 1370 1380 1390 1400
KQRTSAITST MLPLIKAILI SCISNKWQLI YLLYLIFEVS YYLHKKVIDE
1410 1420 1430 1440 1450
IAGGTNFVSR LVAALIEVNW AFDNEEVKGL KKFFLLSSRV KELIIKHKVR
1460 1470 1480 1490 1500
NEVVVRWFGD EEIYGMPKLI GLVKAATLSR NKHCMLCTVC EDRDWRGETC
1510 1520 1530 1540 1550
PKCGRFGPPV VCGMTLADFE EKHYKRIFIR EDQSGGPLRE EHAGYLQYKA
1560 1570 1580 1590 1600
RGQLFLRNLP VLATKVKMLL VGNLGTEIGD LEHLGWVLRG PAVCKKVTEH
1610 1620 1630 1640 1650
ERCTTSIMDK LTAFFGVMPR GTTPRAPVRF PTSLLKIRRG LETGWAYTHQ
1660 1670 1680 1690 1700
GGISSVDHVT CGKDLLVCDT MGRTRVVCQS NNKMTDESEY GVKTDSGCPE
1710 1720 1730 1740 1750
GARCYVFNPE AVNISGTKGA MVHLQKTGGE FTCVTASGTP AFFDLKNLKG
1760 1770 1780 1790 1800
WSGLPIFEAS SGRVVGRVKV GKNEDSKPTK LMSGIQTVSK SATDLTEMVK
1810 1820 1830 1840 1850
KITTMNRGEF RQITLATGAG KTTELPRSVI EEIGRHKRVL VLIPLRAAAE
1860 1870 1880 1890 1900
SVYQYMRQKH PSIAFNLRIG EMKEGDMATG ITYASYGYFC QMSQPKLRAA
1910 1920 1930 1940 1950
MVEYSFIFLD EYHCATPEQL AIMGKIHRFS ENLRVVAMTA TPAGTVTTTG
1960 1970 1980 1990 2000
QKHPIEEFIA PEVMKGEDLG SEYLDIAGLK IPVEEMKNNM LVFVPTRNMA
2010 2020 2030 2040 2050
VEAAKKLKAK GYNSGYYYSG EDPSNLRVVT SQSPYVVVAT NAIESGVTLP
2060 2070 2080 2090 2100
DLDVVVDTGL KCEKRIRLSP KMPFIVTGLK RMAVTIGEQA QRRGRVGRVK
2110 2120 2130 2140 2150
PGRYYRSQET PVGSKDYHYD LLQAQRYGIE DGINITKSFR EMNYDWSLYE
2160 2170 2180 2190 2200
EDSLMITQLE ILNNLLISEE LPMAVKNIMA RTDHPEPIQL AYNSYETQVP
2210 2220 2230 2240 2250
VLFPKIRNGE VTDTYDNYTF LNARKLGDDV PPYVYATEDE DLAVELLGLD
2260 2270 2280 2290 2300
WPDPGNQGTV EAGRALKQVV GLSTAENALL VALFGYVGYQ ALSKRHIPVV
2310 2320 2330 2340 2350
TDIYSVEDHR LEDTTHLQYA PNAIKTEGKE TELKELAQGD VQRCVEAVTN
2360 2370 2380 2390 2400
YAREGIQFMK SQALKVRETP TYKETMNTVA DYVKKFIEAL TDSKEDIIKY
2410 2420 2430 2440 2450
GLWGAHTALY KSIGARLGHE TAFATLVVKW LAFGGESISD HIKQAATDLV
2460 2470 2480 2490 2500
VYYIINRPQF PGDTETQQEG RKFVASLLVS ALATYTYKSW NYNNLSKIVE
2510 2520 2530 2540 2550
PALATLPYAA KALKLFAPTR LESVVILSTA IYKTYLSIRR GKSDGLLGTG
2560 2570 2580 2590 2600
VSAAMEIMSQ NPVSVGIAVM LGVGAVAAHN AIEASEQKRT LLMKVFVKNF
2610 2620 2630 2640 2650
LDQAATDELV KESPEKIIMA LFEAVQTVGN PLRLVYHLYG VFYKGWEAKE
2660 2670 2680 2690 2700
LAQRTAGRNL FTLIMFEAVE LLGVDSEGKI RQLSSNYILE LLYKFRDNIK
2710 2720 2730 2740 2750
SSVREIAISW APAPFSCDWT PTDDRIGLPH ENYLRVETKC PCGYRMKAVK
2760 2770 2780 2790 2800
NCAGELRLLE EGGSFLCRNK FGRGSQNYRV TKYYDDNLSE IKPVIRMEGH
2810 2820 2830 2840 2850
VELYYKGATI KLDFNNSKTV LATDKWEVDH STLVRALKRY TGAGYRGAYL
2860 2870 2880 2890 2900
GEKPNHKHLI QRDCATITKD KVCFIKMKRG CAFTYDLSLH NLTRLIELVH
2910 2920 2930 2940 2950
KNNLEDREIP AVTVTTWLAY TFVNEDIGTI KPTFGEKVTP EKQEEVVLQP
2960 2970 2980 2990 3000
AVVVDTTDVA VTVVGETSTM TTGETPTTFT SLGSDSKVRQ VLKLGVDDGQ
3010 3020 3030 3040 3050
YPGPNQQRAS LLEAIQGVDE RPSVLILGSD KATSNRVKTA KNVKIYRSRD
3060 3070 3080 3090 3100
PLELREMMKR GKILVVALSR VDTALLKFVD YKGTFLTRET LEALSLGKPK
3110 3120 3130 3140 3150
KRDITKAEAQ WLLRLEDQIE ELPDWFAAKE PIFLEANIKR DKYHLVGDIA
3160 3170 3180 3190 3200
TIKEKAKQLG ATDSTKISKE VGAKVYSMKL SNWVIQEENK QGSLAPLFEE
3210 3220 3230 3240 3250
LLQQCPPGGQ NKTTHMVSAY QLAQGNWVPV SCHVFMGTIP ARRTKTHPYE
3260 3270 3280 3290 3300
AYVKLRELVD EHKMKALCGG SGLSKHNEWV IGKVKYQGNL RTKHMLNPGK
3310 3320 3330 3340 3350
VAEQLHREGY RHNVYNKTIG SVMTATGIRL EKLPVVRAQT DTTNFHQAIR
3360 3370 3380 3390 3400
DKIDKEENLQ TPGLHKKLME VFNALKRPEL EASYDAVDWE ELERGINRKG
3410 3420 3430 3440 3450
AAGFFERKNI GEVLDSEKNK VEEVIDSLKK GRNIRYYETA IPKNEKRDVN
3460 3470 3480 3490 3500
DDWTAGDFVD EKKPRVIQYP EAKTRLAITK VMYKWVKQKP VVIPGYEGKT
3510 3520 3530 3540 3550
PLFQIFDKVK KEWDQFQNPV AVSFDTKAWD TQVTTRDLEL IRDIQKFYFK
3560 3570 3580 3590 3600
KKWHKFIDTL TKHMSEVPVI SADGEVYIRK GQRGSGQPDT SAGNSMLNVL
3610 3620 3630 3640 3650
TMVYAFCEAT GVPYKSFDRV AKIHVCGDDG FLITERALGE KFASKGVQIL
3660 3670 3680 3690 3700
YEAGKPQKIT EGDKMKVAYQ FDDIEFCSHT PVQVRWSDNT SSYMPGRNTT
3710 3720 3730 3740 3750
TILAKMATRL DSSGERGTIA YEKAVAFSFL LMYSWNPLIR RICLLVLSTE
3760 3770 3780 3790 3800
LQVRPGKSTT YYYEGDPISA YKEVIGHNLF DLKRTSFEKL AKLNLSMSTL
3810 3820 3830 3840 3850
GVWTRHTSKR LLQDCVNVGT KEGNWLVNAD RLVSSKTGNR YIPGEGHTLQ
3860 3870 3880 3890
GKHYEELILA RKPIGNFEGT DRYNLGPIVN VVLRRLKIMM MALIGRGV
Length:3,898
Mass (Da):438,578
Last modified:July 11, 2001 - v2
Checksum:i2C1F17B8A359D0F6
GO

Natural variant

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Natural varianti387T → A. 1
Natural varianti3542R → S. 1

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
J04358 Genomic RNA Translation: AAA43844.2

Similar proteinsi

Entry informationi

Entry nameiPOLG_CSFVA
AccessioniPrimary (citable) accession number: P19712
Entry historyiIntegrated into UniProtKB/Swiss-Prot: February 1, 1991
Last sequence update: July 11, 2001
Last modified: May 23, 2018
This is version 153 of the entry and version 2 of the sequence. See complete history.
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programViral Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

3D-structure, Complete proteome, Direct protein sequencing

Cookie policy

We would like to use anonymized google analytics cookies to gather statistics on how uniprot.org is used in aggregate. Learn more

UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health