Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Genome polyprotein

Gene
N/A
Organism
Classical swine fever virus (strain Alfort) (CSFV) (Hog cholera virus)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

E(rns), E1 and E2 are responsible of cell attachment and subsequent fusion of viral and cellular membrane.By similarity
P7 forms a leader sequence to properly orient NS2 in the membrane.By similarity
Uncleaved NS2-3 is required for production of infectious virus.By similarity
NS2 protease seems to play a vital role in viral RNA replication control and in the pathogenicity of the virus.PROSITE-ProRule annotation
NS3 displays three enzymatic activities: serine protease, NTPase and RNA helicase.By similarity
NS4A is a cofactor for the NS3 protease activity.By similarity
RNA-directed RNA polymerase NS5 replicates the viral (+) and (-) genome.PROSITE-ProRule annotation

Catalytic activityi

Leu is conserved at position P1 for all four cleavage sites. Alanine is found at position P1' of the NS4A-NS4B cleavage site, whereas serine is found at position P1' of the NS3-NS4A, NS4B-NS5A and NS5A-NS5B cleavage sites.
Nucleoside triphosphate + RNA(n) = diphosphate + RNA(n+1).PROSITE-ProRule annotation
NTP + H2O = NDP + phosphate.
ATP + H2O = ADP + phosphate.

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Active sitei22For N-terminal protease activityPROSITE-ProRule annotation1 Publication1
Active sitei49For N-terminal protease activityPROSITE-ProRule annotation1 Publication1
Active sitei69For N-terminal protease activityPROSITE-ProRule annotation1 Publication1
Active sitei1447For cysteine protease NS2 activityPROSITE-ProRule annotation1
Active sitei1461For cysteine protease NS2 activityPROSITE-ProRule annotation1
Active sitei1512For cysteine protease NS2 activityPROSITE-ProRule annotation1
Active sitei1658Charge relay system; for serine protease NS3 activityPROSITE-ProRule annotation1
Active sitei1695Charge relay system; for serine protease NS3 activityPROSITE-ProRule annotation1
Active sitei1752Charge relay system; for serine protease NS3 activityPROSITE-ProRule annotation1

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Molecular functioni

Helicase, Hydrolase, Ion channel, Nucleotidyltransferase, Protease, RNA-directed RNA polymerase, Serine protease, Thiol protease, Transferase, Viral ion channel

Keywords - Biological processi

Activation of host autophagy by virus, Fusion of virus membrane with host endosomal membrane, Fusion of virus membrane with host membrane, Host-virus interaction, Inhibition of host innate immune response by virus, Inhibition of host IRF3 by virus, Inhibition of host RLR pathway by virus, Ion transport, Transport, Viral attachment to host cell, Viral immunoevasion, Viral penetration into host cytoplasm, Viral RNA replication, Virus entry into host cell

Keywords - Ligandi

ATP-binding, Nucleotide-binding

Protein family/group databases

MEROPSiC53.001.

Names & Taxonomyi

Protein namesi
Recommended name:
Genome polyprotein
Cleaved into the following 13 chains:
N-terminal protease (EC:3.4.22.-)
Short name:
N-pro
Alternative name(s):
Autoprotease p20
Alternative name(s):
gp44/48
Alternative name(s):
gp33
Alternative name(s):
gp55
Alternative name(s):
Non-structural protein 2
Alternative name(s):
Non-structural protein 3
Alternative name(s):
NS5B
OrganismiClassical swine fever virus (strain Alfort) (CSFV) (Hog cholera virus)
Taxonomic identifieri11097 [NCBI]
Taxonomic lineageiVirusesssRNA virusesssRNA positive-strand viruses, no DNA stageFlaviviridaePestivirus
Virus hostiSus scrofa (Pig) [TaxID: 9823]
Proteomesi
  • UP000008568 Componenti: Genome

Subcellular locationi

E(rns) glycoprotein :
Cysteine protease NS2 :
  • Host membrane PROSITE-ProRule annotation; Multi-pass membrane protein PROSITE-ProRule annotation

Topology

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Transmembranei1140 – 1164HelicalPROSITE-ProRule annotationAdd BLAST25
Transmembranei1189 – 1209HelicalPROSITE-ProRule annotationAdd BLAST21
Transmembranei1217 – 1237HelicalPROSITE-ProRule annotationAdd BLAST21
Transmembranei1247 – 1267HelicalPROSITE-ProRule annotationAdd BLAST21
Transmembranei1281 – 1301HelicalPROSITE-ProRule annotationAdd BLAST21
Transmembranei1360 – 1380HelicalPROSITE-ProRule annotationAdd BLAST21
Transmembranei1568 – 1588HelicalPROSITE-ProRule annotationAdd BLAST21

GO - Cellular componenti

  • host cell cytoplasm Source: AgBase
  • host cell endoplasmic reticulum Source: AgBase
  • host cell nucleus Source: AgBase
  • host cell surface Source: UniProtKB-SubCell
  • integral component of membrane Source: UniProtKB-KW
  • integral to membrane of host cell Source: UniProtKB-KW
  • virion membrane Source: UniProtKB-SubCell
Complete GO annotation...

Keywords - Cellular componenti

Host membrane, Membrane, Virion

Pathology & Biotechi

Mutagenesis

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Mutagenesisi22E → V: Almost complete loss of cleavage between N-pro and C. 1 Publication1
Mutagenesisi40H → L: No effect. 1 Publication1
Mutagenesisi49H → L: Complete loss of cleavage between N-pro and C. 1 Publication1
Mutagenesisi69C → A: Complete loss of cleavage between N-pro and C. 1 Publication1
Mutagenesisi69C → S: Complete loss of cleavage between N-pro and C. 1 Publication1
Mutagenesisi99H → L: No effect. 1 Publication1
Mutagenesisi112C → A: No effect. 1 Publication1
Mutagenesisi112C → S: No effect. 1 Publication1
Mutagenesisi130H → L: No effect. 1 Publication1
Mutagenesisi134C → A: No effect. 1 Publication1
Mutagenesisi134C → S: No effect. 1 Publication1
Mutagenesisi138C → A: No effect. 1 Publication1
Mutagenesisi138C → S: No effect. 1 Publication1
Mutagenesisi161C → A: No effect. 1 Publication1
Mutagenesisi161C → S: No effect. 1 Publication1

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00000380501 – 168N-terminal proteaseAdd BLAST168
ChainiPRO_0000038051169 – 267Capsid protein CBy similarityAdd BLAST99
ChainiPRO_0000038052268 – 494E(rns) glycoproteinAdd BLAST227
ChainiPRO_0000038053495 – 656Envelope glycoprotein E1Add BLAST162
ChainiPRO_0000038054657 – 1062Envelope glycoprotein E2Add BLAST406
ChainiPRO_00000380551063 – 1132p7By similarityAdd BLAST70
ChainiPRO_00000380561133 – 2272Non-structural protein 2-3By similarityAdd BLAST1140
ChainiPRO_00003493611133 – 1589Cysteine protease NS2PROSITE-ProRule annotationAdd BLAST457
ChainiPRO_00000380571590 – 2272Serine protease NS3By similarityAdd BLAST683
ChainiPRO_00000380582273 – 2336Non-structural protein 4ABy similarityAdd BLAST64
ChainiPRO_00000380592337 – 2683Non-structural protein 4BBy similarityAdd BLAST347
ChainiPRO_00000380602684 – 3180Non-structural protein 5ABy similarityAdd BLAST497
ChainiPRO_00000380613181 – 3898RNA-directed RNA polymeraseBy similarityAdd BLAST718

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Glycosylationi157N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi269N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi274N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi278N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi293N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi332N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi362N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi367N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi410N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi425N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi500N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi594N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi805N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi810N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi874N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi918N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi949N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi986N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi1713N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi2134N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi2217N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi2494N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi2787N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi2815N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi2891N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi3211N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi3316N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi3689N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi3698N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi3794N-linked (GlcNAc...); by hostSequence analysis1

Post-translational modificationi

The E(rns) glycoprotein is heavily glycosylated.1 Publication
The viral RNA of pestiviruses is expressed as a single polyprotein which undergoes post-translational proteolytic processing resulting in the production of at least eleven individual proteins. The N-terminal protease cleaves itself from the nascent polyprotein autocatalytically and thereby generates the N-terminus of the adjacent viral capsid protein C.2 Publications
Cleavage between E2 and p7 is partial.By similarity

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sitei168 – 169Cleavage; by autolysis2
Sitei267 – 268Cleavage; by host signal peptidase2
Sitei494 – 495Cleavage2
Sitei656 – 657Cleavage; by host signal peptidase2
Sitei1062 – 1063Cleavage; by host signal peptidase; partialBy similarity2
Sitei1132 – 1133Cleavage; by host signal peptidaseBy similarity2
Sitei1589 – 1590Cleavage; partial; cysteine protease NS2PROSITE-ProRule annotation2
Sitei2272 – 2273Cleavage; by serine protease NS3By similarity2
Sitei2336 – 2337Cleavage; by serine protease NS3By similarity2
Sitei2683 – 2684Cleavage; by serine protease NS3By similarity2
Sitei3180 – 3181Cleavage; by serine protease NS3By similarity2

Keywords - PTMi

Disulfide bond, Glycoprotein

Miscellaneous databases

PMAP-CutDBP19712.

Expressioni

Inductioni

Translated cap independently from an internal ribosome entry site (IRES).1 Publication

Interactioni

Subunit structurei

The E(rns) glycoprotein is found as a homodimer; disulfide-linked. The E1 and E2 envelope glycoproteins form disulfide-linked homodimers as well as heterodimers.

Protein-protein interaction databases

IntActiP19712. 96 interactors.

Structurei

Secondary structure

13898
Legend: HelixTurnBeta strandPDB Structure known for this area
Show more details
Feature keyPosition(s)DescriptionActionsGraphical viewLength
Helixi1792 – 1803Combined sources12
Beta strandi1810 – 1814Combined sources5
Beta strandi1817 – 1819Combined sources3
Beta strandi1821 – 1823Combined sources3
Helixi1824 – 1833Combined sources10
Beta strandi1839 – 1845Combined sources7
Helixi1846 – 1859Combined sources14
Beta strandi1865 – 1871Combined sources7
Beta strandi1880 – 1885Combined sources6
Helixi1886 – 1889Combined sources4
Helixi1894 – 1901Combined sources8
Beta strandi1905 – 1910Combined sources6
Helixi1912 – 1914Combined sources3
Helixi1917 – 1927Combined sources11
Helixi1928 – 1932Combined sources5
Beta strandi1935 – 1939Combined sources5
Beta strandi1956 – 1958Combined sources3
Turni1970 – 1972Combined sources3
Beta strandi1973 – 1976Combined sources4
Beta strandi1979 – 1982Combined sources4
Helixi1983 – 1987Combined sources5
Beta strandi1990 – 1993Combined sources4
Helixi1997 – 2009Combined sources13
Beta strandi2014 – 2017Combined sources4
Helixi2024 – 2030Combined sources7
Beta strandi2032 – 2034Combined sources3
Beta strandi2036 – 2041Combined sources6
Helixi2042 – 2045Combined sources4
Beta strandi2054 – 2057Combined sources4
Beta strandi2061 – 2068Combined sources8
Beta strandi2070 – 2073Combined sources4
Beta strandi2075 – 2082Combined sources8
Helixi2086 – 2093Combined sources8
Beta strandi2096 – 2100Combined sources5
Beta strandi2103 – 2105Combined sources3
Beta strandi2112 – 2114Combined sources3
Helixi2118 – 2124Combined sources7
Helixi2125 – 2128Combined sources4
Turni2129 – 2132Combined sources4
Helixi2135 – 2146Combined sources12
Helixi2153 – 2166Combined sources14
Helixi2173 – 2181Combined sources9
Helixi2188 – 2193Combined sources6
Turni2194 – 2196Combined sources3
Beta strandi2204 – 2207Combined sources4
Beta strandi2210 – 2215Combined sources6
Beta strandi2234 – 2237Combined sources4
Helixi2238 – 2246Combined sources9
Helixi2257 – 2270Combined sources14
Helixi2272 – 2275Combined sources4

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
4CBGX-ray2.82A/B/C/D1782-2280[»]
4CBHX-ray2.51A/B/C/D1782-2280[»]
4CBIX-ray3.00A/B/C/D1782-2280[»]
4CBLX-ray3.05A/B/C/D1792-2176[»]
A/B/C/D2178-2280[»]
4CBMX-ray3.27A/B/C/D1782-2280[»]
ProteinModelPortaliP19712.
SMRiP19712.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini1 – 168Peptidase C53Add BLAST168
Domaini1441 – 1589Peptidase C74PROSITE-ProRule annotationAdd BLAST149
Domaini1590 – 1763Peptidase S31PROSITE-ProRule annotationAdd BLAST174
Domaini1802 – 1960Helicase ATP-bindingPROSITE-ProRule annotationAdd BLAST159
Domaini1978 – 2179Helicase C-terminalPROSITE-ProRule annotationAdd BLAST202
Domaini3519 – 3642RdRp catalyticPROSITE-ProRule annotationAdd BLAST124

Sequence similaritiesi

Belongs to the pestivirus polyprotein family.Curated
Contains 1 helicase ATP-binding domain.PROSITE-ProRule annotation
Contains 1 helicase C-terminal domain.PROSITE-ProRule annotation
Contains 1 peptidase C53 domain.Curated
Contains 1 peptidase C74 domain.PROSITE-ProRule annotation
Contains 1 peptidase S31 domain.PROSITE-ProRule annotation
Contains 1 RdRp catalytic domain.PROSITE-ProRule annotation

Keywords - Domaini

Transmembrane, Transmembrane helix

Family and domain databases

Gene3Di3.40.50.300. 2 hits.
3.90.730.10. 1 hit.
InterProiIPR021824. Capsid-C_pestivirus.
IPR011492. DEAD_Flavivir.
IPR014001. Helicase_ATP-bd.
IPR001650. Helicase_C.
IPR022120. NS2.
IPR030399. NS2_C74.
IPR027417. P-loop_NTPase.
IPR008751. Peptidase_C53.
IPR032521. Pestivirus_E2.
IPR000280. Pestivirus_NS3_S31.
IPR007094. RNA-dir_pol_PSvirus.
IPR002166. RNA_pol_HCV.
IPR001568. RNase_T2-like.
IPR033130. RNase_T2_His_AS_2.
[Graphical view]
PfamiPF11889. DUF3409. 1 hit.
PF07652. Flavi_DEAD. 1 hit.
PF00271. Helicase_C. 1 hit.
PF05550. Peptidase_C53. 1 hit.
PF12387. Peptidase_C74. 1 hit.
PF05578. Peptidase_S31. 1 hit.
PF16329. Pestivirus_E2. 1 hit.
PF00998. RdRP_3. 1 hit.
[Graphical view]
PRINTSiPR00729. CDVENDOPTASE.
ProDomiPD003091. Peptidase_C53. 1 hit.
[Graphical view] [Entries sharing at least one domain]
SMARTiSM00487. DEXDc. 1 hit.
SM00490. HELICc. 1 hit.
[Graphical view]
SUPFAMiSSF52540. SSF52540. 1 hit.
SSF55895. SSF55895. 1 hit.
PROSITEiPS51192. HELICASE_ATP_BIND_1. 1 hit.
PS51194. HELICASE_CTER. 1 hit.
PS51692. PESTIVIRUS_NS2_PRO. 1 hit.
PS51535. PESTIVIRUS_NS3PRO. 1 hit.
PS50507. RDRP_SSRNA_POS. 1 hit.
PS00531. RNASE_T2_2. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

P19712-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MELNHFELLY KTSKQKPVGV EEPVYDTAGR PLFGNPSEVH PQSTLKLPHD
60 70 80 90 100
RGRGDIRTTL RDLPRKGDCR SGNHLGPVSG IYIKPGPVYY QDYTGPVYHR
110 120 130 140 150
APLEFFDEAQ FCEVTKRIGR VTGSDGKLYH IYVCVDGCIL LKLAKRGTPR
160 170 180 190 200
TLKWIRNFTN CPLWVTSCSD DGASGSKDKK PDRMNKGKLK IAPREHEKDS
210 220 230 240 250
KTKPPDATIV VEGVKYQIKK KGKVKGKNTQ DGLYHNKNKP PESRKKLEKA
260 270 280 290 300
LLAWAVITIL LYQPVAAENI TQWNLSDNGT NGIQRAMYLR GVNRSLHGIW
310 320 330 340 350
PEKICKGVPT HLATDTELKE IRGMMDASER TNYTCCRLQR HEWNKHGWCN
360 370 380 390 400
WYNIDPWIQL MNRTQTNLTE GPPDKECAVT CRYDKNTDVN VVTQARNRPT
410 420 430 440 450
TLTGCKKGKN FSFAGTVIEG PCNFNVSVED ILYGDHECGS LLQDTALYLL
460 470 480 490 500
DGMTNTIENA RQGAARVTSW LGRQLSTAGK KLERRSKTWF GAYALSPYCN
510 520 530 540 550
VTRKIGYIWY TNNCTPACLP KNTKIIGPGK FDTNAEDGKI LHEMGGHLSE
560 570 580 590 600
FLLLSLVILS DFAPETASTL YLILHYAIPQ SHEEPEGCDT NQLNLTVKLR
610 620 630 640 650
TEDVVPSSVW NIGKYVCVRP DWWPYETKVA LLFEEAGQVI KLVLRALRDL
660 670 680 690 700
TRVWNSASTT AFLICLIKVL RGQVVQGIIW LLLVTGAQGR LACKEDYRYA
710 720 730 740 750
ISSTNEIGLL GAEGLTTTWK EYSHGLQLDD GTVKAVCTAG SFKVTALNVV
760 770 780 790 800
SRRYLASLHK RALPTSVTFE LLFDGTNPAI EEMDDDFGFG LCPFDTSPVI
810 820 830 840 850
KGKYNTTLLN GSAFYLVCPI GWTGVVECTA VSPTTLRTEV VKTFRRDKPF
860 870 880 890 900
PHRVDCVTTI VEKEDLFHCK LGGNWTCVKG DPVTYKGGQV KQCRWCGFEF
910 920 930 940 950
KEPYGLPHYP IGKCILTNET GYRVVDSTDC NRDGVVISTE GEHECLIGNT
960 970 980 990 1000
TVKVHALDER LGPMPCRPKE IVSSEGPVRK TSCTFNYTKT LRNKYYEPRD
1010 1020 1030 1040 1050
SYFQQYMLKG EYQYWFNLDV TDHHTDYFAE FVVLVVVALL GGRYVLWLIV
1060 1070 1080 1090 1100
TYIILTEQLA AGLQLGQGEV VLIGNLITHT DNEVVVYFLL LYLVIRDEPI
1110 1120 1130 1140 1150
KKWILLLFHA MTNNPVKTIT VALLMISGVA KGGKIDGGWQ RQPVTSFDIQ
1160 1170 1180 1190 1200
LALAVVVVVV MLLAKRDPTT FPLVITVATL RTAKITNGFS TDLVIATVSA
1210 1220 1230 1240 1250
ALLTWTYISD YYKYKTWLQY LVSTVTGIFL IRVLKGIGEL DLHAPTLPSH
1260 1270 1280 1290 1300
RPLFYILVYL ISTAVVTRWN LDVAGLLLQC VPTLLMVFTM WADILTLILI
1310 1320 1330 1340 1350
LPTYELTKLY YLKEVKIGAE RGWLWKTNYK RVNDIYEVDQ TSEGVYLFPS
1360 1370 1380 1390 1400
KQRTSAITST MLPLIKAILI SCISNKWQLI YLLYLIFEVS YYLHKKVIDE
1410 1420 1430 1440 1450
IAGGTNFVSR LVAALIEVNW AFDNEEVKGL KKFFLLSSRV KELIIKHKVR
1460 1470 1480 1490 1500
NEVVVRWFGD EEIYGMPKLI GLVKAATLSR NKHCMLCTVC EDRDWRGETC
1510 1520 1530 1540 1550
PKCGRFGPPV VCGMTLADFE EKHYKRIFIR EDQSGGPLRE EHAGYLQYKA
1560 1570 1580 1590 1600
RGQLFLRNLP VLATKVKMLL VGNLGTEIGD LEHLGWVLRG PAVCKKVTEH
1610 1620 1630 1640 1650
ERCTTSIMDK LTAFFGVMPR GTTPRAPVRF PTSLLKIRRG LETGWAYTHQ
1660 1670 1680 1690 1700
GGISSVDHVT CGKDLLVCDT MGRTRVVCQS NNKMTDESEY GVKTDSGCPE
1710 1720 1730 1740 1750
GARCYVFNPE AVNISGTKGA MVHLQKTGGE FTCVTASGTP AFFDLKNLKG
1760 1770 1780 1790 1800
WSGLPIFEAS SGRVVGRVKV GKNEDSKPTK LMSGIQTVSK SATDLTEMVK
1810 1820 1830 1840 1850
KITTMNRGEF RQITLATGAG KTTELPRSVI EEIGRHKRVL VLIPLRAAAE
1860 1870 1880 1890 1900
SVYQYMRQKH PSIAFNLRIG EMKEGDMATG ITYASYGYFC QMSQPKLRAA
1910 1920 1930 1940 1950
MVEYSFIFLD EYHCATPEQL AIMGKIHRFS ENLRVVAMTA TPAGTVTTTG
1960 1970 1980 1990 2000
QKHPIEEFIA PEVMKGEDLG SEYLDIAGLK IPVEEMKNNM LVFVPTRNMA
2010 2020 2030 2040 2050
VEAAKKLKAK GYNSGYYYSG EDPSNLRVVT SQSPYVVVAT NAIESGVTLP
2060 2070 2080 2090 2100
DLDVVVDTGL KCEKRIRLSP KMPFIVTGLK RMAVTIGEQA QRRGRVGRVK
2110 2120 2130 2140 2150
PGRYYRSQET PVGSKDYHYD LLQAQRYGIE DGINITKSFR EMNYDWSLYE
2160 2170 2180 2190 2200
EDSLMITQLE ILNNLLISEE LPMAVKNIMA RTDHPEPIQL AYNSYETQVP
2210 2220 2230 2240 2250
VLFPKIRNGE VTDTYDNYTF LNARKLGDDV PPYVYATEDE DLAVELLGLD
2260 2270 2280 2290 2300
WPDPGNQGTV EAGRALKQVV GLSTAENALL VALFGYVGYQ ALSKRHIPVV
2310 2320 2330 2340 2350
TDIYSVEDHR LEDTTHLQYA PNAIKTEGKE TELKELAQGD VQRCVEAVTN
2360 2370 2380 2390 2400
YAREGIQFMK SQALKVRETP TYKETMNTVA DYVKKFIEAL TDSKEDIIKY
2410 2420 2430 2440 2450
GLWGAHTALY KSIGARLGHE TAFATLVVKW LAFGGESISD HIKQAATDLV
2460 2470 2480 2490 2500
VYYIINRPQF PGDTETQQEG RKFVASLLVS ALATYTYKSW NYNNLSKIVE
2510 2520 2530 2540 2550
PALATLPYAA KALKLFAPTR LESVVILSTA IYKTYLSIRR GKSDGLLGTG
2560 2570 2580 2590 2600
VSAAMEIMSQ NPVSVGIAVM LGVGAVAAHN AIEASEQKRT LLMKVFVKNF
2610 2620 2630 2640 2650
LDQAATDELV KESPEKIIMA LFEAVQTVGN PLRLVYHLYG VFYKGWEAKE
2660 2670 2680 2690 2700
LAQRTAGRNL FTLIMFEAVE LLGVDSEGKI RQLSSNYILE LLYKFRDNIK
2710 2720 2730 2740 2750
SSVREIAISW APAPFSCDWT PTDDRIGLPH ENYLRVETKC PCGYRMKAVK
2760 2770 2780 2790 2800
NCAGELRLLE EGGSFLCRNK FGRGSQNYRV TKYYDDNLSE IKPVIRMEGH
2810 2820 2830 2840 2850
VELYYKGATI KLDFNNSKTV LATDKWEVDH STLVRALKRY TGAGYRGAYL
2860 2870 2880 2890 2900
GEKPNHKHLI QRDCATITKD KVCFIKMKRG CAFTYDLSLH NLTRLIELVH
2910 2920 2930 2940 2950
KNNLEDREIP AVTVTTWLAY TFVNEDIGTI KPTFGEKVTP EKQEEVVLQP
2960 2970 2980 2990 3000
AVVVDTTDVA VTVVGETSTM TTGETPTTFT SLGSDSKVRQ VLKLGVDDGQ
3010 3020 3030 3040 3050
YPGPNQQRAS LLEAIQGVDE RPSVLILGSD KATSNRVKTA KNVKIYRSRD
3060 3070 3080 3090 3100
PLELREMMKR GKILVVALSR VDTALLKFVD YKGTFLTRET LEALSLGKPK
3110 3120 3130 3140 3150
KRDITKAEAQ WLLRLEDQIE ELPDWFAAKE PIFLEANIKR DKYHLVGDIA
3160 3170 3180 3190 3200
TIKEKAKQLG ATDSTKISKE VGAKVYSMKL SNWVIQEENK QGSLAPLFEE
3210 3220 3230 3240 3250
LLQQCPPGGQ NKTTHMVSAY QLAQGNWVPV SCHVFMGTIP ARRTKTHPYE
3260 3270 3280 3290 3300
AYVKLRELVD EHKMKALCGG SGLSKHNEWV IGKVKYQGNL RTKHMLNPGK
3310 3320 3330 3340 3350
VAEQLHREGY RHNVYNKTIG SVMTATGIRL EKLPVVRAQT DTTNFHQAIR
3360 3370 3380 3390 3400
DKIDKEENLQ TPGLHKKLME VFNALKRPEL EASYDAVDWE ELERGINRKG
3410 3420 3430 3440 3450
AAGFFERKNI GEVLDSEKNK VEEVIDSLKK GRNIRYYETA IPKNEKRDVN
3460 3470 3480 3490 3500
DDWTAGDFVD EKKPRVIQYP EAKTRLAITK VMYKWVKQKP VVIPGYEGKT
3510 3520 3530 3540 3550
PLFQIFDKVK KEWDQFQNPV AVSFDTKAWD TQVTTRDLEL IRDIQKFYFK
3560 3570 3580 3590 3600
KKWHKFIDTL TKHMSEVPVI SADGEVYIRK GQRGSGQPDT SAGNSMLNVL
3610 3620 3630 3640 3650
TMVYAFCEAT GVPYKSFDRV AKIHVCGDDG FLITERALGE KFASKGVQIL
3660 3670 3680 3690 3700
YEAGKPQKIT EGDKMKVAYQ FDDIEFCSHT PVQVRWSDNT SSYMPGRNTT
3710 3720 3730 3740 3750
TILAKMATRL DSSGERGTIA YEKAVAFSFL LMYSWNPLIR RICLLVLSTE
3760 3770 3780 3790 3800
LQVRPGKSTT YYYEGDPISA YKEVIGHNLF DLKRTSFEKL AKLNLSMSTL
3810 3820 3830 3840 3850
GVWTRHTSKR LLQDCVNVGT KEGNWLVNAD RLVSSKTGNR YIPGEGHTLQ
3860 3870 3880 3890
GKHYEELILA RKPIGNFEGT DRYNLGPIVN VVLRRLKIMM MALIGRGV
Length:3,898
Mass (Da):438,578
Last modified:July 11, 2001 - v2
Checksum:i2C1F17B8A359D0F6
GO

Natural variant

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Natural varianti387T → A.1
Natural varianti3542R → S.1

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
J04358 Genomic RNA. Translation: AAA43844.2.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
J04358 Genomic RNA. Translation: AAA43844.2.

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
4CBGX-ray2.82A/B/C/D1782-2280[»]
4CBHX-ray2.51A/B/C/D1782-2280[»]
4CBIX-ray3.00A/B/C/D1782-2280[»]
4CBLX-ray3.05A/B/C/D1792-2176[»]
A/B/C/D2178-2280[»]
4CBMX-ray3.27A/B/C/D1782-2280[»]
ProteinModelPortaliP19712.
SMRiP19712.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

IntActiP19712. 96 interactors.

Protein family/group databases

MEROPSiC53.001.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Miscellaneous databases

PMAP-CutDBP19712.

Family and domain databases

Gene3Di3.40.50.300. 2 hits.
3.90.730.10. 1 hit.
InterProiIPR021824. Capsid-C_pestivirus.
IPR011492. DEAD_Flavivir.
IPR014001. Helicase_ATP-bd.
IPR001650. Helicase_C.
IPR022120. NS2.
IPR030399. NS2_C74.
IPR027417. P-loop_NTPase.
IPR008751. Peptidase_C53.
IPR032521. Pestivirus_E2.
IPR000280. Pestivirus_NS3_S31.
IPR007094. RNA-dir_pol_PSvirus.
IPR002166. RNA_pol_HCV.
IPR001568. RNase_T2-like.
IPR033130. RNase_T2_His_AS_2.
[Graphical view]
PfamiPF11889. DUF3409. 1 hit.
PF07652. Flavi_DEAD. 1 hit.
PF00271. Helicase_C. 1 hit.
PF05550. Peptidase_C53. 1 hit.
PF12387. Peptidase_C74. 1 hit.
PF05578. Peptidase_S31. 1 hit.
PF16329. Pestivirus_E2. 1 hit.
PF00998. RdRP_3. 1 hit.
[Graphical view]
PRINTSiPR00729. CDVENDOPTASE.
ProDomiPD003091. Peptidase_C53. 1 hit.
[Graphical view] [Entries sharing at least one domain]
SMARTiSM00487. DEXDc. 1 hit.
SM00490. HELICc. 1 hit.
[Graphical view]
SUPFAMiSSF52540. SSF52540. 1 hit.
SSF55895. SSF55895. 1 hit.
PROSITEiPS51192. HELICASE_ATP_BIND_1. 1 hit.
PS51194. HELICASE_CTER. 1 hit.
PS51692. PESTIVIRUS_NS2_PRO. 1 hit.
PS51535. PESTIVIRUS_NS3PRO. 1 hit.
PS50507. RDRP_SSRNA_POS. 1 hit.
PS00531. RNASE_T2_2. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiPOLG_CSFVA
AccessioniPrimary (citable) accession number: P19712
Entry historyi
Integrated into UniProtKB/Swiss-Prot: February 1, 1991
Last sequence update: July 11, 2001
Last modified: November 30, 2016
This is version 144 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programViral Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

3D-structure, Complete proteome, Direct protein sequencing

Documents

  1. PDB cross-references
    Index of Protein Data Bank (PDB) cross-references
  2. Peptidase families
    Classification of peptidase families and list of entries
  3. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.