P21530 (POLG_CSFVB) Reviewed, UniProtKB/Swiss-Prot
Last modified
November 16, 2011.
Version 104.
History...
Names·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order
Names·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize orderNames and origin
| Protein names | Recommended name: Genome polyprotein Cleaved into the following 13 chains:
|
| Organism | Classical swine fever virus (strain Brescia) (CSFV) (Hog cholera virus) [Complete proteome] |
| Taxonomic identifier | 11098 [NCBI] |
| Taxonomic lineage | Viruses › ssRNA positive-strand viruses, no DNA stage › Flaviviridae › Pestivirus |
| Virus host | Sus scrofa (Pig) [TaxID: 9823] |
Protein attributes
| Sequence length | 3898 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is further processed into a mature form. |
| Protein existence | Inferred from homology |
General annotation (Comments)
| Function | E(rns), E1 and E2 are responsible of cell attachment and subsequent fusion of viral and cellular membrane By similarity. P7 forms a leader sequence to properly orient NS2 in the membrane By similarity. Uncleaved NS2-3 is required for production of infectious virus By similarity. NS2 protease seems to play a vital role in viral RNA replication control and in the pathogenicity of the virus By similarity. NS3 displays three enzymatic activities: serine protease, NTPase and RNA helicase By similarity. NS4A is a cofactor for the NS3 protease activity By similarity. RNA-directed RNA polymerase NS5 replicates the viral (+) and (-) genome By similarity. |
| Catalytic activity | Leu is conserved at position P1 for all four cleavage sites. Alanine is found at position P1' of the NS4A-NS4B cleavage site, whereas serine is found at position P1' of the NS3-NS4A, NS4B-NS5A and NS5A-NS5B cleavage sites. Nucleoside triphosphate + RNA(n) = diphosphate + RNA(n+1). NTP + H2O = NDP + phosphate. ATP + H2O = ADP + phosphate. |
| Subunit structure | The E(rns) glycoprotein is found as a homodimer; disulfide-linked By similarity. The E1 and E2 envelope glycoproteins form disulfide-linked homodimers as well as heterodimers By similarity. |
| Subcellular location | E(rns) glycoprotein: Host membrane; Peripheral membrane protein. Note: The C-terminus membrane anchor of Erns represents an amphipathic helix embedded in plane into the membrane By similarity. Envelope glycoprotein E2: Host cell surface By similarity. Cysteine protease NS2: Host membrane; Multi-pass membrane protein Potential. |
| Post-translational modification | The E(rns) glycoprotein is heavily glycosylated By similarity. The viral RNA of pestiviruses is expressed as a single polyprotein which undergoes post-translational proteolytic processing resulting in the production of at least eleven individual proteins. The N-terminal protease cleaves itself from the nascent polyprotein autocatalytically and thereby generates the N-terminus of the adjacent viral capsid protein C. Cleavage between E2 and p7 is partial By similarity. |
| Sequence similarities | Belongs to the pestivirus polyprotein family. Contains 1 helicase ATP-binding domain. Contains 1 helicase C-terminal domain. Contains 1 peptidase C53 domain. Contains 1 peptidase C74 domain. Contains 1 peptidase S31 domain. Contains 1 RdRp catalytic domain. |
Ontologies
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||
Molecule processing | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Chain | 1 – 168 | 168 | N-terminal protease By similarity | PRO_0000038062 | |||||
| Chain | 169 – 267 | 99 | Capsid protein C By similarity | PRO_0000038063 | |||||
| Chain | 268 – 494 | 227 | E(rns) glycoprotein By similarity | PRO_0000038064 | |||||
| Chain | 495 – 656 | 162 | Envelope glycoprotein E1 By similarity | PRO_0000038065 | |||||
| Chain | 657 – 1062 | 406 | Envelope glycoprotein E2 By similarity | PRO_0000038066 | |||||
| Chain | 1063 – 1132 | 70 | p7 By similarity | PRO_0000038067 | |||||
| Chain | 1133 – 2272 | 1140 | Non-structural protein 2-3 By similarity | PRO_0000038068 | |||||
| Chain | 1133 – 1589 | 457 | Cysteine protease NS2 By similarity | PRO_0000349362 | |||||
| Chain | 1590 – 2272 | 683 | Serine protease NS3 By similarity | PRO_0000038069 | |||||
| Chain | 2273 – 2336 | 64 | Non-structural protein 4A By similarity | PRO_0000038070 | |||||
| Chain | 2337 – 2683 | 347 | Non-structural protein 4B By similarity | PRO_0000038071 | |||||
| Chain | 2684 – 3180 | 497 | Non-structural protein 5A By similarity | PRO_0000038072 | |||||
| Chain | 3181 – 3898 | 718 | RNA-directed RNA polymerase By similarity | PRO_0000038073 | |||||
Regions | |||||||||
| Transmembrane | 1140 – 1164 | 25 | Helical; Potential | ||||||
| Transmembrane | 1189 – 1209 | 21 | Helical; Potential | ||||||
| Transmembrane | 1217 – 1237 | 21 | Helical; Potential | ||||||
| Transmembrane | 1247 – 1267 | 21 | Helical; Potential | ||||||
| Transmembrane | 1281 – 1301 | 21 | Helical; Potential | ||||||
| Transmembrane | 1360 – 1380 | 21 | Helical; Potential | ||||||
| Transmembrane | 1568 – 1588 | 21 | Helical; Potential | ||||||
| Domain | 1 – 168 | 168 | Peptidase C53 | ||||||
| Domain | 1441 – 1589 | 149 | Peptidase C74 | ||||||
| Domain | 1590 – 1763 | 174 | Peptidase S31 | ||||||
| Domain | 1802 – 1960 | 159 | Helicase ATP-binding | ||||||
| Domain | 1978 – 2179 | 202 | Helicase C-terminal | ||||||
| Domain | 3519 – 3642 | 124 | RdRp catalytic | ||||||
Sites | |||||||||
| Active site | 22 | 1 | For N-terminal protease activity By similarity | ||||||
| Active site | 49 | 1 | For N-terminal protease activity By similarity | ||||||
| Active site | 69 | 1 | For N-terminal protease activity By similarity | ||||||
| Active site | 1447 | 1 | For cysteine protease NS2 activity By similarity | ||||||
| Active site | 1461 | 1 | For cysteine protease NS2 activity By similarity | ||||||
| Active site | 1512 | 1 | For cysteine protease NS2 activity By similarity | ||||||
| Active site | 1658 | 1 | Charge relay system; for serine protease NS3 activity By similarity | ||||||
| Active site | 1695 | 1 | Charge relay system; for serine protease NS3 activity By similarity | ||||||
| Active site | 1752 | 1 | Charge relay system; for serine protease NS3 activity By similarity | ||||||
| Site | 168 – 169 | 2 | Cleavage; by autolysis By similarity | ||||||
| Site | 267 – 268 | 2 | Cleavage; by host signal peptidase By similarity | ||||||
| Site | 494 – 495 | 2 | Cleavage By similarity | ||||||
| Site | 656 – 657 | 2 | Cleavage; by host signal peptidase By similarity | ||||||
| Site | 1062 – 1063 | 2 | Cleavage; by host signal peptidase; partial By similarity | ||||||
| Site | 1132 – 1133 | 2 | Cleavage; by host signal peptidase By similarity | ||||||
| Site | 1589 – 1590 | 2 | Cleavage; partial; by cysetine protease NS2 By similarity | ||||||
| Site | 2272 – 2273 | 2 | Cleavage; by serine protease NS3 By similarity | ||||||
| Site | 2336 – 2337 | 2 | Cleavage; by serine protease NS3 By similarity | ||||||
| Site | 2683 – 2684 | 2 | Cleavage; by serine protease NS3 By similarity | ||||||
| Site | 3180 – 3181 | 2 | Cleavage; by serine protease NS3 By similarity | ||||||
Amino acid modifications | |||||||||
| Glycosylation | 157 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 269 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 278 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 332 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 362 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 410 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 425 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 500 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 594 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 805 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 810 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 874 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 918 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 949 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 1713 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 2134 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 2217 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 2419 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 2494 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 2787 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 2815 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 2891 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 3103 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 3211 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 3316 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 3689 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 3698 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
| Glycosylation | 3794 | 1 | N-linked (GlcNAc...); by host Potential | ||||||
Sequences
| ||||||||||||||||||
References
| [1] | "Molecular cloning and nucleotide sequence of hog cholera virus strain Brescia and mapping of the genomic region encoding envelope protein E1." Moormann R.J.M., Warmerdam P.A.M., van der Meer B., Schaaper W.M.M., Wensvoort G., Hulst M.M. Virology 177:184-198(1990) [PubMed: 2162104] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA]. |
Cross-references
Sequence databases | |
|---|---|
| EMBL GenBank DDBJ | M31768 Genomic RNA. Translation: AAA43843.1. |
| PIR | GNWVHB. A35317. |
3D structure databases | |
| ProteinModelPortal | P21530. |
| SMR | P21530. Positions 3273-3854. |
| ModBase | Search... |
Protocols and materials databases | |
| StructuralBiologyKnowledgebase | Search... |
Family and domain databases | |
| InterPro | IPR021824. Capsid-C_pestivirus. IPR014001. DEAD-like_helicase. IPR011492. DEAD_Flavivir. IPR001650. Helicase_C. IPR022120. Pept_C74_NS2-pestiV. IPR008751. Peptidase_C53. IPR000280. Peptidase_S31. IPR007094. RNA-dir_pol_PSvirus. IPR002166. RNA_pol_HCV. IPR001568. RNase_T2. IPR018188. RNase_T2_AS. [Graphical view] |
| Gene3D | G3DSA:3.90.730.10. RNase_T2. 1 hit. |
| Pfam | PF11889. DUF3409. 1 hit. PF07652. Flavi_DEAD. 1 hit. PF00271. Helicase_C. 1 hit. PF05550. Peptidase_C53. 1 hit. PF12387. Peptidase_C74. 1 hit. PF05578. Peptidase_S31. 1 hit. PF00998. RdRP_3. 1 hit. [Graphical view] |
| PRINTS | PR00729. CDVENDOPTASE. |
| ProDom | PD003091. Peptidase_C53. 1 hit. [Graphical view] [Entries sharing at least one domain] |
| SMART | SM00487. DEXDc. 1 hit. SM00490. HELICc. 1 hit. [Graphical view] |
| SUPFAM | SSF55895. RNase_T2. 1 hit. |
| PROSITE | PS51192. HELICASE_ATP_BIND_1. 1 hit. PS51194. HELICASE_CTER. 1 hit. PS51535. PESTIVIRUS_NS3PRO. 1 hit. PS50507. RDRP_SSRNA_POS. 1 hit. PS00531. RNASE_T2_2. 1 hit. [Graphical view] |
| ProtoNet | Search... |
Entry information
| Entry name | POLG_CSFVB | ||||||||
| Accession | Primary (citable) accession number: P21530 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation program | Viral Protein Annotation Program | ||||||||
Relevant documents
| Peptidase families Classification of peptidase families and list of entries |
| SIMILARITY comments Index of protein domains and families |

Clusters with