Q89277 (POLG_YEFVF) Reviewed, UniProtKB/Swiss-Prot
Last modified
November 16, 2011.
Version 87.
History...
Names·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order
Names·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize orderNames and origin
Protein attributes
| Sequence length | 3411 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is further processed into a mature form. |
| Protein existence | Inferred from homology |
General annotation (Comments)
| Function | Capsid protein C self-assembles to form an icosahedral capsid about 30 nm in diameter. The capsid encapsulates the genomic RNA By similarity. prM acts as a chaperone for envelope protein E during intracellular virion assembly by masking and inactivating envelope protein E fusion peptide. prM is matured in the last step of virion assembly, presumably to avoid catastrophic activation of the viral fusion peptide induced by the acidic pH of the trans-Golgi network. After cleavage by host furin, the pr peptide is released in the extracellular medium and small envelope protein M and envelope protein E homodimers are dissociated By similarity. Envelope protein E binding to host cell surface receptor is followed by virus internalization through clathrin-mediated endocytosis. Envelope protein E is subsequently involved in membrane fusion between virion and host late endosomes. Synthesized as an homodimer with prM which acts as a chaperone for envelope protein E. After cleavage of prM, envelope protein E dissociate from small envelope protein M and homodimerizes By similarity. Non-structural protein 1 is involved in virus replication and regulation of the innate immune response By similarity. Non-structural protein 2A may be involved viral RNA replication and capsid assembly Potential. Non-structural protein 2B is a required cofactor for the serine protease function of NS3 By similarity. Serine protease NS3 displays three enzymatic activities: serine protease, NTPase and RNA helicase. NS3 serine protease, in association with NS2B, performs its autocleavage and cleaves the polyprotein at dibasic sites in the cytoplasm: C-prM, NS2A-NS2B, NS2B-NS3, NS3-NS4A, NS4A-2K and NS4B-NS5. NS3 RNA helicase binds RNA and unwinds dsRNA in the 3' to 5' direction By similarity. Non-structural protein 4A induces host endoplasmic reticulum membrane rearrangements leading to the formation of virus-induced membranous vesicles hosting the dsRNA and polymerase, functionning as a replication complex. NS4A might also regulate the ATPase activity of the NS3 helicase By similarity. Peptide 2k functions as a signal peptide for NS4B and is required for the interferon antagonism activity of the latter By similarity. Non-structural protein 4B inhibits interferon (IFN)-induced host STAT1 phosphorylation and nuclear translocation, thereby preventing the establishment of cellular antiviral state by blocking the IFN-alpha/beta pathway By similarity. RNA-directed RNA polymerase NS5 replicates the viral (+) and (-) genome, and performs the capping of genomes in the cytoplasm. NS5 methylates viral RNA cap at guanine N-7 and ribose 2'-O positions. Besides its role in genome replication, also prevents the establishment of cellular antiviral state by blocking the interferon-alpha/beta (IFN-alpha/beta) signaling pathway By similarity. |
| Catalytic activity | Selective hydrolysis of -Xaa-Xaa-|-Yaa- bonds in which each of the Xaa can be either Arg or Lys and Yaa can be either Ser or Ala. Nucleoside triphosphate + RNA(n) = diphosphate + RNA(n+1). NTP + H2O = NDP + phosphate. ATP + H2O = ADP + phosphate. S-adenosyl-L-methionine + G(5')pppR-RNA = S-adenosyl-L-homocysteine + m7G(5')pppR-RNA. S-adenosyl-L-methionine + m7G(5')pppR-RNA = S-adenosyl-L-homocysteine + m7G(5')pppRm-RNA. |
| Subunit structure | Capsid protein C forms homodimers. prM and envelope protein E form heterodimers in the endoplasmic reticulum and Golgi. In immature particles, there are 60 icosaedrally organized trimeric spikes on the surface. Each spike consists of three heterodimers of envelope protein M precursor (prM) and envelope protein E. NS1 forms homodimers as well as homohexamers when secreted. NS1 may interact with NS4A. NS3 and NS2B form a heterodimer. NS3 is the catalytic subunit, whereas NS2B strongly stimulates the latter, acting as a cofactor. In the absence of the NS2B, NS3 protease is unfolded and inactive. NS3 interacts with unphosphorylated NS5; this interaction stimulates NS5 guanylyltransferase activity By similarity. |
| Subcellular location | Capsid protein C: Virion Potential. Peptide pr: Secreted By similarity. Small envelope protein M: Virion membrane; Multi-pass membrane protein By similarity. Host endoplasmic reticulum membrane; Multi-pass membrane protein By similarity. Envelope protein E: Virion membrane; Multi-pass membrane protein By similarity. Host endoplasmic reticulum membrane; Multi-pass membrane protein By similarity. Non-structural protein 1: Secreted. Host endoplasmic reticulum membrane; Peripheral membrane protein; Lumenal side By similarity. Non-structural protein 2A-alpha: Host endoplasmic reticulum membrane; Multi-pass membrane protein Potential. Non-structural protein 2A: Host endoplasmic reticulum membrane; Multi-pass membrane protein Potential. Serine protease subunit NS2B: Host endoplasmic reticulum membrane; Peripheral membrane protein; Cytoplasmic side By similarity. Serine protease NS3: Host endoplasmic reticulum membrane; Peripheral membrane protein; Cytoplasmic side By similarity. Note: Remains non-covalently associated to NS3 protease By similarity. Non-structural protein 4A: Host endoplasmic reticulum membrane; Multi-pass membrane protein By similarity. Note: Located in RE-associated vesicles hosting the replication complex. Non-structural protein 4B: Host endoplasmic reticulum membrane; Multi-pass membrane protein By similarity. RNA-directed RNA polymerase NS5: Host endoplasmic reticulum membrane; Peripheral membrane protein; Cytoplasmic side By similarity. Host nucleus By similarity. Note: Located in RE-associated vesicles hosting the replication complex. |
| Domain | Transmembrane domains of the small envelope protein M and envelope protein E contains an endoplasmic reticulum retention signals By similarity. |
| Post-translational modification | Specific enzymatic cleavages in vivo yield mature proteins. The nascent protein C contains a C-terminal hydrophobic domain that act as a signal sequence for translocation of prM into the lumen of the ER. Mature protein C is cleaved at a site upstream of this hydrophobic domain by NS3. prM is cleaved in post-Golgi vesicles by a host furin, releasing the mature small envelope protein M, and peptide pr. Non-structural protein 2A-alpha, a C-terminally truncated form of non-structural protein 2A, results from partial cleavage by NS3. Specific enzymatic cleavages in vivo yield mature proteins Peptide 2K acts as a signal sequence and is removed from the N-terminus of NS4B by the host signal peptidase in the ER lumen. Signal cleavage at the 2K-4B site requires a prior NS3 protease-mediated cleavage at the 4A-2K site By similarity. RNA-directed RNA polymerase NS5 is phosphorylated on serines residues. This phosphorylation may trigger NS5 nuclear localization. Envelope protein E and non-structural protein 1 are N-glycosylated By similarity. |
| Sequence similarities | Contains 1 helicase ATP-binding domain. Contains 1 helicase C-terminal domain. Contains 1 peptidase S7 domain. Contains 1 RdRp catalytic domain. |
Ontologies
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||||
Molecule processing | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Chain | 1 – 3411 | 3411 | Genome polyprotein | PRO_0000405159 | |||||||
| Chain | 1 – 101 | 101 | Capsid protein C By similarity | PRO_0000261515 | |||||||
| Propeptide | 102 – 121 | 20 | ER anchor for the protein C, removed in mature form by serine protease NS3 | PRO_0000261516 | |||||||
| Chain | 122 – 285 | 164 | prM By similarity | PRO_0000261517 | |||||||
| Chain | 122 – 210 | 89 | Peptide pr By similarity | PRO_0000261518 | |||||||
| Chain | 211 – 285 | 75 | Small envelope protein M By similarity | PRO_0000261519 | |||||||
| Chain | 286 – 778 | 493 | Envelope protein E By similarity | PRO_0000261520 | |||||||
| Chain | 779 – 1130 | 352 | Non-structural protein 1 By similarity | PRO_0000261521 | |||||||
| Chain | 1131 – 1354 | 224 | Non-structural protein 2A By similarity | PRO_0000261522 | |||||||
| Chain | 1131 – 1320 | 190 | Non-structural protein 2A-alpha By similarity | PRO_0000261523 | |||||||
| Chain | 1355 – 1484 | 130 | Serine protease subunit NS2B By similarity | PRO_0000261524 | |||||||
| Chain | 1485 – 2107 | 623 | Serine protease NS3 By similarity | PRO_0000261525 | |||||||
| Chain | 2108 – 2233 | 126 | Non-structural protein 4A By similarity | PRO_0000261526 | |||||||
| Peptide | 2234 – 2256 | 23 | Peptide 2k | PRO_0000261527 | |||||||
| Chain | 2257 – 2506 | 250 | Non-structural protein 4B By similarity | PRO_0000261528 | |||||||
| Chain | 2507 – 3411 | 905 | RNA-directed RNA polymerase NS5 By similarity | PRO_0000261529 | |||||||
Regions | |||||||||||
| Topological domain | 1 – 104 | 104 | Cytoplasmic Potential | ||||||||
| Transmembrane | 105 – 125 | 21 | Helical; Potential | ||||||||
| Topological domain | 126 – 244 | 119 | Extracellular Potential | ||||||||
| Transmembrane | 245 – 265 | 21 | Helical; Potential | ||||||||
| Topological domain | 266 – 270 | 5 | Cytoplasmic Potential | ||||||||
| Transmembrane | 271 – 285 | 15 | Helical; Potential | ||||||||
| Topological domain | 286 – 730 | 445 | Extracellular Potential | ||||||||
| Intramembrane | 731 – 751 | 21 | Helical; Potential | ||||||||
| Topological domain | 752 – 757 | 6 | Extracellular Potential | ||||||||
| Intramembrane | 758 – 778 | 21 | Helical; Potential | ||||||||
| Topological domain | 779 – 1130 | 352 | Extracellular Potential | ||||||||
| Transmembrane | 1131 – 1151 | 21 | Helical; Potential | ||||||||
| Topological domain | 1152 – 1160 | 9 | Cytoplasmic Potential | ||||||||
| Transmembrane | 1161 – 1181 | 21 | Helical; Potential | ||||||||
| Topological domain | 1182 – 1201 | 20 | Lumenal Potential | ||||||||
| Transmembrane | 1202 – 1222 | 21 | Helical; Potential | ||||||||
| Topological domain | 1223 – 1231 | 9 | Cytoplasmic Potential | ||||||||
| Transmembrane | 1232 – 1252 | 21 | Helical; Potential | ||||||||
| Topological domain | 1253 – 1262 | 10 | Lumenal Potential | ||||||||
| Transmembrane | 1263 – 1285 | 23 | Helical; Potential | ||||||||
| Topological domain | 1286 – 1355 | 70 | Cytoplasmic Potential | ||||||||
| Transmembrane | 1356 – 1376 | 21 | Helical; Potential | ||||||||
| Topological domain | 1377 – 1378 | 2 | Lumenal Potential | ||||||||
| Transmembrane | 1379 – 1399 | 21 | Helical; Potential | ||||||||
| Topological domain | 1400 – 1456 | 57 | Cytoplasmic Potential | ||||||||
| Intramembrane | 1457 – 1477 | 21 | Helical; Potential | ||||||||
| Topological domain | 1478 – 2157 | 680 | Cytoplasmic Potential | ||||||||
| Transmembrane | 2158 – 2178 | 21 | Helical; Potential | ||||||||
| Topological domain | 2179 – 2186 | 8 | Lumenal Potential | ||||||||
| Intramembrane | 2187 – 2207 | 21 | Helical; Potential | ||||||||
| Topological domain | 2208 – 2209 | 2 | Lumenal Potential | ||||||||
| Transmembrane | 2210 – 2230 | 21 | Helical; Potential | ||||||||
| Topological domain | 2231 – 2241 | 11 | Cytoplasmic Potential | ||||||||
| Transmembrane | 2242 – 2262 | 21 | Helical; Note=Signal for NS4B; Potential | ||||||||
| Topological domain | 2263 – 2293 | 31 | Lumenal Potential | ||||||||
| Intramembrane | 2294 – 2314 | 21 | Helical; Potential | ||||||||
| Topological domain | 2315 – 2338 | 24 | Lumenal Potential | ||||||||
| Intramembrane | 2339 – 2359 | 21 | Helical; Potential | ||||||||
| Topological domain | 2360 | 1 | Lumenal Potential | ||||||||
| Transmembrane | 2361 – 2380 | 20 | Helical; Potential | ||||||||
| Topological domain | 2381 – 2421 | 41 | Cytoplasmic Potential | ||||||||
| Transmembrane | 2422 – 2442 | 21 | Helical; Potential | ||||||||
| Topological domain | 2443 – 2445 | 3 | Lumenal Potential | ||||||||
| Transmembrane | 2446 – 2466 | 21 | Helical; Potential | ||||||||
| Topological domain | 2467 – 3411 | 945 | Cytoplasmic Potential | ||||||||
| Domain | 1485 – 1665 | 181 | Peptidase S7 | ||||||||
| Domain | 1669 – 1825 | 157 | Helicase ATP-binding | ||||||||
| Domain | 1820 – 1997 | 178 | Helicase C-terminal | ||||||||
| Domain | 3035 – 3187 | 153 | RdRp catalytic | ||||||||
| Nucleotide binding | 1682 – 1689 | 8 | ATP Potential | ||||||||
| Region | 30 – 74 | 45 | Hydrophobic; homodimerization of capsid protein C By similarity | ||||||||
| Region | 383 – 396 | 14 | Involved in fusion By similarity | ||||||||
| Region | 1407 – 1446 | 40 | Interacts with and activates NS3 protease By similarity | ||||||||
| Motif | 1773 – 1776 | 4 | DEAH box | ||||||||
| Motif | 2878 – 2911 | 34 | Nuclear localization signal By similarity | ||||||||
| Compositional bias | 2656 – 2660 | 5 | Poly-Ser | ||||||||
Sites | |||||||||||
| Active site | 1537 | 1 | Charge relay system; for serine protease NS3 activity By similarity | ||||||||
| Active site | 1561 | 1 | Charge relay system; for serine protease NS3 activity By similarity | ||||||||
| Active site | 1622 | 1 | Charge relay system; for serine protease NS3 activity By similarity | ||||||||
| Active site | 2567 | 1 | For 2'-O-methyltransferase activity By similarity | ||||||||
| Active site | 2652 | 1 | For 2'-O-methyltransferase and N-7 methyltransferase activity By similarity | ||||||||
| Active site | 2688 | 1 | For 2'-O-methyltransferase activity By similarity | ||||||||
| Active site | 2724 | 1 | For 2'-O-methyltransferase activity By similarity | ||||||||
| Site | 101 – 102 | 2 | Cleavage; by viral protease NS3 Potential | ||||||||
| Site | 121 – 122 | 2 | Cleavage; by host signal peptidase By similarity | ||||||||
| Site | 210 – 211 | 2 | Cleavage; by host furin Potential | ||||||||
| Site | 285 – 286 | 2 | Cleavage; by host signal peptidase Potential | ||||||||
| Site | 778 – 779 | 2 | Cleavage; by host signal peptidase Potential | ||||||||
| Site | 1130 – 1131 | 2 | Cleavage; by host Potential | ||||||||
| Site | 1354 – 1355 | 2 | Cleavage; by viral protease NS3 Potential | ||||||||
| Site | 1484 – 1485 | 2 | Cleavage; by autolysis Potential | ||||||||
| Site | 2107 – 2108 | 2 | Cleavage; by autolysis Potential | ||||||||
| Site | 2233 – 2234 | 2 | Cleavage; by viral protease NS3 Potential | ||||||||
| Site | 2256 – 2257 | 2 | Cleavage; by host signal peptidase Potential | ||||||||
| Site | 2506 – 2507 | 2 | Cleavage; by viral protease NS3 Potential | ||||||||
Amino acid modifications | |||||||||||
| Glycosylation | 134 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Glycosylation | 150 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Glycosylation | 908 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Glycosylation | 986 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Disulfide bond | 288 ↔ 315 | By similarity | |||||||||
| Disulfide bond | 345 ↔ 401 | By similarity | |||||||||
| Disulfide bond | 359 ↔ 390 | By similarity | |||||||||
| Disulfide bond | 377 ↔ 406 | By similarity | |||||||||
| Disulfide bond | 467 ↔ 568 | By similarity | |||||||||
| Disulfide bond | 585 ↔ 615 | By similarity | |||||||||
Sequences
| ||||||||||||||||||
References
| [1] | "Comparison of the genomes of the wild-type French viscerotropic strain of yellow fever virus with its vaccine derivative French neurotropic vaccine." Wang E., Ryman K.D., Jennings A.D., Wood D.J., Taffs F., Minor P.D., Sanders P.G., Barrett A.D. J. Gen. Virol. 76:2749-2755(1995) [PubMed: 7595382] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA]. |
| + | Additional computationally mapped references. |
Cross-references
Sequence databases | |
|---|---|
| EMBL GenBank DDBJ | U21055 Genomic RNA. Translation: AAA99712.1. |
3D structure databases | |
| HSSP | HSSP built from PDB template 1YKS based on UniProtKB P19901. |
| ProteinModelPortal | Q89277. |
| SMR | Q89277. Positions 572-683, 1490-1667, 1671-2107, 2512-2772, 2780-3399. |
| ModBase | Search... |
Protocols and materials databases | |
| StructuralBiologyKnowledgebase | Search... |
Family and domain databases | |
| InterPro | IPR014001. DEAD-like_helicase. IPR011492. DEAD_Flavivir. IPR000069. Env_glycoprot_M_flavivir. IPR013756. Flav_glyE_cen_dom_subdom2. IPR013754. Flav_glyE_dim. IPR001122. Flavi_capsidC. IPR001157. Flavi_NS1. IPR000752. Flavi_NS2A. IPR000487. Flavi_NS2B. IPR000404. Flavi_NS4A. IPR001528. Flavi_NS4B. IPR002535. Flavi_propep. IPR000336. Flv_glyE_Ig-like. IPR014412. Gen_Poly_FLV. IPR011999. GlycoprotE_cen/dimer_Flavivir. IPR011998. GlycoprotE_cen/dimer_vir. IPR001650. Helicase_C. IPR014756. Ig_E-set. IPR009003. Pept_cys/ser_Trypsin-like. IPR001850. Peptidase_S7. IPR000208. RNA-dir_pol_flavivirus. IPR007094. RNA-dir_pol_PSvirus. IPR002877. rRNA_MeTrfase_RrmJ/FtsJ. [Graphical view] |
| Gene3D | G3DSA:3.30.67.10. Flav_glyE_cen_2. 1 hit. G3DSA:2.60.98.10. Flav_glyE_dim. 3 hits. G3DSA:2.60.40.350. Flv_glyE_Ig-like. 1 hit. |
| Pfam | PF01003. Flavi_capsid. 1 hit. PF07652. Flavi_DEAD. 1 hit. PF02832. Flavi_glycop_C. 1 hit. PF00869. Flavi_glycoprot. 1 hit. PF01004. Flavi_M. 1 hit. PF00948. Flavi_NS1. 1 hit. PF01005. Flavi_NS2A. 1 hit. PF01002. Flavi_NS2B. 1 hit. PF01350. Flavi_NS4A. 1 hit. PF01349. Flavi_NS4B. 1 hit. PF00972. Flavi_NS5. 1 hit. PF01570. Flavi_propep. 1 hit. PF01728. FtsJ. 1 hit. PF00271. Helicase_C. 1 hit. PF00949. Peptidase_S7. 1 hit. [Graphical view] |
| PIRSF | PIRSF003817. Gen_Poly_FLV. 1 hit. |
| SMART | SM00487. DEXDc. 1 hit. SM00490. HELICc. 1 hit. [Graphical view] |
| SUPFAM | SSF56983. Flavi_glycoprotE. 1 hit. SSF81296. Ig_E-set. 1 hit. SSF50494. Pept_Ser_Cys. 1 hit. |
| PROSITE | PS51527. FLAVIVIRUS_NS2B. 1 hit. PS51528. FLAVIVIRUS_NS3PRO. 1 hit. PS51192. HELICASE_ATP_BIND_1. 1 hit. PS51194. HELICASE_CTER. False negative. PS50507. RDRP_SSRNA_POS. 1 hit. [Graphical view] |
| ProtoNet | Search... |
Entry information
| Entry name | POLG_YEFVF | ||||||||
| Accession | Primary (citable) accession number: Q89277 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation program | Viral Protein Annotation Program | ||||||||
Relevant documents
| Peptidase families Classification of peptidase families and list of entries |
| SIMILARITY comments Index of protein domains and families |

Clusters with