Q9YRV3 (POLG_YEFVT) Reviewed, UniProtKB/Swiss-Prot
Last modified
November 16, 2011.
Version 83.
History...
Names·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order
Names·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize orderNames and origin
Protein attributes
| Sequence length | 3411 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is further processed into a mature form. |
| Protein existence | Inferred from homology |
General annotation (Comments)
| Function | Capsid protein C self-assembles to form an icosahedral capsid about 30 nm in diameter. The capsid encapsulates the genomic RNA By similarity. prM acts as a chaperone for envelope protein E during intracellular virion assembly by masking and inactivating envelope protein E fusion peptide. prM is matured in the last step of virion assembly, presumably to avoid catastrophic activation of the viral fusion peptide induced by the acidic pH of the trans-Golgi network. After cleavage by host furin, the pr peptide is released in the extracellular medium and small envelope protein M and envelope protein E homodimers are dissociated By similarity. Envelope protein E binding to host cell surface receptor is followed by virus internalization through clathrin-mediated endocytosis. Envelope protein E is subsequently involved in membrane fusion between virion and host late endosomes. Synthesized as an homodimer with prM which acts as a chaperone for envelope protein E. After cleavage of prM, envelope protein E dissociate from small envelope protein M and homodimerizes By similarity. Non-structural protein 1 is involved in virus replication and regulation of the innate immune response By similarity. Non-structural protein 2A may be involved viral RNA replication and capsid assembly Potential. Non-structural protein 2B is a required cofactor for the serine protease function of NS3 By similarity. Serine protease NS3 displays three enzymatic activities: serine protease, NTPase and RNA helicase. NS3 serine protease, in association with NS2B, performs its autocleavage and cleaves the polyprotein at dibasic sites in the cytoplasm: C-prM, NS2A-NS2B, NS2B-NS3, NS3-NS4A, NS4A-2K and NS4B-NS5. NS3 RNA helicase binds RNA and unwinds dsRNA in the 3' to 5' direction By similarity. Non-structural protein 4A induces host endoplasmic reticulum membrane rearrangements leading to the formation of virus-induced membranous vesicles hosting the dsRNA and polymerase, functionning as a replication complex. NS4A might also regulate the ATPase activity of the NS3 helicase By similarity. Peptide 2k functions as a signal peptide for NS4B and is required for the interferon antagonism activity of the latter By similarity. Non-structural protein 4B inhibits interferon (IFN)-induced host STAT1 phosphorylation and nuclear translocation, thereby preventing the establishment of cellular antiviral state by blocking the IFN-alpha/beta pathway By similarity. RNA-directed RNA polymerase NS5 replicates the viral (+) and (-) genome, and performs the capping of genomes in the cytoplasm. NS5 methylates viral RNA cap at guanine N-7 and ribose 2'-O positions. Besides its role in genome replication, also prevents the establishment of cellular antiviral state by blocking the interferon-alpha/beta (IFN-alpha/beta) signaling pathway By similarity. |
| Catalytic activity | Selective hydrolysis of -Xaa-Xaa-|-Yaa- bonds in which each of the Xaa can be either Arg or Lys and Yaa can be either Ser or Ala. Nucleoside triphosphate + RNA(n) = diphosphate + RNA(n+1). NTP + H2O = NDP + phosphate. ATP + H2O = ADP + phosphate. S-adenosyl-L-methionine + G(5')pppR-RNA = S-adenosyl-L-homocysteine + m7G(5')pppR-RNA. S-adenosyl-L-methionine + m7G(5')pppR-RNA = S-adenosyl-L-homocysteine + m7G(5')pppRm-RNA. |
| Subunit structure | Capsid protein C forms homodimers. prM and envelope protein E form heterodimers in the endoplasmic reticulum and Golgi. In immature particles, there are 60 icosaedrally organized trimeric spikes on the surface. Each spike consists of three heterodimers of envelope protein M precursor (prM) and envelope protein E. NS1 forms homodimers as well as homohexamers when secreted. NS1 may interact with NS4A. NS3 and NS2B form a heterodimer. NS3 is the catalytic subunit, whereas NS2B strongly stimulates the latter, acting as a cofactor. In the absence of the NS2B, NS3 protease is unfolded and inactive. NS3 interacts with unphosphorylated NS5; this interaction stimulates NS5 guanylyltransferase activity By similarity. |
| Subcellular location | Capsid protein C: Virion Potential. Peptide pr: Secreted By similarity. Small envelope protein M: Virion membrane; Multi-pass membrane protein By similarity. Host endoplasmic reticulum membrane; Multi-pass membrane protein By similarity. Envelope protein E: Virion membrane; Multi-pass membrane protein By similarity. Host endoplasmic reticulum membrane; Multi-pass membrane protein By similarity. Non-structural protein 1: Secreted. Host endoplasmic reticulum membrane; Peripheral membrane protein; Lumenal side By similarity. Non-structural protein 2A-alpha: Host endoplasmic reticulum membrane; Multi-pass membrane protein Potential. Non-structural protein 2A: Host endoplasmic reticulum membrane; Multi-pass membrane protein Potential. Serine protease subunit NS2B: Host endoplasmic reticulum membrane; Peripheral membrane protein; Cytoplasmic side By similarity. Serine protease NS3: Host endoplasmic reticulum membrane; Peripheral membrane protein; Cytoplasmic side By similarity. Note: Remains non-covalently associated to NS3 protease By similarity. Non-structural protein 4A: Host endoplasmic reticulum membrane; Multi-pass membrane protein By similarity. Note: Located in RE-associated vesicles hosting the replication complex. Non-structural protein 4B: Host endoplasmic reticulum membrane; Multi-pass membrane protein By similarity. RNA-directed RNA polymerase NS5: Host endoplasmic reticulum membrane; Peripheral membrane protein; Cytoplasmic side By similarity. Host nucleus By similarity. Note: Located in RE-associated vesicles hosting the replication complex. |
| Domain | Transmembrane domains of the small envelope protein M and envelope protein E contains an endoplasmic reticulum retention signals By similarity. |
| Post-translational modification | Specific enzymatic cleavages in vivo yield mature proteins. The nascent protein C contains a C-terminal hydrophobic domain that act as a signal sequence for translocation of prM into the lumen of the ER. Mature protein C is cleaved at a site upstream of this hydrophobic domain by NS3. prM is cleaved in post-Golgi vesicles by a host furin, releasing the mature small envelope protein M, and peptide pr. Non-structural protein 2A-alpha, a C-terminally truncated form of non-structural protein 2A, results from partial cleavage by NS3. Specific enzymatic cleavages in vivo yield mature proteins Peptide 2K acts as a signal sequence and is removed from the N-terminus of NS4B by the host signal peptidase in the ER lumen. Signal cleavage at the 2K-4B site requires a prior NS3 protease-mediated cleavage at the 4A-2K site By similarity. RNA-directed RNA polymerase NS5 is phosphorylated on serines residues. This phosphorylation may trigger NS5 nuclear localization. Envelope protein E and non-structural protein 1 are N-glycosylated By similarity. |
| Sequence similarities | Contains 1 helicase ATP-binding domain. Contains 1 helicase C-terminal domain. Contains 1 peptidase S7 domain. Contains 1 RdRp catalytic domain. |
Ontologies
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||||
Molecule processing | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Chain | 1 – 3411 | 3411 | Genome polyprotein | PRO_0000405157 | |||||||
| Chain | 1 – 101 | 101 | Capsid protein C By similarity | PRO_0000261485 | |||||||
| Propeptide | 102 – 121 | 20 | ER anchor for the protein C, removed in mature form by serine protease NS3 | PRO_0000261486 | |||||||
| Chain | 122 – 285 | 164 | prM By similarity | PRO_0000261487 | |||||||
| Chain | 122 – 210 | 89 | Peptide pr By similarity | PRO_0000261488 | |||||||
| Chain | 211 – 285 | 75 | Small envelope protein M By similarity | PRO_0000261489 | |||||||
| Chain | 286 – 778 | 493 | Envelope protein E By similarity | PRO_0000261490 | |||||||
| Chain | 779 – 1130 | 352 | Non-structural protein 1 By similarity | PRO_0000261491 | |||||||
| Chain | 1131 – 1354 | 224 | Non-structural protein 2A By similarity | PRO_0000261492 | |||||||
| Chain | 1131 – 1320 | 190 | Non-structural protein 2A-alpha By similarity | PRO_0000261493 | |||||||
| Chain | 1355 – 1484 | 130 | Serine protease subunit NS2B By similarity | PRO_0000261494 | |||||||
| Chain | 1485 – 2107 | 623 | Serine protease NS3 By similarity | PRO_0000261495 | |||||||
| Chain | 2108 – 2233 | 126 | Non-structural protein 4A By similarity | PRO_0000261496 | |||||||
| Peptide | 2234 – 2256 | 23 | Peptide 2k | PRO_0000261497 | |||||||
| Chain | 2257 – 2506 | 250 | Non-structural protein 4B By similarity | PRO_0000261498 | |||||||
| Chain | 2507 – 3411 | 905 | RNA-directed RNA polymerase NS5 By similarity | PRO_0000261499 | |||||||
Regions | |||||||||||
| Topological domain | 1 – 104 | 104 | Cytoplasmic Potential | ||||||||
| Transmembrane | 105 – 125 | 21 | Helical; Potential | ||||||||
| Topological domain | 126 – 244 | 119 | Extracellular Potential | ||||||||
| Transmembrane | 245 – 265 | 21 | Helical; Potential | ||||||||
| Topological domain | 266 – 270 | 5 | Cytoplasmic Potential | ||||||||
| Transmembrane | 271 – 285 | 15 | Helical; Potential | ||||||||
| Topological domain | 286 – 730 | 445 | Extracellular Potential | ||||||||
| Intramembrane | 731 – 751 | 21 | Helical; Potential | ||||||||
| Topological domain | 752 – 757 | 6 | Extracellular Potential | ||||||||
| Intramembrane | 758 – 778 | 21 | Helical; Potential | ||||||||
| Topological domain | 779 – 1130 | 352 | Extracellular Potential | ||||||||
| Transmembrane | 1131 – 1151 | 21 | Helical; Potential | ||||||||
| Topological domain | 1152 – 1160 | 9 | Cytoplasmic Potential | ||||||||
| Transmembrane | 1161 – 1181 | 21 | Helical; Potential | ||||||||
| Topological domain | 1182 – 1201 | 20 | Lumenal Potential | ||||||||
| Transmembrane | 1202 – 1222 | 21 | Helical; Potential | ||||||||
| Topological domain | 1223 – 1231 | 9 | Cytoplasmic Potential | ||||||||
| Transmembrane | 1232 – 1252 | 21 | Helical; Potential | ||||||||
| Topological domain | 1253 – 1262 | 10 | Lumenal Potential | ||||||||
| Transmembrane | 1263 – 1285 | 23 | Helical; Potential | ||||||||
| Topological domain | 1286 – 1355 | 70 | Cytoplasmic Potential | ||||||||
| Transmembrane | 1356 – 1376 | 21 | Helical; Potential | ||||||||
| Topological domain | 1377 – 1378 | 2 | Lumenal Potential | ||||||||
| Transmembrane | 1379 – 1399 | 21 | Helical; Potential | ||||||||
| Topological domain | 1400 – 1456 | 57 | Cytoplasmic Potential | ||||||||
| Intramembrane | 1457 – 1477 | 21 | Helical; Potential | ||||||||
| Topological domain | 1478 – 2157 | 680 | Cytoplasmic Potential | ||||||||
| Transmembrane | 2158 – 2178 | 21 | Helical; Potential | ||||||||
| Topological domain | 2179 – 2186 | 8 | Lumenal Potential | ||||||||
| Intramembrane | 2187 – 2207 | 21 | Helical; Potential | ||||||||
| Topological domain | 2208 – 2209 | 2 | Lumenal Potential | ||||||||
| Transmembrane | 2210 – 2230 | 21 | Helical; Potential | ||||||||
| Topological domain | 2231 – 2241 | 11 | Cytoplasmic Potential | ||||||||
| Transmembrane | 2242 – 2262 | 21 | Helical; Note=Signal for NS4B; Potential | ||||||||
| Topological domain | 2263 – 2293 | 31 | Lumenal Potential | ||||||||
| Intramembrane | 2294 – 2314 | 21 | Helical; Potential | ||||||||
| Topological domain | 2315 – 2338 | 24 | Lumenal Potential | ||||||||
| Intramembrane | 2339 – 2359 | 21 | Helical; Potential | ||||||||
| Topological domain | 2360 | 1 | Lumenal Potential | ||||||||
| Transmembrane | 2361 – 2380 | 20 | Helical; Potential | ||||||||
| Topological domain | 2381 – 2421 | 41 | Cytoplasmic Potential | ||||||||
| Transmembrane | 2422 – 2442 | 21 | Helical; Potential | ||||||||
| Topological domain | 2443 – 2445 | 3 | Lumenal Potential | ||||||||
| Transmembrane | 2446 – 2466 | 21 | Helical; Potential | ||||||||
| Topological domain | 2467 – 3411 | 945 | Cytoplasmic Potential | ||||||||
| Domain | 1485 – 1665 | 181 | Peptidase S7 | ||||||||
| Domain | 1669 – 1825 | 157 | Helicase ATP-binding | ||||||||
| Domain | 1820 – 1997 | 178 | Helicase C-terminal | ||||||||
| Domain | 3035 – 3187 | 153 | RdRp catalytic | ||||||||
| Nucleotide binding | 1682 – 1689 | 8 | ATP Potential | ||||||||
| Region | 30 – 74 | 45 | Hydrophobic; homodimerization of capsid protein C By similarity | ||||||||
| Region | 383 – 396 | 14 | Involved in fusion By similarity | ||||||||
| Region | 1407 – 1446 | 40 | Interacts with and activates NS3 protease By similarity | ||||||||
| Motif | 1773 – 1776 | 4 | DEAH box | ||||||||
| Motif | 2878 – 2911 | 34 | Nuclear localization signal By similarity | ||||||||
| Compositional bias | 2656 – 2660 | 5 | Poly-Ser | ||||||||
Sites | |||||||||||
| Active site | 1537 | 1 | Charge relay system; for serine protease NS3 activity By similarity | ||||||||
| Active site | 1561 | 1 | Charge relay system; for serine protease NS3 activity By similarity | ||||||||
| Active site | 1622 | 1 | Charge relay system; for serine protease NS3 activity By similarity | ||||||||
| Active site | 2567 | 1 | For 2'-O-methyltransferase activity By similarity | ||||||||
| Active site | 2652 | 1 | For 2'-O-methyltransferase and N-7 methyltransferase activity By similarity | ||||||||
| Active site | 2688 | 1 | For 2'-O-methyltransferase activity By similarity | ||||||||
| Active site | 2724 | 1 | For 2'-O-methyltransferase activity By similarity | ||||||||
| Site | 101 – 102 | 2 | Cleavage; by viral protease NS3 Potential | ||||||||
| Site | 121 – 122 | 2 | Cleavage; by host signal peptidase By similarity | ||||||||
| Site | 210 – 211 | 2 | Cleavage; by host furin Potential | ||||||||
| Site | 285 – 286 | 2 | Cleavage; by host signal peptidase Potential | ||||||||
| Site | 778 – 779 | 2 | Cleavage; by host signal peptidase Potential | ||||||||
| Site | 1130 – 1131 | 2 | Cleavage; by host Potential | ||||||||
| Site | 1354 – 1355 | 2 | Cleavage; by viral protease NS3 Potential | ||||||||
| Site | 1484 – 1485 | 2 | Cleavage; by autolysis Potential | ||||||||
| Site | 2107 – 2108 | 2 | Cleavage; by autolysis Potential | ||||||||
| Site | 2233 – 2234 | 2 | Cleavage; by viral protease NS3 Potential | ||||||||
| Site | 2256 – 2257 | 2 | Cleavage; by host signal peptidase Potential | ||||||||
| Site | 2506 – 2507 | 2 | Cleavage; by viral protease NS3 Potential | ||||||||
Amino acid modifications | |||||||||||
| Glycosylation | 134 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Glycosylation | 150 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Glycosylation | 908 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Glycosylation | 986 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Disulfide bond | 288 ↔ 315 | By similarity | |||||||||
| Disulfide bond | 345 ↔ 401 | By similarity | |||||||||
| Disulfide bond | 359 ↔ 390 | By similarity | |||||||||
| Disulfide bond | 377 ↔ 406 | By similarity | |||||||||
| Disulfide bond | 467 ↔ 568 | By similarity | |||||||||
| Disulfide bond | 585 ↔ 615 | By similarity | |||||||||
Sequences
| ||||||||||||||||||
References
| [1] | "Complete nucleotide sequence and phylogeny of an American strain of yellow fever virus, TRINID79A." Pisano M.R., Mercier V., Deubel V., Tolou H. Arch. Virol. 144:1837-1843(1999) [PubMed: 10542030] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA]. |
| + | Additional computationally mapped references. |
Cross-references
Sequence databases | |
|---|---|
| EMBL GenBank DDBJ | AF094612 Genomic RNA. Translation: AAC72235.1. |
3D structure databases | |
| HSSP | HSSP built from PDB template 1YKS based on UniProtKB P19901. |
| ProteinModelPortal | Q9YRV3. |
| SMR | Q9YRV3. Positions 572-683, 1490-1667, 1671-2107, 2512-2772, 2780-3399. |
| ModBase | Search... |
Protein family/group databases | |
| MEROPS | S07.001. |
Protocols and materials databases | |
| StructuralBiologyKnowledgebase | Search... |
Family and domain databases | |
| InterPro | IPR014001. DEAD-like_helicase. IPR011492. DEAD_Flavivir. IPR000069. Env_glycoprot_M_flavivir. IPR013756. Flav_glyE_cen_dom_subdom2. IPR013754. Flav_glyE_dim. IPR001122. Flavi_capsidC. IPR001157. Flavi_NS1. IPR000752. Flavi_NS2A. IPR000487. Flavi_NS2B. IPR000404. Flavi_NS4A. IPR001528. Flavi_NS4B. IPR002535. Flavi_propep. IPR000336. Flv_glyE_Ig-like. IPR014412. Gen_Poly_FLV. IPR011999. GlycoprotE_cen/dimer_Flavivir. IPR011998. GlycoprotE_cen/dimer_vir. IPR001650. Helicase_C. IPR014756. Ig_E-set. IPR009003. Pept_cys/ser_Trypsin-like. IPR001850. Peptidase_S7. IPR000208. RNA-dir_pol_flavivirus. IPR007094. RNA-dir_pol_PSvirus. IPR002877. rRNA_MeTrfase_RrmJ/FtsJ. [Graphical view] |
| Gene3D | G3DSA:3.30.67.10. Flav_glyE_cen_2. 1 hit. G3DSA:2.60.98.10. Flav_glyE_dim. 3 hits. G3DSA:2.60.40.350. Flv_glyE_Ig-like. 1 hit. |
| Pfam | PF01003. Flavi_capsid. 1 hit. PF07652. Flavi_DEAD. 1 hit. PF02832. Flavi_glycop_C. 1 hit. PF00869. Flavi_glycoprot. 1 hit. PF01004. Flavi_M. 1 hit. PF00948. Flavi_NS1. 1 hit. PF01005. Flavi_NS2A. 1 hit. PF01002. Flavi_NS2B. 1 hit. PF01350. Flavi_NS4A. 1 hit. PF01349. Flavi_NS4B. 1 hit. PF00972. Flavi_NS5. 1 hit. PF01570. Flavi_propep. 1 hit. PF01728. FtsJ. 1 hit. PF00271. Helicase_C. 1 hit. PF00949. Peptidase_S7. 1 hit. [Graphical view] |
| PIRSF | PIRSF003817. Gen_Poly_FLV. 1 hit. |
| SMART | SM00487. DEXDc. 1 hit. SM00490. HELICc. 1 hit. [Graphical view] |
| SUPFAM | SSF56983. Flavi_glycoprotE. 1 hit. SSF81296. Ig_E-set. 1 hit. SSF50494. Pept_Ser_Cys. 1 hit. |
| PROSITE | PS51527. FLAVIVIRUS_NS2B. 1 hit. PS51528. FLAVIVIRUS_NS3PRO. 1 hit. PS51192. HELICASE_ATP_BIND_1. 1 hit. PS51194. HELICASE_CTER. False negative. PS50507. RDRP_SSRNA_POS. 1 hit. [Graphical view] |
| ProtoNet | Search... |
Entry information
| Entry name | POLG_YEFVT | ||||||||
| Accession | Primary (citable) accession number: Q9YRV3 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation program | Viral Protein Annotation Program | ||||||||
Relevant documents
| Peptidase families Classification of peptidase families and list of entries |
| SIMILARITY comments Index of protein domains and families |

Clusters with