Reviewed,
UniProtKB/Swiss-Prot P17763 (POLG_DEN1W)
Last modified
June 16, 2009.
Version 84.
History...
Clusters with 100%,
90%,
50% identity |
Documents (2) |
Third-party data |
Customize display | text xml rdf/xml gff fasta |
Names and origin
| Protein names | Recommended name: Genome polyprotein Cleaved into the following 14 chains: 1- Recommended name: Protein C Alternative name(s): Core protein Capsid protein 2- Recommended name: prM 3- Recommended name: Peptide pr 4- Recommended name: Small envelope protein M Alternative name(s): Matrix protein 5- Recommended name: Envelope protein E 6- Recommended name: Non-structural protein 1 Short name=NS1 7- Recommended name: Non-structural protein 2A Short name=NS2A 8- Recommended name: Non-structural protein 2A-alpha Short name=NS2A-alpha 9- Recommended name: Serine protease subunit NS2B Alternative name(s): Non-structural protein 2B 10- Recommended name: Serine protease subunit NS3 EC=3.4.21.91 Alternative name(s): Non-structural protein 3 11- Recommended name: Non-structural protein 4A Short name=NS4A 12- Recommended name: Peptide 2k 13- Recommended name: Non-structural protein 4B Short name=NS4B 14- Recommended name: RNA-directed RNA polymerase NS5 EC=2.7.7.48 EC=2.1.1.56 Alternative name(s): Non-structural protein 5 |
| Organism | Dengue virus type 1 (strain Nauru/West Pac/1974) (DENV-1) [Complete proteome] |
| Taxonomic identifier | 11059 [NCBI] |
| Taxonomic lineage | Viruses › ssRNA positive-strand viruses, no DNA stage › Flaviviridae › Flavivirus › Dengue virus group |
| Virus host | Aedes aegypti (Yellowfever mosquito) (Culex aegypti) [TaxID: 7159] Homo sapiens (Human) [TaxID: 9606] |
Protein attributes
| Sequence length | 3392 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is further processed into a mature form. |
| Protein existence | Inferred from homology. |
General annotation (Comments)
| Function | Protein C packages viral RNA to form a viral nucleocapsid, and promotes virion budding By similarity. prM acts as a chaperone for envelope protein E during intracellular virion assembly by masking and inactivating envelope protein E fusion peptide. prM is matured in the last step of virion assembly, presumably to avoid catastrophic activation of the viral fusion peptide induced by the acidic pH of the trans-Golgi network. After cleavage by host furin, the pr peptide is released in the extracellular medium and small envelope protein M and envelope protein E homodimers are dissociated By similarity. Envelope protein E binds cell surface receptor and is involved in membrane fusion between virion and target cell. Synthesized as an homodimer with prM which acts as a chaperone for envelope protein E. After cleavage of prM, envelope protein E dissociate from small envelope protein M and homodimerizes By similarity. Non-structural protein 1 is slowly secreted from mammalian cells, but not from mosquito cells. Secreted form elicits protective immune response and plays an essential role in RNA replication. Soluble and membrane-associated NS1 may activate human complement and induce host vascular leakage. This effect might explain the clinical manifestations of dengue hemorrhagic fever and dengue shock syndrome By similarity. Non-structural protein 2B is a required cofactor for the serine protease function of NS3 By similarity. Serine protease NS3 displays three enzymatic activities: serine protease, NTPase and RNA helicase. NS3 serine protease, in association with NS2B, cleaves the polyprotein at dibasic sites in the cytoplasm: C-prM, NS2A-NS2B, NS2B-NS3, NS3-NS4A, NS4A-2K and NS4B-NS5. NS3 RNA helicase binds RNA and unwinds dsRNA in the 3' to 5' direction By similarity. Non-structural protein 4A plays a role in RNA replication. Enhances inhibition of cell antiviral response by non-structural protein 4B By similarity. Non-structural protein 4B prevent the establishment of cellular antiviral state by blocking the interferon-alpha/beta (IFN-alpha/beta) and IFN-gamma signaling pathways By similarity. RNA-directed RNA polymerase NS5 replicates the viral (+) and (-) genome, and assure the capping of genomes in the cytoplasm. May be involved in methylation of 5'RNA cap structure By similarity. |
| Catalytic activity | Selective hydrolysis of -Xaa-Xaa-|-Yaa- bonds in which each of the Xaa can be either Arg or Lys and Yaa can be either Ser or Ala. Nucleoside triphosphate + RNA(n) = diphosphate + RNA(n+1). S-adenosyl-L-methionine + G(5')pppR-RNA = S-adenosyl-L-homocysteine + m7G(5')pppR-RNA. |
| Subunit structure | prM and envelope protein E form heterodimers in the endoplasmic reticulum and Golgi. Envelope protein E forms homodimers. NS1 forms homodimers as well as homohexamers when secreted. NS1 may interact with NS4A. NS3 and NS2B form an heterodimer. NS3 interacts with unphosphorylated NS5 By similarity. |
| Subcellular location | Protein C: Virion By similarity. Peptide pr: Secreted By similarity. Small envelope protein M: Virion membrane; Single-pass type I membrane protein By similarity. Envelope protein E: Virion membrane; Single-pass type I membrane protein By similarity. Non-structural protein 1: Secreted. Host endoplasmic reticulum membrane; Peripheral membrane protein; Lumenal side By similarity. Non-structural protein 2A-alpha: Host endoplasmic reticulum membrane By similarity. Non-structural protein 2A: Host endoplasmic reticulum membrane By similarity. Serine protease subunit NS2B: Host endoplasmic reticulum membrane; Peripheral membrane protein; Cytoplasmic side By similarity. Serine protease subunit NS3: Host endoplasmic reticulum membrane; Peripheral membrane protein; Cytoplasmic side By similarity. Non-structural protein 4A: Host endoplasmic reticulum membrane; Peripheral membrane protein; Cytoplasmic side By similarity. Non-structural protein 4B: Host endoplasmic reticulum membrane; Multi-pass membrane protein By similarity. Note: The C-terminal transmembrane domain of non-structural protein 4B is presumably reoriented after cleavage on the lumenal side By similarity. RNA-directed RNA polymerase NS5: Host endoplasmic reticulum membrane; Peripheral membrane protein; Cytoplasmic side. Host nucleus By similarity. |
| Domain | Transmembrane domains of the small envelope protein M and envelope protein E contains an endoplasmic reticulum retention signals By similarity. |
| Post-translational modification | Specific enzymatic cleavages in vivo yield mature proteins. The nascent protein C contains a C-terminal hydrophobic domain that act as a signal sequence for translocation of prM into the lumen of the ER. Mature protein C is cleaved at a site upstream of this hydrophobic domain by NS3. prM is cleaved in post-Golgi vesicles by a host furin, releasing the mature small envelope protein M, and peptide pr. Non-structural protein 2A-alpha, a C-terminally truncated form of non-structural protein 2A, results from partial cleavage by NS3 By similarity. RNA-directed RNA polymerase NS5 is phosphorylated on serines residues. This phosphorylation may trigger NS5 nuclear localization By similarity. Envelope protein E and non-structural protein 1 are N-glycosylated By similarity. |
| Miscellaneous | The virion is assembled in the endoplasmic reticulum lumen, transported by vesicles to the Golgi, then transported again to the cell membrane where it is released outside the cell. |
| Sequence similarities | Contains 1 helicase ATP-binding domain. Contains 1 helicase C-terminal domain. Contains 1 peptidase S7 domain. Contains 1 RdRp catalytic domain. |
Ontologies
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||||
Molecule processing | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Chain | 1 – 100 | 100 | Protein C | PRO_0000264658 | |||||||
| Propeptide | 101 – 114 | 14 | ER anchor for the protein C, removed in mature form by serine protease NS3 By similarity | PRO_0000264659 | |||||||
| Chain | 115 – 280 | 166 | prM | PRO_0000264660 | |||||||
| Chain | 115 – 205 | 91 | Peptide pr | PRO_0000264661 | |||||||
| Chain | 206 – 280 | 75 | Small envelope protein M | PRO_0000264662 | |||||||
| Chain | 281 – 775 | 495 | Envelope protein E | PRO_0000264663 | |||||||
| Chain | 776 – 1127 | 352 | Non-structural protein 1 | PRO_0000264664 | |||||||
| Chain | 1128 – 1345 | 218 | Non-structural protein 2A | PRO_0000264666 | |||||||
| Chain | 1128 – 1315 | 188 | Non-structural protein 2A-alpha | PRO_0000264665 | |||||||
| Chain | 1346 – 1475 | 130 | Serine protease subunit NS2B | PRO_0000264667 | |||||||
| Chain | 1476 – 2094 | 619 | Serine protease subunit NS3 | PRO_0000264668 | |||||||
| Chain | 2095 – 2221 | 127 | Non-structural protein 4A | PRO_0000264669 | |||||||
| Peptide | 2222 – 2244 | 23 | Peptide 2k By similarity | PRO_0000264670 | |||||||
| Chain | 2245 – 2493 | 249 | Non-structural protein 4B | PRO_0000264671 | |||||||
| Chain | 2494 – 3392 | 899 | RNA-directed RNA polymerase NS5 | PRO_0000264672 | |||||||
Regions | |||||||||||
| Topological domain | 1 – 101 | 101 | Cytoplasmic Potential | ||||||||
| Transmembrane | 102 – 122 | 21 | Potential | ||||||||
| Topological domain | 123 – 238 | 116 | Extracellular Potential | ||||||||
| Transmembrane | 239 – 259 | 21 | Potential | ||||||||
| Topological domain | 260 – 265 | 6 | Cytoplasmic Potential | ||||||||
| Transmembrane | 266 – 286 | 21 | Potential | ||||||||
| Topological domain | 287 – 725 | 439 | Extracellular Potential | ||||||||
| Transmembrane | 726 – 746 | 21 | Potential | ||||||||
| Topological domain | 747 – 752 | 6 | Cytoplasmic Potential | ||||||||
| Transmembrane | 753 – 773 | 21 | Potential | ||||||||
| Topological domain | 774 – 1156 | 383 | Extracellular Potential | ||||||||
| Transmembrane | 1157 – 1177 | 21 | Potential | ||||||||
| Topological domain | 1178 – 1447 | 270 | Cytoplasmic Potential | ||||||||
| Transmembrane | 1448 – 1468 | 21 | Potential | ||||||||
| Topological domain | 1469 – 2193 | 725 | Lumenal Potential | ||||||||
| Transmembrane | 2194 – 2214 | 21 | Potential | ||||||||
| Topological domain | 2215 – 2221 | 7 | Cytoplasmic Potential | ||||||||
| Transmembrane | 2222 – 2241 | 20 | Potential | ||||||||
| Topological domain | 2242 – 2349 | 108 | Lumenal Potential | ||||||||
| Transmembrane | 2350 – 2370 | 21 | Potential | ||||||||
| Topological domain | 2371 – 2415 | 45 | Cytoplasmic Potential | ||||||||
| Transmembrane | 2416 – 2436 | 21 | Potential | ||||||||
| Topological domain | 2437 – 2461 | 25 | Lumenal Potential | ||||||||
| Transmembrane | 2462 – 2482 | 21 | Potential | ||||||||
| Topological domain | 2483 – 3392 | 910 | Cytoplasmic Potential | ||||||||
| Domain | 1656 – 1812 | 157 | Helicase ATP-binding | ||||||||
| Domain | 1822 – 1989 | 168 | Helicase C-terminal | ||||||||
| Domain | 3020 – 3169 | 150 | RdRp catalytic | ||||||||
| Nucleotide binding | 1669 – 1676 | 8 | ATP Potential | ||||||||
| Motif | 1760 – 1763 | 4 | DEAH box By similarity | ||||||||
Sites | |||||||||||
| Active site | 1526 | 1 | Charge relay system; for serine protease NS3 activity By similarity | ||||||||
| Active site | 1550 | 1 | Charge relay system; for serine protease NS3 activity By similarity | ||||||||
| Active site | 1610 | 1 | Charge relay system; for serine protease NS3 activity By similarity | ||||||||
| Site | 100 – 101 | 2 | Cleavage; by serine protease NS3 By similarity | ||||||||
| Site | 114 – 115 | 2 | Cleavage; by host signal peptidase By similarity | ||||||||
| Site | 205 – 206 | 2 | Cleavage; by host furin By similarity | ||||||||
| Site | 280 – 281 | 2 | Cleavage; by host signal peptidase By similarity | ||||||||
| Site | 775 – 776 | 2 | Cleavage; by host signal peptidase By similarity | ||||||||
| Site | 1127 – 1128 | 2 | Cleavage; by host By similarity | ||||||||
| Site | 1315 – 1316 | 2 | Cleavage; by serine protease NS3 By similarity | ||||||||
| Site | 1345 – 1346 | 2 | Cleavage; by serine protease NS3 By similarity | ||||||||
| Site | 1475 – 1476 | 2 | Cleavage; by serine protease NS3 By similarity | ||||||||
| Site | 2094 – 2095 | 2 | Cleavage; by serine protease NS3 By similarity | ||||||||
| Site | 2221 – 2222 | 2 | Cleavage; by host signal peptidase By similarity | ||||||||
| Site | 2244 – 2245 | 2 | Cleavage; by serine protease NS3 By similarity | ||||||||
| Site | 2493 – 2494 | 2 | Cleavage; by serine protease NS3 By similarity | ||||||||
Amino acid modifications | |||||||||||
| Glycosylation | 183 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Glycosylation | 347 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Glycosylation | 433 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Glycosylation | 982 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Glycosylation | 2303 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Glycosylation | 2307 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Glycosylation | 2459 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Disulfide bond | 283 ↔ 310 | By similarity | |||||||||
| Disulfide bond | 340 ↔ 401 | By similarity | |||||||||
| Disulfide bond | 354 ↔ 385 | By similarity | |||||||||
| Disulfide bond | 372 ↔ 396 | By similarity | |||||||||
| Disulfide bond | 465 ↔ 565 | By similarity | |||||||||
| Disulfide bond | 582 ↔ 613 | By similarity | |||||||||
Natural variations | |||||||||||
| Natural variant | 125 – 126 | 2 | HM → TL in strain: Isolate Philippines/836-1/1984. | ||||||||
| Natural variant | 142 | 1 | S → P in strain: Isolate Philippines/836-1/1984. | ||||||||
| Natural variant | 171 – 173 | 3 | TET → AEA in strain: Isolate Philippines/836-1/1984. | ||||||||
| Natural variant | 186 | 1 | E → D in strain: Isolate Philippines/836-1/1984. | ||||||||
| Natural variant | 210 | 1 | A → D in strain: Isolate Philippines/836-1/1984. | ||||||||
| Natural variant | 251 | 1 | A → G in strain: Isolate Philippines/836-1/1984. | ||||||||
| Natural variant | 441 | 1 | T → I in strain: Isolate Philippines/836-1/1984. | ||||||||
| Natural variant | 475 | 1 | E → R in strain: Isolate Philippines/836-1/1984. | ||||||||
| Natural variant | 482 | 1 | E → K in strain: Isolate 45AZ5. | ||||||||
| Natural variant | 573 | 1 | T → I in strain: Isolate 45AZ5. | ||||||||
| Natural variant | 677 | 1 | S → T in strain: Isolate Philippines/836-1/1984. | ||||||||
| Natural variant | 786 | 1 | R → K in strain: Isolate Philippines/836-1/1984. | ||||||||
| Natural variant | 1689 – 1692 | 4 | RRNV → KRKL in strain: Isolate 45AZ5. | ||||||||
| Natural variant | 2242 | 1 | A → V in strain: Isolate 45AZ5. | ||||||||
| Natural variant | 2357 | 1 | F → L in strain: Isolate 45AZ5. | ||||||||
| Natural variant | 2543 | 1 | P → T in strain: Isolate 45AZ5. | ||||||||
| Natural variant | 2779 | 1 | G → E in strain: Isolate 45AZ5. | ||||||||
| Natural variant | 3337 | 1 | R → Q in strain: Isolate 45AZ5. | ||||||||
Sequences
| ||||||||||||||||||
References
| [1] | "Molecular analysis of dengue virus attenuation after serial passage in primary dog kidney cells." Puri B., Nelson W.M., Henchal E.A., Hoke C.H., Eckels K.H., Dubois D.R., Porter K.R., Hayes C.G. J. Gen. Virol. 78:2287-2291(1997) [PubMed: 9292016] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA]. Strain: Isolate 45AZ5 and Isolate WestPac. |
| [2] | "Sequence of the dengue-1 virus genome in the region encoding the three structural proteins and the major nonstructural protein NS1." Mason P.W., McAda P.C., Mason T.L., Fournier M.J. Virology 161:262-267(1987) [PubMed: 3672932] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA] OF 1-1226. |
| [3] | "Genetic relatedness among structural protein genes of dengue 1 virus strains." Chu M.C., O'Rourke E.J., Trent D.W. J. Gen. Virol. 70:1701-1712(1989) [PubMed: 2738579] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA] OF 1-792. Strain: Isolate Philippines/836-1/1984. |
| + | Additional computationally mapped references. |
Cross-references
Sequence databases | |
|---|---|
| U88535 Genomic RNA. Translation: AAB70694.1. U88536 Genomic RNA. Translation: AAB70695.1. M23027 Genomic RNA. Translation: AAA42940.1. D00503 Genomic RNA. Translation: BAA00395.1. | |
| PIR | GNWVWP. A27032. |
3D structure databases | |
| HSSP | HSSP built from PDB template 1OKE based on UniProtKB Q88653. |
| SMR | P17763. Positions 21-100, 281-674, 1652-2094, 2500-2760. |
| ModBase | Search... |
Family and domain databases | |
| InterPro | IPR014001. DEAD-like_N. IPR011492. DEAD_Flavivir. IPR001650. DNA/RNA_helicase_C. IPR000069. Env_glycoprot_M_flavivir. IPR013755. Flav_glyE_cen_1. IPR013756. Flav_glyE_cen_2. IPR011999. Flav_glyE_cen_dm. IPR001122. Flavi_capsidC. IPR001157. Flavi_NS1. IPR000752. Flavi_NS2A. IPR000487. Flavi_NS2B. IPR000404. Flavi_NS4A. IPR001528. Flavi_NS4B. IPR002535. Flavi_propep. IPR000336. Flv_glyE_Ig-like. IPR014412. Gen_Poly_FLV. IPR014021. Helicase_SF1/SF2_ATP-bd. IPR001850. Peptidase_S7. IPR000208. RNA_pol_flaviviral. IPR007094. RNA_pol_PSvir. IPR002877. rRNA_MeTrfase_RrmJ/FtsJ. [Graphical view] |
| Gene3D | G3DSA:3.30.387.10. Flav_glyE_cen_1. 1 hit. G3DSA:3.30.67.10. Flav_glyE_cen_2. 1 hit. G3DSA:2.60.40.350. Flv_glyE_Ig-like. 1 hit. |
| Pfam | PF01003. Flavi_capsid. 1 hit. PF07652. Flavi_DEAD. 1 hit. PF02832. Flavi_glycop_C. 1 hit. PF00869. Flavi_glycoprot. 1 hit. PF01004. Flavi_M. 1 hit. PF00948. Flavi_NS1. 1 hit. PF01005. Flavi_NS2A. 1 hit. PF01002. Flavi_NS2B. 1 hit. PF01350. Flavi_NS4A. 1 hit. PF01349. Flavi_NS4B. 1 hit. PF00972. Flavi_NS5. 1 hit. PF01570. Flavi_propep. 1 hit. PF01728. FtsJ. 1 hit. PF00949. Peptidase_S7. 1 hit. [Graphical view] |
| PIRSF | PIRSF003817. Gen_Poly_FLV. 1 hit. |
| ProDom | PD001496. Flavi_NS1. 1 hit. [Graphical view] [Entries sharing at least one domain] |
| SMART | SM00487. DEXDc. 1 hit. SM00490. HELICc. 1 hit. [Graphical view] |
| PROSITE | PS51192. HELICASE_ATP_BIND_1. 1 hit. PS51194. HELICASE_CTER. 1 hit. PS50507. RDRP_SSRNA_POS. 1 hit. [Graphical view] |
| ProtoNet | Search... |
Entry information
| Entry name | POLG_DEN1W | ||||||||
| Accession | Primary (citable) accession number: P17763 Secondary accession number(s): P27910, P89313, P89314 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation project | Virus (Virus annotation project) | ||||||||
Relevant documents
| Peptidase families Classification of peptidase families and list of entries |
| SIMILARITY comments Index of protein domains and families |

Clusters with


