Reviewed,
UniProtKB/Swiss-Prot Q074N0 (POLG_YEFVE)
Last modified
February 9, 2010.
Version 35.
History...
Clusters with 100%,
90%,
50% identity |
Documents (3) |
Third-party data |
Customize display | text xml rdf/xml gff fasta |
Names and origin
| Protein names | Recommended name: Genome polyprotein Cleaved into the following 14 chains: 1- Recommended name: Protein C Alternative name(s): Core protein Capsid protein 2- Recommended name: prM 3- Recommended name: Peptide pr 4- Recommended name: Small envelope protein M Alternative name(s): Matrix protein 5- Recommended name: Envelope protein E 6- Recommended name: Non-structural protein 1 Short name=NS1 7- Recommended name: Non-structural protein 2A Short name=NS2A 8- Recommended name: Non-structural protein 2A-alpha Short name=NS2A-alpha 9- Recommended name: Serine protease subunit NS2B Alternative name(s): Non-structural protein 2B Flavivirin protease NS2B regulatory subunit 10- Recommended name: Serine protease/NTPase/helicase NS3 EC=3.4.21.91 EC=3.6.1.15 EC=3.6.1.- Alternative name(s): Flavivirin protease NS3 catalytic subunit Non-structural protein 3 11- Recommended name: Non-structural protein 4A Short name=NS4A 12- Recommended name: Peptide 2k 13- Recommended name: Non-structural protein 4B Short name=NS4B 14- Recommended name: Methyltransferase/RNA-directed RNA polymerase NS5 EC=2.7.7.48 EC=2.1.1.56 EC=2.1.1.57 Alternative name(s): Non-structural protein 5 |
| Organism | Yellow fever virus (isolate Ethiopia/Couma/1961) (YFV) [Complete proteome] |
| Taxonomic identifier | 407141 [NCBI] |
| Taxonomic lineage | Viruses › ssRNA positive-strand viruses, no DNA stage › Flaviviridae › Flavivirus › Yellow fever virus group |
| Virus host | Aedes aegypti (Yellowfever mosquito) (Culex aegypti) [TaxID: 7159] Aedes simpsoni [TaxID: 7161] Homo sapiens (Human) [TaxID: 9606] Aedes luteocephalus (Mosquito) [TaxID: 299629] Simiiformes [TaxID: 314293] |
Protein attributes
| Sequence length | 3412 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is further processed into a mature form. |
| Protein existence | Evidence at protein level. |
General annotation (Comments)
| Function | Protein C packages viral RNA to form a viral nucleocapsid, and promotes virion budding By similarity. prM acts as a chaperone for envelope protein E during intracellular virion assembly by masking and inactivating envelope protein E fusion peptide. prM is matured in the last step of virion assembly, presumably to avoid catastrophic activation of the viral fusion peptide induced by the acidic pH of the trans-Golgi network. After cleavage by host furin, the pr peptide is released in the extracellular medium and small envelope protein M and envelope protein E heterodimers are dissociated By similarity. Envelope protein E binds cell surface receptor and is involved in membrane fusion between virion and target cell. Synthesized as an homodimer with prM which acts as a chaperone for envelope protein E. After cleavage of prM, envelope protein E dissociate from small envelope protein M and homodimerizes By similarity. Non-structural protein 1 is slowly secreted from mammalian cells, but not from mosquito cells. The secreted form elicits protective immune response and plays an essential role in RNA replication By similarity. Non-structural protein 2B is a required cofactor for the serine protease function of NS3 By similarity. Serine protease NS3 displays three enzymatic activities: serine protease, NTPase and RNA helicase. NS3 serine protease, in association with NS2B, cleaves the polyprotein at dibasic sites in the cytoplasm: C-prM, NS2A-NS2B, NS2B-NS3, NS3-NS4A, NS4A-2K and NS4B-NS5. NS3 RNA helicase binds RNA and unwinds dsRNA in the 3' to 5' direction By similarity. Non-structural protein 4A plays a role in RNA replication By similarity. Non-structural protein 4B plays a role in RNA replication By similarity. RNA-directed RNA polymerase NS5 replicates the viral (+) and (-) genome, and performs the capping of genomes in the cytoplasm. NS5 methylates viral RNA cap at guanine N-7 and ribose 2'-O positions By similarity. |
| Catalytic activity | Selective hydrolysis of -Xaa-Xaa-|-Yaa- bonds in which each of the Xaa can be either Arg or Lys and Yaa can be either Ser or Ala. Nucleoside triphosphate + RNA(n) = diphosphate + RNA(n+1). NTP + H2O = NDP + phosphate. S-adenosyl-L-methionine + G(5')pppR-RNA = S-adenosyl-L-homocysteine + m7G(5')pppR-RNA. S-adenosyl-L-methionine + m7G(5')pppR-RNA = S-adenosyl-L-homocysteine + m7G(5')pppRm-RNA. |
| Subunit structure | prM and envelope protein E form heterodimers in the endoplasmic reticulum and Golgi. Envelope protein E forms homodimers. NS1 forms homodimers as well as homohexamers when secreted. NS1 may interact with NS4A. NS3 and NS2B form an heterodimer. NS3 interacts with unphosphorylated NS5 By similarity. |
| Subcellular location | Protein C: Virion By similarity. Peptide pr: Secreted By similarity. Small envelope protein M: Virion membrane; Single-pass type I membrane protein By similarity. Envelope protein E: Virion membrane; Single-pass type I membrane protein By similarity. Non-structural protein 1: Secreted. Host endoplasmic reticulum membrane; Peripheral membrane protein; Lumenal side By similarity. Non-structural protein 2A-alpha: Host endoplasmic reticulum membrane By similarity. Non-structural protein 2A: Host endoplasmic reticulum membrane By similarity. Serine protease subunit NS2B: Host endoplasmic reticulum membrane; Peripheral membrane protein; Cytoplasmic side By similarity. Serine protease/NTPase/helicase NS3: Host endoplasmic reticulum membrane; Peripheral membrane protein; Cytoplasmic side By similarity. Non-structural protein 4A: Host endoplasmic reticulum membrane; Peripheral membrane protein; Cytoplasmic side By similarity. Non-structural protein 4B: Host endoplasmic reticulum membrane; Multi-pass membrane protein By similarity. Note: The C-terminal transmembrane domain of non-structural protein 4B is presumably reoriented after cleavage on the lumenal side By similarity. Methyltransferase/RNA-directed RNA polymerase NS5: Host endoplasmic reticulum membrane; Peripheral membrane protein; Cytoplasmic side. Host nucleus By similarity. |
| Domain | Transmembrane domains of the small envelope protein M and envelope protein E contains an endoplasmic reticulum retention signals By similarity. |
| Post-translational modification | Specific enzymatic cleavages in vivo yield mature proteins. The nascent protein C contains a C-terminal hydrophobic domain that act as a signal sequence for translocation of prM into the lumen of the ER. Mature protein C is cleaved at a site upstream of this hydrophobic domain by NS3. prM is cleaved in post-Golgi vesicles by a host furin, releasing the mature small envelope protein M, and peptide pr. Non-structural protein 2A-alpha, a C-terminally truncated form of non-structural protein 2A, results from partial cleavage by NS3 By similarity. RNA-directed RNA polymerase NS5 is phosphorylated on serines residues. This phosphorylation may trigger NS5 nuclear localization By similarity. Envelope protein E and non-structural protein 1 are N-glycosylated By similarity. |
| Miscellaneous | The virion is assembled in the endoplasmic reticulum lumen, transported by vesicles to the Golgi, then transported again to the cell membrane where it is released outside the cell. |
| Sequence similarities | Contains 1 helicase ATP-binding domain. Contains 1 helicase C-terminal domain. Contains 1 peptidase S7 domain. Contains 1 RdRp catalytic domain. |
Ontologies
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||||
Molecule processing | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Initiator methionine | 1 | 1 | Removed; by host Potential | ||||||||
| Chain | 2 – 101 | 100 | Protein C | PRO_0000261560 | |||||||
| Propeptide | 102 – 121 | 20 | ER anchor for the protein C, removed in mature form by serine protease NS3 | PRO_0000261561 | |||||||
| Chain | 122 – 285 | 164 | prM | PRO_0000261562 | |||||||
| Chain | 122 – 210 | 89 | Peptide pr | PRO_0000261563 | |||||||
| Chain | 211 – 285 | 75 | Small envelope protein M | PRO_0000261564 | |||||||
| Chain | 286 – 778 | 493 | Envelope protein E | PRO_0000261565 | |||||||
| Chain | 779 – 1130 | 352 | Non-structural protein 1 | PRO_0000261566 | |||||||
| Chain | 1131 – 1354 | 224 | Non-structural protein 2A | PRO_0000261567 | |||||||
| Chain | 1131 – 1320 | 190 | Non-structural protein 2A-alpha | PRO_0000261568 | |||||||
| Chain | 1355 – 1484 | 130 | Serine protease subunit NS2B | PRO_0000261569 | |||||||
| Chain | 1485 – 2107 | 623 | Serine protease/NTPase/helicase NS3 | PRO_0000261570 | |||||||
| Chain | 2108 – 2233 | 126 | Non-structural protein 4A | PRO_0000261571 | |||||||
| Peptide | 2234 – 2256 | 23 | Peptide 2k | PRO_0000261572 | |||||||
| Chain | 2257 – 2506 | 250 | Non-structural protein 4B | PRO_0000261573 | |||||||
| Chain | 2507 – 3412 | 906 | Methyltransferase/RNA-directed RNA polymerase NS5 | PRO_0000261574 | |||||||
Regions | |||||||||||
| Topological domain | 2 – 101 | 100 | Cytoplasmic Potential | ||||||||
| Transmembrane | 102 – 121 | 20 | Potential | ||||||||
| Topological domain | 122 – 244 | 123 | Extracellular Potential | ||||||||
| Transmembrane | 245 – 265 | 21 | Potential | ||||||||
| Topological domain | 266 – 269 | 4 | Cytoplasmic Potential | ||||||||
| Transmembrane | 270 – 287 | 18 | Potential | ||||||||
| Topological domain | 288 – 730 | 443 | Extracellular Potential | ||||||||
| Transmembrane | 731 – 751 | 21 | Potential | ||||||||
| Topological domain | 752 – 757 | 6 | Cytoplasmic Potential | ||||||||
| Transmembrane | 758 – 778 | 21 | Potential | ||||||||
| Topological domain | 779 – 1130 | 352 | Extracellular Potential | ||||||||
| Transmembrane | 1131 – 1151 | 21 | Potential | ||||||||
| Topological domain | 1152 – 1160 | 9 | Cytoplasmic Potential | ||||||||
| Transmembrane | 1161 – 1181 | 21 | Potential | ||||||||
| Topological domain | 1182 – 1201 | 20 | Lumenal Potential | ||||||||
| Transmembrane | 1202 – 1222 | 21 | Potential | ||||||||
| Topological domain | 1223 – 1231 | 9 | Cytoplasmic Potential | ||||||||
| Transmembrane | 1232 – 1252 | 21 | Potential | ||||||||
| Topological domain | 1253 – 1262 | 10 | Lumenal Potential | ||||||||
| Transmembrane | 1263 – 1285 | 23 | Potential | ||||||||
| Topological domain | 1286 – 1287 | 2 | Cytoplasmic Potential | ||||||||
| Transmembrane | 1288 – 1308 | 21 | Potential | ||||||||
| Topological domain | 1309 – 1321 | 13 | Lumenal Potential | ||||||||
| Transmembrane | 1322 – 1342 | 21 | Potential | ||||||||
| Topological domain | 1343 – 2186 | 844 | Cytoplasmic Potential | ||||||||
| Transmembrane | 2187 – 2207 | 21 | Potential | ||||||||
| Topological domain | 2208 – 2209 | 2 | Lumenal Potential | ||||||||
| Transmembrane | 2210 – 2230 | 21 | Potential | ||||||||
| Topological domain | 2231 – 2233 | 3 | Cytoplasmic Potential | ||||||||
| Transmembrane | 2234 – 2256 | 23 | Potential | ||||||||
| Topological domain | 2257 – 2360 | 104 | Lumenal Potential | ||||||||
| Transmembrane | 2361 – 2381 | 21 | Potential | ||||||||
| Topological domain | 2382 – 2422 | 41 | Cytoplasmic Potential | ||||||||
| Transmembrane | 2423 – 2443 | 21 | Potential | ||||||||
| Topological domain | 2444 – 2446 | 3 | Lumenal Potential | ||||||||
| Transmembrane | 2447 – 2467 | 21 | Potential | ||||||||
| Topological domain | 2468 – 3412 | 945 | Cytoplasmic Potential | ||||||||
| Domain | 1492 – 1666 | 175 | Peptidase S7 | ||||||||
| Domain | 1669 – 1825 | 157 | Helicase ATP-binding | ||||||||
| Domain | 1820 – 1997 | 178 | Helicase C-terminal | ||||||||
| Domain | 3036 – 3188 | 153 | RdRp catalytic | ||||||||
| Nucleotide binding | 1682 – 1689 | 8 | ATP Potential | ||||||||
| Region | 383 – 396 | 14 | Involved in fusion By similarity | ||||||||
| Motif | 1773 – 1776 | 4 | DEAH box By similarity | ||||||||
| Motif | 2879 – 2912 | 34 | Nuclear localization signal By similarity | ||||||||
Sites | |||||||||||
| Active site | 1537 | 1 | Charge relay system; for serine protease NS3 activity By similarity | ||||||||
| Active site | 1561 | 1 | Charge relay system; for serine protease NS3 activity By similarity | ||||||||
| Active site | 1622 | 1 | Charge relay system; for serine protease NS3 activity By similarity | ||||||||
| Active site | 2568 | 1 | For 2'-O-methyltransferase activity By similarity | ||||||||
| Active site | 2653 | 1 | For 2'-O-methyltransferase and N-7 methyltransferase activity By similarity | ||||||||
| Active site | 2689 | 1 | For 2'-O-methyltransferase activity By similarity | ||||||||
| Active site | 2725 | 1 | For 2'-O-methyltransferase activity By similarity | ||||||||
| Site | 101 – 102 | 2 | Cleavage; by serine protease NS3 By similarity | ||||||||
| Site | 121 – 122 | 2 | Cleavage; by host signal peptidase By similarity | ||||||||
| Site | 210 – 211 | 2 | Cleavage; by host furin By similarity | ||||||||
| Site | 285 – 286 | 2 | Cleavage; by host signal peptidase By similarity | ||||||||
| Site | 778 – 779 | 2 | Cleavage; by host signal peptidase By similarity | ||||||||
| Site | 1130 – 1131 | 2 | Cleavage; by host By similarity | ||||||||
| Site | 1320 – 1321 | 2 | Cleavage; by serine protease NS3 By similarity | ||||||||
| Site | 1354 – 1355 | 2 | Cleavage; by serine protease NS3 By similarity | ||||||||
| Site | 1484 – 1485 | 2 | Cleavage; by serine protease NS3 By similarity | ||||||||
| Site | 2107 – 2108 | 2 | Cleavage; by serine protease NS3 By similarity | ||||||||
| Site | 2233 – 2234 | 2 | Cleavage; by host signal peptidase By similarity | ||||||||
| Site | 2256 – 2257 | 2 | Cleavage; by serine protease NS3 By similarity | ||||||||
| Site | 2507 – 2508 | 2 | Cleavage; by serine protease NS3 By similarity | ||||||||
Amino acid modifications | |||||||||||
| Glycosylation | 134 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Glycosylation | 150 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Glycosylation | 908 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Glycosylation | 986 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Disulfide bond | 288 ↔ 315 | By similarity | |||||||||
| Disulfide bond | 345 ↔ 401 | By similarity | |||||||||
| Disulfide bond | 359 ↔ 390 | By similarity | |||||||||
| Disulfide bond | 377 ↔ 406 | By similarity | |||||||||
| Disulfide bond | 467 ↔ 568 | By similarity | |||||||||
| Disulfide bond | 585 ↔ 615 | By similarity | |||||||||
Sequences
| ||||||||||||||||||
References
| [1] | "Genome analysis and phylogenetic relationships between east, central and west African isolates of Yellow fever virus." von Lindern J.J., Aroner S., Barrett N.D., Wicker J.A., Davis C.T., Barrett A.D. J. Gen. Virol. 87:895-907(2006) [PubMed: 16528039] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA]. |
| + | Additional computationally mapped references. |
Cross-references
Sequence databases | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| EMBL GenBank DDBJ | DQ235229 Genomic RNA. Translation: ABB69689.1. | ||||||||||||
3D structure databases | |||||||||||||
| PDBe RCSB PDB PDBj |
| ||||||||||||
| ModBase | Search... | ||||||||||||
Family and domain databases | |||||||||||||
| InterPro | IPR014001. DEAD-like_N. IPR001650. DNA/RNA_helicase_C. IPR000069. Env_glycoprot_M_flavivir. IPR013755. Flav_glyE_cen_dom_subdom1. IPR013756. Flav_glyE_cen_dom_subdom2. IPR001122. Flavi_capsidC. IPR001157. Flavi_NS1. IPR000752. Flavi_NS2A. IPR000487. Flavi_NS2B. IPR000404. Flavi_NS4A. IPR001528. Flavi_NS4B. IPR002535. Flavi_propep. IPR000336. Flv_glyE_Ig-like. IPR014412. Gen_Poly_FLV. IPR011999. GlycoprotE_cen/dimer_Flavivir. IPR011998. GlycoprotE_cen/dimer_vir. IPR014021. Helicase_SF1/SF2_ATP-bd. IPR014756. Ig_E-set. IPR001850. Peptidase_S7. IPR000208. RNA-dir_pol_flavivirus. IPR007094. RNA-dir_pol_PSvirus. IPR002877. rRNA_MeTrfase_RrmJ/FtsJ. IPR009003. Ser/Cys_Pept_Trypsin-like. [Graphical view] | ||||||||||||
| Gene3D | G3DSA:3.30.387.10. Flav_glyE_cen_1. 1 hit. G3DSA:3.30.67.10. Flav_glyE_cen_2. 1 hit. G3DSA:2.60.40.350. Flv_glyE_Ig-like. 1 hit. | ||||||||||||
| Pfam | PF01003. Flavi_capsid. 1 hit. PF02832. Flavi_glycop_C. 1 hit. PF00869. Flavi_glycoprot. 1 hit. PF01004. Flavi_M. 1 hit. PF00948. Flavi_NS1. 1 hit. PF01005. Flavi_NS2A. 1 hit. PF01002. Flavi_NS2B. 1 hit. PF01350. Flavi_NS4A. 1 hit. PF01349. Flavi_NS4B. 1 hit. PF00972. Flavi_NS5. 1 hit. PF01570. Flavi_propep. 1 hit. PF01728. FtsJ. 1 hit. PF00271. Helicase_C. 1 hit. PF00949. Peptidase_S7. 1 hit. [Graphical view] | ||||||||||||
| PIRSF | PIRSF003817. Gen_Poly_FLV. 1 hit. | ||||||||||||
| SMART | SM00487. DEXDc. 1 hit. SM00490. HELICc. 1 hit. [Graphical view] | ||||||||||||
| PROSITE | PS51192. HELICASE_ATP_BIND_1. 1 hit. PS51194. HELICASE_CTER. False negative. PS50507. RDRP_SSRNA_POS. 1 hit. [Graphical view] | ||||||||||||
| ProtoNet | Search... | ||||||||||||
Entry information
| Entry name | POLG_YEFVE | ||||||||
| Accession | Primary (citable) accession number: Q074N0 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation project | Virus (Virus annotation project) | ||||||||
Relevant documents
| PDB cross-references Index of Protein Data Bank (PDB) cross-references |
| Peptidase families Classification of peptidase families and list of entries |
| SIMILARITY comments Index of protein domains and families |

Clusters with


