Reviewed,
UniProtKB/Swiss-Prot Q01299 (POLG_TBEVH)
Last modified
February 9, 2010.
Version 79.
History...
Clusters with 100%,
90%,
50% identity |
Documents (2) |
Third-party data |
Customize display | text xml rdf/xml gff fasta |
Names and origin
| Protein names | Recommended name: Genome polyprotein Cleaved into the following 10 chains: 1- Recommended name: Protein C Alternative name(s): Core protein Capsid protein 2- Recommended name: Small envelope protein M Alternative name(s): Matrix protein 3- Recommended name: Envelope protein E 4- Recommended name: Non-structural protein 1 Short name=NS1 5- Recommended name: Non-structural protein 2A Short name=NS2A 6- Recommended name: Serine protease subunit NS2B Alternative name(s): Non-structural protein 2B Flavivirin protease NS2B regulatory subunit 7- Recommended name: Serine protease/NTPase/helicase NS3 EC=3.4.21.91 EC=3.6.1.15 EC=3.6.1.- Alternative name(s): Flavivirin protease NS3 catalytic subunit Non-structural protein 3 8- Recommended name: Non-structural protein 4A Short name=NS4A 9- Recommended name: Non-structural protein 4B Short name=NS4B 10- Recommended name: Methyltransferase/RNA-directed RNA polymerase NS5 EC=2.7.7.48 EC=2.1.1.56 EC=2.1.1.57 Alternative name(s): Non-structural protein 5 |
| Organism | Tick-borne encephalitis virus (strain Hypr) (TBEV) [Complete proteome] |
| Taxonomic identifier | 70733 [NCBI] |
| Taxonomic lineage | Viruses › ssRNA positive-strand viruses, no DNA stage › Flaviviridae › Flavivirus › tick-borne encephalitis virus group |
| Virus host | Ixodes ricinus (Sheep tick) [TaxID: 34613] Ixodes persulcatus (Taiga tick) [TaxID: 34615] |
Protein attributes
| Sequence length | 3414 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is further processed into a mature form. |
| Protein existence | Inferred from homology. |
General annotation (Comments)
| Function | The small proteins NS2A, NS4A and NS4B are hydrophobic, suggesting a possible membrane-related function By similarity. Non-structural protein 2B is a required cofactor for the serine protease function of NS3 By similarity. Serine protease NS3 displays three enzymatic activities: serine protease, NTPase and RNA helicase. NS3 serine protease, in association with NS2B, cleaves the polyprotein By similarity. RNA-directed RNA polymerase NS5 replicates the viral (+) and (-) genome, and performs the capping of genomes in the cytoplasm. NS5 methylates viral RNA cap at guanine N-7 and ribose 2'-O positions By similarity. |
| Catalytic activity | Selective hydrolysis of -Xaa-Xaa-|-Yaa- bonds in which each of the Xaa can be either Arg or Lys and Yaa can be either Ser or Ala. Nucleoside triphosphate + RNA(n) = diphosphate + RNA(n+1). NTP + H2O = NDP + phosphate. S-adenosyl-L-methionine + G(5')pppR-RNA = S-adenosyl-L-homocysteine + m7G(5')pppR-RNA. S-adenosyl-L-methionine + m7G(5')pppR-RNA = S-adenosyl-L-homocysteine + m7G(5')pppRm-RNA. |
| Subunit structure | NS3 and NS2B form a heterodimer. NS3 is the catalytic subunit, whereas NS2B strongly stimulates the latter By similarity. |
| Subcellular location | Protein C: Virion Potential. Host membrane; Single-pass membrane protein Potential. Small envelope protein M: Virion Potential. Host membrane; Single-pass membrane protein Potential. Envelope protein E: Virion Potential. Host membrane; Multi-pass membrane protein Potential. Non-structural protein 4A: Host membrane; Single-pass membrane protein Potential. Non-structural protein 4B: Host membrane; Multi-pass membrane protein Potential. |
| Post-translational modification | Specific enzymatic cleavages in vivo yield mature proteins By similarity. |
| Miscellaneous | The virion of this virus is a nucleocapsid covered by a lipoprotein envelope. The envelope contains two proteins: the protein M and glycoprotein E. The nucleocapsid is a complex of protein C and mRNA. In immature particles, there are 60 icosaedrally organized trimeric spikes on the surface. Each spike consists of three heterodimers of envelope protein M precursor (prM) and envelope protein E By similarity. The nonstructural protein NS1 presents two alternative cleavage sites for its C-terminus (either at position 1128 or at 1190), which may define a soluble or a membrane-bound form of NS1. |
| Sequence similarities | Contains 1 helicase ATP-binding domain. Contains 1 helicase C-terminal domain. Contains 1 peptidase S7 domain. Contains 1 RdRp catalytic domain. |
Ontologies
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||||
Molecule processing | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Initiator methionine | 1 | 1 | Removed; by host Potential | ||||||||
| Chain | 2 – 112 | 111 | Protein C | PRO_0000037793 | |||||||
| Propeptide | 113 – 205 | 93 | PRO_0000037794 | ||||||||
| Chain | 206 – 280 | 75 | Small envelope protein M | PRO_0000037795 | |||||||
| Chain | 281 – 776 | 496 | Envelope protein E | PRO_0000037796 | |||||||
| Chain | 777 – ?1128 | 352 | Non-structural protein 1 | PRO_0000037797 | |||||||
| Chain | ?1129 – 1358 | 230 | Non-structural protein 2A | PRO_0000037798 | |||||||
| Chain | 1359 – 1489 | 131 | Serine protease subunit NS2B | PRO_0000037799 | |||||||
| Chain | 1490 – 2110 | 621 | Serine protease/NTPase/helicase NS3 | PRO_0000037800 | |||||||
| Chain | 2111 – 2259 | 149 | Non-structural protein 4A | PRO_0000037801 | |||||||
| Chain | 2260 – 2511 | 252 | Non-structural protein 4B | PRO_0000037802 | |||||||
| Chain | 2512 – 3414 | 903 | Methyltransferase/RNA-directed RNA polymerase NS5 | PRO_0000037803 | |||||||
Regions | |||||||||||
| Transmembrane | 33 – 53 | 21 | Potential | ||||||||
| Transmembrane | 99 – 119 | 21 | Potential | ||||||||
| Transmembrane | 254 – 274 | 21 | Potential | ||||||||
| Transmembrane | 728 – 748 | 21 | Potential | ||||||||
| Transmembrane | 760 – 780 | 21 | Potential | ||||||||
| Transmembrane | 1133 – 1153 | 21 | Potential | ||||||||
| Transmembrane | 1159 – 1179 | 21 | Potential | ||||||||
| Transmembrane | 1203 – 1225 | 23 | Potential | ||||||||
| Transmembrane | 1233 – 1253 | 21 | Potential | ||||||||
| Transmembrane | 1294 – 1314 | 21 | Potential | ||||||||
| Transmembrane | 1383 – 1403 | 21 | Potential | ||||||||
| Transmembrane | 1455 – 1475 | 21 | Potential | ||||||||
| Transmembrane | 2161 – 2181 | 21 | Potential | ||||||||
| Transmembrane | 2190 – 2210 | 21 | Potential | ||||||||
| Transmembrane | 2212 – 2232 | 21 | Potential | ||||||||
| Transmembrane | 2245 – 2265 | 21 | Potential | ||||||||
| Transmembrane | 2344 – 2364 | 21 | Potential | ||||||||
| Transmembrane | 2369 – 2389 | 21 | Potential | ||||||||
| Transmembrane | 2433 – 2453 | 21 | Potential | ||||||||
| Domain | 1496 – 1672 | 177 | Peptidase S7 | ||||||||
| Domain | 1675 – 1831 | 157 | Helicase ATP-binding | ||||||||
| Domain | 1841 – 2000 | 160 | Helicase C-terminal | ||||||||
| Domain | 3040 – 3189 | 150 | RdRp catalytic | ||||||||
| Nucleotide binding | 1688 – 1695 | 8 | ATP Potential | ||||||||
| Motif | 1779 – 1782 | 4 | DEAH box | ||||||||
Sites | |||||||||||
| Active site | 1543 | 1 | Charge relay system By similarity | ||||||||
| Active site | 1567 | 1 | Charge relay system By similarity | ||||||||
| Active site | 1627 | 1 | Charge relay system By similarity | ||||||||
| Active site | 2572 | 1 | For 2'-O-methyltransferase activity By similarity | ||||||||
| Active site | 2657 | 1 | For 2'-O-methyltransferase and N-7 methyltransferase activity By similarity | ||||||||
| Active site | 2694 | 1 | For 2'-O-methyltransferase activity By similarity | ||||||||
| Active site | 2730 | 1 | For 2'-O-methyltransferase activity By similarity | ||||||||
Amino acid modifications | |||||||||||
| Glycosylation | 144 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Glycosylation | 434 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Glycosylation | 861 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Glycosylation | 983 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Glycosylation | 999 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Glycosylation | 1649 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Glycosylation | 1988 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Glycosylation | 2044 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Glycosylation | 2529 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Glycosylation | 2726 | 1 | N-linked (GlcNAc...); by host Potential | ||||||||
| Disulfide bond | 283 ↔ 310 | By similarity | |||||||||
| Disulfide bond | 340 ↔ 396 | By similarity | |||||||||
| Disulfide bond | 354 ↔ 385 | By similarity | |||||||||
| Disulfide bond | 372 ↔ 401 | By similarity | |||||||||
| Disulfide bond | 466 ↔ 570 | By similarity | |||||||||
| Disulfide bond | 587 ↔ 618 | By similarity | |||||||||
Sequences
| ||||||||||||||||||
References
| [1] | Wallner G., Mandl C.W., Ecker M., Holzmann H., Stiasny K., Kunz C., Heinz F.X. Submitted (OCT-1995) to the EMBL/GenBank/DDBJ databases Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA]. |
| [2] | "Presence of poly(A) in a flavivirus: significant differences between the 3' noncoding regions of the genomic RNAs of tick-borne encephalitis virus strains." Mandl C.W., Kunz C., Heinz F.X. J. Virol. 65:4070-4077(1991) [PubMed: 1712858] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA] OF 3357-3414. |
Cross-references
Sequence databases | |
|---|---|
| EMBL GenBank DDBJ | U39292 Genomic RNA. Translation: AAB53095.1. M76660 Genomic RNA. Translation: AAA47904.1. |
3D structure databases | |
| ModBase | Search... |
Family and domain databases | |
| InterPro | IPR014001. DEAD-like_N. IPR001650. DNA/RNA_helicase_C. IPR000069. Env_glycoprot_M_flavivir. IPR013754. Flav_glyE_dim. IPR001122. Flavi_capsidC. IPR001157. Flavi_NS1. IPR000752. Flavi_NS2A. IPR000487. Flavi_NS2B. IPR000404. Flavi_NS4A. IPR001528. Flavi_NS4B. IPR002535. Flavi_propep. IPR000336. Flv_glyE_Ig-like. IPR014412. Gen_Poly_FLV. IPR011999. GlycoprotE_cen/dimer_Flavivir. IPR011998. GlycoprotE_cen/dimer_vir. IPR014021. Helicase_SF1/SF2_ATP-bd. IPR014756. Ig_E-set. IPR001850. Peptidase_S7. IPR000208. RNA-dir_pol_flavivirus. IPR007094. RNA-dir_pol_PSvirus. IPR002877. rRNA_MeTrfase_RrmJ/FtsJ. IPR009003. Ser/Cys_Pept_Trypsin-like. [Graphical view] |
| Gene3D | G3DSA:2.60.98.10. Flav_glyE_dim. 1 hit. G3DSA:2.60.40.350. Flv_glyE_Ig-like. 1 hit. |
| Pfam | PF01003. Flavi_capsid. 1 hit. PF02832. Flavi_glycop_C. 1 hit. PF00869. Flavi_glycoprot. 1 hit. PF01004. Flavi_M. 1 hit. PF00948. Flavi_NS1. 1 hit. PF01005. Flavi_NS2A. 1 hit. PF01002. Flavi_NS2B. 1 hit. PF01350. Flavi_NS4A. 1 hit. PF01349. Flavi_NS4B. 1 hit. PF00972. Flavi_NS5. 1 hit. PF01570. Flavi_propep. 1 hit. PF01728. FtsJ. 1 hit. PF00271. Helicase_C. 1 hit. PF00949. Peptidase_S7. 1 hit. [Graphical view] |
| PIRSF | PIRSF003817. Gen_Poly_FLV. 1 hit. |
| SMART | SM00487. DEXDc. 1 hit. SM00490. HELICc. 1 hit. [Graphical view] |
| PROSITE | PS00690. DEAH_ATP_HELICASE. False negative. PS51192. HELICASE_ATP_BIND_1. 1 hit. PS51194. HELICASE_CTER. 1 hit. PS50507. RDRP_SSRNA_POS. 1 hit. [Graphical view] |
| ProtoNet | Search... |
Entry information
| Entry name | POLG_TBEVH | ||||||||
| Accession | Primary (citable) accession number: Q01299 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation project | Virus (Virus annotation project) | ||||||||
Relevant documents
| Peptidase families Classification of peptidase families and list of entries |
| SIMILARITY comments Index of protein domains and families |

Clusters with


