Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot P09732 (POLG_STEVM)

Last modified June 16, 2009. Version 76. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Genome polyprotein
Cleaved into the following 7 chains:
    1- Recommended name:
            Protein C
        Alternative name(s):
            Core protein
            Capsid protein
    2- Recommended name:
            Small envelope protein M
        Alternative name(s):
            Matrix protein
    3- Recommended name:
            Envelope protein E
    4- Recommended name:
            Non-structural protein 1
                Short name=NS1
    5- Recommended name:
            Non-structural protein 2A
                Short name=NS2A
    6- Recommended name:
            Flavivirin protease NS2B regulatory subunit
    7- Recommended name:
            Flavivirin protease NS3 catalytic subunit
              EC=3.4.21.91
OrganismSt. louis encephalitis virus (strain MS1-7)
Taxonomic identifier11081 [NCBI]
Taxonomic lineageVirusesssRNA positive-strand viruses, no DNA stageFlaviviridaeFlavivirusJapanese encephalitis virus group
Virus hostCulex pipiens (House mosquito) [TaxID: 7175]
Culex quinquefasciatus (Southern house mosquito) (Culex pungens) [TaxID: 7176]
Culex tarsalis (Encephalitis mosquito) [TaxID: 7177]
Columba livia (Domestic pigeon) [TaxID: 8932]
Turdus migratorius (American robin) [TaxID: 9188]
Agelaius tricolor (Tricolored blackbird) [TaxID: 9191]
Homo sapiens (Human) [TaxID: 9606]
Cyanocitta cristata (Blue jay) [TaxID: 28727]
Carpodacus mexicanus (House finch) [TaxID: 30427]
Culex nigripalpus [TaxID: 42429]
Zenaida macroura (Mourning dove) [TaxID: 47245]
Passer domesticus (House sparrow) [TaxID: 48849]
Mimus polyglottos (Northern mockingbird) [TaxID: 60713]
Euphagus cyanocephalus (Brewer's blackbird) [TaxID: 84817]
Cardinalis cardinalis (Northern cardinal) [TaxID: 98964]

Protein attributes

Sequence length1525 AA.
Sequence statusFragment.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceInferred from homology.

General annotation (Comments)

Function

The small proteins NS2A, NS4A and NS4B are hydrophobic, suggesting a possible membrane-related function. NS5 may play a role in the viral RNA replication. The NS2B/NS3 protease complex processes the viral polyprotein.

Catalytic activity

Selective hydrolysis of -Xaa-Xaa-|-Yaa- bonds in which each of the Xaa can be either Arg or Lys and Yaa can be either Ser or Ala.

Nucleoside triphosphate + RNA(n) = diphosphate + RNA(n+1).

Subunit structure

NS3 and NS2B form a heterodimer. NS3 is the catalytic subunit, whereas NS2B strongly stimulates the latter By similarity.

Subcellular location

Protein C: Virion Potential. Host membrane; Single-pass membrane protein Potential.

Small envelope protein M: Virion Potential. Host membrane; Single-pass membrane protein Potential.

Envelope protein E: Virion Potential. Host membrane; Multi-pass membrane protein Potential.

Post-translational modification

Specific enzymatic cleavages in vivo yield mature proteins By similarity.

Miscellaneous

The virion of this virus is a nucleocapsid covered by a lipoprotein envelope. The envelope contains two proteins: the protein M and glycoprotein E. The nucleocapsid is a complex of protein C and mRNA. In immature particles, there are 60 icosaedrally organized trimeric spikes on the surface. Each spike consists of three heterodimers of envelope protein M precursor (prM) and envelope protein E By similarity.

Sequence similarities

Contains 1 peptidase S7 domain.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Initiator methionine11Removed; by host
Chain2 – 121120Protein C
PRO_0000037735
Propeptide122 – 21392
PRO_0000037736
Chain214 – 28875Small envelope protein M
PRO_0000037737
Chain289 – 789501Envelope protein E
PRO_0000037738
Chain790 – 1203414Non-structural protein 1
PRO_0000037739
Chain1204 – 1368165Non-structural protein 2A
PRO_0000037740
Chain1369 – 1499131Flavivirin protease NS2B regulatory subunit
PRO_0000037741
Chain1500 – ›1525›26Flavivirin protease NS3 catalytic subunit
PRO_0000037742

Regions

Transmembrane108 – 11912 Potential
Transmembrane253 – 26816 Potential
Transmembrane274 – 28815 Potential
Transmembrane751 – 76212 Potential
Transmembrane768 – 78720 Potential
Transmembrane1173 – 118816 Potential

Amino acid modifications

Glycosylation1361N-linked (GlcNAc...); by host Potential
Glycosylation2691N-linked (GlcNAc...); by host Potential
Glycosylation4421N-linked (GlcNAc...); by host Potential
Glycosylation9191N-linked (GlcNAc...); by host Potential
Glycosylation9641N-linked (GlcNAc...); by host Potential
Glycosylation9961N-linked (GlcNAc...); by host Potential
Glycosylation11891N-linked (GlcNAc...); by host Potential
Disulfide bond291 ↔ 318 By similarity
Disulfide bond348 ↔ 404 By similarity
Disulfide bond362 ↔ 393 By similarity
Disulfide bond380 ↔ 409 By similarity
Disulfide bond478 ↔ 576 By similarity
Disulfide bond593 ↔ 624 By similarity

Experimental info

Non-terminal residue15251

Sequences

Sequence LengthMass (Da)Tools
P09732-1 [UniParc].

Last modified July 1, 1989. Version 1.
Checksum: E1A373F1E511159F

FASTA1,525167,892
        10         20         30         40         50         60 
MSKKPGKPGR NRVVNMLKRG VSRVNPLTGL KRILGSLLDG RGPVRFMLAI LTFFRFTALQ 

        70         80         90        100        110        120 
PTEALKRRWR AVDKRTALKH LNGFKRDLGS MLDTINRRPS KKRGGTRSLL GLAALIGLAS 

       130        140        150        160        170        180 
SLQLSTYQGK VLMSINKTDA QSAINIPSAN GANTCIVRAL DVGIMCKDDI TYLCPVLSAG 

       190        200        210        220        230        240 
NDPEDIDCWC DVEEVWVHYG RCTRMGHSRR SRRSISVQHH GDSTLATKNT PWLDTVKTTK 

       250        260        270        280        290        300 
YLTKVENWFC RNPGYALVAL AIGWMLGSNN TQRVVFVIML MLIAPAYSFN CLGTSNRDFV 

       310        320        330        340        350        360 
EGASGATWID LVLEGGSCVT VMAPEKPTLD FKVMKMEATE LATVRKYCYE ATLDTLSTVA 

       370        380        390        400        410        420 
RCPTTGEAHN TKRSDPTFVC KRDVVDRGWG NGCGLFGKGS IDTCAKFTCK NKATGKTILR 

       430        440        450        460        470        480 
ENIKYEVAIF VHGSTDSTSH GNYSEQIGKN QAARFTISPQ APSFTANMGE YGTVTIDCEA 

       490        500        510        520        530        540 
RSGINTEDYY VFTVKEKSWL VNRDWFHDLN LPWTSPATTD WRNRETLVEF EEPHATKQTV 

       550        560        570        580        590        600 
VALGSQEGAL HTALAGAIPA TVSSSTLTLQ SGHLKCRAKL DKVKIKGTTY GMCDSAFTFS 

       610        620        630        640        650        660 
KNPTDTGHGT VIVELQYTGS NGPCRVPISV TANLMDLTPV GRLVTVNPFI STGGANNKVM 

       670        680        690        700        710        720 
IEVEPPFGDS YIVVGRGTTQ INYHWHKEGS SIGKALATTW KGAQRLAVLG DTAWDFGSIG 

       730        740        750        760        770        780 
GVFNSIGKAV HQVFGGAFRT LFGGMSWITQ GLLGALLLWM GLQARDRSIS LTLLAVGGIL 

       790        800        810        820        830        840 
IFLATSVQAD SGCAISLQRR ELKCGGGIFV YNDVEKWKSD YKYFPLTPTG LAHVIQEAHA 

       850        860        870        880        890        900 
NGYCGIRSTS RLEHLMWENI QRELNAIFED NEIDLSVVVQ EDPKYYKRAP RRLKKLEDEL 

       910        920        930        940        950        960 
DYGWKKWGKT LFVEPRLGNN TFVVDGPETK ECPTANRAWN SFKVEDFGFG MVFTQLWLTI 

       970        980        990       1000       1010       1020 
REENTTECDS AIIGTAIKGD RAVHSDLSYW IESKKNETWQ LERAVMGEVK SCTWPETHTL 

      1030       1040       1050       1060       1070       1080 
WGDGVVESEM IIPVTLGGPK SHHNKRNGYH TQTKGPWSEG EITLDFDYCP GTTVTVTEHC 

      1090       1100       1110       1120       1130       1140 
GNRGASLRTT TASGKLVTDW CCRSCSLPPL RYTTKDGCWY GMEIRPVKEE EAKLVKSRVT 

      1150       1160       1170       1180       1190       1200 
AGVAGGMEPF QLGLLVAFIA TQEVLKRRWT GKLTLTSLAV CLALLIFGNL TYMDLVRYLV 

      1210       1220       1230       1240       1250       1260 
LVGTAFAEMN TGGDVIHLAL VAVFKVQPAF LAGLFLRMQW SNQENILMVI GAAFLQMAAN 

      1270       1280       1290       1300       1310       1320 
DLKLEVLPIL NAMSIAWMLI RAMKEGKVAM YALPILCALT PGMRMAGLDV IRCLLLIIGI 

      1330       1340       1350       1360       1370       1380 
VTLLNERRES VAKKKGGYLL AAALCQAGVC SPLIMMGGLI LAHPNGKRSW PASEVLTGVG 

      1390       1400       1410       1420       1430       1440 
LMCALAGGLL EFEETSMVVP FAIAGLMYIT YTVSGKAAEM WIEKAADITW EQNAEITGTS 

      1450       1460       1470       1480       1490       1500 
PRLDVDLDSH GNFKLLNDPG APVHLFALRF ILLGLSARFH WFIPFGVLGF WLLGKHSKRG 

      1510       1520 
GALWDVPSPK VYPKCETKPG IYRIM 

« Hide

References

[1]"Partial nucleotide sequence of St. Louis encephalitis virus RNA: structural proteins, NS1, NS2A, and NS2B."
Trent D.W., Kinney R.M., Johnson B.J.B., Vorndam A.V., Grant J.A., Deubel V., Rice C.M., Hahn C.
Virology 156:293-304(1987) [PubMed: 3027980] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA].

Cross-references

Sequence databases

M16614 Genomic RNA. Translation: AAA47786.1.

3D structure databases

SMRP09732. Positions 289-691.
ModBaseSearch...

Family and domain databases

InterProIPR000069. Env_glycoprot_M_flavivir.
IPR013756. Flav_glyE_cen_2.
IPR011999. Flav_glyE_cen_dm.
IPR013754. Flav_glyE_dim.
IPR001122. Flavi_capsidC.
IPR001157. Flavi_NS1.
IPR000752. Flavi_NS2A.
IPR000487. Flavi_NS2B.
IPR002535. Flavi_propep.
IPR000336. Flv_glyE_Ig-like.
[Graphical view]
Gene3DG3DSA:3.30.67.10. Flav_glyE_cen_2. 1 hit.
G3DSA:2.60.98.10. Flav_glyE_dim. 1 hit.
G3DSA:2.60.40.350. Flv_glyE_Ig-like. 1 hit.
PfamPF01003. Flavi_capsid. 1 hit.
PF02832. Flavi_glycop_C. 1 hit.
PF00869. Flavi_glycoprot. 1 hit.
PF01004. Flavi_M. 1 hit.
PF00948. Flavi_NS1. 1 hit.
PF01005. Flavi_NS2A. 1 hit.
PF01002. Flavi_NS2B. 1 hit.
PF01570. Flavi_propep. 1 hit.
[Graphical view]
ProDomPD001496. Flavi_NS1. 1 hit.
[Graphical view] [Entries sharing at least one domain]
ProtoNetSearch...

Entry information

Entry namePOLG_STEVM
AccessionPrimary (citable) accession number: P09732
Secondary accession number(s): Q88781 expand/collapse secondary AC list , Q88782, Q88783, Q88784, Q88785, Q88786, Q88787, Q88788
Entry history
Integrated into UniProtKB/Swiss-Prot: July 1, 1989
Last sequence update: July 1, 1989
Last modified: June 16, 2009
This is version 76 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectVirus (Virus annotation project)

Relevant documents

Peptidase families

Classification of peptidase families and list of entries

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents