Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot P14403 (POLG_JAEVN)

Last modified June 16, 2009. Version 72. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information

Names and origin

Protein namesRecommended name:
    Genome polyprotein
Cleaved into the following 7 chains:
    1- Recommended name:
            Protein C
        Alternative name(s):
            Core protein
            Capsid protein
    2- Recommended name:
            Small envelope protein M
        Alternative name(s):
            Matrix protein
    3- Recommended name:
            Envelope protein E
    4- Recommended name:
            Non-structural protein 1
                Short name=NS1
    5- Recommended name:
            Non-structural protein 2A
                Short name=NS2A
    6- Recommended name:
            Flavivirin protease NS2B regulatory subunit
    7- Recommended name:
            Flavivirin protease NS3 catalytic subunit
              EC=3.4.21.91
OrganismJapanese encephalitis virus (strain Nakayama)
Taxonomic identifier11076 [NCBI]
Taxonomic lineageVirusesssRNA positive-strand viruses, no DNA stageFlaviviridaeFlavivirusJapanese encephalitis virus group
Virus hostCulex tritaeniorhynchus (Mosquito) [TaxID: 7178]
Ardeidae (herons) [TaxID: 8899]
Homo sapiens (Human) [TaxID: 9606]
Equus caballus (Horse) [TaxID: 9796]
Sus scrofa (Pig) [TaxID: 9823]
Bos taurus (Bovine) [TaxID: 9913]
Culex gelidus [TaxID: 308713]

Protein attributes

Sequence length1440 AA.
Sequence statusFragment.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceInferred from homology.

General annotation (Comments)

Function

The small proteins NS2A, NS4A and NS4B are hydrophobic, suggesting a possible membrane-related function. NS5 may play a role in the viral RNA replication. The NS2B/NS3 protease complex processes the viral polyprotein.

Catalytic activity

Selective hydrolysis of -Xaa-Xaa-|-Yaa- bonds in which each of the Xaa can be either Arg or Lys and Yaa can be either Ser or Ala.

Subunit structure

NS3 and NS2B form a heterodimer. NS3 is the catalytic subunit, whereas NS2B strongly stimulates the latter By similarity.

Subcellular location

Protein C: Virion Potential. Host membrane; Single-pass membrane protein Potential.

Small envelope protein M: Virion Potential. Host membrane; Single-pass membrane protein Potential.

Envelope protein E: Virion Potential. Host membrane; Multi-pass membrane protein Potential.

Post-translational modification

Specific enzymatic cleavages in vivo yield mature proteins By similarity.

Miscellaneous

The virion of this virus is a nucleocapsid covered by a lipoprotein envelope. The envelope contains two proteins: the protein M and glycoprotein E. The nucleocapsid is a complex of protein C and mRNA. In immature particles, there are 60 icosaedrally organized trimeric spikes on the surface. Each spike consists of three heterodimers of envelope protein M precursor (prM) and envelope protein E By similarity.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain‹1 – 53›53Protein C
PRO_0000037869
Propeptide54 – 14693
PRO_0000037870
Chain147 – 22276Small envelope protein M
PRO_0000037871
Chain223 – 794572Envelope protein E
PRO_0000037872
Chain795 – 1136342Non-structural protein 1
PRO_0000037873
Chain1137 – 1301165Non-structural protein 2A
PRO_0000037874
Chain1302 – 1432131Flavivirin protease NS2B regulatory subunit
PRO_0000037875
Chain1433 – ›1440›8Flavivirin protease NS3 catalytic subunit
PRO_0000037876

Regions

Transmembrane36 – 5621 Potential
Transmembrane181 – 20121 Potential
Transmembrane208 – 22417 Potential
Transmembrane675 – 69521 Potential
Transmembrane702 – 72221 Potential
Transmembrane1106 – 112621 Potential
Transmembrane1148 – 116821 Potential
Transmembrane1179 – 119921 Potential
Transmembrane1201 – 122121 Potential
Transmembrane1238 – 125821 Potential
Transmembrane1270 – 129021 Potential
Transmembrane1303 – 132321 Potential
Transmembrane1327 – 134721 Potential
Transmembrane1405 – 142521 Potential

Amino acid modifications

Glycosylation681N-linked (GlcNAc...); by host Potential
Glycosylation3761N-linked (GlcNAc...); by host Potential
Glycosylation8521N-linked (GlcNAc...); by host Potential
Glycosylation9291N-linked (GlcNAc...); by host Potential
Disulfide bond225 ↔ 252 By similarity
Disulfide bond282 ↔ 338 By similarity
Disulfide bond296 ↔ 327 By similarity
Disulfide bond314 ↔ 343 By similarity
Disulfide bond412 ↔ 509 By similarity
Disulfide bond526 ↔ 557 By similarity

Experimental info

Non-terminal residue11
Non-terminal residue14401

Sequences

Sequence LengthMass (Da)Tools
P14403-1 [UniParc].

Last modified January 1, 1990. Version 1.
Checksum: 4D489A365A3C2E6E

FASTA1,440158,185
        10         20         30         40         50         60 
SVAMKHLTSF KRELGTLIDA VNKRGRKQNK RGGNEGSIMW LASLAVVIAC AGAMKLSNFQ 

        70         80         90        100        110        120 
GKLLMTVNNT DIADVIVIPN PSKGENRCWV RAIDVGYMCE DTITYECPKL TMGNDPEDVD 

       130        140        150        160        170        180 
CWCDNQEVYV QYGRCTRTRH SKRSRRSVSV QTHGESSLVN KKEAWLDSTK ATRYLMKTEN 

       190        200        210        220        230        240 
WIVRNPGYAF LAAILGWMLG SNNGQRRWYF TILLLLVAPA YSFNCLGMGN RDFIEGASGA 

       250        260        270        280        290        300 
TWVDLVLEGD SCLTIMANDK PTLDVRMINI EAVQLAEVRS YCYHASVTDI STVARCPTTG 

       310        320        330        340        350        360 
EAHNEKRADS SYVCKQGFTD RGWGNGCGLF GKGSIDTCAK FSCTSKAIGR TIQPENIKYE 

       370        380        390        400        410        420 
VGIFVHGTTT SENHGNYSAQ VGASQAAKFT VTPNAPSITL KLGDYGEVTL DCEPRSGLNT 

       430        440        450        460        470        480 
EAFYVMTVGS KSFLVHREWF HDLALPWTPP SSTAWRNREL LMEFEEAHAT KQSVVALGSQ 

       490        500        510        520        530        540 
EGGLHQALAG AIVVEYSSSV KLTSGHLKCR LKMDKLALKG TTYGMCTEKF SFAKNPADTG 

       550        560        570        580        590        600 
HGTVVIELSY SGSDGPCKIP IVSVASLNDM TPVGRLVTVN PFVATSSANS KVLVEMEPPF 

       610        620        630        640        650        660 
GDSYIVVGRG DKQINHHWHK AGSTLGKAFS TTLKGAQRLA ALGDTAWDFG SIGGVFNSIG 

       670        680        690        700        710        720 
KAVHQVFGGA FRTLFGGMSW ITQGLMGALL LWMGVNARDR SIALAFLATG GVLVFLATNV 

       730        740        750        760        770        780 
HADTGCAIDI TRKEMRCGSG IFVHNDVEAW VDRYKYLPET PRSLAKIVHK AHKEGVCGVR 

       790        800        810        820        830        840 
SVTRLEHQMW EAVRDELNVL LKENAVDLSV VVNKPVGRYR SAPKRLSMTQ EKFEMGWKAW 

       850        860        870        880        890        900 
GKSILFAPEL ANSTFVVDGP ETKECPDEHR AWNSIEIEDF GFGITSTRVW LKIREESTDE 

       910        920        930        940        950        960 
CDGAIIGTAV KGHVAVHSDL SYWIESRYND TWKLERAVFG EVKSCTWPET HTLWGDGVEE 

       970        980        990       1000       1010       1020 
SELIIPHTIA GPKSKHNRRE GYKTQNQGPW DENGIVLDFD YCPGTKVTIT EDCGKRGPSV 

      1030       1040       1050       1060       1070       1080 
RTTTDSGKLI TDWCCRSCSL PPLRFRTENG CWYGMEIRPV RHDETTLVRS QVDAFNGEMV 

      1090       1100       1110       1120       1130       1140 
DPFQLGLLVM FLATQEVLRK RWTARLTIPA VLGALLVLML GGITYTDLAR YVVLVAAAFA 

      1150       1160       1170       1180       1190       1200 
EANSGGDVLH LALIAVFKIQ PAFLVMNMLS TRWTNQENVV LVLGAAFFHL ASVDLQIGVH 

      1210       1220       1230       1240       1250       1260 
GILNAAAIAW MIVRAITFPT TSSVTMPVLA LLTPGMRALY LDTYRIILLV IGICSLLQER 

      1270       1280       1290       1300       1310       1320 
KKTMAKKKGA VLLGLALTST GWFSPTTIAA GLMVCNPNKK RGWPATEFLS AVGLMFAIVG 

      1330       1340       1350       1360       1370       1380 
GLAELDIESM SIPFMLAGLM AVSYVVSGKA TDMWLERAAD ISWEMDAAIT GSSRRLDVKL 

      1390       1400       1410       1420       1430       1440 
DDDGDFHLID DPGVPWKVWV LRMSCIGLAA LTPWAIVPAA FGYWLTLKTT KRGGVFWDTP 

« Hide

References

[1]"Partial nucleotide sequence of the Japanese encephalitis virus genome."
McAda P.C., Mason P.W., Schmaljohn C.S., Dalrymple J.M., Mason T.L., Fournier M.J.
Virology 158:348-360(1987) [PubMed: 3035787] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA].

Cross-references

Sequence databases

M16574 Genomic RNA. Translation: AAA46251.1.
PIRGNWVJF. A27844.

3D structure databases

HSSPHSSP built from PDB template 1OKE based on UniProtKB Q88653.
SMRP14403. Positions 223-621.
ModBaseSearch...

Family and domain databases

InterProIPR000069. Env_glycoprot_M_flavivir.
IPR013756. Flav_glyE_cen_2.
IPR011999. Flav_glyE_cen_dm.
IPR013754. Flav_glyE_dim.
IPR001122. Flavi_capsidC.
IPR001157. Flavi_NS1.
IPR000752. Flavi_NS2A.
IPR000487. Flavi_NS2B.
IPR002535. Flavi_propep.
IPR000336. Flv_glyE_Ig-like.
[Graphical view]
Gene3DG3DSA:3.30.67.10. Flav_glyE_cen_2. 1 hit.
G3DSA:2.60.98.10. Flav_glyE_dim. 1 hit.
G3DSA:2.60.40.350. Flv_glyE_Ig-like. 1 hit.
PfamPF01003. Flavi_capsid. 1 hit.
PF02832. Flavi_glycop_C. 1 hit.
PF00869. Flavi_glycoprot. 1 hit.
PF01004. Flavi_M. 1 hit.
PF00948. Flavi_NS1. 1 hit.
PF01005. Flavi_NS2A. 1 hit.
PF01002. Flavi_NS2B. 1 hit.
PF01570. Flavi_propep. 1 hit.
[Graphical view]
ProDomPD001496. Flavi_NS1. 1 hit.
[Graphical view] [Entries sharing at least one domain]
ProtoNetSearch...

Entry information

Entry namePOLG_JAEVN
AccessionPrimary (citable) accession number: P14403
Secondary accession number(s): P08769
Entry history
Integrated into UniProtKB/Swiss-Prot: January 1, 1990
Last sequence update: January 1, 1990
Last modified: June 16, 2009
This is version 72 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectVirus (Virus annotation project)
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information