Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot P27914 (POLG_DEN2T)

Last modified November 3, 2009. Version 82. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Genome polyprotein
Cleaved into the following 5 chains:
    1- Recommended name:
            Envelope protein E
    2- Recommended name:
            Non-structural protein 1
                Short name=NS1
    3- Recommended name:
            Non-structural protein 2A
                Short name=NS2A
    4- Recommended name:
            Non-structural protein 2A-alpha
                Short name=NS2A-alpha
    5- Recommended name:
            Serine protease subunit NS3
              EC=3.4.21.91
        Alternative name(s):
            Non-structural protein 3
OrganismDengue virus type 2 (strain Tonga/EKB194/1974) (DENV-2)
Taxonomic identifier11067 [NCBI]
Taxonomic lineageVirusesssRNA positive-strand viruses, no DNA stageFlaviviridaeFlavivirusDengue virus group
Virus hostErythrocebus patas (Red guenon) (Cercopithecus patas) [TaxID: 9538]
Homo sapiens (Human) [TaxID: 9606]
Diceromyia [TaxID: 53539]
Aedimorphus [TaxID: 53540]
Stegomyia [TaxID: 53541]

Protein attributes

Sequence length1683 AA.
Sequence statusFragments.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level.

General annotation (Comments)

Catalytic activity

Selective hydrolysis of -Xaa-Xaa-|-Yaa- bonds in which each of the Xaa can be either Arg or Lys and Yaa can be either Ser or Ala.

Subcellular location

Envelope protein E: Virion Potential. Host membrane; Multi-pass membrane protein Potential.

Post-translational modification

Specific enzymatic cleavages in vivo yield mature proteins By similarity.

Miscellaneous

The virion of this virus is a nucleocapsid covered by a lipoprotein envelope. The envelope contains two proteins: the protein M and glycoprotein E. The nucleocapsid is a complex of protein C and mRNA. In immature particles, there are 60 icosaedrally organized trimeric spikes on the surface. Each spike consists of three heterodimers of envelope protein M precursor (prM) and envelope protein E By similarity.

Sequence similarities

Contains 1 helicase ATP-binding domain.

Contains 1 helicase C-terminal domain.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 495495Envelope protein E By similarity
PRO_0000037980
Chain496 – 847352Non-structural protein 1 By similarity
PRO_0000037981
Chain848 – ›1064›217Non-structural protein 2A By similarity
PRO_0000308290
Chain848 – 1035188Non-structural protein 2A-alpha By similarity
PRO_0000037982
Chain1066 – 1683618Serine protease subunit NS3 By similarity
PRO_0000037983

Regions

Topological domain7 – 445439Lumenal Potential
Transmembrane446 – 46621 Potential
Topological domain467 – 4726Cytoplasmic Potential
Transmembrane473 – 49321 Potential
Topological domain494 – 876383Lumenal Potential
Transmembrane877 – 89721 Potential
Topological domain898 – ›1065›168Cytoplasmic Potential
Domain1245 – 1401157Helicase ATP-binding
Domain1411 – 1582172Helicase C-terminal
Nucleotide binding1258 – 12658ATP Potential
Motif1349 – 13524DEAH box

Sites

Active site11161Charge relay system; for serine protease NS3 activity By similarity
Active site11401Charge relay system; for serine protease NS3 activity By similarity
Active site12001Charge relay system; for serine protease NS3 activity By similarity
Site495 – 4962Cleavage; by host signal peptidase By similarity
Site847 – 8482Cleavage; by host By similarity
Site1035 – 10362Cleavage; by serine protease NS3 By similarity
Site1065 – 10662Cleavage; by serine protease NS3 By similarity

Amino acid modifications

Glycosylation671N-linked (GlcNAc...); by host Potential
Glycosylation1531N-linked (GlcNAc...); by host Potential
Disulfide bond3 ↔ 30 By similarity
Disulfide bond60 ↔ 121 By similarity
Disulfide bond74 ↔ 105 By similarity
Disulfide bond92 ↔ 116 By similarity
Disulfide bond185 ↔ 285 By similarity
Disulfide bond302 ↔ 333 By similarity

Experimental info

Non-adjacent residues1065 – 10662
Non-terminal residue11
Non-terminal residue16831

Secondary structure

........................................................................ 1683
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
P27914-1 [UniParc].

Last modified August 1, 1992. Version 1.
Checksum: 3B0438D96196BFC8

FASTA1,683187,440
        10         20         30         40         50         60 
MRCIGISNRD FVEGVSGGSW VDIVLEHGSC VTTMAKNKPT LDFELIKTEA KQPATLRKYC 

        70         80         90        100        110        120 
IEAKLTNTTT DSRCPTQGEP TLNEEQDKRF VCKHSMVDRG WGNGCGLFGK GGIVTCAMFT 

       130        140        150        160        170        180 
CKKNMEGKIV QPENLEYTVV ITPHSGEEHA VGNDTGKHGK EVKITPQSSI TEAELTGYGT 

       190        200        210        220        230        240 
VTMECSPRTG LDFNEMVLLQ MEDKAWLVHR QWFLDLPLPW LPGADTQGSN WIQKETLVTF 

       250        260        270        280        290        300 
KNPHAKKQDV VVLGSQEGAM HTALTGATEI QMSSGNLLFT GHLKCRLRMD KLQLKGMSYS 

       310        320        330        340        350        360 
MCTGKFKIVK EIAETQHGTI VIRVQYEGDG SPCKIPFEIM DLEKRHVLGR LITVNPIVTE 

       370        380        390        400        410        420 
KDSPVNIEAE PPFGDSYIII GVEPGQLKLD WFKKGSSIGQ MFETTMRGAK RMAILGDTAW 

       430        440        450        460        470        480 
DFGSLGGVFT SIGKALHQVF GAIYGAAFSG VSWTMKILIG VIITWIGMNS RSTSLSVSLV 

       490        500        510        520        530        540 
LVGIVTLYLG VMVQADSGCV VSWKNKELKC GSGIFVTDNV HTWTEQYKFQ PESPSKLASA 

       550        560        570        580        590        600 
IQKAHEEGIC GIRSVTRLEN LMWKQITSEL NHILSENEVK LTIMTGDIKG IMQVGKRSLR 

       610        620        630        640        650        660 
PQPTELRYSW KTWGKAKMLS TELHNQTFLI DGPETAECPN TNRAWNSLEV EDYGFGVFTT 

       670        680        690        700        710        720 
NIWLRLREKQ DVFCDSKLMS AAIKDNRAVH ADMGYWIESA LNDTWKIEKA SFIEVKSCHW 

       730        740        750        760        770        780 
PKSHTLWSNG VLESEMVIPK NFAGPVSQHN NRPGYYTQTA GPWHLGKLEM DFDFCEGTTV 

       790        800        810        820        830        840 
VVTEDCGNRG PSLRTTTASG KLITEWCCRS CTLPPLRYRG EDGCWYGMEI RPLKEKEENL 

       850        860        870        880        890        900 
VSSLVTAGHG QIDNFSLGIL GMALFLEEML RTRVGTKHAI LLVAVSFVTL ITGNMSFRDL 

       910        920        930        940        950        960 
GRVMVMVGAT MTDDIGMGVT YLALLAAFRV RPTFAAGLLL RKLTSKELMM TTIGIVLLSQ 

       970        980        990       1000       1010       1020 
SSIPETILEL TDALALGMMV LKMVRNMEKY QLAVTIMAIL CVPNAVILQN AWKVSCTILA 

      1030       1040       1050       1060       1070       1080 
VVSVSPLLLT SSQQKADWIP LALTIKGLNP TAIFLTTLSR TSKKRAGVLW DVPSPPPVGK 

      1090       1100       1110       1120       1130       1140 
AELEDGAYRI KQKGILGYSQ IGAGVYKEGT FHTMWHVTRG AVLMHKGKRI EPSWADVKKD 

      1150       1160       1170       1180       1190       1200 
LISYGGGWKL EGEWKEGEEV QVLALEPGKN PRAVQTKPGL FRTNTGTIGA VSLDFSPGTS 

      1210       1220       1230       1240       1250       1260 
GSPIVDKKGK VVGLYGNGVV TRSGAYVSAI AQTEKSIEDN PEIEDDIFRK RRLTIMDLHP 

      1270       1280       1290       1300       1310       1320 
GAGKTKRYLP AIVREAIKRG LRTLILAPTR VVAAEMEEAL RGLPIRYQTP AIRAEHTGRE 

      1330       1340       1350       1360       1370       1380 
IVDLMCHATF TMRLLSPIRV PNYNLIIMDE AHFTDPASIA ARGYISTRVE MGEAAGIFMT 

      1390       1400       1410       1420       1430       1440 
ATPPGSRDPF PQSNAPIMDE EREIPERSWN SGHEWVTDFK GKTVWFVPSI KTGNDIAACL 

      1450       1460       1470       1480       1490       1500 
RKNGKRVIQL SRKTFDSEYV KTRTNDWDFV VTTDISEMGA NFKAERVIDP RRCMKPVILT 

      1510       1520       1530       1540       1550       1560 
DGEERVILAG PMPVTHSSAA QRRGRIGRNP RNENDQYIYM GEPLENDEDC AHWKEAKMLL 

      1570       1580       1590       1600       1610       1620 
DNINTPEGII PSIFEPEREK VDAIDGEYRL RGEARKTFVD LMRRGDLPVW LAYKVAAEGI 

      1630       1640       1650       1660       1670       1680 
NYADRRWCFD GTRNNQILEE NVEVEIWTKE GERKKLKPRW LDARIYSDPL ALKEFKEFAA 


GRK 

« Hide

References

[1]"Nucleotide sequence of the envelope glycoprotein gene of a dengue-2 virus isolated during an epidemic of benign dengue fever in Tonga in 1974."
Chen W., Maguire T.
Nucleic Acids Res. 18:5889-5889(1990) [PubMed: 2216784] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA] OF 1-495.
[2]Qu X., Chen W., Maguire T.
Submitted (MAR-1991) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA] OF 496-1683.
+Additional computationally mapped references.

Cross-references

Sequence databases

X54319 Genomic RNA. Translation: CAA38217.1.
X57469 Genomic RNA. Translation: CAA40705.1.
X57468 Genomic RNA. Translation: CAA40704.1.
PIRPQ0507.
S11482.

3D structure databases

EntryMethodResolution (Å)ChainPositionsPDBsum
1TG8X-ray2.61A1-395[»]
1TGEelectron microscopy12.50A/B/C1-395[»]
SMRP27914. Positions 1070-1246, 1236-1683.
ModBaseSearch...

Family and domain databases

InterProIPR014001. DEAD-like_N.
IPR011492. DEAD_Flavivir.
IPR001650. DNA/RNA_helicase_C.
IPR002464. DNA/RNA_helicase_DEAH_CS.
IPR013754. Flav_glyE_dim.
IPR001157. Flavi_NS1.
IPR000752. Flavi_NS2A.
IPR000336. Flv_glyE_Ig-like.
IPR011999. GlycoprotE_cen/dimer_Flavivir.
IPR014021. Helicase_SF1/SF2_ATP-bd.
IPR001850. Peptidase_S7.
[Graphical view]
Gene3DG3DSA:2.60.98.10. Flav_glyE_dim. 1 hit.
G3DSA:2.60.40.350. Flv_glyE_Ig-like. 1 hit.
PfamPF07652. Flavi_DEAD. 1 hit.
PF02832. Flavi_glycop_C. 1 hit.
PF00869. Flavi_glycoprot. 1 hit.
PF00948. Flavi_NS1. 1 hit.
PF01005. Flavi_NS2A. 1 hit.
PF00949. Peptidase_S7. 1 hit.
[Graphical view]
ProDomPD001496. Flavi_NS1. 1 hit.
[Graphical view] [Entries sharing at least one domain]
SMARTSM00487. DEXDc. 1 hit.
SM00490. HELICc. 1 hit.
[Graphical view]
PROSITEPS00690. DEAH_ATP_HELICASE. False negative.
PS51192. HELICASE_ATP_BIND_1. 1 hit.
PS51194. HELICASE_CTER. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry namePOLG_DEN2T
AccessionPrimary (citable) accession number: P27914
Entry history
Integrated into UniProtKB/Swiss-Prot: August 1, 1992
Last sequence update: August 1, 1992
Last modified: November 3, 2009
This is version 82 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectVirus (Virus annotation project)

Relevant documents

PDB cross-references

Index of Protein Data Bank (PDB) cross-references

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents