Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Envelopment polyprotein

Gene

GP

Organism
Rift valley fever virus (RVFV)
Status
Reviewed-Annotation score: Annotation score: 4 out of 5-Experimental evidence at protein leveli

Functioni

Glycoprotein N: interact with each other and are present at the surface of the virion. They are able to attach the virion to a cell receptor and to promote fusion of membranes after endocytosis of the virion (By similarity).By similarity
Glycoprotein C: interact with each other and are present at the surface of the virion. Together they play a role in attachment of the virion to a cell receptor. Promotes fusion of membranes after endocytosis of the virion (By similarity).By similarity
NSm protein: Plays a role for virus dissemination in mouse.By similarity
NSm-Gn protein: Plays a role for virus dissemination in mosquitoes.By similarity

GO - Biological processi

Complete GO annotation...

Keywords - Biological processi

Fusion of virus membrane with host endosomal membrane, Fusion of virus membrane with host membrane, Host-virus interaction, Viral attachment to host cell, Viral penetration into host cytoplasm, Virus entry into host cell

Names & Taxonomyi

Protein namesi
Recommended name:
Envelopment polyprotein
Alternative name(s):
M polyprotein
Cleaved into the following 3 chains:
NSm-Gn proteinBy similarity
Glycoprotein NBy similarity
Short name:
Gn
Alternative name(s):
Glycoprotein G1
Glycoprotein CBy similarity
Short name:
Gc
Alternative name(s):
Glycoprotein G2
Gene namesi
Name:GP
OrganismiRift valley fever virus (RVFV)
Taxonomic identifieri11588 [NCBI]
Taxonomic lineageiVirusesssRNA virusesssRNA negative-strand virusesBunyaviridaePhlebovirus
Virus hostiAedes [TaxID: 7158]
Bos taurus (Bovine) [TaxID: 9913]
Bos taurus x Bison bison (beefalo) [TaxID: 297284]
Camelus bactrianus (Bactrian camel) [TaxID: 9837]
Capra hircus (Goat) [TaxID: 9925]
Homo sapiens (Human) [TaxID: 9606]
Ovis aries (Sheep) [TaxID: 9940]
Phlebotomus papatasi (Sandfly) [TaxID: 29031]

Subcellular locationi

Glycoprotein N :
  • Virion membrane By similarity; Single-pass type I membrane protein By similarity
  • Host Golgi apparatus membrane By similarity; Single-pass type I membrane protein By similarity
  • Host endoplasmic reticulum membrane By similarity; Single-pass type I membrane protein By similarity

  • Note: Interaction between G1 and G2 is essential for proper targeting of G1 to the Golgi complex, where virion budding occurs.By similarity
Glycoprotein C :
  • Virion membrane By similarity; Single-pass type I membrane protein By similarity
  • Host Golgi apparatus membrane By similarity; Single-pass type I membrane protein By similarity

Topology

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Topological domaini17 – 582566LumenalSequence analysisAdd
BLAST
Transmembranei583 – 60321HelicalSequence analysisAdd
BLAST
Topological domaini604 – 69087CytoplasmicSequence analysisAdd
BLAST
Topological domaini691 – 1136446LumenalSequence analysisAdd
BLAST
Transmembranei1137 – 115721HelicalSequence analysisAdd
BLAST
Topological domaini1158 – 120649CytoplasmicSequence analysisAdd
BLAST

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Host endoplasmic reticulum, Host Golgi apparatus, Host membrane, Membrane, Virion

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 1616Sequence analysisAdd
BLAST
Chaini17 – 12061190Envelopment polyproteinPRO_0000247009Add
BLAST
Chaini17 – 153137NSm-Gn proteinPRO_0000036847Add
BLAST
Chaini154 – 690537Glycoprotein NSequence analysisPRO_0000036848Add
BLAST
Chaini691 – 1206516Glycoprotein CSequence analysisPRO_0000036849Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Glycosylationi88 – 881N-linked (GlcNAc...); by hostSequence analysis
Glycosylationi438 – 4381N-linked (GlcNAc...); by hostSequence analysis
Glycosylationi794 – 7941N-linked (GlcNAc...); by hostSequence analysis
Glycosylationi1035 – 10351N-linked (GlcNAc...); by hostSequence analysis
Glycosylationi1077 – 10771N-linked (GlcNAc...); by hostSequence analysis

Post-translational modificationi

Specific enzymatic cleavages in vivo yield mature proteins including NSm protein, Glycoprotein C, and Glycoprotein N.By similarity

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sitei690 – 6912Cleavage; by host signal peptidaseBy similarity

Keywords - PTMi

Glycoprotein

Proteomic databases

PRIDEiP03518.

Interactioni

Subunit structurei

Glycoprotein C and Glycoprotein N interacts with each other.By similarity

Structurei

Secondary structure

1
1206
Legend: HelixTurnBeta strand
Show more details
Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Beta strandi691 – 6966Combined sources
Helixi699 – 7013Combined sources
Beta strandi702 – 7098Combined sources
Beta strandi711 – 72212Combined sources
Beta strandi728 – 7358Combined sources
Beta strandi742 – 76423Combined sources
Beta strandi767 – 77610Combined sources
Turni785 – 7906Combined sources
Helixi798 – 8036Combined sources
Beta strandi810 – 8178Combined sources
Helixi821 – 8233Combined sources
Beta strandi826 – 8294Combined sources
Beta strandi831 – 84414Combined sources
Beta strandi847 – 86418Combined sources
Beta strandi870 – 8756Combined sources
Beta strandi880 – 8834Combined sources
Beta strandi886 – 8938Combined sources
Helixi900 – 9023Combined sources
Beta strandi904 – 9085Combined sources
Turni909 – 9113Combined sources
Beta strandi912 – 9187Combined sources
Beta strandi931 – 9366Combined sources
Helixi937 – 9415Combined sources
Beta strandi947 – 9493Combined sources
Beta strandi954 – 9596Combined sources
Beta strandi962 – 9676Combined sources
Helixi972 – 9787Combined sources
Beta strandi980 – 9856Combined sources
Beta strandi988 – 9925Combined sources
Beta strandi994 – 9963Combined sources
Beta strandi999 – 10035Combined sources
Beta strandi1009 – 102113Combined sources
Beta strandi1030 – 104415Combined sources
Beta strandi1046 – 105611Combined sources
Beta strandi1058 – 10647Combined sources
Beta strandi1070 – 10756Combined sources
Beta strandi1077 – 108610Combined sources
Beta strandi1089 – 110315Combined sources
Beta strandi1106 – 11138Combined sources

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
EntryMethodResolution (Å)ChainPositionsPDBsum
4HJ1X-ray1.90A/B/C/D691-1119[»]
4HJCX-ray4.15A691-1118[»]
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Sequence similaritiesi

Keywords - Domaini

Signal, Transmembrane, Transmembrane helix

Family and domain databases

InterProiIPR016404. M_polyprot_prcur_phlebovir.
IPR010826. Phlebovirus_G1.
IPR009878. Phlebovirus_G2.
IPR009879. Phlebovirus_NSM.
[Graphical view]
PfamiPF07243. Phlebovirus_G1. 1 hit.
PF07245. Phlebovirus_G2. 1 hit.
PF07246. Phlebovirus_NSM. 2 hits.
[Graphical view]
PIRSFiPIRSF003961. M_poly_PhleboV. 1 hit.

Sequences (3)i

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

This entry describes 3 isoformsi produced by alternative initiation. AlignAdd to basket

Isoform 1 (identifier: P03518-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MYVLLTILIS VLVCEAVIRV SLSSTREETC FGDSTNPEMI EGAWDSLREE
60 70 80 90 100
EMPEELSCSI SGIREVKTSS QELYRALKAI IAADGLNNIT CHGKDPEDKI
110 120 130 140 150
SLIKGPPHKK RVGIVRCERR RDAKQIGRET MAGIAMTVLP ALAVFALAPV
160 170 180 190 200
VFAEDPHLRN RPGKGHNYID GMTQEDATCK PVTYAGACSS FDVLLEKGKF
210 220 230 240 250
PLFQSYAHHR TLLEAVHDTI IAKADPPSCD LQSAHGNPCM KEKLVMKTHC
260 270 280 290 300
PNDYQSAHYL NNDGKMASVK CPPKYGLTED CNFCRQMTGA SLKKGSYPLQ
310 320 330 340 350
DLFCQSSEDD GSKLKTKMKG VCEVGVQAHK KCDGQLSTAH EVVPFAVFKN
360 370 380 390 400
SKKVYLDKLD LKTEENLLPD SFVCFEHKGQ YKGTMDSGQT KRELKSFDIS
410 420 430 440 450
QCPKIGGHGS KKCTGDAAFC SAYECTAQYA NAYCSHANGS GIVQIQVSGV
460 470 480 490 500
WKKPLCVGYE RVVVKRELSA KPIQRVEPCT TCITKCEPHG LVVRSTGFKI
510 520 530 540 550
SSAVACASGV CVTGSQSPST EITLKYPGIS QSSGGDIGVH MAHDDQSVSS
560 570 580 590 600
KIVAHCPPQD PCLVHGCIVC AHGLINYQCH TALSAFVVVF VFSSIAIICL
610 620 630 640 650
AVLYRVLKCL KIAPRKVLNP LMWITAFIRW IYKKMVARVA HNINQVNREI
660 670 680 690 700
GWMEGGQLVL GNPAPIPRHA PIPRYSTYLM LLLIVSYASA CSELIQASSR
710 720 730 740 750
ITTCSTEGVN TKCRLSGTAL IRAGSVGAEA CLMLKGVKED QTKFLKIKTV
760 770 780 790 800
SSELSCREGQ SYWTGSISPK CLSSRRCHLV GECHVNRCLS WRDNETSAEF
810 820 830 840 850
SFVGESTTMR ENKCFEQCGG WGCGCFNVNP SCLFVHTYLQ SVRKEALRVF
860 870 880 890 900
NCIDWVHKLT LEITDFDGSV STIDLGASSS RFTNWGSVSL SLDAEGISGS
910 920 930 940 950
NSFSFIESPS KGYAIVDEPF SEIPRQGFLG EIRCNSESSV LSAHESCLRA
960 970 980 990 1000
PNLISYKPMI DQLECTTNLI DPFVVFERGS LPQTRNDKTF AASKGNRGVQ
1010 1020 1030 1040 1050
AFSKGSVQAD LTLMFDNFEV DFVGAAVSCD AAFLNLTGCY SCNAGARVCL
1060 1070 1080 1090 1100
SITSTGTGSL SAHNKDGSLH IVLPSENGTK DQCQILHFTV PEVEEEFMYS
1110 1120 1130 1140 1150
CDGDERPLLV KGTLIAIDPF DDRREAGGES TVVNPKSGSW NFFDWFSGLM
1160 1170 1180 1190 1200
SWFGGPLKLY SSFACMLHYQ LGSFSSLYIL EEQASLKCGL LPLRRPHRSV

RVKVIC
Length:1,206
Mass (Da):132,053
Last modified:November 1, 1995 - v2
Checksum:iD2E8017179285924
GO
Isoform NSm proteinBy similarity (identifier: P03518-2) [UniParc]FASTAAdd to basket

Also known as: P14

The sequence of this isoform differs from the canonical sequence as follows:
     1-38: Missing.
     154-1197: Missing.

Show »
Length:124
Mass (Da):13,657
Checksum:iE20CCDB67F1AD7DC
GO
Isoform NSm' proteinBy similarity (identifier: P03518-3) [UniParc]FASTAAdd to basket

Also known as: P13

The sequence of this isoform differs from the canonical sequence as follows:
     1-51: Missing.
     154-1197: Missing.

Show »
Length:111
Mass (Da):12,110
Checksum:iF6418D10C3EC8F6D
GO

Alternative sequence

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Alternative sequencei1 – 5151Missing in isoform NSm' protein. VSP_057985Add
BLAST
Alternative sequencei1 – 3838Missing in isoform NSm protein. VSP_057986Add
BLAST
Alternative sequencei154 – 11971044Missing in isoform NSm protein and isoform NSm' protein. VSP_057987Add
BLAST

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
M11157 Genomic RNA. Translation: AAA47450.1.
PIRiA04110. VGVURV.

Keywords - Coding sequence diversityi

Alternative initiation

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
M11157 Genomic RNA. Translation: AAA47450.1.
PIRiA04110. VGVURV.

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
EntryMethodResolution (Å)ChainPositionsPDBsum
4HJ1X-ray1.90A/B/C/D691-1119[»]
4HJCX-ray4.15A691-1118[»]
ModBaseiSearch...
MobiDBiSearch...

Proteomic databases

PRIDEiP03518.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Family and domain databases

InterProiIPR016404. M_polyprot_prcur_phlebovir.
IPR010826. Phlebovirus_G1.
IPR009878. Phlebovirus_G2.
IPR009879. Phlebovirus_NSM.
[Graphical view]
PfamiPF07243. Phlebovirus_G1. 1 hit.
PF07245. Phlebovirus_G2. 1 hit.
PF07246. Phlebovirus_NSM. 2 hits.
[Graphical view]
PIRSFiPIRSF003961. M_poly_PhleboV. 1 hit.
ProtoNetiSearch...

Publicationsi

  1. Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA].

Entry informationi

Entry nameiGP_RVFV
AccessioniPrimary (citable) accession number: P03518
Secondary accession number(s): Q86494, Q86495
Entry historyi
Integrated into UniProtKB/Swiss-Prot: July 21, 1986
Last sequence update: November 1, 1995
Last modified: January 20, 2016
This is version 81 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programViral Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

3D-structure

Documents

  1. PDB cross-references
    Index of Protein Data Bank (PDB) cross-references
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.