Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Envelopment polyprotein

Gene

GP

Organism
Hantaan virus (strain 76-118) (Korean hemorrhagic fever virus)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Glycoprotein N and Glycoprotein C interact with each. other and are present at the surface of the virion. They are able to attach the virion to host cell receptors. This attachment induces virion internalization predominantly through clathrin-dependent endocytosis. Also promote fusion of viral membrane with host endosomal membrane after endocytosis of the virion. Glycoprotein N contains an ITAM motif which is likely to dysregulate normal immune and endothelial cell responses and contribute to virus pathogenesis (By similarity).By similarity1 Publication

GO - Biological processi

Complete GO annotation...

Keywords - Biological processi

Fusion of virus membrane with host endosomal membrane, Fusion of virus membrane with host membrane, Host-virus interaction, Inhibition of host innate immune response by virus, Inhibition of host RLR pathway by virus, Inhibition of host TRAFs by virus, Viral attachment to host cell, Viral immunoevasion, Viral penetration into host cytoplasm, Virus endocytosis by host, Virus entry into host cell

Protein family/group databases

TCDBi1.G.20.1.1. the hantavirus gc envelope fusion glycoprotein (gc-efg) family.

Names & Taxonomyi

Protein namesi
Recommended name:
Envelopment polyprotein
Alternative name(s):
Glycoprotein precursor1 Publication
M polyprotein
Cleaved into the following 2 chains:
Glycoprotein N1 Publication
Short name:
Gn
Alternative name(s):
Glycoprotein G2
Glycoprotein C1 Publication
Short name:
Gc
Alternative name(s):
Glycoprotein G1
Gene namesi
Name:GP
OrganismiHantaan virus (strain 76-118) (Korean hemorrhagic fever virus)
Taxonomic identifieri11602 [NCBI]
Taxonomic lineageiVirusesssRNA virusesssRNA negative-strand virusesBunyaviridaeHantavirus
Virus hostiApodemus agrarius (Eurasian field mouse) [TaxID: 39030]
Homo sapiens (Human) [TaxID: 9606]
Proteomesi
  • UP000008627 Componenti: Genome

Subcellular locationi

Glycoprotein N :
Glycoprotein C :

Topology

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Topological domaini19 – 485LumenalSequence analysisAdd BLAST467
Transmembranei486 – 506HelicalSequence analysisAdd BLAST21
Topological domaini507 – 648CytoplasmicSequence analysisAdd BLAST142
Topological domaini649 – 1105LumenalSequence analysisAdd BLAST457
Transmembranei1106 – 1126HelicalSequence analysisAdd BLAST21
Topological domaini1127 – 1135CytoplasmicSequence analysis9

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Host endoplasmic reticulum, Host Golgi apparatus, Host membrane, Membrane, Viral envelope protein, Virion

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Signal peptidei1 – 18Add BLAST18
ChainiPRO_000003681519 – 1135Envelopment polyproteinAdd BLAST1117
ChainiPRO_000003681619 – ?648Glycoprotein NAdd BLAST630
ChainiPRO_0000036817649 – ?1126Glycoprotein CAdd BLAST478

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Glycosylationi134N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi235N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi347N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi399N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi928N-linked (GlcNAc...); by hostSequence analysis1

Post-translational modificationi

Specific enzymatic cleavages in vivo yield mature proteins including Glycoprotein N and Glycoprotein C.1 Publication

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sitei645 – 646Cleavage; by host signal peptidase2

Keywords - PTMi

Glycoprotein

Interactioni

Subunit structurei

Glycoprotein N and Glycoprotein C interact with each other.By similarity

Structurei

Secondary structure

11135
Legend: HelixTurnBeta strandPDB Structure known for this area
Show more details
Feature keyPosition(s)DescriptionActionsGraphical viewLength
Beta strandi663 – 665Combined sources3
Helixi668 – 670Combined sources3
Beta strandi673 – 680Combined sources8
Beta strandi685 – 692Combined sources8
Turni694 – 696Combined sources3
Beta strandi700 – 707Combined sources8
Beta strandi711 – 737Combined sources27
Helixi739 – 741Combined sources3
Helixi745 – 748Combined sources4
Beta strandi750 – 759Combined sources10
Helixi763 – 765Combined sources3
Beta strandi776 – 809Combined sources34
Beta strandi812 – 819Combined sources8
Beta strandi823 – 825Combined sources3
Beta strandi827 – 834Combined sources8
Beta strandi845 – 851Combined sources7
Beta strandi853 – 860Combined sources8
Beta strandi875 – 881Combined sources7
Beta strandi892 – 895Combined sources4
Beta strandi903 – 906Combined sources4
Helixi913 – 918Combined sources6
Helixi919 – 923Combined sources5
Beta strandi924 – 928Combined sources5
Beta strandi933 – 935Combined sources3
Beta strandi938 – 941Combined sources4
Beta strandi949 – 956Combined sources8
Beta strandi958 – 960Combined sources3
Beta strandi971 – 984Combined sources14
Beta strandi989 – 1011Combined sources23
Beta strandi1014 – 1034Combined sources21
Beta strandi1043 – 1047Combined sources5

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
5LJXX-ray1.40A649-1105[»]
5LJYX-ray3.00A649-1105[»]
5LJZX-ray1.60A649-1105[»]
5LK0X-ray1.80A649-1105[»]
5LK1X-ray1.70A649-1105[»]
5LK2X-ray1.60A649-1105[»]
5LK3X-ray1.50A649-1105[»]
ProteinModelPortaliP08668.
SMRiP08668.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini611 – 634ITAMPROSITE-ProRule annotationAdd BLAST24

Sequence similaritiesi

Contains 1 ITAM domain.PROSITE-ProRule annotation

Keywords - Domaini

Signal, Transmembrane, Transmembrane helix

Family and domain databases

InterProiIPR016402. Envelope_glycoprot_Hantavirus.
IPR002534. Hanta_G1.
IPR002532. Hanta_G2.
IPR012316. ITAM_motif_hantavir-typ.
[Graphical view]
PfamiPF01567. Hanta_G1. 1 hit.
PF01561. Hanta_G2. 1 hit.
PF10538. ITAM_Cys-rich. 1 hit.
[Graphical view]
PIRSFiPIRSF003945. M_poly_HantaV. 1 hit.
ProDomiPD001813. Hanta_G2. 1 hit.
[Graphical view] [Entries sharing at least one domain]
PROSITEiPS51056. ITAM_2. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

P08668-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MGIWKWLVMA SLVWPVLTLR NVYDMKIECP HTVSFGENSV IGYVELPPVP
60 70 80 90 100
LADTAQMVPE SSCNMDNHQS LNTITKYTQV SWRGKADQSQ SSQNSFETVS
110 120 130 140 150
TEVDLKGTCV LKHKMVEESY RSRKSVTCYD LSCNSTYCKP TLYMIVPIHA
160 170 180 190 200
CNMMKSCLIA LGPYRVQVVY ERSYCMTGVL IEGKCFVPDQ SVVSIIKHGI
210 220 230 240 250
FDIASVHIVC FFVAVKGNTY KIFEQVKKSF ESTCNDTENK VQGYYICIVG
260 270 280 290 300
GNSAPIYVPT LDDFRSMEAF TGIFRSPHGE DHDLAGEEIA SYSIVGPANA
310 320 330 340 350
KVPHSASSDT LSLIAYSGIP SYSSLSILTS STEAKHVFSP GLFPKLNHTN
360 370 380 390 400
CDKSAIPLIW TGMIDLPGYY EAVHPCTVFC VLSGPGASCE AFSEGGIFNI
410 420 430 440 450
TSPMCLVSKQ NRFRLTEQQV NFVCQRVDMD IVVYCNGQRK VILTKTLVIG
460 470 480 490 500
QCIYTITSLF SLLPGVAHSI AVELCVPGFH GWATAALLVT FCFGWVLIPA
510 520 530 540 550
ITFIILTVLK FIANIFHTSN QENRLKSVLR KIKEEFEKTK GSMVCDVCKY
560 570 580 590 600
ECETYKELKA HGVSCPQSQC PYCFTHCEPT EAAFQAHYKV CQVTHRFRDD
610 620 630 640 650
LKKTVTPQNF TPGCYRTLNL FRYKSRCYIF TMWIFLLVLE SILWAASASE
660 670 680 690 700
TPLTPVWNDN AHGVGSVPMH TDLELDFSLT SSSKYTYRRK LTNPLEEAQS
710 720 730 740 750
IDLHIEIEEQ TIGVDVHALG HWFDGRLNLK TSFHCYGACT KYEYPWHTAK
760 770 780 790 800
CHYERDYQYE TSWGCNPSDC PGVGTGCTAC GLYLDQLKPV GSAYKIITIR
810 820 830 840 850
YSRRVCVQFG EENLCKIIDM NDCFVSRHVK VCIIGTVSKF SQGDTLLFFG
860 870 880 890 900
PLEGGGLIFK HWCTSTCQFG DPGDIMSPRD KGFLCPEFPG SFRKKCNFAT
910 920 930 940 950
TPICEYDGNM VSGYKKVMAT IDSFQSFNTS TMHFTDERIE WKDPDGMLRD
960 970 980 990 1000
HINILVTKDI DFDNLGENPC KIGLQTSSIE GAWGSGVGFT LTCLVSLTEC
1010 1020 1030 1040 1050
PTFLTSIKAC DKAICYGAES VTLTRGQNTV KVSGKGGHSG STFRCCHGED
1060 1070 1080 1090 1100
CSQIGLHAAA PHLDKVNGIS EIENSKVYDD GAPQCGIKCW FVKSGEWISG
1110 1120 1130
IFSGNWIVLI VLCVFLLFSL VLLSILCPVR KHKKS
Length:1,135
Mass (Da):126,421
Last modified:January 1, 1988 - v1
Checksum:i8E40B8EA68EA62FA
GO

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti37E → G in CAA68456 (PubMed:3114716).Curated1
Sequence conflicti64N → S in CAA68456 (PubMed:3114716).Curated1
Sequence conflicti173S → T in CAA68456 (PubMed:3114716).Curated1

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
M14627 Genomic RNA. Translation: AAA43836.1.
Y00386 mRNA. Translation: CAA68456.1.
PIRiA26348. GNVUHV.
A29382. GNVUH7.
RefSeqiNP_941978.1. NC_005219.1.

Genome annotation databases

GeneIDi2943079.
KEGGivg:2943079.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
M14627 Genomic RNA. Translation: AAA43836.1.
Y00386 mRNA. Translation: CAA68456.1.
PIRiA26348. GNVUHV.
A29382. GNVUH7.
RefSeqiNP_941978.1. NC_005219.1.

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
5LJXX-ray1.40A649-1105[»]
5LJYX-ray3.00A649-1105[»]
5LJZX-ray1.60A649-1105[»]
5LK0X-ray1.80A649-1105[»]
5LK1X-ray1.70A649-1105[»]
5LK2X-ray1.60A649-1105[»]
5LK3X-ray1.50A649-1105[»]
ProteinModelPortaliP08668.
SMRiP08668.
ModBaseiSearch...
MobiDBiSearch...

Protein family/group databases

TCDBi1.G.20.1.1. the hantavirus gc envelope fusion glycoprotein (gc-efg) family.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

GeneIDi2943079.
KEGGivg:2943079.

Family and domain databases

InterProiIPR016402. Envelope_glycoprot_Hantavirus.
IPR002534. Hanta_G1.
IPR002532. Hanta_G2.
IPR012316. ITAM_motif_hantavir-typ.
[Graphical view]
PfamiPF01567. Hanta_G1. 1 hit.
PF01561. Hanta_G2. 1 hit.
PF10538. ITAM_Cys-rich. 1 hit.
[Graphical view]
PIRSFiPIRSF003945. M_poly_HantaV. 1 hit.
ProDomiPD001813. Hanta_G2. 1 hit.
[Graphical view] [Entries sharing at least one domain]
PROSITEiPS51056. ITAM_2. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiGP_HANTV
AccessioniPrimary (citable) accession number: P08668
Entry historyi
Integrated into UniProtKB/Swiss-Prot: January 1, 1988
Last sequence update: January 1, 1988
Last modified: November 30, 2016
This is version 100 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programViral Protein Annotation Program

Miscellaneousi

Miscellaneous

It was not possible to precisely define the C-terminus of Glycoprotein C and Glycoprotein N mature glycoproteins.

Keywords - Technical termi

3D-structure, Complete proteome, Reference proteome

Documents

  1. PDB cross-references
    Index of Protein Data Bank (PDB) cross-references
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.