Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Envelopment polyprotein

Gene

GP

Organism
Crimean-Congo hemorrhagic fever virus (strain Nigeria/IbAr10200/1970) (CCHFV)
Status
Reviewed-Annotation score: -Protein inferred from homologyi

Functioni

Glycoprotein C and glycoprotein N interact with each other and are present at the surface of the virion. They are able to attach the virion to host cell receptors. This attachment induces virion internalization predominantly through clathrin-dependent endocytosis. Also promote fusion of viral membrane with host endosomal membrane after endocytosis of the virion (By similarity).By similarity1 Publication

GO - Biological processi

Keywordsi

Biological processClathrin-mediated endocytosis of virus by host, Fusion of virus membrane with host endosomal membrane, Fusion of virus membrane with host membrane, Host-virus interaction, Viral attachment to host cell, Viral penetration into host cytoplasm, Virus endocytosis by host, Virus entry into host cell

Names & Taxonomyi

Protein namesi
Recommended name:
Envelopment polyprotein
Alternative name(s):
M polyprotein
Cleaved into the following 5 chains:
GP381 Publication
Glycoprotein N1 Publication
Short name:
Gn
Alternative name(s):
Glycoprotein G2
Non-Structural protein M1 Publication
Short name:
NSm
Glycoprotein C1 Publication
Short name:
Gc
Alternative name(s):
Glycoprotein G1
Gene namesi
Name:GP
OrganismiCrimean-Congo hemorrhagic fever virus (strain Nigeria/IbAr10200/1970) (CCHFV)
Taxonomic identifieri652961 [NCBI]
Taxonomic lineageiVirusesssRNA virusesssRNA negative-strand virusesBunyaviralesNairoviridaeOrthonairovirus
Virus hostiBos taurus (Bovine) [TaxID: 9913]
Capra hircus (Goat) [TaxID: 9925]
Homo sapiens (Human) [TaxID: 9606]
Hyalomma [TaxID: 34625]
Ovis aries (Sheep) [TaxID: 9940]
Rhipicephalus microplus (Cattle tick) (Boophilus microplus) [TaxID: 6941]
Proteomesi

Subcellular locationi

Glycoprotein C :

Topology

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Transmembranei699 – 719HelicalSequence analysisAdd BLAST21
Transmembranei822 – 842HelicalSequence analysisAdd BLAST21
Transmembranei860 – 880HelicalSequence analysisAdd BLAST21
Transmembranei973 – 993HelicalSequence analysisAdd BLAST21
Transmembranei1595 – 1615HelicalSequence analysisAdd BLAST21

GO - Cellular componenti

Keywords - Cellular componenti

Host endoplasmic reticulum, Host Golgi apparatus, Host membrane, Membrane, Viral envelope protein, Virion

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Signal peptidei1 – 18Sequence analysisAdd BLAST18
ChainiPRO_000040656419 – 1684Envelopment polyproteinBy similarityAdd BLAST1666
ChainiPRO_000040656519 – 247Mucin-like variable regionBy similarityAdd BLAST229
ChainiPRO_0000434910248 – 519GP381 PublicationAdd BLAST272
ChainiPRO_0000406566520 – 842Glycoprotein N1 PublicationAdd BLAST323
ChainiPRO_0000434911843 – 1040Non-Structural protein M1 PublicationAdd BLAST198
ChainiPRO_00004065671041 – 1684Glycoprotein C1 PublicationAdd BLAST644

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Glycosylationi25N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi30N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi196N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi200N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi243N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi376N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi426N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi557N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi755N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi1054N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi1345N-linked (GlcNAc...) asparagine; by hostSequence analysis1
Glycosylationi1563N-linked (GlcNAc...) asparagine; by hostSequence analysis1

Post-translational modificationi

Specific enzymatic cleavages in vivo yield mature proteins including glycoprotein Glycoprotein C and Glycoprotein N.1 Publication

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sitei247 – 248Cleavage; by host furin-like protease1 Publication2
Sitei516 – 517Cleavage; by hostBy similarity2
Sitei519 – 520Cleavage; by host HHAT protease1 Publication2
Sitei842 – 843Cleavage; by host protease1 Publication2
Sitei1037 – 1038Cleavage; by host signal peptidaseBy similarity2
Sitei1040 – 1041Cleavage; by host protease1 Publication2

Keywords - PTMi

Glycoprotein

Proteomic databases

PRIDEiQ8JSZ3

Interactioni

Subunit structurei

Glycoprotein C and Glycoprotein N interact with each other.By similarity

Structurei

3D structure databases

SMRiQ8JSZ3
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Compositional bias

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Compositional biasi27 – 242Thr-richAdd BLAST216
Compositional biasi1165 – 1208Cys-richAdd BLAST44

Sequence similaritiesi

Keywords - Domaini

Signal, Transmembrane, Transmembrane helix

Phylogenomic databases

OrthoDBiVOG090002DI

Family and domain databases

InterProiView protein in InterPro
IPR012487 Nairovirus_M
PfamiView protein in Pfam
PF07948 Nairovirus_M, 1 hit
PIRSFiPIRSF003962 M_poly_NairoV, 1 hit

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

Q8JSZ3-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MHISLMYAIL CLQLCGLGET HGSHNETRHN KTDTMTTPGD NPSSEPPVST
60 70 80 90 100
ALSITLDPST VTPTTPASGL EGSGEVYTSP PITTGSLPLS ETTPELPVTT
110 120 130 140 150
GTDTLSAGDV DPSTQTAGGT SAPTVRTSLP NSPSTPSTPQ DTHHPVRNLL
160 170 180 190 200
SVTSPGPDET STPSGTGKES SATSSPHPVS NRPPTPPATA QGPTENDSHN
210 220 230 240 250
ATEHPESLTQ SATPGLMTSP TQIVHPQSAT PITVQDTHPS PTNRSKRNLK
260 270 280 290 300
MEIILTLSQG LKKYYGKILR LLQLTLEEDT EGLLEWCKRN LGLDCDDTFF
310 320 330 340 350
QKRIEEFFIT GEGHFNEVLQ FRTPGTLSTT ESTPAGLPTA EPFKSYFAKG
360 370 380 390 400
FLSIDSGYYS AKCYSGTSNS GLQLINITRH STRIVDTPGP KITNLKTINC
410 420 430 440 450
INLKASIFKE HREVEINVLL PQVAVNLSNC HVVIKSHVCD YSLDIDGAVR
460 470 480 490 500
LPHIYHEGVF IPGTYKIVID KKNKLNDRCT LFTDCVIKGR EVRKGQSVLR
510 520 530 540 550
QYKTEIRIGK ASTGSRRLLS EEPSDDCISR TQLLRTETAE IHGDNYGGPG
560 570 580 590 600
DKITICNGST IVDQRLGSEL GCYTINRVRS FKLCENSATG KNCEIDSVPV
610 620 630 640 650
KCRQGYCLRI TQEGRGHVKL SRGSEVVLDA CDTSCEIMIP KGTGDILVDC
660 670 680 690 700
SGGQQHFLKD NLIDLGCPKI PLLGKMAIYI CRMSNHPKTT MAFLFWFSFG
710 720 730 740 750
YVITCILCKA IFYLLIIVGT LGKRLKQYRE LKPQTCTICE TTPVNAIDAE
760 770 780 790 800
MHDLNCSYNI CPYCASRLTS DGLARHVIQC PKRKEKVEET ELYLNLERIP
810 820 830 840 850
WVVRKLLQVS ESTGVALKRS SWLIVLLVLF TVSLSPVQSA PIGQGKTIEA
860 870 880 890 900
YRAREGYTSI CLFVLGSILF IVSCLMKGLV DSVGNSFFPG LSICKTCSIS
910 920 930 940 950
SINGFEIESH KCYCSLFCCP YCRHCSTDKE IHKLHLSICK KRKKGSNVML
960 970 980 990 1000
AVCKLMCFRA TMEVSNRALF IRSIINTTFV LCILILAVCV VSTSAVEMEN
1010 1020 1030 1040 1050
LPAGTWEREE DLTNFCHQEC QVTETECLCP YEALVLRKPL FLDSTAKGMK
1060 1070 1080 1090 1100
NLLNSTSLET SLSIEAPWGA INVQSTYKPT VSTANIALSW SSVEHRGNKI
1110 1120 1130 1140 1150
LVSGRSESIM KLEERTGISW DLGVEDASES KLLTVSVMDL SQMYSPVFEY
1160 1170 1180 1190 1200
LSGDRQVGEW PKATCTGDCP ERCGCTSSTC LHKEWPHSRN WRCNPTWCWG
1210 1220 1230 1240 1250
VGTGCTCCGL DVKDLFTDYM FVKWKVEYIK TEAIVCVELT SQERQCSLIE
1260 1270 1280 1290 1300
AGTRFNLGPV TITLSEPRNI QQKLPPEIIT LHPRIEEGFF DLMHVQKVLS
1310 1320 1330 1340 1350
ASTVCKLQSC THGVPGDLQV YHIGNLLKGD KVNGHLIHKI EPHFNTSWMS
1360 1370 1380 1390 1400
WDGCDLDYYC NMGDWPSCTY TGVTQHNHAS FVNLLNIETD YTKNFHFHSK
1410 1420 1430 1440 1450
RVTAHGDTPQ LDLKARPTYG AGEITVLVEV ADMELHTKKI EISGLKFASL
1460 1470 1480 1490 1500
ACTGCYACSS GISCKVRIHV DEPDELTVHV KSDDPDVVAA SSSLMARKLE
1510 1520 1530 1540 1550
FGTDSTFKAF SAMPKTSLCF YIVEREHCKS CSEEDTKKCV NTKLEQPQSI
1560 1570 1580 1590 1600
LIEHKGTIIG KQNSTCTAKA SCWLESVKSF FYGLKNMLSG IFGNVFMGIF
1610 1620 1630 1640 1650
LFLAPFILLI LFFMFGWRIL FCFKCCRRTR GLFKYRHLKD DEETGYRRII
1660 1670 1680
EKLNNKKGKN KLLDGERLAD RRIAELFSTK THIG
Length:1,684
Mass (Da):186,589
Last modified:October 1, 2002 - v1
Checksum:i7CEAB4B46F74F578
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AF467768 Genomic RNA Translation: AAM48106.1

Entry informationi

Entry nameiGP_CCHFI
AccessioniPrimary (citable) accession number: Q8JSZ3
Entry historyiIntegrated into UniProtKB/Swiss-Prot: April 5, 2011
Last sequence update: October 1, 2002
Last modified: May 23, 2018
This is version 49 of the entry and version 1 of the sequence. See complete history.
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programViral Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. SIMILARITY comments
    Index of protein domains and families

Cookie policy

We would like to use anonymized google analytics cookies to gather statistics on how uniprot.org is used in aggregate. Learn more

UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health