Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Gag polyprotein

Gene

gag

Organism
Walleye dermal sarcoma virus (WDSV)
Status
Reviewed-Annotation score: Annotation score: 4 out of 5-Experimental evidence at protein leveli

Functioni

Matrix protein p10: targets Gag and gag-pol polyproteins to the plasma membrane via a multipartite membrane binding signal, that includes its myristoylated N-terminus. Also mediates nuclear localization of the preintegration complex (By similarity).By similarity
Capsid protein p25 forms the spherical core of the virion that encapsulates the genomic RNA-nucleocapsid complex.By similarity
Nucleocapsid protein p14: involved in the packaging and encapsidation of two copies of the genome. Binds with high affinity to conserved UCUG elements within the packaging signal, located near the 5'-end of the genome. This binding is dependent on genome dimerization (By similarity).By similarity
Gag polyprotein: plays a role in budding and is processed by the viral protease during virion maturation outside the cell.By similarity

Regions

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Zinc fingeri501 – 51818CCHC-typePROSITE-ProRule annotationAdd
BLAST

GO - Molecular functioni

Complete GO annotation...

Keywords - Ligandi

Metal-binding, Viral nucleoprotein, Zinc

Names & Taxonomyi

Protein namesi
Recommended name:
Gag polyprotein
Cleaved into the following 4 chains:
Matrix protein p10
Short name:
MA
Alternative name(s):
pp12
Capsid protein p25
Short name:
CA
Nucleocapsid protein p14
Short name:
NC-gag
Gene namesi
Name:gag
OrganismiWalleye dermal sarcoma virus (WDSV)
Taxonomic identifieri39720 [NCBI]
Taxonomic lineageiVirusesRetro-transcribing virusesRetroviridaeOrthoretrovirinaeEpsilonretrovirus
Virus hostiSander vitreus (Walleye) (Perca vitrea) [TaxID: 283036]
Proteomesi
  • UP000008337 Componenti: Genome
  • UP000007081 Componenti: Genome

Subcellular locationi

Gag polyprotein :

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Capsid protein, Host cell membrane, Host membrane, Membrane, Virion

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Initiator methionineiRemovedSequence analysis
Chaini2 – 582581Gag polyproteinPRO_0000410603Add
BLAST
Chaini2 – 9594Matrix protein p10PRO_0000410604Add
BLAST
Chaini96 – 251156RNA-binding phosphoprotein p20PRO_0000410605Add
BLAST
Chaini252 – 457206Capsid protein p25PRO_0000410606Add
BLAST
Chaini458 – 582125Nucleocapsid protein p14PRO_0000410607Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Lipidationi2 – 21N-myristoyl glycine; by hostBy similarity

Post-translational modificationi

Specific enzymatic cleavages by the viral protease yield mature proteins. The protease is released by autocatalytic cleavage. The polyprotein is cleaved during and after budding, this process is termed maturation (By similarity).By similarity

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sitei95 – 962Cleavage; by viral proteaseBy similarity
Sitei251 – 2522Cleavage; by viral proteaseBy similarity
Sitei457 – 4582Cleavage; by viral proteaseBy similarity

Keywords - PTMi

Lipoprotein, Myristate

Miscellaneous databases

PMAP-CutDBQ88937.

Family & Domainsi

Coiled coil

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Coiled coili154 – 18532Sequence analysisAdd
BLAST

Compositional bias

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Compositional biasi147 – 17024Gln-richAdd
BLAST
Compositional biasi181 – 1844Poly-Lys
Compositional biasi484 – 4874Poly-Gln

Sequence similaritiesi

Contains 1 CCHC-type zinc finger.PROSITE-ProRule annotation

Zinc finger

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Zinc fingeri501 – 51818CCHC-typePROSITE-ProRule annotationAdd
BLAST

Keywords - Domaini

Coiled coil, Zinc-finger

Family and domain databases

Gene3Di4.10.60.10. 1 hit.
InterProiIPR001878. Znf_CCHC.
[Graphical view]
PfamiPF00098. zf-CCHC. 1 hit.
[Graphical view]
SMARTiSM00343. ZnF_C2HC. 1 hit.
[Graphical view]
SUPFAMiSSF57756. SSF57756. 1 hit.
PROSITEiPS50158. ZF_CCHC. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

Q88937-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MGNSSSTPPP SALKNSDLFK TMLRTQYSGS VKTRRINQDI KKQYPLWPDQ
60 70 80 90 100
GTCATKHWEQ AVLIPLDSVS EETAKVLNFL RVKIQARKGE TARQMTAHTI
110 120 130 140 150
KKLIVGTIDK NKQQTEILQK TDESDEEMDT TNTMLFIARN KRERIAQQQQ
160 170 180 190 200
ADLAAQQQVL LLQREQQREQ REKDIKKRDE KKKKLLPDTT QKVEQTDIGE
210 220 230 240 250
ASSSDASAQK PISTDNNPDL KVDGVLTRSQ HTTVPSNITI KKDGTSVQYQ
260 270 280 290 300
HPIRNYPTGE GNLTAQVRNP FRPLELQQLR KDCPALPEGI PQLAEWLTQT
310 320 330 340 350
MAIYNCDEAD VEQLARVIFP TPVRQIAGVI NGHAAANTAA KIQNYVTACR
360 370 380 390 400
QHYPAVCDWG TIQAFTYKPP QTAHEYVKHA EIIFKNNSGL EWQHATVPFI
410 420 430 440 450
NMVVQGLPPK VTRSLMSGNP DWSTKTIPQI IPLMQHYLNL QSRQDAKIKQ
460 470 480 490 500
TPLVLQLAMP AQTMNGNKGY VGSYPTNEPY YSFQQQQRPA PRAPPGNVPS
510 520 530 540 550
NTCFFCKQPG HWKADCPNKT RNLRNMGNMG RGGRMGGPPY RSQPYPAFIQ
560 570 580
PPQNHQNQYN GRMDRSQLQA SAQEWLPGTY PA
Length:582
Mass (Da):65,697
Last modified:November 1, 1996 - v1
Checksum:iCA5EF28EE38A3434
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
L41838 Genomic RNA. Translation: AAA99526.1.
AF033822 Genomic RNA. Translation: AAC82607.1.
RefSeqiNP_045938.1. NC_001867.1.

Genome annotation databases

GeneIDi1403496.
KEGGivg:1403496.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
L41838 Genomic RNA. Translation: AAA99526.1.
AF033822 Genomic RNA. Translation: AAC82607.1.
RefSeqiNP_045938.1. NC_001867.1.

3D structure databases

ModBaseiSearch...
MobiDBiSearch...

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

GeneIDi1403496.
KEGGivg:1403496.

Miscellaneous databases

PMAP-CutDBQ88937.

Family and domain databases

Gene3Di4.10.60.10. 1 hit.
InterProiIPR001878. Znf_CCHC.
[Graphical view]
PfamiPF00098. zf-CCHC. 1 hit.
[Graphical view]
SMARTiSM00343. ZnF_C2HC. 1 hit.
[Graphical view]
SUPFAMiSSF57756. SSF57756. 1 hit.
PROSITEiPS50158. ZF_CCHC. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiGAG_WDSV
AccessioniPrimary (citable) accession number: Q88937
Entry historyi
Integrated into UniProtKB/Swiss-Prot: June 28, 2011
Last sequence update: November 1, 1996
Last modified: April 1, 2015
This is version 62 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programViral Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Direct protein sequencing, Reference proteome

Documents

  1. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.