Skip Header

Contribute Send feedback
Read comments (?) or add your own

P17282 (GAG_SIVCZ) Reviewed, UniProtKB/Swiss-Prot

Last modified April 3, 2013. Version 88. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (1) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Gag polyprotein
Alternative name(s):
Pr55Gag

Cleaved into the following 6 chains:

  1. Matrix protein p17
    Short name=MA
  2. Capsid protein p24
    Short name=CA
  3. Spacer peptide p2
  4. Nucleocapsid protein p7
    Short name=NC
  5. Spacer peptide p1
  6. p6-gag
Gene names
Name:gag
OrganismSimian immunodeficiency virus (isolate CPZ GAB1) (SIV-cpz) (Chimpanzee immunodeficiency virus) [Complete proteome]
Taxonomic identifier402771 [NCBI]
Taxonomic lineageVirusesRetro-transcribing virusesRetroviridaeOrthoretrovirinaeLentivirusPrimate lentivirus group
Virus hostPan (chimpanzees) [TaxID: 9596]

Protein attributes

Sequence length508 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceInferred from homology

General annotation (Comments)

Function

Matrix protein p17 targets Gag and Gag-Pol polyproteins to the plasma membrane via a multipartite membrane binding signal, that includes its myristoylated N-terminus. Also mediates nuclear localization of the preintegration complex. Implicated in the release from host cell mediated by Vpu By similarity.

Capsid protein p24 forms the conical core of the virus that encapsulates the genomic RNA-nucleocapsid complex By similarity.

Nucleocapsid protein p7 encapsulates and protects viral dimeric unspliced (genomic) RNA. Binds these RNAs through its zinc fingers By similarity.

p6-gag plays a role in budding of the assembled particle by interacting with the host class E VPS proteins TSG101 and PDCD6IP/AIP1 By similarity.

Subunit structure

Matrix protein p17 is a trimer. Interacts with gp120. p6-gag interacts with host TSG101 By similarity.

Subcellular location

Matrix protein p17: Virion Potential. Host nucleus By similarity. Host cytoplasm By similarity. Host cell membrane; Lipid-anchor Potential. Note: Following virus entry, the nuclear localization signal (NLS) of the matrix protein participates with Vpr to the nuclear localization of the viral genome. During virus production, the nuclear export activity of the matrix protein counteracts the NLS to maintain the Gag and Gag-Pol polyproteins in the cytoplasm, thereby directing unspliced RNA to the plasma membrane By similarity.

Capsid protein p24: Virion Potential.

Nucleocapsid protein p7: Virion Potential.

Domain

Late-budding domains (L domains) are short sequence motifs essential for viral particle budding. They recruit proteins of the host ESCRT machinery (Endosomal Sorting Complex Required for Transport) or ESCRT-associated proteins. p6-gag contains two L domains: a PTAP/PSAP motif, which interacts with the UEV domain of TSG101 and a LYPX(n)L motif which interacts with PDCD6IP/AIP1 By similarity.

Post-translational modification

Capsid protein p24 is phosphorylated By similarity.

Specific enzymatic cleavages by the viral protease yield mature proteins. The polyprotein is cleaved during and after budding, this process is termed maturation By similarity.

Sequence similarities

Belongs to the primate lentivirus group gag polyprotein family.

Contains 2 CCHC-type zinc fingers.

Alternative products

This entry describes 2 isoforms produced by ribosomal frameshifting. [Align] [Select]

Note: Translation results in the formation of the Gag polyprotein most of the time. Ribosomal frameshifting at the gag-pol genes boundary occurs at low frequency and produces the Gag-Pol polyprotein. This strategy of translation probably allows the virus to modulate the quantity of each viral protein. Maintenance of a correct Gag to Gag-Pol ratio is essential for RNA dimerization and viral infectivity.
Isoform Gag polyprotein (identifier: P17282-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Note: Produced by conventional translation.
Isoform Gag-Pol polyprotein (identifier: P17283-1)

The sequence of this isoform can be found in the external entry P17283.
Isoforms of the same protein are often annotated in two different entries if their sequences differ significantly.
Note: Produced by -1 ribosomal frameshifting.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain2 – 508507Gag polyprotein By similarity
PRO_0000316150
Chain2 – 140139Matrix protein p17 By similarity
PRO_0000038644
Chain141 – 371231Capsid protein p24 By similarity
PRO_0000038645
Peptide372 – 38716Spacer peptide p2 By similarity
PRO_0000316151
Chain388 – 44255Nucleocapsid protein p7 By similarity
PRO_0000038646
Peptide443 – 45816Spacer peptide p1 By similarity
PRO_0000316152
Chain459 – 50850p6-gag By similarity
PRO_0000316153

Regions

Zinc finger400 – 41718CCHC-type 1
Zinc finger421 – 43818CCHC-type 2
Motif16 – 227Nuclear export signal By similarity
Motif26 – 327Nuclear localization signal By similarity
Motif465 – 4684PTAP/PSAP motif
Motif491 – 50010LYPX(n)L motif

Sites

Site140 – 1412Cleavage; by viral protease By similarity
Site371 – 3722Cleavage; by viral protease By similarity
Site387 – 3882Cleavage; by viral protease By similarity
Site442 – 4432Cleavage; by viral protease By similarity
Site458 – 4592Cleavage; by viral protease By similarity

Amino acid modifications

Lipidation21N-myristoyl glycine; by host By similarity

Sequences

Sequence LengthMass (Da)Tools
Isoform Gag polyprotein [UniParc].

Last modified August 1, 1990. Version 1.
Checksum: 6FC992B9EB4CBB5D

FASTA50855,963
        10         20         30         40         50         60 
MGARASVLTG GKLDRWEKVR LRPGGRKRYM MKHLVWASRE LERFACDPGL MESKEGCTKL 

        70         80         90        100        110        120 
LQQLEPALKT GSEGLRSLFN TLAVLWCIHS DITVEDTQKA LEQLKRHHGE QQSKTESNSG 

       130        140        150        160        170        180 
SREGGASQGA SASAGISGNY PLVQNAQGQM VHQAISPRTL NAWVKVVEEK AFSPEVIPMF 

       190        200        210        220        230        240 
SALSEGALPQ DVNTMLNAVG GHQGAMQVLK EVINEEAAEW DRLHPTHAGP IAPGQLREPR 

       250        260        270        280        290        300 
GSDIAGTTST LQEQIGWTTA NPPIPVGDVY RRWVILGLNK VVRMYCPVSI LDIRQGPKEP 

       310        320        330        340        350        360 
FRDYVDRFYK TLRAEQASQE VKNWMTDTLL VQNANPDCKQ ILKALGPGAT LEEMMTACQG 

       370        380        390        400        410        420 
VGGPSHKARV LAEAMSMVQN QGRADVFFQK GQGAGPKRKI KCFNCGKEGH LARNCKAPRR 

       430        440        450        460        470        480 
KGCWRCGQEG HQMKDCTGRQ VNFLGKGWPS RSGRPGNFVQ NRTEPTAPPI ESYGYQEEEK 

       490        500 
SQEKKEGESS LYPPTSLKSL FGSDPSSQ 

« Hide

Isoform Gag-Pol polyprotein [UniParc].

See P17283.

References

[1]"Genetic organization of a chimpanzee lentivirus related to HIV-1."
Huet T., Cheynier R., Meyerhans A., Roelants G., Wain-Hobson S.
Nature 345:356-359(1990) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA].

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
X52154 Genomic RNA. Translation: CAA36401.1.
PIRFOLJSI. S09983.

3D structure databases

ProteinModelPortalP17282.
SMRP17282. Positions 2-442, 459-508.
ModBaseSearch...

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Family and domain databases

Gene3D1.10.1200.30. 1 hit.
1.10.150.90. 1 hit.
1.10.375.10. 1 hit.
4.10.60.10. 1 hit.
InterProIPR000721. Gag_p24.
IPR014817. Gag_p6.
IPR000071. Lentvrl_matrix_N.
IPR012344. Matrix_N_HIV/RSV.
IPR008916. Retrov_capsid_C.
IPR008919. Retrov_capsid_N.
IPR010999. Retrovr_matrix_N.
IPR001878. Znf_CCHC.
[Graphical view]
PfamPF00540. Gag_p17. 1 hit.
PF00607. Gag_p24. 1 hit.
PF08705. Gag_p6. 1 hit.
PF00098. zf-CCHC. 2 hits.
[Graphical view]
PRINTSPR00234. HIV1MATRIX.
SMARTSM00343. ZnF_C2HC. 2 hits.
[Graphical view]
SUPFAMSSF47353. Retrov_capsid_C. 1 hit.
SSF47943. Retrov_capsid_N. 1 hit.
SSF47836. Retrovir_matrix. 1 hit.
SSF57756. SSF57756. 1 hit.
PROSITEPS50158. ZF_CCHC. 2 hits.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameGAG_SIVCZ
AccessionPrimary (citable) accession number: P17282
Entry history
Integrated into UniProtKB/Swiss-Prot: August 1, 1990
Last sequence update: August 1, 1990
Last modified: April 3, 2013
This is version 88 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programViral Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families