Skip Header

Contribute Send feedback
Read comments (?) or add your own

P07567 (GAG_MPMV) Reviewed, UniProtKB/Swiss-Prot

Last modified May 29, 2013. Version 106. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Gag polyprotein
Alternative name(s):
Core polyprotein
Gene names
Name:gag
OrganismMason-Pfizer monkey virus (MPMV) (Simian Mason-Pfizer virus) [Complete proteome]
Taxonomic identifier11855 [NCBI]
Taxonomic lineageVirusesRetro-transcribing virusesRetroviridaeOrthoretrovirinaeBetaretrovirus
Virus hostMacaca mulatta (Rhesus macaque) [TaxID: 9544]

Protein attributes

Sequence length657 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

p10 is the matrix protein.

P14 is the nucleocapsid protein.

p27 is the capsid protein.

Subcellular location

Matrix protein p10: Virion Potential.

Capsid protein p27: Virion Potential.

Nucleocapsid protein p14: Virion Potential.

Domain

Late-budding domains (L domains) are short sequence motifs essential for viral particle budding. They recruit proteins of the host ESCRT machinery (Endosomal Sorting Complex Required for Transport) or ESCRT-associated proteins. Core phosphoprotein pp24 contains two L domains: a PTAP/PSAP motif which interacts with the UEV domain of TSG101, and a PPXY motif which probably binds to the WW domains of HECT (homologous to E6-AP C-terminus) E3 ubiquitin ligases By similarity.

Post-translational modification

p10 is myristoylated By similarity.

Sequence similarities

Contains 2 CCHC-type zinc fingers.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Initiator methionine11Removed; by host By similarity
Chain2 – 10099Matrix protein p10
PRO_0000040940
Chain101 – 216116Core phosphoprotein pp24
PRO_0000040941
Chain217 – 29882p12
PRO_0000040942
Chain299 – 526228Capsid protein p27
PRO_0000040943
Chain527 – 62195Nucleocapsid protein p14
PRO_0000040944
Chain622 – 65736p4
PRO_0000040945

Regions

Zinc finger547 – 56418CCHC-type 1
Zinc finger576 – 59318CCHC-type 2
Motif202 – 2054PPXY motif
Motif210 – 2134PTAP/PSAP motif

Amino acid modifications

Lipidation21N-myristoyl glycine; by host By similarity

Secondary structure

......................................... 657
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
P07567 [UniParc].

Last modified January 23, 2007. Version 3.
Checksum: 301888B033BC3061

FASTA65773,110
        10         20         30         40         50         60 
MGQELSQHER YVEQLKQALK TRGVKVKYAD LLKFFDFVKD TCPWFPQEGT IDIKRWRRVG 

        70         80         90        100        110        120 
DCFQDYYNTF GPEKVPVTAF SYWNLIKELI DKKEVNPQVM AAVAQTEEIL KSNSQTDLTK 

       130        140        150        160        170        180 
TSQNPDLDLI SLDSDDEGAK SSSLQDKGLS STKKPKRFPV LLTAQTSKDP EDPNPSEVDW 

       190        200        210        220        230        240 
DGLEDEAAKY HNPDWPPFLT RPPPYNKATP SAPTVMAVVN PKEELKEKIA QLEEQIKLEE 

       250        260        270        280        290        300 
LHQALISKLQ KLKTGNETVT HPDTAGGLSR TPHWPGQHIP KGKCCASREK EEQIPKDIFP 

       310        320        330        340        350        360 
VTETVDGQGQ AWRHHNGFDF AVIKELKTAA SQYGATAPYT LAIVESVADN WLTPTDWNTL 

       370        380        390        400        410        420 
VRAVLSGGDH LLWKSEFFEN CRDTAKRNQQ AGNGWDFDML TGSGNYSSTD AQMQYDPGLF 

       430        440        450        460        470        480 
AQIQAAATKA WRKLPVKGDP GASLTGVKQG PDEPFADFVH RLITTAGRIF GSAEAGVDYV 

       490        500        510        520        530        540 
KQLAYENANP ACQAAIRPYR KKTDLTGYIR LCSDIGPSYQ QGLAMAAAFS GQTVKDFLNN 

       550        560        570        580        590        600 
KNKEKGGCCF KCGKKGHFAK NCHEHAHNNA EPKVPGLCPR CKRGKHWANE CKSKTDNQGN 

       610        620        630        640        650 
PIPPHQGNGW RGQPQAPKQA YGAVSFVPAN KNNPFQSLPE PPQEVQDWTS VPPPTQY 

« Hide

References

[1]"Nucleotide sequence of Mason-Pfizer monkey virus: an immunosuppressive D-type retrovirus."
Sonigo P., Barker C., Hunter E., Wain-Hobson S.
Cell 45:375-385(1986) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA] (CLONE 6A).
[2]"Purification and N-terminal amino acid sequence comparisons of structural proteins from retrovirus-D/Washington and Mason-Pfizer monkey virus."
Henderson L.E., Sowder R., Smythers G., Benveniste R.E., Oroszlan S.
J. Virol. 55:778-787(1985) [PubMed] [Europe PMC] [Abstract]
Cited for: PARTIAL PROTEIN SEQUENCE, PROTEOLYTIC PROCESSING OF POLYPROTEIN.
[3]"The three-dimensional solution structure of the matrix protein from the type D retrovirus, the Mason-Pfizer monkey virus, and implications for the morphology of retroviral assembly."
Conte M.R., Klikova M., Hunter E., Ruml T., Matthews S.
EMBO J. 16:5819-5826(1997) [PubMed] [Europe PMC] [Abstract]
Cited for: STRUCTURE BY NMR OF 1-94.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
M12349 Genomic RNA. Translation: AAA47710.1.
PIRFOLJMP. A25839.
RefSeqNP_056893.1. NC_001550.1.

3D structure databases

PDBe
RCSB PDB
PDBj
EntryMethodResolution (Å)ChainPositionsPDBsum
1BAXNMR-A1-94[»]
1CL4NMR-A546-605[»]
2KGFNMR-A300-439[»]
2LPYNMR-A2-118[»]
4ARDelectron microscopy7.00A/B318-433[»]
ProteinModelPortalP07567.
SMRP07567. Positions 1-100, 574-605.
ModBaseSearch...

Protein-protein interaction databases

DIPDIP-45130N.
MINTMINT-188975.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

GeneID2746974.

Family and domain databases

Gene3D1.10.1200.30. 1 hit.
1.10.375.10. 1 hit.
4.10.60.10. 1 hit.
InterProIPR003322. B_retro_matrix_N.
IPR000721. Gag_p24.
IPR008916. Retrov_capsid_C.
IPR008919. Retrov_capsid_N.
IPR010999. Retrovr_matrix_N.
IPR001878. Znf_CCHC.
[Graphical view]
PfamPF02337. Gag_p10. 1 hit.
PF00607. Gag_p24. 1 hit.
PF00098. zf-CCHC. 1 hit.
[Graphical view]
ProDomPD004265. B_retro_matrix_N. 1 hit.
[Graphical view] [Entries sharing at least one domain]
SMARTSM00343. ZnF_C2HC. 2 hits.
[Graphical view]
SUPFAMSSF47353. Retrov_capsid_C. 1 hit.
SSF47943. Retrov_capsid_N. 1 hit.
SSF47836. Retrovir_matrix. 1 hit.
SSF57756. SSF57756. 1 hit.
PROSITEPS50158. ZF_CCHC. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

EvolutionaryTraceP07567.
PMAP-CutDBP07567.

Entry information

Entry nameGAG_MPMV
AccessionPrimary (citable) accession number: P07567
Entry history
Integrated into UniProtKB/Swiss-Prot: August 1, 1988
Last sequence update: January 23, 2007
Last modified: May 29, 2013
This is version 106 of the entry and version 3 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programViral Protein Annotation Program

Relevant documents

PDB cross-references

Index of Protein Data Bank (PDB) cross-references

SIMILARITY comments

Index of protein domains and families