Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Gag polyprotein

Gene

gag

Organism
Mouse mammary tumor virus (strain BR6) (MMTV)
Status
Reviewed-Annotation score: Annotation score: 4 out of 5-Experimental evidence at protein leveli

Functioni

Matrix protein p10: Matrix protein.
Nucleocapsid protein p14: Nucleocapsid protein. Binds to single-stranded DNA.
Capsid protein p27: capsid protein.

Regions

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Zinc fingeri525 – 542CCHC-type 1PROSITE-ProRule annotationAdd BLAST18
Zinc fingeri552 – 569CCHC-type 2PROSITE-ProRule annotationAdd BLAST18

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Biological processi

Host-virus interaction, Viral budding, Viral budding via the host ESCRT complexes, Virus exit from host cell

Keywords - Ligandi

DNA-binding, Metal-binding, Nucleotide-binding, Viral nucleoprotein, Zinc

Names & Taxonomyi

Protein namesi
Gene namesi
Name:gag
OrganismiMouse mammary tumor virus (strain BR6) (MMTV)
Taxonomic identifieri11758 [NCBI]
Taxonomic lineageiVirusesRetro-transcribing virusesRetroviridaeOrthoretrovirinaeBetaretrovirus
Virus hostiMus musculus (Mouse) [TaxID: 10090]

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Capsid protein, Viral matrix protein, Virion

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Initiator methionineiRemoved; by hostBy similarity
ChainiPRO_00000409202 – 99Matrix protein p10By similarityAdd BLAST98
ChainiPRO_0000040921100 – 195Phosphorylated protein pp21By similarityAdd BLAST96
ChainiPRO_0000040922196 – 228Protein p3By similarityAdd BLAST33
ChainiPRO_0000040923229 – 252Protein p8By similarityAdd BLAST24
ChainiPRO_0000040924253 – 269Protein nBy similarityAdd BLAST17
ChainiPRO_0000040925270 – 496Capsid protein p27By similarityAdd BLAST227
ChainiPRO_0000040926497 – 591Nucleocapsid protein p14By similarityAdd BLAST95

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Lipidationi2N-myristoyl glycine; by hostBy similarity1

Post-translational modificationi

p10 is myristoylated.By similarity

Keywords - PTMi

Lipoprotein, Myristate, Phosphoprotein

Miscellaneous databases

PMAP-CutDBP10258.

Structurei

Secondary structure

1591
Legend: HelixTurnBeta strandPDB Structure known for this area
Show more details
Feature keyPosition(s)DescriptionActionsGraphical viewLength
Turni5 – 8Combined sources4
Helixi9 – 21Combined sources13
Turni22 – 24Combined sources3
Helixi29 – 42Combined sources14
Helixi45 – 49Combined sources5
Helixi54 – 71Combined sources18
Helixi73 – 75Combined sources3
Helixi80 – 90Combined sources11

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
4ZV5X-ray1.57A/B2-92[»]
ProteinModelPortaliP10258.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Motif

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Motifi305 – 308PTAP/PSAP motifSequence analysis4

Domaini

Late-budding domains (L domains) are short sequence motifs essential for viral particle budding. They recruit proteins of the host ESCRT machinery (Endosomal Sorting Complex Required for Transport) or ESCRT-associated proteins. Gag-p27 contains one L domain: a PTAP/PSAP motif, which interacts with the UEV domain of TSG101 (Potential).Curated

Sequence similaritiesi

Contains 2 CCHC-type zinc fingers.PROSITE-ProRule annotation

Zinc finger

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Zinc fingeri525 – 542CCHC-type 1PROSITE-ProRule annotationAdd BLAST18
Zinc fingeri552 – 569CCHC-type 2PROSITE-ProRule annotationAdd BLAST18

Keywords - Domaini

Repeat, Zinc-finger

Family and domain databases

Gene3Di1.10.1200.30. 1 hit.
1.10.375.10. 1 hit.
4.10.60.10. 1 hit.
InterProiIPR003322. B_retro_matrix.
IPR000721. Gag_p24.
IPR008916. Retrov_capsid_C.
IPR008919. Retrov_capsid_N.
IPR010999. Retrovr_matrix.
IPR001878. Znf_CCHC.
[Graphical view]
PfamiPF02337. Gag_p10. 1 hit.
PF00607. Gag_p24. 1 hit.
PF00098. zf-CCHC. 1 hit.
[Graphical view]
ProDomiPD004265. B_retro_matrix_N. 1 hit.
[Graphical view] [Entries sharing at least one domain]
SMARTiSM00343. ZnF_C2HC. 2 hits.
[Graphical view]
SUPFAMiSSF47353. SSF47353. 1 hit.
SSF47836. SSF47836. 1 hit.
SSF47943. SSF47943. 1 hit.
SSF57756. SSF57756. 2 hits.
PROSITEiPS50158. ZF_CCHC. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

P10258-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MGVSGSKGQK LFVSVLQRLL SERGLHVKES SAIEFYQFLI KVSPWFPEEG
60 70 80 90 100
GLNLQDWKRV GREMKRYAAE HGTDSIPKQA YPIWLQLREI LTEQSDLVLL
110 120 130 140 150
SAEAKSVTEE ELEEGLTGLL STSSQEKTYG TRGTAYAEID TEVDKLSEHI
160 170 180 190 200
YDEPYEEKEK ADKNEEKDHV RKIKKVVQRK ENSEGKRKEK DSKAFLATDW
210 220 230 240 250
NDDDLSPEDW DDLEEQAAHY HDDDELILPV KRKVVKKKPQ ALRRKPLPPV
260 270 280 290 300
GFAGAMAEAR EKGDLTFTFP VVFMGESDED DTPVWEPLPL KTLKELQSAV
310 320 330 340 350
RTMGPSAPYT LQVVDMVASQ WLTPSDWHQT ARATLSPGDY VLWRTEYEEK
360 370 380 390 400
SKEMVQKAAG KRKGKVSLDM LLGTGQFLSP SSQIKLSKDV LKDVTTNAVL
410 420 430 440 450
AWRAIPPPGV KKTVLAGLKQ GNEESYETFI SRLEEAVYRM MPRGEGSDIL
460 470 480 490 500
IKQLAWENAN SLCQDLIRPI RKTGTIQDYI RACLDASPAV VQGMAYAAAM
510 520 530 540 550
RGQKYSTFVK QTYGGGKGGQ GAEGPVCFSC GKTGHIRKDC KDEKGSKRAP
560 570 580 590
PGLCPRCKKG YHWKSECKSK FDKDGNPLPP LETNAENSKN L
Length:591
Mass (Da):66,269
Last modified:January 23, 2007 - v3
Checksum:i8A5C2212460864A5
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
M15122 Genomic RNA. Translation: AAA46543.1.
PIRiA26795. FOMVMM.
RefSeqiNP_056882.1. NC_001503.1.

Genome annotation databases

GeneIDi1491864.
KEGGivg:1491864.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
M15122 Genomic RNA. Translation: AAA46543.1.
PIRiA26795. FOMVMM.
RefSeqiNP_056882.1. NC_001503.1.

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
4ZV5X-ray1.57A/B2-92[»]
ProteinModelPortaliP10258.
ModBaseiSearch...
MobiDBiSearch...

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

GeneIDi1491864.
KEGGivg:1491864.

Miscellaneous databases

PMAP-CutDBP10258.

Family and domain databases

Gene3Di1.10.1200.30. 1 hit.
1.10.375.10. 1 hit.
4.10.60.10. 1 hit.
InterProiIPR003322. B_retro_matrix.
IPR000721. Gag_p24.
IPR008916. Retrov_capsid_C.
IPR008919. Retrov_capsid_N.
IPR010999. Retrovr_matrix.
IPR001878. Znf_CCHC.
[Graphical view]
PfamiPF02337. Gag_p10. 1 hit.
PF00607. Gag_p24. 1 hit.
PF00098. zf-CCHC. 1 hit.
[Graphical view]
ProDomiPD004265. B_retro_matrix_N. 1 hit.
[Graphical view] [Entries sharing at least one domain]
SMARTiSM00343. ZnF_C2HC. 2 hits.
[Graphical view]
SUPFAMiSSF47353. SSF47353. 1 hit.
SSF47836. SSF47836. 1 hit.
SSF47943. SSF47943. 1 hit.
SSF57756. SSF57756. 2 hits.
PROSITEiPS50158. ZF_CCHC. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiGAG_MMTVB
AccessioniPrimary (citable) accession number: P10258
Entry historyi
Integrated into UniProtKB/Swiss-Prot: July 1, 1989
Last sequence update: January 23, 2007
Last modified: October 5, 2016
This is version 103 of the entry and version 3 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programViral Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

3D-structure

Documents

  1. PDB cross-references
    Index of Protein Data Bank (PDB) cross-references
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.