Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

P03391 (ENV_FSVGA) Reviewed, UniProtKB/Swiss-Prot

Last modified February 19, 2014. Version 94. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein namesRecommended name:
Envelope glycoprotein
Alternative name(s):
Env polyprotein

Cleaved into the following 3 chains:

  1. Surface protein
    Short name=SU
    Alternative name(s):
    Glycoprotein 70
    Short name=gp70
  2. Transmembrane protein
    Short name=TM
    Alternative name(s):
    Envelope protein p15E
  3. R-peptide
    Alternative name(s):
    p2E
Gene names
Name:env
OrganismFeline sarcoma virus (strain Gardner-Arnstein) (Ga-FeSV) (Gardner-Arnstein feline leukemia oncovirus B)
Taxonomic identifier11774 [NCBI]
Taxonomic lineageVirusesRetro-transcribing virusesRetroviridaeOrthoretrovirinaeGammaretrovirus
Virus hostFelidae (cat family) [TaxID: 9681]

Protein attributes

Sequence length662 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceInferred from homology

General annotation (Comments)

Function

The surface protein (SU) attaches the virus to the host cell by binding to its receptor. This interaction triggers the refolding of the transmembrane protein (TM) and is thought to activate its fusogenic potential by unmasking its fusion peptide. Fusion occurs at the host cell plasma membrane By similarity.

The transmembrane protein (TM) acts as a class I viral fusion protein. Under the current model, the protein has at least 3 conformational states: pre-fusion native state, pre-hairpin intermediate state, and post-fusion hairpin state. During viral and target cell membrane fusion, the coiled coil regions (heptad repeats) assume a trimer-of-hairpins structure, positioning the fusion peptide in close proximity to the C-terminal region of the ectodomain. The formation of this structure appears to drive apposition and subsequent fusion of viral and target cell membranes. Membranes fusion leads to delivery of the nucleocapsid into the cytoplasm By similarity.

Subunit structure

The mature envelope protein (Env) consists of a trimer of SU-TM heterodimers attached by a labile interchain disulfide bond By similarity.

Subcellular location

Transmembrane protein: Virion membrane; Single-pass type I membrane protein By similarity. Host cell membrane; Single-pass type I membrane protein By similarity.

Surface protein: Virion membrane; Peripheral membrane protein. Host cell membrane; Peripheral membrane protein By similarity. Note: The surface protein is not anchored to the viral envelope, but associates with the extravirion surface through its binding to TM. Both proteins are thought to be concentrated at the site of budding and incorporated into the virions possibly by contacts between the cytoplasmic tail of Env and the N-terminus of Gag By similarity.

R-peptide: Host cell membrane; Peripheral membrane protein By similarity. Note: The R-peptide is membrane-associated through its palmitate By similarity.

Domain

The 17 amino acids long immunosuppressive region is present in many retroviral envelope proteins. Synthetic peptides derived from this relatively conserved sequence inhibit immune function in vitro and in vivo By similarity.

Post-translational modification

Specific enzymatic cleavages in vivo yield mature proteins. Envelope glycoproteins are synthesized as a inactive precursor that is N-glycosylated and processed likely by host cell furin or by a furin-like protease in the Golgi to yield the mature SU and TM proteins. The cleavage site between SU and TM requires the minimal sequence [KR]-X-[KR]-R. The R-peptide is released from the C-terminus of the cytoplasmic tail of the TM protein upon particle formation as a result of proteolytic cleavage by the viral protease. Cleavage of this peptide is required for TM to become fusogenic By similarity.

The CXXC motif is highly conserved across a broad range of retroviral envelope proteins. It is thought to participate in the formation of a labile disulfide bond possibly with the CX6CC motif present in the transmembrane protein. Isomerization of the intersubunit disulfide bond to an SU intrachain disulfide bond is thought to occur upon receptor recognition in order to allow membrane fusion By similarity.

The transmembrane protein is palmitoylated By similarity.

The R-peptide is palmitoylated By similarity.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 3434 Potential
Chain35 – 662628Envelope glycoprotein
PRO_0000239570
Chain35 – 465431Surface protein By similarity
PRO_0000040724
Chain466 – 645180Transmembrane protein By similarity
PRO_0000040725
Peptide646 – 66217R-peptide By similarity
PRO_0000239571

Regions

Topological domain35 – 606572Extracellular Potential
Transmembrane607 – 62721Helical; Potential
Topological domain628 – 66235Cytoplasmic Potential
Region468 – 48821Fusion peptide Potential
Region534 – 55017Immunosuppression By similarity
Coiled coil496 – 54550 Potential
Coiled coil555 – 59137 Potential
Motif332 – 3354CXXC
Motif551 – 5599CX6CC

Sites

Site465 – 4662Cleavage; by host By similarity
Site645 – 6462Cleavage; by viral protease By similarity

Amino acid modifications

Lipidation6261S-palmitoyl cysteine; by host By similarity
Glycosylation431N-linked (GlcNAc...); by host Potential
Glycosylation581N-linked (GlcNAc...); by host Potential
Glycosylation2861N-linked (GlcNAc...); by host Potential
Glycosylation3221N-linked (GlcNAc...); by host Potential
Glycosylation3271N-linked (GlcNAc...); by host Potential
Glycosylation3511N-linked (GlcNAc...); by host Potential
Glycosylation3541N-linked (GlcNAc...); by host Potential
Glycosylation3941N-linked (GlcNAc...); by host Potential
Glycosylation4101N-linked (GlcNAc...); by host Potential
Glycosylation4301N-linked (GlcNAc...); by host Potential
Disulfide bond115 ↔ 132 By similarity
Disulfide bond124 ↔ 137 By similarity
Disulfide bond332 ↔ 559Interchain (between SU and TM chains, or C-335 with C-559); alternate By similarity
Disulfide bond332 ↔ 335Alternate By similarity
Disulfide bond551 ↔ 558 By similarity

Experimental info

Sequence conflict151Missing Ref.4
Sequence conflict411V → I Ref.4
Sequence conflict471T → V Ref.4
Sequence conflict51 – 566LVTGTK → VQTNTQ Ref.4
Sequence conflict70 – 756FPTMYF → YPTLHV Ref.4
Sequence conflict80 – 9516IIGNT…EPFPG → LVGDSWEPIVLDPNNVKHGA RYSSSK Ref.4
Sequence conflict99 – 11012DQPMR…QQRNT → KTTDRKKQQQTY Ref.4
Sequence conflict120 – 1234NRKQ → PSLGPKGTH Ref.4
Sequence conflict1271P → A Ref.4
Sequence conflict1341V → A Ref.4
Sequence conflict143 – 1486TYWRPT → AWWKPS Ref.4
Sequence conflict158 – 19336KGVTQ…SEGGR → RGSSQDTNSCEGK Ref.4
Sequence conflict2081T → A Ref.4
Sequence conflict2151S → M Ref.4
Sequence conflict2231S → T Ref.4
Sequence conflict2321S → T Ref.4
Sequence conflict2381M → S Ref.4
Sequence conflict264 – 30037IESRV…VTPAS → TGSKVATQRPQTNESAPRSV APTTMG Ref.4

Sequences

Sequence LengthMass (Da)Tools
P03391 [UniParc].

Last modified July 21, 1986. Version 1.
Checksum: 1482088D547CFF47

FASTA66273,150
        10         20         30         40         50         60 
MESPTHPKPS KDKTLSWNLV FLVGILFTID IGMANPSPHQ VYNVTWTITN LVTGTKANAT 

        70         80         90        100        110        120 
SMLGTLTDAF PTMYFDLCDI IGNTWNPSDQ EPFPGYGCDQ PMRRWQQRNT PFYVCPGHAN 

       130        140        150        160        170        180 
RKQCGGPQDG FCAVWGCETT GETYWRPTSS WDYITVKKGV TQGIYQCSGG GWCGPCYDKA 

       190        200        210        220        230        240 
VHSSTTGASE GGRCNPLILQ FTQKGRQTSW DGPKSWGLRL YRSGYDPIAL FSVSRQVMTI 

       250        260        270        280        290        300 
TPPQAMGPNL VLPDQKPPSR QSQIESRVTP HHSQGNGGTP GITLVNASIA PLSTPVTPAS 

       310        320        330        340        350        360 
PKRIGTGDRL INLVQGTYLA LNATDPNRTK DCWLCLVSRP PYYEGIAILG NYSNQTNPPP 

       370        380        390        400        410        420 
SCLSIPQHKL TISEVSGQGL CIGTVPKTHQ ALCNETQQGH TGAHYLAAPN GTYWACNTGL 

       430        440        450        460        470        480 
TPCISMAVLN WTSDFCVLIE LWPRVTYHQP EYVYTHFAKA ARFRREPISL TVALMLGGLT 

       490        500        510        520        530        540 
VGGIAAGVGT GTKALIETAQ FRQLQMAMHT DIQALEESIS ALEKSLTSLS EVVLQNRRGL 

       550        560        570        580        590        600 
DILFLQEGGL CAALKEECCF YADHTGLVRD NMAKLRERLK QRQQLFDSQQ GWFEGWFNKS 

       610        620        630        640        650        660 
PWFTTLISSI MGPLLILLLI LLFGPCILNR LVQFVKDRIS VVQALILTQQ YQQIKQYDPD 


RP 

« Hide

References

[1]"Nucleotide sequences of the envelope genes of two isolates of feline leukemia virus subgroup B."
Nunberg J.H., Williams M.E., Innis M.A.
J. Virol. 49:629-632(1984) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA].
[2]"Nucleotide sequence of the envelope gene of Gardner-Arnstein feline leukemia virus B reveals unique sequence homologies with a murine mink cell focus-forming virus."
Elder J.H., Mullins J.I.
J. Virol. 46:871-880(1983) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
[3]"Sequence analysis of Gardner-Arnstein feline leukaemia virus envelope gene reveals common structural properties of mammalian retroviral envelope genes."
Wunsch M., Schulz A.S., Koch W., Friedrich R., Hunsmann G.
EMBO J. 2:2239-2246(1983) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
[4]"Nucleotide sequence analysis of the LTRs and env genes of SM-FeSV and GA-FeSV."
Guilhot S., Hampe A., D'Auriol L., Galibert F.
Virology 161:252-258(1987) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
K01209 Genomic RNA. Translation: AAA43052.1.
V01172 Genomic DNA. Translation: CAA24497.1.
X00188 Genomic DNA. Translation: CAA25008.1.
M23026 Genomic DNA. No translation available.
PIRVCVWGF. A03991.

3D structure databases

ProteinModelPortalP03391.
SMRP03391. Positions 38-242, 511-563.
ModBaseSearch...
MobiDBSearch...

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Family and domain databases

Gene3D3.90.310.10. 1 hit.
InterProIPR008981. FMuLV_rcpt-bd.
IPR018154. TLV/ENV_coat_polyprotein.
[Graphical view]
PANTHERPTHR10424. PTHR10424. 1 hit.
PfamPF00429. TLV_coat. 1 hit.
[Graphical view]
SUPFAMSSF49830. SSF49830. 1 hit.
ProtoNetSearch...

Entry information

Entry nameENV_FSVGA
AccessionPrimary (citable) accession number: P03391
Secondary accession number(s): P21446
Entry history
Integrated into UniProtKB/Swiss-Prot: July 21, 1986
Last sequence update: July 21, 1986
Last modified: February 19, 2014
This is version 94 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programViral Protein Annotation Program