Skip Header

Contribute Send feedback
Read comments (?) or add your own

P03144 (HBSAG_GSHV) Reviewed, UniProtKB/Swiss-Prot

Last modified May 31, 2011. Version 53. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (1) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Large envelope protein
Alternative name(s):
L glycoprotein
L-HBsAg
Short name=LHB
Large S protein
Large surface protein
Major surface antigen
Gene names
Name:S
OrganismGround squirrel hepatitis virus (strain 27) (GSHV) [Complete proteome]
Taxonomic identifier10406 [NCBI]
Taxonomic lineageVirusesRetro-transcribing virusesHepadnaviridaeOrthohepadnavirus
Virus hostSpermophilus [TaxID: 9996]

Protein attributes

Sequence length428 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceInferred from homology

General annotation (Comments)

Function

The large envelope protein exists in two topological conformations, one which is termed 'external' or Le-HBsAg and the other 'internal' or Li-HBsAg. In its external conformation the protein attaches the virus to cell receptors and thereby initiating infection. This interaction determines the species specificity and liver tropism. The large envelope protein probably also assumes fusion between virion and host membranes. In its internal conformation the protein plays a role in virion morphogenesis and mediates the contact with the nucleocapsid like a matrix protein By similarity.

The middle envelope protein plays an important role in the budding of the virion. It is involved in the induction of budding in a nucleocapsid independent way. In this process the majority of envelope proteins bud to form subviral lipoprotein particles of 22 nm of diameter that do not contain a nucleocapsid By similarity.

Subunit structure

Li-HBsAg interacts with capsid protein and with HDV Large delta antigen. Isoform M associates with host chaperone CANX through its pre-S2 N glycan. This association may be essential for M proper secretion By similarity.

Subcellular location

Virion membrane By similarity.

Domain

The large envelope protein is synthesized with the pre-S region at the cytosolic side of the endoplasmic reticulum and, hence will be within the virion after budding. Therefore the pre-S region is not N-glycosylated. Later a post-translational translocation of N-terminal pre-S and TM1 domains occur in about 50% of proteins at the virion surface. These molecules change their topology by an unknown mechanism, resulting in exposure of pre-S region at virion surface. For isoform M in contrast, the pre-S2 region is translocated cotranslationally to the endoplasmic reticulum lumen and is N-glycosylated.

Post-translational modification

Isoform M is N-terminaly acetylated at a ratio of 90%, and N-glycosylated at the pre-S2 region By similarity.

Myristoylated By similarity.

Sequence similarities

Belongs to the orthohepadnavirus major surface antigen family.

Sequence caution

The sequence AAA46757.1 differs from that shown. Reason: Erroneous initiation.

Alternative products

This entry describes 3 isoforms produced by alternative splicing and alternative initiation. [Align] [Select]
Isoform L (identifier: P03144-1)

Also known as: Large envelope protein; LHB; L-HBsAg;

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform M (identifier: P03144-2)

Also known as: Middle envelope protein; MHB; M-HBsAg;

The sequence of this isoform differs from the canonical sequence as follows:
     1-146: Missing.
Isoform S (identifier: P03144-3)

Also known as: Small envelope protein; SHB; S-HBsAg;

The sequence of this isoform differs from the canonical sequence as follows:
     1-206: Missing.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Initiator methionine11Removed; by host By similarity
Chain2 – 428427Large envelope protein
PRO_0000038083

Regions

Topological domain4 – 283280Cytoplasmic; in internal conformation Potential
Topological domain4 – 211208Extracellular; in external conformation Potential
Transmembrane212 – 23221Helical; Name=TM1; Note=In external conformation; Potential
Topological domain233 – 28351Cytoplasmic; in external conformation Potential
Transmembrane284 – 30421Helical; Name=TM2; Potential
Topological domain305 – 37672Extracellular Potential
Transmembrane377 – 39721Helical; Name=TM3; Potential
Topological domain398 – 4036Cytoplasmic Potential
Transmembrane404 – 42623Helical; Potential
Topological domain427 – 4282Extracellular Potential
Region2 – 206205Pre-S By similarity
Region2 – 146145Pre-S1 By similarity
Region147 – 20660Pre-S2 By similarity

Amino acid modifications

Lipidation21N-myristoyl glycine; by host By similarity
Glycosylation1491N-linked (GlcNAc...); by host; in isoform M By similarity
Glycosylation3481N-linked (GlcNAc...); by host By similarity

Natural variations

Alternative sequence1 – 206206Missing in isoform S.
VSP_031448
Alternative sequence1 – 146146Missing in isoform M.
VSP_031449

Sequences

Sequence LengthMass (Da)Tools
Isoform L (Large envelope protein) (LHB) (L-HBsAg) [UniParc].

Last modified July 21, 1986. Version 1.
Checksum: 5E84114BC33862D7

FASTA42848,383
        10         20         30         40         50         60 
MGNNIKVTFD PNKLAAWWPT VGTYYTPTTT VTNPAIFKPG IYQTTSLKNP KNQQELDAIL 

        70         80         90        100        110        120 
MTRYKEIDWD NWQGFPVNQR LPVSNNNPPS GQRAETFEIK SRPIIVPGIR DIPRGIVPPQ 

       130        140        150        160        170        180 
TPSNRDQRRK PTPLTPPLRD THPHLTMKNQ TGHLQGFAEG LRALTTSDHH NSAYGDPFTT 

       190        200        210        220        230        240 
LSPVVPTVST TLSPPLTIGD PVLSTEMSPS GLLGLLAGLQ VVYFLWTKIL TIAQSLDWWW 

       250        260        270        280        290        300 
TSLSFPGGIP ECTGQNLQFQ TCKHLPTSCP PTCNGFRWMY LRRFIIYLLV LLLFLTFLLV 

       310        320        330        340        350        360 
LLDWKGLLPV CPMMPATETT VNCRQCTISA QDTFTTPYCC CLKPTAGNCT CWPIPSSWAL 

       370        380        390        400        410        420 
GSYLWEWALA RFSWLSLLVP LLQWLGGISL TVWLLLIWMI WFWGPVLMSI LPPFIPIFAL 


FFLIWAYI 

« Hide

Isoform M (Middle envelope protein) (MHB) (M-HBsAg) [UniParc].

Checksum: A82E58360A5DDFDF
Show »

FASTA28231,764
Isoform S (Small envelope protein) (SHB) (S-HBsAg) [UniParc].

Checksum: B44662CA25AEF543
Show »

FASTA22225,475

References

[1]"Nucleotide sequence of an infectious molecularly cloned genome of ground squirrel hepatitis virus."
Seeger C., Ganem D., Varmus H.E.
J. Virol. 51:367-375(1984) [PubMed: 6086950] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
[2]"Functions of the large hepatitis B virus surface protein in viral particle morphogenesis."
Bruss V., Gerhardt E., Vieluf K., Wunderlich G.
Intervirology 39:23-31(1996) [PubMed: 8957666] [Abstract]
Cited for: REVIEW.
[3]"Role of glycan processing in hepatitis B virus envelope protein trafficking."
Block T.M., Lu X., Mehta A., Park J., Blumberg B.S., Dwek R.
Adv. Exp. Med. Biol. 435:207-216(1998) [PubMed: 9498079] [Abstract]
Cited for: REVIEW.
[4]"Envelopment of the hepatitis B virus nucleocapsid."
Bruss V.
Virus Res. 106:199-209(2004) [PubMed: 15567498] [Abstract]
Cited for: REVIEW.
[5]"Hepatitis B virus pre-S mutants, endoplasmic reticulum stress and hepatocarcinogenesis."
Wang H.C., Huang W., Lai M.D., Su I.J.
Cancer Sci. 97:683-688(2006) [PubMed: 16863502] [Abstract]
Cited for: REVIEW.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
K02715 Genomic DNA. Translation: AAA46757.1. Different initiation.
PIRSAVLS. A03709.
RefSeqNP_040995.1. NC_001484.1.
NP_955537.1. NC_001484.1.

3D structure databases

ModBaseSearch...

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

GeneID1488458.

Family and domain databases

InterProIPR000349. Hepvir_surfAg.
[Graphical view]
PfamPF00695. vMSA. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameHBSAG_GSHV
AccessionPrimary (citable) accession number: P03144
Entry history
Integrated into UniProtKB/Swiss-Prot: July 21, 1986
Last sequence update: July 21, 1986
Last modified: May 31, 2011
This is version 53 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programViral Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families