Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

P30215 (HEMA_CVHOC) Reviewed, UniProtKB/Swiss-Prot

Last modified April 16, 2014. Version 89. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (1) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Hemagglutinin-esterase

Short name=HE protein
EC=3.1.1.53
Alternative name(s):
E3 glycoprotein
Gene names
Name:HE
ORF Names:2b
OrganismHuman coronavirus OC43 (HCoV-OC43) [Complete proteome]
Taxonomic identifier31631 [NCBI]
Taxonomic lineageVirusesssRNA positive-strand viruses, no DNA stageNidoviralesCoronaviridaeCoronavirinaeBetacoronavirus
Virus hostHomo sapiens (Human) [TaxID: 9606]

Protein attributes

Sequence length424 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceInferred from homology

General annotation (Comments)

Function

Structural protein that makes short spikes at the surface of the virus. Contains receptor binding and receptor-destroying activities. Mediates de-O-acetylation of N-acetyl-9-O-acetylneuraminic acid, which is probably the receptor determinant recognized by the virus on the surface of erythrocytes and susceptible cells. This receptor-destroying activity is important for virus release as it probably helps preventing self-aggregation and ensures the efficient spread of the progeny virus from cell to cell. May serve as a secondary viral attachment protein for initiating infection, the spike protein being the major one. Seems to be a 'luxury' protein that is not absolutely necessary for virus infection in culture. However, its presence in the virus may alter its pathogenicity. May become a target for both the humoral and the cellular branches of the immune system By similarity.

Catalytic activity

N-acetyl-O-acetylneuraminate + H2O = N-acetylneuraminate + acetate.

Subunit structure

Homodimer; disulfide-linked. Forms a complex with the M protein in the pre-Golgi. Associates then with S-M complex to form a ternary complex S-M-HE.

Subcellular location

Virion membrane; Single-pass type I membrane protein Potential. Host cell membrane; Single-pass type I membrane protein Potential. Note: In infected cells becomes incorporated into the envelope of virions during virus assembly at the endoplasmic reticulum and cis Golgi. However, some may escape incorporation into virions and subsequently migrate to the cell surface By similarity.

Post-translational modification

N-glycosylated in the RER By similarity.

Sequence similarities

Belongs to the influenza type C/coronaviruses hemagglutinin-esterase family.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 1818 By similarity
Chain19 – 424406Hemagglutinin-esterase
PRO_0000037144

Regions

Topological domain19 – 392374Virion surface Potential
Transmembrane393 – 41321Helical; Potential
Topological domain414 – 42411Intravirion Potential
Region7 – 127121Esterase domain first part By similarity
Region128 – 266139Receptor binding By similarity
Region267 – 379113Esterase domain second part By similarity

Sites

Active site401Nucleophile By similarity
Active site2281Charge relay system By similarity
Active site3291Charge relay system By similarity

Amino acid modifications

Glycosylation541N-linked (GlcNAc...); by host Potential
Glycosylation891N-linked (GlcNAc...); by host Potential
Glycosylation1141N-linked (GlcNAc...); by host Potential
Glycosylation1531N-linked (GlcNAc...); by host Potential
Glycosylation2361N-linked (GlcNAc...); by host Potential
Glycosylation3011N-linked (GlcNAc...); by host Potential
Glycosylation3161N-linked (GlcNAc...); by host Potential
Glycosylation3581N-linked (GlcNAc...); by host Potential
Disulfide bond44 ↔ 65 By similarity
Disulfide bond113 ↔ 162 By similarity
Disulfide bond197 ↔ 276 By similarity
Disulfide bond205 ↔ 249 By similarity
Disulfide bond307 ↔ 312 By similarity
Disulfide bond347 ↔ 371 By similarity

Natural variations

Natural variant1581S → A in strain: Isolate ATCC VR-759 and Isolate clinical OC43-Paris.
Natural variant3791N → I in strain: Isolate ATCC VR-759 and Isolate clinical OC43-Paris.
Natural variant403 – 4042VI → IV in strain: Isolate ATCC VR-759 and Isolate clinical OC43-Paris.
Natural variant4181G → V in strain: Isolate ATCC VR-759 and Isolate clinical OC43-Paris.

Sequences

Sequence LengthMass (Da)Tools
P30215 [UniParc].

Last modified April 1, 1993. Version 1.
Checksum: EA43A1CCDF8598D7

FASTA42447,724
        10         20         30         40         50         60 
MFLLPRFILV SCIIGSLGFY NPPTNVVSHV NGDWFLFGDS RSDCNHIVNI NPHNYSYMDL 

        70         80         90        100        110        120 
NPVLCDSGKI SSKAGNSIFR SFHFTDFYNY TGEGQQIIFY EGVNFTPYHA FKCNRSGSND 

       130        140        150        160        170        180 
IWMQNKGLFY TQVYKNMAVY RSLTFVNVPY VYNGSAQSTA LCKSGSLVLN NPAYIAPQAN 

       190        200        210        220        230        240 
SGDYYYKVEA DFYLSGCDEY IVPLCIFNGK FLSNTKYYDD SQYYFNKDTG VIYGLNSTET 

       250        260        270        280        290        300 
ITTGFDLNCY YLVLPSGNYL AISNELLLTV PTKAICLNKR KDFTPVQVVD SRWNNARQSD 

       310        320        330        340        350        360 
NMTAVACQPP YCYFRNSTTN YVGVYDINHG DAGFTSILSG LLYNSPCFSQ QGVFRYDNVS 

       370        380        390        400        410        420 
SVWPLYPYGR CPTAADINNP DLPICVYDPL PVILLGILLG VAVIIIVVLL LYFMVDNGTR 


LHDA 

« Hide

References

[1]"The hemagglutinin/esterase gene of human coronavirus strain OC43: phylogenetic relationships to bovine and murine coronaviruses and influenza C virus."
Zhang X.M., Kousoulas K.G., Storz J.
Virology 186:318-323(1992) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA].
[2]"Human respiratory coronavirus OC43: genetic stability and neuroinvasion."
St Jean J.R., Jacomy H., Desforges M., Vabret A., Freymuth F., Talbot P.J.
J. Virol. 78:8824-8834(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA].
Strain: Isolate ATCC VR-759 and Isolate clinical OC43-Paris.
[3]"Complete genomic sequence of human coronavirus OC43: molecular clock analysis suggests a relatively recent zoonotic coronavirus transmission event."
Vijgen L., Keyaerts E., Moes E., Thoelen I., Wollants E., Lemey P., Vandamme A.M., Van Ranst M.
J. Virol. 79:1595-1604(2005) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA].
Strain: Isolate ATCC VR-759.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
M76373 Genomic RNA. Translation: AAA45460.1.
AY391777 Genomic RNA. Translation: AAR01014.1.
AY585228 Genomic RNA. Translation: AAT84353.1.
AY585229 Genomic RNA. Translation: AAT84361.1.
PIRHMIHCC. A39450.
RefSeqNP_937949.1. NC_005147.1.

3D structure databases

ProteinModelPortalP30215.
SMRP30215. Positions 19-376.
ModBaseSearch...
MobiDBSearch...

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

GeneID2648209.

Family and domain databases

InterProIPR008980. Capsid_hemagglutn.
IPR007142. Hemagglutn-estrase_core.
IPR003860. Hemagglutn-estrase_hemagglutn.
[Graphical view]
PfamPF03996. Hema_esterase. 1 hit.
PF02710. Hema_HEFG. 1 hit.
[Graphical view]
SUPFAMSSF49818. SSF49818. 1 hit.
ProtoNetSearch...

Entry information

Entry nameHEMA_CVHOC
AccessionPrimary (citable) accession number: P30215
Secondary accession number(s): Q6TNG0
Entry history
Integrated into UniProtKB/Swiss-Prot: April 1, 1993
Last sequence update: April 1, 1993
Last modified: April 16, 2014
This is version 89 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programViral Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families