Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Hemagglutinin-esterase

Gene

HE

Organism
Bovine coronavirus (strain Quebec) (BCoV) (BCV)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Protein inferred from homologyi

Functioni

Structural protein that makes short spikes at the surface of the virus. Contains receptor binding and receptor-destroying activities. Mediates de-O-acetylation of N-acetyl-9-O-acetylneuraminic acid, which is probably the receptor determinant recognized by the virus on the surface of erythrocytes and susceptible cells. This receptor-destroying activity is important for virus release as it probably helps preventing self-aggregation and ensures the efficient spread of the progeny virus from cell to cell. May serve as a secondary viral attachment protein for initiating infection, the spike protein being the major one. Seems to be a 'luxury' protein that is not absolutely necessary for virus infection in culture. However, its presence in the virus may alter its pathogenicity. May become a target for both the humoral and the cellular branches of the immune system (By similarity).By similarity

Catalytic activityi

N-acetyl-O-acetylneuraminate + H2O = N-acetylneuraminate + acetate.

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Active sitei40NucleophileBy similarity1
Active sitei326Charge relay systemBy similarity1
Active sitei329Charge relay systemBy similarity1

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Molecular functioni

Hemagglutinin, Hydrolase

Names & Taxonomyi

Protein namesi
Recommended name:
Hemagglutinin-esterase (EC:3.1.1.53)
Short name:
HE protein
Alternative name(s):
E3 glycoprotein
Gene namesi
Name:HE
ORF Names:2b
OrganismiBovine coronavirus (strain Quebec) (BCoV) (BCV)
Taxonomic identifieri11133 [NCBI]
Taxonomic lineageiVirusesssRNA virusesssRNA positive-strand viruses, no DNA stageNidoviralesCoronaviridaeCoronavirinaeBetacoronavirus
Virus hostiBos taurus (Bovine) [TaxID: 9913]
Proteomesi
  • UP000008572 Componenti: Genome

Subcellular locationi

Topology

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Topological domaini19 – 392Virion surfaceSequence analysisAdd BLAST374
Transmembranei393 – 413HelicalSequence analysisAdd BLAST21
Topological domaini414 – 424IntravirionSequence analysisAdd BLAST11

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Host cell membrane, Host membrane, Membrane, Viral envelope protein, Virion

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Signal peptidei1 – 18By similarityAdd BLAST18
ChainiPRO_000003714319 – 424Hemagglutinin-esteraseAdd BLAST406

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Disulfide bondi44 ↔ 65By similarity
Glycosylationi54N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi89N-linked (GlcNAc...); by hostSequence analysis1
Disulfide bondi113 ↔ 162By similarity
Glycosylationi153N-linked (GlcNAc...); by hostSequence analysis1
Disulfide bondi197 ↔ 276By similarity
Disulfide bondi205 ↔ 249By similarity
Glycosylationi236N-linked (GlcNAc...); by hostSequence analysis1
Glycosylationi301N-linked (GlcNAc...); by hostSequence analysis1
Disulfide bondi307 ↔ 312By similarity
Glycosylationi316N-linked (GlcNAc...); by hostSequence analysis1
Disulfide bondi347 ↔ 371By similarity
Glycosylationi358N-linked (GlcNAc...); by hostSequence analysis1

Post-translational modificationi

N-glycosylated in the RER.

Keywords - PTMi

Disulfide bond, Glycoprotein

Interactioni

Subunit structurei

Homodimer; disulfide-linked. Forms a complex with the M protein in the pre-Golgi. Associates then with S-M complex to form a ternary complex S-M-HE.

Structurei

3D structure databases

ProteinModelPortaliP59709.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Regioni7 – 127Esterase domain first partBy similarityAdd BLAST121
Regioni128 – 266Receptor bindingBy similarityAdd BLAST139
Regioni267 – 379Esterase domain second partBy similarityAdd BLAST113

Sequence similaritiesi

Keywords - Domaini

Signal, Transmembrane, Transmembrane helix

Phylogenomic databases

KOiK19253.

Family and domain databases

InterProiIPR008980. Capsid_hemagglutn.
IPR007142. Hemagglutn-estrase_core.
IPR003860. Hemagglutn-estrase_hemagglutn.
IPR013830. SGNH_hydro.
[Graphical view]
PfamiPF03996. Hema_esterase. 1 hit.
PF02710. Hema_HEFG. 1 hit.
[Graphical view]
SUPFAMiSSF49818. SSF49818. 1 hit.
SSF52266. SSF52266. 2 hits.

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

P59709-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MFLLLRFVLV SCIIGSLGFD NPPTNVVSHL NGDWFLFGDS RSDCNHVVNT
60 70 80 90 100
NPRNYSYMDL NPALCDSGKI SSKAGNSIFR SFHFTDFYNY TGEGQQIIFY
110 120 130 140 150
EGLNFTPYHA FKCTTSGSND IWMQNKGLFY TQVYKNMAVY RSLTFVNVPY
160 170 180 190 200
VYNGSAQSTA LCKSGSLVLN NPAYIAREAN FGDYYYKVEA DFYLSGCDEY
210 220 230 240 250
IVPLCIFNGK FLSNTKYYDD SQYYFNKDTG VIYGLNSTET ITTGFDFNCH
260 270 280 290 300
YLVLPSGNYL AISNELLLTV PTKAICLNKR KDFTPVQVVD SRWNNARQSD
310 320 330 340 350
NMTAVACQPP YCYFRNSTTN YVGVYDINHG DAGFTSILSG LLYDSPCFSQ
360 370 380 390 400
QGVFRYDNVS SVWPLYSYGR CPTAADINTP DVPICVYDPL PLILLGILLG
410 420
VAVIIIVVLL LYFMVDNGTR LHDA
Length:424
Mass (Da):47,709
Last modified:June 16, 2003 - v1
Checksum:iF1CE076E4AA7B277
GO

Natural variant

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Natural varianti5L → P in strain: Isolate BCQ.3, Isolate BCQ.1523, Isolate BCQ.2439, Isolate BCQ.2442, Isolate BCQ.2508, Isolate BCQ.2590, Isolate BCQ.3708, Isolate BCQ.3994 and Isolate BCQ.7373. 1
Natural varianti8V → A in strain: Isolate BCQ.2590. 1
Natural varianti11S → C in strain: Isolate BCQ.1523, Isolate BCQ.2442 and Isolate BCQ.2508. 1
Natural varianti49N → T in strain: Isolate BCQ.376, Isolate BCQ.701, Isolate BCQ.1523, Isolate BCQ.2442, Isolate BCQ.2508, Isolate BCQ.3708, Isolate BCQ.3994 and Isolate BCQ.7373. 1
Natural varianti53R → P in strain: Isolate BCQ.2590. 1
Natural varianti66D → G in strain: Isolate BCQ.376, Isolate BCQ.701, Isolate BCQ.3708 and Isolate BCQ.3994. 1
Natural varianti103L → I in strain: Isolate BCQ.2439. 1
Natural varianti103L → V in strain: Isolate BCQ.3, Isolate BCQ.376, Isolate BCQ.701, Isolate BCQ.1523, Isolate BCQ.2442, Isolate BCQ.2508, Isolate BCQ.2590, Isolate BCQ.3708, Isolate BCQ.3994 and Isolate BCQ.7373. 1
Natural varianti114T → I in strain: Isolate BCQ.3708. 1
Natural varianti182G → R in strain: Isolate BCQ.7373. 1
Natural varianti245F → L in strain: Isolate BCQ.7373. 1
Natural varianti282D → G in strain: Isolate BCQ.7373. 1
Natural varianti344D → A in strain: Isolate BCQ.2590. 1
Natural varianti350Q → R in strain: Isolate BCQ.2590. 1
Natural varianti367S → P in strain: Isolate BCQ.376, Isolate BCQ.701, Isolate BCQ.1523, Isolate BCQ.2439, Isolate BCQ.2442, Isolate BCQ.2508, Isolate BCQ.2590, Isolate BCQ.3708, Isolate BCQ.3994 and Isolate BCQ.7373. 1
Natural varianti392L → I in strain: Isolate BCQ.3, Isolate BCQ.376, Isolate BCQ.571, Isolate BCQ.701, Isolate BCQ.1523, Isolate BCQ.2439, Isolate BCQ.2442, Isolate BCQ.2508, Isolate BCQ.3708, Isolate BCQ.3994 and Isolate BCQ.7373. 1

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
L38962 Genomic RNA. Translation: AAA92989.1.
L38963 Genomic RNA. Translation: AAA92990.1.
U06093 Genomic RNA. Translation: AAA92991.1.
AF239306 Genomic RNA. Translation: AAG40594.1.
AF239307 Genomic RNA. Translation: AAG40600.1.
AF230523 Genomic RNA. Translation: AAG40588.1.
AF230524 Genomic RNA. Translation: AAG40589.1.
AF230525 Genomic RNA. Translation: AAG40590.1.
AF230526 Genomic RNA. Translation: AAG40591.1.
AF230527 Genomic RNA. Translation: AAG40592.1.
AF230528 Genomic RNA. Translation: AAG40593.1.
AF339836 Genomic RNA. Translation: AAK14397.1.
AF220295 Genomic RNA. Translation: AAL40399.1.
PIRiA31684. HMIHBQ.
RefSeqiNP_150076.1. NC_003045.1.

Genome annotation databases

GeneIDi921684.
KEGGivg:921684.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
L38962 Genomic RNA. Translation: AAA92989.1.
L38963 Genomic RNA. Translation: AAA92990.1.
U06093 Genomic RNA. Translation: AAA92991.1.
AF239306 Genomic RNA. Translation: AAG40594.1.
AF239307 Genomic RNA. Translation: AAG40600.1.
AF230523 Genomic RNA. Translation: AAG40588.1.
AF230524 Genomic RNA. Translation: AAG40589.1.
AF230525 Genomic RNA. Translation: AAG40590.1.
AF230526 Genomic RNA. Translation: AAG40591.1.
AF230527 Genomic RNA. Translation: AAG40592.1.
AF230528 Genomic RNA. Translation: AAG40593.1.
AF339836 Genomic RNA. Translation: AAK14397.1.
AF220295 Genomic RNA. Translation: AAL40399.1.
PIRiA31684. HMIHBQ.
RefSeqiNP_150076.1. NC_003045.1.

3D structure databases

ProteinModelPortaliP59709.
ModBaseiSearch...
MobiDBiSearch...

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

GeneIDi921684.
KEGGivg:921684.

Phylogenomic databases

KOiK19253.

Family and domain databases

InterProiIPR008980. Capsid_hemagglutn.
IPR007142. Hemagglutn-estrase_core.
IPR003860. Hemagglutn-estrase_hemagglutn.
IPR013830. SGNH_hydro.
[Graphical view]
PfamiPF03996. Hema_esterase. 1 hit.
PF02710. Hema_HEFG. 1 hit.
[Graphical view]
SUPFAMiSSF49818. SSF49818. 1 hit.
SSF52266. SSF52266. 2 hits.
ProtoNetiSearch...

Entry informationi

Entry nameiHEMA_CVBQ
AccessioniPrimary (citable) accession number: P59709
Secondary accession number(s): P24351
, Q66166, Q66167, Q66168, Q77NC5, Q98VL2, Q9DGT1, Q9DGT9, Q9DR84, Q9DRF4, Q9DRF5
Entry historyi
Integrated into UniProtKB/Swiss-Prot: June 16, 2003
Last sequence update: June 16, 2003
Last modified: October 5, 2016
This is version 80 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programViral Protein Annotation Program

Miscellaneousi

Miscellaneous

The sequence shown is that of Quebec reference strain.

Keywords - Technical termi

Complete proteome

Documents

  1. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.