Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

P59709 (HEMA_CVBQ) Reviewed, UniProtKB/Swiss-Prot

Last modified April 16, 2014. Version 71. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (1) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Hemagglutinin-esterase

Short name=HE protein
EC=3.1.1.53
Alternative name(s):
E3 glycoprotein
Gene names
Name:HE
ORF Names:2b
OrganismBovine coronavirus (strain Quebec) (BCoV) (BCV) [Complete proteome]
Taxonomic identifier11133 [NCBI]
Taxonomic lineageVirusesssRNA positive-strand viruses, no DNA stageNidoviralesCoronaviridaeCoronavirinaeBetacoronavirus
Virus hostBos taurus (Bovine) [TaxID: 9913]

Protein attributes

Sequence length424 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceInferred from homology

General annotation (Comments)

Function

Structural protein that makes short spikes at the surface of the virus. Contains receptor binding and receptor-destroying activities. Mediates de-O-acetylation of N-acetyl-9-O-acetylneuraminic acid, which is probably the receptor determinant recognized by the virus on the surface of erythrocytes and susceptible cells. This receptor-destroying activity is important for virus release as it probably helps preventing self-aggregation and ensures the efficient spread of the progeny virus from cell to cell. May serve as a secondary viral attachment protein for initiating infection, the spike protein being the major one. Seems to be a 'luxury' protein that is not absolutely necessary for virus infection in culture. However, its presence in the virus may alter its pathogenicity. May become a target for both the humoral and the cellular branches of the immune system By similarity.

Catalytic activity

N-acetyl-O-acetylneuraminate + H2O = N-acetylneuraminate + acetate.

Subunit structure

Homodimer; disulfide-linked. Forms a complex with the M protein in the pre-Golgi. Associates then with S-M complex to form a ternary complex S-M-HE.

Subcellular location

Virion membrane; Single-pass type I membrane protein Potential. Host cell membrane; Single-pass type I membrane protein Potential. Note: In infected cells becomes incorporated into the envelope of virions during virus assembly at the endoplasmic reticulum and cis Golgi. However, some may escape incorporation into virions and subsequently migrate to the cell surface By similarity.

Post-translational modification

N-glycosylated in the RER.

Miscellaneous

The sequence shown is that of Quebec reference strain.

Sequence similarities

Belongs to the influenza type C/coronaviruses hemagglutinin-esterase family.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 1818 By similarity
Chain19 – 424406Hemagglutinin-esterase
PRO_0000037143

Regions

Topological domain19 – 392374Virion surface Potential
Transmembrane393 – 41321Helical; Potential
Topological domain414 – 42411Intravirion Potential
Region7 – 127121Esterase domain first part By similarity
Region128 – 266139Receptor binding By similarity
Region267 – 379113Esterase domain second part By similarity

Sites

Active site401Nucleophile By similarity
Active site2281Charge relay system By similarity
Active site3291Charge relay system By similarity

Amino acid modifications

Glycosylation541N-linked (GlcNAc...); by host Potential
Glycosylation891N-linked (GlcNAc...); by host Potential
Glycosylation1531N-linked (GlcNAc...); by host Potential
Glycosylation2361N-linked (GlcNAc...); by host Potential
Glycosylation3011N-linked (GlcNAc...); by host Potential
Glycosylation3161N-linked (GlcNAc...); by host Potential
Glycosylation3581N-linked (GlcNAc...); by host Potential
Disulfide bond44 ↔ 65 By similarity
Disulfide bond113 ↔ 162 By similarity
Disulfide bond197 ↔ 276 By similarity
Disulfide bond205 ↔ 249 By similarity
Disulfide bond307 ↔ 312 By similarity
Disulfide bond347 ↔ 371 By similarity

Natural variations

Natural variant51L → P in strain: Isolate BCQ.3, Isolate BCQ.1523, Isolate BCQ.2439, Isolate BCQ.2442, Isolate BCQ.2508, Isolate BCQ.2590, Isolate BCQ.3708, Isolate BCQ.3994 and Isolate BCQ.7373.
Natural variant81V → A in strain: Isolate BCQ.2590.
Natural variant111S → C in strain: Isolate BCQ.1523, Isolate BCQ.2442 and Isolate BCQ.2508.
Natural variant491N → T in strain: Isolate BCQ.376, Isolate BCQ.701, Isolate BCQ.1523, Isolate BCQ.2442, Isolate BCQ.2508, Isolate BCQ.3708, Isolate BCQ.3994 and Isolate BCQ.7373.
Natural variant531R → P in strain: Isolate BCQ.2590.
Natural variant661D → G in strain: Isolate BCQ.376, Isolate BCQ.701, Isolate BCQ.3708 and Isolate BCQ.3994.
Natural variant1031L → I in strain: Isolate BCQ.2439.
Natural variant1031L → V in strain: Isolate BCQ.3, Isolate BCQ.376, Isolate BCQ.701, Isolate BCQ.1523, Isolate BCQ.2442, Isolate BCQ.2508, Isolate BCQ.2590, Isolate BCQ.3708, Isolate BCQ.3994 and Isolate BCQ.7373.
Natural variant1141T → I in strain: Isolate BCQ.3708.
Natural variant1821G → R in strain: Isolate BCQ.7373.
Natural variant2451F → L in strain: Isolate BCQ.7373.
Natural variant2821D → G in strain: Isolate BCQ.7373.
Natural variant3441D → A in strain: Isolate BCQ.2590.
Natural variant3501Q → R in strain: Isolate BCQ.2590.
Natural variant3671S → P in strain: Isolate BCQ.376, Isolate BCQ.701, Isolate BCQ.1523, Isolate BCQ.2439, Isolate BCQ.2442, Isolate BCQ.2508, Isolate BCQ.2590, Isolate BCQ.3708, Isolate BCQ.3994 and Isolate BCQ.7373.
Natural variant3921L → I in strain: Isolate BCQ.3, Isolate BCQ.376, Isolate BCQ.571, Isolate BCQ.701, Isolate BCQ.1523, Isolate BCQ.2439, Isolate BCQ.2442, Isolate BCQ.2508, Isolate BCQ.3708, Isolate BCQ.3994 and Isolate BCQ.7373.

Sequences

Sequence LengthMass (Da)Tools
P59709 [UniParc].

Last modified June 16, 2003. Version 1.
Checksum: F1CE076E4AA7B277

FASTA42447,709
        10         20         30         40         50         60 
MFLLLRFVLV SCIIGSLGFD NPPTNVVSHL NGDWFLFGDS RSDCNHVVNT NPRNYSYMDL 

        70         80         90        100        110        120 
NPALCDSGKI SSKAGNSIFR SFHFTDFYNY TGEGQQIIFY EGLNFTPYHA FKCTTSGSND 

       130        140        150        160        170        180 
IWMQNKGLFY TQVYKNMAVY RSLTFVNVPY VYNGSAQSTA LCKSGSLVLN NPAYIAREAN 

       190        200        210        220        230        240 
FGDYYYKVEA DFYLSGCDEY IVPLCIFNGK FLSNTKYYDD SQYYFNKDTG VIYGLNSTET 

       250        260        270        280        290        300 
ITTGFDFNCH YLVLPSGNYL AISNELLLTV PTKAICLNKR KDFTPVQVVD SRWNNARQSD 

       310        320        330        340        350        360 
NMTAVACQPP YCYFRNSTTN YVGVYDINHG DAGFTSILSG LLYDSPCFSQ QGVFRYDNVS 

       370        380        390        400        410        420 
SVWPLYSYGR CPTAADINTP DVPICVYDPL PLILLGILLG VAVIIIVVLL LYFMVDNGTR 


LHDA 

« Hide

References

[1]"Cloning and in vitro expression of the gene for the E3 haemagglutinin glycoprotein of bovine coronavirus."
Parker M.D., Cox G.J., Deregt D., Fitzpatrick D.R., Babiuk L.A.
J. Gen. Virol. 70:155-164(1989) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA].
[2]"Comparison of bovine coronavirus isolates associated with neonatal calf diarrhoea and winter dysentery in adult dairy cattle in Quebec."
Dea S., Michaud L., Milane G.
J. Gen. Virol. 76:1263-1270(1995) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA].
Strain: Isolate BCQ.2590, Isolate BCQ.3 and Isolate BCQ.571.
[3]"Identification of specific variations within the HE, S1, and ORF4 genes of bovine coronaviruses associated with enteric and respiratory diseases in dairy cattle."
Gelinas A.-M., Sasseville A.M.-J., Dea S.
Adv. Exp. Med. Biol. 494:63-67(2001) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA].
Strain: Isolate BCQ.1523, Isolate BCQ.2590, Isolate BCQ.3994, Isolate BCQ.571 and Isolate BCQ.7373.
[4]"Genomic and antigenic variations of the HE glycoprotein of bovine coronaviruses associated with neonatal calf diarrhea and winter dysentery."
Kourtesis A.B., Gelinas A.-M., Dea S.
Arch. Virol. 146:1219-1230(2001) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA].
Strain: Isolate BCQ.1523, Isolate BCQ.2439, Isolate BCQ.2442, Isolate BCQ.2508, Isolate BCQ.3708, Isolate BCQ.376, Isolate BCQ.701 and Isolate BCQ.7373.
[5]"Bovine coronaviruses associated with enteric and respiratory diseases in Canadian dairy cattle display different reactivities to anti-HE monoclonal antibodies and distinct amino acid changes in their HE, S and ns4.9 protein."
Gelinas A.-M., Boutin M., Sasseville A.M.-J., Dea S.
Virus Res. 76:43-57(2001) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA].
Strain: Isolate BCQ.3994.
[6]"Full-length genomic sequence of bovine coronavirus (31 kb). Completion of the open reading frame 1a/1b sequences."
Yoo D., Pei Y.
Adv. Exp. Med. Biol. 494:73-76(2001) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA].
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
L38962 Genomic RNA. Translation: AAA92989.1.
L38963 Genomic RNA. Translation: AAA92990.1.
U06093 Genomic RNA. Translation: AAA92991.1.
AF239306 Genomic RNA. Translation: AAG40594.1.
AF239307 Genomic RNA. Translation: AAG40600.1.
AF230523 Genomic RNA. Translation: AAG40588.1.
AF230524 Genomic RNA. Translation: AAG40589.1.
AF230525 Genomic RNA. Translation: AAG40590.1.
AF230526 Genomic RNA. Translation: AAG40591.1.
AF230527 Genomic RNA. Translation: AAG40592.1.
AF230528 Genomic RNA. Translation: AAG40593.1.
AF339836 Genomic RNA. Translation: AAK14397.1.
AF220295 Genomic RNA. Translation: AAL40399.1.
PIRHMIHBQ. A31684.
RefSeqNP_150076.1. NC_003045.1.

3D structure databases

ProteinModelPortalP59709.
SMRP59709. Positions 19-376.
ModBaseSearch...
MobiDBSearch...

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

GeneID921684.

Family and domain databases

InterProIPR008980. Capsid_hemagglutn.
IPR007142. Hemagglutn-estrase_core.
IPR003860. Hemagglutn-estrase_hemagglutn.
[Graphical view]
PfamPF03996. Hema_esterase. 1 hit.
PF02710. Hema_HEFG. 1 hit.
[Graphical view]
SUPFAMSSF49818. SSF49818. 1 hit.
ProtoNetSearch...

Entry information

Entry nameHEMA_CVBQ
AccessionPrimary (citable) accession number: P59709
Secondary accession number(s): P24351 expand/collapse secondary AC list , Q66166, Q66167, Q66168, Q77NC5, Q98VL2, Q9DGT1, Q9DGT9, Q9DR84, Q9DRF4, Q9DRF5
Entry history
Integrated into UniProtKB/Swiss-Prot: June 16, 2003
Last sequence update: June 16, 2003
Last modified: April 16, 2014
This is version 71 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programViral Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families