Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Major allergen Equ c 1

Gene
N/A
Organism
Equus caballus (Horse)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

GO - Molecular functioni

Complete GO annotation...

Keywords - Biological processi

Transport

Names & Taxonomyi

Protein namesi
Recommended name:
Major allergen Equ c 1
Alternative name(s):
Allergen: Equ c 1
OrganismiEquus caballus (Horse)
Taxonomic identifieri9796 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaLaurasiatheriaPerissodactylaEquidaeEquus
Proteomesi
  • UP000002281 Componenti: Unplaced

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Secreted

Pathology & Biotechi

Allergenic propertiesi

Causes an allergic reaction in human. Potent allergen responsible for about 80% of anti-horse IgE antibody response in patients who are chronically exposed to horse allergens.1 Publication

Keywords - Diseasei

Allergen

Protein family/group databases

Allergomei3306. Equ c 1.0101.
331. Equ c 1.

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Signal peptidei1 – 15Or 16, or 21Add BLAST15
ChainiPRO_000001798216 – 187Major allergen Equ c 1Add BLAST172

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Glycosylationi53N-linked (GlcNAc...)Sequence analysis1
Glycosylationi68N-linked (GlcNAc...)Sequence analysis1
Disulfide bondi83 ↔ 176

Post-translational modificationi

Several N-terminal ends may be due to cleavage by signal peptidase at different sites or may be generated by proteolytic processing of the secreted protein.
Analysis of the sugar composition shows the presence of GalNAc, Gal, NeuAc, GlcNAc, and Man. May be also O-glycosylated.

Keywords - PTMi

Disulfide bond, Glycoprotein

Proteomic databases

PaxDbiQ95182.

Expressioni

Tissue specificityi

Expressed in liver and in sublingual and submaxillary salivary glands. Highly concentrated in secretory fluid such as saliva and urine as well as in hair dandruff extract.

Interactioni

Subunit structurei

Homodimer.

Protein-protein interaction databases

STRINGi9796.ENSECAP00000000397.

Structurei

Secondary structure

1187
Legend: HelixTurnBeta strandPDB Structure known for this area
Show more details
Feature keyPosition(s)DescriptionActionsGraphical viewLength
Beta strandi25 – 27Combined sources3
Helixi30 – 32Combined sources3
Beta strandi38 – 46Combined sources9
Helixi47 – 49Combined sources3
Beta strandi59 – 65Combined sources7
Beta strandi71 – 79Combined sources9
Beta strandi82 – 92Combined sources11
Beta strandi98 – 116Combined sources19
Beta strandi119 – 126Combined sources8
Beta strandi133 – 144Combined sources12
Helixi147 – 159Combined sources13
Helixi164 – 166Combined sources3
Beta strandi167 – 169Combined sources3
Helixi170 – 172Combined sources3
Helixi177 – 179Combined sources3

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
1EW3X-ray2.30A23-181[»]
ProteinModelPortaliQ95182.
SMRiQ95182.
ModBaseiSearch...
MobiDBiSearch...

Miscellaneous databases

EvolutionaryTraceiQ95182.

Family & Domainsi

Sequence similaritiesi

Belongs to the calycin superfamily. Lipocalin family.Curated

Keywords - Domaini

Signal

Phylogenomic databases

eggNOGiENOG410J5XW. Eukaryota.
ENOG411154J. LUCA.
HOGENOMiHOG000231458.
HOVERGENiHBG000215.
OMAiFHTKVNG.

Family and domain databases

Gene3Di2.40.128.20. 1 hit.
InterProiIPR012674. Calycin.
IPR011038. Calycin-like.
IPR002345. Lipocalin.
IPR022272. Lipocalin_CS.
IPR000566. Lipocln_cytosolic_FA-bd_dom.
IPR002971. Maj_urinary.
[Graphical view]
PfamiPF00061. Lipocalin. 1 hit.
[Graphical view]
PRINTSiPR00179. LIPOCALIN.
PR01221. MAJORURINARY.
SUPFAMiSSF50814. SSF50814. 1 hit.
PROSITEiPS00213. LIPOCALIN. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

Q95182-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MKLLLLCLGL ILVCAQQEEN SDVAIRNFDI SKISGEWYSI FLASDVKEKI
60 70 80 90 100
EENGSMRVFV DVIRALDNSS LYAEYQTKVN GECTEFPMVF DKTEEDGVYS
110 120 130 140 150
LNYDGYNVFR ISEFENDEHI ILYLVNFDKD RPFQLFEFYA REPDVSPEIK
160 170 180
EEFVKIVQKR GIVKENIIDL TKIDRCFQLR GNGVAQA
Length:187
Mass (Da):21,696
Last modified:February 1, 1997 - v1
Checksum:i28898F744732FF71
GO

Mass spectrometryi

Molecular mass is 22000 Da from positions 16 - 187. Determined by ESI. 1 Publication

Natural variant

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Natural varianti62V → L.1
Natural varianti90F → A.1
Natural varianti136F → L.1
Natural varianti146S → D.1
Natural varianti172 – 173KI → QT.2

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U70823 mRNA. Translation: AAC48691.1.
RefSeqiNP_001075966.1. NM_001082497.2.
UniGeneiEca.13041.

Genome annotation databases

GeneIDi100034197.
KEGGiecb:100034197.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U70823 mRNA. Translation: AAC48691.1.
RefSeqiNP_001075966.1. NM_001082497.2.
UniGeneiEca.13041.

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
1EW3X-ray2.30A23-181[»]
ProteinModelPortaliQ95182.
SMRiQ95182.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi9796.ENSECAP00000000397.

Protein family/group databases

Allergomei3306. Equ c 1.0101.
331. Equ c 1.

Proteomic databases

PaxDbiQ95182.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

GeneIDi100034197.
KEGGiecb:100034197.

Phylogenomic databases

eggNOGiENOG410J5XW. Eukaryota.
ENOG411154J. LUCA.
HOGENOMiHOG000231458.
HOVERGENiHBG000215.
OMAiFHTKVNG.

Miscellaneous databases

EvolutionaryTraceiQ95182.

Family and domain databases

Gene3Di2.40.128.20. 1 hit.
InterProiIPR012674. Calycin.
IPR011038. Calycin-like.
IPR002345. Lipocalin.
IPR022272. Lipocalin_CS.
IPR000566. Lipocln_cytosolic_FA-bd_dom.
IPR002971. Maj_urinary.
[Graphical view]
PfamiPF00061. Lipocalin. 1 hit.
[Graphical view]
PRINTSiPR00179. LIPOCALIN.
PR01221. MAJORURINARY.
SUPFAMiSSF50814. SSF50814. 1 hit.
PROSITEiPS00213. LIPOCALIN. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiALL1_HORSE
AccessioniPrimary (citable) accession number: Q95182
Entry historyi
Integrated into UniProtKB/Swiss-Prot: July 15, 1998
Last sequence update: February 1, 1997
Last modified: November 2, 2016
This is version 111 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

3D-structure, Complete proteome, Direct protein sequencing, Reference proteome

Documents

  1. Allergens
    Nomenclature of allergens and list of entries
  2. PDB cross-references
    Index of Protein Data Bank (PDB) cross-references
  3. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.