SubmitCancel

Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Q98UI9

- MUC5B_CHICK

UniProt

Q98UI9 - MUC5B_CHICK

(max 400 entries)x

Your basket is currently empty.

Select item(s) and click on "Add to basket" to create your own collection here
(400 entries max)

Protein
Mucin-5B
Gene
MUC5B
Organism
Gallus gallus (Chicken)
Status
Reviewed - Annotation score: 4 out of 5 - Experimental evidence at protein leveli

Functioni

Ovomucin, the glycoprotein responsible for the gel properties of egg white, is composed for 2 subunits, alpha-ovomucin/MUC5B and beta-ovomucin/MUC6.1 Publication

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sitei69 – 691Not glycosylated
Sitei673 – 6731Not glycosylated

Names & Taxonomyi

Protein namesi
Recommended name:
Mucin-5B
Alternative name(s):
Ovomucin, alpha-subunit
Gene namesi
Name:MUC5B
OrganismiGallus gallus (Chicken)
Taxonomic identifieri9031 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiTestudines + Archosauria groupArchosauriaDinosauriaSaurischiaTheropodaCoelurosauriaAvesNeognathaeGalliformesPhasianidaePhasianinaeGallus
ProteomesiUP000000539: Unplaced

Subcellular locationi

GO - Cellular componenti

  1. extracellular region Source: UniProtKB-SubCell
Complete GO annotation...

Keywords - Cellular componenti

Secreted

Pathology & Biotechi

Protein family/group databases

Allergomei2741. Gal d Ovomucin.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 2121 Reviewed prediction
Add
BLAST
Chaini22 – 21082087Mucin-5B
PRO_5000049585Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Disulfide bondi60 ↔ 68 By similarity
Glycosylationi381 – 3811N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi528 – 5281N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi599 – 5991N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi680 – 6801N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi772 – 7721N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi855 – 8551N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi1036 – 10361N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi1219 – 12191N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi1371 – 13711N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi1452 – 14521N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi1567 – 15671N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi1639 – 16391N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi1792 – 17921N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi1807 – 18071N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi1841 – 18411N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi1964 – 19641N-linked (GlcNAc...) (complex)1 Publication
Disulfide bondi2010 ↔ 2066 By similarity
Disulfide bondi2031 ↔ 2080 By similarity
Disulfide bondi2042 ↔ 2096 By similarity
Disulfide bondi2046 ↔ 2098 By similarity

Post-translational modificationi

N-glycosylated. Complex glycosylation with bisecting N-acetylglucosamine. Contains mainly N-acetylglucosamine (3.1-8.5%), mannose (2.9-4.6%), a small amount of galactose (1.1-4.35) and sialic acid (0.3-1.3%). Most abundant glycan is composed of a GlcNAc2Man3 core, a bisecting GlcNAc and another 3 GlcNAc antannae located on the mannoses of the core. Site Asn-1639 exists both in glycosylated and non-glycosylated forms.2 Publications

Keywords - PTMi

Disulfide bond, Glycoprotein

Proteomic databases

PaxDbiQ98UI9.
PRIDEiQ98UI9.

Interactioni

Subunit structurei

Multimer; disulfide-linked.1 Publication

Protein-protein interaction databases

STRINGi9031.ENSGALP00000010852.

Structurei

3D structure databases

ProteinModelPortaliQ98UI9.

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini37 – 242206VWFD 1
Add
BLAST
Domaini304 – 36057TIL 1
Add
BLAST
Domaini399 – 610212VWFD 2
Add
BLAST
Domaini666 – 72358TIL 2
Add
BLAST
Domaini782 – 82544TIL 3
Add
BLAST
Domaini825 – 89773VWFC 1
Add
BLAST
Domaini864 – 1069206VWFD 3
Add
BLAST
Domaini1430 – 1646217VWFD 4
Add
BLAST
Domaini1761 – 183272VWFC 2
Add
BLAST
Domaini1870 – 193768VWFC 3
Add
BLAST
Domaini2010 – 210495CTCK
Add
BLAST

Sequence similaritiesi

Contains 3 VWFC domains.
Contains 4 VWFD domains.

Keywords - Domaini

Repeat, Signal

Phylogenomic databases

eggNOGiNOG12793.
HOGENOMiHOG000168234.
HOVERGENiHBG004380.
InParanoidiQ98UI9.
PhylomeDBiQ98UI9.

Family and domain databases

InterProiIPR006207. Cys_knot_C.
IPR002919. TIL_dom.
IPR014853. Unchr_dom_Cys-rich.
IPR001007. VWF_C.
IPR001846. VWF_type-D.
[Graphical view]
PfamiPF08742. C8. 4 hits.
PF01826. TIL. 3 hits.
PF00094. VWD. 4 hits.
[Graphical view]
SMARTiSM00832. C8. 4 hits.
SM00041. CT. 1 hit.
SM00214. VWC. 5 hits.
SM00216. VWD. 4 hits.
[Graphical view]
SUPFAMiSSF57567. SSF57567. 5 hits.
PROSITEiPS01225. CTCK_2. 1 hit.
PS01208. VWFC_1. 2 hits.
PS50184. VWFC_2. 2 hits.
PS51233. VWFD. 4 hits.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

Q98UI9-1 [UniParc]FASTAAdd to Basket

« Hide

MEIKKERSFW IFCLIWSFCK GKEPVQIVQV STVGRSECTT WGNFHFHTFD     50
HVKFTFPGTC TYVFASHCND SYQDFNIKIR RSDKNSHLIY FTVTTDGVIL 100
EVKETGITVN GNQIPLPFSL KSILIEDTCA YFQVTSKLGL TLKWNWADTL 150
LLDLEETYKE KICGLCGNYD GNKKNDLILD GYKMHPRQFG NFHKVEDPSE 200
KCPDVRPDDH TGRHPTEDDN RCSKYKKMCK KLLSRFGNCP KVVAFDDYVA 250
TCTEDMCNCV VNSSQSDLVS SCICSTLNQY SRDCVLSKGD PGEWRTKELC 300
YQECPSNMEY MECGNSCADT CADPERSKIC KAPCTDGCFC PPGTILDDLG 350
GKKCVPRDSC PCMFQGKVYS SGGTYSTPCQ NCTCKGGHWS CISLPCSGSC 400
SIDGGFHIKT FDNKKFNFHG NCHYVLAKNT DDTFVVIGEI IQCGTSKTMT 450
CLKNVLVTLG RTTIKICSCG SIYMNNFIVK LPVSKDGITI FRPSTFFIKI 500
LSSAGVQIRV QMKPVMQLSI TVDHSYQNRT SGLCGNFNNI QTDDFRTATG 550
AVEDSAAAFG NSWKTRASCF DVEDSFEDPC SNSVDKEKFA QHWCALLSNT 600
SSTFAACHSV VDPSVYIKRC MYDTCNAEKS EVALCSVLST YSRDCAAAGM 650
TLKGWRQGIC DPSEECPETM VYNYSVKYCN QSCRSLDEPD PLCKVQIAPM 700
EGCGCPEGTY LNDEEECVTP DDCPCYYKGK IVQPGNSFQE DKLLCKCIQG 750
RLDCIGETVL VKDCPAPMYY FNCSSAGPGA IGSECQKSCK TQDMHCYVTE 800
CVSGCMCPDG LVLDGSGGCI PKDQCPCVHG GHFYKPGETI RVDCNTCTCN 850
KRQWNCTDNP CKGTCTVYGN GHYMSFDGEK FDFLGDCDYI LAQDFCPNNM 900
DAGTFRIVIQ NNACGKSLSI CSLKITLIFE SSEIRLLEGR IQEIATDPGA 950
EKNYKVDLRG GYIVIETTQG MSFMWDQKTT VVVHVTPSFQ GKVCGLCGDF 1000
DGRSRNDFTT RGQSVEMSIQ EFGNSWKITS TCSNINMTDL CADQPFKSAL 1050
GQKHCSIIKS SVFEACHSKV NPIPYYESCV SDFCGCDSVG DCECFCTSVA 1100
AYARSCSTAG VCINWRTPAI CPVFCDYYNP PDKHEWFYKP CGAPCLKTCR 1150
NPQGKCGNIL YSLEGCYPEC SPDKPYFDEE RRECVSLPDC TSCNPEEKLC 1200
TEDSKDCLCC YNGKTYPLNE TIYSQTEGTK CGNAFCGPNG MIIETFIPCS 1250
TLSVPAQEQL MQPVTSAPLL STEATPCFCT DNGQLIQMGE NVSLPMNISG 1300
HCAYSICNAS CQIELIWAEC KVVQTEALET CEPNSEACPP TAAPNATSLV 1350
PATALAPMSD CLGLIPPRKF NESWDFGNCQ IATCLGEENN IKLSSITCPP 1400
QQLKLCVNGF PFMKHHDETG CCEVFECQCI CSGWGNEHYV TFDGTYYHFK 1450
ENCTYVLVEL IQPSSEKFWI HIDNYYCGAA DGAICSMSLL IFHSNSLVIL 1500
TQAKEHGKGT NLVLFNDKKV VPDISKNGIR ITSSGLYIIV EIPELEVYVS 1550
YSRLAFYIKL PFGKYYNNTM GLCGTCTNQK SDDARKRNGE VTDSFKEMAL 1600
DWKAPVSTNR YCNPGISEPV KIENYQHCEP SELCKIIWNL TECHRVVPPQ 1650
PYYEACVASR CSQQHPSTEC QSMQTYAALC GLHGICVDWR GQTNGQCEAT 1700
CARDQVYKPC GEAKRNTCFS REVIVDTLLS RNNTPVFVEG CYCPDGNILL 1750
NEHDGICVSV CGCTAQDGSV KKPREAWEHD CQYCTCDEET LNISCFPRPC 1800
AKSPPINCTK EGFVRKIKPR LDDPCCTETV CECDIKTCII NKTACDLGFQ 1850
PVVAISEDGC CPIFSCIPKG VCVSEGVEFK PGAVVPKSSC EDCVCTDEQD 1900
AVTGTNRIQC VPVKCQTTCQ QGFRYVEKEG QCCSQCQQVA CVANFPFGSV 1950
TIEVGKSYKA PYDNCTQYTC TESGGQFSLT STVKVCLPFE ESNCVPGTVD 2000
VTSDGCCKTC IDLPHKCKRS MKEQYIVHKH CKSAAPVPVP FCEGTCSTYS 2050
VYSFENNEME HKCICCHEKK SHVEKVELVC SEHKTLKFSY VHVDECGCVE 2100
TKCPMRRT 2108
Length:2,108
Mass (Da):233,553
Last modified:June 1, 2001 - v1
Checksum:i68B887CB781E6539
GO

Sequence databases

Select the link destinations:
EMBL
GenBank
DDBJ
Links Updated
AB046524 mRNA. Translation: BAB21488.1.
UniGeneiGga.3337.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBL
GenBank
DDBJ
Links Updated
AB046524 mRNA. Translation: BAB21488.1 .
UniGenei Gga.3337.

3D structure databases

ProteinModelPortali Q98UI9.
ModBasei Search...
MobiDBi Search...

Protein-protein interaction databases

STRINGi 9031.ENSGALP00000010852.

Protein family/group databases

Allergomei 2741. Gal d Ovomucin.

Proteomic databases

PaxDbi Q98UI9.
PRIDEi Q98UI9.

Protocols and materials databases

Structural Biology Knowledgebase Search...

Phylogenomic databases

eggNOGi NOG12793.
HOGENOMi HOG000168234.
HOVERGENi HBG004380.
InParanoidi Q98UI9.
PhylomeDBi Q98UI9.

Miscellaneous databases

NextBioi 20815465.
PROi Q98UI9.

Family and domain databases

InterProi IPR006207. Cys_knot_C.
IPR002919. TIL_dom.
IPR014853. Unchr_dom_Cys-rich.
IPR001007. VWF_C.
IPR001846. VWF_type-D.
[Graphical view ]
Pfami PF08742. C8. 4 hits.
PF01826. TIL. 3 hits.
PF00094. VWD. 4 hits.
[Graphical view ]
SMARTi SM00832. C8. 4 hits.
SM00041. CT. 1 hit.
SM00214. VWC. 5 hits.
SM00216. VWD. 4 hits.
[Graphical view ]
SUPFAMi SSF57567. SSF57567. 5 hits.
PROSITEi PS01225. CTCK_2. 1 hit.
PS01208. VWFC_1. 2 hits.
PS50184. VWFC_2. 2 hits.
PS51233. VWFD. 4 hits.
[Graphical view ]
ProtoNeti Search...

Publicationsi

  1. "Amino acid sequence of alpha-subunit in hen egg white ovomucin deduced from cloned cDNA."
    Watanabe K., Shimoyamada M., Onizuka T., Akiyama H., Niwa M., Ido T., Tsuge Y.
    DNA Seq. 15:251-261(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [MRNA].
    Tissue: Oviduct.
  2. "Studies on the composition of egg-white ovomucin."
    Robinson D.S., Monsey J.B.
    Biochem. J. 121:537-547(1971) [PubMed] [Europe PMC] [Abstract]
    Cited for: STRUCTURE OF CARBOHYDRATES, FUNCTION, SUBUNIT.
  3. "N-glycosylation of ovomucin from hen egg white."
    Offengenden M., Fentabil M.A., Wu J.
    Glycoconj. J. 28:113-123(2011) [PubMed] [Europe PMC] [Abstract]
    Cited for: GLYCOSYLATION AT ASN-381; ASN-528; ASN-599; ASN-680; ASN-772; ASN-855; ASN-1036; ASN-1219; ASN-1371; ASN-1452; ASN-1567; ASN-1639; ASN-1792; ASN-1807; ASN-1841 AND ASN-1964, LACK OF GLYCOSYLATION AT ASN-69 AND ASN-673, IDENTIFICATION BY MASS SPECTROMETRY, STRUCTURE OF CARBOHYDRATES.

Entry informationi

Entry nameiMUC5B_CHICK
AccessioniPrimary (citable) accession number: Q98UI9
Entry historyi
Integrated into UniProtKB/Swiss-Prot: September 21, 2011
Last sequence update: June 1, 2001
Last modified: April 16, 2014
This is version 72 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3

Similar proteinsi