Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Q98UI9

- MUC5B_CHICK

UniProt

Q98UI9 - MUC5B_CHICK

(max 400 entries)x

Your basket is currently empty.

Select item(s) and click on "Add to basket" to create your own collection here
(400 entries max)

Protein

Mucin-5B

Gene

MUC5B

Organism
Gallus gallus (Chicken)
Status
Reviewed - Annotation score: 4 out of 5- Experimental evidence at protein leveli

Functioni

Ovomucin, the glycoprotein responsible for the gel properties of egg white, is composed for 2 subunits, alpha-ovomucin/MUC5B and beta-ovomucin/MUC6.1 Publication

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sitei69 – 691Not glycosylated
Sitei673 – 6731Not glycosylated

Names & Taxonomyi

Protein namesi
Recommended name:
Mucin-5B
Alternative name(s):
Ovomucin, alpha-subunit
Gene namesi
Name:MUC5B
OrganismiGallus gallus (Chicken)
Taxonomic identifieri9031 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiTestudines + Archosauria groupArchosauriaDinosauriaSaurischiaTheropodaCoelurosauriaAvesNeognathaeGalliformesPhasianidaePhasianinaeGallus
ProteomesiUP000000539: Unplaced

Subcellular locationi

GO - Cellular componenti

  1. extracellular region Source: UniProtKB-KW
Complete GO annotation...

Keywords - Cellular componenti

Secreted

Pathology & Biotechi

Protein family/group databases

Allergomei2741. Gal d Ovomucin.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 2121Sequence AnalysisAdd
BLAST
Chaini22 – 21082087Mucin-5BPRO_5000049585Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Disulfide bondi60 ↔ 68By similarity
Glycosylationi381 – 3811N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi528 – 5281N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi599 – 5991N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi680 – 6801N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi772 – 7721N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi855 – 8551N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi1036 – 10361N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi1219 – 12191N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi1371 – 13711N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi1452 – 14521N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi1567 – 15671N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi1639 – 16391N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi1792 – 17921N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi1807 – 18071N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi1841 – 18411N-linked (GlcNAc...) (complex)1 Publication
Glycosylationi1964 – 19641N-linked (GlcNAc...) (complex)1 Publication
Disulfide bondi2010 ↔ 2066By similarity
Disulfide bondi2031 ↔ 2080By similarity
Disulfide bondi2042 ↔ 2096By similarity
Disulfide bondi2046 ↔ 2098By similarity

Post-translational modificationi

N-glycosylated. Complex glycosylation with bisecting N-acetylglucosamine. Contains mainly N-acetylglucosamine (3.1-8.5%), mannose (2.9-4.6%), a small amount of galactose (1.1-4.35) and sialic acid (0.3-1.3%). Most abundant glycan is composed of a GlcNAc2Man3 core, a bisecting GlcNAc and another 3 GlcNAc antannae located on the mannoses of the core. Site Asn-1639 exists both in glycosylated and non-glycosylated forms.1 Publication

Keywords - PTMi

Disulfide bond, Glycoprotein

Proteomic databases

PaxDbiQ98UI9.
PRIDEiQ98UI9.

Interactioni

Subunit structurei

Multimer; disulfide-linked.1 Publication

Protein-protein interaction databases

STRINGi9031.ENSGALP00000010852.

Structurei

3D structure databases

ProteinModelPortaliQ98UI9.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini37 – 242206VWFD 1PROSITE-ProRule annotationAdd
BLAST
Domaini304 – 36057TIL 1Add
BLAST
Domaini399 – 610212VWFD 2PROSITE-ProRule annotationAdd
BLAST
Domaini666 – 72358TIL 2Add
BLAST
Domaini782 – 82544TIL 3Add
BLAST
Domaini825 – 89773VWFC 1PROSITE-ProRule annotationAdd
BLAST
Domaini864 – 1069206VWFD 3PROSITE-ProRule annotationAdd
BLAST
Domaini1430 – 1646217VWFD 4PROSITE-ProRule annotationAdd
BLAST
Domaini1761 – 183272VWFC 2PROSITE-ProRule annotationAdd
BLAST
Domaini1870 – 193768VWFC 3PROSITE-ProRule annotationAdd
BLAST
Domaini2010 – 210495CTCKPROSITE-ProRule annotationAdd
BLAST

Sequence similaritiesi

Contains 1 CTCK (C-terminal cystine knot-like) domain.PROSITE-ProRule annotation
Contains 3 VWFC domains.PROSITE-ProRule annotation
Contains 4 VWFD domains.PROSITE-ProRule annotation

Keywords - Domaini

Repeat, Signal

Phylogenomic databases

eggNOGiNOG12793.
HOGENOMiHOG000168234.
HOVERGENiHBG004380.
InParanoidiQ98UI9.
PhylomeDBiQ98UI9.

Family and domain databases

InterProiIPR006207. Cys_knot_C.
IPR002919. TIL_dom.
IPR014853. Unchr_dom_Cys-rich.
IPR001007. VWF_C.
IPR001846. VWF_type-D.
[Graphical view]
PfamiPF08742. C8. 4 hits.
PF01826. TIL. 3 hits.
PF00094. VWD. 4 hits.
[Graphical view]
SMARTiSM00832. C8. 4 hits.
SM00041. CT. 1 hit.
SM00214. VWC. 5 hits.
SM00216. VWD. 4 hits.
[Graphical view]
SUPFAMiSSF57567. SSF57567. 5 hits.
PROSITEiPS01225. CTCK_2. 1 hit.
PS01208. VWFC_1. 2 hits.
PS50184. VWFC_2. 2 hits.
PS51233. VWFD. 4 hits.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

Q98UI9 [UniParc]FASTAAdd to Basket

« Hide

        10         20         30         40         50
MEIKKERSFW IFCLIWSFCK GKEPVQIVQV STVGRSECTT WGNFHFHTFD
60 70 80 90 100
HVKFTFPGTC TYVFASHCND SYQDFNIKIR RSDKNSHLIY FTVTTDGVIL
110 120 130 140 150
EVKETGITVN GNQIPLPFSL KSILIEDTCA YFQVTSKLGL TLKWNWADTL
160 170 180 190 200
LLDLEETYKE KICGLCGNYD GNKKNDLILD GYKMHPRQFG NFHKVEDPSE
210 220 230 240 250
KCPDVRPDDH TGRHPTEDDN RCSKYKKMCK KLLSRFGNCP KVVAFDDYVA
260 270 280 290 300
TCTEDMCNCV VNSSQSDLVS SCICSTLNQY SRDCVLSKGD PGEWRTKELC
310 320 330 340 350
YQECPSNMEY MECGNSCADT CADPERSKIC KAPCTDGCFC PPGTILDDLG
360 370 380 390 400
GKKCVPRDSC PCMFQGKVYS SGGTYSTPCQ NCTCKGGHWS CISLPCSGSC
410 420 430 440 450
SIDGGFHIKT FDNKKFNFHG NCHYVLAKNT DDTFVVIGEI IQCGTSKTMT
460 470 480 490 500
CLKNVLVTLG RTTIKICSCG SIYMNNFIVK LPVSKDGITI FRPSTFFIKI
510 520 530 540 550
LSSAGVQIRV QMKPVMQLSI TVDHSYQNRT SGLCGNFNNI QTDDFRTATG
560 570 580 590 600
AVEDSAAAFG NSWKTRASCF DVEDSFEDPC SNSVDKEKFA QHWCALLSNT
610 620 630 640 650
SSTFAACHSV VDPSVYIKRC MYDTCNAEKS EVALCSVLST YSRDCAAAGM
660 670 680 690 700
TLKGWRQGIC DPSEECPETM VYNYSVKYCN QSCRSLDEPD PLCKVQIAPM
710 720 730 740 750
EGCGCPEGTY LNDEEECVTP DDCPCYYKGK IVQPGNSFQE DKLLCKCIQG
760 770 780 790 800
RLDCIGETVL VKDCPAPMYY FNCSSAGPGA IGSECQKSCK TQDMHCYVTE
810 820 830 840 850
CVSGCMCPDG LVLDGSGGCI PKDQCPCVHG GHFYKPGETI RVDCNTCTCN
860 870 880 890 900
KRQWNCTDNP CKGTCTVYGN GHYMSFDGEK FDFLGDCDYI LAQDFCPNNM
910 920 930 940 950
DAGTFRIVIQ NNACGKSLSI CSLKITLIFE SSEIRLLEGR IQEIATDPGA
960 970 980 990 1000
EKNYKVDLRG GYIVIETTQG MSFMWDQKTT VVVHVTPSFQ GKVCGLCGDF
1010 1020 1030 1040 1050
DGRSRNDFTT RGQSVEMSIQ EFGNSWKITS TCSNINMTDL CADQPFKSAL
1060 1070 1080 1090 1100
GQKHCSIIKS SVFEACHSKV NPIPYYESCV SDFCGCDSVG DCECFCTSVA
1110 1120 1130 1140 1150
AYARSCSTAG VCINWRTPAI CPVFCDYYNP PDKHEWFYKP CGAPCLKTCR
1160 1170 1180 1190 1200
NPQGKCGNIL YSLEGCYPEC SPDKPYFDEE RRECVSLPDC TSCNPEEKLC
1210 1220 1230 1240 1250
TEDSKDCLCC YNGKTYPLNE TIYSQTEGTK CGNAFCGPNG MIIETFIPCS
1260 1270 1280 1290 1300
TLSVPAQEQL MQPVTSAPLL STEATPCFCT DNGQLIQMGE NVSLPMNISG
1310 1320 1330 1340 1350
HCAYSICNAS CQIELIWAEC KVVQTEALET CEPNSEACPP TAAPNATSLV
1360 1370 1380 1390 1400
PATALAPMSD CLGLIPPRKF NESWDFGNCQ IATCLGEENN IKLSSITCPP
1410 1420 1430 1440 1450
QQLKLCVNGF PFMKHHDETG CCEVFECQCI CSGWGNEHYV TFDGTYYHFK
1460 1470 1480 1490 1500
ENCTYVLVEL IQPSSEKFWI HIDNYYCGAA DGAICSMSLL IFHSNSLVIL
1510 1520 1530 1540 1550
TQAKEHGKGT NLVLFNDKKV VPDISKNGIR ITSSGLYIIV EIPELEVYVS
1560 1570 1580 1590 1600
YSRLAFYIKL PFGKYYNNTM GLCGTCTNQK SDDARKRNGE VTDSFKEMAL
1610 1620 1630 1640 1650
DWKAPVSTNR YCNPGISEPV KIENYQHCEP SELCKIIWNL TECHRVVPPQ
1660 1670 1680 1690 1700
PYYEACVASR CSQQHPSTEC QSMQTYAALC GLHGICVDWR GQTNGQCEAT
1710 1720 1730 1740 1750
CARDQVYKPC GEAKRNTCFS REVIVDTLLS RNNTPVFVEG CYCPDGNILL
1760 1770 1780 1790 1800
NEHDGICVSV CGCTAQDGSV KKPREAWEHD CQYCTCDEET LNISCFPRPC
1810 1820 1830 1840 1850
AKSPPINCTK EGFVRKIKPR LDDPCCTETV CECDIKTCII NKTACDLGFQ
1860 1870 1880 1890 1900
PVVAISEDGC CPIFSCIPKG VCVSEGVEFK PGAVVPKSSC EDCVCTDEQD
1910 1920 1930 1940 1950
AVTGTNRIQC VPVKCQTTCQ QGFRYVEKEG QCCSQCQQVA CVANFPFGSV
1960 1970 1980 1990 2000
TIEVGKSYKA PYDNCTQYTC TESGGQFSLT STVKVCLPFE ESNCVPGTVD
2010 2020 2030 2040 2050
VTSDGCCKTC IDLPHKCKRS MKEQYIVHKH CKSAAPVPVP FCEGTCSTYS
2060 2070 2080 2090 2100
VYSFENNEME HKCICCHEKK SHVEKVELVC SEHKTLKFSY VHVDECGCVE

TKCPMRRT
Length:2,108
Mass (Da):233,553
Last modified:June 1, 2001 - v1
Checksum:i68B887CB781E6539
GO

Sequence databases

Select the link destinations:
EMBL
GenBank
DDBJ
Links Updated
AB046524 mRNA. Translation: BAB21488.1.
UniGeneiGga.3337.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBL
GenBank
DDBJ
Links Updated
AB046524 mRNA. Translation: BAB21488.1 .
UniGenei Gga.3337.

3D structure databases

ProteinModelPortali Q98UI9.
ModBasei Search...
MobiDBi Search...

Protein-protein interaction databases

STRINGi 9031.ENSGALP00000010852.

Protein family/group databases

Allergomei 2741. Gal d Ovomucin.

Proteomic databases

PaxDbi Q98UI9.
PRIDEi Q98UI9.

Protocols and materials databases

Structural Biology Knowledgebase Search...

Phylogenomic databases

eggNOGi NOG12793.
HOGENOMi HOG000168234.
HOVERGENi HBG004380.
InParanoidi Q98UI9.
PhylomeDBi Q98UI9.

Miscellaneous databases

NextBioi 20815465.
PROi Q98UI9.

Family and domain databases

InterProi IPR006207. Cys_knot_C.
IPR002919. TIL_dom.
IPR014853. Unchr_dom_Cys-rich.
IPR001007. VWF_C.
IPR001846. VWF_type-D.
[Graphical view ]
Pfami PF08742. C8. 4 hits.
PF01826. TIL. 3 hits.
PF00094. VWD. 4 hits.
[Graphical view ]
SMARTi SM00832. C8. 4 hits.
SM00041. CT. 1 hit.
SM00214. VWC. 5 hits.
SM00216. VWD. 4 hits.
[Graphical view ]
SUPFAMi SSF57567. SSF57567. 5 hits.
PROSITEi PS01225. CTCK_2. 1 hit.
PS01208. VWFC_1. 2 hits.
PS50184. VWFC_2. 2 hits.
PS51233. VWFD. 4 hits.
[Graphical view ]
ProtoNeti Search...

Publicationsi

  1. "Amino acid sequence of alpha-subunit in hen egg white ovomucin deduced from cloned cDNA."
    Watanabe K., Shimoyamada M., Onizuka T., Akiyama H., Niwa M., Ido T., Tsuge Y.
    DNA Seq. 15:251-261(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [MRNA].
    Tissue: Oviduct.
  2. "Studies on the composition of egg-white ovomucin."
    Robinson D.S., Monsey J.B.
    Biochem. J. 121:537-547(1971) [PubMed] [Europe PMC] [Abstract]
    Cited for: STRUCTURE OF CARBOHYDRATES, FUNCTION, SUBUNIT.
  3. "N-glycosylation of ovomucin from hen egg white."
    Offengenden M., Fentabil M.A., Wu J.
    Glycoconj. J. 28:113-123(2011) [PubMed] [Europe PMC] [Abstract]
    Cited for: GLYCOSYLATION AT ASN-381; ASN-528; ASN-599; ASN-680; ASN-772; ASN-855; ASN-1036; ASN-1219; ASN-1371; ASN-1452; ASN-1567; ASN-1639; ASN-1792; ASN-1807; ASN-1841 AND ASN-1964, LACK OF GLYCOSYLATION AT ASN-69 AND ASN-673, IDENTIFICATION BY MASS SPECTROMETRY, STRUCTURE OF CARBOHYDRATES.

Entry informationi

Entry nameiMUC5B_CHICK
AccessioniPrimary (citable) accession number: Q98UI9
Entry historyi
Integrated into UniProtKB/Swiss-Prot: September 21, 2011
Last sequence update: June 1, 2001
Last modified: October 29, 2014
This is version 74 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3