UniProtKB - Q98UI9 (MUC5B_CHICK)
Protein
Mucin-5B
Gene
MUC5B
Organism
Gallus gallus (Chicken)
Status
Functioni
Ovomucin, the glycoprotein responsible for the gel properties of egg white, is composed for 2 subunits, alpha-ovomucin/MUC5B and beta-ovomucin/MUC6.1 Publication
GO - Molecular functioni
- virion binding Source: AgBase
GO - Biological processi
- cholesterol homeostasis Source: AgBase
- intestinal cholesterol absorption Source: AgBase
- macrophage activation involved in immune response Source: AgBase
Names & Taxonomyi
Protein namesi | Recommended name: Mucin-5BAlternative name(s): Ovomucin, alpha-subunit |
Gene namesi | Name:MUC5B |
Organismi | Gallus gallus (Chicken) |
Taxonomic identifieri | 9031 [NCBI] |
Taxonomic lineagei | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Archelosauria › Archosauria › Dinosauria › Saurischia › Theropoda › Coelurosauria › Aves › Neognathae › Galloanserae › Galliformes › Phasianidae › Phasianinae › Gallus |
Proteomesi |
|
Subcellular locationi
Extracellular region or secreted
Extracellular region or secreted
- extracellular space Source: AgBase
Other locations
- extracellular matrix Source: GO_Central
- intracellular membrane-bounded organelle Source: AgBase
Keywords - Cellular componenti
SecretedPTM / Processingi
Molecule processing
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Signal peptidei | 1 – 21 | Sequence analysisAdd BLAST | 21 | |
ChainiPRO_5000049585 | 22 – 2108 | Mucin-5BAdd BLAST | 2087 |
Amino acid modifications
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Disulfide bondi | 60 ↔ 68 | By similarity | ||
Glycosylationi | 381 | N-linked (GlcNAc...) (complex) asparagine1 Publication | 1 | |
Glycosylationi | 528 | N-linked (GlcNAc...) (complex) asparagine1 Publication | 1 | |
Glycosylationi | 599 | N-linked (GlcNAc...) (complex) asparagine1 Publication | 1 | |
Glycosylationi | 680 | N-linked (GlcNAc...) (complex) asparagine1 Publication | 1 | |
Glycosylationi | 772 | N-linked (GlcNAc...) (complex) asparagine1 Publication | 1 | |
Glycosylationi | 855 | N-linked (GlcNAc...) (complex) asparagine1 Publication | 1 | |
Glycosylationi | 1036 | N-linked (GlcNAc...) (complex) asparagine1 Publication | 1 | |
Glycosylationi | 1219 | N-linked (GlcNAc...) (complex) asparagine1 Publication | 1 | |
Glycosylationi | 1371 | N-linked (GlcNAc...) (complex) asparagine1 Publication | 1 | |
Glycosylationi | 1452 | N-linked (GlcNAc...) (complex) asparagine1 Publication | 1 | |
Glycosylationi | 1567 | N-linked (GlcNAc...) (complex) asparagine1 Publication | 1 | |
Glycosylationi | 1639 | N-linked (GlcNAc...) (complex) asparagine1 Publication | 1 | |
Glycosylationi | 1792 | N-linked (GlcNAc...) (complex) asparagine1 Publication | 1 | |
Glycosylationi | 1807 | N-linked (GlcNAc...) (complex) asparagine1 Publication | 1 | |
Glycosylationi | 1841 | N-linked (GlcNAc...) (complex) asparagine1 Publication | 1 | |
Glycosylationi | 1964 | N-linked (GlcNAc...) (complex) asparagine1 Publication | 1 | |
Disulfide bondi | 2010 ↔ 2066 | By similarity | ||
Disulfide bondi | 2031 ↔ 2080 | By similarity | ||
Disulfide bondi | 2042 ↔ 2096 | By similarity | ||
Disulfide bondi | 2046 ↔ 2098 | By similarity |
Post-translational modificationi
N-glycosylated. Complex glycosylation with bisecting N-acetylglucosamine. Contains mainly N-acetylglucosamine (3.1-8.5%), mannose (2.9-4.6%), a small amount of galactose (1.1-4.35) and sialic acid (0.3-1.3%). Most abundant glycan is composed of a GlcNAc2Man3 core, a bisecting GlcNAc and another 3 GlcNAc antannae located on the mannoses of the core. Site Asn-1639 exists both in glycosylated and non-glycosylated forms.1 Publication
Sites
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Sitei | 69 | Not glycosylated1 Publication | 1 | |
Sitei | 673 | Not glycosylated1 Publication | 1 |
Keywords - PTMi
Disulfide bond, GlycoproteinProteomic databases
PaxDbi | Q98UI9 |
PRIDEi | Q98UI9 |
PTM databases
iPTMneti | Q98UI9 |
Interactioni
Subunit structurei
Multimer; disulfide-linked.
1 PublicationProtein-protein interaction databases
STRINGi | 9031.ENSGALP00000010852 |
Family & Domainsi
Domains and Repeats
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Domaini | 37 – 242 | VWFD 1PROSITE-ProRule annotationAdd BLAST | 206 | |
Domaini | 304 – 360 | TIL 1Add BLAST | 57 | |
Domaini | 399 – 610 | VWFD 2PROSITE-ProRule annotationAdd BLAST | 212 | |
Domaini | 666 – 723 | TIL 2Add BLAST | 58 | |
Domaini | 782 – 825 | TIL 3Add BLAST | 44 | |
Domaini | 825 – 897 | VWFC 1PROSITE-ProRule annotationAdd BLAST | 73 | |
Domaini | 864 – 1069 | VWFD 3PROSITE-ProRule annotationAdd BLAST | 206 | |
Domaini | 1430 – 1646 | VWFD 4PROSITE-ProRule annotationAdd BLAST | 217 | |
Domaini | 1761 – 1832 | VWFC 2PROSITE-ProRule annotationAdd BLAST | 72 | |
Domaini | 1870 – 1937 | VWFC 3PROSITE-ProRule annotationAdd BLAST | 68 | |
Domaini | 2010 – 2104 | CTCKPROSITE-ProRule annotationAdd BLAST | 95 |
Keywords - Domaini
Repeat, SignalPhylogenomic databases
eggNOGi | KOG1216, Eukaryota |
InParanoidi | Q98UI9 |
PhylomeDBi | Q98UI9 |
Family and domain databases
InterProi | View protein in InterPro IPR006207, Cys_knot_C IPR036084, Ser_inhib-like_sf IPR002919, TIL_dom IPR014853, Unchr_dom_Cys-rich IPR001007, VWF_dom IPR001846, VWF_type-D |
Pfami | View protein in Pfam PF08742, C8, 4 hits PF01826, TIL, 3 hits PF00094, VWD, 4 hits |
SMARTi | View protein in SMART SM00832, C8, 4 hits SM00041, CT, 1 hit SM00214, VWC, 6 hits SM00215, VWC_out, 2 hits SM00216, VWD, 4 hits |
SUPFAMi | SSF57567, SSF57567, 5 hits |
PROSITEi | View protein in PROSITE PS01225, CTCK_2, 1 hit PS01208, VWFC_1, 2 hits PS50184, VWFC_2, 2 hits PS51233, VWFD, 4 hits |
i Sequence
Sequence statusi: Complete.
: The displayed sequence is further processed into a mature form. Sequence processingi
Q98UI9-1 [UniParc]FASTAAdd to basket
10 20 30 40 50
MEIKKERSFW IFCLIWSFCK GKEPVQIVQV STVGRSECTT WGNFHFHTFD
60 70 80 90 100
HVKFTFPGTC TYVFASHCND SYQDFNIKIR RSDKNSHLIY FTVTTDGVIL
110 120 130 140 150
EVKETGITVN GNQIPLPFSL KSILIEDTCA YFQVTSKLGL TLKWNWADTL
160 170 180 190 200
LLDLEETYKE KICGLCGNYD GNKKNDLILD GYKMHPRQFG NFHKVEDPSE
210 220 230 240 250
KCPDVRPDDH TGRHPTEDDN RCSKYKKMCK KLLSRFGNCP KVVAFDDYVA
260 270 280 290 300
TCTEDMCNCV VNSSQSDLVS SCICSTLNQY SRDCVLSKGD PGEWRTKELC
310 320 330 340 350
YQECPSNMEY MECGNSCADT CADPERSKIC KAPCTDGCFC PPGTILDDLG
360 370 380 390 400
GKKCVPRDSC PCMFQGKVYS SGGTYSTPCQ NCTCKGGHWS CISLPCSGSC
410 420 430 440 450
SIDGGFHIKT FDNKKFNFHG NCHYVLAKNT DDTFVVIGEI IQCGTSKTMT
460 470 480 490 500
CLKNVLVTLG RTTIKICSCG SIYMNNFIVK LPVSKDGITI FRPSTFFIKI
510 520 530 540 550
LSSAGVQIRV QMKPVMQLSI TVDHSYQNRT SGLCGNFNNI QTDDFRTATG
560 570 580 590 600
AVEDSAAAFG NSWKTRASCF DVEDSFEDPC SNSVDKEKFA QHWCALLSNT
610 620 630 640 650
SSTFAACHSV VDPSVYIKRC MYDTCNAEKS EVALCSVLST YSRDCAAAGM
660 670 680 690 700
TLKGWRQGIC DPSEECPETM VYNYSVKYCN QSCRSLDEPD PLCKVQIAPM
710 720 730 740 750
EGCGCPEGTY LNDEEECVTP DDCPCYYKGK IVQPGNSFQE DKLLCKCIQG
760 770 780 790 800
RLDCIGETVL VKDCPAPMYY FNCSSAGPGA IGSECQKSCK TQDMHCYVTE
810 820 830 840 850
CVSGCMCPDG LVLDGSGGCI PKDQCPCVHG GHFYKPGETI RVDCNTCTCN
860 870 880 890 900
KRQWNCTDNP CKGTCTVYGN GHYMSFDGEK FDFLGDCDYI LAQDFCPNNM
910 920 930 940 950
DAGTFRIVIQ NNACGKSLSI CSLKITLIFE SSEIRLLEGR IQEIATDPGA
960 970 980 990 1000
EKNYKVDLRG GYIVIETTQG MSFMWDQKTT VVVHVTPSFQ GKVCGLCGDF
1010 1020 1030 1040 1050
DGRSRNDFTT RGQSVEMSIQ EFGNSWKITS TCSNINMTDL CADQPFKSAL
1060 1070 1080 1090 1100
GQKHCSIIKS SVFEACHSKV NPIPYYESCV SDFCGCDSVG DCECFCTSVA
1110 1120 1130 1140 1150
AYARSCSTAG VCINWRTPAI CPVFCDYYNP PDKHEWFYKP CGAPCLKTCR
1160 1170 1180 1190 1200
NPQGKCGNIL YSLEGCYPEC SPDKPYFDEE RRECVSLPDC TSCNPEEKLC
1210 1220 1230 1240 1250
TEDSKDCLCC YNGKTYPLNE TIYSQTEGTK CGNAFCGPNG MIIETFIPCS
1260 1270 1280 1290 1300
TLSVPAQEQL MQPVTSAPLL STEATPCFCT DNGQLIQMGE NVSLPMNISG
1310 1320 1330 1340 1350
HCAYSICNAS CQIELIWAEC KVVQTEALET CEPNSEACPP TAAPNATSLV
1360 1370 1380 1390 1400
PATALAPMSD CLGLIPPRKF NESWDFGNCQ IATCLGEENN IKLSSITCPP
1410 1420 1430 1440 1450
QQLKLCVNGF PFMKHHDETG CCEVFECQCI CSGWGNEHYV TFDGTYYHFK
1460 1470 1480 1490 1500
ENCTYVLVEL IQPSSEKFWI HIDNYYCGAA DGAICSMSLL IFHSNSLVIL
1510 1520 1530 1540 1550
TQAKEHGKGT NLVLFNDKKV VPDISKNGIR ITSSGLYIIV EIPELEVYVS
1560 1570 1580 1590 1600
YSRLAFYIKL PFGKYYNNTM GLCGTCTNQK SDDARKRNGE VTDSFKEMAL
1610 1620 1630 1640 1650
DWKAPVSTNR YCNPGISEPV KIENYQHCEP SELCKIIWNL TECHRVVPPQ
1660 1670 1680 1690 1700
PYYEACVASR CSQQHPSTEC QSMQTYAALC GLHGICVDWR GQTNGQCEAT
1710 1720 1730 1740 1750
CARDQVYKPC GEAKRNTCFS REVIVDTLLS RNNTPVFVEG CYCPDGNILL
1760 1770 1780 1790 1800
NEHDGICVSV CGCTAQDGSV KKPREAWEHD CQYCTCDEET LNISCFPRPC
1810 1820 1830 1840 1850
AKSPPINCTK EGFVRKIKPR LDDPCCTETV CECDIKTCII NKTACDLGFQ
1860 1870 1880 1890 1900
PVVAISEDGC CPIFSCIPKG VCVSEGVEFK PGAVVPKSSC EDCVCTDEQD
1910 1920 1930 1940 1950
AVTGTNRIQC VPVKCQTTCQ QGFRYVEKEG QCCSQCQQVA CVANFPFGSV
1960 1970 1980 1990 2000
TIEVGKSYKA PYDNCTQYTC TESGGQFSLT STVKVCLPFE ESNCVPGTVD
2010 2020 2030 2040 2050
VTSDGCCKTC IDLPHKCKRS MKEQYIVHKH CKSAAPVPVP FCEGTCSTYS
2060 2070 2080 2090 2100
VYSFENNEME HKCICCHEKK SHVEKVELVC SEHKTLKFSY VHVDECGCVE
TKCPMRRT
Sequence databases
Select the link destinations: EMBLi GenBanki DDBJi Links Updated | AB046524 mRNA Translation: BAB21488.1 |
RefSeqi | NP_989992.1, NM_204661.1 |
Genome annotation databases
GeneIDi | 395381 |
KEGGi | gga:395381 |
Similar proteinsi
Cross-referencesi
Sequence databases
Select the link destinations: EMBLi GenBanki DDBJi Links Updated | AB046524 mRNA Translation: BAB21488.1 |
RefSeqi | NP_989992.1, NM_204661.1 |
3D structure databases
SMRi | Q98UI9 |
ModBasei | Search... |
Protein-protein interaction databases
STRINGi | 9031.ENSGALP00000010852 |
Protein family/group databases
Allergomei | 2741, Gal d Ovomucin |
PTM databases
iPTMneti | Q98UI9 |
Proteomic databases
PaxDbi | Q98UI9 |
PRIDEi | Q98UI9 |
Genome annotation databases
GeneIDi | 395381 |
KEGGi | gga:395381 |
Organism-specific databases
CTDi | 395381 |
Phylogenomic databases
eggNOGi | KOG1216, Eukaryota |
InParanoidi | Q98UI9 |
PhylomeDBi | Q98UI9 |
Miscellaneous databases
PROi | PR:Q98UI9 |
Family and domain databases
InterProi | View protein in InterPro IPR006207, Cys_knot_C IPR036084, Ser_inhib-like_sf IPR002919, TIL_dom IPR014853, Unchr_dom_Cys-rich IPR001007, VWF_dom IPR001846, VWF_type-D |
Pfami | View protein in Pfam PF08742, C8, 4 hits PF01826, TIL, 3 hits PF00094, VWD, 4 hits |
SMARTi | View protein in SMART SM00832, C8, 4 hits SM00041, CT, 1 hit SM00214, VWC, 6 hits SM00215, VWC_out, 2 hits SM00216, VWD, 4 hits |
SUPFAMi | SSF57567, SSF57567, 5 hits |
PROSITEi | View protein in PROSITE PS01225, CTCK_2, 1 hit PS01208, VWFC_1, 2 hits PS50184, VWFC_2, 2 hits PS51233, VWFD, 4 hits |
ProtoNeti | Search... |
MobiDBi | Search... |
Entry informationi
Entry namei | MUC5B_CHICK | |
Accessioni | Q98UI9Primary (citable) accession number: Q98UI9 | |
Entry historyi | Integrated into UniProtKB/Swiss-Prot: | September 21, 2011 |
Last sequence update: | June 1, 2001 | |
Last modified: | December 2, 2020 | |
This is version 96 of the entry and version 1 of the sequence. See complete history. | ||
Entry statusi | Reviewed (UniProtKB/Swiss-Prot) | |
Annotation program | Chordata Protein Annotation Program |