Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Basket 0
(max 400 entries)x

Your basket is currently empty.

Select item(s) and click on "Add to basket" to create your own collection here
(400 entries max)

Q98UI9

- MUC5B_CHICK

UniProt

Q98UI9 - MUC5B_CHICK

Protein

Mucin-5B

Gene

MUC5B

Organism
Gallus gallus (Chicken)
Status
Reviewed - Annotation score: 4 out of 5- Experimental evidence at protein leveli
    • BLAST
    • Align
    • Format
    • Add to basket
    • History
      Entry version 73 (01 Oct 2014)
      Sequence version 1 (01 Jun 2001)
      Previous versions | rss
    • Help video
    • Feedback
    • Comment

    Functioni

    Ovomucin, the glycoprotein responsible for the gel properties of egg white, is composed for 2 subunits, alpha-ovomucin/MUC5B and beta-ovomucin/MUC6.1 Publication

    Sites

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Sitei69 – 691Not glycosylated
    Sitei673 – 6731Not glycosylated

    Names & Taxonomyi

    Protein namesi
    Recommended name:
    Mucin-5B
    Alternative name(s):
    Ovomucin, alpha-subunit
    Gene namesi
    Name:MUC5B
    OrganismiGallus gallus (Chicken)
    Taxonomic identifieri9031 [NCBI]
    Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiTestudines + Archosauria groupArchosauriaDinosauriaSaurischiaTheropodaCoelurosauriaAvesNeognathaeGalliformesPhasianidaePhasianinaeGallus
    ProteomesiUP000000539: Unplaced

    Subcellular locationi

    GO - Cellular componenti

    1. extracellular region Source: UniProtKB-SubCell

    Keywords - Cellular componenti

    Secreted

    Pathology & Biotechi

    Protein family/group databases

    Allergomei2741. Gal d Ovomucin.

    PTM / Processingi

    Molecule processing

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Signal peptidei1 – 2121Sequence AnalysisAdd
    BLAST
    Chaini22 – 21082087Mucin-5BPRO_5000049585Add
    BLAST

    Amino acid modifications

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Disulfide bondi60 ↔ 68By similarity
    Glycosylationi381 – 3811N-linked (GlcNAc...) (complex)1 Publication
    Glycosylationi528 – 5281N-linked (GlcNAc...) (complex)1 Publication
    Glycosylationi599 – 5991N-linked (GlcNAc...) (complex)1 Publication
    Glycosylationi680 – 6801N-linked (GlcNAc...) (complex)1 Publication
    Glycosylationi772 – 7721N-linked (GlcNAc...) (complex)1 Publication
    Glycosylationi855 – 8551N-linked (GlcNAc...) (complex)1 Publication
    Glycosylationi1036 – 10361N-linked (GlcNAc...) (complex)1 Publication
    Glycosylationi1219 – 12191N-linked (GlcNAc...) (complex)1 Publication
    Glycosylationi1371 – 13711N-linked (GlcNAc...) (complex)1 Publication
    Glycosylationi1452 – 14521N-linked (GlcNAc...) (complex)1 Publication
    Glycosylationi1567 – 15671N-linked (GlcNAc...) (complex)1 Publication
    Glycosylationi1639 – 16391N-linked (GlcNAc...) (complex)1 Publication
    Glycosylationi1792 – 17921N-linked (GlcNAc...) (complex)1 Publication
    Glycosylationi1807 – 18071N-linked (GlcNAc...) (complex)1 Publication
    Glycosylationi1841 – 18411N-linked (GlcNAc...) (complex)1 Publication
    Glycosylationi1964 – 19641N-linked (GlcNAc...) (complex)1 Publication
    Disulfide bondi2010 ↔ 2066By similarity
    Disulfide bondi2031 ↔ 2080By similarity
    Disulfide bondi2042 ↔ 2096By similarity
    Disulfide bondi2046 ↔ 2098By similarity

    Post-translational modificationi

    N-glycosylated. Complex glycosylation with bisecting N-acetylglucosamine. Contains mainly N-acetylglucosamine (3.1-8.5%), mannose (2.9-4.6%), a small amount of galactose (1.1-4.35) and sialic acid (0.3-1.3%). Most abundant glycan is composed of a GlcNAc2Man3 core, a bisecting GlcNAc and another 3 GlcNAc antannae located on the mannoses of the core. Site Asn-1639 exists both in glycosylated and non-glycosylated forms.1 Publication

    Keywords - PTMi

    Disulfide bond, Glycoprotein

    Proteomic databases

    PaxDbiQ98UI9.
    PRIDEiQ98UI9.

    Interactioni

    Subunit structurei

    Multimer; disulfide-linked.1 Publication

    Protein-protein interaction databases

    STRINGi9031.ENSGALP00000010852.

    Structurei

    3D structure databases

    ProteinModelPortaliQ98UI9.
    ModBaseiSearch...
    MobiDBiSearch...

    Family & Domainsi

    Domains and Repeats

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Domaini37 – 242206VWFD 1PROSITE-ProRule annotationAdd
    BLAST
    Domaini304 – 36057TIL 1Add
    BLAST
    Domaini399 – 610212VWFD 2PROSITE-ProRule annotationAdd
    BLAST
    Domaini666 – 72358TIL 2Add
    BLAST
    Domaini782 – 82544TIL 3Add
    BLAST
    Domaini825 – 89773VWFC 1PROSITE-ProRule annotationAdd
    BLAST
    Domaini864 – 1069206VWFD 3PROSITE-ProRule annotationAdd
    BLAST
    Domaini1430 – 1646217VWFD 4PROSITE-ProRule annotationAdd
    BLAST
    Domaini1761 – 183272VWFC 2PROSITE-ProRule annotationAdd
    BLAST
    Domaini1870 – 193768VWFC 3PROSITE-ProRule annotationAdd
    BLAST
    Domaini2010 – 210495CTCKPROSITE-ProRule annotationAdd
    BLAST

    Sequence similaritiesi

    Contains 1 CTCK (C-terminal cystine knot-like) domain.PROSITE-ProRule annotation
    Contains 3 VWFC domains.PROSITE-ProRule annotation
    Contains 4 VWFD domains.PROSITE-ProRule annotation

    Keywords - Domaini

    Repeat, Signal

    Phylogenomic databases

    eggNOGiNOG12793.
    HOGENOMiHOG000168234.
    HOVERGENiHBG004380.
    InParanoidiQ98UI9.
    PhylomeDBiQ98UI9.

    Family and domain databases

    InterProiIPR006207. Cys_knot_C.
    IPR002919. TIL_dom.
    IPR014853. Unchr_dom_Cys-rich.
    IPR001007. VWF_C.
    IPR001846. VWF_type-D.
    [Graphical view]
    PfamiPF08742. C8. 4 hits.
    PF01826. TIL. 3 hits.
    PF00094. VWD. 4 hits.
    [Graphical view]
    SMARTiSM00832. C8. 4 hits.
    SM00041. CT. 1 hit.
    SM00214. VWC. 5 hits.
    SM00216. VWD. 4 hits.
    [Graphical view]
    SUPFAMiSSF57567. SSF57567. 5 hits.
    PROSITEiPS01225. CTCK_2. 1 hit.
    PS01208. VWFC_1. 2 hits.
    PS50184. VWFC_2. 2 hits.
    PS51233. VWFD. 4 hits.
    [Graphical view]

    Sequencei

    Sequence statusi: Complete.

    Sequence processingi: The displayed sequence is further processed into a mature form.

    Q98UI9-1 [UniParc]FASTAAdd to Basket

    « Hide

    MEIKKERSFW IFCLIWSFCK GKEPVQIVQV STVGRSECTT WGNFHFHTFD     50
    HVKFTFPGTC TYVFASHCND SYQDFNIKIR RSDKNSHLIY FTVTTDGVIL 100
    EVKETGITVN GNQIPLPFSL KSILIEDTCA YFQVTSKLGL TLKWNWADTL 150
    LLDLEETYKE KICGLCGNYD GNKKNDLILD GYKMHPRQFG NFHKVEDPSE 200
    KCPDVRPDDH TGRHPTEDDN RCSKYKKMCK KLLSRFGNCP KVVAFDDYVA 250
    TCTEDMCNCV VNSSQSDLVS SCICSTLNQY SRDCVLSKGD PGEWRTKELC 300
    YQECPSNMEY MECGNSCADT CADPERSKIC KAPCTDGCFC PPGTILDDLG 350
    GKKCVPRDSC PCMFQGKVYS SGGTYSTPCQ NCTCKGGHWS CISLPCSGSC 400
    SIDGGFHIKT FDNKKFNFHG NCHYVLAKNT DDTFVVIGEI IQCGTSKTMT 450
    CLKNVLVTLG RTTIKICSCG SIYMNNFIVK LPVSKDGITI FRPSTFFIKI 500
    LSSAGVQIRV QMKPVMQLSI TVDHSYQNRT SGLCGNFNNI QTDDFRTATG 550
    AVEDSAAAFG NSWKTRASCF DVEDSFEDPC SNSVDKEKFA QHWCALLSNT 600
    SSTFAACHSV VDPSVYIKRC MYDTCNAEKS EVALCSVLST YSRDCAAAGM 650
    TLKGWRQGIC DPSEECPETM VYNYSVKYCN QSCRSLDEPD PLCKVQIAPM 700
    EGCGCPEGTY LNDEEECVTP DDCPCYYKGK IVQPGNSFQE DKLLCKCIQG 750
    RLDCIGETVL VKDCPAPMYY FNCSSAGPGA IGSECQKSCK TQDMHCYVTE 800
    CVSGCMCPDG LVLDGSGGCI PKDQCPCVHG GHFYKPGETI RVDCNTCTCN 850
    KRQWNCTDNP CKGTCTVYGN GHYMSFDGEK FDFLGDCDYI LAQDFCPNNM 900
    DAGTFRIVIQ NNACGKSLSI CSLKITLIFE SSEIRLLEGR IQEIATDPGA 950
    EKNYKVDLRG GYIVIETTQG MSFMWDQKTT VVVHVTPSFQ GKVCGLCGDF 1000
    DGRSRNDFTT RGQSVEMSIQ EFGNSWKITS TCSNINMTDL CADQPFKSAL 1050
    GQKHCSIIKS SVFEACHSKV NPIPYYESCV SDFCGCDSVG DCECFCTSVA 1100
    AYARSCSTAG VCINWRTPAI CPVFCDYYNP PDKHEWFYKP CGAPCLKTCR 1150
    NPQGKCGNIL YSLEGCYPEC SPDKPYFDEE RRECVSLPDC TSCNPEEKLC 1200
    TEDSKDCLCC YNGKTYPLNE TIYSQTEGTK CGNAFCGPNG MIIETFIPCS 1250
    TLSVPAQEQL MQPVTSAPLL STEATPCFCT DNGQLIQMGE NVSLPMNISG 1300
    HCAYSICNAS CQIELIWAEC KVVQTEALET CEPNSEACPP TAAPNATSLV 1350
    PATALAPMSD CLGLIPPRKF NESWDFGNCQ IATCLGEENN IKLSSITCPP 1400
    QQLKLCVNGF PFMKHHDETG CCEVFECQCI CSGWGNEHYV TFDGTYYHFK 1450
    ENCTYVLVEL IQPSSEKFWI HIDNYYCGAA DGAICSMSLL IFHSNSLVIL 1500
    TQAKEHGKGT NLVLFNDKKV VPDISKNGIR ITSSGLYIIV EIPELEVYVS 1550
    YSRLAFYIKL PFGKYYNNTM GLCGTCTNQK SDDARKRNGE VTDSFKEMAL 1600
    DWKAPVSTNR YCNPGISEPV KIENYQHCEP SELCKIIWNL TECHRVVPPQ 1650
    PYYEACVASR CSQQHPSTEC QSMQTYAALC GLHGICVDWR GQTNGQCEAT 1700
    CARDQVYKPC GEAKRNTCFS REVIVDTLLS RNNTPVFVEG CYCPDGNILL 1750
    NEHDGICVSV CGCTAQDGSV KKPREAWEHD CQYCTCDEET LNISCFPRPC 1800
    AKSPPINCTK EGFVRKIKPR LDDPCCTETV CECDIKTCII NKTACDLGFQ 1850
    PVVAISEDGC CPIFSCIPKG VCVSEGVEFK PGAVVPKSSC EDCVCTDEQD 1900
    AVTGTNRIQC VPVKCQTTCQ QGFRYVEKEG QCCSQCQQVA CVANFPFGSV 1950
    TIEVGKSYKA PYDNCTQYTC TESGGQFSLT STVKVCLPFE ESNCVPGTVD 2000
    VTSDGCCKTC IDLPHKCKRS MKEQYIVHKH CKSAAPVPVP FCEGTCSTYS 2050
    VYSFENNEME HKCICCHEKK SHVEKVELVC SEHKTLKFSY VHVDECGCVE 2100
    TKCPMRRT 2108
    Length:2,108
    Mass (Da):233,553
    Last modified:June 1, 2001 - v1
    Checksum:i68B887CB781E6539
    GO

    Sequence databases

    Select the link destinations:
    EMBL
    GenBank
    DDBJ
    Links Updated
    AB046524 mRNA. Translation: BAB21488.1.
    UniGeneiGga.3337.

    Cross-referencesi

    Sequence databases

    Select the link destinations:
    EMBL
    GenBank
    DDBJ
    Links Updated
    AB046524 mRNA. Translation: BAB21488.1 .
    UniGenei Gga.3337.

    3D structure databases

    ProteinModelPortali Q98UI9.
    ModBasei Search...
    MobiDBi Search...

    Protein-protein interaction databases

    STRINGi 9031.ENSGALP00000010852.

    Protein family/group databases

    Allergomei 2741. Gal d Ovomucin.

    Proteomic databases

    PaxDbi Q98UI9.
    PRIDEi Q98UI9.

    Protocols and materials databases

    Structural Biology Knowledgebase Search...

    Phylogenomic databases

    eggNOGi NOG12793.
    HOGENOMi HOG000168234.
    HOVERGENi HBG004380.
    InParanoidi Q98UI9.
    PhylomeDBi Q98UI9.

    Miscellaneous databases

    NextBioi 20815465.
    PROi Q98UI9.

    Family and domain databases

    InterProi IPR006207. Cys_knot_C.
    IPR002919. TIL_dom.
    IPR014853. Unchr_dom_Cys-rich.
    IPR001007. VWF_C.
    IPR001846. VWF_type-D.
    [Graphical view ]
    Pfami PF08742. C8. 4 hits.
    PF01826. TIL. 3 hits.
    PF00094. VWD. 4 hits.
    [Graphical view ]
    SMARTi SM00832. C8. 4 hits.
    SM00041. CT. 1 hit.
    SM00214. VWC. 5 hits.
    SM00216. VWD. 4 hits.
    [Graphical view ]
    SUPFAMi SSF57567. SSF57567. 5 hits.
    PROSITEi PS01225. CTCK_2. 1 hit.
    PS01208. VWFC_1. 2 hits.
    PS50184. VWFC_2. 2 hits.
    PS51233. VWFD. 4 hits.
    [Graphical view ]
    ProtoNeti Search...

    Publicationsi

    1. "Amino acid sequence of alpha-subunit in hen egg white ovomucin deduced from cloned cDNA."
      Watanabe K., Shimoyamada M., Onizuka T., Akiyama H., Niwa M., Ido T., Tsuge Y.
      DNA Seq. 15:251-261(2004) [PubMed] [Europe PMC] [Abstract]
      Cited for: NUCLEOTIDE SEQUENCE [MRNA].
      Tissue: Oviduct.
    2. "Studies on the composition of egg-white ovomucin."
      Robinson D.S., Monsey J.B.
      Biochem. J. 121:537-547(1971) [PubMed] [Europe PMC] [Abstract]
      Cited for: STRUCTURE OF CARBOHYDRATES, FUNCTION, SUBUNIT.
    3. "N-glycosylation of ovomucin from hen egg white."
      Offengenden M., Fentabil M.A., Wu J.
      Glycoconj. J. 28:113-123(2011) [PubMed] [Europe PMC] [Abstract]
      Cited for: GLYCOSYLATION AT ASN-381; ASN-528; ASN-599; ASN-680; ASN-772; ASN-855; ASN-1036; ASN-1219; ASN-1371; ASN-1452; ASN-1567; ASN-1639; ASN-1792; ASN-1807; ASN-1841 AND ASN-1964, LACK OF GLYCOSYLATION AT ASN-69 AND ASN-673, IDENTIFICATION BY MASS SPECTROMETRY, STRUCTURE OF CARBOHYDRATES.

    Entry informationi

    Entry nameiMUC5B_CHICK
    AccessioniPrimary (citable) accession number: Q98UI9
    Entry historyi
    Integrated into UniProtKB/Swiss-Prot: September 21, 2011
    Last sequence update: June 1, 2001
    Last modified: October 1, 2014
    This is version 73 of the entry and version 1 of the sequence. [Complete history]
    Entry statusiReviewed (UniProtKB/Swiss-Prot)
    Annotation programChordata Protein Annotation Program

    Miscellaneousi

    Keywords - Technical termi

    Complete proteome, Reference proteome

    Documents

    1. SIMILARITY comments
      Index of protein domains and families

    External Data

    Dasty 3