Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q80Z19 (MUC2_MOUSE) Reviewed, UniProtKB/Swiss-Prot

Last modified April 16, 2014. Version 77. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Web links·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Mucin-2

Short name=MUC-2
Alternative name(s):
Colonic mucin
Short name=MCM
Secreted gel-forming mucin
Gene names
Name:Muc2
OrganismMus musculus (Mouse) [Reference proteome]
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus

Protein attributes

Sequence length2680 AA.
Sequence statusFragments.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Coats the epithelia of the intestines, airways, and other mucus membrane-containing organs. Thought to provide a protective, lubricating barrier against particles and infectious agents at mucosal surfaces. Major constituent of both the inner and outer mucus layers of the colon and may play a role in excluding bacteria from the inner mucus layer. Ref.6 Ref.7 UniProtKB Q02817

Subunit structure

Homotrimer; disulfide-linked. Dimerizes in the endoplasmic reticulum via its C-terminal region and polymerizes via its N-terminal region by disulfide-linked trimerization By similarity. Interacts with FCGBP. Interacts with AGR2; disulfide-linked By similarity. Ref.7 UniProtKB Q02817

Subcellular location

Secreted. Note: In the intestine, secreted into the inner and outer mucus layers. Ref.6 Ref.7

Tissue specificity

Highly expressed in goblet cells of the colon with lower levels in the small intestine and no expression in the stomach (at protein level). Ref.1 Ref.2

Post-translational modification

O-glycosylated. Ref.2

May undergo proteolytic cleavage in the outer mucus layer of the colon, contributing to the expanded volume and loose nature of this layer which allows for bacterial colonization in contrast to the inner mucus layer which is dense and devoid of bacteria. Ref.6

May undergo autocatalytic cleavage in vivo triggered by the low pH of the late secretory pathway By similarity. UniProtKB Q02817

Disruption phenotype

Aberrant intestinal crypt morphology and altered cell maturation and migration. Frequent development of adenomas in the small intestine which progress to invasive adenocarcinomas, as well as rectal tumors. Absence of inner and outer mucus layers in the colon so that bacteria are in direct contact with the colon epithelium and enter into the cells and crypts in contrast to wild-type animals which are devoid of bacteria in the inner mucus layer. Ref.5 Ref.6

Sequence similarities

Contains 1 CTCK (C-terminal cystine knot-like) domain.

Contains 1 TIL (trypsin inhibitory-like) domain.

Contains 2 VWFC domains.

Contains 4 VWFD domains.

Sequence caution

The sequence AAH30862.1 differs from that shown. Reason: Erroneous initiation.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 2020 Potential
Chain21 – 26802660Mucin-2
PRO_0000378062

Regions

Domain33 – 238206VWFD 1
Domain295 – 35157TIL
Domain388 – 601214VWFD 2
Domain857 – 1062206VWFD 3
Domain1985 – 2197213VWFD 4
Domain2315 – 238672VWFC 1
Domain2424 – 249168VWFC 2
Domain2575 – 266086CTCK
Compositional bias1257 – 1913657Thr-rich

Amino acid modifications

Modified residue211Phosphoserine By similarity UniProtKB Q02817
Glycosylation6671N-linked (GlcNAc...) Potential
Glycosylation12271N-linked (GlcNAc...) Potential
Glycosylation18491N-linked (GlcNAc...) Potential
Disulfide bond56 ↔ 64 By similarity
Disulfide bond2575 ↔ 2622 By similarity
Disulfide bond2589 ↔ 2636 By similarity
Disulfide bond2598 ↔ 2652 By similarity
Disulfide bond2602 ↔ 2654 By similarity
Disulfide bond? ↔ 2659 By similarity

Experimental info

Sequence conflict3011E → G in AAD01593. Ref.2
Sequence conflict21201V → G in AAH30862. Ref.3
Sequence conflict21201V → G in AAH36168. Ref.3
Sequence conflict23981T → P in AAH36168. Ref.3
Non-adjacent residues1442 – 14432
Non-adjacent residues1758 – 17592

Sequences

Sequence LengthMass (Da)Tools
Q80Z19 [UniParc].

Last modified June 16, 2009. Version 2.
Checksum: D67D8664AAB15933

FASTA2,680293,436
        10         20         30         40         50         60 
MGLPLARLVA ACLVLALAKG SELQKEARSR NHVCSTWGDF HYKTFDGDVY RFPGLCDYNF 

        70         80         90        100        110        120 
ASDCRDSYKE FAVHLKRGLG EAGGHSQIES ILITIKDDTI YLTHKLAVVN GAMVSTPHYS 

       130        140        150        160        170        180 
SGLLIEKNDA YTKVYSRAGL SLMWNREDAL MVELDSRFQN HTCGLCGDFN GMQTNYEFLS 

       190        200        210        220        230        240 
EEGIQFSAIE FGNMQKINKP EVQCEDPEAV QEPESCSEHR AECERLLTSA AFEDCQTRVP 

       250        260        270        280        290        300 
VESYVRACMH DRCQCPKGGA CECSTLAEFS RQCSHAGGRP ENWRTASLCP KKCPNNMVYL 

       310        320        330        340        350        360 
ESSSPCVDTC SHLEVSSLCE EHYMDGCFCP EGTVYDDITG SGCIPVSQCH CKLHGHLYMP 

       370        380        390        400        410        420 
GQEFTNDCEQ CVCNAGRWVC KDLPCPETCA LEGGSHITTF DGKKFTFHGD CYYVLTKSEH 

       430        440        450        460        470        480 
NDSYALLGEL ASCGSTDKQT CLKTVVLLTD DKKNVVAFKS GGSVLLNEME VTLPHVAASF 

       490        500        510        520        530        540 
SIFQPSSYHI VVNTKFGLRL QIQLLPVMQL FVTLDQAAQG QVQGLCGNFN GLESDDFMTS 

       550        560        570        580        590        600 
GGMVEATGAG FANTWKAQSS CHDKLDWLDD PCSLNIETNY AEHWCSLLKR SETPFARCHL 

       610        620        630        640        650        660 
AVDPTEYYKR CKYDTCNCQN NEDCMCAALS SYARACAAKG VMLWGWRERV CNKDVHACPS 

       670        680        690        700        710        720 
SQIFMYNLTT CQQTCRSLSE GDSHCLKGFA PVEGCGCPDH TFMDEKGRCV PLAKCSCYHH 

       730        740        750        760        770        780 
GLYLEAGDVI LRQEERCICR NGRLQCTQVK LIGHTCQYPK ILVDCNNLTA LAVRKPRPTS 

       790        800        810        820        830        840 
CQTLVAGYYH TECISGCVCP DGLLDDGRGG CVEEDKCPCI HNKDLYSSGE SIKLDCNNTC 

       850        860        870        880        890        900 
TCQKGRWECT RYACHSTCSI YGSGHYITFD GKHYDFDGHC SYVAVQDYCG QNSTGSFSII 

       910        920        930        940        950        960 
TENVPCGTTG VTCSKAIKIF IGGTELKLVD KHRVVKQLEE GHHVPYITRE VGQYLVVEAS 

       970        980        990       1000       1010       1020 
SGIIVIWDKK TTIFIKLDPS YKGTVCGLCG NFDDQTKNDF TTRDHMVVTS ELDFGNSWKE 

      1030       1040       1050       1060       1070       1080 
ASTCPDVSHN PDPCSLNPHR RSWAEKQCSI IKSRVFKVCH SKVDPTVFYE ACVHDSCSCD 

      1090       1100       1110       1120       1130       1140 
TGGDCDCFCS AVASYAQECT KAEACVFWRT PDLCPIFCDY YNPPDECEWH YEPCGNRSFE 

      1150       1160       1170       1180       1190       1200 
TCRTLNGIHS NISVSYLEGC YPRCPEDRPI YDEDLKKCVT GDKCGCYIED TRYPPGGSVP 

      1210       1220       1230       1240       1250       1260 
TDEICKSCTC TNTSKIECHP DEGKILNMTQ DGIFCYWEFC GPNGTVGQHF NICGSSTAIP 

      1270       1280       1290       1300       1310       1320 
STTTSFTTIS TPISTTPIST TITTTTVTMT TEQVPCCFWS DWINKYHPTK ENGGDRETFT 

      1330       1340       1350       1360       1370       1380 
HVCSAPEDIE CRAATDPKLS WEELGQKVQC NVSTGLICNN EDQYGIGEFE LCYDYEIRVN 

      1390       1400       1410       1420       1430       1440 
CCYPMEYCTP STISPTTSTT TLSTTPPTSS PTTLPTSSPV TSSATLPTTS SITSTISPTT 

      1450       1460       1470       1480       1490       1500 
SPSTATQTIS VTTSQTSSSA TPPNSSPTSS ATTSPTTSSG TSTATSPSTS PTTSSTFTTP 

      1510       1520       1530       1540       1550       1560 
PSTTCIDDCK WTGWLDSGKP TYDIKSGDFE LIKGVCEPHW EVQNISCRAV MHSNIPLDQL 

      1570       1580       1590       1600       1610       1620 
GQIVVCNKEV GLVCKNEDQE IGGIIPMRMC LNYEINVYCC NPICFTSTPS STTTETPTTT 

      1630       1640       1650       1660       1670       1680 
STTKTSILTS TTTQTPSPSP TTTVTPTPAP TTTQIPTSTS TTTQTTTPTP ITETSTPTST 

      1690       1700       1710       1720       1730       1740 
ISQTPSPAST TTVTPATTST TTETSTSTST TTQTTSPTPT VTETSTPRST TTQTPSPVPT 

      1750       1760       1770       1780       1790       1800 
TTVTSTPTPT IGETTTPKRP PSTSTPTSFT VPTETTTQTR PLSTTPTTLE TTRTSSWGTF 

      1810       1820       1830       1840       1850       1860 
SSTSPITSPS TVWTHTETQV TCCVLNEMFY GPGELVYNST HGGTCFYVNC SLDCHLQFFN 

      1870       1880       1890       1900       1910       1920 
WSCPSTPSTP TPSTPTPTPS QTTTPSTTSS KSTPSTPQST SPKSTLSTPT KTTPYGCPDF 

      1930       1940       1950       1960       1970       1980 
DPPRQVNETW WLCNCTMAIC NHDNVVEIVP LKCDPPPMPT CANGLKPVRV PDADNCCWHW 

      1990       2000       2010       2020       2030       2040 
ECDCYCTGWG DPHFVTFDGL YYSYQGNCTY VLVEEITPTV DNFGVYIDNY HCDANDKVSC 

      2050       2060       2070       2080       2090       2100 
PRTLIVRHET QEVQIKTVRM MPIEVEVQVN KQLVALPYKK YGLEVYESGI NIVVNISRLE 

      2110       2120       2130       2140       2150       2160 
AKISYNGLSF SIRLPYKLFV NNTKGQCGTC TNNTADDCIL PSGKIISDCE IAADEWLVND 

      2170       2180       2190       2200       2210       2220 
PSKPHCPHKG LTTKRPATTT PGLSLNNCTV SPVCHLIMDS LFSQCHAFVP PKHYYEACLF 

      2230       2240       2250       2260       2270       2280 
DSCYVPGSNM ECASVQAYAT LCAKEGVCID WRNHTQGVCS VKCPPHKQYQ ACGPEEEPTC 

      2290       2300       2310       2320       2330       2340 
QPSSSQNSTL LVEGCFCPEG TTKFAPGYDV CVKTCGCVGP DNVPREFGEH FEFDCKDCVC 

      2350       2360       2370       2380       2390       2400 
REGGSGIVCQ PKKCSGGNQT TCEEDGTYLV VETNPDDKCC NITSCKCDTK RCKAERPTCL 

      2410       2420       2430       2440       2450       2460 
LGFEVKTEIV PGKCCPVYSC VPKGVCVHQN AEYQPGSPVY SNKCQDCVCT NILDNSTQLN 

      2470       2480       2490       2500       2510       2520 
VISCTHVPCN ISCSSGFELV DVPGECCKKC QQTHCIIEGP KQQYIILKPG EIHKNPSNKC 

      2530       2540       2550       2560       2570       2580 
TFFSCMKINN QLISSVSNIT CPDFNPSDCV SGSITYMPNG CCKTCIPQNQ TRVPCSAVSV 

      2590       2600       2610       2620       2630       2640 
MKEISYNGCT KNISMNYCFG SCGTFAMYSA QVQGLDHRCS CCKEEKTSVR SVTLECPDGS 

      2650       2660       2670       2680 
ELSHTYTHIE SCLCQDTVCG LPQAQQVRTR RSSPRFLGRK 

« Hide

References

« Hide 'large scale' references
[1]"The mouse secreted gel-forming mucin gene cluster."
Escande F., Porchet N., Bernigaud A., Petitprez D., Aubert J.-P., Buisine M.-P.
Biochim. Biophys. Acta 1676:240-250(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-1442 AND 1759-2680, TISSUE SPECIFICITY.
Strain: C57BL/6J.
[2]"Gastrointestinal expression and partial cDNA cloning of murine Muc2."
van Klinken B.J.-W., Einerhand A.W.C., Duits L.A., Makkink M.K., Tytgat K.M.A.J., Renes I.B., Verburg M., Bueller H.A., Dekker J.
Am. J. Physiol. 276:G115-G124(1999) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 1-301, TISSUE SPECIFICITY, GLYCOSYLATION.
Strain: 129.
Tissue: Colon.
[3]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1443-1758 AND 1795-2680.
Strain: FVB/N.
Tissue: Colon.
[4]"The transcriptional landscape of the mammalian genome."
Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J. expand/collapse author list , Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.
Science 309:1559-1563(2005) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 2199-2680.
Strain: C57BL/6J.
Tissue: Small intestine.
[5]"Colorectal cancer in mice genetically deficient in the mucin Muc2."
Velcich A., Yang W., Heyer J., Fragale A., Nicholas C., Viani S., Kucherlapati R., Lipkin M., Yang K., Augenlicht L.
Science 295:1726-1729(2002) [PubMed] [Europe PMC] [Abstract]
Cited for: DISRUPTION PHENOTYPE.
[6]"The inner of the two Muc2 mucin-dependent mucus layers in colon is devoid of bacteria."
Johansson M.E.V., Phillipson M., Petersson J., Velcich A., Holm L., Hansson G.C.
Proc. Natl. Acad. Sci. U.S.A. 105:15064-15069(2008) [PubMed] [Europe PMC] [Abstract]
Cited for: FUNCTION, SUBCELLULAR LOCATION, DISRUPTION PHENOTYPE.
[7]"Proteomic analyses of the two mucus layers of the colon barrier reveal that their main component, the Muc2 mucin, is strongly bound to the Fcgbp protein."
Johansson M.E.V., Thomsson K.A., Hansson G.C.
J. Proteome Res. 8:3549-3557(2009) [PubMed] [Europe PMC] [Abstract]
Cited for: IDENTIFICATION BY MASS SPECTROMETRY, FUNCTION, INTERACTION WITH FCGBP, SUBCELLULAR LOCATION.
+Additional computationally mapped references.

Web resources

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AJ511872 Genomic DNA. Translation: CAD54414.1.
AJ511873, AJ511874 Genomic DNA. Translation: CAD54416.1.
AF016695 mRNA. Translation: AAD01593.1.
BC024540 mRNA. No translation available.
BC030862 mRNA. Translation: AAH30862.1. Different initiation.
BC036168 mRNA. Translation: AAH36168.1.
AK008250 mRNA. Translation: BAB25557.1.
UniGeneMm.2041.
Mm.461296.

3D structure databases

ProteinModelPortalQ80Z19.
SMRQ80Z19. Positions 2260-2315.
ModBaseSearch...
MobiDBSearch...

Proteomic databases

PaxDbQ80Z19.
PRIDEQ80Z19.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENSMUST00000026590; ENSMUSP00000026590; ENSMUSG00000025515.

Organism-specific databases

MGIMGI:1339364. Muc2.

Phylogenomic databases

eggNOGNOG12793.
GeneTreeENSGT00730000110990.
HOGENOMHOG000203111.
InParanoidQ8K0Q1.
OrthoDBEOG73V6J9.
TreeFamTF337106.

Gene expression databases

GenevestigatorQ80Z19.

Family and domain databases

InterProIPR006207. Cys_knot_C.
IPR002919. TIL_dom.
IPR014853. Unchr_dom_Cys-rich.
IPR001007. VWF_C.
IPR001846. VWF_type-D.
IPR025155. WxxW_domain.
[Graphical view]
PfamPF08742. C8. 4 hits.
PF13330. Mucin2_WxxW. 2 hits.
PF01826. TIL. 1 hit.
PF00094. VWD. 4 hits.
[Graphical view]
SMARTSM00832. C8. 4 hits.
SM00041. CT. 1 hit.
SM00214. VWC. 3 hits.
SM00216. VWD. 4 hits.
[Graphical view]
SUPFAMSSF57567. SSF57567. 4 hits.
PROSITEPS01185. CTCK_1. 1 hit.
PS01225. CTCK_2. 1 hit.
PS01208. VWFC_1. 2 hits.
PS50184. VWFC_2. 2 hits.
PS51233. VWFD. 4 hits.
[Graphical view]
ProtoNetSearch...

Other

ChiTaRSMUC2. mouse.
PROQ80Z19.
SOURCESearch...

Entry information

Entry nameMUC2_MOUSE
AccessionPrimary (citable) accession number: Q80Z19
Secondary accession number(s): Q0P637 expand/collapse secondary AC list , Q80Z17, Q8K0Q1, Q9CVG8, Q9Z2U5
Entry history
Integrated into UniProtKB/Swiss-Prot: June 16, 2009
Last sequence update: June 16, 2009
Last modified: April 16, 2014
This is version 77 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot