Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Regenerating islet-derived protein 4

Gene

Reg4

Organism
Mus musculus (Mouse)
Status
Reviewed-Annotation score: Annotation score: 2 out of 5-Experimental evidence at transcript leveli

Functioni

Calcium-independent lectin displaying mannose-binding specificity and able to maintain carbohydrate recognition activity in an acidic environment. May be involved in inflammatory and metaplastic responses of the gastrointestinal epithelium (By similarity).By similarity

GO - Molecular functioni

Complete GO annotation...

Keywords - Ligandi

Lectin

Names & Taxonomyi

Protein namesi
Recommended name:
Regenerating islet-derived protein 4
Short name:
REG-4
Gene namesi
Name:Reg4
OrganismiMus musculus (Mouse)
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus
Proteomesi
  • UP000000589 Componenti: Chromosome 3

Organism-specific databases

MGIiMGI:1914959. Reg4.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Secreted

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 2222By similarityAdd
BLAST
Chaini23 – 157135Regenerating islet-derived protein 4PRO_0000017438Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Disulfide bondi29 ↔ 40PROSITE-ProRule annotation
Glycosylationi49 – 491N-linked (GlcNAc...)Sequence analysis
Disulfide bondi57 ↔ 153PROSITE-ProRule annotation
Glycosylationi62 – 621N-linked (GlcNAc...)Sequence analysis
Disulfide bondi128 ↔ 145PROSITE-ProRule annotation

Keywords - PTMi

Disulfide bond, Glycoprotein

Proteomic databases

MaxQBiQ9D8G5.
PaxDbiQ9D8G5.
PRIDEiQ9D8G5.

PTM databases

PhosphoSiteiQ9D8G5.

Expressioni

Gene expression databases

BgeeiQ9D8G5.
CleanExiMM_REG4.
GenevisibleiQ9D8G5. MM.

Interactioni

Protein-protein interaction databases

STRINGi10090.ENSMUSP00000029469.

Structurei

3D structure databases

ProteinModelPortaliQ9D8G5.
SMRiQ9D8G5. Positions 28-156.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini36 – 154119C-type lectinPROSITE-ProRule annotationAdd
BLAST

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni97 – 1026Carbohydrate-bindingBy similarity
Regioni134 – 1363Carbohydrate-bindingBy similarity

Sequence similaritiesi

Contains 1 C-type lectin domain.PROSITE-ProRule annotation

Keywords - Domaini

Signal

Phylogenomic databases

eggNOGiKOG4297. Eukaryota.
ENOG410XPJ1. LUCA.
GeneTreeiENSGT00700000104249.
HOGENOMiHOG000010281.
HOVERGENiHBG004151.
InParanoidiQ9D8G5.
OMAiCNKRQHF.
OrthoDBiEOG7FFMQR.
PhylomeDBiQ9D8G5.

Family and domain databases

Gene3Di3.10.100.10. 1 hit.
InterProiIPR001304. C-type_lectin.
IPR016186. C-type_lectin-like.
IPR016187. C-type_lectin_fold.
[Graphical view]
PfamiPF00059. Lectin_C. 1 hit.
[Graphical view]
SMARTiSM00034. CLECT. 1 hit.
[Graphical view]
SUPFAMiSSF56436. SSF56436. 1 hit.
PROSITEiPS50041. C_TYPE_LECTIN_2. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

Q9D8G5-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MASKGVRLLL LLSWVAGPEV LSDILRPSCA PGWFYYRSHC YGYFRKLRNW
60 70 80 90 100
SHAELECQSY GNGSHLASVL NQKEASVISK YITGYQRNLP VWIGLHDPQK
110 120 130 140 150
KQLWQWTDGS TNLYRRWNPR TKSEARHCAE MNPKDKFLTW NKNGCANRQH

FLCKYKT
Length:157
Mass (Da):18,398
Last modified:June 1, 2001 - v1
Checksum:iF3981722BBD83968
GO

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti3 – 31S → Y in BAB25669 (PubMed:16141072).Curated

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK008049 mRNA. Translation: BAB25429.1.
AK008438 mRNA. Translation: BAB25669.1.
AK162316 mRNA. Translation: BAE36851.1.
BC019465 mRNA. Translation: AAH19465.1.
CCDSiCCDS17661.1.
RefSeqiNP_080604.2. NM_026328.2.
UniGeneiMm.46306.

Genome annotation databases

EnsembliENSMUST00000029469; ENSMUSP00000029469; ENSMUSG00000027876.
GeneIDi67709.
KEGGimmu:67709.
UCSCiuc008qpq.1. mouse.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK008049 mRNA. Translation: BAB25429.1.
AK008438 mRNA. Translation: BAB25669.1.
AK162316 mRNA. Translation: BAE36851.1.
BC019465 mRNA. Translation: AAH19465.1.
CCDSiCCDS17661.1.
RefSeqiNP_080604.2. NM_026328.2.
UniGeneiMm.46306.

3D structure databases

ProteinModelPortaliQ9D8G5.
SMRiQ9D8G5. Positions 28-156.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi10090.ENSMUSP00000029469.

PTM databases

PhosphoSiteiQ9D8G5.

Proteomic databases

MaxQBiQ9D8G5.
PaxDbiQ9D8G5.
PRIDEiQ9D8G5.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSMUST00000029469; ENSMUSP00000029469; ENSMUSG00000027876.
GeneIDi67709.
KEGGimmu:67709.
UCSCiuc008qpq.1. mouse.

Organism-specific databases

CTDi83998.
MGIiMGI:1914959. Reg4.

Phylogenomic databases

eggNOGiKOG4297. Eukaryota.
ENOG410XPJ1. LUCA.
GeneTreeiENSGT00700000104249.
HOGENOMiHOG000010281.
HOVERGENiHBG004151.
InParanoidiQ9D8G5.
OMAiCNKRQHF.
OrthoDBiEOG7FFMQR.
PhylomeDBiQ9D8G5.

Miscellaneous databases

NextBioi325329.
PROiQ9D8G5.
SOURCEiSearch...

Gene expression databases

BgeeiQ9D8G5.
CleanExiMM_REG4.
GenevisibleiQ9D8G5. MM.

Family and domain databases

Gene3Di3.10.100.10. 1 hit.
InterProiIPR001304. C-type_lectin.
IPR016186. C-type_lectin-like.
IPR016187. C-type_lectin_fold.
[Graphical view]
PfamiPF00059. Lectin_C. 1 hit.
[Graphical view]
SMARTiSM00034. CLECT. 1 hit.
[Graphical view]
SUPFAMiSSF56436. SSF56436. 1 hit.
PROSITEiPS50041. C_TYPE_LECTIN_2. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

  1. "The transcriptional landscape of the mammalian genome."
    Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J.
    , Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.
    Science 309:1559-1563(2005) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
    Strain: C57BL/6J.
    Tissue: Colon and Small intestine.
  2. "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
    The MGC Project Team
    Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
    Strain: FVB/N.
    Tissue: Colon.

Entry informationi

Entry nameiREG4_MOUSE
AccessioniPrimary (citable) accession number: Q9D8G5
Secondary accession number(s): Q3TS25, Q9D858
Entry historyi
Integrated into UniProtKB/Swiss-Prot: July 5, 2005
Last sequence update: June 1, 2001
Last modified: November 11, 2015
This is version 101 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. MGD cross-references
    Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.