Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot P31809 (CEAM1_MOUSE)

Last modified June 16, 2009. Version 86. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Alternative products · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Carcinoembryonic antigen-related cell adhesion molecule 1
Alternative name(s):
    Biliary glycoprotein 1
      Short name=BGP-1
    Murine hepatitis virus receptor
      Short name=MHV-R
    MHVR1
    Biliary glycoprotein D
    CD_antigen=CD66a
Gene names
Name: Ceacam1
Synonyms: Bgp, Bgp1, Bgpd
OrganismMus musculus (Mouse)
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMus

Protein attributes

Sequence length521 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level.

General annotation (Comments)

Function

Unknown. In case of murine coronavirus (MHV) infection, serves as receptor for MHV S1 spike glycoprotein. Ref.2 Ref.7

Subunit structure

Interacts with MHV S1 spike glycoprotein. Ref.2 Ref.1 Ref.9

Subcellular location

Cell membrane; Single-pass type I membrane protein. Ref.1

Sequence similarities

Belongs to the immunoglobulin superfamily. CEA family.

Contains 3 Ig-like C2-type (immunoglobulin-like) domains.

Contains 1 Ig-like V-type (immunoglobulin-like) domain.

Ontologies

Keywords
   Biological processHost-virus interaction
   Cellular componentCell membrane
Membrane
   Coding sequence diversityAlternative splicing
   DomainImmunoglobulin domain
Repeat
Signal
Transmembrane
   Molecular functionReceptor
   PTMDisulfide bond
Glycoprotein
   Technical term3D-structure
Direct protein sequencing
Gene Ontology (GO)
   Biological processinterspecies interaction between organisms

Inferred from electronic annotation. Source: UniProtKB-KW

   Cellular componentintegral to membrane

Inferred from electronic annotation. Source: UniProtKB-KW

plasma membrane

Inferred from electronic annotation. Source: UniProtKB-SubCell

   Molecular functionreceptor activity

Inferred from electronic annotation. Source: UniProtKB-KW

Complete GO annotation...

Alternative products

This entry describes 3 isoforms produced by alternative splicing. [Align] [Select]
Isoform Long (identifier: P31809-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform Short (identifier: P31809-2)

The sequence of this isoform differs from the canonical sequence as follows:
     455-458: GSDQ → SGSF
     459-521: Missing.
Isoform 3 (identifier: P31809-3)

The sequence of this isoform differs from the canonical sequence as follows:
     142-142: P → Q
     143-322: Missing.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 3434 Ref.5
Chain35 – 521487Carcinoembryonic antigen-related cell adhesion molecule 1
PRO_0000014563

Regions

Topological domain35 – 428394Extracellular Potential
Transmembrane429 – 44719 Potential
Topological domain448 – 52174Cytoplasmic Potential
Domain35 – 142108Ig-like V-type
Domain147 – 23488Ig-like C2-type 1
Domain239 – 31981Ig-like C2-type 2
Domain323 – 41189Ig-like C2-type 3

Amino acid modifications

Glycosylation711N-linked (GlcNAc...) Ref.9
Glycosylation891N-linked (GlcNAc...) Ref.9
Glycosylation1041N-linked (GlcNAc...) Ref.9
Glycosylation1481N-linked (GlcNAc...) Potential
Glycosylation1521N-linked (GlcNAc...) Potential
Glycosylation1991N-linked (GlcNAc...) Potential
Glycosylation2061N-linked (GlcNAc...) Ref.8
Glycosylation2101N-linked (GlcNAc...) Potential
Glycosylation2261N-linked (GlcNAc...) Potential
Glycosylation2581N-linked (GlcNAc...) Potential
Glycosylation2901N-linked (GlcNAc...) Potential
Glycosylation2941N-linked (GlcNAc...) Potential
Glycosylation3041N-linked (GlcNAc...) Potential
Glycosylation3171N-linked (GlcNAc...) Potential
Glycosylation3331N-linked (GlcNAc...) Ref.9
Glycosylation3751N-linked (GlcNAc...) Potential
Disulfide bond167 ↔ 217 Probable
Disulfide bond261 ↔ 301 Probable
Disulfide bond346 ↔ 394 Ref.9

Natural variations

Alternative sequence1421P → Q in isoform 3.
VSP_036040
Alternative sequence143 – 322180Missing in isoform 3.
VSP_036041
Alternative sequence455 – 4584GSDQ → SGSF in isoform Short.
VSP_002484
Alternative sequence459 – 52163Missing in isoform Short.
VSP_002485

Experimental info

Sequence conflict361 – 3622SQ → RE in CAA47699. Ref.3

Sequences

Sequence LengthMass (Da)Tools
Isoform Long [UniParc].

Last modified July 1, 1993. Version 1.
Checksum: 1C8F71FAC47DD54E

FASTA52157,016
        10         20         30         40         50         60 
MELASAHLHK GQVPWGGLLL TASLLASWSP ATTAEVTIEA VPPQVAEDNN VLLLVHNLPL 

        70         80         90        100        110        120 
ALGAFAWYKG NTTAIDKEIA RFVPNSNMNF TGQAYSGREI IYSNGSLLFQ MITMKDMGVY 

       130        140        150        160        170        180 
TLDMTDENYR RTQATVRFHV HPILLKPNIT SNNSNPVEGD DSVSLTCDSY TDPDNINYLW 

       190        200        210        220        230        240 
SRNGESLSEG DRLKLSEGNR TLTLLNVTRN DTGPYVCETR NPVSVNRSDP FSLNIIYGPD 

       250        260        270        280        290        300 
TPIISPSDIY LHPGSNLNLS CHAASNPPAQ YFWLINEKPH ASSQELFIPN ITTNNSGTYT 

       310        320        330        340        350        360 
CFVNNSVTGL SRTTVKNITV LEPVTQPFLQ VTNTTVKELD SVTLTCLSND IGANIQWLFN 

       370        380        390        400        410        420 
SQSLQLTERM TLSQNNSILR IDPIKREDAG EYQCEISNPV SVRRSNSIKL DIIFDPTQGG 

       430        440        450        460        470        480 
LSDGAIAGIV IGVVAGVALI AGLAYFLYSR KSGGGSDQRD LTEHKPSTSN HNLAPSDNSP 

       490        500        510        520 
NKVDDVAYTV LNFNSQQPNR PTSAPSSPRA TETVYSEVKK K 

« Hide

Isoform Short.

Checksum: 63E0EC6AC58BA660
Show »

FASTA45850,057
Isoform 3.

Checksum: 0ABE851287A77290
Show »

FASTA34137,271

References

« Hide 'large scale' references
[1]"Several members of the mouse carcinoembryonic antigen-related glycoprotein family are functional receptors for the coronavirus mouse hepatitis virus-A59."
Dveksler G.S., Dieffenback C.B., Cardellichio C.B., McCuaig K., Pensiero M.N., Jiang G.-S., Beauchemin N., Holmes K.V.
J. Virol. 67:1-8(1993) [PubMed: 8380065] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA], SUBCELLULAR LOCATION, ALTERNATIVE SPLICING, INTERACTION WITH MHV SPIKE GLYCOPROTEIN.
Strain: CD-1.
Tissue: Colon.
[2]"Cloning of the mouse hepatitis virus (MHV) receptor: expression in human and hamster cell lines confers susceptibility to MHV."
Dveksler G.S., Pensiero M.N., Cardellichio C.B., Williams R.K., Jiang G.-S., Holmes K.V., Dieffenbach C.W.
J. Virol. 65:6881-6891(1991) [PubMed: 1719235] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM SHORT), INTERACTION WITH MHV SPIKE GLYCOPROTEIN, CHARACTERIZATION OF MHV RECEPTOR FUNCTION.
Strain: BALB/c and CD-1.
Tissue: Colon and Liver.
[3]"Expression of the Bgp gene and characterization of mouse colon biliary glycoprotein isoforms."
McCuaig K., Rosenberg M., Nedellec P., Turbide C., Beauchemin N.
Gene 127:173-183(1993) [PubMed: 8500759] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORMS LONG AND 3).
[4]"A mouse analogue of the human carcinoembryonic antigen."
Beauchemin N., Turbide C., Afar D., Raymond M., Bell J., Stanners C.P., Fuks A.
Cancer Res. 49:2017-2021(1989) [PubMed: 2702644] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM SHORT).
Strain: CD-1.
Tissue: Colon.
[5]"Receptor for mouse hepatitis virus is a member of the carcinoembryonic antigen family of glycoproteins."
Williams R.K., Jiang G.-S., Holmes K.V.
Proc. Natl. Acad. Sci. U.S.A. 88:5533-5536(1991) [PubMed: 1648219] [Abstract]
Cited for: PROTEIN SEQUENCE OF 35-59.
[6]Lubec G., Sunyer B., Chen W.-Q.
Submitted (JAN-2009) to UniProtKB
Cited for: PROTEIN SEQUENCE OF 116-130, MASS SPECTROMETRY.
Strain: OF1.
Tissue: Hippocampus.
[7]"Ceacam1a-/- mice are completely resistant to infection by murine coronavirus mouse hepatitis virus A59."
Hemmila E., Turbide C., Olson M., Jothy S., Holmes K.V., Beauchemin N.
J. Virol. 78:10156-10165(2004) [PubMed: 15331748] [Abstract]
Cited for: CHARACTERIZATION OF MHV RECEPTOR FUNCTION.
[8]"Proteome-wide characterization of N-glycosylation events by diagonal chromatography."
Ghesquiere B., Van Damme J., Martens L., Vandekerckhove J., Gevaert K.
J. Proteome Res. 5:2438-2447(2006) [PubMed: 16944957] [Abstract]
Cited for: GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-206, MASS SPECTROMETRY.
Tissue: Plasma.
[9]"Crystal structure of murine sCEACAM1a[1,4]: a coronavirus receptor in the CEA family."
Tan K., Zelus B.D., Meijers R., Liu J.-H., Bergelson J.M., Duke N., Zhang R., Joachimiak A., Holmes K.V., Wang J.-H.
EMBO J. 21:2076-2086(2002) [PubMed: 11980704] [Abstract]
Cited for: X-RAY CRYSTALLOGRAPHY (3.32 ANGSTROMS) OF 35-236 (ISOFORM 3), GLYCOSYLATION AT ASN-71; ASN-89; ASN-104 AND ASN-333, DISULFIDE BOND, INTERACTION WITH MURINE CORONAVIRUS MHV S1 SPIKE GLYCOPROTEIN.
+Additional computationally mapped references.

Cross-references

Sequence databases

X15351 mRNA. Translation: CAA33409.1.
M77196 mRNA. Translation: AAA37858.1.
X67279 mRNA. Translation: CAA47696.1.
X67282 mRNA. Translation: CAA47699.1.
IPIIPI00108535.
IPI00227804.
IPI00915472.
PIRWMMSR1. JC1505.
JC1508.
JC1511.
RefSeqNP_001034274.1.
NP_001034275.1.
UniGeneMm.322502

3D structure databases

EntryMethodResolution (Å)ChainPositionsPDBsum
1L6ZX-ray3.32A35-416[»]
ModBaseSearch...

PTM databases

PhosphoSiteP31809.

Proteomic databases

PRIDEP31809.

Genome annotation databases

EnsemblENSMUSG00000074272. Mus musculus. [Contig view]
GeneID26365.
KEGGmmu:26365.

Organism-specific databases

MGIMGI:1347245. Ceacam1.

Phylogenomic databases

HOGENOMP31809.
HOVERGENP31809.

Gene expression databases

ArrayExpressP31809.
BgeeP31809.
CleanExMM_CEACAM1.
GermOnlineENSMUSG00000074272. Mus musculus.

Family and domain databases

InterProIPR013151. Ig.
IPR007110. Ig-like.
IPR013783. Ig-like_fold.
IPR003598. Ig_sub2.
IPR013106. Ig_V-set.
[Graphical view]
Gene3DG3DSA:2.60.40.10. Ig-like_fold. 4 hits.
PfamPF00047. ig. 3 hits.
PF07686. V-set. 1 hit.
[Graphical view]
SMARTSM00408. IGc2. 3 hits.
[Graphical view]
PROSITEPS50835. IG_LIKE. 3 hits.
[Graphical view]
ProtoNetSearch...

Other Resources

NextBio304235.
SOURCESearch...

Entry information

Entry nameCEAM1_MOUSE
AccessionPrimary (citable) accession number: P31809
Secondary accession number(s): Q61353
Entry history
Integrated into UniProtKB/Swiss-Prot: July 1, 1993
Last sequence update: July 1, 1993
Last modified: June 16, 2009
This is version 86 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectHPI (Human Proteome Initiative)

Relevant documents

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot

PDB cross-references

Index of Protein Data Bank (PDB) cross-references

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Alternative products · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents