Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q63994 (CD33_MOUSE) Reviewed, UniProtKB/Swiss-Prot

Last modified July 9, 2014. Version 115. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Web links·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Myeloid cell surface antigen CD33
Alternative name(s):
Sialic acid-binding Ig-like lectin 3
Short name=Siglec-3
CD_antigen=CD33
Gene names
Name:Cd33
Synonyms:Siglec3
OrganismMus musculus (Mouse) [Reference proteome]
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus

Protein attributes

Sequence length403 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at transcript level

General annotation (Comments)

Function

Putative adhesion molecule of myelomonocytic-derived cells that mediates sialic-acid dependent binding to cells. Preferentially binds to alpha-2,6-linked sialic acid By similarity. The sialic acid recognition site may be masked by cis interactions with sialic acids on the same cell surface.

Subcellular location

Cell membrane; Single-pass type I membrane protein.

Sequence similarities

Belongs to the immunoglobulin superfamily. SIGLEC (sialic acid binding Ig-like lectin) family.

Contains 1 Ig-like C2-type (immunoglobulin-like) domain.

Contains 1 Ig-like V-type (immunoglobulin-like) domain.

Ontologies

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform 33-B (identifier: Q63994-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 33-A (identifier: Q63994-2)

The sequence of this isoform differs from the canonical sequence as follows:
     287-403: RQEAITSYNH...MLLCVSLTLS → AHQQDSKVHS...GGKPQEYSEI

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 1616 Potential
Chain17 – 403387Myeloid cell surface antigen CD33
PRO_0000014879

Regions

Topological domain18 – 240223Extracellular Potential
Transmembrane241 – 26727Helical; Potential
Topological domain268 – 403136Cytoplasmic Potential
Domain17 – 120104Ig-like V-type
Domain145 – 22884Ig-like C2-type

Sites

Binding site1181Sialic acid By similarity

Amino acid modifications

Glycosylation1101N-linked (GlcNAc...) Potential
Glycosylation1601N-linked (GlcNAc...) Potential
Glycosylation2301N-linked (GlcNAc...) Potential
Disulfide bond36 ↔ 169 By similarity
Disulfide bond41 ↔ 100 By similarity
Disulfide bond163 ↔ 212 By similarity

Natural variations

Alternative sequence287 – 403117RQEAI…SLTLS → AHQQDSKVHSNPENPRPLQK DSPQEQSSVHTKISLDFMGG KPQEYSEI in isoform 33-A.
VSP_002534

Sequences

Sequence LengthMass (Da)Tools
Isoform 33-B [UniParc].

Last modified November 1, 1996. Version 1.
Checksum: F1FE6D5C393F0FF1

FASTA40344,824
        10         20         30         40         50         60 
MLWPLPLFLL CAGSLAQDLE FQLVAPESVT VEEGLCVHVP CSVFYPSIKL TLGPVTGSWL 

        70         80         90        100        110        120 
RKGVSLHEDS PVATSDPRQL VQKATQGRFQ LLGDPQKHDC SLFIRDAQKN DTGMYFFRVV 

       130        140        150        160        170        180 
REPFVRYSYK KSQLSLHVTS LSRTPDIIIP GTLEAGYPSN LTCSVPWACE QGTPPTFSWM 

       190        200        210        220        230        240 
STALTSLSSR TTDSSVLTFT PQPQDHGTKL TCLVTFSGAG VTVERTIQLN VTRKSGQMRE 

       250        260        270        280        290        300 
LVLVAVGEAT VKLLILGLCL VFLIVMFCRR KTTKLSVHMG CENPIKRQEA ITSYNHCLSP 

       310        320        330        340        350        360 
TASDAVTPGC SIHRLISRTP RCTAILRIQD PYRRTHLRNR AVSTLRFPWI SWEGSLRSTQ 

       370        380        390        400 
RSKCTKLCSP VKNLCPLWLP VDNSCIPLIP EWVMLLCVSL TLS 

« Hide

Isoform 33-A [UniParc].

Checksum: A3EC9608B5AFE617
Show »

FASTA33436,951

References

« Hide 'large scale' references
[1]"Molecular cloning of two isoforms of the murine homolog of the myeloid CD33 antigen."
Tchilian E.Z., Beverley P.C., Young B.D., Watt S.M.
Blood 83:3188-3198(1994) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORMS 33-A AND 33-B).
Strain: BALB/c.
Tissue: Bone marrow.
[2]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 33-A).
Tissue: Brain.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
S71345 mRNA. Translation: AAB30842.1.
S71403 mRNA. Translation: AAB30843.2.
BC132379 mRNA. Translation: AAI32380.1.
CCDSCCDS21173.1. [Q63994-2]
CCDS52224.1. [Q63994-1]
RefSeqNP_001104528.1. NM_001111058.1. [Q63994-1]
NP_067268.1. NM_021293.3. [Q63994-2]
UniGeneMm.140157.

3D structure databases

ProteinModelPortalQ63994.
SMRQ63994. Positions 21-232.
ModBaseSearch...
MobiDBSearch...

PTM databases

PhosphoSiteQ63994.

Proteomic databases

PRIDEQ63994.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENSMUST00000004728; ENSMUSP00000004728; ENSMUSG00000004609. [Q63994-1]
ENSMUST00000039861; ENSMUSP00000045458; ENSMUSG00000004609. [Q63994-2]
GeneID12489.
KEGGmmu:12489.
UCSCuc009gmy.2. mouse. [Q63994-2]
uc009gna.1. mouse. [Q63994-1]

Organism-specific databases

CTD945.
MGIMGI:99440. Cd33.

Phylogenomic databases

eggNOGNOG320441.
GeneTreeENSGT00560000076846.
HOGENOMHOG000236324.
InParanoidQ63994.
KOK06473.
OMAHEDSPVA.
OrthoDBEOG73JKV5.
PhylomeDBQ63994.
TreeFamTF332441.

Gene expression databases

BgeeQ63994.
CleanExMM_CD33.
GenevestigatorQ63994.

Family and domain databases

Gene3D2.60.40.10. 2 hits.
InterProIPR007110. Ig-like_dom.
IPR013783. Ig-like_fold.
IPR003599. Ig_sub.
IPR013106. Ig_V-set.
[Graphical view]
PfamPF07686. V-set. 1 hit.
[Graphical view]
SMARTSM00409. IG. 1 hit.
[Graphical view]
PROSITEPS50835. IG_LIKE. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

NextBio281404.
PROQ63994.
SOURCESearch...

Entry information

Entry nameCD33_MOUSE
AccessionPrimary (citable) accession number: Q63994
Secondary accession number(s): A2RT59, Q63997
Entry history
Integrated into UniProtKB/Swiss-Prot: October 19, 2002
Last sequence update: November 1, 1996
Last modified: July 9, 2014
This is version 115 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot