Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

C-X-C chemokine receptor type 4

Gene

Cxcr4

Organism
Mus musculus (Mouse)
Status
Reviewed-Annotation score:

Annotation score:5 out of 5

<p>The annotation score provides a heuristic measure of the annotation content of a UniProtKB entry or proteome. This score <strong>cannot</strong> be used as a measure of the accuracy of the annotation as we cannot define the ‘correct annotation’ for any given protein.<p><a href='/help/annotation_score' target='_top'>More...</a></p>
-Experimental evidence at protein leveli <p>This indicates the type of evidence that supports the existence of the protein. Note that the ‘protein existence’ evidence does not give information on the accuracy or correctness of the sequence(s) displayed.<p><a href='/help/protein_existence' target='_top'>More...</a></p>

<p>This section provides any useful information about the protein, mostly biological knowledge.<p><a href='/help/function_section' target='_top'>More...</a></p>Functioni

Receptor for the C-X-C chemokine CXCL12/SDF-1 that transduces a signal by increasing intracellular calcium ion levels and enhancing MAPK1/MAPK3 activation (PubMed:8962122, PubMed:9295051, PubMed:9103415). Involved in the AKT signaling cascade (By similarity). Plays a role in regulation of cell migration, e.g. during wound healing. Acts as a receptor for extracellular ubiquitin; leading to enhanced intracellular calcium ions and reduced cellular cAMP levels. Binds bacterial lipopolysaccharide (LPS) et mediates LPS-induced inflammatory response, including TNF secretion by monocytes (By similarity). Involved in hematopoiesis and in cardiac ventricular septum formation (PubMed:9634237, PubMed:9634238, PubMed:9689100). Also plays an essential role in vascularization of the gastrointestinal tract, probably by regulating vascular branching and/or remodeling processes in endothelial cells (PubMed:9634237). Involved in cerebellar development. In the CNS, could mediate hippocampal-neuron survival (PubMed:9634238, PubMed:9689100).By similarity6 Publications

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the ‘Function’ section describes the interaction between a single amino acid and another chemical entity. Priority is given to the annotation of physiological ligands.<p><a href='/help/binding' target='_top'>More...</a></p>Binding sitei173ChemokineBy similarity1
Binding sitei295ChemokineBy similarity1

<p>The <a href="http://www.geneontology.org/">Gene Ontology (GO)</a> project provides a set of hierarchical controlled vocabulary split into 3 categories:<p><a href='/help/gene_ontology' target='_top'>More...</a></p>GO - Molecular functioni

GO - Biological processi

<p>UniProtKB Keywords constitute a <a href="http://www.uniprot.org/keywords">controlled vocabulary</a> with a hierarchical structure. Keywords summarise the content of a UniProtKB entry and facilitate the search for proteins of interest.<p><a href='/help/keywords' target='_top'>More...</a></p>Keywordsi

Molecular functionG-protein coupled receptor, Receptor, Transducer

<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Names & Taxonomyi

<p>This subsection of the <a href="http://www.uniprot.org/help/names_and_taxonomy_section">Names and taxonomy</a> section provides an exhaustive list of all names of the protein, from commonly used to obsolete, to allow unambiguous identification of a protein.<p><a href='/help/protein_names' target='_top'>More...</a></p>Protein namesi
Recommended name:
C-X-C chemokine receptor type 4
Short name:
CXC-R4
Short name:
CXCR-4
Alternative name(s):
Fusin1 Publication
Leukocyte-derived seven transmembrane domain receptor
Short name:
LESTR1 Publication
Pre-B-cell-derived chemokine receptor
Short name:
PB-CKR
Stromal cell-derived factor 1 receptor
Short name:
SDF-1 receptor
CD_antigen: CD184
<p>This subsection of the <a href="http://www.uniprot.org/help/names_and_taxonomy_section">Names and taxonomy</a> section indicates the name(s) of the gene(s) that code for the protein sequence(s) described in the entry. Four distinct tokens exist: ‘Name’, ‘Synonyms’, ‘Ordered locus names’ and ‘ORF names’.<p><a href='/help/gene_name' target='_top'>More...</a></p>Gene namesi
Name:Cxcr4
Synonyms:Cmkar4, Lestr, Sdf1r
<p>This subsection of the <a href="http://www.uniprot.org/help/names_and_taxonomy_section">Names and taxonomy</a> section provides information on the name(s) of the organism that is the source of the protein sequence.<p><a href='/help/organism-name' target='_top'>More...</a></p>OrganismiMus musculus (Mouse)
<p>This subsection of the <a href="http://www.uniprot.org/help/names_and_taxonomy_section">Names and taxonomy</a> section shows the unique identifier assigned by the NCBI to the source organism of the protein. This is known as the ‘taxonomic identifier’ or ‘taxid’.<p><a href='/help/taxonomic_identifier' target='_top'>More...</a></p>Taxonomic identifieri10090 [NCBI]
<p>This subsection of the <a href="http://www.uniprot.org/help/names_and_taxonomy_section">Names and taxonomy</a> section contains the taxonomic hierarchical classification lineage of the source organism. It lists the nodes as they appear top-down in the taxonomic tree, with the more general grouping listed first.<p><a href='/help/taxonomic_lineage' target='_top'>More...</a></p>Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaMyomorphaMuroideaMuridaeMurinaeMusMus
<p>This subsection of the <a href="http://www.uniprot.org/help/names_and_taxonomy_section">Names and taxonomy</a> section is present for entries that are part of a <a href="http://www.uniprot.org/proteomes">proteome</a>, i.e. of a set of proteins thought to be expressed by organisms whose genomes have been completely sequenced.<p><a href='/help/proteomes_manual' target='_top'>More...</a></p>Proteomesi
  • UP000000589 <p>A UniProt <a href="http://www.uniprot.org/manual/proteomes_manual">proteome</a> can consist of several components. <br></br>The component name refers to the genomic component encoding a set of proteins.<p><a href='/help/proteome_component' target='_top'>More...</a></p> Componenti: Unplaced

Organism-specific databases

Mouse genome database (MGD) from Mouse Genome Informatics (MGI)

More...
MGIi
MGI:109563 Cxcr4

<p>This section provides information on the location and the topology of the mature protein in the cell.<p><a href='/help/subcellular_location_section' target='_top'>More...</a></p>Subcellular locationi

Extracellular region or secreted Cytosol Plasma membrane Cytoskeleton Lysosome Endosome Peroxisome ER Golgi apparatus Nucleus Mitochondrion Manual annotation Automatic computational assertionGraphics by Christian Stolte; Source: COMPARTMENTS

Topology

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the <a href="http://www.uniprot.org/help/subcellular_location_section">'Subcellular location'</a> section describes the subcellular compartment where each non-membrane region of a membrane-spanning protein is found.<p><a href='/help/topo_dom' target='_top'>More...</a></p>Topological domaini1 – 40ExtracellularCuratedAdd BLAST40
<p>This subsection of the <a href="http://www.uniprot.org/help/subcellular_location_section">'Subcellular location'</a> section describes the extent of a membrane-spanning region of the protein. It denotes the presence of both alpha-helical transmembrane regions and the membrane spanning regions of beta-barrel transmembrane proteins.<p><a href='/help/transmem' target='_top'>More...</a></p>Transmembranei41 – 65Helical; Name=1By similarityAdd BLAST25
Topological domaini66 – 79CytoplasmicCuratedAdd BLAST14
Transmembranei80 – 101Helical; Name=2By similarityAdd BLAST22
Topological domaini102 – 112ExtracellularCuratedAdd BLAST11
Transmembranei113 – 132Helical; Name=3By similarityAdd BLAST20
Topological domaini133 – 156CytoplasmicCuratedAdd BLAST24
Transmembranei157 – 176Helical; Name=4By similarityAdd BLAST20
Topological domaini177 – 202ExtracellularCuratedAdd BLAST26
Transmembranei203 – 223Helical; Name=5By similarityAdd BLAST21
Topological domaini224 – 248CytoplasmicCuratedAdd BLAST25
Transmembranei249 – 268Helical; Name=6By similarityAdd BLAST20
Topological domaini269 – 289ExtracellularCuratedAdd BLAST21
Transmembranei290 – 309Helical; Name=7By similarityAdd BLAST20
Topological domaini310 – 359CytoplasmicCuratedAdd BLAST50

Keywords - Cellular componenti

Cell junction, Cell membrane, Endosome, Lysosome, Membrane

<p>This section provides information on the disease(s) and phenotype(s) associated with a protein.<p><a href='/help/pathology_and_biotech_section' target='_top'>More...</a></p>Pathology & Biotechi

<p>This subsection of the ‘Pathology and Biotech’ section describes the in vivo effects caused by ablation of the gene (or one or more transcripts) coding for the protein described in the entry. This includes gene knockout and knockdown, provided experiments have been performed in the context of a whole organism or a specific tissue, and not at the single-cell level.<p><a href='/help/disruption_phenotype' target='_top'>More...</a></p>Disruption phenotypei

Half of the embryos die by E17.5-E18.5 and neonates die within a few hours. Mutants display defective vascular development, cerebellar development, B-lymphopoiesis, myelopoiesis, and cardiogenesis with defective formation of the large vessels supplying the gastrointestinal tract.3 Publications

Chemistry databases

ChEMBL database of bioactive drug-like small molecules

More...
ChEMBLi
CHEMBL1250365

IUPHAR/BPS Guide to PHARMACOLOGY

More...
GuidetoPHARMACOLOGYi
71

<p>This section describes post-translational modifications (PTMs) and/or processing events.<p><a href='/help/ptm_processing_section' target='_top'>More...</a></p>PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the ‘PTM / Processing’ section describes the extent of a polypeptide chain in the mature protein following processing.<p><a href='/help/chain' target='_top'>More...</a></p>ChainiPRO_00000693551 – 359C-X-C chemokine receptor type 4Add BLAST359

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the ‘PTM / Processing’ section specifies the position and type of each modified residue excluding <a href="http://www.uniprot.org/manual/lipid">lipids</a>, <a href="http://www.uniprot.org/manual/carbohyd">glycans</a> and <a href="http://www.uniprot.org/manual/crosslnk">protein cross-links</a>.<p><a href='/help/mod_res' target='_top'>More...</a></p>Modified residuei9SulfotyrosineBy similarity1
<p>This subsection of the <a href="http://www.uniprot.org/help/ptm_processing_section">PTM / Processing</a> section specifies the position and type of each covalently attached glycan group (mono-, di-, or polysaccharide).<p><a href='/help/carbohyd' target='_top'>More...</a></p>Glycosylationi13N-linked (GlcNAc...) asparagineBy similarity1
Modified residuei14SulfotyrosineBy similarity1
Glycosylationi20O-linked (Xyl...) (chondroitin sulfate) serineBy similarity1
Modified residuei23SulfotyrosineBy similarity1
<p>This subsection of the PTM / Processing":/help/ptm_processing_section section describes the positions of cysteine residues participating in disulfide bonds.<p><a href='/help/disulfid' target='_top'>More...</a></p>Disulfide bondi30 ↔ 281PROSITE-ProRule annotation
Disulfide bondi111 ↔ 193PROSITE-ProRule annotation
Modified residuei326PhosphoserineCombined sources1
Modified residuei328PhosphoserineBy similarity1
Modified residuei331Phosphoserine; by PKC and GRK6By similarity1
Modified residuei332Phosphoserine; by PKC and GRK6By similarity1
Modified residuei337Phosphoserine; by GRK6By similarity1
<p>This subsection of the <a href="http://www.uniprot.org/help/ptm_processing_section">PTM / Processing</a> section describes <strong>covalent linkages</strong> of various types formed <strong>between two proteins (interchain cross-links)</strong> or <strong>between two parts of the same protein (intrachain cross-links)</strong>, except the disulfide bonds that are annotated in the <a href="http://www.uniprot.org/manual/disulfid">'Disulfide bond'</a> subsection.<p><a href='/help/crosslnk' target='_top'>More...</a></p>Cross-linki338Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in ubiquitin)By similarity
Modified residuei346Phosphoserine; by GRK6By similarity1
Modified residuei355PhosphoserineBy similarity1
Modified residuei358PhosphoserineBy similarity1

<p>This subsection of the <a href="http://www.uniprot.org/help/ptm_processing_section">PTM/processing</a> section describes post-translational modifications (PTMs). This subsection <strong>complements</strong> the information provided at the sequence level or describes modifications for which <strong>position-specific data is not yet available</strong>.<p><a href='/help/post-translational_modification' target='_top'>More...</a></p>Post-translational modificationi

Phosphorylated on agonist stimulation. Rapidly phosphorylated on serine and threonine residues in the C-terminal. Phosphorylation at Ser-331 and Ser-332 leads to recruitment of ITCH, ubiquitination and protein degradation.By similarity
Ubiquitinated after ligand binding, leading to its degradation. Ubiquitinated by ITCH at the cell membrane on agonist stimulation. The ubiquitin-dependent mechanism, endosomal sorting complex required for transport (ESCRT), then targets CXCR4 for lysosomal degradation. This process is dependent also on prior Ser-/Thr-phosphorylation in the C-terminal of CXCR4. Also binding of ARRB1 to STAM negatively regulates CXCR4 sorting to lysosomes though modulating ubiquitination of SFR5S.By similarity
Sulfation is required for efficient binding of CXCL12/SDF-1alpha and promotes its dimerization.By similarity
O- and N-glycosylated. N-glycosylation can mask coreceptor function. The O-glycosylation chondroitin sulfate attachment does not affect interaction with CXCL12/SDF-1alpha nor its coreceptor activity.By similarity

Keywords - PTMi

Disulfide bond, Glycoprotein, Isopeptide bond, Phosphoprotein, Proteoglycan, Sulfation, Ubl conjugation

Proteomic databases

MaxQB - The MaxQuant DataBase

More...
MaxQBi
P70658

PaxDb, a database of protein abundance averages across all three domains of life

More...
PaxDbi
P70658

PeptideAtlas

More...
PeptideAtlasi
P70658

PRoteomics IDEntifications database

More...
PRIDEi
P70658

PTM databases

iPTMnet integrated resource for PTMs in systems biology context

More...
iPTMneti
P70658

Comprehensive resource for the study of protein post-translational modifications (PTMs) in human, mouse and rat.

More...
PhosphoSitePlusi
P70658

<p>This section provides information on the expression of a gene at the mRNA or protein level in cells or in tissues of multicellular organisms.<p><a href='/help/expression_section' target='_top'>More...</a></p>Expressioni

<p>This subsection of the ‘Expression’ section provides information on the expression of a gene at the mRNA or protein level in cells or in tissues of multicellular organisms. By default, the information is derived from experiments at the mRNA level, unless specified ‘at protein level’. <br></br>Examples: <a href="http://www.uniprot.org/uniprot/P92958#expression">P92958</a>, <a href="http://www.uniprot.org/uniprot/Q8TDN4#expression">Q8TDN4</a>, <a href="http://www.uniprot.org/uniprot/O14734#expression">O14734</a><p><a href='/help/tissue_specificity' target='_top'>More...</a></p>Tissue specificityi

Lymphocytes, macrophages, neutrophils, microglial cells and astrocytes. Found in spleen, thymus, bone marrow, lymph nodes and, at lower levels in brain, small intestine, stomach and kidney. CXCR4-A is predominant in all tissues tested. During embryonic development, high levels are detected in the endothelium of developing blood vessels and in many regions of the developing brain including the olfactory epithelium, olfactory bulb, hippocampus, cerebellum and spinal cord.2 Publications

<p>This subsection of the ‘Expression’ section provides information on the expression of the gene product at various stages of a cell, tissue or organism development. By default, the information is derived from experiments at the mRNA level, unless specified ‘at the protein level’.<p><a href='/help/developmental_stage' target='_top'>More...</a></p>Developmental stagei

High expression during embryonic development does not seem to be associated with the differentiation of any particular cell type, but is widely utilized when there is a requirement for cell movement. Frequently associated with less differentiated cell types and down-regulated with subsequent differentiation. Detected in sites with hemopoietic potential: the yolk sac (7.5, 8.5 and 12.5 dpc) and fetal liver (12.5 dpc). During gastrulation, at 7.2 to 7.8 dpc, expressed in the mesoderm and the definitive endoderm. As gastrulation pattern fades (8.5 dpc), expression in the mesoderm is down-regulated, while it becomes predominant in neural ectoderm. Endodermal expression is retained in the foregut and later in a subset of foregut derivatives, including the stomach (10.5 dpc), the cystic ducts of the gall bladder and the lung epithelium (12.5 dpc). In neuronal tissue: at 10.5 and 12.5 dpc, expressed in the dorsal root ganglia, in the ventral mantle layer of the spinal cord (or basal plates), in the hindbrain. At 14.5 dpc, expression more tightly confined to the neural epithelium lining the ventricular space and to the external granular layer of the ventral rhombic lip (the developing cerebellum). Expressed in the outpocketing of the diencephalic floor at 10.5 dpc and in the developing thalamus and, to a lesser extent, the developing hypothalamus. At 14.5 dpc, restricted to the region where thalamus and hypothalamus meet. Detected in a discrete band of cells at the edge of the olfactory bulb. In the vascular system: expressed in the endothelium of numerous blood vessels, but not all, at 10.5, 11.5 and 12.5 dpc, such as vitelline/umbilical vessels, cardiac ventricular wall capillaries, facial vessels and, at 14.5 dpc, in the vasculature of the herniated gut. Expression seems to be associated with expanding vascular networks. In the heart development, expressed at 10.5 dpc in the precursor to the aortopulmonary (AP) septum. At 12.5 dpc, detected in the AP septum at the base of the outflow tract and in the atrioventricular valves. Detected in cranofacial ectoderm from 10.5 to 14.5 dpc. At 10.5 and 11.5 dpc, expressed in the Rathke pouch.1 Publication

Gene expression databases

CleanEx database of gene expression profiles

More...
CleanExi
MM_CXCR4

<p>This section provides information on the quaternary structure of a protein and on interaction(s) with other proteins or protein complexes.<p><a href='/help/interaction_section' target='_top'>More...</a></p>Interactioni

<p>This subsection of the <a href="http://www.uniprot.org/help/interaction_section">'Interaction'</a> section provides information about the protein quaternary structure and interaction(s) with other proteins or protein complexes (with the exception of physiological receptor-ligand interactions which are annotated in the <a href="http://www.uniprot.org/help/function_section">'Function'</a> section).<p><a href='/help/subunit_structure' target='_top'>More...</a></p>Subunit structurei

Monomer. Can form homodimers. Interacts with CD164. Interacts with ARRB2; the interaction is dependent on the C-terminal phosphorylation of CXCR4 and allows activation of MAPK1 and MAPK3. Interacts with ARRC; the interaction is dependent on the C-terminal phosphorylation of CXCR4 and modulates calcium mobilization. Interacts with RNF113A; the interaction, enhanced by CXCL12, promotes CXCR4 ubiquitination and subsequent degradation. Interacts (via the cytoplasmic C-terminal) with ITCH (via the WW domains I and II); the interaction, enhanced by CXCL12, promotes CXCR4 ubiquitination and leads to its degradation. Interacts with extracellular ubiquitin. Interacts with DBN1; this interaction is enhanced by antigenic stimulation. Following LPS binding, may form a complex with GDF5, HSP90AA1 and HSPA8.By similarity

GO - Molecular functioni

Protein-protein interaction databases

The Biological General Repository for Interaction Datasets (BioGrid)

More...
BioGridi
198767, 2 interactors

Protein interaction database and analysis system

More...
IntActi
P70658, 1 interactor

STRING: functional protein association networks

More...
STRINGi
10090.ENSMUSP00000053489

Chemistry databases

BindingDB database of measured binding affinities

More...
BindingDBi
P70658

<p>This section provides information on the tertiary and secondary structure of a protein.<p><a href='/help/structure_section' target='_top'>More...</a></p>Structurei

3D structure databases

Protein Model Portal of the PSI-Nature Structural Biology Knowledgebase

More...
ProteinModelPortali
P70658

SWISS-MODEL Repository - a database of annotated 3D protein structure models

More...
SMRi
P70658

Database of comparative protein structure models

More...
ModBasei
Search...

MobiDB: a database of protein disorder and mobility annotations

More...
MobiDBi
Search...

<p>This section provides information on sequence similarities with other proteins and the domain(s) present in a protein.<p><a href='/help/family_and_domains_section' target='_top'>More...</a></p>Family & Domainsi

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the ‘Family and Domains’ section describes a region of interest that cannot be described in other subsections.<p><a href='/help/region' target='_top'>More...</a></p>Regioni1 – 23Important for chemokine binding and signalingBy similarityAdd BLAST23
Regioni96 – 99Chemokine bindingBy similarity4
Regioni115 – 119Chemokine bindingBy similarity5
Regioni137 – 149Involved in dimerization; when bound to chemokineBy similarityAdd BLAST13
Regioni193 – 197Chemokine binding, important for signalingBy similarity5
Regioni198 – 217Involved in dimerizationBy similarityAdd BLAST20
Regioni273 – 275Involved in dimerizationBy similarity3

Motif

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the ‘Family and Domains’ section describes a short (usually not more than 20 amino acids) conserved sequence motif of biological significance.<p><a href='/help/motif' target='_top'>More...</a></p>Motifi135 – 137Important for signalingBy similarity3

<p>This subsection of the ‘Family and domains’ section provides information about the sequence similarity with other proteins.<p><a href='/help/sequence_similarities' target='_top'>More...</a></p>Sequence similaritiesi

Belongs to the G-protein coupled receptor 1 family.PROSITE-ProRule annotation

Keywords - Domaini

Transmembrane, Transmembrane helix

Phylogenomic databases

evolutionary genealogy of genes: Non-supervised Orthologous Groups

More...
eggNOGi
KOG3656 Eukaryota
ENOG410XRW9 LUCA

The HOGENOM Database of Homologous Genes from Fully Sequenced Organisms

More...
HOGENOMi
HOG000234122

The HOVERGEN Database of Homologous Vertebrate Genes

More...
HOVERGENi
HBG106917

InParanoid: Eukaryotic Ortholog Groups

More...
InParanoidi
P70658

KEGG Orthology (KO)

More...
KOi
K04189

Database for complete collections of gene phylogenies

More...
PhylomeDBi
P70658

TreeFam database of animal gene trees

More...
TreeFami
TF330966

Family and domain databases

Integrated resource of protein families, domains and functional sites

More...
InterProi
View protein in InterPro
IPR022726 Chemokine_CXCR4_N_dom
IPR000355 Chemokine_rcpt
IPR001277 CXCR4/ACKR2
IPR000276 GPCR_Rhodpsn
IPR017452 GPCR_Rhodpsn_7TM

Pfam protein domain database

More...
Pfami
View protein in Pfam
PF00001 7tm_1, 1 hit
PF12109 CXCR4_N, 1 hit

Protein Motif fingerprint database; a protein domain database

More...
PRINTSi
PR00657 CCCHEMOKINER
PR00645 CXCCHMKINER4
PR00237 GPCRRHODOPSN

PROSITE; a protein domain and family database

More...
PROSITEi
View protein in PROSITE
PS00237 G_PROTEIN_RECEP_F1_1, 1 hit
PS50262 G_PROTEIN_RECEP_F1_2, 1 hit

<p>This section displays by default the canonical protein sequence and upon request all isoforms described in the entry. It also includes information pertinent to the sequence(s), including <a href="http://www.uniprot.org/help/sequence_length">length</a> and <a href="http://www.uniprot.org/help/sequences">molecular weight</a>.<p><a href='/help/sequences_section' target='_top'>More...</a></p>Sequences (2+)i

<p>This subsection of the <a href="http://www.uniprot.org/help/sequences_section">Sequence</a> section indicates if the <a href="http://www.uniprot.org/help/canonical_and_isoforms">canonical sequence</a> displayed by default in the entry is complete or not.<p><a href='/help/sequence_status' target='_top'>More...</a></p>Sequence statusi: Complete.

This entry describes 2 <p>This subsection of the ‘Sequence’ section lists the alternative protein sequences (isoforms) that can be generated from the same gene by a single or by the combination of up to four biological events (alternative promoter usage, alternative splicing, alternative initiation and ribosomal frameshifting). Additionally, this section gives relevant information on each alternative protein isoform.<p><a href='/help/alternative_products' target='_top'>More...</a></p> isoformsi produced by alternative splicing. AlignAdd to basket

This entry has 2 described isoforms and 2 potential isoforms that are computationally mapped.Show allAlign All

Isoform CXCR4-B (identifier: P70658-1) [UniParc]FASTAAdd to basket
Also known as: LESTR-B

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide
        10         20         30         40         50
MEPISVSIYT SDNYSEEVGS GDYDSNKEPC FRDENVHFNR IFLPTIYFII
60 70 80 90 100
FLTGIVGNGL VILVMGYQKK LRSMTDKYRL HLSVADLLFV ITLPFWAVDA
110 120 130 140 150
MADWYFGKFL CKAVHIIYTV NLYSSVLILA FISLDRYLAI VHATNSQRPR
160 170 180 190 200
KLLAEKAVYV GVWIPALLLT IPDFIFADVS QGDISQGDDR YICDRLYPDS
210 220 230 240 250
LWMVVFQFQH IMVGLILPGI VILSCYCIII SKLSHSKGHQ KRKALKTTVI
260 270 280 290 300
LILAFFACWL PYYVGISIDS FILLGVIKQG CDFESIVHKW ISITEALAFF
310 320 330 340 350
HCCLNPILYA FLGAKFKSSA QHALNSMSRG SSLKILSKGK RGGHSSVSTE

SESSSFHSS
Length:359
Mass (Da):40,426
Last modified:November 1, 1997 - v2
<p>The checksum is a form of redundancy check that is calculated from the sequence. It is useful for tracking sequence updates.</p> <p>It should be noted that while, in theory, two different sequences could have the same checksum value, the likelihood that this would happen is extremely low.</p> <p>However UniProtKB may contain entries with identical sequences in case of multiple genes (paralogs).</p> <p>The checksum is computed as the sequence 64-bit Cyclic Redundancy Check value (CRC64) using the generator polynomial: x<sup>64</sup> + x<sup>4</sup> + x<sup>3</sup> + x + 1. The algorithm is described in the ISO 3309 standard. </p> <p class="publication">Press W.H., Flannery B.P., Teukolsky S.A. and Vetterling W.T.<br /> <strong>Cyclic redundancy and other checksums</strong><br /> <a href="http://www.nrbook.com/b/bookcpdf.php">Numerical recipes in C 2nd ed., pp896-902, Cambridge University Press (1993)</a>)</p> Checksum:i33D1B5552A31595B
GO
Isoform CXCR4-A (identifier: P70658-2) [UniParc]FASTAAdd to basket
Also known as: LESTR-A

The sequence of this isoform differs from the canonical sequence as follows:
     6-7: Missing.

Show »
Length:357
Mass (Da):40,240
Checksum:i648C8835307AD5EC
GO

<p>In eukaryotic reference proteomes, unreviewed entries that are likely to belong to the same gene are computationally mapped, based on gene identifiers from Ensembl, EnsemblGenomes and model organism databases.<p><a href='/help/gene_centric_isoform_mapping' target='_top'>More...</a></p>Computationally mapped potential isoform sequencesi

There are 2 potential isoforms mapped to this entry.BLASTAlignShow allAdd to basket
EntryEntry nameProtein names
Gene namesLengthAnnotation
A0A0R4J0N8A0A0R4J0N8_MOUSE
C-X-C chemokine receptor type 4
Cxcr4
359Annotation score:

Annotation score:3 out of 5

<p>The annotation score provides a heuristic measure of the annotation content of a UniProtKB entry or proteome. This score <strong>cannot</strong> be used as a measure of the accuracy of the annotation as we cannot define the ‘correct annotation’ for any given protein.<p><a href='/help/annotation_score' target='_top'>More...</a></p>
E9Q2D4E9Q2D4_MOUSE
C-X-C chemokine receptor type 4
Cxcr4
272Annotation score:

Annotation score:1 out of 5

<p>The annotation score provides a heuristic measure of the annotation content of a UniProtKB entry or proteome. This score <strong>cannot</strong> be used as a measure of the accuracy of the annotation as we cannot define the ‘correct annotation’ for any given protein.<p><a href='/help/annotation_score' target='_top'>More...</a></p>

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the ‘Sequence’ section reports difference(s) between the canonical sequence (displayed by default in the entry) and the different sequence submissions merged in the entry. These various submissions may originate from different sequencing projects, different types of experiments, or different biological samples. Sequence conflicts are usually of unknown origin.<p><a href='/help/conflict' target='_top'>More...</a></p>Sequence conflicti216I → V in CAA67893 (PubMed:9295051).Curated1
Sequence conflicti216I → V in BAA19187 (Ref. 6) Curated1

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the ‘Sequence’ section describes the sequence of naturally occurring alternative protein isoform(s). The changes in the amino acid sequence may be due to alternative splicing, alternative promoter usage, alternative initiation, or ribosomal frameshifting. The information stored in this subsection is used to automatically construct alternative protein sequence(s) for display.<p><a href='/help/var_seq' target='_top'>More...</a></p>Alternative sequenceiVSP_0018916 – 7Missing in isoform CXCR4-A. 1 Publication2

Sequence databases

Select the link destinations:

EMBL nucleotide sequence database

More...
EMBLi

GenBank nucleotide sequence database

More...
GenBanki

DNA Data Bank of Japan; a nucleotide sequence database

More...
DDBJi
Links Updated
U59760 mRNA Translation: AAB07725.1
U65580 Genomic DNA Translation: AAC52953.1
D87747 mRNA Translation: BAA13451.1
Z80111 mRNA Translation: CAB02201.1
Z80112 mRNA Translation: CAB02202.1
X99581 Genomic DNA Translation: CAA67893.1
X99582 mRNA Translation: CAA67894.1
AB000803 mRNA Translation: BAA19187.1
BC031665 mRNA Translation: AAH31665.1
BC098322 mRNA Translation: AAH98322.1

The Consensus CDS (CCDS) project

More...
CCDSi
CCDS15254.1 [P70658-1]

NCBI Reference Sequences

More...
RefSeqi
NP_034041.2, NM_009911.3

UniGene gene-oriented nucleotide sequence clusters

More...
UniGenei
Mm.1401

Genome annotation databases

Database of genes from NCBI RefSeq genomes

More...
GeneIDi
12767

KEGG: Kyoto Encyclopedia of Genes and Genomes

More...
KEGGi
mmu:12767

UCSC genome browser

More...
UCSCi
uc007cls.1 mouse [P70658-2]
uc007clt.1 mouse [P70658-1]

Keywords - Coding sequence diversityi

Alternative splicing

<p>This section provides links to proteins that are similar to the protein sequence(s) described in this entry at different levels of sequence identity thresholds (100%, 90% and 50%) based on their membership in UniProt Reference Clusters (<a href="http://www.uniprot.org/help/uniref">UniRef</a>).<p><a href='/help/similar_proteins_section' target='_top'>More...</a></p>Similar proteinsi

<p>This section is used to point to information related to entries and found in data collections other than UniProtKB.<p><a href='/help/cross_references_section' target='_top'>More...</a></p>Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U59760 mRNA Translation: AAB07725.1
U65580 Genomic DNA Translation: AAC52953.1
D87747 mRNA Translation: BAA13451.1
Z80111 mRNA Translation: CAB02201.1
Z80112 mRNA Translation: CAB02202.1
X99581 Genomic DNA Translation: CAA67893.1
X99582 mRNA Translation: CAA67894.1
AB000803 mRNA Translation: BAA19187.1
BC031665 mRNA Translation: AAH31665.1
BC098322 mRNA Translation: AAH98322.1
CCDSiCCDS15254.1 [P70658-1]
RefSeqiNP_034041.2, NM_009911.3
UniGeneiMm.1401

3D structure databases

ProteinModelPortaliP70658
SMRiP70658
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi198767, 2 interactors
IntActiP70658, 1 interactor
STRINGi10090.ENSMUSP00000053489

Chemistry databases

BindingDBiP70658
ChEMBLiCHEMBL1250365
GuidetoPHARMACOLOGYi71

Protein family/group databases

Information system for G protein-coupled receptors (GPCRs)

More...
GPCRDBi
Search...

PTM databases

iPTMnetiP70658
PhosphoSitePlusiP70658

Proteomic databases

MaxQBiP70658
PaxDbiP70658
PeptideAtlasiP70658
PRIDEiP70658

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

GeneIDi12767
KEGGimmu:12767
UCSCiuc007cls.1 mouse [P70658-2]
uc007clt.1 mouse [P70658-1]

Organism-specific databases

Comparative Toxicogenomics Database

More...
CTDi
7852
MGIiMGI:109563 Cxcr4

Phylogenomic databases

eggNOGiKOG3656 Eukaryota
ENOG410XRW9 LUCA
HOGENOMiHOG000234122
HOVERGENiHBG106917
InParanoidiP70658
KOiK04189
PhylomeDBiP70658
TreeFamiTF330966

Miscellaneous databases

ChiTaRS: a database of human, mouse and fruit fly chimeric transcripts and RNA-sequencing data

More...
ChiTaRSi
Cxcr4 mouse

Protein Ontology

More...
PROi
PR:P70658

The Stanford Online Universal Resource for Clones and ESTs

More...
SOURCEi
Search...

Gene expression databases

CleanExiMM_CXCR4

Family and domain databases

InterProiView protein in InterPro
IPR022726 Chemokine_CXCR4_N_dom
IPR000355 Chemokine_rcpt
IPR001277 CXCR4/ACKR2
IPR000276 GPCR_Rhodpsn
IPR017452 GPCR_Rhodpsn_7TM
PfamiView protein in Pfam
PF00001 7tm_1, 1 hit
PF12109 CXCR4_N, 1 hit
PRINTSiPR00657 CCCHEMOKINER
PR00645 CXCCHMKINER4
PR00237 GPCRRHODOPSN
PROSITEiView protein in PROSITE
PS00237 G_PROTEIN_RECEP_F1_1, 1 hit
PS50262 G_PROTEIN_RECEP_F1_2, 1 hit

ProtoNet; Automatic hierarchical classification of proteins

More...
ProtoNeti
Search...

<p>This section provides general information on the entry.<p><a href='/help/entry_information_section' target='_top'>More...</a></p>Entry informationi

<p>This subsection of the ‘Entry information’ section provides a mnemonic identifier for a UniProtKB entry, but it is not a stable identifier. Each reviewed entry is assigned a unique entry name upon integration into UniProtKB/Swiss-Prot.<p><a href='/help/entry_name' target='_top'>More...</a></p>Entry nameiCXCR4_MOUSE
<p>This subsection of the ‘Entry information’ section provides one or more accession number(s). These are stable identifiers and should be used to cite UniProtKB entries. Upon integration into UniProtKB, each entry is assigned a unique accession number, which is called ‘Primary (citable) accession number’.<p><a href='/help/accession_numbers' target='_top'>More...</a></p>AccessioniPrimary (citable) accession number: P70658
Secondary accession number(s): O09059
, O09062, P70233, P70346, Q4KMW1
<p>This subsection of the ‘Entry information’ section shows the date of integration of the entry into UniProtKB, the date of the last sequence update and the date of the last annotation modification (‘Last modified’). The version number for both the entry and the <a href="http://www.uniprot.org/help/canonical_and_isoforms">canonical sequence</a> are also displayed.<p><a href='/help/entry_history' target='_top'>More...</a></p>Entry historyiIntegrated into UniProtKB/Swiss-Prot: November 1, 1997
Last sequence update: November 1, 1997
Last modified: December 5, 2018
This is version 170 of the entry and version 2 of the sequence. See complete history.
<p>This subsection of the ‘Entry information’ section indicates whether the entry has been manually annotated and reviewed by UniProtKB curators or not, in other words, if the entry belongs to the Swiss-Prot section of UniProtKB (<strong>reviewed</strong>) or to the computer-annotated TrEMBL section (<strong>unreviewed</strong>).<p><a href='/help/entry_status' target='_top'>More...</a></p>Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

<p>This section contains any relevant information that doesn’t fit in any other defined sections<p><a href='/help/miscellaneous_section' target='_top'>More...</a></p>Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. SIMILARITY comments
    Index of protein domains and families
  2. 7-transmembrane G-linked receptors
    List of 7-transmembrane G-linked receptor entries
  3. MGD cross-references
    Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again