Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Calmegin

Gene

CLGN

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: -Experimental evidence at protein leveli

Functioni

Functions during spermatogenesis as a chaperone for a range of client proteins that are important for sperm adhesion onto the egg zona pellucida and for subsequent penetration of the zona pellucida. Required for normal sperm migration from the uterus into the oviduct. Required for normal male fertility. Binds calcium ions (By similarity).By similarity

GO - Molecular functioni

  • calcium ion binding Source: Ensembl
  • protein binding involved in protein folding Source: Ensembl
  • unfolded protein binding Source: ProtInc

GO - Biological processi

Keywordsi

Molecular functionChaperone
LigandCalcium

Names & Taxonomyi

Protein namesi
Recommended name:
Calmegin
Gene namesi
Name:CLGN
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 4

Organism-specific databases

EuPathDBiHostDB:ENSG00000153132.12
HGNCiHGNC:2060 CLGN
MIMi601858 gene
neXtProtiNX_O14967

Subcellular locationi

Extracellular region or secreted Cytosol Plasma membrane Cytoskeleton Lysosome Endosome Peroxisome ER Golgi apparatus Nucleus Mitochondrion Manual annotation Automatic computational assertionGraphics by Christian Stolte; Source: COMPARTMENTS

Topology

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Topological domaini20 – 471LumenalSequence analysisAdd BLAST452
Transmembranei472 – 492HelicalSequence analysisAdd BLAST21
Topological domaini493 – 610CytoplasmicSequence analysisAdd BLAST118

Keywords - Cellular componenti

Endoplasmic reticulum, Membrane

Pathology & Biotechi

Organism-specific databases

DisGeNETi1047
OpenTargetsiENSG00000153132
PharmGKBiPA26587

Polymorphism and mutation databases

BioMutaiCLGN

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Signal peptidei1 – 19Sequence analysisAdd BLAST19
ChainiPRO_000000421020 – 610CalmeginAdd BLAST591

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei128N6-acetyllysineBy similarity1
Disulfide bondi151 ↔ 185By similarity
Disulfide bondi351 ↔ 355By similarity
Modified residuei560PhosphoserineCombined sources1
Modified residuei576PhosphoserineCombined sources1
Modified residuei579PhosphoserineBy similarity1
Modified residuei581PhosphoserineBy similarity1
Modified residuei591PhosphoserineBy similarity1
Modified residuei594PhosphoserineBy similarity1
Modified residuei601PhosphoserineBy similarity1

Keywords - PTMi

Acetylation, Disulfide bond, Phosphoprotein

Proteomic databases

EPDiO14967
MaxQBiO14967
PaxDbiO14967
PeptideAtlasiO14967
PRIDEiO14967
ProteomicsDBi48342

PTM databases

iPTMnetiO14967
PhosphoSitePlusiO14967
SwissPalmiO14967

Expressioni

Tissue specificityi

Detected in testis (at protein level). Detected in testis.1 Publication

Gene expression databases

BgeeiENSG00000153132 Expressed in 148 organ(s), highest expression level in heart right ventricle
CleanExiHS_CLGN
ExpressionAtlasiO14967 baseline and differential
GenevisibleiO14967 HS

Organism-specific databases

HPAiCAB020709
HPA048761
HPA058627

Interactioni

Subunit structurei

Interacts with PPIB. Interacts with ADAM2 (By similarity). Interacts with PDILT.By similarity1 Publication

GO - Molecular functioni

Protein-protein interaction databases

BioGridi107477, 38 interactors
IntActiO14967, 10 interactors
STRINGi9606.ENSP00000326699

Structurei

3D structure databases

ProteinModelPortaliO14967
SMRiO14967
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Repeati267 – 2801-1Add BLAST14
Repeati284 – 2971-2Add BLAST14
Repeati303 – 3161-3Add BLAST14
Repeati322 – 3351-4Add BLAST14
Repeati339 – 3522-1Add BLAST14
Repeati356 – 3692-2Add BLAST14
Repeati370 – 3832-3Add BLAST14
Repeati384 – 3972-4Add BLAST14

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Regioni317 – 350Interaction with PPIBBy similarityAdd BLAST34

Sequence similaritiesi

Belongs to the calreticulin family.Curated

Keywords - Domaini

Repeat, Signal, Transmembrane, Transmembrane helix

Phylogenomic databases

eggNOGiKOG0675 Eukaryota
ENOG410XP7T LUCA
GeneTreeiENSGT00430000030841
HOGENOMiHOG000192436
HOVERGENiHBG005407
InParanoidiO14967
KOiK09551
OMAiIFRHKHP
OrthoDBiEOG091G04SS
PhylomeDBiO14967
TreeFamiTF300618

Family and domain databases

Gene3Di2.10.250.10, 1 hit
InterProiView protein in InterPro
IPR001580 Calret/calnex
IPR018124 Calret/calnex_CS
IPR009033 Calreticulin/calnexin_P_dom_sf
IPR013320 ConA-like_dom_sf
PANTHERiPTHR11073 PTHR11073, 1 hit
PfamiView protein in Pfam
PF00262 Calreticulin, 1 hit
PRINTSiPR00626 CALRETICULIN
SUPFAMiSSF49899 SSF49899, 1 hit
SSF63887 SSF63887, 1 hit
PROSITEiView protein in PROSITE
PS00803 CALRETICULIN_1, 1 hit
PS00804 CALRETICULIN_2, 1 hit
PS00805 CALRETICULIN_REPEAT, 2 hits

Sequences (2+)i

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

This entry describes 2 isoformsi produced by alternative splicing. AlignAdd to basket

This entry has 2 described isoforms and 1 potential isoform that is computationally mapped.Show allAlign All

Isoform 1 (identifier: O14967-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide
        10         20         30         40         50
MHFQAFWLCL GLLFISINAE FMDDDVETED FEENSEEIDV NESELSSEIK
60 70 80 90 100
YKTPQPIGEV YFAETFDSGR LAGWVLSKAK KDDMDEEISI YDGRWEIEEL
110 120 130 140 150
KENQVPGDRG LVLKSRAKHH AISAVLAKPF IFADKPLIVQ YEVNFQDGID
160 170 180 190 200
CGGAYIKLLA DTDDLILENF YDKTSYIIMF GPDKCGEDYK LHFIFRHKHP
210 220 230 240 250
KTGVFEEKHA KPPDVDLKKF FTDRKTHLYT LVMNPDDTFE VLVDQTVVNK
260 270 280 290 300
GSLLEDVVPP IKPPKEIEDP NDKKPEEWDE RAKIPDPSAV KPEDWDESEP
310 320 330 340 350
AQIEDSSVVK PAGWLDDEPK FIPDPNAEKP DDWNEDTDGE WEAPQILNPA
360 370 380 390 400
CRIGCGEWKP PMIDNPKYKG VWRPPLVDNP NYQGIWSPRK IPNPDYFEDD
410 420 430 440 450
HPFLLTSFSA LGLELWSMTS DIYFDNFIIC SEKEVADHWA ADGWRWKIMI
460 470 480 490 500
ANANKPGVLK QLMAAAEGHP WLWLIYLVTA GVPIALITSF CWPRKVKKKH
510 520 530 540 550
KDTEYKKTDI CIPQTKGVLE QEEKEEKAAL EKPMDLEEEK KQNDGEMLEK
560 570 580 590 600
EEESEPEEKS EEEIEIIEGQ EESNQSNKSG SEDEMKEADE STGSGDGPIK
610
SVRKRRVRKD
Length:610
Mass (Da):70,039
Last modified:January 1, 1998 - v1
Checksum:iF024FC4010D42D7E
GO
Isoform 2 (identifier: O14967-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     54-211: Missing.
     378-424: Missing.

Note: No experimental confirmation available.
Show »
Length:405
Mass (Da):46,453
Checksum:i6E259D1577FA8BC6
GO

Computationally mapped potential isoform sequencesi

There is 1 potential isoform mapped to this entry.BLASTAlignShow allAdd to basket
EntryEntry nameProtein names
Gene namesLengthAnnotation
D6RAZ4D6RAZ4_HUMAN
Calmegin
CLGN
179Annotation score:

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti232V → A in BAG63520 (PubMed:14702039).Curated1

Natural variant

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Natural variantiVAR_024400160A → S. Corresponds to variant dbSNP:rs2567241Ensembl.1
Natural variantiVAR_033776290V → I. Corresponds to variant dbSNP:rs2175563Ensembl.1
Natural variantiVAR_048590352R → W. Corresponds to variant dbSNP:rs12513290Ensembl.1

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_05551754 – 211Missing in isoform 2. 1 PublicationAdd BLAST158
Alternative sequenceiVSP_055518378 – 424Missing in isoform 2. 1 PublicationAdd BLAST47

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
D86322 mRNA Translation: BAA22590.1
AK093096 mRNA Translation: BAG52652.1
AK302149 mRNA Translation: BAG63520.1
CH471056 Genomic DNA Translation: EAX05099.1
CH471056 Genomic DNA Translation: EAX05100.1
CH471056 Genomic DNA Translation: EAX05101.1
BC028357 mRNA Translation: AAH28357.1
CCDSiCCDS3751.1 [O14967-1]
RefSeqiNP_001124147.1, NM_001130675.1 [O14967-1]
NP_004353.1, NM_004362.2 [O14967-1]
UniGeneiHs.86368

Genome annotation databases

EnsembliENST00000325617; ENSP00000326699; ENSG00000153132 [O14967-1]
ENST00000414773; ENSP00000392782; ENSG00000153132 [O14967-1]
GeneIDi1047
KEGGihsa:1047
UCSCiuc003iii.4 human [O14967-1]

Keywords - Coding sequence diversityi

Alternative splicing, Polymorphism

Similar proteinsi

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
D86322 mRNA Translation: BAA22590.1
AK093096 mRNA Translation: BAG52652.1
AK302149 mRNA Translation: BAG63520.1
CH471056 Genomic DNA Translation: EAX05099.1
CH471056 Genomic DNA Translation: EAX05100.1
CH471056 Genomic DNA Translation: EAX05101.1
BC028357 mRNA Translation: AAH28357.1
CCDSiCCDS3751.1 [O14967-1]
RefSeqiNP_001124147.1, NM_001130675.1 [O14967-1]
NP_004353.1, NM_004362.2 [O14967-1]
UniGeneiHs.86368

3D structure databases

ProteinModelPortaliO14967
SMRiO14967
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi107477, 38 interactors
IntActiO14967, 10 interactors
STRINGi9606.ENSP00000326699

PTM databases

iPTMnetiO14967
PhosphoSitePlusiO14967
SwissPalmiO14967

Polymorphism and mutation databases

BioMutaiCLGN

Proteomic databases

EPDiO14967
MaxQBiO14967
PaxDbiO14967
PeptideAtlasiO14967
PRIDEiO14967
ProteomicsDBi48342

Protocols and materials databases

DNASUi1047
Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000325617; ENSP00000326699; ENSG00000153132 [O14967-1]
ENST00000414773; ENSP00000392782; ENSG00000153132 [O14967-1]
GeneIDi1047
KEGGihsa:1047
UCSCiuc003iii.4 human [O14967-1]

Organism-specific databases

CTDi1047
DisGeNETi1047
EuPathDBiHostDB:ENSG00000153132.12
GeneCardsiCLGN
HGNCiHGNC:2060 CLGN
HPAiCAB020709
HPA048761
HPA058627
MIMi601858 gene
neXtProtiNX_O14967
OpenTargetsiENSG00000153132
PharmGKBiPA26587
GenAtlasiSearch...

Phylogenomic databases

eggNOGiKOG0675 Eukaryota
ENOG410XP7T LUCA
GeneTreeiENSGT00430000030841
HOGENOMiHOG000192436
HOVERGENiHBG005407
InParanoidiO14967
KOiK09551
OMAiIFRHKHP
OrthoDBiEOG091G04SS
PhylomeDBiO14967
TreeFamiTF300618

Miscellaneous databases

ChiTaRSiCLGN human
GeneWikiiCalmegin
GenomeRNAii1047
PROiPR:O14967
SOURCEiSearch...

Gene expression databases

BgeeiENSG00000153132 Expressed in 148 organ(s), highest expression level in heart right ventricle
CleanExiHS_CLGN
ExpressionAtlasiO14967 baseline and differential
GenevisibleiO14967 HS

Family and domain databases

Gene3Di2.10.250.10, 1 hit
InterProiView protein in InterPro
IPR001580 Calret/calnex
IPR018124 Calret/calnex_CS
IPR009033 Calreticulin/calnexin_P_dom_sf
IPR013320 ConA-like_dom_sf
PANTHERiPTHR11073 PTHR11073, 1 hit
PfamiView protein in Pfam
PF00262 Calreticulin, 1 hit
PRINTSiPR00626 CALRETICULIN
SUPFAMiSSF49899 SSF49899, 1 hit
SSF63887 SSF63887, 1 hit
PROSITEiView protein in PROSITE
PS00803 CALRETICULIN_1, 1 hit
PS00804 CALRETICULIN_2, 1 hit
PS00805 CALRETICULIN_REPEAT, 2 hits
ProtoNetiSearch...

Entry informationi

Entry nameiCLGN_HUMAN
AccessioniPrimary (citable) accession number: O14967
Secondary accession number(s): B3KS90, B4DXV8, D3DNY8
Entry historyiIntegrated into UniProtKB/Swiss-Prot: July 15, 1998
Last sequence update: January 1, 1998
Last modified: September 12, 2018
This is version 177 of the entry and version 1 of the sequence. See complete history.
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Human chromosome 4
    Human chromosome 4: entries, gene names and cross-references to MIM
  2. SIMILARITY comments
    Index of protein domains and families
  3. Human polymorphisms and disease mutations
    Index of human polymorphisms and disease mutations
  4. MIM cross-references
    Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot
  5. Human entries with polymorphisms or disease mutations
    List of human entries with polymorphisms or disease mutations
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again