Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Calmegin

Gene

CLGN

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Functions during spermatogenesis as a chaperone for a range of client proteins that are important for sperm adhesion onto the egg zona pellucida and for subsequent penetration of the zona pellucida. Required for normal sperm migration from the uterus into the oviduct. Required for normal male fertility. Binds calcium ions (By similarity).By similarity

GO - Molecular functioni

  • calcium ion binding Source: Ensembl
  • unfolded protein binding Source: ProtInc

GO - Biological processi

  • binding of sperm to zona pellucida Source: Ensembl
  • protein complex assembly Source: Ensembl
  • protein folding Source: InterPro
  • single fertilization Source: ProtInc
Complete GO annotation...

Keywords - Molecular functioni

Chaperone

Keywords - Ligandi

Calcium

Enzyme and pathway databases

BioCyciZFISH:ENSG00000153132-MONOMER.

Names & Taxonomyi

Protein namesi
Recommended name:
Calmegin
Gene namesi
Name:CLGN
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 4

Organism-specific databases

HGNCiHGNC:2060. CLGN.

Subcellular locationi

Topology

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Topological domaini20 – 471LumenalSequence analysisAdd BLAST452
Transmembranei472 – 492HelicalSequence analysisAdd BLAST21
Topological domaini493 – 610CytoplasmicSequence analysisAdd BLAST118

GO - Cellular componenti

  • endoplasmic reticulum Source: ProtInc
  • endoplasmic reticulum membrane Source: UniProtKB-SubCell
  • integral component of membrane Source: UniProtKB-KW
Complete GO annotation...

Keywords - Cellular componenti

Endoplasmic reticulum, Membrane

Pathology & Biotechi

Organism-specific databases

DisGeNETi1047.
OpenTargetsiENSG00000153132.
PharmGKBiPA26587.

Polymorphism and mutation databases

BioMutaiCLGN.

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Signal peptidei1 – 19Sequence analysisAdd BLAST19
ChainiPRO_000000421020 – 610CalmeginAdd BLAST591

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei128N6-acetyllysineBy similarity1
Disulfide bondi151 ↔ 185By similarity
Disulfide bondi351 ↔ 355By similarity
Modified residuei560PhosphoserineCombined sources1
Modified residuei576PhosphoserineCombined sources1
Modified residuei579PhosphoserineBy similarity1
Modified residuei581PhosphoserineBy similarity1
Modified residuei591PhosphoserineBy similarity1
Modified residuei594PhosphoserineBy similarity1
Modified residuei601PhosphoserineBy similarity1

Keywords - PTMi

Acetylation, Disulfide bond, Phosphoprotein

Proteomic databases

EPDiO14967.
MaxQBiO14967.
PaxDbiO14967.
PeptideAtlasiO14967.
PRIDEiO14967.

PTM databases

iPTMnetiO14967.
PhosphoSitePlusiO14967.
SwissPalmiO14967.

Expressioni

Tissue specificityi

Detected in testis (at protein level). Detected in testis.1 Publication

Gene expression databases

BgeeiENSG00000153132.
CleanExiHS_CLGN.
ExpressionAtlasiO14967. baseline and differential.
GenevisibleiO14967. HS.

Organism-specific databases

HPAiCAB020709.
HPA048761.
HPA058627.

Interactioni

Subunit structurei

Interacts with PPIB. Interacts with ADAM2 (By similarity). Interacts with PDILT.By similarity1 Publication

GO - Molecular functioni

  • unfolded protein binding Source: ProtInc

Protein-protein interaction databases

BioGridi107477. 37 interactors.
IntActiO14967. 7 interactors.
STRINGi9606.ENSP00000326699.

Structurei

3D structure databases

ProteinModelPortaliO14967.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Repeati267 – 2801-1Add BLAST14
Repeati284 – 2971-2Add BLAST14
Repeati303 – 3161-3Add BLAST14
Repeati322 – 3351-4Add BLAST14
Repeati339 – 3522-1Add BLAST14
Repeati356 – 3692-2Add BLAST14
Repeati370 – 3832-3Add BLAST14
Repeati384 – 3972-4Add BLAST14

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Regioni317 – 350Interaction with PPIBBy similarityAdd BLAST34

Sequence similaritiesi

Belongs to the calreticulin family.Curated

Keywords - Domaini

Repeat, Signal, Transmembrane, Transmembrane helix

Phylogenomic databases

eggNOGiKOG0675. Eukaryota.
ENOG410XP7T. LUCA.
GeneTreeiENSGT00430000030841.
HOGENOMiHOG000192436.
HOVERGENiHBG005407.
InParanoidiO14967.
KOiK09551.
OMAiPACRIGC.
OrthoDBiEOG091G04SS.
PhylomeDBiO14967.
TreeFamiTF300618.

Family and domain databases

Gene3Di2.10.250.10. 1 hit.
2.60.120.200. 2 hits.
InterProiIPR001580. Calret/calnex.
IPR018124. Calret/calnex_CS.
IPR009033. Calreticulin/calnexin_P_dom.
IPR013320. ConA-like_dom.
[Graphical view]
PANTHERiPTHR11073. PTHR11073. 2 hits.
PfamiPF00262. Calreticulin. 1 hit.
[Graphical view]
PRINTSiPR00626. CALRETICULIN.
SUPFAMiSSF49899. SSF49899. 1 hit.
SSF63887. SSF63887. 1 hit.
PROSITEiPS00803. CALRETICULIN_1. 1 hit.
PS00804. CALRETICULIN_2. 1 hit.
PS00805. CALRETICULIN_REPEAT. 2 hits.
[Graphical view]

Sequences (2)i

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

This entry describes 2 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: O14967-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MHFQAFWLCL GLLFISINAE FMDDDVETED FEENSEEIDV NESELSSEIK
60 70 80 90 100
YKTPQPIGEV YFAETFDSGR LAGWVLSKAK KDDMDEEISI YDGRWEIEEL
110 120 130 140 150
KENQVPGDRG LVLKSRAKHH AISAVLAKPF IFADKPLIVQ YEVNFQDGID
160 170 180 190 200
CGGAYIKLLA DTDDLILENF YDKTSYIIMF GPDKCGEDYK LHFIFRHKHP
210 220 230 240 250
KTGVFEEKHA KPPDVDLKKF FTDRKTHLYT LVMNPDDTFE VLVDQTVVNK
260 270 280 290 300
GSLLEDVVPP IKPPKEIEDP NDKKPEEWDE RAKIPDPSAV KPEDWDESEP
310 320 330 340 350
AQIEDSSVVK PAGWLDDEPK FIPDPNAEKP DDWNEDTDGE WEAPQILNPA
360 370 380 390 400
CRIGCGEWKP PMIDNPKYKG VWRPPLVDNP NYQGIWSPRK IPNPDYFEDD
410 420 430 440 450
HPFLLTSFSA LGLELWSMTS DIYFDNFIIC SEKEVADHWA ADGWRWKIMI
460 470 480 490 500
ANANKPGVLK QLMAAAEGHP WLWLIYLVTA GVPIALITSF CWPRKVKKKH
510 520 530 540 550
KDTEYKKTDI CIPQTKGVLE QEEKEEKAAL EKPMDLEEEK KQNDGEMLEK
560 570 580 590 600
EEESEPEEKS EEEIEIIEGQ EESNQSNKSG SEDEMKEADE STGSGDGPIK
610
SVRKRRVRKD
Length:610
Mass (Da):70,039
Last modified:January 1, 1998 - v1
Checksum:iF024FC4010D42D7E
GO
Isoform 2 (identifier: O14967-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     54-211: Missing.
     378-424: Missing.

Note: No experimental confirmation available.
Show »
Length:405
Mass (Da):46,453
Checksum:i6E259D1577FA8BC6
GO

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti232V → A in BAG63520 (PubMed:14702039).Curated1

Natural variant

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Natural variantiVAR_024400160A → S.Corresponds to variant rs2567241dbSNPEnsembl.1
Natural variantiVAR_033776290V → I.Corresponds to variant rs2175563dbSNPEnsembl.1
Natural variantiVAR_048590352R → W.Corresponds to variant rs12513290dbSNPEnsembl.1

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_05551754 – 211Missing in isoform 2. 1 PublicationAdd BLAST158
Alternative sequenceiVSP_055518378 – 424Missing in isoform 2. 1 PublicationAdd BLAST47

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
D86322 mRNA. Translation: BAA22590.1.
AK093096 mRNA. Translation: BAG52652.1.
AK302149 mRNA. Translation: BAG63520.1.
CH471056 Genomic DNA. Translation: EAX05099.1.
CH471056 Genomic DNA. Translation: EAX05100.1.
CH471056 Genomic DNA. Translation: EAX05101.1.
BC028357 mRNA. Translation: AAH28357.1.
CCDSiCCDS3751.1. [O14967-1]
RefSeqiNP_001124147.1. NM_001130675.1. [O14967-1]
NP_004353.1. NM_004362.2. [O14967-1]
UniGeneiHs.86368.

Genome annotation databases

EnsembliENST00000325617; ENSP00000326699; ENSG00000153132. [O14967-1]
ENST00000414773; ENSP00000392782; ENSG00000153132. [O14967-1]
GeneIDi1047.
KEGGihsa:1047.
UCSCiuc003iii.4. human. [O14967-1]

Keywords - Coding sequence diversityi

Alternative splicing, Polymorphism

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
D86322 mRNA. Translation: BAA22590.1.
AK093096 mRNA. Translation: BAG52652.1.
AK302149 mRNA. Translation: BAG63520.1.
CH471056 Genomic DNA. Translation: EAX05099.1.
CH471056 Genomic DNA. Translation: EAX05100.1.
CH471056 Genomic DNA. Translation: EAX05101.1.
BC028357 mRNA. Translation: AAH28357.1.
CCDSiCCDS3751.1. [O14967-1]
RefSeqiNP_001124147.1. NM_001130675.1. [O14967-1]
NP_004353.1. NM_004362.2. [O14967-1]
UniGeneiHs.86368.

3D structure databases

ProteinModelPortaliO14967.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi107477. 37 interactors.
IntActiO14967. 7 interactors.
STRINGi9606.ENSP00000326699.

PTM databases

iPTMnetiO14967.
PhosphoSitePlusiO14967.
SwissPalmiO14967.

Polymorphism and mutation databases

BioMutaiCLGN.

Proteomic databases

EPDiO14967.
MaxQBiO14967.
PaxDbiO14967.
PeptideAtlasiO14967.
PRIDEiO14967.

Protocols and materials databases

DNASUi1047.
Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000325617; ENSP00000326699; ENSG00000153132. [O14967-1]
ENST00000414773; ENSP00000392782; ENSG00000153132. [O14967-1]
GeneIDi1047.
KEGGihsa:1047.
UCSCiuc003iii.4. human. [O14967-1]

Organism-specific databases

CTDi1047.
DisGeNETi1047.
GeneCardsiCLGN.
HGNCiHGNC:2060. CLGN.
HPAiCAB020709.
HPA048761.
HPA058627.
MIMi601858. gene.
neXtProtiNX_O14967.
OpenTargetsiENSG00000153132.
PharmGKBiPA26587.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiKOG0675. Eukaryota.
ENOG410XP7T. LUCA.
GeneTreeiENSGT00430000030841.
HOGENOMiHOG000192436.
HOVERGENiHBG005407.
InParanoidiO14967.
KOiK09551.
OMAiPACRIGC.
OrthoDBiEOG091G04SS.
PhylomeDBiO14967.
TreeFamiTF300618.

Enzyme and pathway databases

BioCyciZFISH:ENSG00000153132-MONOMER.

Miscellaneous databases

ChiTaRSiCLGN. human.
GeneWikiiCalmegin.
GenomeRNAii1047.
PROiO14967.
SOURCEiSearch...

Gene expression databases

BgeeiENSG00000153132.
CleanExiHS_CLGN.
ExpressionAtlasiO14967. baseline and differential.
GenevisibleiO14967. HS.

Family and domain databases

Gene3Di2.10.250.10. 1 hit.
2.60.120.200. 2 hits.
InterProiIPR001580. Calret/calnex.
IPR018124. Calret/calnex_CS.
IPR009033. Calreticulin/calnexin_P_dom.
IPR013320. ConA-like_dom.
[Graphical view]
PANTHERiPTHR11073. PTHR11073. 2 hits.
PfamiPF00262. Calreticulin. 1 hit.
[Graphical view]
PRINTSiPR00626. CALRETICULIN.
SUPFAMiSSF49899. SSF49899. 1 hit.
SSF63887. SSF63887. 1 hit.
PROSITEiPS00803. CALRETICULIN_1. 1 hit.
PS00804. CALRETICULIN_2. 1 hit.
PS00805. CALRETICULIN_REPEAT. 2 hits.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiCLGN_HUMAN
AccessioniPrimary (citable) accession number: O14967
Secondary accession number(s): B3KS90, B4DXV8, D3DNY8
Entry historyi
Integrated into UniProtKB/Swiss-Prot: July 15, 1998
Last sequence update: January 1, 1998
Last modified: November 30, 2016
This is version 161 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Human chromosome 4
    Human chromosome 4: entries, gene names and cross-references to MIM
  2. Human entries with polymorphisms or disease mutations
    List of human entries with polymorphisms or disease mutations
  3. Human polymorphisms and disease mutations
    Index of human polymorphisms and disease mutations
  4. MIM cross-references
    Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot
  5. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.