Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Procollagen galactosyltransferase 1

Gene

COLGALT1

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Has a beta-galactosyltransferase activity; transfers beta-galactose to hydroxylysine residues of collagen.1 Publication

Catalytic activityi

UDP-alpha-D-galactose + 5-hydroxy-L-lysine-[procollagen] = UDP + 5-(D-galactosyloxy)-L-lysine-[procollagen].1 Publication

Kineticsi

  1. KM=18.7 µM for UDP-galactose1 Publication

    GO - Molecular functioni

    GO - Biological processi

    Complete GO annotation...

    Keywords - Molecular functioni

    Glycosyltransferase, Transferase

    Enzyme and pathway databases

    BRENDAi2.4.1.50. 2681.
    ReactomeiREACT_121139. Collagen biosynthesis and modifying enzymes.
    SABIO-RKQ8NBJ5.

    Protein family/group databases

    CAZyiGT25. Glycosyltransferase Family 25.

    Names & Taxonomyi

    Protein namesi
    Recommended name:
    Procollagen galactosyltransferase 1 (EC:2.4.1.50)
    Alternative name(s):
    Collagen beta(1-O)galactosyltransferase 1
    Glycosyltransferase 25 family member 1
    Hydroxylysine galactosyltransferase 1
    Gene namesi
    Name:COLGALT1
    Synonyms:GLT25D1
    ORF Names:PSEC0241
    OrganismiHomo sapiens (Human)
    Taxonomic identifieri9606 [NCBI]
    Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
    ProteomesiUP000005640 Componenti: Chromosome 19

    Organism-specific databases

    HGNCiHGNC:26182. COLGALT1.

    Subcellular locationi

    • Endoplasmic reticulum lumen PROSITE-ProRule annotation1 Publication

    • Note: Colocalized with PLOD3 and mannose binding lectin.

    GO - Cellular componenti

    • endoplasmic reticulum lumen Source: UniProtKB
    • membrane Source: UniProtKB
    Complete GO annotation...

    Keywords - Cellular componenti

    Endoplasmic reticulum

    Pathology & Biotechi

    Organism-specific databases

    PharmGKBiPA134991138.

    Polymorphism and mutation databases

    BioMutaiCOLGALT1.
    DMDMi74715064.

    PTM / Processingi

    Molecule processing

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Signal peptidei1 – 2929Sequence AnalysisAdd
    BLAST
    Chaini30 – 622593Procollagen galactosyltransferase 1PRO_0000309536Add
    BLAST

    Amino acid modifications

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Glycosylationi96 – 961N-linked (GlcNAc...)1 Publication
    Glycosylationi184 – 1841N-linked (GlcNAc...)Sequence Analysis
    Glycosylationi381 – 3811N-linked (GlcNAc...)1 Publication

    Post-translational modificationi

    N-glycosylated.2 Publications

    Keywords - PTMi

    Glycoprotein

    Proteomic databases

    MaxQBiQ8NBJ5.
    PaxDbiQ8NBJ5.
    PeptideAtlasiQ8NBJ5.
    PRIDEiQ8NBJ5.

    PTM databases

    PhosphoSiteiQ8NBJ5.

    Expressioni

    Tissue specificityi

    Ubiquitous with higher levels in placenta, heart, lung and spleen.1 Publication

    Gene expression databases

    BgeeiQ8NBJ5.
    CleanExiHS_GLT25D1.
    ExpressionAtlasiQ8NBJ5. baseline and differential.
    GenevestigatoriQ8NBJ5.

    Organism-specific databases

    HPAiHPA047821.

    Interactioni

    Protein-protein interaction databases

    BioGridi122826. 12 interactions.
    IntActiQ8NBJ5. 4 interactions.
    STRINGi9606.ENSP00000252599.

    Structurei

    3D structure databases

    ProteinModelPortaliQ8NBJ5.
    ModBaseiSearch...
    MobiDBiSearch...

    Family & Domainsi

    Motif

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Motifi619 – 6224Prevents secretion from ER

    Sequence similaritiesi

    Belongs to the glycosyltransferase 25 family.Curated

    Keywords - Domaini

    Signal

    Phylogenomic databases

    eggNOGiNOG293154.
    GeneTreeiENSGT00550000074427.
    HOGENOMiHOG000007198.
    HOVERGENiHBG058097.
    InParanoidiQ8NBJ5.
    KOiK11703.
    OMAiMWADYIL.
    OrthoDBiEOG7060RC.
    PhylomeDBiQ8NBJ5.
    TreeFamiTF313826.

    Family and domain databases

    Gene3Di3.90.550.10. 2 hits.
    InterProiIPR002654. Glyco_trans_25.
    IPR029044. Nucleotide-diphossugar_trans.
    [Graphical view]
    PfamiPF01755. Glyco_transf_25. 1 hit.
    [Graphical view]
    SUPFAMiSSF53448. SSF53448. 1 hit.
    PROSITEiPS00014. ER_TARGET. 1 hit.
    [Graphical view]

    Sequencei

    Sequence statusi: Complete.

    Sequence processingi: The displayed sequence is further processed into a mature form.

    Q8NBJ5-1 [UniParc]FASTAAdd to basket

    « Hide

            10         20         30         40         50
    MAAAPRAGRR RGQPLLALLL LLLAPLPPGA PPGADAYFPE ERWSPESPLQ
    60 70 80 90 100
    APRVLIALLA RNAAHALPTT LGALERLRHP RERTALWVAT DHNMDNTSTV
    110 120 130 140 150
    LREWLVAVKS LYHSVEWRPA EEPRSYPDEE GPKHWSDSRY EHVMKLRQAA
    160 170 180 190 200
    LKSARDMWAD YILFVDADNL ILNPDTLSLL IAENKTVVAP MLDSRAAYSN
    210 220 230 240 250
    FWCGMTSQGY YKRTPAYIPI RKRDRRGCFA VPMVHSTFLI DLRKAASRNL
    260 270 280 290 300
    AFYPPHPDYT WSFDDIIVFA FSCKQAEVQM YVCNKEEYGF LPVPLRAHST
    310 320 330 340 350
    LQDEAESFMH VQLEVMVKHP PAEPSRFISA PTKTPDKMGF DEVFMINLRR
    360 370 380 390 400
    RQDRRERMLR ALQAQEIECR LVEAVDGKAM NTSQVEALGI QMLPGYRDPY
    410 420 430 440 450
    HGRPLTKGEL GCFLSHYNIW KEVVDRGLQK SLVFEDDLRF EIFFKRRLMN
    460 470 480 490 500
    LMRDVEREGL DWDLIYVGRK RMQVEHPEKA VPRVRNLVEA DYSYWTLAYV
    510 520 530 540 550
    ISLQGARKLL AAEPLSKMLP VDEFLPVMFD KHPVSEYKAH FSLRNLHAFS
    560 570 580 590 600
    VEPLLIYPTH YTGDDGYVSD TETSVVWNNE HVKTDWDRAK SQKMREQQAL
    610 620
    SREAKNSDVL QSPLDSAARD EL
    Length:622
    Mass (Da):71,636
    Last modified:October 1, 2002 - v1
    Checksum:iC430974CB1CF5280
    GO

    Experimental Info

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Sequence conflicti100 – 1001V → E in BAC11307 (PubMed:16303743).Curated
    Sequence conflicti220 – 2201I → T in BAC11307 (PubMed:16303743).Curated

    Sequence databases

    Select the link destinations:
    EMBLi
    GenBanki
    DDBJi
    Links Updated
    AK074941 mRNA. Translation: BAC11307.1.
    AK075541 mRNA. Translation: BAC11684.1.
    AC010618 Genomic DNA. No translation available.
    CH471106 Genomic DNA. Translation: EAW84622.1.
    BC108308 mRNA. Translation: AAI08309.1.
    CCDSiCCDS12363.1.
    RefSeqiNP_078932.2. NM_024656.2.
    UniGeneiHs.418795.

    Genome annotation databases

    EnsembliENST00000252599; ENSP00000252599; ENSG00000130309.
    GeneIDi79709.
    KEGGihsa:79709.
    UCSCiuc002nhc.1. human.

    Cross-referencesi

    Sequence databases

    Select the link destinations:
    EMBLi
    GenBanki
    DDBJi
    Links Updated
    AK074941 mRNA. Translation: BAC11307.1.
    AK075541 mRNA. Translation: BAC11684.1.
    AC010618 Genomic DNA. No translation available.
    CH471106 Genomic DNA. Translation: EAW84622.1.
    BC108308 mRNA. Translation: AAI08309.1.
    CCDSiCCDS12363.1.
    RefSeqiNP_078932.2. NM_024656.2.
    UniGeneiHs.418795.

    3D structure databases

    ProteinModelPortaliQ8NBJ5.
    ModBaseiSearch...
    MobiDBiSearch...

    Protein-protein interaction databases

    BioGridi122826. 12 interactions.
    IntActiQ8NBJ5. 4 interactions.
    STRINGi9606.ENSP00000252599.

    Protein family/group databases

    CAZyiGT25. Glycosyltransferase Family 25.

    PTM databases

    PhosphoSiteiQ8NBJ5.

    Polymorphism and mutation databases

    BioMutaiCOLGALT1.
    DMDMi74715064.

    Proteomic databases

    MaxQBiQ8NBJ5.
    PaxDbiQ8NBJ5.
    PeptideAtlasiQ8NBJ5.
    PRIDEiQ8NBJ5.

    Protocols and materials databases

    Structural Biology KnowledgebaseSearch...

    Genome annotation databases

    EnsembliENST00000252599; ENSP00000252599; ENSG00000130309.
    GeneIDi79709.
    KEGGihsa:79709.
    UCSCiuc002nhc.1. human.

    Organism-specific databases

    CTDi79709.
    GeneCardsiGC19P017916.
    HGNCiHGNC:26182. COLGALT1.
    HPAiHPA047821.
    neXtProtiNX_Q8NBJ5.
    PharmGKBiPA134991138.
    GenAtlasiSearch...

    Phylogenomic databases

    eggNOGiNOG293154.
    GeneTreeiENSGT00550000074427.
    HOGENOMiHOG000007198.
    HOVERGENiHBG058097.
    InParanoidiQ8NBJ5.
    KOiK11703.
    OMAiMWADYIL.
    OrthoDBiEOG7060RC.
    PhylomeDBiQ8NBJ5.
    TreeFamiTF313826.

    Enzyme and pathway databases

    BRENDAi2.4.1.50. 2681.
    ReactomeiREACT_121139. Collagen biosynthesis and modifying enzymes.
    SABIO-RKQ8NBJ5.

    Miscellaneous databases

    GenomeRNAii79709.
    NextBioi69027.
    PROiQ8NBJ5.

    Gene expression databases

    BgeeiQ8NBJ5.
    CleanExiHS_GLT25D1.
    ExpressionAtlasiQ8NBJ5. baseline and differential.
    GenevestigatoriQ8NBJ5.

    Family and domain databases

    Gene3Di3.90.550.10. 2 hits.
    InterProiIPR002654. Glyco_trans_25.
    IPR029044. Nucleotide-diphossugar_trans.
    [Graphical view]
    PfamiPF01755. Glyco_transf_25. 1 hit.
    [Graphical view]
    SUPFAMiSSF53448. SSF53448. 1 hit.
    PROSITEiPS00014. ER_TARGET. 1 hit.
    [Graphical view]
    ProtoNetiSearch...

    Publicationsi

    « Hide 'large scale' publications
    1. "Signal sequence and keyword trap in silico for selection of full-length human cDNAs encoding secretion or membrane proteins from oligo-capped cDNA libraries."
      Otsuki T., Ota T., Nishikawa T., Hayashi K., Suzuki Y., Yamamoto J., Wakamatsu A., Kimura K., Sakamoto K., Hatano N., Kawai Y., Ishii S., Saito K., Kojima S., Sugiyama T., Ono T., Okano K., Yoshikawa Y.
      , Aotsuka S., Sasaki N., Hattori A., Okumura K., Nagai K., Sugano S., Isogai T.
      DNA Res. 12:117-126(2005) [PubMed] [Europe PMC] [Abstract]
      Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
      Tissue: Teratocarcinoma.
    2. "The DNA sequence and biology of human chromosome 19."
      Grimwood J., Gordon L.A., Olsen A.S., Terry A., Schmutz J., Lamerdin J.E., Hellsten U., Goodstein D., Couronne O., Tran-Gyamfi M., Aerts A., Altherr M., Ashworth L., Bajorek E., Black S., Branscomb E., Caenepeel S., Carrano A.V.
      , Caoile C., Chan Y.M., Christensen M., Cleland C.A., Copeland A., Dalin E., Dehal P., Denys M., Detter J.C., Escobar J., Flowers D., Fotopulos D., Garcia C., Georgescu A.M., Glavina T., Gomez M., Gonzales E., Groza M., Hammon N., Hawkins T., Haydu L., Ho I., Huang W., Israni S., Jett J., Kadner K., Kimball H., Kobayashi A., Larionov V., Leem S.-H., Lopez F., Lou Y., Lowry S., Malfatti S., Martinez D., McCready P.M., Medina C., Morgan J., Nelson K., Nolan M., Ovcharenko I., Pitluck S., Pollard M., Popkie A.P., Predki P., Quan G., Ramirez L., Rash S., Retterer J., Rodriguez A., Rogers S., Salamov A., Salazar A., She X., Smith D., Slezak T., Solovyev V., Thayer N., Tice H., Tsai M., Ustaszewska A., Vo N., Wagner M., Wheeler J., Wu K., Xie G., Yang J., Dubchak I., Furey T.S., DeJong P., Dickson M., Gordon D., Eichler E.E., Pennacchio L.A., Richardson P., Stubbs L., Rokhsar D.S., Myers R.M., Rubin E.M., Lucas S.M.
      Nature 428:529-535(2004) [PubMed] [Europe PMC] [Abstract]
      Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    3. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    4. "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
      The MGC Project Team
      Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
      Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
      Tissue: Cervix.
    5. "Glycoproteomics analysis of human liver tissue by combination of multiple enzyme digestion and hydrazide chemistry."
      Chen R., Jiang X., Sun D., Han G., Wang F., Ye M., Wang L., Zou H.
      J. Proteome Res. 8:651-661(2009) [PubMed] [Europe PMC] [Abstract]
      Cited for: GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-96 AND ASN-381.
      Tissue: Liver.
    6. "Core glycosylation of collagen is initiated by two beta(1-O)galactosyltransferases."
      Schegg B., Huelsmeier A.J., Rutschmann C., Maag C., Hennet T.
      Mol. Cell. Biol. 29:943-952(2009) [PubMed] [Europe PMC] [Abstract]
      Cited for: FUNCTION, BIOPHYSICOCHEMICAL PROPERTIES, CATALYTIC ACTIVITY, TISSUE SPECIFICITY.
    7. "The human collagen beta(1-O)galactosyltransferase, GLT25D1, is a soluble endoplasmic reticulum localized protein."
      Liefhebber J.M., Punt S., Spaan W.J., van Leeuwen H.C.
      BMC Cell Biol. 11:33-33(2010) [PubMed] [Europe PMC] [Abstract]
      Cited for: SUBCELLULAR LOCATION, GLYCOSYLATION, IDENTIFICATION BY MASS SPECTROMETRY.
    8. Cited for: IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].

    Entry informationi

    Entry nameiGT251_HUMAN
    AccessioniPrimary (citable) accession number: Q8NBJ5
    Secondary accession number(s): Q8NC64
    Entry historyi
    Integrated into UniProtKB/Swiss-Prot: December 4, 2007
    Last sequence update: October 1, 2002
    Last modified: April 29, 2015
    This is version 102 of the entry and version 1 of the sequence. [Complete history]
    Entry statusiReviewed (UniProtKB/Swiss-Prot)
    Annotation programChordata Protein Annotation Program
    DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

    Miscellaneousi

    Caution

    Has no glucosyltransferase activity.Curated

    Keywords - Technical termi

    Complete proteome, Reference proteome

    Documents

    1. Human chromosome 19
      Human chromosome 19: entries, gene names and cross-references to MIM
    2. SIMILARITY comments
      Index of protein domains and families

    External Data

    Dasty 3

    Similar proteinsi

    Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
    100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into Uniref entry.
    90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
    50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.