SubmitCancel

Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Q95ZJ1

- GALT5_CAEEL

UniProt

Q95ZJ1 - GALT5_CAEEL

(max 400 entries)x

Your basket is currently empty.

Select item(s) and click on "Add to basket" to create your own collection here
(400 entries max)

Protein

Polypeptide N-acetylgalactosaminyltransferase 5

Gene
gly-5, Y39E4B.12
Organism
Caenorhabditis elegans
Status
Reviewed - Annotation score: 5 out of 5 - Experimental evidence at transcript leveli

Functioni

Catalyzes the initial reaction in O-linked oligosaccharide biosynthesis, the transfer of an N-acetyl-D-galactosamine residue to a serine or threonine residue on the protein receptor.

Catalytic activityi

UDP-N-acetyl-alpha-D-galactosamine + polypeptide = UDP + N-acetyl-alpha-D-galactosaminyl-polypeptide.

Cofactori

Manganese By similarity.

Pathwayi

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Binding sitei215 – 2151Substrate By similarity
Binding sitei245 – 2451Substrate By similarity
Metal bindingi268 – 2681Manganese By similarity
Binding sitei269 – 2691Substrate By similarity
Metal bindingi270 – 2701Manganese By similarity
Binding sitei376 – 3761Substrate By similarity
Metal bindingi404 – 4041Manganese By similarity
Binding sitei407 – 4071Substrate By similarity
Binding sitei412 – 4121Substrate By similarity

GO - Molecular functioni

  1. metal ion binding Source: UniProtKB-KW
  2. polypeptide N-acetylgalactosaminyltransferase activity Source: WormBase

GO - Biological processi

  1. protein O-linked glycosylation via threonine Source: WormBase
Complete GO annotation...

Keywords - Molecular functioni

Glycosyltransferase, Transferase

Keywords - Ligandi

Lectin, Manganese, Metal-binding

Enzyme and pathway databases

UniPathwayiUPA00378.

Protein family/group databases

CAZyiCBM13. Carbohydrate-Binding Module Family 13.
GT27. Glycosyltransferase Family 27.

Names & Taxonomyi

Protein namesi
Recommended name:
Polypeptide N-acetylgalactosaminyltransferase 5 (EC:2.4.1.41)
Short name:
pp-GaNTase 5
Alternative name(s):
Protein-UDP acetylgalactosaminyltransferase 5
UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase 5
Gene namesi
Name:gly-5
ORF Names:Y39E4B.12
OrganismiCaenorhabditis elegans
Taxonomic identifieri6239 [NCBI]
Taxonomic lineageiEukaryotaMetazoaEcdysozoaNematodaChromadoreaRhabditidaRhabditoideaRhabditidaePeloderinaeCaenorhabditis
ProteomesiUP000001940: Chromosome III

Organism-specific databases

WormBaseiY39E4B.12a; CE24240; WBGene00001630; gly-5.
Y39E4B.12b; CE28119; WBGene00001630; gly-5.
Y39E4B.12c; CE28120; WBGene00001630; gly-5.

Subcellular locationi

Topology

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Topological domaini1 – 1111Cytoplasmic Reviewed predictionAdd
BLAST
Transmembranei12 – 3120Helical; Signal-anchor for type II membrane protein; Reviewed predictionAdd
BLAST
Topological domaini32 – 626595Lumenal Reviewed predictionAdd
BLAST

GO - Cellular componenti

  1. Golgi membrane Source: UniProtKB-SubCell
  2. integral component of membrane Source: UniProtKB-KW
Complete GO annotation...

Keywords - Cellular componenti

Golgi apparatus, Membrane

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 626626Polypeptide N-acetylgalactosaminyltransferase 5PRO_0000059148Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Glycosylationi32 – 321N-linked (GlcNAc...) Reviewed prediction
Disulfide bondi165 ↔ 399 By similarity
Glycosylationi338 – 3381N-linked (GlcNAc...) Reviewed prediction
Disulfide bondi390 ↔ 466 By similarity
Disulfide bondi502 ↔ 521 By similarity
Disulfide bondi544 ↔ 557 By similarity
Disulfide bondi583 ↔ 598 By similarity

Keywords - PTMi

Disulfide bond, Glycoprotein

Proteomic databases

PaxDbiQ95ZJ1.
PRIDEiQ95ZJ1.

Interactioni

Protein-protein interaction databases

BioGridi41908. 4 interactions.
DIPiDIP-26207N.
IntActiQ95ZJ1. 3 interactions.
MINTiMINT-1041440.
STRINGi6239.Y39E4B.12a.1.

Structurei

3D structure databases

ProteinModelPortaliQ95ZJ1.
SMRiQ95ZJ1. Positions 154-607.

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini488 – 610123Ricin B-type lectinAdd
BLAST

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni174 – 284111Catalytic subdomain AAdd
BLAST
Regioni345 – 40763Catalytic subdomain BAdd
BLAST

Domaini

There are two conserved domains in the glycosyltransferase region: the N-terminal domain (domain A, also called GT1 motif), which is probably involved in manganese coordination and substrate binding and the C-terminal domain (domain B, also called Gal/GalNAc-T motif), which is probably involved in catalytic reaction and UDP-Gal binding By similarity.
The ricin B-type lectin domain binds to GalNAc and contributes to the glycopeptide specificity By similarity.

Sequence similaritiesi

Keywords - Domaini

Signal-anchor, Transmembrane, Transmembrane helix

Phylogenomic databases

eggNOGiNOG239675.
GeneTreeiENSGT00750000117385.
HOGENOMiHOG000038227.
InParanoidiQ95ZJ1.
KOiK00710.
OMAiYNENLPR.
PhylomeDBiQ95ZJ1.

Family and domain databases

Gene3Di3.90.550.10. 1 hit.
InterProiIPR001173. Glyco_trans_2-like.
IPR029044. Nucleotide-diphossugar_trans.
IPR000772. Ricin_B_lectin.
[Graphical view]
PfamiPF00535. Glycos_transf_2. 1 hit.
PF00652. Ricin_B_lectin. 1 hit.
[Graphical view]
SMARTiSM00458. RICIN. 1 hit.
[Graphical view]
SUPFAMiSSF50370. SSF50370. 1 hit.
SSF53448. SSF53448. 1 hit.
PROSITEiPS50231. RICIN_B_LECTIN. 1 hit.
[Graphical view]

Sequences (3)i

Sequence statusi: Complete.

This entry describes 3 isoformsi produced by alternative splicing. Align

Isoform a (identifier: Q95ZJ1-1) [UniParc]FASTAAdd to Basket

Also known as: GLY5b, GLY-5b

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

MIIFKKKAIL KVLLLVPVFW ICSLIFFAAT SNDSSQIGSN NDLANKIAEA    50
NFHPKAAKQD VIQGFGPPIE PEPVVENNKV EEEEQPGGNL AKPKFMVDPN 100
DPIYKKGDAA QAGELGKAVV VDKTKLSTEE KAKYDKGMLN NAFNQYASDM 150
ISVHRTLPTN IDAECKTEKY NENLPRTSVI ICFHNEAWSV LLRTVHSVLE 200
RTPDHLLEEV VLVDDFSDMD HTKRPLEEYM SQFGGKVKIL RMEKREGLIR 250
ARLRGAAVAT GEVLTYLDSH CECMEGWMEP LLDRIKRDPT TVVCPVIDVI 300
DDNTFEYHHS KAYFTSVGGF DWGLQFNWHS IPERDRKNRT RPIDPVRSPT 350
MAGGLFSIDK KYFEKLGTYD PGFDIWGGEN LELSFKIWMC GGTLEIVPCS 400
HVGHVFRKRS PYKWRTGVNV LKRNSIRLAE VWLDDYKTYY YERINNQLGD 450
FGDISSRKKL REDLGCKSFK WYLDNIYPEL FVPGESVAKG EVRNSAVQPA 500
RCLDCMVGRH EKNRPVGTYQ CHGQGGNQYW MLSKDGEIRR DESCVDYAGS 550
DVMVFPCHGM KGNQEWRYNH DTGRLQHAVS QKCLGMTKDG AKLEMVACQY 600
DDPYQHWKFK EYNEAKAIEH GAKPPS 626
Length:626
Mass (Da):71,382
Last modified:August 16, 2004 - v2
Checksum:i561BD0576514B983
GO
Isoform b (identifier: Q95ZJ1-2) [UniParc]FASTAAdd to Basket

Also known as: GLY5a, GLY-5a

The sequence of this isoform differs from the canonical sequence as follows:
     492-523: VRNSAVQPARCLDCMVGRHEKNRPVGTYQCHG → MRNAGGKNRQCIDYKPSGGKTVGMYQCHN

Show »
Length:623
Mass (Da):71,014
Checksum:i722AC7E93EF5FE4D
GO
Isoform c (identifier: Q95ZJ1-3) [UniParc]FASTAAdd to Basket

Also known as: GLY5c, GLY-5c

The sequence of this isoform differs from the canonical sequence as follows:
     492-523: VRNSAVQPARCLDCMVGRHEKNRPVGTYQCHG → LRNAQTSQCLDSAVGEEVENKAITPYPCHE

Show »
Length:624
Mass (Da):71,103
Checksum:i4959436CAE9916D4
GO

Alternative sequence

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Alternative sequencei492 – 52332VRNSA…YQCHG → MRNAGGKNRQCIDYKPSGGK TVGMYQCHN in isoform b. VSP_011238Add
BLAST
Alternative sequencei492 – 52332VRNSA…YQCHG → LRNAQTSQCLDSAVGEEVEN KAITPYPCHE in isoform c. VSP_011239Add
BLAST

Sequence conflict

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti361 – 3611K → E in AAC13671. 1 Publication
Sequence conflicti361 – 3611K → E in AAC13672. 1 Publication
Sequence conflicti361 – 3611K → E in AAC13673. 1 Publication

Sequence databases

Select the link destinations:
EMBL
GenBank
DDBJ
Links Updated
AF031835 mRNA. Translation: AAC13671.1.
AF031836 mRNA. Translation: AAC13672.1.
AF031837 mRNA. Translation: AAC13673.1.
AL110487 Genomic DNA. Translation: CAB54435.1.
AL110487 Genomic DNA. Translation: CAC42369.1.
AL110487 Genomic DNA. Translation: CAC42368.1.
PIRiT42245.
T42246.
T42247.
RefSeqiNP_001022850.1. NM_001027679.4. [Q95ZJ1-1]
NP_001022851.1. NM_001027680.4. [Q95ZJ1-2]
NP_001022852.1. NM_001027681.5. [Q95ZJ1-3]
UniGeneiCel.19665.

Genome annotation databases

EnsemblMetazoaiY39E4B.12a.1; Y39E4B.12a.1; WBGene00001630. [Q95ZJ1-1]
Y39E4B.12a.2; Y39E4B.12a.2; WBGene00001630. [Q95ZJ1-1]
GeneIDi176736.
KEGGicel:CELE_Y39E4B.12.
UCSCiY39E4B.12c. c. elegans. [Q95ZJ1-1]

Keywords - Coding sequence diversityi

Alternative splicing

Cross-referencesi

Sequence databases

Select the link destinations:
EMBL
GenBank
DDBJ
Links Updated
AF031835 mRNA. Translation: AAC13671.1 .
AF031836 mRNA. Translation: AAC13672.1 .
AF031837 mRNA. Translation: AAC13673.1 .
AL110487 Genomic DNA. Translation: CAB54435.1 .
AL110487 Genomic DNA. Translation: CAC42369.1 .
AL110487 Genomic DNA. Translation: CAC42368.1 .
PIRi T42245.
T42246.
T42247.
RefSeqi NP_001022850.1. NM_001027679.4. [Q95ZJ1-1 ]
NP_001022851.1. NM_001027680.4. [Q95ZJ1-2 ]
NP_001022852.1. NM_001027681.5. [Q95ZJ1-3 ]
UniGenei Cel.19665.

3D structure databases

ProteinModelPortali Q95ZJ1.
SMRi Q95ZJ1. Positions 154-607.
ModBasei Search...
MobiDBi Search...

Protein-protein interaction databases

BioGridi 41908. 4 interactions.
DIPi DIP-26207N.
IntActi Q95ZJ1. 3 interactions.
MINTi MINT-1041440.
STRINGi 6239.Y39E4B.12a.1.

Protein family/group databases

CAZyi CBM13. Carbohydrate-Binding Module Family 13.
GT27. Glycosyltransferase Family 27.

Proteomic databases

PaxDbi Q95ZJ1.
PRIDEi Q95ZJ1.

Protocols and materials databases

Structural Biology Knowledgebase Search...

Genome annotation databases

EnsemblMetazoai Y39E4B.12a.1 ; Y39E4B.12a.1 ; WBGene00001630 . [Q95ZJ1-1 ]
Y39E4B.12a.2 ; Y39E4B.12a.2 ; WBGene00001630 . [Q95ZJ1-1 ]
GeneIDi 176736.
KEGGi cel:CELE_Y39E4B.12.
UCSCi Y39E4B.12c. c. elegans. [Q95ZJ1-1 ]

Organism-specific databases

CTDi 176736.
WormBasei Y39E4B.12a ; CE24240 ; WBGene00001630 ; gly-5.
Y39E4B.12b ; CE28119 ; WBGene00001630 ; gly-5.
Y39E4B.12c ; CE28120 ; WBGene00001630 ; gly-5.

Phylogenomic databases

eggNOGi NOG239675.
GeneTreei ENSGT00750000117385.
HOGENOMi HOG000038227.
InParanoidi Q95ZJ1.
KOi K00710.
OMAi YNENLPR.
PhylomeDBi Q95ZJ1.

Enzyme and pathway databases

UniPathwayi UPA00378 .

Miscellaneous databases

NextBioi 893796.

Family and domain databases

Gene3Di 3.90.550.10. 1 hit.
InterProi IPR001173. Glyco_trans_2-like.
IPR029044. Nucleotide-diphossugar_trans.
IPR000772. Ricin_B_lectin.
[Graphical view ]
Pfami PF00535. Glycos_transf_2. 1 hit.
PF00652. Ricin_B_lectin. 1 hit.
[Graphical view ]
SMARTi SM00458. RICIN. 1 hit.
[Graphical view ]
SUPFAMi SSF50370. SSF50370. 1 hit.
SSF53448. SSF53448. 1 hit.
PROSITEi PS50231. RICIN_B_LECTIN. 1 hit.
[Graphical view ]
ProtoNeti Search...

Publicationsi

« Hide 'large scale' publications
  1. "cDNA cloning and expression of a family of UDP-N-acetyl-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase sequence homologs from Caenorhabditis elegans."
    Hagen F.K., Nehrke K.
    J. Biol. Chem. 273:8268-8277(1998) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORMS A; B AND C).
    Strain: Bristol N2.
  2. "Genome sequence of the nematode C. elegans: a platform for investigating biology."
    The C. elegans sequencing consortium
    Science 282:2012-2018(1998) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: Bristol N2.

Entry informationi

Entry nameiGALT5_CAEEL
AccessioniPrimary (citable) accession number: Q95ZJ1
Secondary accession number(s): O61391
, O61392, O61393, Q95ZJ2, Q9U2J8
Entry historyi
Integrated into UniProtKB/Swiss-Prot: August 16, 2004
Last sequence update: August 16, 2004
Last modified: July 9, 2014
This is version 109 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programCaenorhabditis annotation project

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Caenorhabditis elegans
    Caenorhabditis elegans: entries, gene names and cross-references to WormBase
  2. PATHWAY comments
    Index of metabolic and biosynthesis pathways
  3. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3

Similar proteinsi