Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Polypeptide N-acetylgalactosaminyltransferase 3

Gene

gly-3

Organism
Caenorhabditis elegans
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at transcript leveli

Functioni

Catalyzes the initial reaction in O-linked oligosaccharide biosynthesis, the transfer of an N-acetyl-D-galactosamine residue to a serine or threonine residue on the protein receptor.

Catalytic activityi

UDP-N-acetyl-alpha-D-galactosamine + polypeptide = UDP + N-acetyl-alpha-D-galactosaminyl-polypeptide.1 Publication

Cofactori

Mn2+By similarity

Pathwayi: protein glycosylation

This protein is involved in the pathway protein glycosylation, which is part of Protein modification.
View all proteins of this organism that are known to be involved in the pathway protein glycosylation and in Protein modification.

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Binding sitei210SubstrateBy similarity1
Binding sitei240SubstrateBy similarity1
Metal bindingi263ManganeseBy similarity1
Metal bindingi265ManganeseBy similarity1
Binding sitei370SubstrateBy similarity1
Metal bindingi398ManganeseBy similarity1
Binding sitei401SubstrateBy similarity1
Binding sitei406SubstrateBy similarity1

GO - Molecular functioni

GO - Biological processi

  • protein O-linked glycosylation via threonine Source: WormBase
Complete GO annotation...

Keywords - Molecular functioni

Glycosyltransferase, Transferase

Keywords - Ligandi

Lectin, Manganese, Metal-binding

Enzyme and pathway databases

ReactomeiR-CEL-6811436. COPI-independent Golgi-to-ER retrograde traffic.
UniPathwayiUPA00378.

Protein family/group databases

CAZyiCBM13. Carbohydrate-Binding Module Family 13.
GT27. Glycosyltransferase Family 27.

Names & Taxonomyi

Protein namesi
Recommended name:
Polypeptide N-acetylgalactosaminyltransferase 3 (EC:2.4.1.41)
Alternative name(s):
GalNAc-T1
Protein-UDP acetylgalactosaminyltransferase 3
UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase 3
Short name:
pp-GaNTase 3
Gene namesi
Name:gly-3
ORF Names:ZK688.8
OrganismiCaenorhabditis elegans
Taxonomic identifieri6239 [NCBI]
Taxonomic lineageiEukaryotaMetazoaEcdysozoaNematodaChromadoreaRhabditidaRhabditoideaRhabditidaePeloderinaeCaenorhabditis
Proteomesi
  • UP000001940 Componenti: Chromosome III

Organism-specific databases

WormBaseiZK688.8; CE29649; WBGene00001628; gly-3.

Subcellular locationi

Topology

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Topological domaini1 – 13CytoplasmicSequence analysisAdd BLAST13
Transmembranei14 – 34Helical; Signal-anchor for type II membrane proteinSequence analysisAdd BLAST21
Topological domaini35 – 612LumenalSequence analysisAdd BLAST578

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Golgi apparatus, Membrane

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00000591461 – 612Polypeptide N-acetylgalactosaminyltransferase 3Add BLAST612

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Glycosylationi146N-linked (GlcNAc...)Sequence analysis1
Disulfide bondi157 ↔ 393PROSITE-ProRule annotation
Glycosylationi239N-linked (GlcNAc...)Sequence analysis1
Disulfide bondi384 ↔ 462PROSITE-ProRule annotation
Disulfide bondi496 ↔ 513PROSITE-ProRule annotation
Disulfide bondi536 ↔ 554PROSITE-ProRule annotation
Disulfide bondi582 ↔ 595PROSITE-ProRule annotation

Keywords - PTMi

Disulfide bond, Glycoprotein

Proteomic databases

EPDiP34678.
PaxDbiP34678.
PeptideAtlasiP34678.
PRIDEiP34678.

Expressioni

Gene expression databases

BgeeiWBGene00001628.

Interactioni

Protein-protein interaction databases

STRINGi6239.ZK688.8.1.

Structurei

3D structure databases

ProteinModelPortaliP34678.
SMRiP34678.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini478 – 607Ricin B-type lectinPROSITE-ProRule annotationAdd BLAST130

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Regioni169 – 279Catalytic subdomain AAdd BLAST111
Regioni339 – 401Catalytic subdomain BAdd BLAST63

Domaini

There are two conserved domains in the glycosyltransferase region: the N-terminal domain (domain A, also called GT1 motif), which is probably involved in manganese coordination and substrate binding and the C-terminal domain (domain B, also called Gal/GalNAc-T motif), which is probably involved in catalytic reaction and UDP-Gal binding.By similarity
The ricin B-type lectin domain binds to GalNAc and contributes to the glycopeptide specificity.By similarity

Sequence similaritiesi

Contains 1 ricin B-type lectin domain.PROSITE-ProRule annotation

Keywords - Domaini

Signal-anchor, Transmembrane, Transmembrane helix

Phylogenomic databases

eggNOGiKOG3736. Eukaryota.
ENOG410XPMK. LUCA.
GeneTreeiENSGT00760000118828.
HOGENOMiHOG000038227.
InParanoidiP34678.
KOiK00710.
OMAiAGGNQAW.
OrthoDBiEOG091G085O.
PhylomeDBiP34678.

Family and domain databases

Gene3Di3.90.550.10. 1 hit.
InterProiIPR001173. Glyco_trans_2-like.
IPR029044. Nucleotide-diphossugar_trans.
IPR000772. Ricin_B_lectin.
[Graphical view]
PfamiPF00535. Glycos_transf_2. 1 hit.
PF00652. Ricin_B_lectin. 1 hit.
[Graphical view]
SMARTiSM00458. RICIN. 1 hit.
[Graphical view]
SUPFAMiSSF50370. SSF50370. 1 hit.
SSF53448. SSF53448. 1 hit.
PROSITEiPS50231. RICIN_B_LECTIN. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

P34678-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MLSVGGGRSA VCRAVIATSI VWLLIDVVIL FYYLDPSTSQ QQPFPEDNRI
60 70 80 90 100
LNRARRIEPL PPAAQHDSDP DAHPIQPEKQ EKQVYPVDKE TANQLRKLME
110 120 130 140 150
TQAFGPGYHG QGGTGVTVPE DKKTIKEKRF LENQFNVVAS EMISVNRTLP
160 170 180 190 200
DYRSDACRTS GNNLKTAGMP KTSIIIVFHN EAWTTLLRTL HSVINRSPRH
210 220 230 240 250
LLEEIILVDD KSDRDYLVKP LDSYIKMFPI PIHLVHLENR SGLIRARLTG
260 270 280 290 300
SEMAKGKILL FLDAHVEVTD GWLEPLVSRV AEDRKRVVAP IIDVISDDTF
310 320 330 340 350
EYVTASETTW GGFNWHLNFR WYAVPKRELN RRGSDRSMPI QTPTIAGGLF
360 370 380 390 400
AIDKQFFYDI GSYDEGMQVW GGENLEISFR VWMCGGSLEI HPCSRVGHVF
410 420 430 440 450
RKQTPYTFPG GTAKVIHHNA ARTAEVWMDE YKAFFYKMVP AARNVEAGDV
460 470 480 490 500
SERKKLRETL QCKSFKWYLE NIYPEAPLPA DFRSLGAIVN RFTEKCVDTN
510 520 530 540 550
GKKDGQAPGI QACHGAGGNQ AWSLTGKGEI RSDDLCLSSG HVYQIGSELK
560 570 580 590 600
LERCSVSKIN VKHVFVFDDQ AGTLLHKKTG KCVTGADQRV TLDECGLGRK
610
DQMWQLEGYQ SP
Length:612
Mass (Da):68,912
Last modified:May 27, 2002 - v2
Checksum:i3031C2FE933F9858
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AF031833 mRNA. Translation: AAC13669.1.
FO080277 Genomic DNA. Translation: CCD62541.1.
PIRiA88515.
T42243.
RefSeqiNP_498722.1. NM_066321.5.
UniGeneiCel.17588.

Genome annotation databases

EnsemblMetazoaiZK688.8; ZK688.8; WBGene00001628.
GeneIDi176112.
KEGGicel:CELE_ZK688.8.
UCSCiZK688.8.1. c. elegans.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AF031833 mRNA. Translation: AAC13669.1.
FO080277 Genomic DNA. Translation: CCD62541.1.
PIRiA88515.
T42243.
RefSeqiNP_498722.1. NM_066321.5.
UniGeneiCel.17588.

3D structure databases

ProteinModelPortaliP34678.
SMRiP34678.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi6239.ZK688.8.1.

Protein family/group databases

CAZyiCBM13. Carbohydrate-Binding Module Family 13.
GT27. Glycosyltransferase Family 27.

Proteomic databases

EPDiP34678.
PaxDbiP34678.
PeptideAtlasiP34678.
PRIDEiP34678.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaiZK688.8; ZK688.8; WBGene00001628.
GeneIDi176112.
KEGGicel:CELE_ZK688.8.
UCSCiZK688.8.1. c. elegans.

Organism-specific databases

CTDi176112.
WormBaseiZK688.8; CE29649; WBGene00001628; gly-3.

Phylogenomic databases

eggNOGiKOG3736. Eukaryota.
ENOG410XPMK. LUCA.
GeneTreeiENSGT00760000118828.
HOGENOMiHOG000038227.
InParanoidiP34678.
KOiK00710.
OMAiAGGNQAW.
OrthoDBiEOG091G085O.
PhylomeDBiP34678.

Enzyme and pathway databases

UniPathwayiUPA00378.
ReactomeiR-CEL-6811436. COPI-independent Golgi-to-ER retrograde traffic.

Miscellaneous databases

PROiP34678.

Gene expression databases

BgeeiWBGene00001628.

Family and domain databases

Gene3Di3.90.550.10. 1 hit.
InterProiIPR001173. Glyco_trans_2-like.
IPR029044. Nucleotide-diphossugar_trans.
IPR000772. Ricin_B_lectin.
[Graphical view]
PfamiPF00535. Glycos_transf_2. 1 hit.
PF00652. Ricin_B_lectin. 1 hit.
[Graphical view]
SMARTiSM00458. RICIN. 1 hit.
[Graphical view]
SUPFAMiSSF50370. SSF50370. 1 hit.
SSF53448. SSF53448. 1 hit.
PROSITEiPS50231. RICIN_B_LECTIN. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiGALT3_CAEEL
AccessioniPrimary (citable) accession number: P34678
Secondary accession number(s): Q9U003
Entry historyi
Integrated into UniProtKB/Swiss-Prot: February 1, 1994
Last sequence update: May 27, 2002
Last modified: November 30, 2016
This is version 133 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programCaenorhabditis annotation project

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Caenorhabditis elegans
    Caenorhabditis elegans: entries, gene names and cross-references to WormBase
  2. PATHWAY comments
    Index of metabolic and biosynthesis pathways
  3. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.