Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q22295 (UGT50_CAEEL) Reviewed, UniProtKB/Swiss-Prot

Last modified June 11, 2014. Version 89. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Putative UDP-glucuronosyltransferase ugt-50

Short name=UDPGT 50
EC=2.4.1.17
Gene names
Name:ugt-50
Synonyms:ugt16
ORF Names:T07C5.1
OrganismCaenorhabditis elegans [Reference proteome]
Taxonomic identifier6239 [NCBI]
Taxonomic lineageEukaryotaMetazoaEcdysozoaNematodaChromadoreaRhabditidaRhabditoideaRhabditidaePeloderinaeCaenorhabditis

Protein attributes

Sequence length523 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level

General annotation (Comments)

Catalytic activity

UDP-glucuronate + acceptor = UDP + acceptor beta-D-glucuronoside.

Subcellular location

Membrane; Single-pass membrane protein Potential.

Sequence similarities

Belongs to the UDP-glycosyltransferase family.

Ontologies

Keywords
   Cellular componentMembrane
   Coding sequence diversityAlternative splicing
   DomainSignal
Transmembrane
Transmembrane helix
   Molecular functionGlycosyltransferase
Transferase
   PTMGlycoprotein
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
   Cellular_componentintegral component of membrane

Inferred from electronic annotation. Source: UniProtKB-KW

   Molecular_functionglucuronosyltransferase activity

Inferred from electronic annotation. Source: UniProtKB-EC

Complete GO annotation...

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform c (identifier: Q22295-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Note: No experimental confirmation available.
Isoform b (identifier: Q22295-2)

The sequence of this isoform differs from the canonical sequence as follows:
     182-190: LPTLPSYVP → KPMTTFAES
     191-523: Missing.
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 2525 Potential
Chain26 – 523498Putative UDP-glucuronosyltransferase ugt-50
PRO_0000036055

Regions

Transmembrane490 – 50819Helical; Potential

Amino acid modifications

Glycosylation841N-linked (GlcNAc...) Ref.2
Glycosylation2481N-linked (GlcNAc...) Ref.2
Glycosylation2831N-linked (GlcNAc...) Ref.2
Glycosylation4871N-linked (GlcNAc...) Potential

Natural variations

Alternative sequence182 – 1909LPTLPSYVP → KPMTTFAES in isoform b.
VSP_021638
Alternative sequence191 – 523333Missing in isoform b.
VSP_021639

Sequences

Sequence LengthMass (Da)Tools
Isoform c [UniParc].

Last modified November 28, 2006. Version 2.
Checksum: 6D00E382BFF26C4D

FASTA52360,286
        10         20         30         40         50         60 
MHYSQMRWMF FCLTALLHGS FIVNAAKILV YCPSISKSHV LLCSKYADLL HNAGHDTVLF 

        70         80         90        100        110        120 
IPSYSKLLDN YDGAKHAKVW RLHNVTEAYD TKLGTLANVM ENSHIGFIDR LTFDADFWID 

       130        140        150        160        170        180 
MCADLLGKLP EMQHIIDYKF DLVIYNEIDP CTPAIVRLFN IPKTVLLSSE AIMDKVAWNL 

       190        200        210        220        230        240 
GLPTLPSYVP SVEENPNHDR MSFFERMSNV YKFFQSIVVH YLQDIHVLNL FRKEVSSDFP 

       250        260        270        280        290        300 
SIAEIIRNVS LVLVNTDEIF DLPRSYSSKF VYVGMLEAGK DENVTLPKKQ DDYFKKGKSG 

       310        320        330        340        350        360 
SVFVSFGTVT PFRSLPERIQ LSILNAIQKL PDYHFVVKTT ADDESSAQFF STVQNVDLVD 

       370        380        390        400        410        420 
WVPQKAVLRH ANLKLFVSHG GMNSVLETMY YGVPMVIMPV FTDQFRNGRN VERRGAGKMV 

       430        440        450        460        470        480 
LRETVVKETF FDAIHSVLEE KSYSSSVKRI SHLMKNKPFT SEERVTKWID FVLKYETSEH 

       490        500        510        520 
FDLESNNLSI IEHNHLDLFF YLCIISLLNF VVYRKIFKRK SQS 

« Hide

Isoform b [UniParc].

Checksum: 1032764C783A68BB
Show »

FASTA19021,650

References

[1]"Genome sequence of the nematode C. elegans: a platform for investigating biology."
The C. elegans sequencing consortium
Science 282:2012-2018(1998) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], ALTERNATIVE SPLICING.
Strain: Bristol N2.
[2]"Proteomics reveals N-linked glycoprotein diversity in Caenorhabditis elegans and suggests an atypical translocation mechanism for integral membrane proteins."
Kaji H., Kamiie J., Kawakami H., Kido K., Yamauchi Y., Shinkawa T., Taoka M., Takahashi N., Isobe T.
Mol. Cell. Proteomics 6:2100-2109(2007) [PubMed] [Europe PMC] [Abstract]
Cited for: GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-84; ASN-248 AND ASN-283, IDENTIFICATION BY MASS SPECTROMETRY.
Strain: Bristol N2.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
Z50006 Genomic DNA. Translation: CAA90301.1.
Z50006 Genomic DNA. Translation: CAD44148.1.
PIRT24647.
T24652.
RefSeqNP_510118.1. NM_077717.3. [Q22295-2]
NP_741913.1. NM_171786.5. [Q22295-1]
UniGeneCel.406.

3D structure databases

ProteinModelPortalQ22295.
SMRQ22295. Positions 286-456.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

STRING6239.T07C5.1c.

Protein family/group databases

CAZyGT1. Glycosyltransferase Family 1.

Proteomic databases

PaxDbQ22295.
PRIDEQ22295.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaT07C5.1c; T07C5.1c; WBGene00011564. [Q22295-1]
GeneID181413.
KEGGcel:CELE_T07C5.1.
UCSCT07C5.1b. c. elegans. [Q22295-1]

Organism-specific databases

CTD181413.
WormBaseT07C5.1b; CE18217; WBGene00011564; ugt-50.
T07C5.1c; CE31603; WBGene00011564; ugt-50.

Phylogenomic databases

eggNOGCOG1819.
GeneTreeENSGT00740000115714.
HOGENOMHOG000018870.
InParanoidQ22295.
KOK00699.
OMADGAKHAK.
PhylomeDBQ22295.

Family and domain databases

InterProIPR002213. UDP_glucos_trans.
[Graphical view]
PANTHERPTHR11926. PTHR11926. 1 hit.
PfamPF00201. UDPGT. 1 hit.
[Graphical view]
PROSITEPS00375. UDPGT. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

NextBio913828.

Entry information

Entry nameUGT50_CAEEL
AccessionPrimary (citable) accession number: Q22295
Secondary accession number(s): O62371, Q8MPX8
Entry history
Integrated into UniProtKB/Swiss-Prot: November 1, 1997
Last sequence update: November 28, 2006
Last modified: June 11, 2014
This is version 89 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programCaenorhabditis annotation project

Relevant documents

SIMILARITY comments

Index of protein domains and families

Caenorhabditis elegans

Caenorhabditis elegans: entries, gene names and cross-references to WormBase