Skip Header

 
Contribute Send feedback
Read comments (1) or add your own

Unreviewed, UniProtKB/TrEMBL O33835 (O33835_THEMA)

Last modified May 5, 2009. Version 47. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · Ontologies · Sequences · References · Cross-references · Entry information

Names and origin

Protein namesSubmitted name:
    Alpha-galactosidase EMBL CAA04514.1
    EC=3.2.1.22
Gene names
Name: galA EMBL CAA04514.1
Ordered Locus Names: TM_1192
OrganismThermotoga maritima [Complete proteome] [HAMAP] EMBL CAA04514.1
Taxonomic identifier2336 [NCBI]
Taxonomic lineageBacteriaThermotogaeThermotogalesThermotogaceaeThermotoga

Protein attributes

Sequence length552 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceEvidence at protein level.

Ontologies

Keywords
   Molecular functionGlycosidase
Hydrolase
   Technical termComplete proteome
Gene Ontology (GO)
   Biological processcarbohydrate metabolic process

Inferred from electronic annotation. Source: InterPro

   Molecular functionalpha-galactosidase activity

Inferred from electronic annotation. Source: EC

Complete GO annotation...

Sequences

Sequence LengthMass (Da)Tools
O33835-1 [UniParc].

Last modified January 1, 1998. Version 1.
Checksum: 91C6E6EFA24EA9D5

FASTA55263,657
        10         20         30         40         50         60 
MEIFGKTFRE GRFVLKEKNF TVEFAVEKIH LGWKISGRVK GSPGRLEVLR TKAPEKVLVN 

        70         80         90        100        110        120 
NWQSWGPCRV VDAFSFKPPE IDPNWRYTAS VVPDVLERNL QSDYFVAEEG KVYGFLSSKI 

       130        140        150        160        170        180 
AHPFFAVEDG ELVAYLEYFD VEFDDFVPLE PLVVLEDPNT PLLLEKYAEL VGMENNARVP 

       190        200        210        220        230        240 
KHTPTGWCSW YHYFLDLTWE ETLKNLKLAK NFPFEVFQID DAYEKDIGDW LVTRGDFPSV 

       250        260        270        280        290        300 
EEMAKVIAEN GFIPGIWTAP FSVSETSDVF NEHPDWVVKE NGEPKMAYRN WNKKIYALDL 

       310        320        330        340        350        360 
SKDEVLNWLF DLFSSLRKMG YRYFKIDFLF AGAVPGERKK NITPIQAFRK GIETIRKAVG 

       370        380        390        400        410        420 
EDSFILGCGS PLLPAVGCVD GMRIGPDTAP FWGEHIEDNG APAARWALRN AITRYFMHDR 

       430        440        450        460        470        480 
FWLNDPDCLI LREEKTDLTQ KEKELYSYTC GVLDNMIIES DDLSLVRDHG KKVLKETLEL 

       490        500        510        520        530        540 
LGGRPRVQNI MSEDLRYEIV SSGTLSGNVK IVVDLNSREY HLEKEGKSSL KKRVVKREDG 

       550 
RNFYFYEEGE RE 

« Hide

References

« Hide 'large scale' references
[1]"Properties of an alpha-galactosidase, and structure of its gene, galA, within an alpha- and beta-galactoside utilization gene cluster of the hyperthermophilic bacterium Thermotoga maritima."
Liebl W., Wagner B., Schellhase J.
Submitted (AUG-1997) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE.
Strain: MSB8 EMBL CAA04514.1.
[2]"Evidence for lateral gene transfer between Archaea and Bacteria from genome sequence of Thermotoga maritima."
Nelson K.E., Clayton R.A., Gill S.R., Gwinn M.L., Dodson R.J., Haft D.H., Hickey E.K., Peterson J.D., Nelson W.C., Ketchum K.A., McDonald L.A., Utterback T.R., Malek J.A., Linher K.D., Garrett M.M., Stewart A.M., Cotton M.D., Pratt M.S. expand/collapse author list , Phillips C.A., Richardson D.L., Heidelberg J.F., Sutton G.G., Fleischmann R.D., Eisen J.A., White O., Salzberg S.L., Smith H.O., Venter J.C., Fraser C.M.
Nature 399:323-329(1999) [PubMed: 10360571] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: ATCC 43589 / MSB8 / DSM 3109 / JCM 10099.

Cross-references

Sequence databases

AE000512 Genomic DNA. Translation: AAD36267.1.
AJ001072 Genomic DNA. Translation: CAA04514.1.
PIRE72283.
RefSeqNP_228997.1.

3D structure databases

EntryMethodResolution (Å)ChainPositionsPDBsum
1ZY9X-ray2.34A1-552[»]
ModBaseSearch...

Protein family/group databases

CAZyGH36. Glycoside Hydrolase Family 36.

Genome annotation databases

GeneID898292.
GenomeReviewsGene locus TM_1192 in contig AE000512_GR.
KEGGtma:TM1192.
NMPDRfig|243274.1.peg.1181.
TIGRTM_1192.

Phylogenomic databases

HOGENOMO33835.
OMAO33835. YFKIDFL.

Enzyme and pathway databases

BioCycTMAR243274:TM_1192-MON.

Family and domain databases

InterProIPR013785. Aldolase_TIM.
IPR000322. Glyco_hydro_31.
[Graphical view]
Gene3DG3DSA:3.20.20.70. Aldolase_TIM. 1 hit.
PANTHERPTHR22762. Glyco_hydro_31. 1 hit.
ProtoNetSearch...

Entry information

Entry nameO33835_THEMA
AccessionPrimary (citable) accession number: O33835
Entry history
Integrated into UniProtKB/TrEMBL: January 1, 1998
Last sequence update: January 1, 1998
Last modified: May 5, 2009
This is version 47 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)
Names and origin · Protein attributes · Ontologies · Sequences · References · Cross-references · Entry information