Skip Header

Contribute Send feedback
Read comments (?) or add your own

A4XIJ3 (A4XIJ3_CALS8) Unreviewed, UniProtKB/TrEMBL

Last modified January 25, 2012. Version 25. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·Ontologies·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein names
Gene names
Ordered Locus Names:Csac_1118
OrganismCaldicellulosiruptor saccharolyticus (strain ATCC 43494 / DSM 8903) [Complete proteome] [HAMAP]
Taxonomic identifier351627 [NCBI]
Taxonomic lineageBacteriaFirmicutesClostridiaThermoanaerobacteralesThermoanaerobacterales Family III. Incertae SedisCaldicellulosiruptor

Protein attributes

Sequence length729 AA.
Sequence statusComplete.
Protein existencePredicted

Ontologies

Keywords
   Molecular functionHydrolase EMBL ABP66728.1
   Technical termComplete proteome
Gene Ontology (GO)
   Biological processcarbohydrate metabolic process

Inferred from electronic annotation. Source: InterPro

   Molecular functionalpha-galactosidase activity

Inferred from electronic annotation. Source: InterPro

Complete GO annotation...

Sequences

Sequence LengthMass (Da)Tools
A4XIJ3 [UniParc].

Last modified May 29, 2007. Version 1.
Checksum: C6FD7DF0A47BD580

FASTA72984,489
        10         20         30         40         50         60 
MSITFNPQTN MFFIEAKNTS YIIKLFKGKF LSHVYWGKKI KEFEWTDFDV TGGRVFGATP 

        70         80         90        100        110        120 
DPNDKTYSFD TMLLEYPAYG NSDFRHPAYQ VEQEDGSRIT NLVYKTHRIY DGKPKLEGLP 

       130        140        150        160        170        180 
ATYVESPDEA QTLEIELYDD LIDLKVTLIY TAFRDFDVIT RSVRFENLGK QTLKILRAMS 

       190        200        210        220        230        240 
VCVDFPEGDF DLLHLWGSWA RERYVERIPL IHGMQVIDSA RGESSHQHNP FIALLSKDAT 

       250        260        270        280        290        300 
EKHGDVYGFS LVYSGNFAAI VEKDQYNMLR VTMGINPFEF TWVLKPGESF QTPEVVMVNS 

       310        320        330        340        350        360 
QEGLGGMSRT YHKLYRKRLC RGVYRDKRRP ILINSWEATY FNFNEEKLLA LAKEAKELGI 

       370        380        390        400        410        420 
ELFVLDDGWF GKRDDDTSSL GDWFVDRRKL PNGLDGLGKK LNEMGLKFGL WFEPEMVSPD 

       430        440        450        460        470        480 
SELYRKHPDW CIQVRGRSLT QCRNQYVLDI TREDVRKEIL RMMKEILKAA PIEYIKWDMN 

       490        500        510        520        530        540 
RPLTEVGSLE LPPERQKEVF HRYVLGLYQM MEELTTEFPH ILFEGCSGGG GRFDPGILYY 

       550        560        570        580        590        600 
MPQIWTSDDT DAIERLKIQF GTSIVYPAST MGAHVSIVPN HQVGRVTPMK TRGVVALSGC 

       610        620        630        640        650        660 
FGYELDLTKL SCEDKEEIKR QIELYKRIWH IVFEGDLYRL ISPFDGNSAA WMYVTEDKKE 

       670        680        690        700        710        720 
AVVFYVEILR QPNPPIKRLK LDGLDPSKSY LIEGEQKTRF GDELMNIGLM IPQMWGDFNS 


HIWILKAVD 

« Hide

References

[1]"Genome sequence of the thermophilic hydrogen-producing bacterium Caldicellulosiruptor saccharolyticus DSM 8903."
Copeland A., Lucas S., Lapidus A., Barry K., Detter J.C., Glavina del Rio T., Hammon N., Israni S., Dalin E., Tice H., Pitluck S., Kiss H., Brettin T., Bruce D., Han C., Schmutz J., Larimer F., Land M. expand/collapse author list , Hauser L., Kyrpides N., Lykidis A., van de Werken H.J.G., Verhaart M.R.A., VanFossen A.L., Lewis D.L., Nichols J.D., Goorissen H.P., van Niel E.W.J., Stams F.J.M., Willquist K.U., Ward D.E., van der Oost J., Kelly R.M., Kengen S.M.W., Richardson P.
Submitted (APR-2007) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: ATCC 43494 / DSM 8903.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
CP000679 Genomic DNA. Translation: ABP66728.1.
RefSeqYP_001179919.1. NC_009437.1.

3D structure databases

ProteinModelPortalA4XIJ3.
ModBaseSearch...

Protein-protein interaction databases

STRINGA4XIJ3.

Protein family/group databases

CAZyGH36. Glycoside Hydrolase Family 36.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

GeneID5088080.
GenomeReviewsGene locus Csac_1118 in contig CP000679_GR.
KEGGcsc:Csac_1118.
PATRIC21251884. VBICalSac56748_1257.

Organism-specific databases

CMRSearch...

Phylogenomic databases

eggNOGCOG3345.
HOGENOMHBG302872.
OMAMPQTWTS.
ProtClustDBCLSK2399973.

Family and domain databases

InterProIPR013785. Aldolase_TIM.
IPR002252. Glyco_hydro_36.
IPR000111. Glyco_hydro_GHD.
IPR017853. Glycoside_hydrolase_SF.
[Graphical view]
Gene3DG3DSA:3.20.20.70. Aldolase_TIM. 1 hit.
KOK07407.
PfamPF02065. Melibiase. 1 hit.
[Graphical view]
PIRSFPIRSF005536. Agal. 1 hit.
PRINTSPR00743. GLHYDRLASE36.
SUPFAMSSF51445. Glyco_hydro_cat. 1 hit.
PROSITEPS00512. ALPHA_GALACTOSIDASE. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameA4XIJ3_CALS8
AccessionPrimary (citable) accession number: A4XIJ3
Entry history
Integrated into UniProtKB/TrEMBL: May 29, 2007
Last sequence update: May 29, 2007
Last modified: January 25, 2012
This is version 25 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)