Skip Header

Contribute Send feedback
Read comments (?) or add your own

A7ZRF0 (A7ZRF0_ECO24) Unreviewed, UniProtKB/TrEMBL

Last modified December 14, 2011. Version 28. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·Ontologies·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein names
EC=3.2.1.22 EMBL ABV20016.1
Gene names
Name:rafA EMBL ABV20016.1
Ordered Locus Names:EcE24377A_3380
OrganismEscherichia coli O139:H28 (strain E24377A / ETEC) [Complete proteome] [HAMAP] EMBL ABV20016.1
Taxonomic identifier331111 [NCBI]
Taxonomic lineageBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia

Protein attributes

Sequence length708 AA.
Sequence statusComplete.
Protein existencePredicted

Ontologies

Keywords
   Molecular functionGlycosidase EMBL ABV20016.1
Hydrolase
   Technical termComplete proteome
Gene Ontology (GO)
   Biological processcarbohydrate metabolic process

Inferred from electronic annotation. Source: InterPro

   Molecular functionraffinose alpha-galactosidase activity

Inferred from electronic annotation. Source: EC

Complete GO annotation...

Sequences

Sequence LengthMass (Da)Tools
A7ZRF0 [UniParc].

Last modified October 23, 2007. Version 1.
Checksum: AD9D7D1DB18E1214

FASTA70881,138
        10         20         30         40         50         60 
MVSKYCRLSS PRSDLIIKTR PHAEIIWWGS ALKHFSPDDC ASLERPVANG RLDIDTPLTL 

        70         80         90        100        110        120 
MAENALGLFS SPGLEGHRNG LDASPVFYTV DVEHTENTLR LTSEDSVAGL RLVSELVMTP 

       130        140        150        160        170        180 
SGILKVRHAL TNLREGDWQI NRFAITLPLA ERAEEVMAFH GRWTREFQPH RVRLTHDAFV 

       190        200        210        220        230        240 
LENRRGRTSH EHFPALIVGT PGFSEQQGEV WAVHLGWSGN HRMRCEAKTD GRRYVQAEAL 

       250        260        270        280        290        300 
WMPGEKALRK NETLYTPWLY ACHSADGLNG MSQQYHRFLR DEIIRFPEQK PRPVHLNTWE 

       310        320        330        340        350        360 
GIYFNHNPDY IMQMAERAAA LGVERFIIDD GWFKGRNDDR AALGDWYTDE QKYPNGLMPV 

       370        380        390        400        410        420 
IKHVKSLGME FGIWVEPEMI NPDSDLFRLH PDWVLSMPGY SQPTGRYQYV LNLNIPEAFA 

       430        440        450        460        470        480 
YIYERFLWLL GEHPVDYVKW DMNRELVQAG HEGRAAADAQ TRQFYRLLDL LRERFPHVEF 

       490        500        510        520        530        540 
ESCASGGGRI DFEVLKRTHR FWASDNNDAL ERCTIQRGMS YFFPPEVMGA HIGHRRCHAT 

       550        560        570        580        590        600 
FRQHSIAFRG LTALFGHMGL ELDPVAADAK ESDGYRRYAL LYKEWRQLIH TGVLWRVDMP 

       610        620        630        640        650        660 
DPSIQVQGVV SPDQSQALFM ISQLAMPDYT LPGILRFPGL AAEVRYRLRV IDHPDLQVVG 

       670        680        690        700 
EGGHTMRKLP VWMNQSLEAS GEWLAQGGIQ LPVLDPESAI LIALERAV 

« Hide

References

[1]"The pangenome structure of Escherichia coli: comparative genomic analysis of E. coli commensal and pathogenic isolates."
Rasko D.A., Rosovitz M.J., Myers G.S.A., Mongodin E.F., Fricke W.F., Gajer P., Crabtree J., Sebaihia M., Thomson N.R., Chaudhuri R., Henderson I.R., Sperandio V., Ravel J.
J. Bacteriol. 190:6881-6893(2008) [PubMed: 18676672] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
CP000800 Genomic DNA. Translation: ABV20016.1.
RefSeqYP_001464379.1. NC_009801.1.

3D structure databases

ProteinModelPortalA7ZRF0.
ModBaseSearch...

Protein-protein interaction databases

STRINGA7ZRF0.

Protein family/group databases

CAZyGH36. Glycoside Hydrolase Family 36.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaEBESCT00000019341; EBESCP00000018403; EBESCG00000018395.
GeneID5589615.
GenomeReviewsGene locus EcE24377A_3380 in contig CP000800_GR.
KEGGecw:EcE24377A_3380.
NMPDRfig|331111.3.peg.1072.
PATRIC18296078. VBIEscCol31211_3654.

Organism-specific databases

CMRSearch...

Phylogenomic databases

eggNOGCOG3345.
GeneTreeEBGT00050000013654.
HOGENOMHBG299823.
OMATPWLYAS.
ProtClustDBCLSK888896.

Family and domain databases

InterProIPR013785. Aldolase_TIM.
IPR002252. Glyco_hydro_36.
IPR000111. Glyco_hydro_GHD.
IPR017853. Glycoside_hydrolase_SF.
[Graphical view]
Gene3DG3DSA:3.20.20.70. Aldolase_TIM. 1 hit.
KOK07407.
PfamPF02065. Melibiase. 1 hit.
[Graphical view]
PIRSFPIRSF005536. Agal. 1 hit.
PRINTSPR00743. GLHYDRLASE36.
SUPFAMSSF51445. Glyco_hydro_cat. 1 hit.
PROSITEPS00512. ALPHA_GALACTOSIDASE. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameA7ZRF0_ECO24
AccessionPrimary (citable) accession number: A7ZRF0
Entry history
Integrated into UniProtKB/TrEMBL: October 23, 2007
Last sequence update: October 23, 2007
Last modified: December 14, 2011
This is version 28 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)