Skip Header

Contribute Send feedback
Read comments (?) or add your own

A1R9U4 (A1R9U4_ARTAT) Unreviewed, UniProtKB/TrEMBL

Last modified January 25, 2012. Version 33. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein namesRecommended name:
Beta-galactosidase PIRNR PIRNR001084

Short name=Beta-gal PIRNR PIRNR001084
EC=3.2.1.23 PIRNR PIRNR001084
Gene names
Ordered Locus Names:AAur_3310
OrganismArthrobacter aurescens (strain TC1) [Complete proteome] [HAMAP] EMBL ABM06369.1
Taxonomic identifier290340 [NCBI]
Taxonomic lineageBacteriaActinobacteriaActinobacteridaeActinomycetalesMicrococcineaeMicrococcaceaeArthrobacter

Protein attributes

Sequence length672 AA.
Sequence statusComplete.
Protein existenceInferred from homology

General annotation (Comments)

Catalytic activity

Hydrolysis of terminal non-reducing beta-D-galactose residues in beta-D-galactosides. PIRNR PIRNR001084

Sequence similarities

Belongs to the glycosyl hydrolase 42 family. PIRNR PIRNR001084

Ontologies

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Sites

Active site1601Proton donor By similarity PIRSR PIRSR001084-1
Active site3181Nucleophile By similarity PIRSR PIRSR001084-1
Binding site1211Substrate By similarity PIRSR PIRSR001084-2
Binding site1591Substrate By similarity PIRSR PIRSR001084-2
Binding site3261Substrate By similarity PIRSR PIRSR001084-2

Sequences

Sequence LengthMass (Da)Tools
A1R9U4 [UniParc].

Last modified February 6, 2007. Version 1.
Checksum: 9C1E9014B5967D7F

FASTA67273,849
        10         20         30         40         50         60 
MTSPEIPRSP SVWNSIHGLA YGGDYNPEQW PEDIRLEDIE LMKEAGVNFL SVGIFSWGLL 

        70         80         90        100        110        120 
EPAEGNYDFS WLDDVMDNLH GAGIKVALAT ATASPPAWLA RKYPEILPVT AEGIRLERGS 

       130        140        150        160        170        180 
RRHYTPSSSV YRRYATAMTR VIAERYKNHP ALALWHVDNE LGCHVGEFHG EEDAAAFRAW 

       190        200        210        220        230        240 
LERRYGSIEA LNEAWGTAFW SQHYASFDEI IPPGAAPTTL NPGQQLDFAR FNSWAFIDYY 

       250        260        270        280        290        300 
RELLAVIREV TPGIPATTNF MVSSATKALD YFDWSKDMDV VANDHYLVAA DPEREIELAF 

       310        320        330        340        350        360 
SADLTRGVAG GEPWILMEHS TSAVNWQPHN QPKMPGEMLR NSLTHVARGA DAVMFFQWRQ 

       370        380        390        400        410        420 
SKAGSEKFHS AMVPHGGRDT QVWRNVVDLG DALSKLETVK GSRVESRVAI VFDYEAWWAS 

       430        440        450        460        470        480 
ELDSHPNNSL KYLDTMRAFH RSLYRRGITA DFVHPSSDLS GYDLILVCTL YSVADAAAAS 

       490        500        510        520        530        540 
IAAAAEGGAT VLISYFSGIV DERDHVRLGG YPGAFRELLG VRSEEFHPLF PGTSVTLSDG 

       550        560        570        580        590        600 
TVGSVWSEHV HAAEGTEILA TFTDYPLGEV PALTRRSVGT GSAWYLATLP DADGIDSLTA 

       610        620        630        640        650        660 
RLVEEAGVRA VSEMSAGVEL TRRRAADGRT FLFAINHSQE DVTVKADGGE LLSGARFTGV 

       670 
VPAGAVAVIA ED 

« Hide

References

[1]"Secrets of soil survival revealed by the genome sequence of Arthrobacter aurescens TC1."
Mongodin E.F., Shapir N., Daugherty S.C., DeBoy R.T., Emerson J.B., Shvartzbeyn A., Radune D., Vamathevan J., Riggs F., Grinberg V., Khouri H.M., Wackett L.P., Nelson K.E., Sadowsky M.J.
PLoS Genet. 2:2094-2106(2006) [PubMed: 17194220] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
CP000474 Genomic DNA. Translation: ABM06369.1.
RefSeqYP_949006.1. NC_008711.1.

3D structure databases

ProteinModelPortalA1R9U4.
ModBaseSearch...

Protein-protein interaction databases

STRINGA1R9U4.

Protein family/group databases

CAZyGH42. Glycoside Hydrolase Family 42.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

GeneID4638672.
GenomeReviewsGene locus AAur_3310 in contig CP000474_GR.
KEGGaau:AAur_3310.
PATRIC20982136. VBIArtAur67810_3372.
TIGRAAur_3310.

Phylogenomic databases

eggNOGCOG1874.
HOGENOMHBG476453.
OMAPLAWFRA.
ProtClustDBCLSK780343.

Family and domain databases

InterProIPR013739. Beta_galactosidase_C.
IPR013738. Beta_galactosidase_Trimer.
IPR003476. Glyco_hydro_42.
IPR013529. Glyco_hydro_42_N.
IPR013781. Glyco_hydro_subgr_catalytic.
IPR017853. Glycoside_hydrolase_SF.
[Graphical view]
Gene3DG3DSA:3.20.20.80. Glyco_hydro_cat. 1 hit.
KOK01190.
PfamPF02449. Glyco_hydro_42. 1 hit.
PF08533. Glyco_hydro_42C. 1 hit.
PF08532. Glyco_hydro_42M. 1 hit.
[Graphical view]
PIRSFPIRSF001084. B-galactosidase. 1 hit.
SUPFAMSSF51445. Glyco_hydro_cat. 1 hit.
ProtoNetSearch...

Entry information

Entry nameA1R9U4_ARTAT
AccessionPrimary (citable) accession number: A1R9U4
Entry history
Integrated into UniProtKB/TrEMBL: February 6, 2007
Last sequence update: February 6, 2007
Last modified: January 25, 2012
This is version 33 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)