Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Beta-galactosidase 4

Gene

Os02g0219200

Organism
Oryza sativa subsp. japonica (Rice)
Status
Reviewed-Annotation score: Annotation score: 2 out of 5-Experimental evidence at transcript leveli

Functioni

Catalytic activityi

Hydrolysis of terminal non-reducing beta-D-galactose residues in beta-D-galactosides.

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Active sitei194 – 1941Proton donorSequence Analysis
Active sitei263 – 2631NucleophileSequence Analysis

GO - Molecular functioni

  1. beta-galactosidase activity Source: UniProtKB-EC

GO - Biological processi

  1. carbohydrate metabolic process Source: InterPro
Complete GO annotation...

Keywords - Molecular functioni

Glycosidase, Hydrolase

Protein family/group databases

CAZyiGH35. Glycoside Hydrolase Family 35.

Names & Taxonomyi

Protein namesi
Recommended name:
Beta-galactosidase 4 (EC:3.2.1.23)
Short name:
Lactase 4
Gene namesi
Ordered Locus Names:Os02g0219200, LOC_Os02g12730
ORF Names:P0027A02.24
OrganismiOryza sativa subsp. japonica (Rice)
Taxonomic identifieri39947 [NCBI]
Taxonomic lineageiEukaryotaViridiplantaeStreptophytaEmbryophytaTracheophytaSpermatophytaMagnoliophytaLiliopsidaPoalesPoaceaeBEP cladeEhrhartoideaeOryzeaeOryza
ProteomesiUP000000763 Componenti: Chromosome 2

Organism-specific databases

GrameneiQ6Z6K4.

Subcellular locationi

GO - Cellular componenti

  1. apoplast Source: UniProtKB-SubCell
Complete GO annotation...

Keywords - Cellular componenti

Apoplast, Secreted

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 3535Sequence AnalysisAdd
BLAST
Chaini36 – 729694Beta-galactosidase 4PRO_0000294156Add
BLAST

Interactioni

Protein-protein interaction databases

STRINGi39947.LOC_Os02g12730.1.

Structurei

3D structure databases

ProteinModelPortaliQ6Z6K4.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Sequence similaritiesi

Belongs to the glycosyl hydrolase 35 family.Curated

Keywords - Domaini

Signal

Phylogenomic databases

eggNOGiCOG1874.
InParanoidiQ6Z6K4.
OMAiGHSMQVF.

Family and domain databases

Gene3Di2.60.120.260. 2 hits.
3.20.20.80. 1 hit.
InterProiIPR025300. BetaGal_jelly_roll_dom.
IPR008979. Galactose-bd-like.
IPR019801. Glyco_hydro_35_CS.
IPR013781. Glyco_hydro_catalytic_dom.
IPR001944. Glycoside_Hdrlase_35.
IPR017853. Glycoside_hydrolase_SF.
[Graphical view]
PANTHERiPTHR23421. PTHR23421. 1 hit.
PfamiPF13364. BetaGal_dom4_5. 1 hit.
PF01301. Glyco_hydro_35. 1 hit.
[Graphical view]
PRINTSiPR00742. GLHYDRLASE35.
SUPFAMiSSF49785. SSF49785. 2 hits.
SSF51445. SSF51445. 1 hit.
PROSITEiPS01182. GLYCOSYL_HYDROL_F35. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

Q6Z6K4-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MAPAPTPAAA AGRRVAVLAA ALVAASLAAS VGVANAAVSY DRRSLVINGR
60 70 80 90 100
RRILLSGSIH YPRSTPEMWP GLIQKAKDGG LDVIQTYVFW NGHEPVQGQY
110 120 130 140 150
YFSDRYDLVR FVKLVKQAGL YVHLRIGPYV CAEWNFGGFP VWLKYVPGVS
160 170 180 190 200
FRTDNGPFKA EMQKFVEKIV SMMKSEGLFE WQGGPIIMSQ VENEFGPMES
210 220 230 240 250
VGGSGAKPYA NWAAKMAVGT NTGVPWVMCK QDDAPDPVIN TCNGFYCDYF
260 270 280 290 300
SPNKNYKPSM WTEAWTGWFT SFGGGVPHRP VEDLAFAVAR FIQKGGSFVN
310 320 330 340 350
YYMYHGGTNF GRTAGGPFIA TSYDYDAPID EFGLLRQPKW GHLRDLHRAI
360 370 380 390 400
KQAEPVLVSA DPTIESIGSY EKAYVFKAKN GACAAFLSNY HMNTAVKVRF
410 420 430 440 450
NGQQYNLPAW SISILPDCKT AVFNTATVKE PTLMPKMNPV VRFAWQSYSE
460 470 480 490 500
DTNSLSDSAF TKDGLVEQLS MTWDKSDYLW YTTYVNIGTN DLRSGQSPQL
510 520 530 540 550
TVYSAGHSMQ VFVNGKSYGS VYGGYDNPKL TYNGRVKMWQ GSNKISILSS
560 570 580 590 600
AVGLPNVGNH FENWNVGVLG PVTLSSLNGG TKDLSHQKWT YQVGLKGETL
610 620 630 640 650
GLHTVTGSSA VEWGGPGGYQ PLTWHKAFFN APAGNDPVAL DMGSMGKGQL
660 670 680 690 700
WVNGHHVGRY WSYKASGGCG GCSYAGTYHE DKCRSNCGDL SQRWYHVPRS
710 720
WLKPGGNLLV VLEEYGGDLA GVSLATRTT
Length:729
Mass (Da):79,996
Last modified:July 5, 2004 - v1
Checksum:i877E85CECA84EEEE
GO

Sequence cautioni

The sequence BAF08224.1 differs from that shown. Reason: Erroneous gene model prediction. Curated

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AP004996 Genomic DNA. Translation: BAD17189.1.
AP008208 Genomic DNA. Translation: BAF08224.1. Sequence problems.
AK059059 mRNA. No translation available.
RefSeqiNP_001046310.1. NM_001052845.1.
UniGeneiOs.14358.

Genome annotation databases

GeneIDi4328745.
KEGGiosa:4328745.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AP004996 Genomic DNA. Translation: BAD17189.1.
AP008208 Genomic DNA. Translation: BAF08224.1. Sequence problems.
AK059059 mRNA. No translation available.
RefSeqiNP_001046310.1. NM_001052845.1.
UniGeneiOs.14358.

3D structure databases

ProteinModelPortaliQ6Z6K4.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi39947.LOC_Os02g12730.1.

Protein family/group databases

CAZyiGH35. Glycoside Hydrolase Family 35.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

GeneIDi4328745.
KEGGiosa:4328745.

Organism-specific databases

GrameneiQ6Z6K4.

Phylogenomic databases

eggNOGiCOG1874.
InParanoidiQ6Z6K4.
OMAiGHSMQVF.

Family and domain databases

Gene3Di2.60.120.260. 2 hits.
3.20.20.80. 1 hit.
InterProiIPR025300. BetaGal_jelly_roll_dom.
IPR008979. Galactose-bd-like.
IPR019801. Glyco_hydro_35_CS.
IPR013781. Glyco_hydro_catalytic_dom.
IPR001944. Glycoside_Hdrlase_35.
IPR017853. Glycoside_hydrolase_SF.
[Graphical view]
PANTHERiPTHR23421. PTHR23421. 1 hit.
PfamiPF13364. BetaGal_dom4_5. 1 hit.
PF01301. Glyco_hydro_35. 1 hit.
[Graphical view]
PRINTSiPR00742. GLHYDRLASE35.
SUPFAMiSSF49785. SSF49785. 2 hits.
SSF51445. SSF51445. 1 hit.
PROSITEiPS01182. GLYCOSYL_HYDROL_F35. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: cv. Nipponbare.
  2. "The rice annotation project database (RAP-DB): 2008 update."
    The rice annotation project (RAP)
    Nucleic Acids Res. 36:D1028-D1033(2008) [PubMed] [Europe PMC] [Abstract]
    Cited for: GENOME REANNOTATION.
    Strain: cv. Nipponbare.
  3. "Collection, mapping, and annotation of over 28,000 cDNA clones from japonica rice."
    The rice full-length cDNA consortium
    Science 301:376-379(2003) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 230-729.
    Strain: cv. Nipponbare.

Entry informationi

Entry nameiBGAL4_ORYSJ
AccessioniPrimary (citable) accession number: Q6Z6K4
Secondary accession number(s): Q0E2R4
Entry historyi
Integrated into UniProtKB/Swiss-Prot: July 10, 2007
Last sequence update: July 5, 2004
Last modified: January 7, 2015
This is version 67 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programPlant Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Glycosyl hydrolases
    Classification of glycosyl hydrolase families and list of entries
  2. Oryza sativa (rice)
    Index of Oryza sativa entries and their corresponding gene designations
  3. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into Uniref entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.