Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Regulatory protein GAL4

Gene

GAL4

Organism
Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

This protein is a positive regulator for the gene expression of the galactose-induced genes such as GAL1, GAL2, GAL7, GAL10, and MEL1 which code for the enzymes used to convert galactose to glucose. It recognizes a 17 base pair sequence in (5'-CGGRNNRCYNYNCNCCG-3') the upstream activating sequence (UAS-G) of these genes.

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Metal bindingi11Zinc 11
Metal bindingi11Zinc 21
Metal bindingi14Zinc 11
Metal bindingi21Zinc 11
Metal bindingi28Zinc 11
Metal bindingi28Zinc 21
Metal bindingi31Zinc 21
Metal bindingi38Zinc 21

Regions

Feature keyPosition(s)DescriptionActionsGraphical viewLength
DNA bindingi11 – 38Zn(2)-C6 fungal-typePROSITE-ProRule annotationAdd BLAST28

GO - Molecular functioni

GO - Biological processi

  • galactose metabolic process Source: SGD
  • positive regulation of transcription from RNA polymerase II promoter by galactose Source: SGD
Complete GO annotation...

Keywords - Molecular functioni

Activator

Keywords - Biological processi

Carbohydrate metabolism, Galactose metabolism, Transcription, Transcription regulation

Keywords - Ligandi

DNA-binding, Metal-binding, Zinc

Enzyme and pathway databases

BioCyciYEAST:G3O-34134-MONOMER.

Names & Taxonomyi

Protein namesi
Recommended name:
Regulatory protein GAL4
Gene namesi
Name:GAL4
Ordered Locus Names:YPL248C
OrganismiSaccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast)
Taxonomic identifieri559292 [NCBI]
Taxonomic lineageiEukaryotaFungiDikaryaAscomycotaSaccharomycotinaSaccharomycetesSaccharomycetalesSaccharomycetaceaeSaccharomyces
Proteomesi
  • UP000002311 Componenti: Chromosome XVI

Organism-specific databases

EuPathDBiFungiDB:YPL248C.
SGDiS000006169. GAL4.

Subcellular locationi

GO - Cellular componenti

  • nucleus Source: SGD
Complete GO annotation...

Keywords - Cellular componenti

Nucleus

Pathology & Biotechi

Mutagenesis

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Mutagenesisi26P → L: Loss of DNA-binding. 1 Publication1

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00001149511 – 881Regulatory protein GAL4Add BLAST881

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei694PhosphotyrosineCombined sources1
Modified residuei696PhosphoserineCombined sources1
Modified residuei699PhosphoserineCombined sources1
Modified residuei703PhosphoserineCombined sources1
Modified residuei712PhosphoserineCombined sources1

Post-translational modificationi

Association between GAL11 and GAL4 may serve to expedite phosphorylation of GAL4.

Keywords - PTMi

Phosphoprotein

PTM databases

iPTMnetiP04386.

Interactioni

Subunit structurei

Binds DNA as a homodimer. Interacts directly with the mediator subunits GAL11/MED15 and SRB4/MED17.2 Publications

GO - Molecular functioni

  • RNA polymerase II transcription factor binding Source: SGD

Protein-protein interaction databases

BioGridi35915. 88 interactors.
DIPiDIP-593N.
IntActiP04386. 5 interactors.
MINTiMINT-1510289.

Structurei

Secondary structure

1881
Legend: HelixTurnBeta strandPDB Structure known for this area
Show more details
Feature keyPosition(s)DescriptionActionsGraphical viewLength
Helixi12 – 17Combined sources6
Beta strandi25 – 27Combined sources3
Helixi29 – 33Combined sources5
Helixi51 – 71Combined sources21
Beta strandi73 – 75Combined sources3
Helixi77 – 82Combined sources6
Helixi86 – 93Combined sources8

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
1AW6NMR-A1-43[»]
1D66X-ray2.70A/B1-65[»]
1HBWNMR-A/B50-106[»]
3BTSX-ray2.70E/F854-874[»]
3COQX-ray2.40A/B8-96[»]
ProteinModelPortaliP04386.
SMRiP04386.
ModBaseiSearch...
MobiDBiSearch...

Miscellaneous databases

EvolutionaryTraceiP04386.

Family & Domainsi

Motif

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Motifi862 – 8709aaTAD9

Domaini

the 9aaTAD motif (residues 862 to 870) is a transactivation domain present in a large number of yeast and animal transcription factors.1 Publication

Sequence similaritiesi

Contains 1 Zn(2)-C6 fungal-type DNA-binding domain.PROSITE-ProRule annotation

Phylogenomic databases

HOGENOMiHOG000112689.
InParanoidiP04386.
KOiK09241.
OMAiRSFIWES.
OrthoDBiEOG092C4I3T.

Family and domain databases

CDDicd14654. ZIP_Gal4. 1 hit.
Gene3Di4.10.240.10. 1 hit.
InterProiIPR005600. Gal4_dimer_dom.
IPR007219. Transcription_factor_dom_fun.
IPR001138. Zn2-C6_fun-type_DNA-bd.
[Graphical view]
PfamiPF04082. Fungal_trans. 1 hit.
PF03902. Gal4_dimer. 1 hit.
PF00172. Zn_clus. 1 hit.
[Graphical view]
SMARTiSM00906. Fungal_trans. 1 hit.
SM00066. GAL4. 1 hit.
[Graphical view]
SUPFAMiSSF57701. SSF57701. 1 hit.
PROSITEiPS00463. ZN2_CY6_FUNGAL_1. 1 hit.
PS50048. ZN2_CY6_FUNGAL_2. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

P04386-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MKLLSSIEQA CDICRLKKLK CSKEKPKCAK CLKNNWECRY SPKTKRSPLT
60 70 80 90 100
RAHLTEVESR LERLEQLFLL IFPREDLDMI LKMDSLQDIK ALLTGLFVQD
110 120 130 140 150
NVNKDAVTDR LASVETDMPL TLRQHRISAT SSSEESSNKG QRQLTVSIDS
160 170 180 190 200
AAHHDNSTIP LDFMPRDALH GFDWSEEDDM SDGLPFLKTD PNNNGFFGDG
210 220 230 240 250
SLLCILRSIG FKPENYTNSN VNRLPTMITD RYTLASRSTT SRLLQSYLNN
260 270 280 290 300
FHPYCPIVHS PTLMMLYNNQ IEIASKDQWQ ILFNCILAIG AWCIEGESTD
310 320 330 340 350
IDVFYYQNAK SHLTSKVFES GSIILVTALH LLSRYTQWRQ KTNTSYNFHS
360 370 380 390 400
FSIRMAISLG LNRDLPSSFS DSSILEQRRR IWWSVYSWEI QLSLLYGRSI
410 420 430 440 450
QLSQNTISFP SSVDDVQRTT TGPTIYHGII ETARLLQVFT KIYELDKTVT
460 470 480 490 500
AEKSPICAKK CLMICNEIEE VSRQAPKFLQ MDISTTALTN LLKEHPWLSF
510 520 530 540 550
TRFELKWKQL SLIIYVLRDF FTNFTQKKSQ LEQDQNDHQS YEVKRCSIML
560 570 580 590 600
SDAAQRTVMS VSSYMDNHNV TPYFAWNCSY YLFNAVLVPI KTLLSNSKSN
610 620 630 640 650
AENNETAQLL QQINTVLMLL KKLATFKIQT CEKYIQVLEE VCAPFLLSQC
660 670 680 690 700
AIPLPHISYN NSNGSAIKNI VGSATIAQYP TLPEENVNNI SVKYVSPGSV
710 720 730 740 750
GPSPVPLKSG ASFSDLVKLL SNRPPSRNSP VTIPRSTPSH RSVTPFLGQQ
760 770 780 790 800
QQLQSLVPLT PSALFGGANF NQSGNIADSS LSFTFTNSSN GPNLITTQTN
810 820 830 840 850
SQALSQPIAS SNVHDNFMNN EITASKIDDG NNSKPLSPGW TDQTAYNAFG
860 870 880
ITTGMFNTTT MDDVYNYLFD DEDTPPNPKK E
Length:881
Mass (Da):99,403
Last modified:November 1, 1995 - v2
Checksum:i29D7FF68B0B05880
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
K01486 Genomic DNA. Translation: AAA34626.1.
Z67751 Genomic DNA. Translation: CAA91596.1.
Z73604 Genomic DNA. Translation: CAA97969.1.
BK006949 Genomic DNA. Translation: DAA11189.1.
PIRiA05022. RGBYG4.
RefSeqiNP_015076.1. NM_001184062.1.

Genome annotation databases

EnsemblFungiiYPL248C; YPL248C; YPL248C.
GeneIDi855828.
KEGGisce:YPL248C.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
K01486 Genomic DNA. Translation: AAA34626.1.
Z67751 Genomic DNA. Translation: CAA91596.1.
Z73604 Genomic DNA. Translation: CAA97969.1.
BK006949 Genomic DNA. Translation: DAA11189.1.
PIRiA05022. RGBYG4.
RefSeqiNP_015076.1. NM_001184062.1.

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
1AW6NMR-A1-43[»]
1D66X-ray2.70A/B1-65[»]
1HBWNMR-A/B50-106[»]
3BTSX-ray2.70E/F854-874[»]
3COQX-ray2.40A/B8-96[»]
ProteinModelPortaliP04386.
SMRiP04386.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi35915. 88 interactors.
DIPiDIP-593N.
IntActiP04386. 5 interactors.
MINTiMINT-1510289.

PTM databases

iPTMnetiP04386.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblFungiiYPL248C; YPL248C; YPL248C.
GeneIDi855828.
KEGGisce:YPL248C.

Organism-specific databases

EuPathDBiFungiDB:YPL248C.
SGDiS000006169. GAL4.

Phylogenomic databases

HOGENOMiHOG000112689.
InParanoidiP04386.
KOiK09241.
OMAiRSFIWES.
OrthoDBiEOG092C4I3T.

Enzyme and pathway databases

BioCyciYEAST:G3O-34134-MONOMER.

Miscellaneous databases

EvolutionaryTraceiP04386.
PROiP04386.

Family and domain databases

CDDicd14654. ZIP_Gal4. 1 hit.
Gene3Di4.10.240.10. 1 hit.
InterProiIPR005600. Gal4_dimer_dom.
IPR007219. Transcription_factor_dom_fun.
IPR001138. Zn2-C6_fun-type_DNA-bd.
[Graphical view]
PfamiPF04082. Fungal_trans. 1 hit.
PF03902. Gal4_dimer. 1 hit.
PF00172. Zn_clus. 1 hit.
[Graphical view]
SMARTiSM00906. Fungal_trans. 1 hit.
SM00066. GAL4. 1 hit.
[Graphical view]
SUPFAMiSSF57701. SSF57701. 1 hit.
PROSITEiPS00463. ZN2_CY6_FUNGAL_1. 1 hit.
PS50048. ZN2_CY6_FUNGAL_2. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiGAL4_YEAST
AccessioniPrimary (citable) accession number: P04386
Secondary accession number(s): D6W3C3
Entry historyi
Integrated into UniProtKB/Swiss-Prot: March 20, 1987
Last sequence update: November 1, 1995
Last modified: November 2, 2016
This is version 171 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programFungal Protein Annotation Program

Miscellaneousi

Miscellaneous

Present with 166 molecules/cell in log phase SD medium.1 Publication

Keywords - Technical termi

3D-structure, Complete proteome, Reference proteome

Documents

  1. PDB cross-references
    Index of Protein Data Bank (PDB) cross-references
  2. SIMILARITY comments
    Index of protein domains and families
  3. Yeast
    Yeast (Saccharomyces cerevisiae): entries, gene names and cross-references to SGD
  4. Yeast chromosome XVI
    Yeast (Saccharomyces cerevisiae) chromosome XVI: entries and gene names

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.