Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

MADS-box protein SOC1

Gene

SOC1

Organism
Arabidopsis thaliana (Mouse-ear cress)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Transcription activator active in flowering time control. May integrate signals from the photoperiod, vernalization and autonomous floral induction pathways. Can modulate class B and C homeotic genes expression. When associated with AGL24, mediates effect of gibberellins on flowering under short-day conditions, and regulates the expression of LEAFY (LFY), which links floral induction and floral development.4 Publications

GO - Molecular functioni

GO - Biological processi

  • cell differentiation Source: UniProtKB-KW
  • flower development Source: TAIR
  • maintenance of inflorescence meristem identity Source: TAIR
  • MAPK cascade Source: InterPro
  • positive regulation of flower development Source: TAIR
  • positive regulation of transcription, DNA-templated Source: UniProtKB
  • positive regulation of transcription from RNA polymerase II promoter Source: InterPro
  • protein import into nucleus, translocation Source: UniProtKB
  • response to cold Source: TAIR
  • response to gibberellin Source: UniProtKB
  • transcription, DNA-templated Source: UniProtKB-KW
Complete GO annotation...

Keywords - Molecular functioni

Activator, Developmental protein

Keywords - Biological processi

Differentiation, Flowering, Transcription, Transcription regulation

Keywords - Ligandi

DNA-binding

Names & Taxonomyi

Protein namesi
Recommended name:
MADS-box protein SOC1
Alternative name(s):
Agamous-like MADS-box protein AGL20
Protein SUPPRESSOR OF CONSTANS OVEREXPRESSION 1
Gene namesi
Name:SOC1
Synonyms:AGL20
Ordered Locus Names:At2g45660
ORF Names:F17K2.19
OrganismiArabidopsis thaliana (Mouse-ear cress)
Taxonomic identifieri3702 [NCBI]
Taxonomic lineageiEukaryotaViridiplantaeStreptophytaEmbryophytaTracheophytaSpermatophytaMagnoliophytaeudicotyledonsGunneridaePentapetalaerosidsmalvidsBrassicalesBrassicaceaeCamelineaeArabidopsis
Proteomesi
  • UP000006548 Componenti: Chromosome 2

Organism-specific databases

TAIRiAT2G45660.

Subcellular locationi

  • Nucleus PROSITE-ProRule annotation1 Publication
  • Cytoplasm 1 Publication

  • Note: Translocation from the cytoplasm to the nucleus in the presence of AGL24.

GO - Cellular componenti

  • cytoplasm Source: UniProtKB
  • nucleus Source: UniProtKB
Complete GO annotation...

Keywords - Cellular componenti

Cytoplasm, Nucleus

Pathology & Biotechi

Disruption phenotypei

Plants are late-flowering.1 Publication

Mutagenesis

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Mutagenesisi24R → K in sso36; suppression of early flowering time mediated by SOC1 over-expression. 1 Publication1
Mutagenesisi34E → K in sso11; partial suppression of early flowering time mediated by SOC1 overexpression. 1 Publication1
Mutagenesisi113G → E in sso4; partial suppression of early flowering time mediated by SOC1 overexpression. 1 Publication1

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00001994861 – 214MADS-box protein SOC1Add BLAST214

Proteomic databases

PaxDbiO64645.
PRIDEiO64645.

Expressioni

Tissue specificityi

Widely expressed. Not found in the apical meristem of short-day grown plants in vegetative stage.

Developmental stagei

Rapidly up-regulated in apical meristems during the transition to flowering. Transiently expressed in inflorescence meristem. Re-appears in stage 3 flowers, in the central dome that later will develop into stamens and carpels.1 Publication

Inductioni

Up-regulated by gibberellins, vernalization and under long-day conditions. Gradual increase during vegetative growth. Induced by AGL24 at the shoot apex at the floral transitional stage. Repressed by SVP during the early stages of flower development. Inhibited by AP1 in emerging floral meristems (PubMed:17428825, PubMed:18339670, PubMed:19656343). Repressed by SHL to prevent flowering (PubMed:25281686).4 Publications

Gene expression databases

GenevisibleiO64645. AT.

Interactioni

Subunit structurei

Forms a heterodimer with AGL24 through MADS-box domain. Interacts with AGL15, AGL16 and AGL19.2 Publications

Binary interactionsi

WithEntry#Exp.IntActNotes
AGL8Q388764EBI-592041,EBI-621912
AP1P356315EBI-592041,EBI-592003
CALQ390814EBI-592041,EBI-592136
PIN1Q9SL423EBI-592041,EBI-2618990

GO - Molecular functioni

  • transcription factor binding Source: UniProtKB

Protein-protein interaction databases

BioGridi4510. 34 interactors.
DIPiDIP-33799N.
IntActiO64645. 35 interactors.
STRINGi3702.AT2G45660.1.

Structurei

3D structure databases

ProteinModelPortaliO64645.
SMRiO64645.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini3 – 57MADS-boxPROSITE-ProRule annotationAdd BLAST55
Domaini87 – 177K-boxPROSITE-ProRule annotationAdd BLAST91

Sequence similaritiesi

Contains 1 K-box domain.PROSITE-ProRule annotation
Contains 1 MADS-box domain.PROSITE-ProRule annotation

Phylogenomic databases

eggNOGiKOG0014. Eukaryota.
COG5068. LUCA.
HOGENOMiHOG000155301.
InParanoidiO64645.
OMAiWSNKNQE.
OrthoDBiEOG0936106C.
PhylomeDBiO64645.

Family and domain databases

CDDicd00265. MADS_MEF2_like. 1 hit.
InterProiIPR033896. MADS_MEF2-like.
IPR002487. TF_Kbox.
IPR002100. TF_MADSbox.
[Graphical view]
PfamiPF01486. K-box. 1 hit.
PF00319. SRF-TF. 1 hit.
[Graphical view]
PRINTSiPR00404. MADSDOMAIN.
SMARTiSM00432. MADS. 1 hit.
[Graphical view]
SUPFAMiSSF55455. SSF55455. 1 hit.
PROSITEiPS51297. K_BOX. 1 hit.
PS00350. MADS_BOX_1. 1 hit.
PS50066. MADS_BOX_2. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

O64645-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MVRGKTQMKR IENATSRQVT FSKRRNGLLK KAFELSVLCD AEVSLIIFSP
60 70 80 90 100
KGKLYEFASS NMQDTIDRYL RHTKDRVSTK PVSEENMQHL KYEAANMMKK
110 120 130 140 150
IEQLEASKRK LLGEGIGTCS IEELQQIEQQ LEKSVKCIRA RKTQVFKEQI
160 170 180 190 200
EQLKQKEKAL AAENEKLSEK WGSHESEVWS NKNQESTGRG DEESSPSSEV
210
ETQLFIGLPC SSRK
Length:214
Mass (Da):24,533
Last modified:August 1, 1998 - v1
Checksum:iB4D39151DE541F8D
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AC003680 Genomic DNA. Translation: AAC06175.1.
CP002685 Genomic DNA. Translation: AEC10583.1.
AY007726 mRNA. Translation: AAG16297.1.
AF385731 mRNA. Translation: AAK60321.1.
AY093967 mRNA. Translation: AAM16228.1.
PIRiT00879.
RefSeqiNP_182090.1. NM_130128.4.
UniGeneiAt.25546.

Genome annotation databases

EnsemblPlantsiAT2G45660.1; AT2G45660.1; AT2G45660.
GeneIDi819174.
GrameneiAT2G45660.1; AT2G45660.1; AT2G45660.
KEGGiath:AT2G45660.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AC003680 Genomic DNA. Translation: AAC06175.1.
CP002685 Genomic DNA. Translation: AEC10583.1.
AY007726 mRNA. Translation: AAG16297.1.
AF385731 mRNA. Translation: AAK60321.1.
AY093967 mRNA. Translation: AAM16228.1.
PIRiT00879.
RefSeqiNP_182090.1. NM_130128.4.
UniGeneiAt.25546.

3D structure databases

ProteinModelPortaliO64645.
SMRiO64645.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi4510. 34 interactors.
DIPiDIP-33799N.
IntActiO64645. 35 interactors.
STRINGi3702.AT2G45660.1.

Proteomic databases

PaxDbiO64645.
PRIDEiO64645.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblPlantsiAT2G45660.1; AT2G45660.1; AT2G45660.
GeneIDi819174.
GrameneiAT2G45660.1; AT2G45660.1; AT2G45660.
KEGGiath:AT2G45660.

Organism-specific databases

TAIRiAT2G45660.

Phylogenomic databases

eggNOGiKOG0014. Eukaryota.
COG5068. LUCA.
HOGENOMiHOG000155301.
InParanoidiO64645.
OMAiWSNKNQE.
OrthoDBiEOG0936106C.
PhylomeDBiO64645.

Miscellaneous databases

PROiO64645.

Gene expression databases

GenevisibleiO64645. AT.

Family and domain databases

CDDicd00265. MADS_MEF2_like. 1 hit.
InterProiIPR033896. MADS_MEF2-like.
IPR002487. TF_Kbox.
IPR002100. TF_MADSbox.
[Graphical view]
PfamiPF01486. K-box. 1 hit.
PF00319. SRF-TF. 1 hit.
[Graphical view]
PRINTSiPR00404. MADSDOMAIN.
SMARTiSM00432. MADS. 1 hit.
[Graphical view]
SUPFAMiSSF55455. SSF55455. 1 hit.
PROSITEiPS51297. K_BOX. 1 hit.
PS00350. MADS_BOX_1. 1 hit.
PS50066. MADS_BOX_2. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiSOC1_ARATH
AccessioniPrimary (citable) accession number: O64645
Entry historyi
Integrated into UniProtKB/Swiss-Prot: December 5, 2001
Last sequence update: August 1, 1998
Last modified: November 30, 2016
This is version 133 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programPlant Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Arabidopsis thaliana
    Arabidopsis thaliana: entries and gene names
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.