Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Regulatory protein SoxS

Gene

soxS

Organism
Escherichia coli (strain K12)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Experimental evidence at protein leveli

Functioni

Transcriptional activator of the superoxide response regulon of E.coli that includes at least 10 genes such as sodA, nfo, zwf and micF. Binds the DNA sequence 5'-GCACN7CAA-3'. It also facilitates the subsequent binding of RNA polymerase to the micF and the nfo promoters.

Regions

Feature keyPosition(s)DescriptionActionsGraphical viewLength
DNA bindingi24 – 43H-T-H motifPROSITE-ProRule annotationAdd BLAST20

GO - Molecular functioni

GO - Biological processi

  • regulation of transcription, DNA-templated Source: EcoCyc
  • transcription, DNA-templated Source: UniProtKB-KW

Keywordsi

Molecular functionActivator, DNA-binding
Biological processTranscription, Transcription regulation

Enzyme and pathway databases

BioCyciEcoCyc:PD00406.

Names & Taxonomyi

Protein namesi
Recommended name:
Regulatory protein SoxS
Gene namesi
Name:soxS
Ordered Locus Names:b4062, JW4023
OrganismiEscherichia coli (strain K12)
Taxonomic identifieri83333 [NCBI]
Taxonomic lineageiBacteriaProteobacteriaGammaproteobacteriaEnterobacteralesEnterobacteriaceaeEscherichia
Proteomesi
  • UP000000318 Componenti: Chromosome
  • UP000000625 Componenti: Chromosome

Organism-specific databases

EcoGeneiEG10958. soxS.

Subcellular locationi

GO - Cellular componenti

Keywords - Cellular componenti

Cytoplasm

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Initiator methionineiRemoved1 Publication
ChainiPRO_00001945832 – 107Regulatory protein SoxSAdd BLAST106

Proteomic databases

PaxDbiP0A9E2.
PRIDEiP0A9E2.

Expressioni

Inductioni

By paraquat.

Interactioni

GO - Molecular functioni

  • bacterial-type RNA polymerase holo enzyme binding Source: EcoCyc

Protein-protein interaction databases

BioGridi4262018. 126 interactors.
DIPiDIP-10904N.
IntActiP0A9E2. 10 interactors.
STRINGi511145.b4062.

Structurei

3D structure databases

ProteinModelPortaliP0A9E2.
SMRiP0A9E2.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Phylogenomic databases

eggNOGiENOG4108W3Q. Bacteria.
ENOG4111IXA. LUCA.
HOGENOMiHOG000120762.
InParanoidiP0A9E2.
KOiK13631.
OMAiLTHWIDQ.
PhylomeDBiP0A9E2.

Family and domain databases

Gene3Di1.10.10.60. 2 hits.
InterProiView protein in InterPro
IPR009057. Homeobox-like.
IPR018060. HTH_AraC.
IPR018062. HTH_AraC-typ_CS.
IPR020449. Tscrpt_reg_HTH_AraC-type.
PfamiView protein in Pfam
PF12833. HTH_18. 1 hit.
PRINTSiPR00032. HTHARAC.
SMARTiView protein in SMART
SM00342. HTH_ARAC. 1 hit.
SUPFAMiSSF46689. SSF46689. 2 hits.
PROSITEiView protein in PROSITE
PS00041. HTH_ARAC_FAMILY_1. 1 hit.
PS01124. HTH_ARAC_FAMILY_2. 1 hit.

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

P0A9E2-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MSHQKIIQDL IAWIDEHIDQ PLNIDVVAKK SGYSKWYLQR MFRTVTHQTL
60 70 80 90 100
GDYIRQRRLL LAAVELRTTE RPIFDIAMDL GYVSQQTFSR VFRRQFDRTP

SDYRHRL
Length:107
Mass (Da):12,911
Last modified:January 23, 2007 - v2
Checksum:i7341326FBAB819D6
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
M60111 Genomic DNA. Translation: AAA24640.1.
X59593 Genomic DNA. Translation: CAA42161.1.
U00006 Genomic DNA. Translation: AAC43156.1.
U00096 Genomic DNA. Translation: AAC77032.1.
AP009048 Genomic DNA. Translation: BAE78064.1.
PIRiJS0578.
RefSeqiNP_418486.1. NC_000913.3.
WP_000019358.1. NZ_LN832404.1.

Genome annotation databases

EnsemblBacteriaiAAC77032; AAC77032; b4062.
BAE78064; BAE78064; BAE78064.
GeneIDi948567.
KEGGiecj:JW4023.
eco:b4062.
PATRICi32123669. VBIEscCol129921_4183.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
M60111 Genomic DNA. Translation: AAA24640.1.
X59593 Genomic DNA. Translation: CAA42161.1.
U00006 Genomic DNA. Translation: AAC43156.1.
U00096 Genomic DNA. Translation: AAC77032.1.
AP009048 Genomic DNA. Translation: BAE78064.1.
PIRiJS0578.
RefSeqiNP_418486.1. NC_000913.3.
WP_000019358.1. NZ_LN832404.1.

3D structure databases

ProteinModelPortaliP0A9E2.
SMRiP0A9E2.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi4262018. 126 interactors.
DIPiDIP-10904N.
IntActiP0A9E2. 10 interactors.
STRINGi511145.b4062.

Proteomic databases

PaxDbiP0A9E2.
PRIDEiP0A9E2.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaiAAC77032; AAC77032; b4062.
BAE78064; BAE78064; BAE78064.
GeneIDi948567.
KEGGiecj:JW4023.
eco:b4062.
PATRICi32123669. VBIEscCol129921_4183.

Organism-specific databases

EchoBASEiEB0951.
EcoGeneiEG10958. soxS.

Phylogenomic databases

eggNOGiENOG4108W3Q. Bacteria.
ENOG4111IXA. LUCA.
HOGENOMiHOG000120762.
InParanoidiP0A9E2.
KOiK13631.
OMAiLTHWIDQ.
PhylomeDBiP0A9E2.

Enzyme and pathway databases

BioCyciEcoCyc:PD00406.

Miscellaneous databases

PROiP0A9E2.

Family and domain databases

Gene3Di1.10.10.60. 2 hits.
InterProiView protein in InterPro
IPR009057. Homeobox-like.
IPR018060. HTH_AraC.
IPR018062. HTH_AraC-typ_CS.
IPR020449. Tscrpt_reg_HTH_AraC-type.
PfamiView protein in Pfam
PF12833. HTH_18. 1 hit.
PRINTSiPR00032. HTHARAC.
SMARTiView protein in SMART
SM00342. HTH_ARAC. 1 hit.
SUPFAMiSSF46689. SSF46689. 2 hits.
PROSITEiView protein in PROSITE
PS00041. HTH_ARAC_FAMILY_1. 1 hit.
PS01124. HTH_ARAC_FAMILY_2. 1 hit.
ProtoNetiSearch...

Entry informationi

Entry nameiSOXS_ECOLI
AccessioniPrimary (citable) accession number: P0A9E2
Secondary accession number(s): P22539, Q2M6P2
Entry historyiIntegrated into UniProtKB/Swiss-Prot: July 19, 2005
Last sequence update: January 23, 2007
Last modified: February 15, 2017
This is version 94 of the entry and version 2 of the sequence. See complete history.
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Direct protein sequencing, Reference proteome

Documents

  1. Escherichia coli
    Escherichia coli (strain K12): entries and cross-references to EcoGene

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.