The new UniProt website is here!
Take me to UniProt BETA
UniProtKB - Q61079 (SIM2_MOUSE)
Protein
Single-minded homolog 2
Gene
Sim2
Organism
Mus musculus (Mouse)
Status
Functioni
Transcription factor that may be a master gene of CNS development in cooperation with Arnt. It may have pleiotropic effects in the tissues expressed during development.
GO - Molecular functioni
- DNA binding Source: MGI
- DNA-binding transcription factor activity, RNA polymerase II-specific Source: GO_Central
- protein heterodimerization activity Source: UniProtKB
- RNA polymerase II transcription regulatory region sequence-specific DNA binding Source: GO_Central
GO - Biological processi
- cell differentiation Source: UniProtKB-KW
- embryonic pattern specification Source: MGI
- lung development Source: MGI
- negative regulation of transcription, DNA-templated Source: MGI
- negative regulation of transcription by RNA polymerase II Source: MGI
- nervous system development Source: UniProtKB-KW
- regulation of transcription by RNA polymerase II Source: GO_Central
Keywordsi
Molecular function | Developmental protein, DNA-binding |
Biological process | Differentiation, Neurogenesis, Transcription, Transcription regulation |
Names & Taxonomyi
Protein namesi | Recommended name: Single-minded homolog 2Alternative name(s): SIM transcription factor Short name: mSIM |
Gene namesi | Name:Sim2 |
Organismi | Mus musculus (Mouse) |
Taxonomic identifieri | 10090 [NCBI] |
Taxonomic lineagei | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Mammalia › Eutheria › Euarchontoglires › Glires › Rodentia › Myomorpha › Muroidea › Muridae › Murinae › Mus › Mus |
Proteomesi |
|
Organism-specific databases
MGIi | MGI:98307, Sim2 |
VEuPathDBi | HostDB:ENSMUSG00000062713 |
Subcellular locationi
Nucleus
- Nucleus PROSITE-ProRule annotation
Nucleus
- nuclear body Source: MGI
- nucleoplasm Source: MGI
- nucleus Source: MGI
Keywords - Cellular componenti
NucleusPTM / Processingi
Molecule processing
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
ChainiPRO_0000127442 | 1 – 657 | Single-minded homolog 2Add BLAST | 657 |
Proteomic databases
PaxDbi | Q61079 |
PRIDEi | Q61079 |
ProteomicsDBi | 261236 |
PTM databases
iPTMneti | Q61079 |
PhosphoSitePlusi | Q61079 |
Expressioni
Tissue specificityi
Transcripts were detected in high levels in kidney followed by skeletal muscle and lung. Low levels were found in testis, brain and heart. In early fetal development it is found in CNS, developing kidney, tongue epithelium and cartilage primordia.
Gene expression databases
Bgeei | ENSMUSG00000062713, Expressed in esophagus and 126 other tissues |
ExpressionAtlasi | Q61079, baseline and differential |
Genevisiblei | Q61079, MM |
Interactioni
Subunit structurei
Efficient DNA binding requires dimerization with another bHLH protein. Heterodimer of SIM2 and ARNT.
1 PublicationGO - Molecular functioni
- protein heterodimerization activity Source: UniProtKB
Protein-protein interaction databases
BioGRIDi | 203255, 1 interactor |
CORUMi | Q61079 |
IntActi | Q61079, 2 interactors |
STRINGi | 10090.ENSMUSP00000072043 |
Miscellaneous databases
RNActi | Q61079, protein |
Family & Domainsi
Domains and Repeats
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Domaini | 1 – 53 | bHLHPROSITE-ProRule annotationAdd BLAST | 53 | |
Domaini | 77 – 147 | PAS 1PROSITE-ProRule annotationAdd BLAST | 71 | |
Domaini | 218 – 288 | PACAdd BLAST | 71 | |
Domaini | 218 – 288 | PAS 2PROSITE-ProRule annotationAdd BLAST | 71 | |
Domaini | 336 – 657 | Single-minded C-terminalPROSITE-ProRule annotationAdd BLAST | 322 |
Region
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Regioni | 354 – 387 | DisorderedSequence analysisAdd BLAST | 34 | |
Regioni | 612 – 641 | DisorderedSequence analysisAdd BLAST | 30 |
Motif
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Motifi | 367 – 386 | Nuclear localization signalBy similarityAdd BLAST | 20 |
Compositional bias
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Compositional biasi | 354 – 368 | Polar residuesSequence analysisAdd BLAST | 15 |
Keywords - Domaini
RepeatPhylogenomic databases
eggNOGi | KOG3559, Eukaryota |
GeneTreei | ENSGT00940000159985 |
HOGENOMi | CLU_010044_4_1_1 |
InParanoidi | Q61079 |
OMAi | SECQWHY |
OrthoDBi | 231698at2759 |
PhylomeDBi | Q61079 |
TreeFami | TF317772 |
Family and domain databases
CDDi | cd00130, PAS, 2 hits |
Gene3Di | 4.10.280.10, 1 hit |
InterProi | View protein in InterPro IPR011598, bHLH_dom IPR036638, HLH_DNA-bd_sf IPR001610, PAC IPR000014, PAS IPR035965, PAS-like_dom_sf IPR013767, PAS_fold IPR013655, PAS_fold_3 IPR010578, SIM_C |
Pfami | View protein in Pfam PF00989, PAS, 1 hit PF08447, PAS_3, 1 hit PF06621, SIM_C, 1 hit |
SMARTi | View protein in SMART SM00353, HLH, 1 hit SM00086, PAC, 1 hit SM00091, PAS, 2 hits |
SUPFAMi | SSF47459, SSF47459, 1 hit SSF55785, SSF55785, 2 hits |
PROSITEi | View protein in PROSITE PS50888, BHLH, 1 hit PS50112, PAS, 2 hits PS51302, SIM_C, 1 hit |
(1+)i Sequence
Sequence statusi: Complete.
This entry has 1 described isoform and 1 potential isoform that is computationally mapped.Show allAlign All
Q61079-1 [UniParc]FASTAAdd to basket
10 20 30 40 50
MKEKSKNAAK TRREKENGEF YELAKLLPLP SAITSQLDKA SIIRLTTSYL
60 70 80 90 100
KMRAVFPEGL GDAWGQPSRT GPLDSVAKEL GSHLLQTLDG FVFVVASDGK
110 120 130 140 150
IMYISETASV HLGLSQVELT GNSIYEYIHP SDHDEMTAVL TAHPPLHHHL
160 170 180 190 200
LQEYEIERSF FLRMKCVLAK RNAGLTCSGY KVIHCSGYLK IRQYMLDMSL
210 220 230 240 250
YDSCYQIVGL VAVGQSLPPS AITEIKLHSN MFMFRASLDL KLIFLDSRVT
260 270 280 290 300
ELTGYEPQDL IEKTLYHHVH GCDTFHLRYA HHLLLVKGQV TTKYYRLLSK
310 320 330 340 350
LGGWVWVQSY ATVVHNSRSS RPHCIVSVNY VLTDVEYKEL QLSLDQVSTS
360 370 380 390 400
KSQESWRTTL STSQETRKSA KPKNTKMKTK LRTNPYPPQQ YSSFQMDKLE
410 420 430 440 450
CSQVGNWRTS PPTNAVAPPE QQLHSEASDL LYGPPYSLPF SYHYGHFPLD
460 470 480 490 500
SHVFSSKKPG LPAKFGQPQG SPCEVARFFL STLPASSECQ WHCANSLVPS
510 520 530 540 550
SSSPAKNLSE PSPVNAARHG LVPNYEAPSA AARRFCEDPA PPSFPSCGHY
560 570 580 590 600
REEPALGPAK APRQASRDAA RLALARAPPE CCAPPAPEPQ APAQLPFVLL
610 620 630 640 650
NYHRVLARRG PLGSAAPGAP EAAGSLRPRH PGPVAASAPG APRPHYLGAS
VIITNGR
Computationally mapped potential isoform sequencesi
There is 1 potential isoform mapped to this entry.BLASTAlignShow allAdd to basketA0A338P771 | A0A338P771_MOUSE | Single-minded homolog 2 | Sim2 | 406 | Annotation score: |
Sequence cautioni
The sequence AAA91202 differs from that shown. Reason: Frameshift.Curated
Experimental Info
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Sequence conflicti | 263 | K → R in BAA09700 (PubMed:8561800).Curated | 1 | |
Sequence conflicti | 263 | K → R in AAB84099 (PubMed:8812055).Curated | 1 | |
Sequence conflicti | 336 | E → G in AAA91202 (PubMed:8812055).Curated | 1 | |
Sequence conflicti | 501 | S → T in AAA91202 (PubMed:8812055).Curated | 1 | |
Sequence conflicti | 512 | S → P in AAB84099 (PubMed:8812055).Curated | 1 | |
Sequence conflicti | 541 | P → R in AAA91202 (PubMed:8812055).Curated | 1 | |
Sequence conflicti | 561 – 585 | APRQA…CCAPP → VLARRPGRARCMWES in AAA91202 (PubMed:8812055).CuratedAdd BLAST | 25 | |
Sequence conflicti | 590 – 591 | QA → HG in AAA91202 (PubMed:8812055).Curated | 2 | |
Sequence conflicti | 638 | A → R in BAA09700 (PubMed:8561800).Curated | 1 | |
Sequence conflicti | 638 | A → R in AAB84099 (PubMed:8812055).Curated | 1 |
Sequence databases
Select the link destinations: EMBLi GenBanki DDBJi Links Updated | U42554 mRNA Translation: AAB19098.1 D63383 mRNA Translation: BAA09700.1 U40576 mRNA Translation: AAA91202.1 Frameshift. AF023873 , AF023864, AF023865, AF023869, AF023871, AF023870, AF023868, AF023867, AF023866, AF023872 Genomic DNA Translation: AAB84099.1 D64135 mRNA Translation: BAA11013.1 |
CCDSi | CCDS37406.1 |
RefSeqi | NP_035507.2, NM_011377.2 |
Genome annotation databases
Ensembli | ENSMUST00000072182; ENSMUSP00000072043; ENSMUSG00000062713 |
GeneIDi | 20465 |
KEGGi | mmu:20465 |
UCSCi | uc008aae.1, mouse |
Similar proteinsi
Cross-referencesi
Sequence databases
Select the link destinations: EMBLi GenBanki DDBJi Links Updated | U42554 mRNA Translation: AAB19098.1 D63383 mRNA Translation: BAA09700.1 U40576 mRNA Translation: AAA91202.1 Frameshift. AF023873 , AF023864, AF023865, AF023869, AF023871, AF023870, AF023868, AF023867, AF023866, AF023872 Genomic DNA Translation: AAB84099.1 D64135 mRNA Translation: BAA11013.1 |
CCDSi | CCDS37406.1 |
RefSeqi | NP_035507.2, NM_011377.2 |
3D structure databases
AlphaFoldDBi | Q61079 |
SMRi | Q61079 |
ModBasei | Search... |
Protein-protein interaction databases
BioGRIDi | 203255, 1 interactor |
CORUMi | Q61079 |
IntActi | Q61079, 2 interactors |
STRINGi | 10090.ENSMUSP00000072043 |
PTM databases
iPTMneti | Q61079 |
PhosphoSitePlusi | Q61079 |
Proteomic databases
PaxDbi | Q61079 |
PRIDEi | Q61079 |
ProteomicsDBi | 261236 |
Protocols and materials databases
Antibodypediai | 23108, 176 antibodies from 30 providers |
DNASUi | 20465 |
Genome annotation databases
Ensembli | ENSMUST00000072182; ENSMUSP00000072043; ENSMUSG00000062713 |
GeneIDi | 20465 |
KEGGi | mmu:20465 |
UCSCi | uc008aae.1, mouse |
Organism-specific databases
CTDi | 6493 |
MGIi | MGI:98307, Sim2 |
VEuPathDBi | HostDB:ENSMUSG00000062713 |
Phylogenomic databases
eggNOGi | KOG3559, Eukaryota |
GeneTreei | ENSGT00940000159985 |
HOGENOMi | CLU_010044_4_1_1 |
InParanoidi | Q61079 |
OMAi | SECQWHY |
OrthoDBi | 231698at2759 |
PhylomeDBi | Q61079 |
TreeFami | TF317772 |
Miscellaneous databases
BioGRID-ORCSi | 20465, 3 hits in 74 CRISPR screens |
ChiTaRSi | Sim2, mouse |
PROi | PR:Q61079 |
RNActi | Q61079, protein |
SOURCEi | Search... |
Gene expression databases
Bgeei | ENSMUSG00000062713, Expressed in esophagus and 126 other tissues |
ExpressionAtlasi | Q61079, baseline and differential |
Genevisiblei | Q61079, MM |
Family and domain databases
CDDi | cd00130, PAS, 2 hits |
Gene3Di | 4.10.280.10, 1 hit |
InterProi | View protein in InterPro IPR011598, bHLH_dom IPR036638, HLH_DNA-bd_sf IPR001610, PAC IPR000014, PAS IPR035965, PAS-like_dom_sf IPR013767, PAS_fold IPR013655, PAS_fold_3 IPR010578, SIM_C |
Pfami | View protein in Pfam PF00989, PAS, 1 hit PF08447, PAS_3, 1 hit PF06621, SIM_C, 1 hit |
SMARTi | View protein in SMART SM00353, HLH, 1 hit SM00086, PAC, 1 hit SM00091, PAS, 2 hits |
SUPFAMi | SSF47459, SSF47459, 1 hit SSF55785, SSF55785, 2 hits |
PROSITEi | View protein in PROSITE PS50888, BHLH, 1 hit PS50112, PAS, 2 hits PS51302, SIM_C, 1 hit |
MobiDBi | Search... |
Entry informationi
Entry namei | SIM2_MOUSE | |
Accessioni | Q61079Primary (citable) accession number: Q61079 Secondary accession number(s): O35391, Q61046, Q61904 | |
Entry historyi | Integrated into UniProtKB/Swiss-Prot: | November 1, 1997 |
Last sequence update: | November 1, 1997 | |
Last modified: | May 25, 2022 | |
This is version 178 of the entry and version 1 of the sequence. See complete history. | ||
Entry statusi | Reviewed (UniProtKB/Swiss-Prot) | |
Annotation program | Chordata Protein Annotation Program |
Miscellaneousi
Keywords - Technical termi
Reference proteomeDocuments
- MGD cross-references
Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot