The new UniProt website is here!
Take me to UniProt BETA
UniProtKB - D4AA36 (D4AA36_RAT)
Protein
Submitted name:
SIM bHLH transcription factor 2
Gene
Sim2
Organism
Rattus norvegicus (Rat)
Status
Functioni
GO - Molecular functioni
- DNA binding Source: RGD
- DNA-binding transcription factor activity, RNA polymerase II-specific Source: GO_Central
- protein heterodimerization activity Source: RGD
- RNA polymerase II transcription regulatory region sequence-specific DNA binding Source: GO_Central
GO - Biological processi
- cell differentiation Source: UniProtKB-KW
- embryonic pattern specification Source: RGD
- lung development Source: RGD
- negative regulation of transcription, DNA-templated Source: RGD
- negative regulation of transcription by RNA polymerase II Source: RGD
- nervous system development Source: UniProtKB-KW
- regulation of transcription by RNA polymerase II Source: GO_Central
Keywordsi
Molecular function | Developmental proteinARBA annotation, DNA-bindingARBA annotation |
Biological process | DifferentiationARBA annotation, NeurogenesisARBA annotation |
Names & Taxonomyi
Protein namesi | Submitted name: SIM bHLH transcription factor 2Submitted name: Single-minded 2 (Predicted)Imported |
Gene namesi | ORF Names:rCG_53059Imported |
Organismi | Rattus norvegicus (Rat)Imported |
Taxonomic identifieri | 10116 [NCBI] |
Taxonomic lineagei | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Mammalia › Eutheria › Euarchontoglires › Glires › Rodentia › Myomorpha › Muroidea › Muridae › Murinae › Rattus |
Proteomesi |
|
Organism-specific databases
RGDi | 1308016, Sim2 |
Subcellular locationi
Nucleus
- Nucleus ARBA annotation
Nucleus
- nuclear body Source: Ensembl
- nucleus Source: RGD
Expressioni
Gene expression databases
Bgeei | ENSRNOG00000054203, Expressed in esophagus and 17 other tissues |
Interactioni
GO - Molecular functioni
- protein heterodimerization activity Source: RGD
Protein-protein interaction databases
STRINGi | 10116.ENSRNOP00000002294 |
Structurei
Family & Domainsi
Domains and Repeats
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Domaini | 1 – 53 | BHLHInterPro annotationAdd BLAST | 53 | |
Domaini | 77 – 140 | PASInterPro annotationAdd BLAST | 64 | |
Domaini | 233 – 288 | PASInterPro annotationAdd BLAST | 56 | |
Domaini | 336 – 656 | Single-minded C-terminalInterPro annotationAdd BLAST | 321 |
Region
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Regioni | 354 – 378 | DisorderedSequence analysisAdd BLAST | 25 | |
Regioni | 533 – 567 | DisorderedSequence analysisAdd BLAST | 35 | |
Regioni | 611 – 640 | DisorderedSequence analysisAdd BLAST | 30 |
Compositional bias
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Compositional biasi | 354 – 368 | Polar residuesSequence analysisAdd BLAST | 15 |
Keywords - Domaini
RepeatARBA annotationPhylogenomic databases
eggNOGi | KOG3559, Eukaryota |
GeneTreei | ENSGT00940000159985 |
HOGENOMi | CLU_010044_4_1_1 |
OMAi | SECQWHY |
OrthoDBi | 231698at2759 |
Family and domain databases
CDDi | cd00130, PAS, 2 hits |
Gene3Di | 4.10.280.10, 1 hit |
InterProi | View protein in InterPro IPR011598, bHLH_dom IPR036638, HLH_DNA-bd_sf IPR001610, PAC IPR000014, PAS IPR035965, PAS-like_dom_sf IPR013767, PAS_fold IPR013655, PAS_fold_3 IPR010578, SIM_C |
Pfami | View protein in Pfam PF00989, PAS, 1 hit PF08447, PAS_3, 1 hit PF06621, SIM_C, 1 hit |
SMARTi | View protein in SMART SM00353, HLH, 1 hit SM00086, PAC, 1 hit SM00091, PAS, 2 hits |
SUPFAMi | SSF47459, SSF47459, 1 hit SSF55785, SSF55785, 2 hits |
PROSITEi | View protein in PROSITE PS50888, BHLH, 1 hit PS50112, PAS, 2 hits PS51302, SIM_C, 1 hit |
i Sequence
Sequence statusi: Complete.
D4AA36-1 [UniParc]FASTAAdd to basket
10 20 30 40 50
MKEKSKNAAK TRREKENGEF YELAKLLPLP SAITSQLDKA SIIRLTTSYL
60 70 80 90 100
KMRAVFPEGL GDAWGQPSRT GPLDSVAKEL GSHLLQTLDG FVFVVASDGK
110 120 130 140 150
IMYISETASV HLGLSQVELT GNSIYEYIHP SDHDEMTAVL TAHPPLHHHL
160 170 180 190 200
LQEYEIERSF FLRMKCVLAK RNAGLTCSGY KVIHCSGYLK IRQYMLDMSL
210 220 230 240 250
YDSCYQIVGL VAVGQSLPPS AITEIKLHNN MFMFRASLDL KLIFLDSRVT
260 270 280 290 300
ELTGYEPQDL IEKTLYHHVH GCDTFHLRYA HHLLLVKGQV TTKYYRLLSK
310 320 330 340 350
LGGWVWVQSY ATVVHNSRSS RPHCIVSVNY VLTEVEYKEL QLSLDQVSTS
360 370 380 390 400
KSQESWRTTL STSQETRKSA KPKTTKMRTK LRTNPYPSQQ YSSFQMDKLE
410 420 430 440 450
CSQAGNWRTS PPTSAVAPPE QQLHSEASDL LYGPPYSLPF SYRYGHFPLD
460 470 480 490 500
SHVFSSKKPG LPAKFGQPQG SPCEVARFFL STLPASSECQ WHYANSLMPS
510 520 530 540 550
SPSPAKNLSE SPVNAARHGL VPNYEAPAAT ARRFGEDPAP SSFPSCGHYR
560 570 580 590 600
EEPTLGSAKA ARQASRDTAR LALARATPEC CAPPAPEPQA PAQLPFVLLN
610 620 630 640 650
YHRVLARRGP LGSAAPGAPE AAGSLRPRHP GPVAASAPGA PRPHYLGASV
IITNGR
Sequence databases
Select the link destinations: EMBLi GenBanki DDBJi Links Updated | CH474083 Genomic DNA Translation: EDL76725.1 |
RefSeqi | NP_001100578.1, NM_001107108.3 |
Genome annotation databases
Ensembli | ENSRNOT00000081834.2; ENSRNOP00000069809.1; ENSRNOG00000054203.2 |
GeneIDi | 304071 |
KEGGi | rno:304071 |
UCSCi | RGD:1308016, rat |
Similar proteinsi
Cross-referencesi
Sequence databases
Select the link destinations: EMBLi GenBanki DDBJi Links Updated | CH474083 Genomic DNA Translation: EDL76725.1 |
RefSeqi | NP_001100578.1, NM_001107108.3 |
3D structure databases
ModBasei | Search... |
SWISS-MODEL-Workspacei | Submit a new modelling project... |
Protein-protein interaction databases
STRINGi | 10116.ENSRNOP00000002294 |
Genome annotation databases
Ensembli | ENSRNOT00000081834.2; ENSRNOP00000069809.1; ENSRNOG00000054203.2 |
GeneIDi | 304071 |
KEGGi | rno:304071 |
UCSCi | RGD:1308016, rat |
Organism-specific databases
CTDi | 6493 |
RGDi | 1308016, Sim2 |
Phylogenomic databases
eggNOGi | KOG3559, Eukaryota |
GeneTreei | ENSGT00940000159985 |
HOGENOMi | CLU_010044_4_1_1 |
OMAi | SECQWHY |
OrthoDBi | 231698at2759 |
Gene expression databases
Bgeei | ENSRNOG00000054203, Expressed in esophagus and 17 other tissues |
Family and domain databases
CDDi | cd00130, PAS, 2 hits |
Gene3Di | 4.10.280.10, 1 hit |
InterProi | View protein in InterPro IPR011598, bHLH_dom IPR036638, HLH_DNA-bd_sf IPR001610, PAC IPR000014, PAS IPR035965, PAS-like_dom_sf IPR013767, PAS_fold IPR013655, PAS_fold_3 IPR010578, SIM_C |
Pfami | View protein in Pfam PF00989, PAS, 1 hit PF08447, PAS_3, 1 hit PF06621, SIM_C, 1 hit |
SMARTi | View protein in SMART SM00353, HLH, 1 hit SM00086, PAC, 1 hit SM00091, PAS, 2 hits |
SUPFAMi | SSF47459, SSF47459, 1 hit SSF55785, SSF55785, 2 hits |
PROSITEi | View protein in PROSITE PS50888, BHLH, 1 hit PS50112, PAS, 2 hits PS51302, SIM_C, 1 hit |
MobiDBi | Search... |
Entry informationi
Entry namei | D4AA36_RAT | |
Accessioni | D4AA36Primary (citable) accession number: D4AA36 | |
Entry historyi | Integrated into UniProtKB/TrEMBL: | April 20, 2010 |
Last sequence update: | April 20, 2010 | |
Last modified: | May 25, 2022 | |
This is version 103 of the entry and version 1 of the sequence. See complete history. | ||
Entry statusi | Unreviewed (UniProtKB/TrEMBL) |