Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Spermatogenesis- and oogenesis-specific basic helix-loop-helix-containing protein 2

Gene

Sohlh2

Organism
Rattus norvegicus (Rat)
Status
Reviewed-Annotation score: Annotation score: 2 out of 5-Experimental evidence at transcript leveli

Functioni

Probable transcription factor, which may be involved in spermatogenesis and oogenesis.By similarity

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Molecular functioni

Developmental protein

Keywords - Biological processi

Differentiation, Oogenesis, Spermatogenesis, Transcription, Transcription regulation

Keywords - Ligandi

DNA-binding

Names & Taxonomyi

Protein namesi
Recommended name:
Spermatogenesis- and oogenesis-specific basic helix-loop-helix-containing protein 2
Gene namesi
Name:Sohlh2
OrganismiRattus norvegicus (Rat)
Taxonomic identifieri10116 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeRattus
Proteomesi
  • UP000002494 Componenti: Chromosome 2

Organism-specific databases

RGDi1589577. Sohlh2.

Subcellular locationi

  • Nucleus PROSITE-ProRule annotation

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 462462Spermatogenesis- and oogenesis-specific basic helix-loop-helix-containing protein 2PRO_0000315702Add
BLAST

Proteomic databases

PaxDbiQ3MHT3.
PRIDEiQ3MHT3.

Structurei

3D structure databases

ProteinModelPortaliQ3MHT3.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini200 – 25152bHLHPROSITE-ProRule annotationAdd
BLAST

Sequence similaritiesi

Contains 1 bHLH (basic helix-loop-helix) domain.PROSITE-ProRule annotation

Phylogenomic databases

eggNOGiENOG410IUEE. Eukaryota.
ENOG410ZGUF. LUCA.
GeneTreeiENSGT00390000016050.
HOGENOMiHOG000070147.
HOVERGENiHBG103652.
InParanoidiQ3MHT3.
OMAiLPQHCNS.
OrthoDBiEOG7HB59C.
PhylomeDBiQ3MHT3.
TreeFamiTF336841.

Family and domain databases

Gene3Di4.10.280.10. 1 hit.
InterProiIPR011598. bHLH_dom.
IPR032669. SOHLH2.
[Graphical view]
PANTHERiPTHR16223:SF16. PTHR16223:SF16. 1 hit.
PfamiPF00010. HLH. 1 hit.
[Graphical view]
SMARTiSM00353. HLH. 1 hit.
[Graphical view]
SUPFAMiSSF47459. SSF47459. 1 hit.
PROSITEiPS50888. BHLH. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Q3MHT3-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MADRISTGEL GRRPGQGRVD LLLVGDATRY FLAGSVQKFF SSTAQITLTI
60 70 80 90 100
SNVKKVAALL AANSFDIIFL KVTSTLTAEE QEAVRLIRSG KKKNTHLLFA
110 120 130 140 150
FVIPEKLRGY ISDYGADISF NEPLTLEKVN TVINYWKTYF TNTDMGNTEL
160 170 180 190 200
PPECRLYFQT SCSELGGHFP TDLFLCSELL NNDTGLGLKA PLSSPERNKK
210 220 230 240 250
ASFLHSSKEK LRRERIKFCC EQLRTLLPYV KGRKSDVASV IEATVDYVKQ
260 270 280 290 300
VRESLSPAIM AQITESLQSN KRFSKRQMPI ELFLPCTATS QRGDAMLTSA
310 320 330 340 350
FSPVQEIQLL ADQGLNVYSM PAAGGPLEEA VRGQPGSVSE DLYKTRVPST
360 370 380 390 400
TLSLNSFHAV RYCSGPVSPH EAAARTNQNI SIYLPPTGPS VSSFTPQHCN
410 420 430 440 450
AMLCPTRPAS SSCLCTSGHE LPASSRTASS SIFRGFRESD SGHQASQQPT
460
GPSLQPQDSS YF
Length:462
Mass (Da):50,669
Last modified:October 25, 2005 - v1
Checksum:i589BADC7ED81958F
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
BC104697 mRNA. Translation: AAI04698.1.
RefSeqiNP_001030133.1. NM_001034961.1.
UniGeneiRn.137450.

Genome annotation databases

EnsembliENSRNOT00000044424; ENSRNOP00000049772; ENSRNOG00000038091.
GeneIDi619575.
KEGGirno:619575.
UCSCiRGD:1589577. rat.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
BC104697 mRNA. Translation: AAI04698.1.
RefSeqiNP_001030133.1. NM_001034961.1.
UniGeneiRn.137450.

3D structure databases

ProteinModelPortaliQ3MHT3.
ModBaseiSearch...
MobiDBiSearch...

Proteomic databases

PaxDbiQ3MHT3.
PRIDEiQ3MHT3.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSRNOT00000044424; ENSRNOP00000049772; ENSRNOG00000038091.
GeneIDi619575.
KEGGirno:619575.
UCSCiRGD:1589577. rat.

Organism-specific databases

CTDi54937.
RGDi1589577. Sohlh2.

Phylogenomic databases

eggNOGiENOG410IUEE. Eukaryota.
ENOG410ZGUF. LUCA.
GeneTreeiENSGT00390000016050.
HOGENOMiHOG000070147.
HOVERGENiHBG103652.
InParanoidiQ3MHT3.
OMAiLPQHCNS.
OrthoDBiEOG7HB59C.
PhylomeDBiQ3MHT3.
TreeFamiTF336841.

Miscellaneous databases

NextBioi714666.
PROiQ3MHT3.

Family and domain databases

Gene3Di4.10.280.10. 1 hit.
InterProiIPR011598. bHLH_dom.
IPR032669. SOHLH2.
[Graphical view]
PANTHERiPTHR16223:SF16. PTHR16223:SF16. 1 hit.
PfamiPF00010. HLH. 1 hit.
[Graphical view]
SMARTiSM00353. HLH. 1 hit.
[Graphical view]
SUPFAMiSSF47459. SSF47459. 1 hit.
PROSITEiPS50888. BHLH. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

  1. "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
    The MGC Project Team
    Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
    Tissue: Testis.

Entry informationi

Entry nameiSOLH2_RAT
AccessioniPrimary (citable) accession number: Q3MHT3
Entry historyi
Integrated into UniProtKB/Swiss-Prot: January 15, 2008
Last sequence update: October 25, 2005
Last modified: May 11, 2016
This is version 71 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.