Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q24167 (SIMA_DROME) Reviewed, UniProtKB/Swiss-Prot

Last modified May 1, 2013. Version 115. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Protein similar
Gene names
Name:sima
ORF Names:CG7951
OrganismDrosophila melanogaster (Fruit fly) [Reference proteome]
Taxonomic identifier7227 [NCBI]
Taxonomic lineageEukaryotaMetazoaEcdysozoaArthropodaHexapodaInsectaPterygotaNeopteraEndopterygotaDipteraBrachyceraMuscomorphaEphydroideaDrosophilidaeDrosophilaSophophora

Protein attributes

Sequence length1507 AA.
Sequence statusComplete.
Protein existenceEvidence at transcript level

General annotation (Comments)

Function

Possible DNA-binding transcriptional activator.

Subunit structure

Efficient DNA binding requires dimerization with another bHLH protein. Interacts with VHL.

Subcellular location

Nucleus Potential.

Tissue specificity

Ubiquitously expressed in the embryo.

Sequence similarities

Contains 1 bHLH (basic helix-loop-helix) domain.

Contains 1 PAC (PAS-associated C-terminal) domain.

Contains 2 PAS (PER-ARNT-SIM) domains.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 15071507Protein similar
PRO_0000127444

Regions

Domain72 – 12554bHLH
Domain167 – 24074PAS 1
Domain307 – 37771PAS 2
Domain381 – 42242PAC
Coiled coil880 – 90829 Potential
Coiled coil982 – 105473 Potential
Coiled coil1110 – 116253 Potential
Compositional bias26 – 3914Poly-Ser
Compositional bias577 – 58711Pro-rich
Compositional bias718 – 7258Poly-Ser
Compositional bias759 – 7635Poly-Gln
Compositional bias767 – 77610Poly-Gln
Compositional bias907 – 91812Poly-Gln
Compositional bias945 – 9484Poly-Gln
Compositional bias990 – 9989Poly-Gln
Compositional bias1020 – 103819Poly-Gln
Compositional bias1113 – 112614Poly-Gln
Compositional bias1146 – 116217Poly-Gln
Compositional bias1205 – 12084Poly-Gln
Compositional bias1277 – 12848Poly-Gln
Compositional bias1298 – 13014Poly-Asp

Experimental info

Sequence conflict381S → A in AAC47303. Ref.1
Sequence conflict3451S → L in AAC47303. Ref.1
Sequence conflict4921A → V in AAC47303. Ref.1
Sequence conflict5881T → I in AAC47303. Ref.1
Sequence conflict7091T → K in AAC47303. Ref.1
Sequence conflict7761Q → QQQQ Ref.1
Sequence conflict8951Q → QQ in AAC47303. Ref.1
Sequence conflict9021G → S in AAC47303. Ref.1
Sequence conflict9821A → T in AAC47303. Ref.1
Sequence conflict1125 – 11262Missing Ref.1
Sequence conflict1154 – 11574Missing Ref.1
Sequence conflict14441F → L in AAC47303. Ref.1
Sequence conflict14471G → C in AAC47303. Ref.1
Sequence conflict14511S → N in AAC47303. Ref.1
Sequence conflict14941D → G in AAC47303. Ref.1

Sequences

Sequence LengthMass (Da)Tools
Q24167 [UniParc].

Last modified February 21, 2001. Version 2.
Checksum: 4102939C8FBFB0C6

FASTA1,507165,824
        10         20         30         40         50         60 
MVSLIDTIEA AAEKQKQSQA VVTNTSASSS SCSSSFSSSP PSSSVGSPSP GAPKTNLTAS 

        70         80         90        100        110        120 
GKPKEKRRNN EKRKEKSRDA ARCRRSKETE IFMELSAALP LKTDDVNQLD KASVMRITIA 

       130        140        150        160        170        180 
FLKIREMLQF VPSLRDCNDD IKQDIETAED QQEVKPKLEV GTEDWLNGAE ARELLKQTMD 

       190        200        210        220        230        240 
GFLLVLSHEG DITYVSENVV EYLGITKIDT LGQQIWEYSH QCDHAEIKEA LSLKRELAQK 

       250        260        270        280        290        300 
VKDEPQQNSG VSTHHRDLFV RLKCTLTSRG RSINIKSASY KVIHITGHLV VNAKGERLLM 

       310        320        330        340        350        360 
AIGRPIPHPS NIEIPLGTST FLTKHSLDMR FTYVDDKMHD LLGYSPKDLL DTSLFSCQHG 

       370        380        390        400        410        420 
ADSERLMATF KSVLSKGQGE TSRYRFLGKY GGYCWILSQA TIVYDKLKPQ SVVCVNYVIS 

       430        440        450        460        470        480 
NLENKHEIYS LAQQTAASEQ KEQHHQAAET EKEPEKAADP EIIAQETKET VNTPIHTSEL 

       490        500        510        520        530        540 
QAKPLQLESE KAEKTIEETK TIATIPPVTA TSTADQIKQL PESNPYKQIL QAELLIKREN 

       550        560        570        580        590        600 
HSPGPRTITA QLLSGSSSGL RPEEKRPKSV TASVLRPSPA PPLTPPPTAV LCKKTPLGVE 

       610        620        630        640        650        660 
PNLPPTTTAT AAIISSSNQQ LQIAQQTQLQ NPQQPAQDMS KGFCSLFADD GRGLTMLKEE 

       670        680        690        700        710        720 
PDDLSHHLAS TNCIQLDEMT PFSDMLVGLM GTCLLPEDIN SLDSTTCSTT ASGQHYQSPS 

       730        740        750        760        770        780 
SSSTSAPSNT SSSNNSYANS PLSPLTPNST ATASNPSHQQ QQQHHNQQQQ QQQQQQHHPQ 

       790        800        810        820        830        840 
HHDNSNSSSN IDPLFNYREE SNDTSCSQHL HSPSITSKSP EDSSLPSLCS PNSLTQEDDF 

       850        860        870        880        890        900 
SFEAFAMRAP YIPIDDDMPL LTETDLMWCP PEDLQTMVPK EIDAIQQQLQ QLQQQHHQQY 

       910        920        930        940        950        960 
AGNTGYQQQQ QQPQLQQQHF SNSLCSSPAS TVSSLSPSPV QQHHQQQQAA VFTSDSSELA 

       970        980        990       1000       1010       1020 
ALLCGSGNGT LSILAGSGVT VAEECNERLQ QHQQQQQQTS GNEFRTFQQL QQELQLQEEQ 

      1030       1040       1050       1060       1070       1080 
QQRQQQQQQQ QQQQQQQQLL SLNIECKKEK YDVQMGGSLC HPMEDAFEND YSKDSANLDC 

      1090       1100       1110       1120       1130       1140 
WDLIQMQVVD TEPVSPNAAS PTPCKVSAIQ LLQQQQQLQQ QQQQQQNIIL NAVPLITIQN 

      1150       1160       1170       1180       1190       1200 
NKELMQQQQQ QQQQQQQEQL QQPAIKLLNG ASIAPVNTKA TIRLVESKPP TTTQSRMAKV 

      1210       1220       1230       1240       1250       1260 
NLVPQQQQHG NKRHLNSATG AGNPVESKRL KSGTLCLDVQ SPQLLQQLIG KDPAQQQTQA 

      1270       1280       1290       1300       1310       1320 
AKRAGSERWQ LSAESKQQKQ QQQQSNSVLK NLLVSGRDDD DSEAMIIDED NSLVQPIPLG 

      1330       1340       1350       1360       1370       1380 
KYGLPLHCHT STSSVLRDYH NNPLISGTNF QLSPVFGGSD SSGGDGETGS VVSLDDSVPP 

      1390       1400       1410       1420       1430       1440 
GLTACDTDAS SDSGIDENSL MDGASGSPRK RLSSTSNSTN QAESAPPALD VETPVTQKSV 

      1450       1460       1470       1480       1490       1500 
EEEFEGGGSG SNAPSRKTSI SFLDSSNPLL HTPAMMDLVN DDYIMGEGGF EFSDNQLEQV 


LGWPEIA 

« Hide

References

« Hide 'large scale' references
[1]"The Drosophila melanogaster similar bHLH-PAS gene encodes a protein related to human hypoxia-inducible factor 1 alpha and Drosophila single-minded."
Nambu J.R., Chen W., Hu S., Crews S.T.
Gene 172:249-254(1996) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA].
[2]"The genome sequence of Drosophila melanogaster."
Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N., Sutton G.G., Wortman J.R., Yandell M.D. expand/collapse author list , Zhang Q., Chen L.X., Brandon R.C., Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C., Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A., An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B., Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C., Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S., Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M., Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D., Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F., Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D., Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A., Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C., McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C., Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L., Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R., Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F., Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J., Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R., Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y., Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T., Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S., Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.
Science 287:2185-2195(2000) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: Berkeley.
[3]"Annotation of the Drosophila melanogaster euchromatic genome: a systematic review."
Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S., Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E., Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P., Bettencourt B.R., Celniker S.E., de Grey A.D.N.J. expand/collapse author list , Drysdale R.A., Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M., Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.
Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002) [PubMed] [Europe PMC] [Abstract]
Cited for: GENOME REANNOTATION.
Strain: Berkeley.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
U43090 mRNA. Translation: AAC47303.1.
AE014297 Genomic DNA. Translation: AAF57008.2.
PIRJC4851.
RefSeqNP_524584.2. NM_079845.4.
UniGeneDm.6953.

3D structure databases

ProteinModelPortalQ24167.
SMRQ24167. Positions 74-427.
ModBaseSearch...

Protein-protein interaction databases

DIPDIP-21002N.
IntActQ24167. 3 interactions.
MINTMINT-304504.
STRING7227.FBpp0084931.

Proteomic databases

PaxDbQ24167.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaFBtr0085565; FBpp0084931; FBgn0015542.
GeneID43580.
KEGGdme:Dmel_CG7951.

Organism-specific databases

CTD43580.
FlyBaseFBgn0015542. sima.

Phylogenomic databases

eggNOGNOG289264.
GeneTreeENSGT00650000093281.
InParanoidQ24167.
KOK08268.
OMAPGPRTIT.
OrthoDBEOG4JQ2CF.
PhylomeDBQ24167.

Gene expression databases

BgeeQ24167.
GermOnlineCG7951. Drosophila melanogaster.

Family and domain databases

InterProIPR011598. bHLH_dom.
IPR001610. PAC.
IPR000014. PAS.
IPR013767. PAS_fold.
IPR013655. PAS_fold_3.
[Graphical view]
PfamPF00989. PAS. 1 hit.
PF08447. PAS_3. 1 hit.
[Graphical view]
SMARTSM00353. HLH. 1 hit.
SM00086. PAC. 1 hit.
SM00091. PAS. 2 hits.
[Graphical view]
PROSITEPS50888. BHLH. 1 hit.
PS50113. PAC. False negative.
PS50112. PAS. 2 hits.
[Graphical view]
ProtoNetSearch...

Other

ChiTaRSsima. drosophila.
GenomeRNAi43580.
NextBio834672.

Entry information

Entry nameSIMA_DROME
AccessionPrimary (citable) accession number: Q24167
Secondary accession number(s): Q9VAA5
Entry history
Integrated into UniProtKB/Swiss-Prot: December 15, 1998
Last sequence update: February 21, 2001
Last modified: May 1, 2013
This is version 115 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programDrosophila annotation project

Relevant documents

Drosophila

Drosophila: entries, gene names and cross-references to FlyBase

SIMILARITY comments

Index of protein domains and families