Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q62255 (SALL3_MOUSE) Reviewed, UniProtKB/Swiss-Prot

Last modified July 9, 2014. Version 118. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Sal-like protein 3
Alternative name(s):
MSal
Spalt-like protein 3
Gene names
Name:Sall3
Synonyms:Sal
OrganismMus musculus (Mouse) [Reference proteome]
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus

Protein attributes

Sequence length1320 AA.
Sequence statusComplete.
Protein existenceEvidence at transcript level

General annotation (Comments)

Function

Probable transcription factor.

Subcellular location

Nucleus Probable.

Tissue specificity

In adult brain, testis and kidney. In lower levels also in adult ovaries and embryonic stem cells. In embryo in developing neuroectoderm of brain, inner ear and spinal chord. Also weakly and transiently expressed in embryonic branchial arches, notochord, limb buds and heart.

Developmental stage

During embryogenesis detected from 7 dpc onward in tissues derived from mesoderm and ectoderm.

Sequence similarities

Belongs to the sal C2H2-type zinc-finger protein family.

Contains 9 C2H2-type zinc fingers.

Sequence caution

The sequence BAC32197.1 differs from that shown. Reason: Erroneous initiation.

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q62255-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Note: No experimental confirmation available.
Isoform 2 (identifier: Q62255-2)

The sequence of this isoform differs from the canonical sequence as follows:
     993-1064: Missing.
Note: Lacks two zinc finger domains (6 and 7) and is the major isoform.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 13201320Sal-like protein 3
PRO_0000047025

Regions

Zinc finger427 – 44923C2H2-type 1
Zinc finger455 – 47723C2H2-type 2
Zinc finger692 – 71423C2H2-type 3
Zinc finger720 – 74223C2H2-type 4
Zinc finger752 – 77423C2H2-type 5
Zinc finger997 – 101923C2H2-type 6
Zinc finger1025 – 104723C2H2-type 7
Zinc finger1133 – 115523C2H2-type 8
Zinc finger1161 – 118323C2H2-type 9
Compositional bias142 – 16019Pro-rich
Compositional bias217 – 2204Poly-Gln
Compositional bias374 – 3774Poly-Ser
Compositional bias829 – 932104Ser-rich

Amino acid modifications

Modified residue1091Phosphoserine By similarity
Modified residue9321Phosphoserine By similarity
Modified residue11971Phosphoserine By similarity

Natural variations

Alternative sequence993 – 106472Missing in isoform 2.
VSP_006834

Experimental info

Sequence conflict131 – 1322EP → S in CAA66196. Ref.3
Sequence conflict2361A → G in CAA66196. Ref.3
Sequence conflict255 – 2562TA → NT in CAA66196. Ref.3
Sequence conflict3071C → CC in CAA66196. Ref.3
Sequence conflict497 – 4982NV → KC in CAA66196. Ref.3
Sequence conflict7441A → G in CAA66196. Ref.3
Sequence conflict7651V → I in BAC32197. Ref.4

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified January 9, 2007. Version 2.
Checksum: 33A19F56C69D5EEB

FASTA1,320138,760
        10         20         30         40         50         60 
MSRRKQAKPQ HLKSDEELPP QDGASEHGVP GDGAEDADSG SESRSGSEET SVCEKCCAEF 

        70         80         90        100        110        120 
FKWADFLQHK KTCTKNPLVL IVHDDEPAPP SEDFPEPSPA SSPSDRTESE VAEEVAPTEG 

       130        140        150        160        170        180 
SEVKAATKEA EPMDVEVSTD KGPPGPSVPP PPPALPPQPE PAAFSMPSTN VTLETLLSTK 

       190        200        210        220        230        240 
VAVAQFSQGA RAGGTTGAGG SVGAVAIPMI LEQLVALQQQ QIHQLQLIEQ IRSQVALMSR 

       250        260        270        280        290        300 
QPGPPLKPSA SAPGTASVQL QGLTPHAALQ LSAGPATASA GSGSTLPAAF DGPQHLSQPA 

       310        320        330        340        350        360 
SGTSTPCSTS AAPPDSGAHP ACSTGPAPGA VAAASSTVGN AVQPQNASTP PALGPGPLLS 

       370        380        390        400        410        420 
SASNLPNPLL PQTSSSSVIF PNPLVSIAAT ANALDPLSAL MKHRKGKPPN VSVFEPKASA 

       430        440        450        460        470        480 
EDPFFKHKCR FCAKVFGSDS ALQIHLRSHT GERPFKCNIC GNRFSTKGNL KVHFQRHKEK 

       490        500        510        520        530        540 
YPHIQMNPYP VPEYLDNVPT CSGIPYGMSL PPEKPVTTWL DSKPVLPTVP TSVGLQLPPT 

       550        560        570        580        590        600 
VPGTHNYTDS PSITPVSRSP QRPSPASSEC TSLSPGLNNT ESGITVRPES PQPLLGGPSL 

       610        620        630        640        650        660 
TKAEPVSLPC TSTRTGDAPV VGGQVSGLPT SAATAVTDSA CTSLGSPGLP AVSDQFKAQF 

       670        680        690        700        710        720 
PFGGLLDSMQ TSETSKLQQL VENIDKKMTD PNQCVICHRV LSCQSALKMH YRTHTGERPF 

       730        740        750        760        770        780 
KCKICGRAFT TKGNLKTHFG VHRAKPPLRV QHSCPICQKK FTNAVVLQQH IRMHMGGQIP 

       790        800        810        820        830        840 
NTPLPEGLQE AMDADLPFDE KNAETLSSFD DDIDENSMEE DSELKDTASD SSKPLLSYSG 

       850        860        870        880        890        900 
SCPPSPPSVI SSIAALENQM KMIDSVMNCQ QLANLKSVEN GSGESDRLSN DSSSAVGDLE 

       910        920        930        940        950        960 
SRSAGSPALS ESSSSQALSP AHSNGESFRS KSPGLGHQED PQEIPLKTER LDSPPPGPGN 

       970        980        990       1000       1010       1020 
GGALDLTAGH PGRPLIKEEA PFSLLFLSRE RGKCASTVCG VCGKPFACKS ALEIHYRSHT 

      1030       1040       1050       1060       1070       1080 
KERRFVCTVC RRGCSTMGNL KQHLLTHKLK ELPSQVFDPN FTLGPSHSTP SLASSPAPTM 

      1090       1100       1110       1120       1130       1140 
IKMEVNGHSK AIALGEGPAL PAGVQVPTGP QTVMSPGLAP MLAPPPRRTP KQHNCQSCGK 

      1150       1160       1170       1180       1190       1200 
TFSSASALQI HERTHTGEKP FGCTICGRAF TTKGNLKVHM GTHMWNNAPA RRGRRLSVEN 

      1210       1220       1230       1240       1250       1260 
PMALLGGDAL KFSEMFQKDL AARAMNVDPS FWNQYAAAIT NGLAMKNNEI SVIQNGGIPQ 

      1270       1280       1290       1300       1310       1320 
LPVSLGGGAI PPLGAMASGV DKARTGSSPP IVSLDKASSE TGASRPFARF IEDNKEIGIN 

« Hide

Isoform 2 [UniParc].

Checksum: C1CFD01A0AB4B3B3
Show »

FASTA1,248130,708

References

« Hide 'large scale' references
[1]"Lineage-specific biology revealed by a finished genome assembly of the mouse."
Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X., Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y., Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S. expand/collapse author list , Teague B., Potamousis K., Churas C., Place M., Herschleb J., Runnheim R., Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., Lindblad-Toh K., Eichler E.E., Ponting C.P.
PLoS Biol. 7:E1000112-E1000112(2009) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: C57BL/6J.
[2]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 6-1320 (ISOFORM 2).
Strain: C57BL/6.
Tissue: Brain.
[3]"The mouse homolog of the region specific homeotic gene spalt of Drosophila is expressed in the developing nervous system and in mesoderm-derived structures."
Ott T., Kaestner K.H., Monaghan A.P., Schuetz G.
Mech. Dev. 56:117-128(1996) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 30-1320 (ISOFORMS 1 AND 2).
Tissue: Brain and Embryo.
[4]"The transcriptional landscape of the mammalian genome."
Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J. expand/collapse author list , Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.
Science 309:1559-1563(2005) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 692-1320.
Strain: C57BL/6J.
Tissue: Embryo.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AC125210 Genomic DNA. No translation available.
BC072631 mRNA. Translation: AAH72631.1.
AK045051 mRNA. Translation: BAC32197.1. Different initiation.
X97581 mRNA. Translation: CAA66196.1. Different termination.
CCDSCCDS29371.1. [Q62255-2]
PIRT30253.
RefSeqNP_840064.2. NM_178280.3. [Q62255-2]
UniGeneMm.215917.

3D structure databases

ProteinModelPortalQ62255.
SMRQ62255. Positions 410-483, 671-806, 994-1184.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

BioGrid203420. 3 interactions.
IntActQ62255. 5 interactions.
STRING10090.ENSMUSP00000025457.

PTM databases

PhosphoSiteQ62255.

Proteomic databases

PRIDEQ62255.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENSMUST00000057950; ENSMUSP00000056967; ENSMUSG00000024565. [Q62255-2]
GeneID20689.
KEGGmmu:20689.
UCSCuc008ftl.2. mouse. [Q62255-1]
uc008ftm.1. mouse. [Q62255-2]

Organism-specific databases

CTD27164.
MGIMGI:109295. Sall3.

Phylogenomic databases

eggNOGNOG293478.
GeneTreeENSGT00550000074555.
HOGENOMHOG000231986.
HOVERGENHBG058921.
InParanoidQ62255.
OMAMEVNGHS.
OrthoDBEOG7NCV2P.
PhylomeDBQ62255.
TreeFamTF317003.

Gene expression databases

BgeeQ62255.
CleanExMM_SALL3.
GenevestigatorQ62255.

Family and domain databases

Gene3D3.30.160.60. 8 hits.
InterProIPR007087. Znf_C2H2.
IPR015880. Znf_C2H2-like.
IPR013087. Znf_C2H2/integrase_DNA-bd.
[Graphical view]
PfamPF00096. zf-C2H2. 1 hit.
[Graphical view]
SMARTSM00355. ZnF_C2H2. 9 hits.
[Graphical view]
PROSITEPS00028. ZINC_FINGER_C2H2_1. 9 hits.
PS50157. ZINC_FINGER_C2H2_2. 8 hits.
[Graphical view]
ProtoNetSearch...

Other

NextBio299221.
PROQ62255.
SOURCESearch...

Entry information

Entry nameSALL3_MOUSE
AccessionPrimary (citable) accession number: Q62255
Secondary accession number(s): Q08EB0 expand/collapse secondary AC list , Q52KR5, Q6GQT8, Q8BRD9
Entry history
Integrated into UniProtKB/Swiss-Prot: May 2, 2002
Last sequence update: January 9, 2007
Last modified: July 9, 2014
This is version 118 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot