Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q8BX22 (SALL4_MOUSE) Reviewed, UniProtKB/Swiss-Prot

Last modified May 1, 2013. Version 93. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Interactions·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Sal-like protein 4
Alternative name(s):
Zinc finger protein SALL4
Gene names
Name:Sall4
OrganismMus musculus (Mouse) [Reference proteome]
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus

Protein attributes

Sequence length1067 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Probable transcription factor By similarity.

Subunit structure

Interacts with NANOG. Ref.6

Subcellular location

Nucleus By similarity.

Sequence similarities

Belongs to the sal C2H2-type zinc-finger protein family.

Contains 7 C2H2-type zinc fingers.

Binary interactions

Alternative products

This entry describes 3 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q8BX22-1)

Also known as: Sall4a;

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q8BX22-2)

Also known as: Sall4b;

The sequence of this isoform differs from the canonical sequence as follows:
     386-829: Missing.
Note: No experimental confirmation available.
Isoform 3 (identifier: Q8BX22-3)

Also known as: Sall4c;

The sequence of this isoform differs from the canonical sequence as follows:
     40-828: Missing.
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 10671067Sal-like protein 4
PRO_0000261416

Regions

Zinc finger387 – 40923C2H2-type 1
Zinc finger415 – 43723C2H2-type 2
Zinc finger573 – 59523C2H2-type 3
Zinc finger601 – 62323C2H2-type 4
Zinc finger633 – 65523C2H2-type 5
Zinc finger880 – 90223C2H2-type 6
Zinc finger908 – 93023C2H2-type 7
Compositional bias217 – 2259Poly-Gln

Amino acid modifications

Modified residue3081Phosphoserine By similarity
Modified residue7551Phosphoserine By similarity
Modified residue7851Phosphoserine By similarity
Modified residue7981Phosphoserine By similarity
Modified residue10291Phosphoserine By similarity

Natural variations

Alternative sequence40 – 828789Missing in isoform 3.
VSP_021686
Alternative sequence386 – 829444Missing in isoform 2.
VSP_021687

Experimental info

Sequence conflict8651P → H in AAR91797. Ref.1
Sequence conflict9501A → V in AAR91797. Ref.1
Sequence conflict9501A → V in CAD32912. Ref.5
Sequence conflict9691S → Y in AAR91796. Ref.1
Sequence conflict9691S → Y in AAR91798. Ref.1
Sequence conflict9691S → Y in BAC33598. Ref.2
Sequence conflict9971I → T in AAR91797. Ref.1
Sequence conflict10221T → M in AAR91797. Ref.1

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 (Sall4a) [UniParc].

Last modified July 27, 2011. Version 2.
Checksum: 23B92F488AAF53E5

FASTA1,067113,110
        10         20         30         40         50         60 
MSRRKQAKPQ HINWEEGQGE QPQQLPSPDL AEALAAEEPG APVNSPGNCD EASEDSIPVK 

        70         80         90        100        110        120 
RPRREDTHIC NKCCAEFFSL SEFMEHKKSC TKTPPVLIMN DSEGPVPSED FSRAALSHQL 

       130        140        150        160        170        180 
GSPSNKDSLQ ENGSSSGDLK KLGTDSILYL KTEATQPSTP QDISYLPKGK VANTNVTLQA 

       190        200        210        220        230        240 
LRGTKVAVNQ RGAEAPMAPM PAAQGIPWVL EQILCLQQQQ LQQIQLTEQI RVQVNMWAAH 

       250        260        270        280        290        300 
ALHSGVAGAD TLKALSSHVS QQVSVSQQVS AAVALLSQKA SNPALSLDAL KQAKLPHASV 

       310        320        330        340        350        360 
PSAASPLSSG LTSFTLKPDG TRVLPNFVSR LPSALLPQTP GSVLLQSPFS AVTLDQSKKG 

       370        380        390        400        410        420 
KGKPQNLSAS ASVLDVKAKD EVVLGKHKCR YCPKVFGTDS SLQIHLRSHT GERPYVCPIC 

       430        440        450        460        470        480 
GHRFTTKGNL KVHLQRHPEV KANPQLLAEF QDKGAVSAAS HYALPVPVPA DESSLSVDAE 

       490        500        510        520        530        540 
PVPVTGTPSL GLPQKLTSGP NSRDLMGGSL PNDMQPGPSP ESEAGLPLLG VGMIHNPPKA 

       550        560        570        580        590        600 
GGFQGTGAPE SGSETLKLQQ LVENIDKATT DPNECLICHR VLSCQSSLKM HYRTHTGERP 

       610        620        630        640        650        660 
FQCKICGRAF STKGNLKTHL GVHRTNTTVK TQHSCPICQK KFTNAVMLQQ HIRMHMGGQI 

       670        680        690        700        710        720 
PNTPLPESPC DFTAPEPVAV SENGSASGVC QDDAAEGMEA EEVCSQDVPS GPSTVSLPVP 

       730        740        750        760        770        780 
SAHLASPSLG FSVLASLDTQ GKGALPALAL QRQSSRENSS LEGGDTGPAN DSSLLVGDQE 

       790        800        810        820        830        840 
CQSRSPDATE TMCYQAVSPA NSQAGSVKSR SPEGHKAEGV ESCRVDTEGR TSLPPTFIRA 

       850        860        870        880        890        900 
QPTFVKVEVP GTFVGPPSMP SGMPPLLASQ PQPRRQAKQH CCTRCGKNFS SASALQIHER 

       910        920        930        940        950        960 
THTGEKPFVC NICGRAFTTK GNLKVHYMTH GANNNSARRG RKLAIENPMA ALSAEGKRAP 

       970        980        990       1000       1010       1020 
EVFSKELLSP AVSVDPASWN QYTSVLNGGL AMKTNEISVI QSGGIPTLPV SLGASSVVSN 

      1030       1040       1050       1060 
GTISKLDGSQ TGVSMPMSGN GEKLAVPDGM AKHQFPHFLE ENKIAVS 

« Hide

Isoform 2 (Sall4b) [UniParc].

Checksum: 8C762461A0483EF7
Show »

FASTA62366,215
Isoform 3 (Sall4c) [UniParc].

Checksum: D8DCBBFE0618EF02
Show »

FASTA27829,725

References

« Hide 'large scale' references
[1]"Characterization of the murine Okihiro syndrome gene (Sall4): sequence, expression and alternative splicing."
Ma Y., Di C., Kang Q., Lai R., Theus J., Chai L.
Submitted (NOV-2003) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORMS 1; 2 AND 3).
[2]"The transcriptional landscape of the mammalian genome."
Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J. expand/collapse author list , Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.
Science 309:1559-1563(2005) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
Strain: C57BL/6J.
[3]"Lineage-specific biology revealed by a finished genome assembly of the mouse."
Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X., Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y., Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S. expand/collapse author list , Teague B., Potamousis K., Churas C., Place M., Herschleb J., Runnheim R., Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., Lindblad-Toh K., Eichler E.E., Ponting C.P.
PLoS Biol. 7:E1000112-E1000112(2009) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: C57BL/6J.
[4]Mural R.J., Adams M.D., Myers E.W., Smith H.O., Venter J.C.
Submitted (JUL-2005) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[5]"Cloning and expression of Sall4."
Kohlhase J., Kispert A., Heinrich M.
Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 324-958.
Strain: 129/Sv.
[6]"Sall4 interacts with Nanog and co-occupies Nanog genomic sites in embryonic stem cells."
Wu Q., Chen X., Zhang J., Loh Y.-H., Low T.-Y., Zhang W., Zhang W., Sze S.-K., Lim B., Ng H.-H.
J. Biol. Chem. 281:24090-24094(2006) [PubMed] [Europe PMC] [Abstract]
Cited for: INTERACTION WITH NANOG.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AY463371 mRNA. Translation: AAR91796.1.
AY463372 mRNA. Translation: AAR91797.1.
AY463373 mRNA. Translation: AAR91798.1.
AK049188 mRNA. Translation: BAC33598.1.
AL929248 Genomic DNA. Translation: CAM21964.1.
CH466551 Genomic DNA. Translation: EDL06559.1.
AJ488904 Genomic DNA. Translation: CAD32912.1.
IPIIPI00474168.
IPI00475164.
IPI00761180.
RefSeqNP_780512.2. NM_175303.3.
NP_958797.2. NM_201395.2.
NP_958798.2. NM_201396.2.
UniGeneMm.434054.
Mm.490784.

3D structure databases

HSSPHSSP built from PDB template 1SRK based on UniProtKB O35615.
ProteinModelPortalQ8BX22.
SMRQ8BX22. Positions 387-448, 572-657, 869-930.
ModBaseSearch...

Protein-protein interaction databases

DIPDIP-29926N.
IntActQ8BX22. 42 interactions.

PTM databases

PhosphoSiteQ8BX22.

Proteomic databases

PRIDEQ8BX22.

Protocols and materials databases

DNASU99377.
StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENSMUST00000029061; ENSMUSP00000029061; ENSMUSG00000027547.
ENSMUST00000075044; ENSMUSP00000074556; ENSMUSG00000027547.
ENSMUST00000103074; ENSMUSP00000099363; ENSMUSG00000027547.
GeneID99377.
KEGGmmu:99377.

Organism-specific databases

CTD57167.
MGIMGI:2139360. Sall4.

Phylogenomic databases

eggNOGCOG5048.
GeneTreeENSGT00550000074555.
HOGENOMHOG000231986.
HOVERGENHBG058921.
InParanoidA2AV00.
OMAPHANIPS.
OrthoDBEOG4WWRHV.

Gene expression databases

ArrayExpressQ8BX22.
BgeeQ8BX22.
CleanExMM_SALL4.
GenevestigatorQ8BX22.
GermOnlineENSMUSG00000027547. Mus musculus.

Family and domain databases

Gene3D3.30.160.60. 7 hits.
InterProIPR007087. Znf_C2H2.
IPR015880. Znf_C2H2-like.
IPR013087. Znf_C2H2/integrase_DNA-bd.
[Graphical view]
SMARTSM00355. ZnF_C2H2. 8 hits.
[Graphical view]
PROSITEPS00028. ZINC_FINGER_C2H2_1. 7 hits.
PS50157. ZINC_FINGER_C2H2_2. 7 hits.
[Graphical view]
ProtoNetSearch...

Other

NextBio353905.
SOURCESearch...

Entry information

Entry nameSALL4_MOUSE
AccessionPrimary (citable) accession number: Q8BX22
Secondary accession number(s): A2AV00 expand/collapse secondary AC list , Q6S7E8, Q6S7E9, Q7TST6
Entry history
Integrated into UniProtKB/Swiss-Prot: November 28, 2006
Last sequence update: July 27, 2011
Last modified: May 1, 2013
This is version 93 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Relevant documents

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot

SIMILARITY comments

Index of protein domains and families