Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q9CWL2 (CASZ1_MOUSE) Reviewed, UniProtKB/Swiss-Prot

Last modified May 1, 2013. Version 86. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Zinc finger protein castor homolog 1
Alternative name(s):
Castor-related protein
Gene names
Name:Casz1
Synonyms:Cst, D4Ertd432e, Kiaa3026
OrganismMus musculus (Mouse) [Reference proteome]
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus

Protein attributes

Sequence length1761 AA.
Sequence statusComplete.
Protein existenceEvidence at transcript level

General annotation (Comments)

Function

Probable transcription factor By similarity.

Subcellular location

Nucleus Probable.

Tissue specificity

Expressed in the brain stem and the thalamencephalon. Ref.4 Ref.5

Developmental stage

First expressed at E8.0 in the developing heart and throughout development. By E8.5, it is expressed in the lateral neural folds of the hindbrain and extends anteriorly and posteriorly to eventually cover the dorsal neural tube from the isthmus to its caudal end. From E9.5, it is expressed in the dorsomedial telencephalon. In the hindbrain, it is confined to trigeminal motor neurons and to migrating facial branchiomotor neurons. In the peripheral nervous system, it is expressed in cranial and in dorsal root ganglia. Also expressed in the developing eye and in the nasal placode. Ref.4

Induction

Up-regulated during myoblast differentiation. Ref.5

Sequence similarities

Contains 8 C2H2-type zinc fingers.

Sequence caution

The sequence BAB27027.1 differs from that shown. Reason: Frameshift at position 946.

Ontologies

Keywords
   Biological processTranscription
Transcription regulation
   Cellular componentNucleus
   Coding sequence diversityAlternative splicing
   DomainRepeat
Zinc-finger
   LigandDNA-binding
Metal-binding
Zinc
   PTMPhosphoprotein
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
   Biological_processregulation of transcription, DNA-dependent

Inferred from electronic annotation. Source: UniProtKB-KW

transcription, DNA-dependent

Inferred from electronic annotation. Source: UniProtKB-KW

   Cellular_componentnucleus

Inferred from electronic annotation. Source: UniProtKB-SubCell

   Molecular_functionDNA binding

Inferred from electronic annotation. Source: UniProtKB-KW

zinc ion binding

Inferred from electronic annotation. Source: InterPro

Complete GO annotation...

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q9CWL2-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Note: Gene prediction based on similarity to human ortholog.
Isoform 2 (identifier: Q9CWL2-2)

The sequence of this isoform differs from the canonical sequence as follows:
     1166-1166: N → K
     1167-1761: Missing.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 17611761Zinc finger protein castor homolog 1
PRO_0000046913

Regions

Zinc finger550 – 57425C2H2-type 1
Zinc finger609 – 63325C2H2-type 2
Zinc finger667 – 69125C2H2-type 3
Zinc finger1031 – 105525C2H2-type 4
Zinc finger1300 – 132425C2H2-type 5
Zinc finger1457 – 148125C2H2-type 6
Zinc finger1515 – 153723C2H2-type 7
Zinc finger1571 – 159525C2H2-type 8
Compositional bias383 – 41634Pro-rich
Compositional bias1080 – 114364Pro-rich
Compositional bias1669 – 168113Poly-Thr
Compositional bias1702 – 172928Asp-rich

Amino acid modifications

Modified residue9811Phosphoserine By similarity

Natural variations

Alternative sequence11661N → K in isoform 2.
VSP_027095
Alternative sequence1167 – 1761595Missing in isoform 2.
VSP_027096

Experimental info

Sequence conflict8841A → S in BAD32619. Ref.2
Sequence conflict9121Q → E in BAB27027. Ref.3
Sequence conflict10871L → P in BAB27027. Ref.3
Sequence conflict10871L → P in BAD32619. Ref.2

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified July 24, 2007. Version 3.
Checksum: C0C1959D68D62D50

FASTA1,761191,188
        10         20         30         40         50         60 
MDLGTAESTR CTDPPAGKPP MAAKRKGGLK LNAICAKLSR QVVVEKGAEA GSQAEGSPLH 

        70         80         90        100        110        120 
PRDKERSGPE SGVSRAPRSE EDKRRAVIEK WVNGEYCEDP APTPVLGRIA RDQELPPEGV 

       130        140        150        160        170        180 
YMVQPQGCSD EEDHAEEPSK DNSVLEEKES DGTASKDDSG PSTRQASGET SSLRDYAAST 

       190        200        210        220        230        240 
MTEFLGMFGY DDQNTRDELA KKISFEKPHA GSTPEVAASS MLPSSEDTLS KRARFSKYEE 

       250        260        270        280        290        300 
YIRKLKAGEQ LPWPAHGSKA EDRAGKEVVG PLPSLRLPSN TAHLETKATI LPLPSHSSVQ 

       310        320        330        340        350        360 
MQNLVARASK YDFFIHKLKT GENLRPQNGS TYKKPSKYDL ENVKYLHLFK PGEGSPDMGG 

       370        380        390        400        410        420 
AIAFKTGKVG RPSKYDVRGI QKPGPTKIPP APSLVPTPLT NVPSAPSTPG PGPEPPASLS 

       430        440        450        460        470        480 
FNTPEYLKST FSKTDSITTG TVSTVKNGLP TDKPAVTEDV NIYQKYIARF SGSQHCGHIH 

       490        500        510        520        530        540 
CAYQYREHYH CLDPECNYQR FTSKQDVIRH YNMHKKRDNS LQHGFMRFSP LDDCSVYYHG 

       550        560        570        580        590        600 
CHLNGKSTHY HCMQVGCNKV YTSTSDVMTH ENFHKKNTQL INDGFQRFRA TEDCGTADCQ 

       610        620        630        640        650        660 
FYGQKTTHFH CRRPGCTFTF KNKCDIEKHK SYHIKDDAYA KDGFKKFYKY EECKYEGCMY 

       670        680        690        700        710        720 
SKATNHFHCI RAGCGFTFTS TSQMTSHKRK HERRHIRSSG ALGLPASLLG AKDTEHEESS 

       730        740        750        760        770        780 
NDDLVDFSAL SSKNSSLSAS PTSQQSSASL AAAAAATTAE AIPSATKPPN SKMAGLLPQG 

       790        800        810        820        830        840 
LSGSIPLALA LSNSGLPTTT PYFPLLPNRG SASLPVGSPG LLGSMSSGAT TSATPDMPAL 

       850        860        870        880        890        900 
MASRAGDSAP TAATSLSVPP ASIIERISAS KGLISPMMAR LAAAALKPSA TFDPGSGQQP 

       910        920        930        940        950        960 
TPTKFPQAQV KQEPDSAGTP GPHEASQDRS LDLTVKDPSN ESNGHAVSAN SSLLSSLMNK 

       970        980        990       1000       1010       1020 
MSQGNPSLES FLSIKTEAEG SPAGEPSPFL GKAVKALVQE KLSEPWKVYL RRFGTKDFCD 

      1030       1040       1050       1060       1070       1080 
AQCDFLHKAH FHCVVEECGA LFSTLDGAIK HANFHFRTEG GTAKGTPEAS FPTSAAETKP 

      1090       1100       1110       1120       1130       1140 
PLAPSSLPAP PGTMVAGSSL EGPAPSPVSV PSTPTLLAWK QLASTIPQMP QIPSSVPHLP 

      1150       1160       1170       1180       1190       1200 
TSPLATTSLE SAKPQVKPGF LQFQDNDPCL ATDCKYASKF HFHCLFGNCK YVCKTSGKAE 

      1210       1220       1230       1240       1250       1260 
SHCLDHINPS NSLVNVRDQF AYYSLQCLCP NQHCEFRMRG HYHCLRTGCY FVTNITTKLP 

      1270       1280       1290       1300       1310       1320 
WHIKKHEKAE RRAANGFKYF TKREECGRLG CKYNQVNSHF HCIREGCQFS FLLKHQMTSH 

      1330       1340       1350       1360       1370       1380 
ARKHMRRMLG KNFDRVPPSQ GPPSLMDAET DEGMDYTGCS PGAASSESST MDRSCSSTPV 

      1390       1400       1410       1420       1430       1440 
GNESTAAGNT ISMPTASGAK KRFWIIEDMS PFGKRRKTAS SRKMLDEGMM LEGFRRFDLY 

      1450       1460       1470       1480       1490       1500 
EDCKDTACQF SLKVTHYHCT RENCGYKFCG RTHMYKHAQH HDRVDNLVLD DFKRFKASLS 

      1510       1520       1530       1540       1550       1560 
CHFADCPFSG TSTHFHCLRC RFRCTDSTKV TAHRKHHGKQ DVISAAGFCQ FSSSADCAVP 

      1570       1580       1590       1600       1610       1620 
DCKYKLKCSH FHCTYPGCRH TVVGMSQMDS HKRKHEKQER GEPPAASPGA PVNLDGSLTL 

      1630       1640       1650       1660       1670       1680 
AAEQGSLLFL QTAAAGLGLL GDTGDPGPPV TASGTRDGPA APTPAAAATT TTTTTATATA 

      1690       1700       1710       1720       1730       1740 
TAGESSQEDD EELELPEEEA EDDDEDDDEE DDDDEDDDDD DDDEDLRTDS EESLPEAAGE 

      1750       1760 
AGARTPLAAL GGPGPAPTAA S 

« Hide

Isoform 2 [UniParc].

Checksum: 4804ED3BB3A77BD2
Show »

FASTA1,166125,596

References

« Hide 'large scale' references
[1]"Lineage-specific biology revealed by a finished genome assembly of the mouse."
Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X., Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y., Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S. expand/collapse author list , Teague B., Potamousis K., Churas C., Place M., Herschleb J., Runnheim R., Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., Lindblad-Toh K., Eichler E.E., Ponting C.P.
PLoS Biol. 7:E1000112-E1000112(2009) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: C57BL/6J.
[2]"Prediction of the coding sequences of mouse homologues of KIAA gene: IV. The complete nucleotide sequences of 500 mouse KIAA-homologous cDNAs identified by screening of terminal sequences of cDNA clones randomly sampled from size-fractionated libraries."
Okazaki N., Kikuno R., Ohara R., Inamoto S., Koseki H., Hiraoka S., Saga Y., Seino S., Nishimura M., Kaisho T., Hoshino K., Kitamura H., Nagase T., Ohara O., Koga H.
DNA Res. 11:205-218(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 286-1761 (ISOFORM 2).
Tissue: Pancreatic islet.
[3]"The transcriptional landscape of the mammalian genome."
Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J. expand/collapse author list , Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.
Science 309:1559-1563(2005) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 912-1761 (ISOFORM 2).
Strain: C57BL/6J.
Tissue: Embryonic stem cell.
[4]"Cst, a novel mouse gene related to Drosophila Castor, exhibits dynamic expression patterns during neurogenesis and heart development."
Vacalla C.M.H., Theil T.
Mech. Dev. 118:265-268(2002) [PubMed] [Europe PMC] [Abstract]
Cited for: TISSUE SPECIFICITY, DEVELOPMENTAL STAGE.
[5]"Molecular cloning and characterization of human Castor, a novel human gene upregulated during cell differentiation."
Liu Z., Yang X., Tan F., Cullion K., Thiele C.J.
Biochem. Biophys. Res. Commun. 344:834-844(2006) [PubMed] [Europe PMC] [Abstract]
Cited for: INDUCTION, TISSUE SPECIFICITY.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AL607145 Genomic DNA. Translation: CAM18127.1.
AL611967 Genomic DNA. No translation available.
AK173341 mRNA. Translation: BAD32619.1.
AK010559 mRNA. Translation: BAB27027.1. Frameshift.
IPIIPI00342841.
IPI00347203.
RefSeqNP_081471.2. NM_027195.2.
UniGeneMm.233879.

3D structure databases

ProteinModelPortalQ9CWL2.
SMRQ9CWL2. Positions 550-577, 610-636, 1571-1598.
ModBaseSearch...

PTM databases

PhosphoSiteQ9CWL2.

Proteomic databases

PaxDbQ9CWL2.
PRIDEQ9CWL2.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENSMUST00000094464; ENSMUSP00000092035; ENSMUSG00000028977.
GeneID69743.
KEGGmmu:69743.
UCSCuc008vvl.1. mouse.

Organism-specific databases

CTD54897.
MGIMGI:1196251. Casz1.
RougeSearch...

Phylogenomic databases

eggNOGNOG324200.
GeneTreeENSGT00390000008187.
HOVERGENHBG080122.
InParanoidQ9CWL2.
OMATAEGTRC.

Gene expression databases

ArrayExpressQ9CWL2.
BgeeQ9CWL2.
CleanExMM_CASZ1.
GenevestigatorQ9CWL2.
GermOnlineENSMUSG00000028977. Mus musculus.

Family and domain databases

Gene3D1.25.10.10. 1 hit.
InterProIPR011989. ARM-like.
IPR007087. Znf_C2H2.
IPR015880. Znf_C2H2-like.
[Graphical view]
SMARTSM00355. ZnF_C2H2. 11 hits.
[Graphical view]
PROSITEPS00028. ZINC_FINGER_C2H2_1. 9 hits.
PS50157. ZINC_FINGER_C2H2_2. 3 hits.
[Graphical view]
ProtoNetSearch...

Other

NextBio330234.
SOURCESearch...

Entry information

Entry nameCASZ1_MOUSE
AccessionPrimary (citable) accession number: Q9CWL2
Secondary accession number(s): A2A8A1, Q69Z25
Entry history
Integrated into UniProtKB/Swiss-Prot: May 24, 2004
Last sequence update: July 24, 2007
Last modified: May 1, 2013
This is version 86 of the entry and version 3 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Relevant documents

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot

SIMILARITY comments

Index of protein domains and families