Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q8CFC7 (CLASR_MOUSE) Reviewed, UniProtKB/Swiss-Prot

Last modified April 3, 2013. Version 73. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
CLK4-associating serine/arginine rich protein
Alternative name(s):
Clk4-associating SR-related protein
Serine/arginine-rich splicing factor 16
Splicing factor, arginine/serine-rich 16
Suppressor of white-apricot homolog 2
Gene names
Name:Clasrp
Synonyms:Clasp, Sfrs16, Srsf16, Swap2
OrganismMus musculus (Mouse) [Reference proteome]
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus

Protein attributes

Sequence length668 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Probably functions as an alternative splicing regulator. May regulate the mRNA splicing of genes such as CLK1. May act by regulating members of the CLK kinase family.

Subunit structure

Probably interacts with CLK4.

Subcellular location

Nucleus. Note: Located in nuclear dots.

Isoform 2: Nucleusnucleoplasm.

Tissue specificity

Highly expressed in brain. Expressed at intermediate level in lung and liver. In brain, it is expressed in the hippocampus, cerebellum and olfactory bulb.

Post-translational modification

Phosphorylated in vitro by CLK4.

Sequence similarities

Belongs to the splicing factor SR family.

Caution

It is uncertain whether Met-1 or Met-16 is the initiator.

Sequence caution

The sequence AAC82338.1 differs from that shown. Reason: Erroneous initiation. Translation N-terminally extended.

The sequence BAB26225.1 differs from that shown. Reason: Erroneous initiation. Translation N-terminally extended.

Ontologies

Keywords
   Biological processmRNA processing
mRNA splicing
   Cellular componentNucleus
   Coding sequence diversityAlternative splicing
   DomainCoiled coil
   PTMPhosphoprotein
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
   Biological_processRNA splicing

Inferred from electronic annotation. Source: UniProtKB-KW

mRNA processing

Inferred from electronic annotation. Source: UniProtKB-KW

   Cellular_componentnucleoplasm

Inferred from electronic annotation. Source: UniProtKB-SubCell

Complete GO annotation...

Alternative products

This entry describes 3 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q8CFC7-1)

Also known as: L;

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q8CFC7-2)

Also known as: S;

The sequence of this isoform differs from the canonical sequence as follows:
     564-588: PKLTPQERLKLRMQKALNRQFKADK → VTQADPTGEAEASDAEGSEPPVQGG
     589-668: Missing.
Isoform 3 (identifier: Q8CFC7-3)

The sequence of this isoform differs from the canonical sequence as follows:
     564-627: PKLTPQERLK...MKERERREKE → VSKNLELARL...TPAGPQVPDS
     628-668: Missing.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 668668CLK4-associating serine/arginine rich protein
PRO_0000081946

Regions

Coiled coil579 – 64163 Potential
Compositional bias369 – 667299Arg-rich
Compositional bias377 – 529153Ser-rich

Amino acid modifications

Modified residue2851Phosphoserine By similarity
Modified residue2941Phosphoserine Ref.5
Modified residue3271Phosphothreonine By similarity
Modified residue3311Phosphoserine By similarity
Modified residue3351Phosphoserine By similarity
Modified residue5411Phosphoserine By similarity

Natural variations

Alternative sequence564 – 62764PKLTP…RREKE → VSKNLELARLTPVSSSVQSW ARWACRGPRGRGEPAQRLGK GYPGKLSPIPASHGTPAGPQ VPDS in isoform 3.
VSP_008213
Alternative sequence564 – 58825PKLTP…FKADK → VTQADPTGEAEASDAEGSEP PVQGG in isoform 2.
VSP_008211
Alternative sequence589 – 66880Missing in isoform 2.
VSP_008212
Alternative sequence628 – 66841Missing in isoform 3.
VSP_008214

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 (L) [UniParc].

Last modified April 14, 2009. Version 3.
Checksum: 1A0322702F7A9C40

FASTA66876,825
        10         20         30         40         50         60 
MWHEARKHER KLRGMMVDYK KRAERRREYY EKIKKDPAQF LQVHGRACKV HLDSAVALAA 

        70         80         90        100        110        120 
ESPVNMMPWQ GDTNNMIDRF DVRAHLDHIP DYTPPLLTTI SPEQESDERK CNYERYRGLV 

       130        140        150        160        170        180 
QNDFAGISEE QCLYQIYIDE LYGGLQRPSE DEKKKLAEKK ASIGYTYEDS TVAEVEKVAE 

       190        200        210        220        230        240 
KPEEEESPAE EESNSDEDEV IPDIDVEVDV DELNQEQVAD LNKQATTYGM ADGDFVRMLR 

       250        260        270        280        290        300 
KDKEEAEAIK HAKALEEEKA MYSGRRSRRQ RREFREKRLR GRKISPPSYA RRDSPTYDPY 

       310        320        330        340        350        360 
KRSPSESSSE SRSRSRSPSP GREEKITFIT SFGGSDEEAA AAAAAAAASG AAPGKPPAPP 

       370        380        390        400        410        420 
QTGGPAPGRN ASTRRRSSSS SASRTSSSRS SSRSSSRSRR GYYRSGRHAR SRSRSWSRSR 

       430        440        450        460        470        480 
SRSRRYSRSR SRGRRHSDGG SRDGHRYSRS PARRGGYVPR RRSRSRSRSG DRYKRGARGP 

       490        500        510        520        530        540 
RHHSSSHSRS SWSLSPSRSR SVTRSGSRSQ SRSRSRSQSH SQSQSHSPSP PREKLTRPAA 

       550        560        570        580        590        600 
SPAVGEKLKK TEPAAGKETG AAKPKLTPQE RLKLRMQKAL NRQFKADKKA AQEKMIQQEH 

       610        620        630        640        650        660 
ERQEREDELR AMARKIRMKE RERREKEREE WERQYSRQSR SPSPRYSREY SSSRRRSRSR 


SRSPHYRH 

« Hide

Isoform 2 (S) [UniParc].

Checksum: 6085DFC8C8F7FFEE
Show »

FASTA58865,969
Isoform 3 [UniParc].

Checksum: 2656F106CBD80607
Show »

FASTA62770,289

References

« Hide 'large scale' references
[1]"A transcriptional map in the region of 19q13 derived using direct sequencing and exon trapping."
Yoshiura K., Murray J.C.
Submitted (JAN-1998) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 3).
[2]"The transcriptional landscape of the mammalian genome."
Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J. expand/collapse author list , Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.
Science 309:1559-1563(2005) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 3).
Strain: C57BL/6J.
Tissue: Tongue.
[3]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
Strain: C57BL/6.
Tissue: Brain.
[4]"Novel SR-rich-related protein Clasp specifically interacts with inactivated Clk4 and induces the exon EB inclusion of Clk."
Katsu R., Onogi H., Wada K., Kawaguchi Y., Hagiwara M.
J. Biol. Chem. 277:44220-44228(2002) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 16-668 (ISOFORMS 1 AND 2).
Strain: BALB/c.
Tissue: Brain.
[5]"Solid tumor proteome and phosphoproteome analysis by high resolution mass spectrometry."
Zanivan S., Gnad F., Wickstroem S.A., Geiger T., Macek B., Cox J., Faessler R., Mann M.
J. Proteome Res. 7:5314-5326(2008) [PubMed] [Europe PMC] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-294, MASS SPECTROMETRY.
Tissue: Melanoma.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AF042799 mRNA. Translation: AAC82338.1. Different initiation.
AK009334 mRNA. Translation: BAB26225.1. Different initiation.
BC051916 mRNA. Translation: AAH51916.2.
AB080582 mRNA. Translation: BAC15600.1.
AB080583 mRNA. Translation: BAC15601.1.
IPIIPI00131244.
IPI00172022.
IPI00229282.
RefSeqNP_057889.3. NM_016680.4.
UniGeneMm.20913.

3D structure databases

ModBaseSearch...

PTM databases

PhosphoSiteQ8CFC7.

Proteomic databases

PaxDbQ8CFC7.
PRIDEQ8CFC7.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENSMUST00000086041; ENSMUSP00000083205; ENSMUSG00000061028.
GeneID53609.
KEGGmmu:53609.
UCSCuc009fml.1. mouse.

Organism-specific databases

CTD11129.
MGIMGI:1855695. Clasrp.

Phylogenomic databases

eggNOGNOG316380.
GeneTreeENSGT00700000104518.
HOVERGENHBG065901.
InParanoidQ5RKW4.
KOK13168.
OMAHQLYLEE.
OrthoDBEOG41VK3T.

Gene expression databases

BgeeQ8CFC7.
GenevestigatorQ8CFC7.
GermOnlineENSMUSG00000061028. Mus musculus.

Family and domain databases

InterProIPR019147. SWAP_N_domain.
[Graphical view]
PfamPF09750. DRY_EERY. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

ChiTaRSCLASRP. mouse.
NextBio310315.
SOURCESearch...

Entry information

Entry nameCLASR_MOUSE
AccessionPrimary (citable) accession number: Q8CFC7
Secondary accession number(s): Q5RKW4, Q8CFC8, Q9Z2N3
Entry history
Integrated into UniProtKB/Swiss-Prot: September 19, 2003
Last sequence update: April 14, 2009
Last modified: April 3, 2013
This is version 73 of the entry and version 3 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Relevant documents

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot

SIMILARITY comments

Index of protein domains and families