Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q8CH02 (SUGP1_MOUSE) Reviewed, UniProtKB/Swiss-Prot

Last modified November 16, 2011. Version 68. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
SURP and G-patch domain-containing protein 1
Alternative name(s):
Splicing factor 4
Gene names
Name:Sugp1
Synonyms:Sf4
OrganismMus musculus (Mouse)
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus

Protein attributes

Sequence length643 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Plays a role in pre-mRNA splicing By similarity.

Subunit structure

Component of the spliceosome By similarity.

Subcellular location

Nucleus Probable.

Sequence similarities

Contains 1 G-patch domain.

Contains 2 SURP motif repeats.

Ontologies

Keywords
   Biological processmRNA processing
mRNA splicing
   Cellular componentNucleus
Spliceosome
   DomainRepeat
   PTMPhosphoprotein
   Technical term3D-structure
Complete proteome
Reference proteome
Gene Ontology (GO)
   Biological processRNA splicing

Inferred from electronic annotation. Source: UniProtKB-KW

mRNA processing

Inferred from electronic annotation. Source: UniProtKB-KW

   Cellular componentspliceosomal complex

Inferred from electronic annotation. Source: UniProtKB-KW

   Molecular functionRNA binding

Inferred from electronic annotation. Source: InterPro

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 643643SURP and G-patch domain-containing protein 1
PRO_0000097702

Regions

Repeat187 – 22943SURP motif 1
Repeat262 – 30544SURP motif 2
Domain560 – 60748G-patch
Motif378 – 3847Nuclear localization signal Potential
Compositional bias324 – 37148Pro-rich
Compositional bias439 – 47840Gln/Met-rich

Amino acid modifications

Modified residue4071Phosphoserine By similarity
Modified residue4091Phosphoserine By similarity
Modified residue4121Phosphoserine By similarity
Modified residue4831Phosphoserine Ref.4

Experimental info

Sequence conflict3261P → L in AAI20920. Ref.3
Sequence conflict3261P → L in AAI20921. Ref.3

Secondary structure

....................... 643
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
Q8CH02 [UniParc].

Last modified March 1, 2003. Version 1.
Checksum: A1FCD38A26998E78

FASTA64372,649
        10         20         30         40         50         60 
MSLKMDNRDV AGKANRWFGM AQPKSGKMNM NILHQEELIA QKKREIEARM EQKARQSHVP 

        70         80         90        100        110        120 
SPQPPHPGEI ADAHNSCISN KFANDGSFLQ QFLKLQKAQT STDSAPRAPP SMPTPSSLKK 

       130        140        150        160        170        180 
PLVLSKRTGL GLSSPTGPVK NYSHAKQLPV AHRPSVFQSP DDDEEEDYEQ WLEIKVSPPE 

       190        200        210        220        230        240 
GAETRRVIEK LARFVAEGGP ELEKVAMEDY KDNPAFTFLH DKNSREFLYY RRKVAEIRKE 

       250        260        270        280        290        300 
AQKPQAATQK VSPPEDEEAK NLAEKLARFI ADGGPEVETI ALQNNRENQA FSFLYDPNSQ 

       310        320        330        340        350        360 
GYRYYRQKLD EFRKAKAGST GSFPAPAPNP SLRRKSAPEA LSGAVPPITA CPTPVAPAPA 

       370        380        390        400        410        420 
VNPTPSIPGK PTATAAVKRK RKSRWGPEED KVELPPAELA QRDIDASPSP LSVQDLKGLG 

       430        440        450        460        470        480 
YEKGKPVGLV GVTELSDAQK KQLKEQQEMQ QMYDMIMQHK RAMQDMQLLW EKALQQHQHG 

       490        500        510        520        530        540 
YDSDEEVDSE LGTWEHQLRR MEMDKTREWA EQLTQMGRGK HFIGDFLPPD ELEKFMETFK 

       550        560        570        580        590        600 
ALKEGREPDY SEYKEFKLTV ENIGYQMLMK MGWKEGEGLG TEGQGIKNPV NKGATTIDGA 

       610        620        630        640 
GFGIDRPAEL SKEDDEYEAF RKRMMLAYRF RPNPLNNPRR PYY 

« Hide

References

« Hide 'large scale' references
[1]"SF4 and SFRS14, two related putative splicing factors on human chromosome 19p13.11."
Sampson N.D., Hewitt J.E.
Gene 305:91-100(2003) [PubMed: 12594045] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA].
Strain: C57BL/6J.
[2]"The transcriptional landscape of the mammalian genome."
Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J. expand/collapse author list , Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.
Science 309:1559-1563(2005) [PubMed: 16141072] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
Strain: NOD.
Tissue: Spleen.
[3]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
[4]"Large-scale phosphorylation analysis of mouse liver."
Villen J., Beausoleil S.A., Gerber S.A., Gygi S.P.
Proc. Natl. Acad. Sci. U.S.A. 104:1488-1493(2007) [PubMed: 17242355] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-483, MASS SPECTROMETRY.
Tissue: Liver.
[5]"Solution structure of SURP domain in BAB30904."
RIKEN structural genomics initiative (RSGI)
Submitted (AUG-2004) to the PDB data bank
Cited for: STRUCTURE BY NMR OF 165-239.
[6]"Solution structure of SURP domain in splicing factor 4."
RIKEN structural genomics initiative (RSGI)
Submitted (NOV-2005) to the PDB data bank
Cited for: STRUCTURE BY NMR OF 250-314.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AF521129 mRNA. Translation: AAN77124.1.
AK156508 mRNA. Translation: BAE33738.1.
BC120919 mRNA. Translation: AAI20920.1.
BC120920 mRNA. Translation: AAI20921.1.
IPIIPI00454015.
RefSeqNP_081757.1. NM_027481.2.
UniGeneMm.17665.

3D structure databases

PDBe
RCSB PDB
PDBj
EntryMethodResolution (Å)ChainPositionsPDBsum
1UG0NMR-A165-239[»]
1X4ONMR-A250-314[»]
ProteinModelPortalQ8CH02.
SMRQ8CH02. Positions 165-241, 250-319.
ModBaseSearch...

Protein-protein interaction databases

STRINGQ8CH02.

PTM databases

PhosphoSiteQ8CH02.

Proteomic databases

PRIDEQ8CH02.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENSMUST00000011450; ENSMUSP00000011450; ENSMUSG00000011306.
GeneID70616.
KEGGmmu:70616.
UCSCuc009lyn.1. mouse.

Organism-specific databases

CTD57794.
MGIMGI:1917866. Sugp1.

Phylogenomic databases

eggNOGroNOG11907.
GeneTreeENSGT00410000025695.
HOGENOMHBG356498.
HOVERGENHBG079172.
InParanoidQ8CH02.
OMAEKVAMEN.
OrthoDBEOG4VX24Z.
PhylomeDBQ8CH02.

Gene expression databases

ArrayExpressQ8CH02.
BgeeQ8CH02.
CleanExMM_SF4.
GenevestigatorQ8CH02.
GermOnlineENSMUSG00000011306. Mus musculus.

Family and domain databases

InterProIPR000467. G_patch.
IPR000061. Surp.
[Graphical view]
KOK13096.
PfamPF01585. G-patch. 1 hit.
PF01805. Surp. 2 hits.
[Graphical view]
SMARTSM00443. G_patch. 1 hit.
SM00648. SWAP. 2 hits.
[Graphical view]
PROSITEPS50174. G_PATCH. 1 hit.
PS50128. SURP. 2 hits.
[Graphical view]
ProtoNetSearch...

Other

NextBio331984.
SOURCESearch...

Entry information

Entry nameSUGP1_MOUSE
AccessionPrimary (citable) accession number: Q8CH02
Secondary accession number(s): Q0VAT9, Q3U0W3, Q8R094
Entry history
Integrated into UniProtKB/Swiss-Prot: March 15, 2005
Last sequence update: March 1, 2003
Last modified: November 16, 2011
This is version 68 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Relevant documents

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot

PDB cross-references

Index of Protein Data Bank (PDB) cross-references

SIMILARITY comments

Index of protein domains and families