Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q8IX01 (SUGP2_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified January 25, 2012. Version 79. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (6) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
SURP and G-patch domain-containing protein 2
Alternative name(s):
Arginine/serine-rich-splicing factor 14
Splicing factor, arginine/serine-rich 14
Gene names
Name:SUGP2
Synonyms:KIAA0365, SFRS14
OrganismHomo sapiens (Human)
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length1082 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

May play a role in mRNA splicing Potential.

Subcellular location

Nucleus Potential.

Tissue specificity

Detected in adult testis, and in fetal brain and kidney. Ref.1

Post-translational modification

Phosphorylated upon DNA damage, probably by ATM or ATR. Ref.7 Ref.8 Ref.9 Ref.10 Ref.11

Sequence similarities

Contains 1 G-patch domain.

Contains 2 SURP motif repeats.

Sequence caution

The sequence AAC06129.1 differs from that shown. Reason: Erroneous gene model prediction.

The sequence AAH20586.1 differs from that shown. Reason: Erroneous initiation. Translation N-terminally extended.

The sequence BAA20820.2 differs from that shown. Reason: Erroneous initiation. Translation N-terminally shortened.

Ontologies

Keywords
   Biological processmRNA processing
mRNA splicing
   Cellular componentNucleus
   Coding sequence diversityAlternative splicing
Polymorphism
   DomainRepeat
   PTMPhosphoprotein
   Technical term3D-structure
Complete proteome
Reference proteome
Gene Ontology (GO)
   Biological processRNA splicing

Inferred from electronic annotation. Source: UniProtKB-KW

mRNA processing

Inferred from electronic annotation. Source: UniProtKB-KW

   Cellular componentnucleus

Inferred from electronic annotation. Source: UniProtKB-SubCell

   Molecular functionRNA binding

Inferred from electronic annotation. Source: InterPro

Complete GO annotation...

Alternative products

This entry describes 3 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q8IX01-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 3 (identifier: Q8IX01-3)

The sequence of this isoform differs from the canonical sequence as follows:
     890-941: Missing.
     1044-1082: GTPSEGEGLG...QMYRHKRANK → YAAGSLGWEW...LNRRHLQMRS
Note: No experimental confirmation available.
Isoform 4 (identifier: Q8IX01-4)

The sequence of this isoform differs from the canonical sequence as follows:
     890-941: Missing.
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 10821082SURP and G-patch domain-containing protein 2
PRO_0000097708

Regions

Repeat590 – 63344SURP motif 1
Repeat787 – 83044SURP motif 2
Domain1011 – 105747G-patch
Motif995 – 10006Nuclear localization signal Potential
Compositional bias738 – 77639Pro/Ser-rich
Compositional bias869 – 90537Asp/Glu-rich

Amino acid modifications

Modified residue71Phosphothreonine Ref.9
Modified residue2751Phosphothreonine Ref.10
Modified residue2771Phosphoserine Ref.7 Ref.8 Ref.10 Ref.11
Modified residue3151Phosphoserine Ref.10
Modified residue7731Phosphoserine Ref.11
Modified residue9401Phosphoserine Ref.9

Natural variations

Alternative sequence890 – 94152Missing in isoform 3 and isoform 4.
VSP_013112
Alternative sequence1044 – 108239GTPSE…KRANK → YAAGSLGWEWVGPQSFHLQP AAWLLHSQDGLQLAVDFCFL NRRHLQMRS in isoform 3.
VSP_013113
Natural variant2061G → S. Ref.1 Ref.4 Ref.6
Corresponds to variant rs4808907 [ dbSNP | Ensembl ].
VAR_023711
Natural variant5521M → T.
Corresponds to variant rs10404860 [ dbSNP | Ensembl ].
VAR_051341
Natural variant6491Q → R.
Corresponds to variant rs10414535 [ dbSNP | Ensembl ].
VAR_051342
Natural variant7221Q → R.
Corresponds to variant rs34540303 [ dbSNP | Ensembl ].
VAR_051343
Natural variant8811R → Q.
Corresponds to variant rs35646935 [ dbSNP | Ensembl ].
VAR_051344

Secondary structure

........... 1082
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified October 5, 2010. Version 2.
Checksum: 619AE0AC087761BD

FASTA1,082120,207
        10         20         30         40         50         60 
MAARRITQET FDAVLQEKAK RYHMDASGEA VSETLQFKAQ DLLRAVPRSR AEMYDDVHSD 

        70         80         90        100        110        120 
GRYSLSGSVA HSRDAGREGL RSDVFPGPSF RSSNPSISDD SYFRKECGRD LEFSHSDSRD 

       130        140        150        160        170        180 
QVIGHRKLGH FRSQDWKFAL RGSWEQDFGH PVSQESSWSQ EYSFGPSAVL GDFGSSRLIE 

       190        200        210        220        230        240 
KECLEKESRD YDVDHPGEAD SVLRGGSQVQ ARGRALNIVD QEGSLLGKGE TQGLLTAKGG 

       250        260        270        280        290        300 
VGKLVTLRNV STKKIPTVNR ITPKTQGTNQ IQKNTPSPDV TLGTNPGTED IQFPIQKIPL 

       310        320        330        340        350        360 
GLDLKNLRLP RRKMSFDIID KSDVFSRFGI EIIKWAGFHT IKDDIKFSQL FQTLFELETE 

       370        380        390        400        410        420 
TCAKMLASFK CSLKPEHRDF CFFTIKFLKH SALKTPRVDN EFLNMLLDKG AVKTKNCFFE 

       430        440        450        460        470        480 
IIKPFDKYIM RLQDRLLKSV TPLLMACNAY ELSVKMKTLS NPLDLALALE TTNSLCRKSL 

       490        500        510        520        530        540 
ALLGQTFSLA SSFRQEKILE AVGLQDIAPS PAAFPNFEDS TLFGREYIDH LKAWLVSSGC 

       550        560        570        580        590        600 
PLQVKKAEPE PMREEEKMIP PTKPEIQAKA PSSLSDAVPQ RADHRVVGTI DQLVKRVIEG 

       610        620        630        640        650        660 
SLSPKERTLL KEDPAYWFLS DENSLEYKYY KLKLAEMQRM SENLRGADQK PTSADCAVRA 

       670        680        690        700        710        720 
MLYSRAVRNL KKKLLPWQRR GLLRAQGLRG WKARRATTGT QTLLSSGTRL KHHGRQAPGL 

       730        740        750        760        770        780 
SQAKPSLPDR NDAAKDCPPD PVGPSPQDPS LEASGPSPKP AGVDISEAPQ TSSPCPSADI 

       790        800        810        820        830        840 
DMKTMETAEK LARFVAQVGP EIEQFSIENS TDNPDLWFLH DQNSSAFKFY RKKVFELCPS 

       850        860        870        880        890        900 
ICFTSSPHNL HTGGGDTTGS QESPVDLMEG EAEFEDEPPP REAELESPEV MPEEEDEDDE 

       910        920        930        940        950        960 
DGGEEAPAPG GAGKSEGSTP ADGLPGEAAE DDLAGAPALS QASSGTCFPR KRISSKSLKV 

       970        980        990       1000       1010       1020 
GMIPAPKRVC LIQEPKVHEP VRIAYDRPRG RPMSKKKKPK DLDFAQQKLT DKNLGFQMLQ 

      1030       1040       1050       1060       1070       1080 
KMGWKEGHGL GSLGKGIREP VSVGTPSEGE GLGADGQEHK EDTFDVFRQR MMQMYRHKRA 


NK 

« Hide

Isoform 3 [UniParc].

Checksum: 4C4AF9C44F68DA88
Show »

FASTA1,040116,286
Isoform 4 [UniParc].

Checksum: DBC51336F08F75F9
Show »

FASTA1,030115,186

References

« Hide 'large scale' references
[1]"SF4 and SFRS14, two related putative splicing factors on human chromosome 19p13.11."
Sampson N.D., Hewitt J.E.
Gene 305:91-100(2003) [PubMed: 12594045] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), ALTERNATIVE SPLICING, TISSUE SPECIFICITY, VARIANT SER-206.
[2]"Prediction of the coding sequences of unidentified human genes. VII. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro."
Nagase T., Ishikawa K., Nakajima D., Ohira M., Seki N., Miyajima N., Tanaka A., Kotani H., Nomura N., Ohara O.
DNA Res. 4:141-150(1997) [PubMed: 9205841] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
Tissue: Brain.
[3]Ohara O., Nagase T., Kikuno R., Nomura N.
Submitted (AUG-2005) to the EMBL/GenBank/DDBJ databases
Cited for: SEQUENCE REVISION.
[4]"The full-ORF clone resource of the German cDNA consortium."
Bechtel S., Rosenfelder H., Duda A., Schmidt C.P., Ernst U., Wellenreuther R., Mehrle A., Schuster C., Bahr A., Bloecker H., Heubner D., Hoerlein A., Michel G., Wedler H., Koehrer K., Ottenwaelder B., Poustka A., Wiemann S., Schupp I.
BMC Genomics 8:399-399(2007) [PubMed: 17974005] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1 AND 4), VARIANT SER-206.
Tissue: Endometrial tumor and Testis.
[5]"The DNA sequence and biology of human chromosome 19."
Grimwood J., Gordon L.A., Olsen A.S., Terry A., Schmutz J., Lamerdin J.E., Hellsten U., Goodstein D., Couronne O., Tran-Gyamfi M., Aerts A., Altherr M., Ashworth L., Bajorek E., Black S., Branscomb E., Caenepeel S., Carrano A.V. expand/collapse author list , Caoile C., Chan Y.M., Christensen M., Cleland C.A., Copeland A., Dalin E., Dehal P., Denys M., Detter J.C., Escobar J., Flowers D., Fotopulos D., Garcia C., Georgescu A.M., Glavina T., Gomez M., Gonzales E., Groza M., Hammon N., Hawkins T., Haydu L., Ho I., Huang W., Israni S., Jett J., Kadner K., Kimball H., Kobayashi A., Larionov V., Leem S.-H., Lopez F., Lou Y., Lowry S., Malfatti S., Martinez D., McCready P.M., Medina C., Morgan J., Nelson K., Nolan M., Ovcharenko I., Pitluck S., Pollard M., Popkie A.P., Predki P., Quan G., Ramirez L., Rash S., Retterer J., Rodriguez A., Rogers S., Salamov A., Salazar A., She X., Smith D., Slezak T., Solovyev V., Thayer N., Tice H., Tsai M., Ustaszewska A., Vo N., Wagner M., Wheeler J., Wu K., Xie G., Yang J., Dubchak I., Furey T.S., DeJong P., Dickson M., Gordon D., Eichler E.E., Pennacchio L.A., Richardson P., Stubbs L., Rokhsar D.S., Myers R.M., Rubin E.M., Lucas S.M.
Nature 428:529-535(2004) [PubMed: 15057824] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[6]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 3), VARIANT SER-206.
Tissue: Duodenum.
[7]"Large-scale characterization of HeLa cell nuclear phosphoproteins."
Beausoleil S.A., Jedrychowski M., Schwartz D., Elias J.E., Villen J., Li J., Cohn M.A., Cantley L.C., Gygi S.P.
Proc. Natl. Acad. Sci. U.S.A. 101:12130-12135(2004) [PubMed: 15302935] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-277, MASS SPECTROMETRY.
Tissue: Cervix carcinoma.
[8]"Phosphoproteome analysis of the human mitotic spindle."
Nousiainen M., Sillje H.H.W., Sauer G., Nigg E.A., Koerner R.
Proc. Natl. Acad. Sci. U.S.A. 103:5391-5396(2006) [PubMed: 16565220] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-277, MASS SPECTROMETRY.
Tissue: Cervix adenocarcinoma.
[9]"ATM and ATR substrate analysis reveals extensive protein networks responsive to DNA damage."
Matsuoka S., Ballif B.A., Smogorzewska A., McDonald E.R. III, Hurov K.E., Luo J., Bakalarski C.E., Zhao Z., Solimini N., Lerenthal Y., Shiloh Y., Gygi S.P., Elledge S.J.
Science 316:1160-1166(2007) [PubMed: 17525332] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT THR-7 AND SER-940, MASS SPECTROMETRY.
Tissue: Embryonic kidney.
[10]"A quantitative atlas of mitotic phosphorylation."
Dephoure N., Zhou C., Villen J., Beausoleil S.A., Bakalarski C.E., Elledge S.J., Gygi S.P.
Proc. Natl. Acad. Sci. U.S.A. 105:10762-10767(2008) [PubMed: 18669648] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT THR-275; SER-277 AND SER-315, MASS SPECTROMETRY.
Tissue: Cervix carcinoma.
[11]"Lys-N and trypsin cover complementary parts of the phosphoproteome in a refined SCX-based approach."
Gauci S., Helbig A.O., Slijper M., Krijgsveld J., Heck A.J., Mohammed S.
Anal. Chem. 81:4493-4501(2009) [PubMed: 19413330] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-277 AND SER-773, MASS SPECTROMETRY.
Tissue: Embryonic kidney.
[12]"Solution structure of SURP domain in SFRS14 protein."
RIKEN structural genomics initiative (RSGI)
Submitted (NOV-2005) to the PDB data bank
Cited for: STRUCTURE BY NMR OF 587-639.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AF518874 mRNA. Translation: AAN77117.1.
AB002363 mRNA. Translation: BAA20820.2. Different initiation.
BX647813 mRNA. Translation: CAI46012.1.
AL832488 mRNA. Translation: CAI46117.1.
AC004447 Genomic DNA. Translation: AAC06129.1. Sequence problems.
BC020586 mRNA. Translation: AAH20586.1. Different initiation.
IPIIPI00158020.
IPI00293162.
IPI00554438.
PIRT03030.
RefSeqNP_001017392.2. NM_001017392.3.
NP_055699.2. NM_014884.3.
UniGeneHs.515271.

3D structure databases

PDBe
RCSB PDB
PDBj
EntryMethodResolution (Å)ChainPositionsPDBsum
1X4PNMR-A587-639[»]
ProteinModelPortalQ8IX01.
SMRQ8IX01. Positions 589-639, 778-841.
ModBaseSearch...

Protein-protein interaction databases

IntActQ8IX01. 5 interactions.
MINTMINT-2867991.
STRINGQ8IX01.

PTM databases

PhosphoSiteQ8IX01.

Polymorphism databases

DMDM308153496.

Proteomic databases

PRIDEQ8IX01.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENST00000337018; ENSP00000337926; ENSG00000064607.
ENST00000392342; ENSP00000376153; ENSG00000064607.
ENST00000452918; ENSP00000389380; ENSG00000064607.
GeneID10147.
KEGGhsa:10147.
UCSCuc002nkx.1. human.

Organism-specific databases

CTD10147.
GeneCardsGC19M019103.
HGNCHGNC:18641. SUGP2.
HPAHPA041557.
MIM607993. gene.
neXtProtNX_Q8IX01.
PharmGKBPA134983967.
HUGESearch...
GenAtlasSearch...

Phylogenomic databases

eggNOGprNOG17702.
GeneTreeENSGT00410000025695.
HOGENOMHBG402969.
HOVERGENHBG079176.
InParanoidQ8IX01.
OMAETTNSLC.
OrthoDBEOG4QFWCG.
PhylomeDBQ8IX01.

Gene expression databases

ArrayExpressQ8IX01.
BgeeQ8IX01.
CleanExHS_SFRS14.
GenevestigatorQ8IX01.
GermOnlineENSG00000064607. Homo sapiens.

Family and domain databases

InterProIPR000467. G_patch.
IPR000061. Surp.
[Graphical view]
KOK13166.
PfamPF01585. G-patch. 1 hit.
PF01805. Surp. 1 hit.
[Graphical view]
SMARTSM00443. G_patch. 1 hit.
SM00648. SWAP. 2 hits.
[Graphical view]
PROSITEPS50174. G_PATCH. 1 hit.
PS50128. SURP. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

NextBio38397.
SOURCESearch...

Entry information

Entry nameSUGP2_HUMAN
AccessionPrimary (citable) accession number: Q8IX01
Secondary accession number(s): C9JI71 expand/collapse secondary AC list , O15071, O60369, Q5JPH7, Q8WUF7
Entry history
Integrated into UniProtKB/Swiss-Prot: March 15, 2005
Last sequence update: October 5, 2010
Last modified: January 25, 2012
This is version 79 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

Human chromosome 19

Human chromosome 19: entries, gene names and cross-references to MIM

Human entries with polymorphisms or disease mutations

List of human entries with polymorphisms or disease mutations

Human polymorphisms and disease mutations

Index of human polymorphisms and disease mutations

MIM cross-references

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot

PDB cross-references

Index of Protein Data Bank (PDB) cross-references

SIMILARITY comments

Index of protein domains and families