Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Symplekin

Gene

Sympk

Organism
Mus musculus (Mouse)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Experimental evidence at protein leveli

Functioni

Scaffold protein that functions as a component of a multimolecular complex involved in histone mRNA 3'-end processing. Specific component of the tight junction (TJ) plaque, but might not be an exclusively junctional component. May have a house-keeping rule. Is involved in pre-mRNA polyadenylation. Enhances SSU72 phosphatase activity (By similarity).By similarity

GO - Biological processi

Complete GO annotation...

Keywords - Biological processi

Cell adhesion, mRNA processing

Names & Taxonomyi

Protein namesi
Recommended name:
Symplekin
Gene namesi
Name:Sympk
OrganismiMus musculus (Mouse)
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus
Proteomesi
  • UP000000589 Componenti: Unplaced

Organism-specific databases

MGIiMGI:1915438. Sympk.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Cell junction, Cell membrane, Cytoplasm, Cytoskeleton, Membrane, Nucleus, Tight junction

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00000723861 – 1284SymplekinAdd BLAST1284

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei13PhosphoserineBy similarity1
Cross-linki361Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO1)By similarity
Cross-linki361Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity
Cross-linki483Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity
Modified residuei494PhosphoserineBy similarity1
Modified residuei1234PhosphoserineBy similarity1
Modified residuei1235PhosphoserineBy similarity1
Cross-linki1252Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO1)By similarity
Modified residuei1256PhosphoserineCombined sources1
Modified residuei1270PhosphothreonineBy similarity1
Modified residuei1272PhosphoserineBy similarity1

Keywords - PTMi

Isopeptide bond, Phosphoprotein, Ubl conjugation

Proteomic databases

EPDiQ80X82.
MaxQBiQ80X82.
PaxDbiQ80X82.
PeptideAtlasiQ80X82.
PRIDEiQ80X82.

PTM databases

iPTMnetiQ80X82.
PhosphoSitePlusiQ80X82.

Expressioni

Gene expression databases

BgeeiENSMUSG00000023118.
CleanExiMM_SYMPK.

Interactioni

Subunit structurei

Found in a heat-sensitive complex at least composed of several cleavage and polyadenylation specific and cleavage stimulation factors. Interacts with CPSF2, CPSF3 and CSTF2. Interacts with HSF1 in heat-stressed cells. Interacts with SSU72 (By similarity).By similarity

Protein-protein interaction databases

IntActiQ80X82. 1 interactor.
STRINGi10090.ENSMUSP00000023882.

Structurei

3D structure databases

ProteinModelPortaliQ80X82.
SMRiQ80X82.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Repeati31 – 64HEAT 1Add BLAST34
Repeati67 – 101HEAT 2Add BLAST35
Repeati104 – 146HEAT 3Add BLAST43
Repeati153 – 192HEAT 4Add BLAST40
Repeati227 – 266HEAT 5Add BLAST40

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Regioni1 – 124Interaction with HSF1By similarityAdd BLAST124

Motif

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Motifi345 – 360Nuclear localization signalSequence analysisAdd BLAST16

Domaini

The HEAT repeats have been determined based on 3D-structure analysis of the D.melanogaster ortholog and are not detected by sequence-based prediction programs.

Sequence similaritiesi

Belongs to the Symplekin family.Curated
Contains 5 HEAT repeats.Curated

Keywords - Domaini

Repeat

Phylogenomic databases

eggNOGiKOG1895. Eukaryota.
ENOG410XQAS. LUCA.
HOGENOMiHOG000046745.
HOVERGENiHBG062441.
InParanoidiQ80X82.
PhylomeDBiQ80X82.

Family and domain databases

Gene3Di1.25.10.10. 2 hits.
InterProiIPR011989. ARM-like.
IPR016024. ARM-type_fold.
IPR021850. Symplekin/Pta1.
IPR032460. Symplekin/Pta1_N.
IPR022075. Symplekin_C.
[Graphical view]
PANTHERiPTHR15245:SF20. PTHR15245:SF20. 1 hit.
PfamiPF11935. DUF3453. 1 hit.
PF12295. Symplekin_C. 1 hit.
[Graphical view]
SUPFAMiSSF48371. SSF48371. 3 hits.

Sequencei

Sequence statusi: Complete.

Q80X82-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MASSSGDSVT RRSVASQFFT QEEGPSIDGM TTSERVVDLL NQAALITNDS
60 70 80 90 100
KITVLKQVQE LIINKDPTLL DNFLDEIIAF QADKSIEVRK FVIGFIEEAC
110 120 130 140 150
KRDIELLLKL IANLNMLLRD ENVNVVKKAI LTMTQLYKVA LQWMVKSRVI
160 170 180 190 200
SDLQEACWDM VSSMAGEIIL LLDSDNDGIR THAIKFVEGL IVTLSPRMAD
210 220 230 240 250
SEVPRRQEHD ISLDRIPRDH PYIQYNVLWE EGKAAVEQLL KFMVHPAISS
260 270 280 290 300
INLTTALGSL ANIARQRPMF MSEVIQAYET LHANLPPTLA KSQVSSVRKN
310 320 330 340 350
LKLHLLSVLK HPASLEFQAQ ITTLLVDLGT PQAEIARNMP SSKDSRKRPR
360 370 380 390 400
DDTDSTLKKM KLEPNLGEDD EDKDLEPGPS GTSKASAQIS GQSDTDITAE
410 420 430 440 450
FLQPLLTPDN VANLVLISMV YLPETMPASF QAIYTPVESA GTEAQIKHLA
460 470 480 490 500
RLMATQMTAA GLGPGVEQTK QCKEEPKEEK VVKPESVLIK RRLSVQGQAI
510 520 530 540 550
SVVGSQSTMS PLEEEVPQAK RRPEPIIPVT QPRLAGAGGR KKIFRLSDVL
560 570 580 590 600
KPLTDAQVEA MKLGAVKRIL RAEKAVACSG AAQVRIKILA SLVTQFDSGF
610 620 630 640 650
KAEVLSFILE DVRARLDLAF AWLYQEYNAY LAAGTSGTLD KYEDCLICLL
660 670 680 690 700
SGLQEKPDQK DGIFTKVVLE APLITESALE VIRKYCEDES RAYLGMSTLG
710 720 730 740 750
DLIFKRPSRQ FQYLHVLLDL SSHEKDRVRS QALLFIKRMY EKEQLREYVE
760 770 780 790 800
KFALNYLQLL VHPNPPSVLF GADKDTEVAA PWTEETVKQC LYLYLALLPQ
810 820 830 840 850
NHKLIHELAA VYTEAIADIK RTVLRVIEQP IRGMGMNSPE LLLLVENCPK
860 870 880 890 900
GAETLVTRCL HSLTDKVPPS PELVKRVRDL YHKRLPDVRF LIPVLNGLEK
910 920 930 940 950
KEVIQALPKL IKLNPIVVKE VFNRLLGTQH GEGNSALSPL NPGELLIALH
960 970 980 990 1000
NIDSVKCDMK SIIKATNLCF AERNVYTSEV LAVVMQQLME QSPLPMLLMR
1010 1020 1030 1040 1050
TVIQSLTMYP RLGGFVMNIL ARLIMKQVWK YPKVWEGFIK CCQRTKPQSF
1060 1070 1080 1090 1100
QVILQLPPQQ LGAVFDKCPE LREPLLAHVR SFTPHQQAHI PNSIMTILEA
1110 1120 1130 1140 1150
TGKQEPEVKE APSGPLEEDD LEPLALALAP APAPAPAPAP APRPPQDLIG
1160 1170 1180 1190 1200
LRLAQEKALK RQLEEEQKQK PTGIGAPAAC VSSTPSVPAA ARAGPTPAEE
1210 1220 1230 1240 1250
VMEYREEGPE CETPAIFISM DDDSGLAETT LLDSSLEGPL PKEAAAVGSS
1260 1270 1280
SKDERSPQNL SHAVEEALKT SSPETREPES KGNS
Length:1,284
Mass (Da):142,284
Last modified:June 1, 2003 - v1
Checksum:i983FA5300374DC5B
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
BC049852 mRNA. Translation: AAH49852.1.
UniGeneiMm.130902.

Genome annotation databases

UCSCiuc009fkg.2. mouse.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
BC049852 mRNA. Translation: AAH49852.1.
UniGeneiMm.130902.

3D structure databases

ProteinModelPortaliQ80X82.
SMRiQ80X82.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

IntActiQ80X82. 1 interactor.
STRINGi10090.ENSMUSP00000023882.

PTM databases

iPTMnetiQ80X82.
PhosphoSitePlusiQ80X82.

Proteomic databases

EPDiQ80X82.
MaxQBiQ80X82.
PaxDbiQ80X82.
PeptideAtlasiQ80X82.
PRIDEiQ80X82.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

UCSCiuc009fkg.2. mouse.

Organism-specific databases

MGIiMGI:1915438. Sympk.

Phylogenomic databases

eggNOGiKOG1895. Eukaryota.
ENOG410XQAS. LUCA.
HOGENOMiHOG000046745.
HOVERGENiHBG062441.
InParanoidiQ80X82.
PhylomeDBiQ80X82.

Miscellaneous databases

PROiQ80X82.
SOURCEiSearch...

Gene expression databases

BgeeiENSMUSG00000023118.
CleanExiMM_SYMPK.

Family and domain databases

Gene3Di1.25.10.10. 2 hits.
InterProiIPR011989. ARM-like.
IPR016024. ARM-type_fold.
IPR021850. Symplekin/Pta1.
IPR032460. Symplekin/Pta1_N.
IPR022075. Symplekin_C.
[Graphical view]
PANTHERiPTHR15245:SF20. PTHR15245:SF20. 1 hit.
PfamiPF11935. DUF3453. 1 hit.
PF12295. Symplekin_C. 1 hit.
[Graphical view]
SUPFAMiSSF48371. SSF48371. 3 hits.
ProtoNetiSearch...

Entry informationi

Entry nameiSYMPK_MOUSE
AccessioniPrimary (citable) accession number: Q80X82
Entry historyi
Integrated into UniProtKB/Swiss-Prot: July 19, 2005
Last sequence update: June 1, 2003
Last modified: November 2, 2016
This is version 102 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. MGD cross-references
    Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.