Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Symplekin

Gene

Sym

Organism
Drosophila melanogaster (Fruit fly)
Status
Reviewed-Annotation score: Annotation score: 4 out of 5-Experimental evidence at protein leveli

Functioni

Component of a protein complex required for cotranscriptional processing of 3'-ends of polyadenylated and histone pre-mRNA.2 Publications

GO - Molecular functioni

  • DNA binding Source: FlyBase
  • RNA binding Source: UniProtKB-KW

GO - Biological processi

  • mRNA 3'-end processing by stem-loop binding and cleavage Source: FlyBase
  • mRNA cleavage Source: FlyBase
  • mRNA polyadenylation Source: FlyBase
Complete GO annotation...

Keywords - Biological processi

mRNA processing

Keywords - Ligandi

RNA-binding

Names & Taxonomyi

Protein namesi
Recommended name:
SymplekinImported
Gene namesi
Name:Sym
ORF Names:CG2097
OrganismiDrosophila melanogaster (Fruit fly)
Taxonomic identifieri7227 [NCBI]
Taxonomic lineageiEukaryotaMetazoaEcdysozoaArthropodaHexapodaInsectaPterygotaNeopteraEndopterygotaDipteraBrachyceraMuscomorphaEphydroideaDrosophilidaeDrosophilaSophophora
Proteomesi
  • UP000000803 Componenti: Chromosome 3R

Organism-specific databases

FlyBaseiFBgn0037371. Sym.

Subcellular locationi

GO - Cellular componenti

  • histone locus body Source: UniProtKB
  • mRNA cleavage stimulating factor complex Source: FlyBase
  • nucleus Source: FlyBase
Complete GO annotation...

Keywords - Cellular componenti

Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 11651165SymplekinPRO_0000421978Add
BLAST

Proteomic databases

PaxDbiQ8MSU4.
PRIDEiQ8MSU4.

Expressioni

Gene expression databases

GenevisibleiQ8MSU4. DM.

Interactioni

Subunit structurei

Interacts with Cpsf73 and Cpsf100 forming a core cleavage factor required for both polyadenylated and histone mRNA processing. Interacts with Slbp and Lsm11.1 Publication

Protein-protein interaction databases

BioGridi65914. 3 interactions.
IntActiQ8MSU4. 12 interactions.
MINTiMINT-754132.
STRINGi7227.FBpp0078372.

Structurei

Secondary structure

1
1165
Legend: HelixTurnBeta strand
Show more details
Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Helixi22 – 3716Combined sources
Helixi42 – 5615Combined sources
Turni57 – 604Combined sources
Helixi61 – 677Combined sources
Helixi68 – 725Combined sources
Helixi73 – 753Combined sources
Helixi80 – 9617Combined sources
Helixi98 – 1036Combined sources
Helixi105 – 1117Combined sources
Helixi117 – 14024Combined sources
Beta strandi141 – 1433Combined sources
Helixi146 – 16419Combined sources
Helixi165 – 1673Combined sources
Helixi171 – 18717Combined sources
Beta strandi193 – 1953Combined sources
Helixi204 – 2063Combined sources
Helixi216 – 23419Combined sources
Helixi241 – 25717Combined sources
Helixi259 – 2613Combined sources
Helixi262 – 27413Combined sources
Helixi282 – 30019Combined sources
Helixi303 – 3086Combined sources
Helixi309 – 31810Combined sources
Helixi323 – 3275Combined sources
Helixi335 – 34814Combined sources

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
EntryMethodResolution (Å)ChainPositionsPDBsum
3GS3X-ray2.40A19-270[»]
4IMIX-ray2.35A/C19-351[»]
4IMJX-ray2.58A/C19-351[»]
4YGXX-ray2.95A/C19-351[»]
ProteinModelPortaliQ8MSU4.
SMRiQ8MSU4. Positions 19-350.
ModBaseiSearch...
MobiDBiSearch...

Miscellaneous databases

EvolutionaryTraceiQ8MSU4.

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Repeati23 – 5836HEAT 11 PublicationAdd
BLAST
Repeati61 – 9535HEAT 21 PublicationAdd
BLAST
Repeati98 – 14043HEAT 31 PublicationAdd
BLAST
Repeati147 – 18640HEAT 41 PublicationAdd
BLAST
Repeati218 – 25740HEAT 51 PublicationAdd
BLAST

Domaini

The HEAT repeats have been determined based on 3D-structure analysis and are not detected by sequence-based prediction programs.1 Publication

Sequence similaritiesi

Belongs to the Symplekin family.Curated
Contains 5 HEAT repeats.1 Publication

Keywords - Domaini

Coiled coil, Repeat

Phylogenomic databases

eggNOGiKOG1895. Eukaryota.
ENOG410XQAS. LUCA.
GeneTreeiENSGT00390000017045.
HOGENOMiHOG000247061.
InParanoidiQ8MSU4.
KOiK06100.
OMAiNDGIRTN.
OrthoDBiEOG7RZ5PC.
PhylomeDBiQ8MSU4.

Family and domain databases

Gene3Di1.25.10.10. 4 hits.
InterProiIPR011989. ARM-like.
IPR016024. ARM-type_fold.
IPR021850. Symplekin/Pta1.
IPR032460. Symplekin/Pta1_N.
IPR022075. Symplekin_C.
[Graphical view]
PANTHERiPTHR15245:SF20. PTHR15245:SF20. 1 hit.
PfamiPF11935. DUF3453. 1 hit.
PF12295. Symplekin_C. 1 hit.
[Graphical view]
SUPFAMiSSF48371. SSF48371. 7 hits.

Sequencei

Sequence statusi: Complete.

Q8MSU4-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MDSIIGRSQF VSETANLFTD EKTATARAKV VDWCNELVIA SPSTKCELLA
60 70 80 90 100
KVQETVLGSC AELAEEFLES VLSLAHDSNM EVRKQVVAFV EQVCKVKVEL
110 120 130 140 150
LPHVINVVSM LLRDNSAQVI KRVIQACGSI YKNGLQYLCS LMEPGDSAEQ
160 170 180 190 200
AWNILSLIKA QILDMIDNEN DGIRTNAIKF LEGVVVLQSF ADEDSLKRDG
210 220 230 240 250
DFSLADVPDH CTLFRREKLQ EEGNNILDIL LQFHGTTHIS SVNLIACTSS
260 270 280 290 300
LCTIAKMRPI FMGAVVEAFK QLNANLPPTL TDSQVSSVRK SLKMQLQTLL
310 320 330 340 350
KNRGAFEFAS TIRGMLVDLG SSTNEIQKLI PKMDKQEMAR RQKRILENAA
360 370 380 390 400
QSLAKRARLA CEQQDQQQRE MELDTEELER QKQKSTRVNE KFLAEHFRNP
410 420 430 440 450
ETVVTLVLEF LPSLPTEVPQ KFLQEYTPIR EMSIQQQVTN ISRFFGEQLS
460 470 480 490 500
EKRLGPGAAT FSREPPMRVK KVQAIESTLT AMEVDEDAVQ KLSEEEFQRK
510 520 530 540 550
EEATKKLRET MERAKGEQTV IEKMKERAKT LKLQEITKPL PRNLKEKFLT
560 570 580 590 600
DAVRRILNSE RQCIKGGVSS KRRKLVTVIA ATFPDNVRYG IMEFILEDIK
610 620 630 640 650
QRIDLAFSWL FEEYSLLQGF TRHTYVKTEN RPDHAYNELL NKLIFGIGER
660 670 680 690 700
CDHKDKIILI RRVYLEAPIL PEVSIGHLVQ LSLDDEFSQH GLELIKDLAV
710 720 730 740 750
LRPPRKNRFV RVLLNFSVHE RLDLRDLAQA HLVSLYHVHK ILPARIDEFA
760 770 780 790 800
LEWLKFIEQE SPPAAVFSQD FGRPTEEPDW REDTTKVCFG LAFTLLPYKP
810 820 830 840 850
EVYLQQICQV FVSTSAELKR TILRSLDIPI KKMGVESPTL LQLIEDCPKG
860 870 880 890 900
METLVIRIIY ILTERVPSPH EELVRRVRDL YQNKVKDVRV MIPVLSGLTR
910 920 930 940 950
SELISVLPKL IKLNPAVVKE VFNRLLGIGA EFAHQTMAMT PTDILVALHT
960 970 980 990 1000
IDTSVCDIKA IVKATSLCLA ERDLYTQEVL MAVLQQLVEV TPLPTLMMRT
1010 1020 1030 1040 1050
TIQSLTLYPR LANFVMNLLQ RLIIKQVWRQ KVIWEGFLKT VQRLKPQSMP
1060 1070 1080 1090 1100
ILLHLPPAQL VDALQQCPDL RPALSEYAES MQDEPMNGSG ITQQVLDIIS
1110 1120 1130 1140 1150
GKSVDVFVTD ESGGYISAEH IKKEAPDPSE ISVISTVPVL TSLVPLPVPP
1160
PIGSDLNQPL PPGED
Length:1,165
Mass (Da):132,077
Last modified:October 1, 2002 - v1
Checksum:iCFA818C50B2CC847
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AE014297 Genomic DNA. Translation: AAF51962.2.
AY118592 mRNA. Translation: AAM49961.1.
RefSeqiNP_649580.1. NM_141323.2.
UniGeneiDm.31227.

Genome annotation databases

EnsemblMetazoaiFBtr0078723; FBpp0078372; FBgn0037371.
GeneIDi40709.
KEGGidme:Dmel_CG2097.
UCSCiCG2097-RA. d. melanogaster.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AE014297 Genomic DNA. Translation: AAF51962.2.
AY118592 mRNA. Translation: AAM49961.1.
RefSeqiNP_649580.1. NM_141323.2.
UniGeneiDm.31227.

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
EntryMethodResolution (Å)ChainPositionsPDBsum
3GS3X-ray2.40A19-270[»]
4IMIX-ray2.35A/C19-351[»]
4IMJX-ray2.58A/C19-351[»]
4YGXX-ray2.95A/C19-351[»]
ProteinModelPortaliQ8MSU4.
SMRiQ8MSU4. Positions 19-350.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi65914. 3 interactions.
IntActiQ8MSU4. 12 interactions.
MINTiMINT-754132.
STRINGi7227.FBpp0078372.

Proteomic databases

PaxDbiQ8MSU4.
PRIDEiQ8MSU4.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaiFBtr0078723; FBpp0078372; FBgn0037371.
GeneIDi40709.
KEGGidme:Dmel_CG2097.
UCSCiCG2097-RA. d. melanogaster.

Organism-specific databases

CTDi40709.
FlyBaseiFBgn0037371. Sym.

Phylogenomic databases

eggNOGiKOG1895. Eukaryota.
ENOG410XQAS. LUCA.
GeneTreeiENSGT00390000017045.
HOGENOMiHOG000247061.
InParanoidiQ8MSU4.
KOiK06100.
OMAiNDGIRTN.
OrthoDBiEOG7RZ5PC.
PhylomeDBiQ8MSU4.

Miscellaneous databases

EvolutionaryTraceiQ8MSU4.
GenomeRNAii40709.
NextBioi820175.
PROiQ8MSU4.

Gene expression databases

GenevisibleiQ8MSU4. DM.

Family and domain databases

Gene3Di1.25.10.10. 4 hits.
InterProiIPR011989. ARM-like.
IPR016024. ARM-type_fold.
IPR021850. Symplekin/Pta1.
IPR032460. Symplekin/Pta1_N.
IPR022075. Symplekin_C.
[Graphical view]
PANTHERiPTHR15245:SF20. PTHR15245:SF20. 1 hit.
PfamiPF11935. DUF3453. 1 hit.
PF12295. Symplekin_C. 1 hit.
[Graphical view]
SUPFAMiSSF48371. SSF48371. 7 hits.
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "The genome sequence of Drosophila melanogaster."
    Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N., Sutton G.G., Wortman J.R., Yandell M.D.
    , Zhang Q., Chen L.X., Brandon R.C., Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C., Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A., An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B., Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C., Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S., Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M., Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D., Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F., Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D., Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A., Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C., McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C., Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L., Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R., Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F., Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J., Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R., Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y., Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T., Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S., Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.
    Science 287:2185-2195(2000) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: Berkeley.
  2. Cited for: GENOME REANNOTATION.
    Strain: Berkeley.
  3. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
    Strain: Berkeley1 Publication.
    Tissue: Embryo1 Publication.
  4. "A genome-wide RNA interference screen reveals that variant histones are necessary for replication-dependent histone pre-mRNA processing."
    Wagner E.J., Burch B.D., Godfrey A.C., Salzler H.R., Duronio R.J., Marzluff W.F.
    Mol. Cell 28:692-699(2007) [PubMed] [Europe PMC] [Abstract]
    Cited for: FUNCTION, SUBCELLULAR LOCATION.
  5. "A core complex of CPSF73, CPSF100, and Symplekin may form two different cleavage factors for processing of poly(A) and histone mRNAs."
    Sullivan K.D., Steiniger M., Marzluff W.F.
    Mol. Cell 34:322-332(2009) [PubMed] [Europe PMC] [Abstract]
    Cited for: FUNCTION, INTERACTION WITH CPSF73; CPSF100; SLBP AND LSM11.
  6. "Crystal structure of the HEAT domain from the Pre-mRNA processing factor Symplekin."
    Kennedy S.A., Frazier M.L., Steiniger M., Mast A.M., Marzluff W.F., Redinbo M.R.
    J. Mol. Biol. 392:115-128(2009) [PubMed] [Europe PMC] [Abstract]
    Cited for: X-RAY CRYSTALLOGRAPHY (2.40 ANGSTROMS) OF 19-270, HEAT REPEATS.

Entry informationi

Entry nameiSYMPK_DROME
AccessioniPrimary (citable) accession number: Q8MSU4
Secondary accession number(s): Q9VNH4
Entry historyi
Integrated into UniProtKB/Swiss-Prot: April 3, 2013
Last sequence update: October 1, 2002
Last modified: May 11, 2016
This is version 113 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programDrosophila annotation project

Miscellaneousi

Keywords - Technical termi

3D-structure, Complete proteome, Reference proteome

Documents

  1. Drosophila
    Drosophila: entries, gene names and cross-references to FlyBase
  2. PDB cross-references
    Index of Protein Data Bank (PDB) cross-references
  3. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.