Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot Q9VK33 (SMBT_DROME)

Last modified June 16, 2009. Version 63. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Binary interactions · Alternative products · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Polycomb protein Sfmbt
Alternative name(s):
    Scm-like with four MBT domain-containing protein 1
    dSfmbt
Gene names
Name: Sfmbt
ORF Names: CG16975
OrganismDrosophila melanogaster (Fruit fly) [Complete proteome]
Taxonomic identifier7227 [NCBI]
Taxonomic lineageEukaryotaMetazoaArthropodaHexapodaInsectaPterygotaNeopteraEndopterygotaDipteraBrachyceraMuscomorphaEphydroideaDrosophilidaeDrosophilaSophophora

Protein attributes

Sequence length1220 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceEvidence at protein level.

General annotation (Comments)

Function

Polycomb group (PcG) protein that binds to the Polycomb response elements (PREs) found in the regulatory regions of many genes. PcG proteins act by forming multiprotein complexes, which are required to maintain the transcriptionally repressive state of homeotic genes throughout development. PcG proteins are not required to initiate repression, but to maintain it during later stages of development. They probably act via the methylation of histones, rendering chromatin heritably changed in its expressibility. Necessary but not sufficient to recruit a functional PcG repressive complex that represses target genes, suggesting that the recruitment of the distinct PRC1 complex is also required to allow a subsequent repression. Ref.4

Subunit structure

Interacts with pho as a component of the pho-repressive complex (PhoRC). Ref.4

Subcellular location

Nucleus. Ref.4

Domain

MBT repeats have unique discriminatory binding activity for methylated Lys residues in H3 and H4; the MBT repeats bind mono- and dimethylated H3K9Me1, H3K9Me2, H4K20Me1 and H4K20Me2 but fail to interact with these residues if they are unmodified or trimethylated. Ref.4

Sequence similarities

Contains 1 FCS-type zinc finger.

Contains 4 MBT repeats.

Contains 1 SAM (sterile alpha motif) domain.

Ontologies

Keywords
   Biological processTranscription
Transcription regulation
   Cellular componentNucleus
   Coding sequence diversityAlternative splicing
   DomainRepeat
Zinc-finger
   LigandDNA-binding
Metal-binding
Zinc
   Molecular functionChromatin regulator
Repressor
   Technical termComplete proteome
Gene Ontology (GO)
   Biological processchromatin modification

Inferred from electronic annotation. Source: UniProtKB-KW

chromatin silencing Ref.4

Inferred from direct assay. Source: FlyBase

imaginal disc growth Ref.4

Inferred from mutant phenotype. Source: FlyBase

oogenesis Ref.4

Inferred from mutant phenotype. Source: FlyBase

transcription

Inferred from electronic annotation. Source: UniProtKB-KW

   Cellular componentnucleus Ref.4

Inferred from direct assay. Source: FlyBase

   Molecular functionDNA binding

Inferred from electronic annotation. Source: UniProtKB-KW

methylated histone residue binding Ref.4

Inferred from direct assay. Source: FlyBase

zinc ion binding

Inferred from electronic annotation. Source: UniProtKB-KW

Complete GO annotation...

Binary interactions

With

Entry

#Exp.

IntAct

Notes

Q9VQD01EBI-117801,EBI-138843

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform B Ref.1 (identifier: Q9VK33-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Note: No experimental confirmation available.
Isoform A Ref.1 (identifier: Q9VK33-2)

The sequence of this isoform differs from the canonical sequence as follows:
     1-352: Missing.
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 12201220Polycomb protein Sfmbt
PRO_0000306371

Regions

Repeat536 – 647112MBT 1
Repeat655 – 75399MBT 2
Repeat761 – 871111MBT 3
Repeat879 – 97597MBT 4
Domain1140 – 120364SAM
Zinc finger322 – 35736FCS-type

Natural variations

Alternative sequence1 – 352352Missing in isoform A. Ref.1
VSP_052548

Sequences

Sequence LengthMass (Da)Tools
Isoform B [UniParc].

Last modified June 1, 2003. Version 2.
Checksum: 2BE2785144CA051F

FASTA1,220133,666
        10         20         30         40         50         60 
MNPSELRMMW MSSQYNSERI TLEDAATLLG HPTVGLSVME DLSAHQPTLD MNPMMSLMGG 

        70         80         90        100        110        120 
DFTGQAAATA AALGVQPGTL IATNSNNLYG FAHMGGLQQQ LLQQSAAAAV FQNYAEAMDN 

       130        140        150        160        170        180 
DVENGMVGMA MEAVVDDDDQ VYGQRDNNFD DNGSELEPKQ EIINIDDFVM MNEDNNSYDG 

       190        200        210        220        230        240 
TDFMTSSDKD ISQSSSSCMA QMPGSLGVPG VEHDLLVPLP DGLLHHKLLG TTLVPAMGTL 

       250        260        270        280        290        300 
NGNAFGNIMV STENTSSKQM QRTYSTAKGA NSTATTATCS ASTSSALRSQ RKTRKIEPVN 

       310        320        330        340        350        360 
RPGLVLKTPI AYRGNIDPSV IPIQKDGMAV CKRCGAIGVK HTFYTKSRRF CSMACARGEL 

       370        380        390        400        410        420 
YSLVLNTKME GDQATTSSPD PGAGSESADL PGDQQQSQSD IELDLHAAHI KNANYRFRIT 

       430        440        450        460        470        480 
DQSKITQLNS FGEPMSMGGD AAANNVQMAA DETIAALNGG AVGDATAPGS TEEGASTPNS 

       490        500        510        520        530        540 
YLSAAPTPKA LRLFKDIYPQ DDLPQIPKYE RLPVPCPQME KIISIRRRMY DPTHSYDWLP 

       550        560        570        580        590        600 
RLSKENFNAA PVTCFPHAPG CEVWDNLGVG MKVEVENTDC DSIEVIQPGQ TPTSFWVATI 

       610        620        630        640        650        660 
LEIKGYKALM SYEGFDTDSH DFWVNLCNAE VHSVGWCATR GKPLIPPRTI EHKYKDWKDF 

       670        680        690        700        710        720 
LVGRLSGART LPSNFYNKIN DSLQSRFRLG LNLECVDKDR ISQVRLATVT KIVGKRLFLR 

       730        740        750        760        770        780 
YFDSDDGFWC HEDSPIIHPV GWATTVGHNL AAPQDYLERM LAGREAMIEV HEDDATIELF 

       790        800        810        820        830        840 
KMNFTFDEYY SDGKTNSFVE GMKLEAVDPL NLSSICPATV MAVLKFGYMM IRIDSYQPDA 

       850        860        870        880        890        900 
SGSDWFCYHE KSPCIFPAGF CSVNNISVTP PNGYDSRTFT WEGYLRDTGA VAAGQHLFHR 

       910        920        930        940        950        960 
IIPDHGFEVG MSLECADLMD PRLVCVATVA RVVGRLLKVH FDGWTDEYDQ WLDCESADIY 

       970        980        990       1000       1010       1020 
PVGWCVLVNH KLEGPPRVAH QQAPKPAPKP KIQRKRKPKK GAAGGKTPTD NNTQSVKSRT 

      1030       1040       1050       1060       1070       1080 
IALKTTPHLP KLSIKLELKP EHHNAAFYEN NQPEEEGDEE DPDADGDGDG STSHISEQST 

      1090       1100       1110       1120       1130       1140 
TQSSSDLIAG SGSGSGSASL VTLATGSNKT NSSATNNKYI PRLADIDSSE PHLELVPDTW 

      1150       1160       1170       1180       1190       1200 
NVYDVSQFLR VNDCTAHCDT FSRNKIDGKR LLQLTKDDIM PLLGMKVGPA LKISDLIAQL 

      1210       1220 
KCKVNPGRAR SHKTNKSPFL 

« Hide

Isoform A.

Checksum: A8D56440315A78E4
Show »

FASTA86895,867

References

« Hide 'large scale' references
[1]"The genome sequence of Drosophila melanogaster."
Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N., Sutton G.G., Wortman J.R., Yandell M.D. expand/collapse author list , Zhang Q., Chen L.X., Brandon R.C., Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C., Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A., An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B., Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C., Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S., Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M., Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D., Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F., Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D., Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A., Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C., McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C., Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L., Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R., Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F., Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J., Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R., Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y., Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T., Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S., Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.
Science 287:2185-2195(2000) [PubMed: 10731132] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: Berkeley.
[2]"Annotation of the Drosophila melanogaster euchromatic genome: a systematic review."
Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S., Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E., Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P., Bettencourt B.R., Celniker S.E., de Grey A.D.N.J. expand/collapse author list , Drysdale R.A., Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M., Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.
Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002) [PubMed: 12537572] [Abstract]
Cited for: GENOME REANNOTATION, ALTERNATIVE SPLICING.
[3]Stapleton M., Brokstein P., Hong L., Agbayani A., Carlson J.W., Champe M., Chavez C., Dorsett V., Dresnek D., Farfan D., Frise E., George R.A., Gonzalez M., Guarin H., Kronmiller B., Li P.W., Liao G., Miranda A. expand/collapse author list , Mungall C.J., Nunoo J., Pacleb J.M., Paragas V., Park S., Patel S., Phouanenavong S., Wan K.H., Yu C., Lewis S.E., Rubin G.M., Celniker S.E.
Submitted (MAR-2003) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM B).
Strain: Berkeley.
Tissue: Embryo.
[4]"A Polycomb group protein complex with sequence-specific DNA-binding and selective methyl-lysine-binding activities."
Klymenko T., Papp B., Fischle W., Koecher T., Schelder M., Fritsch C., Wild B., Wilm M., Mueller J.
Genes Dev. 20:1110-1122(2006) [PubMed: 16618800] [Abstract]
Cited for: FUNCTION, INTERACTION WITH PHO, SUBCELLULAR LOCATION.
Tissue: Embryo.

Cross-references

Sequence databases

AE014134 Genomic DNA. Translation: AAF53249.2.
AE014134 Genomic DNA. Translation: AAF53250.2.
BT006011 mRNA. Translation: AAO74694.1.
RefSeqNP_609606.2.
NP_723786.1.
UniGeneDm.11543

3D structure databases

HSSPHSSP built from PDB template 1OI1 based on UniProtKB Q9UQR0.
ModBaseSearch...

Protein-protein interaction databases

IntActQ9VK33. 8 interactions.

Genome annotation databases

EnsemblFBgn0032475. Drosophila melanogaster. [Contig view]
GeneID34709.
KEGGdme:Dmel_CG16975.

Organism-specific databases

FlyBaseFBgn0032475. Sfmbt.

Phylogenomic databases

HOGENOMQ9VK33.
OMAQ9VK33. HEKSPCI.

Gene expression databases

ArrayExpressQ9VK33.

Family and domain databases

InterProIPR004092. Mbt.
IPR001660. SAM.
IPR013761. SAM_type.
IPR012313. Znf_FCS.
[Graphical view]
Gene3DG3DSA:1.10.150.50. SAM_type. 1 hit.
PfamPF02820. MBT. 4 hits.
PF00536. SAM_1. 1 hit.
[Graphical view]
SMARTSM00561. MBT. 4 hits.
SM00454. SAM. 1 hit.
[Graphical view]
PROSITEPS51079. MBT. 4 hits.
PS50105. SAM_DOMAIN. 1 hit.
PS51024. ZF_FCS. 1 hit.
[Graphical view]
ProtoNetSearch...

Other Resources

NextBio789812.

Entry information

Entry nameSMBT_DROME
AccessionPrimary (citable) accession number: Q9VK33
Secondary accession number(s): Q7KTB5, Q9VK32
Entry history
Integrated into UniProtKB/Swiss-Prot: October 2, 2007
Last sequence update: June 1, 2003
Last modified: June 16, 2009
This is version 63 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectDrosophila annotation project

Relevant documents

Drosophila

Drosophila: entries, gene names and cross-references to FlyBase

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Binary interactions · Alternative products · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents