Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q08D57 (SET1B_XENTR) Reviewed, UniProtKB/Swiss-Prot

Last modified April 16, 2014. Version 54. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (1) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Histone-lysine N-methyltransferase SETD1B

EC=2.1.1.43
Alternative name(s):
SET domain-containing protein 1B
Gene names
Name:setd1b
OrganismXenopus tropicalis (Western clawed frog) (Silurana tropicalis) [Reference proteome]
Taxonomic identifier8364 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiAmphibiaBatrachiaAnuraPipoideaPipidaeXenopodinaeXenopusSilurana

Protein attributes

Sequence length1956 AA.
Sequence statusComplete.
Protein existenceEvidence at transcript level

General annotation (Comments)

Function

Histone methyltransferase that specifically methylates 'Lys-4' of histone H3, when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys-9' residue is already methylated. H3 'Lys-4' methylation represents a specific tag for epigenetic transcriptional activation By similarity.

Catalytic activity

S-adenosyl-L-methionine + L-lysine-[histone] = S-adenosyl-L-homocysteine + N(6)-methyl-L-lysine-[histone].

Subcellular location

Nucleus speckle By similarity. Chromosome By similarity.

Sequence similarities

Belongs to the class V-like SAM-binding methyltransferase superfamily.

Contains 1 post-SET domain.

Contains 1 RRM (RNA recognition motif) domain.

Contains 1 SET domain.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 19561956Histone-lysine N-methyltransferase SETD1B
PRO_0000316998

Regions

Domain111 – 19989RRM
Domain1817 – 1934118SET
Domain1940 – 195617Post-SET
Compositional bias657 – 72266Pro-rich
Compositional bias987 – 1223237Glu-rich

Sequences

Sequence LengthMass (Da)Tools
Q08D57 [UniParc].

Last modified October 31, 2006. Version 1.
Checksum: 9AFF263E87E53DCD

FASTA1,956218,069
        10         20         30         40         50         60 
MSFKEAKPGE RGKNPEDHGR KQTASWINGM EAGNQPSTSG EKKSHHWRSY KLIIDPALRK 

        70         80         90        100        110        120 
GQHKLYRYDG LSFSMPNSGV PPVDSVRDPR IGRIWTKTKE LDLPVPKLKI DEFYVGPVPP 

       130        140        150        160        170        180 
KQVTFAKLND NIRENFLGDM CKKYGEVEEV EILYNPKNKK HLGIAKVIFA SVKGARDAVK 

       190        200        210        220        230        240 
HLHNTSVMGN IIHVELDTKG ETRMRFYDLL VNGFYTPQTL PVGSDLDASP TVNETPQVVE 

       250        260        270        280        290        300 
SVKRTKETAI GPSVTPNSST PFSHDTAYSS SRQGTPNSYS QFTPQSQGTP HTPRLGTPFS 

       310        320        330        340        350        360 
QDSSYSSRQT TPAFHYGQDS GFKPRRHENK FTDAYNRRPG HHYVHSSGSY RGTEHTFNVT 

       370        380        390        400        410        420 
RPQPEPVQVP RTPPLSHSSG NYKSAFSPYQ GNTVFPQTDE SQYPQTSRDM EYRRTGPQTS 

       430        440        450        460        470        480 
DSYSDAGCNS ASLELKPVKE KPEEPPPPEP DSTTEQKASF SQTPERCETP GTPTLEAELQ 

       490        500        510        520        530        540 
HNSLDTRIAM LLKEQRTQLH LIAGDQNSDN EIRMEGSPIS SSSSQLSPIP PYSSGSRYQD 

       550        560        570        580        590        600 
VTPSSRPSST GLEDISPTPL PDSDDDDEPI PGTASLCQNS RSASPIDQIN QSGRKTESLD 

       610        620        630        640        650        660 
KKELVAGDET PTSEKMDEGH PSSGEDMEIS DDEVTPSPIT SAECAITSSS VISSVIPIPP 

       670        680        690        700        710        720 
PGFPPLPPPP PPQPGFPMPP PLPPPPPPTH PSVTVPPPPL PAPPGVPPHH ILHHPPPYHH 

       730        740        750        760        770        780 
FPVMQGEMMN VLGNHWGGMT MSFQMQTQML SRMMQGQGSY PYHHFMGGSM QFGNQHPYRP 

       790        800        810        820        830        840 
FAISAHLTRG QPWPPFPKFD PSVPPPGYEH KKEDPHKATV DGVLQVIVKE LKAIMKRDLN 

       850        860        870        880        890        900 
RKMVEVVAFR AFDEWWDKKE RLAKQSLTPV KSGESKEEDK QKTKEHITSS LLESWNKGEG 

       910        920        930        940        950        960 
LGFEGIGLGI GLRGAIRLPS FKVKRKEPPD AALAGDQKRI RPSHSVDDED EESERDRDIS 

       970        980        990       1000       1010       1020 
STASDLSKKD ADAVNNRRRP ARPLDSEGEE EVESEGDDGE TSDKEDSSSE KEDQDDGSVS 

      1030       1040       1050       1060       1070       1080 
ALSSKKQLYG DKEGDDEDDD TQSSGKEEDL VSEEEDTTSV ASSRAEMDSS DESEESSEYE 

      1090       1100       1110       1120       1130       1140 
SSSDSDEKEE EDDEEEELVF GDDQSEDQDL GQEYEVETDR EEDFFRENLS ECSSLPKAGD 

      1150       1160       1170       1180       1190       1200 
VELEDEMQKV EEDVARQTTQ ETLHLRKKNL DVPLVESKEC KQDTLDKVEK LFAVPMQEEV 

      1210       1220       1230       1240       1250       1260 
FKEHEKAPSP MNEEEEYIEL QLEPVPLVPE GAAPAAQEPV IIRPLTPTGA FGETGPVLKL 

      1270       1280       1290       1300       1310       1320 
EEPKLQVNLT QFATEDEELY PRTPGRDTAA HSDTEVTFQP GLKVAPSSLP FLPSHNKEEE 

      1330       1340       1350       1360       1370       1380 
CLLPPEKHAG HLTVTKMLSE EDLPRTPGRD IVVKSSHLGK SQSTETVPAT PGSDAPLTGS 

      1390       1400       1410       1420       1430       1440 
SLTLTSPHIP GSPFSYLSQS PGIINSGIPR TPGRDFNFTP TFPESNSIFP CHPSGKKPSV 

      1450       1460       1470       1480       1490       1500 
DEPDEKSFKE PTSASLTMNS VPSPIPFASP PRGLPHMDIR LGADDLESSD TPAYLSDKLL 

      1510       1520       1530       1540       1550       1560 
SEESECEFTK VHLTSTDESA PSPPLPPAEK RKGDRSKKPL SAHEFETEKN YETSSAVAMS 

      1570       1580       1590       1600       1610       1620 
EGALGKQMFI GQPDAVSGIK DPAAVPLDFR NDSLSENTVH EPIIQKVPLK ELENQWNEVL 

      1630       1640       1650       1660       1670       1680 
KEEEDITKHK KSRNSRHNNR YDEFSTVPSP EFSPPRAMFK PRSEFEEMTI LYDIWNGGID 

      1690       1700       1710       1720       1730       1740 
DEDIKYMCIT YDRLLQQDNG MDWLNDTLWV YHPSTSVYSP KKKKRDDGLR EHVTGCARSE 

      1750       1760       1770       1780       1790       1800 
GYYKIDKKDK LKYLINNRSL TEELPIDTQG KSIPAQPQAS TRAGSERRSE QRRLLSSFTG 

      1810       1820       1830       1840       1850       1860 
SCDSDLLKFN QLKFRKKKLR FCKSHIHDWG LFAMEPIIAD EMVIEYVGQN IRQVIADMRE 

      1870       1880       1890       1900       1910       1920 
KRYEDEGIGS SYMFRVDHDT IIDATKCGNF ARFINHSCNP NCYAKVITVE SQKKIVIYSK 

      1930       1940       1950 
QYINVNEEIT YDYKFPIEDV KIPCLCGAEN CRGTLN 

« Hide

References

[1]NIH - Xenopus Gene Collection (XGC) project
Submitted (SEP-2006) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
Tissue: Brain.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
BC123932 mRNA. Translation: AAI23933.1.
RefSeqNP_001072649.1. NM_001079181.1.
UniGeneStr.52478.

3D structure databases

ProteinModelPortalQ08D57.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

STRING8364.ENSXETP00000008143.

Proteomic databases

PaxDbQ08D57.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

GeneID780106.
KEGGxtr:780106.

Organism-specific databases

CTD23067.
XenbaseXB-GENE-5842344. setd1b.

Phylogenomic databases

eggNOGCOG2940.
HOGENOMHOG000168216.
HOVERGENHBG055596.
KOK11422.

Family and domain databases

Gene3D3.30.70.330. 1 hit.
InterProIPR024657. COMPASS_Set1_N-SET.
IPR015722. Histone-lysine_MeTfrase.
IPR012677. Nucleotide-bd_a/b_plait.
IPR003616. Post-SET_dom.
IPR000504. RRM_dom.
IPR001214. SET_dom.
[Graphical view]
PANTHERPTHR22884:SF10. PTHR22884:SF10. 1 hit.
PfamPF11764. N-SET. 1 hit.
PF00076. RRM_1. 1 hit.
PF00856. SET. 1 hit.
[Graphical view]
SMARTSM00508. PostSET. 1 hit.
SM00360. RRM. 1 hit.
SM00317. SET. 1 hit.
[Graphical view]
PROSITEPS50868. POST_SET. 1 hit.
PS50102. RRM. 1 hit.
PS50280. SET. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameSET1B_XENTR
AccessionPrimary (citable) accession number: Q08D57
Entry history
Integrated into UniProtKB/Swiss-Prot: February 5, 2008
Last sequence update: October 31, 2006
Last modified: April 16, 2014
This is version 54 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families