Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

G4VJ01 (G4VJ01_SCHMA) Unreviewed, UniProtKB/TrEMBL

Last modified July 9, 2014. Version 19. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein names
Gene names
ORF Names:Smp_150850.2 EMBL CCD79402.1
OrganismSchistosoma mansoni (Blood fluke) [Reference proteome]
Taxonomic identifier6183 [NCBI]
Taxonomic lineageEukaryotaMetazoaPlatyhelminthesTrematodaDigeneaStrigeididaSchistosomatoideaSchistosomatidaeSchistosoma

Protein attributes

Sequence length1032 AA.
Sequence statusComplete.
Protein existenceInferred from homology

General annotation (Comments)

Catalytic activity

S-adenosyl-L-methionine + L-lysine-[histone] = S-adenosyl-L-homocysteine + N(6)-methyl-L-lysine-[histone]. RuleBase RU004542

S-adenosyl-L-methionine + histone L-lysine = S-adenosyl-L-homocysteine + histone N(6)-methyl-L-lysine.

Subcellular location

Nucleus By similarity RuleBase RU004542.

Sequence similarities

Contains 1 SET domain. RuleBase RU004538

Contains 1 pre-SET domain. RuleBase RU004542

Contains SET domain. SAAS SAAS001214

Contains pre-SET domain. SAAS SAAS001214

Sequences

Sequence LengthMass (Da)Tools
G4VJ01 [UniParc].

Last modified December 14, 2011. Version 1.
Checksum: 5E28402369A697ED

FASTA1,032118,377
        10         20         30         40         50         60 
MNVLKHEHRV LIDELTESIY DALDSDDIFD VIFAFVESKY QAELKDVNNP KLRLHIIQQR 

        70         80         90        100        110        120 
FSLALEECGL IDEIIESFPE LNLNARLDKL KKEQSTLQSN ILELNGKIDT IVDEMTRLQI 

       130        140        150        160        170        180 
NSFGRVLRLF DNRVIDLAKD EEEIVVIERP NNRIPESTIN HSTAPVWRTL QPAELLLGLQ 

       190        200        210        220        230        240 
LTPSQPVLYR KTQHTWFPGR VLSCHLPNSD PSTPKSDKDR AKENENAQYT IALDTSSKSK 

       250        260        270        280        290        300 
SEICFAVTSS LALAFSASEL RQKYPVGARV VSIYRDDTGG IGYYSGLIAE PPSERNNQRY 

       310        320        330        340        350        360 
LLFFDDGYTQ YSPPNEVYRI CHQSKENWKE ATESSQEFIK RYLAQYPQRP MVRLKPGQTV 

       370        380        390        400        410        420 
ETELDGDWMK ATVEKVDASL ALMKFSRTHS EWIYRGSTRL EPLFSDLLSS QSKSVPTHNQ 

       430        440        450        460        470        480 
RVEVEYSSVD VPLDSSAKRR KARKSATGNQ NTAMSVRALS SGDSHKISNS IHQTNKSLSN 

       490        500        510        520        530        540 
YNNEALKTTS KKGSEKNINE QCLSIDPITN LSELESTGTK TSYLDNLFKE FDYKPFESHK 

       550        560        570        580        590        600 
CGHYCLKSYR KPSSALSPIL CSENPVDYKG LNPLEIPFHC GWLRYLAKYD PPVNNKCNII 

       610        620        630        640        650        660 
YTAPCGRSLR SMHEVERFLD KTNSQLTADL FSFDSTLIIN QEFRAEKTLT NIVDLSYGKE 

       670        680        690        700        710        720 
NVPIPCVNSV DNEVPGYIDY TPQRQPIGNV PLLKDSKFLV CCDCTDNCRD RTKCACQQLT 

       730        740        750        760        770        780 
VEASSLTNPN GLVDSQAGYR YRRLSQFTVG GVYECNSNCQ CDRRCSNRVV QQGLWVRLQV 

       790        800        810        820        830        840 
FKTARKGWGI RALNAIPKGT FICTYAGAIY DEAMAVQEGF DCGDEYQAEL DYIETVEKDK 

       850        860        870        880        890        900 
EGYESTVEDL NEILNETTVA LIQRSASLEP EIKDRKPLER SKSHRKDSEK QHRRQHHHHH 

       910        920        930        940        950        960 
HHHRNHRQQQ LQQLSQSHQQ NQQQTKRMYP VAQKDWLRAR TYFNDINPYI MDAKKMGNLG 

       970        980        990       1000       1010       1020 
RYFNHSCNPN VFVQNVFIDT HDPRFPEVAF FAKRNIDVGE EMTWDYGYTV DAVPFKVLYC 

      1030 
YCGEPNCRIR LL 

« Hide

References

[1]"A systematically improved high quality genome and transcriptome of the human blood fluke Schistosoma mansoni."
Protasio A.V., Tsai I.J., Babbage A., Nichol S., Hunt M., Aslett M.A., De Silva N., Velarde G.S., Anderson T.J., Clark R.C., Davidson C., Dillon G.P., Holroyd N.E., LoVerde P.T., Lloyd C., McQuillan J., Oliveira G., Otto T.D. expand/collapse author list , Parker-Manuel S.J., Quail M.A., Wilson R.A., Zerlotini A., Dunne D.W., Berriman M.
PLoS Negl. Trop. Dis. 6:E1455-E1455(2012) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: Puerto Rican.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
HE601627 Genomic DNA. Translation: CCD79402.1.
UniGeneSma.10940.

3D structure databases

ProteinModelPortalG4VJ01.
ModBaseSearch...
MobiDBSearch...

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaSmp_150850.2; Smp_150850.2:pep; Smp_150850.
KEGGsmm:Smp_150850.2.

Organism-specific databases

CTD8344287.

Phylogenomic databases

KOK11421.
OMAQCWQLTI.
PhylomeDBG4VJ01.

Family and domain databases

Gene3D3.30.890.10. 1 hit.
InterProIPR016177. DNA-bd_dom.
IPR001739. Methyl_CpG_DNA-bd.
IPR007728. Pre-SET_dom.
IPR001214. SET_dom.
[Graphical view]
PfamPF01429. MBD. 1 hit.
PF05033. Pre-SET. 1 hit.
PF00856. SET. 1 hit.
[Graphical view]
SMARTSM00317. SET. 1 hit.
[Graphical view]
SUPFAMSSF54171. SSF54171. 1 hit.
PROSITEPS50982. MBD. 1 hit.
PS50867. PRE_SET. 1 hit.
PS50280. SET. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameG4VJ01_SCHMA
AccessionPrimary (citable) accession number: G4VJ01
Entry history
Integrated into UniProtKB/TrEMBL: December 14, 2011
Last sequence update: December 14, 2011
Last modified: July 9, 2014
This is version 19 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)