Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot Q5RGA4 (MYSM1_DANRE)

Last modified November 3, 2009. Version 40. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Histone H2A deubiquitinase MYSM1
      Short name=2A-DUB
    EC=3.1.2.15
Alternative name(s):
    Myb-like, SWIRM and MPN domain-containing protein 1
Gene names
Name: mysm1
ORF Names: si:ch211-59d15.8
OrganismDanio rerio (Zebrafish) (Brachydanio rerio)
Taxonomic identifier7955 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiActinopterygiiNeopterygiiTeleosteiOstariophysiCypriniformesCyprinidaeDanio

Protein attributes

Sequence length822 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceEvidence at transcript level.

General annotation (Comments)

Function

Metalloprotease that specifically deubiquitinates monoubiquitinated histone H2A, a specific tag for epigenetic transcriptional repression, thereby acting as a coactivator. Preferentially deubiquitinates monoubiquitinated H2A in hyperacetylated nucleosomes. Deubiquitination of histone H2A leads to facilitate the phosphorylation and dissociation of histone H1 from the nucleosome. Acts as a coactivator by participating in the initiation and elongation steps of androgen receptor (AR)-induced gene activation By similarity.

Catalytic activity

Ubiquitin C-terminal thioester + H2O = ubiquitin + a thiol.

Subcellular location

Nucleus By similarity.

Domain

Binds double-stranded DNA via the SANT domain. The SWIRM domain does not bind double-stranded DNA By similarity.

Sequence similarities

Belongs to the peptidase M67A family. MYSM1 subfamily.

Contains 1 MPN (JAB/Mov34) domain.

Contains 1 SANT domain.

Contains 1 SWIRM domain.

Sequence caution

The sequence AAI34240.1 differs from that shown. Reason: Erroneous initiation. Translation N-terminally extended.

The sequence AAI34240.1 differs from that shown. Reason: Miscellaneous discrepancy. Contaminating sequence. Potential poly-A sequence.

The sequence CAI12031.1 differs from that shown. Reason: Erroneous gene model prediction.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 822822Histone H2A deubiquitinase MYSM1
PRO_0000373925

Regions

Domain97 – 14852SANT
Domain313 – 41199SWIRM
Motif596 – 60914JAMM motif

Sites

Metal binding5961Zinc; catalytic By similarity
Metal binding5981Zinc; catalytic By similarity
Metal binding6091Zinc; catalytic By similarity

Experimental info

Sequence conflict381R → K in AAI34240. Ref.1
Sequence conflict411S → G in AAI34240. Ref.1
Sequence conflict2081R → C in AAI34240. Ref.1
Sequence conflict2211R → S in AAI34240. Ref.1
Sequence conflict2421A → V in AAI34240. Ref.1
Sequence conflict2461F → I in AAI34240. Ref.1
Sequence conflict2951A → G in AAI34240. Ref.1

Sequences

Sequence LengthMass (Da)Tools
Q5RGA4-1 [UniParc].

Last modified May 5, 2009. Version 2.
Checksum: D6F0BB5E0266BA5A

FASTA82292,803
        10         20         30         40         50         60 
MADELDVVDI EGDECDEIVG DLSSAEILQD QYLQSAWRTN SSVLPWTLDS SISDENRQVI 

        70         80         90        100        110        120 
ESMLLEEQYY FASKKAPKVA WTNEKTKVKK PLVKSSAAQT RWAEEEKELF EKGLAQFGRR 

       130        140        150        160        170        180 
WTKIAKLIGT RTVLQVKSYA KQYFKNKPKA EPAAEVTSAN VTSVSSIQPH VSALTNAVRI 

       190        200        210        220        230        240 
ERLSDDEDVD ITDDFSDSEL QSKKQPERSV SPDCNHHGEL RPSLSDALLH LPSESTAADG 

       250        260        270        280        290        300 
QADPDFSEDT EIHHSEIDSE AVEESGNPFI NLDSPSKHSL TGEEETELAD KCESAECLEE 

       310        320        330        340        350        360 
EVEDQEEDEE EELRAPEQEV ELDLNTITEE EKQAISEFFE GRPSKTPERY LKIRNYILDQ 

       370        380        390        400        410        420 
WRRSKPKYLN KTSVRPGLKN CGDVNCIGRI HTYLELIGAI NFNCDQAIYN RPRLVDRSRL 

       430        440        450        460        470        480 
KESKDSLEAY HLAQRLQSMR TRKRRIRDVW GNWCDAKDLE GQTYEHLSAE ELAIRREEMK 

       490        500        510        520        530        540 
KKGPRPSKLP KQRGSFDPFQ LIPCKTFGEE RQEPYSVIVC AEALIVMDIH AHVSMGEVIG 

       550        560        570        580        590        600 
LLGGTYEEED KVLKICSAEP CNSLSTGLQC EMDPVSQTQA SEVLGVKGLS VVGWYHSHPA 

       610        620        630        640        650        660 
FDPNPSLRDI DTQAKYQSYF SRGGAPFIGM IVSPYNPSNS SPQSQSTCLL VQEEPGPSGS 

       670        680        690        700        710        720 
HKFPYQFNMQ CSAEPPDWSE VMRRAEWVVF KYRLCHGSVP MDRLFRRDSS LTCLEKMLLS 

       730        740        750        760        770        780 
IRTVLERLSE VDIETFLVQL ESLFNTHFLS ETGSSSTHLY ETATQPQLLS FLSSEPISSS 

       790        800        810        820 
ATEETSDGHD EPNTTGPDKL EQSCVEEEHD ETVMQSTSAE TV 

« Hide

References

[1]The Danio rerio sequencing project at the Sanger Institute
Submitted (JAN-2009) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: Tuebingen.
[2]NIH - Zebrafish Gene Collection (ZGC) project
Submitted (MAR-2007) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1-457.
Tissue: Embryo.

Cross-references

Sequence databases

BX942841 Genomic DNA. Translation: CAI12031.1. Sequence problems.
BC134239 mRNA. Translation: AAI34240.1. Sequence problems.
IPIIPI00494402.
RefSeqNP_001157501.1.
UniGeneDr.108451

3D structure databases

SMRQ5RGA4. Positions 101-149, 309-411.
ModBaseSearch...

Genome annotation databases

EnsemblENSDART00000044655; ENSDARP00000044654; ENSDARG00000034693; Danio rerio. [Genome view]
GeneID561225.
KEGGdre:561225.

Organism-specific databases

ZFINZDB-GENE-041014-28. si:ch211-59d15.8.

Phylogenomic databases

HOVERGENQ5RGA4.
OMAVPMDKIF.

Gene expression databases

BgeeQ5RGA4.

Family and domain databases

InterProIPR000555. Mov34_MPN_PAD1.
IPR014778. Myb_DNA-bd.
IPR001005. SANT_DNA-bd.
IPR017884. SANT_eukarya.
IPR007526. SWIRM.
[Graphical view]
PfamPF01398. Mov34. 1 hit.
PF00249. Myb_DNA-binding. 1 hit.
PF04433. SWIRM. 1 hit.
[Graphical view]
SMARTSM00232. JAB_MPN. 1 hit.
SM00717. SANT. 1 hit.
[Graphical view]
PROSITEPS51293. SANT. 1 hit.
PS50934. SWIRM. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameMYSM1_DANRE
AccessionPrimary (citable) accession number: Q5RGA4
Secondary accession number(s): A3KPA5
Entry history
Integrated into UniProtKB/Swiss-Prot: May 5, 2009
Last sequence update: May 5, 2009
Last modified: November 3, 2009
This is version 40 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectZebrafish annotation project

Relevant documents

Peptidase families

Classification of peptidase families and list of entries

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents