Skip Header

Contribute Send feedback
Read comments (?) or add your own

O95243 (MBD4_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified January 25, 2012. Version 91. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (6) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Interactions·Alt products·Sequence annotation·Sequences·References·Web links·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Methyl-CpG-binding domain protein 4

EC=3.2.2.-
Alternative name(s):
Methyl-CpG-binding endonuclease 1
Methyl-CpG-binding protein MBD4
Mismatch-specific DNA N-glycosylase
Gene names
Name:MBD4
Synonyms:MED1
OrganismHomo sapiens (Human)
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length580 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Mismatch-specific DNA N-glycosylase involved in DNA repair. Has thymine glycosylase activity and is specific for G:T mismatches within methylated and unmethylated CpG sites. Can also remove uracil or 5-fluorouracil in G:U mismatches. Has no lyase activity. Was first identified as methyl-CpG-binding protein. Ref.3 Ref.9

Subunit structure

Interacts with MLH1. Ref.3 Ref.10

Subcellular location

Nucleus.

Sequence similarities

Contains 1 MBD (methyl-CpG-binding) domain.

Ontologies

Keywords
   Biological processDNA damage
DNA repair
   Cellular componentNucleus
   Coding sequence diversityAlternative splicing
Polymorphism
   LigandDNA-binding
   Molecular functionHydrolase
   Technical term3D-structure
Complete proteome
Reference proteome
Gene Ontology (GO)
   Biological processdepyrimidination

Traceable author statement. Source: Reactome

   Cellular componentnucleoplasm

Traceable author statement. Source: Reactome

   Molecular functionendodeoxyribonuclease activity

Traceable author statement. Source: ProtInc

protein binding

Inferred from physical interaction Ref.10. Source: IntAct

satellite DNA binding

Traceable author statement. Source: ProtInc

Complete GO annotation...

Binary interactions

With

Entry

#Exp.

IntAct

Notes

FADDQ131586EBI-348011,EBI-494804

Alternative products

This entry describes 3 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: O95243-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: O95243-2)

The sequence of this isoform differs from the canonical sequence as follows:
     395-400: Missing.
Isoform 3 (identifier: O95243-3)

The sequence of this isoform differs from the canonical sequence as follows:
     539-540: KY → AP
     541-580: Missing.
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 580580Methyl-CpG-binding domain protein 4
PRO_0000096264

Regions

Domain76 – 14873MBD

Sites

Active site5601 By similarity

Natural variations

Alternative sequence395 – 4006Missing in isoform 2.
VSP_010816
Alternative sequence539 – 5402KY → AP in isoform 3.
VSP_010817
Alternative sequence541 – 58040Missing in isoform 3.
VSP_010818
Natural variant611C → R.
Corresponds to variant rs2307296 [ dbSNP | Ensembl ].
VAR_029306
Natural variant2731A → S. Ref.6
Corresponds to variant rs10342 [ dbSNP | Ensembl ].
VAR_019357
Natural variant2731A → T.
Corresponds to variant rs10342 [ dbSNP | Ensembl ].
VAR_019514
Natural variant3421S → P. Ref.6
Corresponds to variant rs2307289 [ dbSNP | Ensembl ].
VAR_019358
Natural variant3461E → K. Ref.6
Corresponds to variant rs140693 [ dbSNP | Ensembl ].
VAR_019359
Natural variant3581I → T.
Corresponds to variant rs2307298 [ dbSNP | Ensembl ].
VAR_019515
Natural variant5681D → H. Ref.6
Corresponds to variant rs2307293 [ dbSNP | Ensembl ].
VAR_019360

Secondary structure

................... 580
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified May 1, 1999. Version 1.
Checksum: BF16FB21A34B8E5F

FASTA58066,051
        10         20         30         40         50         60 
MGTTGLESLS LGDRGAAPTV TSSERLVPDP PNDLRKEDVA MELERVGEDE EQMMIKRSSE 

        70         80         90        100        110        120 
CNPLLQEPIA SAQFGATAGT ECRKSVPCGW ERVVKQRLFG KTAGRFDVYF ISPQGLKFRS 

       130        140        150        160        170        180 
KSSLANYLHK NGETSLKPED FDFTVLSKRG IKSRYKDCSM AALTSHLQNQ SNNSNWNLRT 

       190        200        210        220        230        240 
RSKCKKDVFM PPSSSSELQE SRGLSNFTST HLLLKEDEGV DDVNFRKVRK PKGKVTILKG 

       250        260        270        280        290        300 
IPIKKTKKGC RKSCSGFVQS DSKRESVCNK ADAESEPVAQ KSQLDRTVCI SDAGACGETL 

       310        320        330        340        350        360 
SVTSEENSLV KKKERSLSSG SNFCSEQKTS GIINKFCSAK DSEHNEKYED TFLESEEIGT 

       370        380        390        400        410        420 
KVEVVERKEH LHTDILKRGS EMDNNCSPTR KDFTGEKIFQ EDTIPRTQIE RRKTSLYFSS 

       430        440        450        460        470        480 
KYNKEALSPP RRKAFKKWTP PRSPFNLVQE TLFHDPWKLL IATIFLNRTS GKMAIPVLWK 

       490        500        510        520        530        540 
FLEKYPSAEV ARTADWRDVS ELLKPLGLYD LRAKTIVKFS DEYLTKQWKY PIELHGIGKY 

       550        560        570        580 
GNDSYRIFCV NEWKQVHPED HKLNKYHDWL WENHEKLSLS 

« Hide

Isoform 2 [UniParc].

Checksum: 33809A26A2E61A26
Show »

FASTA57465,348
Isoform 3 [UniParc].

Checksum: 3131CE4F9A488371
Show »

FASTA54060,949

References

« Hide 'large scale' references
[1]"Identification and characterization of a family of mammalian methyl-CpG binding proteins."
Hendrich B., Bird A.
Mol. Cell. Biol. 18:6538-6547(1998) [PubMed: 9774669] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1).
[2]"Genomic structure and chromosomal mapping of the murine and human mbd1, mbd2, mbd3, and mbd4 genes."
Hendrich B., Abbott C., McQueen H., Chambers D., Cross S.H., Bird A.
Mamm. Genome 10:906-912(1999) [PubMed: 10441743] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
[3]"MED1, a novel human methyl-CpG-binding endonuclease, interacts with DNA mismatch repair protein MLH1."
Bellacosa A., Cicchillitti L., Schepis F., Riccio A., Yeung A.T., Matsumoto Y., Golemis E.A., Genuardi M., Neri G.
Proc. Natl. Acad. Sci. U.S.A. 96:3969-3974(1999) [PubMed: 10097147] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), FUNCTION, INTERACTION WITH MLH1.
Tissue: Fetal brain.
[4]Guo J.H., Chen L., Yu L.
Submitted (JUL-2002) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 3).
Tissue: Lung.
[5]"Cloning of human full open reading frames in Gateway(TM) system entry vector (pDONR201)."
Ebert L., Schick M., Neubert P., Schatten R., Henze S., Korn B.
Submitted (MAY-2004) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
[6]NIEHS SNPs program
Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA], VARIANTS SER-273; PRO-342; LYS-346 AND HIS-568.
[7]Mural R.J., Istrail S., Sutton G.G., Florea L., Halpern A.L., Mobarry C.M., Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., Flanigan M.J., Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., Hannenhalli S., Turner R. expand/collapse author list , Yooseph S., Lu F., Nusskern D.R., Shue B.C., Zheng X.H., Zhong F., Delcher A.L., Huson D.H., Kravitz S.A., Mouchard L., Reinert K., Remington K.A., Clark A.G., Waterman M.S., Eichler E.E., Adams M.D., Hunkapiller M.W., Myers E.W., Venter J.C.
Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[8]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
Tissue: Lung.
[9]"Biphasic kinetics of the human DNA repair protein MED1 (MBD4), a mismatch-specific DNA N-glycosylase."
Petronzelli F., Riccio A., Markham G.D., Seeholzer S.H., Stoerker J., Genuardi M., Yeung A.T., Matsumoto Y., Bellacosa A.
J. Biol. Chem. 275:32422-32429(2000) [PubMed: 10930409] [Abstract]
Cited for: FUNCTION.
[10]"Fas-associated death domain protein interacts with methyl-CpG binding domain protein 4: a potential link between genome surveillance and apoptosis."
Screaton R.A., Kiessling S., Sansom O.J., Millar C.B., Maddison K., Bird A., Clarke A.R., Frisch S.M.
Proc. Natl. Acad. Sci. U.S.A. 100:5211-5216(2003) [PubMed: 12702765] [Abstract]
Cited for: INTERACTION WITH FADD.
+Additional computationally mapped references.

Web resources

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AF072250 mRNA. Translation: AAC68879.1.
AF120999, AF120997, AF120998 Genomic DNA. Translation: AAD50374.1.
AF114784 mRNA. Translation: AAD22195.1.
AF532602 mRNA. Translation: AAP97338.1.
CR450305 mRNA. Translation: CAG29301.1.
AF494057 Genomic DNA. Translation: AAM00008.1.
CH471052 Genomic DNA. Translation: EAW79251.1.
CH471052 Genomic DNA. Translation: EAW79253.1.
CH471052 Genomic DNA. Translation: EAW79254.1.
CH471052 Genomic DNA. Translation: EAW79255.1.
BC011752 mRNA. Translation: AAH11752.1.
IPIIPI00426727.
IPI00426728.
IPI00426729.
RefSeqNP_003916.1. NM_003925.1.
UniGeneHs.35947.

3D structure databases

PDBe
RCSB PDB
PDBj
EntryMethodResolution (Å)ChainPositionsPDBsum
3IHOX-ray2.70A437-574[»]
ProteinModelPortalO95243.
SMRO95243. Positions 79-154, 437-574.
ModBaseSearch...

Protein-protein interaction databases

IntActO95243. 3 interactions.
MINTMINT-264766.
STRINGO95243.

PTM databases

PhosphoSiteO95243.

Proteomic databases

PRIDEO95243.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENST00000249910; ENSP00000249910; ENSG00000129071.
GeneID8930.
KEGGhsa:8930.
UCSCuc003emh.1. human.
uc003emi.1. human.
uc003emj.1. human.

Organism-specific databases

CTD8930.
GeneCardsGC03M129149.
H-InvDBHIX0003669.
HGNCHGNC:6919. MBD4.
HPAHPA002031.
MIM603574. gene.
neXtProtNX_O95243.
PharmGKBPA30663.
GenAtlasSearch...

Phylogenomic databases

eggNOGprNOG07000.
GeneTreeENSGT00530000063687.
HOGENOMHBG716172.
HOVERGENHBG052418.
InParanoidO95243.
OMADFTVLSK.
PhylomeDBO95243.

Enzyme and pathway databases

ReactomeREACT_216. DNA Repair.

Gene expression databases

ArrayExpressO95243.
BgeeO95243.
CleanExHS_MBD4.
HS_MED1.
GenevestigatorO95243.
GermOnlineENSG00000129071. Homo sapiens.

Family and domain databases

InterProIPR016177. DNA-bd_integrase-typ.
IPR011257. DNA_glycosylase.
IPR003265. HhH-GPD_domain.
IPR017352. Me_CpG-bd_MBD4.
IPR001739. Methyl_CpG_DNA-bd.
[Graphical view]
Gene3DG3DSA:1.10.340.30. DNA_glycosylase. 1 hit.
G3DSA:3.30.890.10. Methyl_CpG_DNA-bd. 1 hit.
KOK10801.
PfamPF00730. HhH-GPD. 1 hit.
PF01429. MBD. 1 hit.
[Graphical view]
PIRSFPIRSF038005. Methyl_CpG_bd_MBD4. 1 hit.
SMARTSM00391. MBD. 1 hit.
[Graphical view]
SUPFAMSSF54171. DNA-binding_integrase-type. 1 hit.
SSF48150. DNA_glycsylse. 1 hit.
PROSITEPS50982. MBD. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

NextBio33578.
SOURCESearch...

Entry information

Entry nameMBD4_HUMAN
AccessionPrimary (citable) accession number: O95243
Secondary accession number(s): D3DNC3 expand/collapse secondary AC list , D3DNC4, Q7Z4T3, Q96F09
Entry history
Integrated into UniProtKB/Swiss-Prot: July 19, 2004
Last sequence update: May 1, 1999
Last modified: January 25, 2012
This is version 91 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

Human chromosome 3

Human chromosome 3: entries, gene names and cross-references to MIM

Human entries with polymorphisms or disease mutations

List of human entries with polymorphisms or disease mutations

Human polymorphisms and disease mutations

Index of human polymorphisms and disease mutations

MIM cross-references

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot

PDB cross-references

Index of Protein Data Bank (PDB) cross-references

SIMILARITY comments

Index of protein domains and families