Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein
Submitted name:

DNA mismatch repair protein Msh2

Gene

MSH2

Organism
Homo sapiens (Human)
Status
Unreviewed-Annotation score: Annotation score: 3 out of 5-Experimental evidence at protein leveli

Functioni

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Ligandi

DNA-bindingUniRule annotation

Names & Taxonomyi

Protein namesi
Submitted name:
DNA mismatch repair protein Msh2Imported
Gene namesi
Name:MSH2Imported
OrganismiHomo sapiens (Human)Imported
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
ProteomesiUP000005640 Componenti: Chromosome 2

Organism-specific databases

HGNCiHGNC:7325. MSH2.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

PTM / Processingi

Proteomic databases

PRIDEiE9PHA6.

Expressioni

Gene expression databases

BgeeiE9PHA6.
ExpressionAtlasiE9PHA6. baseline and differential.

Interactioni

Protein-protein interaction databases

STRINGi9606.ENSP00000233146.

Structurei

3D structure databases

ProteinModelPortaliE9PHA6.
SMRiE9PHA6. Positions 1-854.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Sequence similaritiesi

Belongs to the DNA mismatch repair MutS family.UniRule annotation

Phylogenomic databases

GeneTreeiENSGT00550000074867.
KOiK08735.

Family and domain databases

Gene3Di3.40.50.300. 1 hit.
InterProiIPR011184. DNA_mismatch_repair_MutS.
IPR007695. DNA_mismatch_repair_MutS-lik_N.
IPR000432. DNA_mismatch_repair_MutS_C.
IPR007861. DNA_mismatch_repair_MutS_clamp.
IPR007696. DNA_mismatch_repair_MutS_core.
IPR007860. DNA_mmatch_repair_MutS_con_dom.
IPR027417. P-loop_NTPase.
[Graphical view]
PfamiPF01624. MutS_I. 1 hit.
PF05188. MutS_II. 1 hit.
PF05192. MutS_III. 1 hit.
PF05190. MutS_IV. 1 hit.
PF00488. MutS_V. 1 hit.
[Graphical view]
PIRSFiPIRSF005813. MSH2. 1 hit.
SMARTiSM00534. MUTSac. 1 hit.
SM00533. MUTSd. 1 hit.
[Graphical view]
SUPFAMiSSF48334. SSF48334. 1 hit.
SSF52540. SSF52540. 1 hit.
PROSITEiPS00486. DNA_MISMATCH_REPAIR_2. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

E9PHA6-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MAVQPKETLQ LESAAEVGFV RFFQGMPEKP TTTVRLFDRG DFYTAHGEDA
60 70 80 90 100
LLAAREVFKT QGVIKYMGPA GAKNLQSVVL SKMNFESFVK DLLLVRQYRV
110 120 130 140 150
EVYKNRAGNK ASKENDWYLA YKASPGNLSQ FEDILFGNND MSASIGVVGV
160 170 180 190 200
KMSAVDGQRQ VGVGYVDSIQ RKLGLCEFPD NDQFSNLEAL LIQIGPKECV
210 220 230 240 250
LPGGETAGDM GKLRQIIQRG GILITERKKA DFSTKDIYQD LNRLLKGKKG
260 270 280 290 300
EQMNSAVLPE MENQVAVSSL SAVIKFLELL SDDSNFGQFE LTTFDFSQYM
310 320 330 340 350
KLDIAAVRAL NLFQGSVEDT TGSQSLAALL NKCKTPQGQR LVNQWIKQPL
360 370 380 390 400
MDKNRIEERL NLVEAFVEDA ELRQTLQEDL LRRFPDLNRL AKKFQRQAAN
410 420 430 440 450
LQDCYRLYQG INQLPNVIQA LEKHEGKHQK LLLAVFVTPL TDLRSDFSKF
460 470 480 490 500
QEMIETTLDM DQVENHEFLV KPSFDPNLSE LREIMNDLEK KMQSTLISAA
510 520 530 540 550
RDLGLDPGKQ IKLDSSAQFG YYFRVTCKEE KVLRNNKNFS TVDIQKNGVK
560 570 580 590 600
FTNSKLTSLN EEYTKNKTEY EEAQDAIVKE IVNISSGYVE PMQTLNDVLA
610 620 630 640 650
QLDAVVSFAH VSNGAPVPYV RPAILEKGQG RIILKASRHA CVEVQDEIAF
660 670 680 690 700
IPNDVYFEKD KQMFHIITGP NMGGKSTYIR QTGVIVLMAQ IGCFVPCESA
710 720 730 740 750
EVSIVDCILA RVGAGDSQLK GVSTFMAEML ETASILRSAT KDSLIIIDEL
760 770 780 790 800
GRGTSTYDGF GLAWAISEYI ATKIGAFCMF ATHFHELTAL ANQIPTVNNL
810 820 830 840 850
HVTALTTEET LTMLYQVKKG VCDQSFGIHV AELANFPKHV IECAKQKALE
860 870 880 890 900
LEEFQYIGES QGYDIMEPAA KKCYLERENL RVTEPKDQCL ILLTWKRKLR
910 920
GGKRSACSRP ERQNQGSGEL S
Length:921
Mass (Da):103,187
Last modified:April 5, 2011 - v1
Checksum:i21E3F502FDB212CC
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AC079775 Genomic DNA. No translation available.
AC138655 Genomic DNA. No translation available.

Genome annotation databases

EnsembliENST00000406134; ENSP00000384199; ENSG00000095002.
KEGGihsa:4436.
UCSCiuc002rvz.4. human.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AC079775 Genomic DNA. No translation available.
AC138655 Genomic DNA. No translation available.

3D structure databases

ProteinModelPortaliE9PHA6.
SMRiE9PHA6. Positions 1-854.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi9606.ENSP00000233146.

Proteomic databases

PRIDEiE9PHA6.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000406134; ENSP00000384199; ENSG00000095002.
KEGGihsa:4436.
UCSCiuc002rvz.4. human.

Organism-specific databases

CTDi4436.
HGNCiHGNC:7325. MSH2.
GenAtlasiSearch...

Phylogenomic databases

GeneTreeiENSGT00550000074867.
KOiK08735.

Miscellaneous databases

ChiTaRSiMSH2. human.
GenomeRNAii4436.
NextBioi35502032.

Gene expression databases

BgeeiE9PHA6.
ExpressionAtlasiE9PHA6. baseline and differential.

Family and domain databases

Gene3Di3.40.50.300. 1 hit.
InterProiIPR011184. DNA_mismatch_repair_MutS.
IPR007695. DNA_mismatch_repair_MutS-lik_N.
IPR000432. DNA_mismatch_repair_MutS_C.
IPR007861. DNA_mismatch_repair_MutS_clamp.
IPR007696. DNA_mismatch_repair_MutS_core.
IPR007860. DNA_mmatch_repair_MutS_con_dom.
IPR027417. P-loop_NTPase.
[Graphical view]
PfamiPF01624. MutS_I. 1 hit.
PF05188. MutS_II. 1 hit.
PF05192. MutS_III. 1 hit.
PF05190. MutS_IV. 1 hit.
PF00488. MutS_V. 1 hit.
[Graphical view]
PIRSFiPIRSF005813. MSH2. 1 hit.
SMARTiSM00534. MUTSac. 1 hit.
SM00533. MUTSd. 1 hit.
[Graphical view]
SUPFAMiSSF48334. SSF48334. 1 hit.
SSF52540. SSF52540. 1 hit.
PROSITEiPS00486. DNA_MISMATCH_REPAIR_2. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "Generation and annotation of the DNA sequences of human chromosomes 2 and 4."
    Hillier L.W., Graves T.A., Fulton R.S., Fulton L.A., Pepin K.H., Minx P., Wagner-McPherson C., Layman D., Wylie K., Sekhon M., Becker M.C., Fewell G.A., Delehaunty K.D., Miner T.L., Nash W.E., Kremitzki C., Oddy L., Du H.
    , Sun H., Bradshaw-Cordum H., Ali J., Carter J., Cordes M., Harris A., Isak A., van Brunt A., Nguyen C., Du F., Courtney L., Kalicki J., Ozersky P., Abbott S., Armstrong J., Belter E.A., Caruso L., Cedroni M., Cotton M., Davidson T., Desai A., Elliott G., Erb T., Fronick C., Gaige T., Haakenson W., Haglund K., Holmes A., Harkins R., Kim K., Kruchowski S.S., Strong C.M., Grewal N., Goyea E., Hou S., Levy A., Martinka S., Mead K., McLellan M.D., Meyer R., Randall-Maher J., Tomlinson C., Dauphin-Kohlberg S., Kozlowicz-Reilly A., Shah N., Swearengen-Shahid S., Snider J., Strong J.T., Thompson J., Yoakum M., Leonard S., Pearman C., Trani L., Radionenko M., Waligorski J.E., Wang C., Rock S.M., Tin-Wollam A.-M., Maupin R., Latreille P., Wendl M.C., Yang S.-P., Pohl C., Wallis J.W., Spieth J., Bieri T.A., Berkowicz N., Nelson J.O., Osborne J., Ding L., Meyer R., Sabo A., Shotland Y., Sinha P., Wohldmann P.E., Cook L.L., Hickenbotham M.T., Eldred J., Williams D., Jones T.A., She X., Ciccarelli F.D., Izaurralde E., Taylor J., Schmutz J., Myers R.M., Cox D.R., Huang X., McPherson J.D., Mardis E.R., Clifton S.W., Warren W.C., Chinwalla A.T., Eddy S.R., Marra M.A., Ovcharenko I., Furey T.S., Miller W., Eichler E.E., Bork P., Suyama M., Torrents D., Waterston R.H., Wilson R.K.
    Nature 434:724-731(2005) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
  2. Cited for: IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
  3. "Lysine acetylation targets protein complexes and co-regulates major cellular functions."
    Choudhary C., Kumar C., Gnad F., Nielsen M.L., Rehman M., Walther T.C., Olsen J.V., Mann M.
    Science 325:834-840(2009) [PubMed] [Europe PMC] [Abstract]
    Cited for: IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
  4. Cited for: IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
  5. Ensembl
    Submitted (JUL-2011) to UniProtKB
    Cited for: IDENTIFICATION.
  6. Cited for: IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].

Entry informationi

Entry nameiE9PHA6_HUMAN
AccessioniPrimary (citable) accession number: E9PHA6
Entry historyi
Integrated into UniProtKB/TrEMBL: April 5, 2011
Last sequence update: April 5, 2011
Last modified: June 24, 2015
This is version 29 of the entry and version 1 of the sequence. [Complete history]
Entry statusiUnreviewed (UniProtKB/TrEMBL)
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Caution

The sequence shown here is derived from an Ensembl automatic analysis pipeline and should be considered as preliminary data.Imported

Keywords - Technical termi

Complete proteome, Proteomics identificationCombined sources, Reference proteomeImported

External Data

Dasty 3

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into Uniref entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.