Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q8K4Q6 (NEIL1_MOUSE) Reviewed, UniProtKB/Swiss-Prot

Last modified April 16, 2014. Version 98. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Endonuclease 8-like 1

EC=3.2.2.-
EC=4.2.99.18
Alternative name(s):
DNA glycosylase/AP lyase Neil1
DNA-(apurinic or apyrimidinic site) lyase Neil1
Endonuclease VIII-like 1
Nei homolog 1
Short name=NEH1
Nei-like protein 1
Gene names
Name:Neil1
Synonyms:Nei1
OrganismMus musculus (Mouse) [Reference proteome]
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus

Protein attributes

Sequence length389 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at transcript level

General annotation (Comments)

Function

Involved in base excision repair of DNA damaged by oxidation or by mutagenic agents. Acts as DNA glycosylase that recognizes and removes damaged bases. Has a preference for oxidized pyrimidines, such as thymine glycol, formamidopyrimidine (Fapy) and 5-hydroxyuracil. Has marginal activity towards 8-oxoguanine. Has AP (apurinic/apyrimidinic) lyase activity and introduces nicks in the DNA strand. Cleaves the DNA backbone by beta-delta elimination to generate a single-strand break at the site of the removed base with both 3'- and 5'-phosphates. Has DNA glycosylase/lyase activity towards mismatched uracil and thymine, in particular in U:C and T:C mismatches. Specifically binds 5-hydroxymethylcytosine (5hmC), suggesting that it acts as a specific reader of 5hmC. Ref.1 Ref.4

Catalytic activity

Removes damaged bases from DNA, leaving an abasic site.

The C-O-P bond 3' to the apurinic or apyrimidinic site in DNA is broken by a beta-elimination reaction, leaving a 3'-terminal unsaturated sugar and a product with a terminal 5'-phosphate.

Subcellular location

Cytoplasmcytoskeletonmicrotubule organizing centercentrosome. Nucleus. Chromosome. Note: During mitosis, associates with centrosomes and condensed chromatin By similarity.

Tissue specificity

Detected in heart, spleen and lung. Ref.1

Induction

Up-regulated during S-phase By similarity.

Sequence similarities

Belongs to the FPG family.

Sequence caution

The sequence BAB28790.1 differs from that shown. Reason: Frameshift at position 344.

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q8K4Q6-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q8K4Q6-2)

The sequence of this isoform differs from the canonical sequence as follows:
     241-389: GKGYGPERGE...PREAGESSAS → EAWGGQDGRRPLP
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Initiator methionine11Removed By similarity
Chain2 – 389388Endonuclease 8-like 1
PRO_0000170906

Sites

Active site21Schiff-base intermediate with DNA Probable
Active site31Proton donor Probable
Active site541Proton donor; for beta-elimination activity Probable
Binding site1761DNA By similarity

Natural variations

Alternative sequence241 – 389149GKGYG…ESSAS → EAWGGQDGRRPLP in isoform 2.
VSP_012207

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified January 23, 2007. Version 3.
Checksum: E3F1A21DDE2CB5AD

FASTA38943,586
        10         20         30         40         50         60 
MPEGPELHLA SHFVNETCKG LVFGGCVEKS SVSRNPEVPF ESSAYHISAL ARGKELRLTL 

        70         80         90        100        110        120 
SPLPGSQPPQ KPLSLVFRFG MSGSFQLVPA EALPRHAHLR FYTAPPAPRL ALCFVDIRRF 

       130        140        150        160        170        180 
GHWDPGGEWQ PGRGPCVLLE YERFRENVLR NLSDKAFDRP ICEALLDQRF FNGIGNYLRA 

       190        200        210        220        230        240 
EILYRLKIPP FEKARTVLEA LQQCRPSPEL TLSQKIKAKL QNPDLLELCH LVPKEVVQLG 

       250        260        270        280        290        300 
GKGYGPERGE EDFAAFRAWL RCYGVPGMSS LRDRHGRTIW FQGDPGPLAP KGGRSQKKKS 

       310        320        330        340        350        360 
QETQLGAEDR KEDLPLSSKS VSRMRRARKH PPKRIAQQSE GAGLQQNQET PTAPEKGKRR 

       370        380 
GQRASTGHRR RPKTIPDTRP REAGESSAS 

« Hide

Isoform 2 [UniParc].

Checksum: 8BE7DE61651A166A
Show »

FASTA25328,453

References

« Hide 'large scale' references
[1]"A back-up glycosylase in Nth1 knock-out mice is a functional Nei (endonuclease VIII) homologue."
Takao M., Kanno S., Kobayashi K., Zhang Q.-M., Yonei S., van der Horst G.T.J., Yasui A.
J. Biol. Chem. 277:42205-42213(2002) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), FUNCTION, TISSUE SPECIFICITY.
Tissue: Liver.
[2]"The transcriptional landscape of the mammalian genome."
Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J. expand/collapse author list , Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.
Science 309:1559-1563(2005) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
Strain: C57BL/6J.
Tissue: Embryo.
[3]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
Strain: FVB/N.
Tissue: Mammary gland.
[4]"Dynamic readers for 5-(hydroxy)methylcytosine and its oxidized derivatives."
Spruijt C.G., Gnerlich F., Smits A.H., Pfaffeneder T., Jansen P.W., Bauer C., Munzel M., Wagner M., Muller M., Khan F., Eberl H.C., Mensinga A., Brinkman A.B., Lephikov K., Muller U., Walter J., Boelens R., van Ingen H. expand/collapse author list , Leonhardt H., Carell T., Vermeulen M.
Cell 152:1146-1159(2013) [PubMed] [Europe PMC] [Abstract]
Cited for: FUNCTION.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AB079069 mRNA. Translation: BAC06477.1.
AK013322 mRNA. Translation: BAB28790.1. Frameshift.
BC043297 mRNA. Translation: AAH43297.1.
RefSeqNP_082623.1. NM_028347.2.
XP_006511547.1. XM_006511484.1.
XP_006511548.1. XM_006511485.1.
XP_006511549.1. XM_006511486.1.
UniGeneMm.35749.
Mm.488747.

3D structure databases

ProteinModelPortalQ8K4Q6.
SMRQ8K4Q6. Positions 2-290.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

STRING10090.ENSMUSP00000034842.

PTM databases

PhosphoSiteQ8K4Q6.

Proteomic databases

PRIDEQ8K4Q6.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENSMUST00000034842; ENSMUSP00000034842; ENSMUSG00000032298. [Q8K4Q6-1]
GeneID72774.
KEGGmmu:72774.
UCSCuc009puh.1. mouse. [Q8K4Q6-1]
uc009puk.1. mouse. [Q8K4Q6-2]

Organism-specific databases

CTD79661.
MGIMGI:1920024. Neil1.

Phylogenomic databases

eggNOGNOG75119.
GeneTreeENSGT00390000016671.
HOGENOMHOG000067872.
HOVERGENHBG052592.
InParanoidQ8K4Q6.
KOK10567.
OMAHLASHFV.
OrthoDBEOG7H1JM2.
PhylomeDBQ8K4Q6.
TreeFamTF333272.

Gene expression databases

BgeeQ8K4Q6.
CleanExMM_NEIL1.
GenevestigatorQ8K4Q6.

Family and domain databases

InterProIPR015886. DNA_glyclase/AP_lyase_DNA-bd.
IPR012319. DNA_glycosylase/AP_lyase_cat.
IPR015371. Endonuclease-VIII_DNA-bd.
IPR010979. Ribosomal_S13-like_H2TH.
[Graphical view]
PfamPF01149. Fapy_DNA_glyco. 1 hit.
PF06831. H2TH. 1 hit.
PF09292. Neil1-DNA_bind. 1 hit.
[Graphical view]
SMARTSM00898. Fapy_DNA_glyco. 1 hit.
[Graphical view]
SUPFAMSSF46946. SSF46946. 1 hit.
SSF81624. SSF81624. 1 hit.
PROSITEPS51068. FPG_CAT. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

NextBio336897.
PROQ8K4Q6.
SOURCESearch...

Entry information

Entry nameNEIL1_MOUSE
AccessionPrimary (citable) accession number: Q8K4Q6
Secondary accession number(s): Q80V58, Q9CYT9
Entry history
Integrated into UniProtKB/Swiss-Prot: December 7, 2004
Last sequence update: January 23, 2007
Last modified: April 16, 2014
This is version 98 of the entry and version 3 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot