Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q80TG1 (KANL1_MOUSE) Reviewed, UniProtKB/Swiss-Prot

Last modified March 19, 2014. Version 71. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (1) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
KAT8 regulatory NSL complex subunit 1
Alternative name(s):
NSL complex protein NSL1
Non-specific lethal 1 homolog
Gene names
Name:Kansl1
Synonyms:Kiaa1267, Nsl1
OrganismMus musculus (Mouse) [Reference proteome]
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus

Protein attributes

Sequence length1036 AA.
Sequence statusComplete.
Protein existenceEvidence at transcript level

General annotation (Comments)

Function

As part of the NSL complex it is involved in acetylation of nucleosomal histone H4 on several lysine residues and therefore may be involved in the regulation of transcription By similarity.

Subunit structure

Component of some MLL1/MLL complex, at least composed of the core components KMT2A/MLL1, ASH2L, HCFC1, WDR5 and RBBP5, as well as the facultative components BAP18, CHD8, E2F6, HSP70, INO80C, KANSL1, LAS1L, MAX, MCRS1, MGA, KAT8/MOF, PELP1, PHF20, PRP31, RING2, RUVB1/TIP49A, RUVB2/TIP49B, SENP3, TAF1, TAF4, TAF6, TAF7, TAF9 and TEX10. Component of the NSL complex at least composed of MOF/KAT8, KANSL1, KANSL2, KANSL3, MCRS1, PHF20, OGT1/OGT, WDR5 and HCFC1. Interacts with KAT8; the interaction is direct By similarity.

Subcellular location

Nucleus By similarity.

Alternative products

This entry describes 5 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q80TG1-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q80TG1-2)

The sequence of this isoform differs from the canonical sequence as follows:
     615-643: HRNVRSGCDVNPSCALCGSGSVNTMPPEI → GAQRTGLRSALILSRVGEPPSSPVNLQNY
     644-1036: Missing.
Note: No experimental confirmation available.
Isoform 3 (identifier: Q80TG1-3)

The sequence of this isoform differs from the canonical sequence as follows:
     478-510: Missing.
     615-643: HRNVRSGCDVNPSCALCGSGSVNTMPPEI → GAQRTGLRSALILSRVGEPPSSPVNLQNY
     644-1036: Missing.
Note: No experimental confirmation available.
Isoform 4 (identifier: Q80TG1-4)

The sequence of this isoform differs from the canonical sequence as follows:
     431-1036: Missing.
Note: No experimental confirmation available.
Isoform 5 (identifier: Q80TG1-5)

The sequence of this isoform differs from the canonical sequence as follows:
     1-770: Missing.
     771-779: GSQVAASTS → MCSLRSWNQ
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 10361036KAT8 regulatory NSL complex subunit 1
PRO_0000234566

Regions

Region781 – 81333Required for activation of KAT8 histone acetyltransferase activity By similarity
Region814 – 1036223Sufficient for interaction with KAT8 By similarity
Coiled coil285 – 31228 Potential

Amino acid modifications

Modified residue1041N6-acetyllysine By similarity
Modified residue2491Phosphoserine By similarity
Modified residue2681Phosphoserine By similarity
Modified residue9221Phosphoserine By similarity
Modified residue9341Phosphothreonine By similarity
Modified residue9761Phosphoserine By similarity

Natural variations

Alternative sequence1 – 770770Missing in isoform 5.
VSP_018360
Alternative sequence431 – 1036606Missing in isoform 4.
VSP_018361
Alternative sequence478 – 51033Missing in isoform 3.
VSP_018362
Alternative sequence615 – 64329HRNVR…MPPEI → GAQRTGLRSALILSRVGEPP SSPVNLQNY in isoform 2 and isoform 3.
VSP_018363
Alternative sequence644 – 1036393Missing in isoform 2 and isoform 3.
VSP_018364
Alternative sequence771 – 7799GSQVAASTS → MCSLRSWNQ in isoform 5.
VSP_018365

Experimental info

Sequence conflict1971D → N in BAE36437. Ref.2

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified June 1, 2003. Version 1.
Checksum: F619B60DEEE93A62

FASTA1,036113,179
        10         20         30         40         50         60 
MAAMAPALTD AAAEAHHIRF KLAPPSSTLS PGSAENNGNA NILISANGTK RKAIAAEDPS 

        70         80         90        100        110        120 
LDFRNNPTKE DLGKLQPLVA SYLCSDVTSV PAKESLKLQG VFSKQTVLKS HPLLSQSYEL 

       130        140        150        160        170        180 
RAELLGRQPV LEFSLENLRT MNTSGQTALP QAPVNGLAKK LTKSSTHSDH DNSSSLNGGK 

       190        200        210        220        230        240 
RSLTSSSLQG GEVGGPDSGN LKGGMTNCTL PHRSLDIQHT TLYSNNSTAN KSSVNSMDQP 

       250        260        270        280        290        300 
ALQGSSRLSP STDSSSNLTN VKLEVKKSPL SSILFSALDS DTRITALLRR QADIEIRARR 

       310        320        330        340        350        360 
LQKRLQVVQA KQVERHLQHQ LGGFLETTLS KLPNLESLRS RSQLMLTRKA EAALRKAASE 

       370        380        390        400        410        420 
SATSEGLSNF LKSDSISEEL ERFTASGIAN LRCSEQAFDS DVTDSSSGGE SDIEEEELTR 

       430        440        450        460        470        480 
ADPEQCHVPL KRRSEWRWAA DRAAIVSRWN WLQAHVSDLE YRIRQQTDIY KQIRANKGLI 

       490        500        510        520        530        540 
VLGEAPFPDH TTDLLSLSSE VKTDHGRDKL IESVSQPSEN HGILVSNITE SLSTKSCGAP 

       550        560        570        580        590        600 
RPVNGVVNSL QPVLADQVPG DSSDAEEQLH KKQRLNLVSS SDGTCVAART RPVLTCKKRR 

       610        620        630        640        650        660 
LVRPSSIVPL SKKVHRNVRS GCDVNPSCAL CGSGSVNTMP PEIHYEAPLL ERLSQLDSCV 

       670        680        690        700        710        720 
HPVLAFPDDV PTSLHFQSML KSQWQNKPFD KIKPTKKFSL KHRATMPCSL SDPVRKDRHK 

       730        740        750        760        770        780 
LVNSFLTTAM LKHHTDMSSP SYLTATHHPP HSPLVRQLST SSDTSTPTSS GSQVAASTSQ 

       790        800        810        820        830        840 
PVRRRRGESS FDINNIVIPM SVAATTRVEK LQYKEILTPS WREVDVQSLK GSPDEENEEI 

       850        860        870        880        890        900 
EDLSDAAFAA LHAKCEEMER ARWLWTTSVP PQRRGSRSYR SSDGRTTPQL GSANPSTPQP 

       910        920        930        940        950        960 
ASPDVSSSHS LSEFSHGQSP RSPISPELHS APLTPVARDS LRHLASEDTR CSTPELGLDE 

       970        980        990       1000       1010       1020 
QSVQPWERRT FPLAYSPQAE CEEQLDAQDT AARCTRRTSG SKTGREAEVA PTSPPVVPLK 

      1030 
SRHLAATVTA QRPAHR 

« Hide

Isoform 2 [UniParc].

Checksum: 74B228E59E1B3DA6
Show »

FASTA64369,724
Isoform 3 [UniParc].

Checksum: EF19107E16ABE383
Show »

FASTA61066,180
Isoform 4 [UniParc].

Checksum: 832BDE9011CBA181
Show »

FASTA43046,184
Isoform 5 [UniParc].

Checksum: EE0BA4E50B0A6286
Show »

FASTA26629,518

References

[1]"Prediction of the coding sequences of mouse homologues of KIAA gene: II. The complete nucleotide sequences of 400 mouse KIAA-homologous cDNAs identified by screening of terminal sequences of cDNA clones randomly sampled from size-fractionated libraries."
Okazaki N., Kikuno R., Ohara R., Inamoto S., Aizawa H., Yuasa S., Nakajima D., Nagase T., Ohara O., Koga H.
DNA Res. 10:35-48(2003) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
Tissue: Brain.
[2]"The transcriptional landscape of the mammalian genome."
Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J. expand/collapse author list , Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.
Science 309:1559-1563(2005) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 2; 4 AND 5).
Strain: C57BL/6J.
Tissue: Testis and Thymus.
[3]"Lineage-specific biology revealed by a finished genome assembly of the mouse."
Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X., Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y., Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S. expand/collapse author list , Teague B., Potamousis K., Churas C., Place M., Herschleb J., Runnheim R., Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., Lindblad-Toh K., Eichler E.E., Ponting C.P.
PLoS Biol. 7:E1000112-E1000112(2009) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: C57BL/6J.
[4]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1 AND 3).
Strain: C3H/He, C57BL/6 and FVB/N.
Tissue: Embryonic brain, Mammary gland and Mesenchymal stem cell.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AK122484 mRNA. Translation: BAC65766.1.
AK006970 mRNA. Translation: BAB24813.1.
AK037800 mRNA. Translation: BAE20523.1.
AK153679 mRNA. Translation: BAE32141.1.
AK161514 mRNA. Translation: BAE36437.1.
AL593843 Genomic DNA. Translation: CAM14793.1.
BC025052 mRNA. Translation: AAH25052.1.
BC043121 mRNA. Translation: AAH43121.2.
BC053389 mRNA. Translation: AAH53389.2.
BC054752 mRNA. Translation: AAH54752.2.
BC079594 mRNA. Translation: AAH79594.1.
RefSeqNP_001074514.1. NM_001081045.1.
XP_006534516.1. XM_006534453.1.
UniGeneMm.136418.

3D structure databases

ProteinModelPortalQ80TG1.
ModBaseSearch...
MobiDBSearch...

PTM databases

PhosphoSiteQ80TG1.

Proteomic databases

PaxDbQ80TG1.
PRIDEQ80TG1.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENSMUST00000018556; ENSMUSP00000018556; ENSMUSG00000018412. [Q80TG1-1]
ENSMUST00000106977; ENSMUSP00000102590; ENSMUSG00000018412. [Q80TG1-1]
GeneID76719.
KEGGmmu:76719.
UCSCuc007lwj.1. mouse. [Q80TG1-1]
uc007lwm.1. mouse. [Q80TG1-3]
uc007lwn.1. mouse. [Q80TG1-2]
uc007lwo.1. mouse. [Q80TG1-4]

Organism-specific databases

CTD284058.
MGIMGI:1923969. Kansl1.
RougeSearch...

Phylogenomic databases

eggNOGNOG87566.
GeneTreeENSGT00530000063688.
HOVERGENHBG080054.
OrthoDBEOG7VQJF6.

Gene expression databases

ArrayExpressQ80TG1.
BgeeQ80TG1.
CleanExMM_1700081L11RIK.
GenevestigatorQ80TG1.

Family and domain databases

InterProIPR026180. NSL1.
[Graphical view]
PANTHERPTHR22443. PTHR22443. 1 hit.
ProtoNetSearch...

Other

NextBio345681.
PROQ80TG1.
SOURCESearch...

Entry information

Entry nameKANL1_MOUSE
AccessionPrimary (citable) accession number: Q80TG1
Secondary accession number(s): A2A5Y5 expand/collapse secondary AC list , Q3TT88, Q3U5D8, Q3V3N3, Q7TMU3, Q80XP7, Q8R3L6, Q9D9G0
Entry history
Integrated into UniProtKB/Swiss-Prot: May 16, 2006
Last sequence update: June 1, 2003
Last modified: March 19, 2014
This is version 71 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Relevant documents

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot