Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Entry version 154 (07 Oct 2020)
Sequence version 4 (31 Jan 2018)
Previous versions | rss
Add a publicationFeedback
Protein

Transcription-associated protein 1

Gene

Nipped-A

Organism
Drosophila melanogaster (Fruit fly)
Status
Reviewed-Annotation score:

Annotation score:5 out of 5

<p>The annotation score provides a heuristic measure of the annotation content of a UniProtKB entry or proteome. This score <strong>cannot</strong> be used as a measure of the accuracy of the annotation as we cannot define the 'correct annotation' for any given protein.<p><a href='/help/annotation_score' target='_top'>More...</a></p>
-Experimental evidence at protein leveli <p>This indicates the type of evidence that supports the existence of the protein. Note that the 'protein existence' evidence does not give information on the accuracy or correctness of the sequence(s) displayed.<p><a href='/help/protein_existence' target='_top'>More...</a></p>

<p>This section provides any useful information about the protein, mostly biological knowledge.<p><a href='/help/function_section' target='_top'>More...</a></p>Functioni

Part of the Tip60 chromatin-remodeling complex which is involved in DNA repair (PubMed:15528408). Upon induction of DNA double-strand breaks, this complex acetylates phosphorylated H2AV in nucleosomes and exchanges it with unmodified H2AV (PubMed:15528408). During wing development, required for activity of Notch and its coactivator mam (PubMed:16508010). Function in promoting mam function is likely to involve both the Tip60 and SAGA complexes (PubMed:16508010).2 Publications

Miscellaneous

Although strongly related to the PI3/PI4-kinase family, it lacks the typical motifs that constitute the catalytic site of PI3/PI4-kinase proteins, suggesting that it probably lacks such activity.

<p>The <a href="http://www.geneontology.org/">Gene Ontology (GO)</a> project provides a set of hierarchical controlled vocabulary split into 3 categories:<p><a href='/help/gene_ontology' target='_top'>More...</a></p>GO - Molecular functioni

GO - Biological processi

<p>UniProtKB Keywords constitute a <a href="http://www.uniprot.org/keywords">controlled vocabulary</a> with a hierarchical structure. Keywords summarise the content of a UniProtKB entry and facilitate the search for proteins of interest.<p><a href='/help/keywords' target='_top'>More...</a></p>Keywordsi

Molecular functionActivator, Chromatin regulator
Biological processTranscription, Transcription regulation

Enzyme and pathway databases

Reactome - a knowledgebase of biological pathways and processes

More...
Reactomei
R-DME-201722, Formation of the beta-catenin:TCF transactivating complex
R-DME-5689880, Ub-specific processing proteases

SignaLink: a signaling pathway resource with multi-layered regulatory networks

More...
SignaLinki
Q8I8U7

<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Names & Taxonomyi

<p>This subsection of the <a href="http://www.uniprot.org/help/names%5Fand%5Ftaxonomy%5Fsection">Names and taxonomy</a> section provides an exhaustive list of all names of the protein, from commonly used to obsolete, to allow unambiguous identification of a protein.<p><a href='/help/protein_names' target='_top'>More...</a></p>Protein namesi
Recommended name:
Transcription-associated protein 1
Alternative name(s):
dTRA1
<p>This subsection of the <a href="http://www.uniprot.org/help/names%5Fand%5Ftaxonomy%5Fsection">Names and taxonomy</a> section indicates the name(s) of the gene(s) that code for the protein sequence(s) described in the entry. Four distinct tokens exist: 'Name', 'Synonyms', 'Ordered locus names' and 'ORF names'.<p><a href='/help/gene_name' target='_top'>More...</a></p>Gene namesi
Name:Nipped-A
Synonyms:Tra1
ORF Names:CG2905
<p>This subsection of the <a href="http://www.uniprot.org/help/names%5Fand%5Ftaxonomy%5Fsection">Names and taxonomy</a> section provides information on the name(s) of the organism that is the source of the protein sequence.<p><a href='/help/organism-name' target='_top'>More...</a></p>OrganismiDrosophila melanogaster (Fruit fly)
<p>This subsection of the <a href="http://www.uniprot.org/help/names%5Fand%5Ftaxonomy%5Fsection">Names and taxonomy</a> section shows the unique identifier assigned by the NCBI to the source organism of the protein. This is known as the 'taxonomic identifier' or 'taxid'.<p><a href='/help/taxonomic_identifier' target='_top'>More...</a></p>Taxonomic identifieri7227 [NCBI]
<p>This subsection of the <a href="http://www.uniprot.org/help/names%5Fand%5Ftaxonomy%5Fsection">Names and taxonomy</a> section contains the taxonomic hierarchical classification lineage of the source organism. It lists the nodes as they appear top-down in the taxonomic tree, with the more general grouping listed first.<p><a href='/help/taxonomic_lineage' target='_top'>More...</a></p>Taxonomic lineageiEukaryotaMetazoaEcdysozoaArthropodaHexapodaInsectaPterygotaNeopteraHolometabolaDipteraBrachyceraMuscomorphaEphydroideaDrosophilidaeDrosophilaSophophora
<p>This subsection of the <a href="http://www.uniprot.org/help/names%5Fand%5Ftaxonomy%5Fsection">Names and taxonomy</a> section is present for entries that are part of a <a href="http://www.uniprot.org/proteomes">proteome</a>, i.e. of a set of proteins thought to be expressed by organisms whose genomes have been completely sequenced.<p><a href='/help/proteomes_manual' target='_top'>More...</a></p>Proteomesi
  • UP000000803 <p>A UniProt <a href="http://www.uniprot.org/manual/proteomes%5Fmanual">proteome</a> can consist of several components.<br></br>The component name refers to the genomic component encoding a set of proteins.<p><a href='/help/proteome_component' target='_top'>More...</a></p> Componenti: Chromosome 2R

Organism-specific databases

Drosophila genome database

More...
FlyBasei
FBgn0053554, Nipped-A

<p>This section provides information on the location and the topology of the mature protein in the cell.<p><a href='/help/subcellular_location_section' target='_top'>More...</a></p>Subcellular locationi

Extracellular region or secreted Cytosol Plasma membrane Cytoskeleton Lysosome Endosome Peroxisome ER Golgi apparatus Nucleus Mitochondrion Manual annotation Automatic computational assertionGraphics by Christian Stolte & Seán O’Donoghue; Source: COMPARTMENTS

Keywords - Cellular componenti

Chromosome, Cytoplasm, Nucleus

<p>This section describes post-translational modifications (PTMs) and/or processing events.<p><a href='/help/ptm_processing_section' target='_top'>More...</a></p>PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the 'PTM / Processing' section describes the extent of a polypeptide chain in the mature protein following processing or proteolytic cleavage.<p><a href='/help/chain' target='_top'>More...</a></p>ChainiPRO_00000888531 – 3790Transcription-associated protein 1Add BLAST3790

Proteomic databases

PaxDb, a database of protein abundance averages across all three domains of life

More...
PaxDbi
Q8I8U7

PRoteomics IDEntifications database

More...
PRIDEi
Q8I8U7

<p>This section provides information on the expression of a gene at the mRNA or protein level in cells or in tissues of multicellular organisms.<p><a href='/help/expression_section' target='_top'>More...</a></p>Expressioni

<p>This subsection of the 'Expression' section provides information on the expression of a gene at the mRNA or protein level in cells or in tissues of multicellular organisms. By default, the information is derived from experiments at the mRNA level, unless specified 'at protein level'.<br></br>Examples: <a href="http://www.uniprot.org/uniprot/P92958#expression">P92958</a>, <a href="http://www.uniprot.org/uniprot/Q8TDN4#expression">Q8TDN4</a>, <a href="http://www.uniprot.org/uniprot/O14734#expression">O14734</a><p><a href='/help/tissue_specificity' target='_top'>More...</a></p>Tissue specificityi

Ubiquitous.1 Publication

<p>This subsection of the 'Expression' section provides information on the expression of the gene product at various stages of a cell, tissue or organism development. By default, the information is derived from experiments at the mRNA level, unless specified 'at the protein level'.<p><a href='/help/developmental_stage' target='_top'>More...</a></p>Developmental stagei

Expressed both maternally and zygotically.1 Publication

Gene expression databases

Bgee dataBase for Gene Expression Evolution

More...
Bgeei
FBgn0053554, Expressed in embryo and 39 other tissues

ExpressionAtlas, Differential and Baseline Expression

More...
ExpressionAtlasi
Q8I8U7, baseline and differential

Genevisible search portal to normalized and curated expression data from Genevestigator

More...
Genevisiblei
Q8I8U7, DM

<p>This section provides information on the quaternary structure of a protein and on interaction(s) with other proteins or protein complexes.<p><a href='/help/interaction_section' target='_top'>More...</a></p>Interactioni

<p>This subsection of the <a href="http://www.uniprot.org/help/interaction%5Fsection">'Interaction'</a> section provides information about the protein quaternary structure and interaction(s) with other proteins or protein complexes (with the exception of physiological receptor-ligand interactions which are annotated in the <a href="http://www.uniprot.org/help/function%5Fsection">'Function'</a> section).<p><a href='/help/subunit_structure' target='_top'>More...</a></p>Subunit structurei

Component of the Tip60 chromatin-remodeling complex which contains the catalytic subunit Tip60 and the subunits Domino, Tra1, Brd8, E(Pc), DMAP1, Pontin, Reptin, Ing3, Act87E, BAP55, Mrg15, MrgBP, Gas41 and YL-1 (PubMed:15528408). Probable component of some SAGA complex (PubMed:12697829).

Interacts with Spt3, Gcn5, Ada3 and Ada2b (PubMed:12697829).

2 Publications

Protein-protein interaction databases

The Biological General Repository for Interaction Datasets (BioGRID)

More...
BioGRIDi
61398, 29 interactors

Protein interaction database and analysis system

More...
IntActi
Q8I8U7, 20 interactors

Molecular INTeraction database

More...
MINTi
Q8I8U7

STRING: functional protein association networks

More...
STRINGi
7227.FBpp0085431

<p>This section provides information on sequence similarities with other proteins and the domain(s) present in a protein.<p><a href='/help/family_and_domains_section' target='_top'>More...</a></p>Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the 'Family and Domains' section indicates the positions and types of repeated sequence motifs or repeated domains within the protein.<p><a href='/help/repeat' target='_top'>More...</a></p>Repeati98 – 136HEAT 1Sequence analysisAdd BLAST39
Repeati335 – 381HEAT 2Sequence analysisAdd BLAST47
Repeati740 – 778HEAT 3Sequence analysisAdd BLAST39
Repeati1185 – 1223HEAT 4Sequence analysisAdd BLAST39
Repeati1332 – 1370HEAT 5Sequence analysisAdd BLAST39
Repeati1826 – 1864HEAT 6Sequence analysisAdd BLAST39
<p>This subsection of the <a href="http://www.uniprot.org/help/family%5Fand%5Fdomains%5Fsection">Family and Domains</a> section describes the position and type of a domain, which is defined as a specific combination of secondary structures organized into a characteristic three-dimensional structure or fold.<p><a href='/help/domain' target='_top'>More...</a></p>Domaini2610 – 3173FATPROSITE-ProRule annotationAdd BLAST564
Domaini3457 – 3756PI3K/PI4KPROSITE-ProRule annotationAdd BLAST300
Domaini3758 – 3790FATCPROSITE-ProRule annotationAdd BLAST33

<p>This subsection of the 'Family and domains' section provides information about the sequence similarity with other proteins.<p><a href='/help/sequence_similarities' target='_top'>More...</a></p>Sequence similaritiesi

Belongs to the PI3/PI4-kinase family. TRA1 subfamily.Curated

Keywords - Domaini

Repeat

Phylogenomic databases

evolutionary genealogy of genes: Non-supervised Orthologous Groups

More...
eggNOGi
KOG0889, Eukaryota

Ensembl GeneTree

More...
GeneTreei
ENSGT00390000017961

InParanoid: Eukaryotic Ortholog Groups

More...
InParanoidi
Q8I8U7

KEGG Orthology (KO)

More...
KOi
K08874

Identification of Orthologs from Complete Genome Data

More...
OMAi
HLSYLMR

Database for complete collections of gene phylogenies

More...
PhylomeDBi
Q8I8U7

Family and domain databases

Integrated resource of protein families, domains and functional sites

More...
InterProi
View protein in InterPro
IPR016024, ARM-type_fold
IPR003152, FATC_dom
IPR011009, Kinase-like_dom_sf
IPR000403, PI3/4_kinase_cat_dom
IPR003151, PIK-rel_kinase_FAT
IPR014009, PIK_FAT
IPR033317, TRA1/TRRAP

The PANTHER Classification System

More...
PANTHERi
PTHR11139:SF1, PTHR11139:SF1, 1 hit

Pfam protein domain database

More...
Pfami
View protein in Pfam
PF02259, FAT, 1 hit
PF02260, FATC, 1 hit
PF00454, PI3_PI4_kinase, 1 hit

Simple Modular Architecture Research Tool; a protein domain database

More...
SMARTi
View protein in SMART
SM01343, FATC, 1 hit
SM00146, PI3Kc, 1 hit

Superfamily database of structural and functional annotation

More...
SUPFAMi
SSF48371, SSF48371, 2 hits
SSF56112, SSF56112, 1 hit

PROSITE; a protein domain and family database

More...
PROSITEi
View protein in PROSITE
PS51189, FAT, 1 hit
PS51190, FATC, 1 hit
PS50290, PI3_4_KINASE_3, 1 hit

<p>This section displays by default the canonical protein sequence and upon request all isoforms described in the entry. It also includes information pertinent to the sequence(s), including <a href="http://www.uniprot.org/help/sequence%5Flength">length</a> and <a href="http://www.uniprot.org/help/sequences">molecular weight</a>. The information is filed in different subsections. The current subsections and their content are listed below:<p><a href='/help/sequences_section' target='_top'>More...</a></p>Sequences (2)i

<p>This subsection of the <a href="http://www.uniprot.org/help/sequences%5Fsection">Sequence</a> section indicates if the <a href="http://www.uniprot.org/help/canonical%5Fand%5Fisoforms">canonical sequence</a> displayed by default in the entry is complete or not.<p><a href='/help/sequence_status' target='_top'>More...</a></p>Sequence statusi: Complete.

This entry describes 2 <p>This subsection of the 'Sequence' section lists the alternative protein sequences (isoforms) that can be generated from the same gene by a single or by the combination of up to four biological events (alternative promoter usage, alternative splicing, alternative initiation and ribosomal frameshifting). Additionally, this section gives relevant information on each alternative protein isoform. This section is only present in reviewed entries, i.e. in UniProtKB/Swiss-Prot.<p><a href='/help/alternative_products' target='_top'>More...</a></p> isoformsi produced by alternative splicing. AlignAdd to basket
Isoform EImported (identifier: Q8I8U7-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the <div> <p><b>What is the canonical sequence?</b><p><a href='/help/canonical_and_isoforms' target='_top'>More...</a></p>canonicali sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide
        10         20         30         40         50
MSVIENVPVN TFRNYLNILN DSSSKDELKL KATQELSEHF EMIMQSPAYP
60 70 80 90 100
SFLDNSLKIF MRILQDGEPQ FIQENTMQHI RKLILEMIHR LPITESLRQH
110 120 130 140 150
VKTIITMMLK ILKTDNEENV LVCLRIIIEL HKHFRPSFNS EIQLFLGFVK
160 170 180 190 200
EIYTNLPNHL TSIFETSNDV WVTDLKDLNL EVLLSESYSV RTIHVEKALD
210 220 230 240 250
SNSQQQIYNL LPRGILSLKV LQELPIIVVL MYQIYKNAVH QEVSEFIPLI
260 270 280 290 300
LTTINLQPTV TRRNSPQKEI YVEFMGAQIK TLSFLAYIVR IFQEVVIASS
310 320 330 340 350
LSVTSGMLNL MKNCPKEAAH LRKELLIAAR HIFATDLRQK FIPSIEQLFD
360 370 380 390 400
EDLLIGKGVT LDSIRPLAYS TLADLAHHVR QSLNIDVLIK AVNLFSKNVH
410 420 430 440 450
DESLAVGIQT MSCKLLLNLV DCLRHHSETE PQRSKALLSK LLKVFVKKFE
460 470 480 490 500
TIAKIQLPLI IQKCKGHAFS GALVNSSGNA SLSHINAPDL KDDISNIQVS
510 520 530 540 550
ASGSQWIYSV NVAEFRSLVK TLVGGVKTIT WGFFNSKFQL TDTKLANHEK
560 570 580 590 600
IFGPEIVCSY IDLVYYAMEA LDIYTINVNP NQQRTSGLIS RSKEEKEVLE
610 620 630 640 650
HFSGIFLMMH SQNFQEIFST TINFLVERIY KNQSLQVIAN SFLANPTTSP
660 670 680 690 700
LFATVLVEYL LNKMEEMGSN LERSNLYLRL FKLVFGSVSL FPVENEQMLR
710 720 730 740 750
PHLHKIVNRS MELALISEEP YNYFLLLRAL FRSIGGGSHD LLYQEFLPLL
760 770 780 790 800
PNLLEGLNRL QSGFHKQHMR DLFVELCLTV PVRLSSLLPY LPMLMDPLVS
810 820 830 840 850
ALNGSPTLIS QGLRTLELCV DNLQPDFLYD HIQPVRAALM QALWKTLRNQ
860 870 880 890 900
DNAALVAFRV LGKFGGGNRK MMVEPQALSY IINDKPTISI VTYFQEYETP
910 920 930 940 950
IDFPVDEAIK SAFRALGSNS TDQFYRRQSW EVIRCFLAAF ISLDDEKHML
960 970 980 990 1000
LKLFTHVDFV ENKIMNWSTF QHKAGNETVR ETHQTALIGM LVASATKDLR
1010 1020 1030 1040 1050
DSVCPVMAAV VRHYTMVAIA QQAGPFPQKG YQATHGIDPM ILIDALASCM
1060 1070 1080 1090 1100
GHEEKELCKP GIACMGIILD TATNIMGNKD RACKLPIIQY LAEKMVSLCY
1110 1120 1130 1140 1150
DRPWYSKVGG CQAIQFLCKH MSLRALFQNL FNFLKAFMFV LMDLEGDVSN
1160 1170 1180 1190 1200
GAIEITKSYM KSMLEICLTP INECYKNIDL KDLQAKATYE VIHELVRHIT
1210 1220 1230 1240 1250
SPNTIVREES MVLLKHIGTI QSKTVSEVMD PHKDVLADII PPKKHLLRHQ
1260 1270 1280 1290 1300
PANAQIGLMD GNTFCTTLEP RLFTIDLTNT YHKLFFHELL TLSEAEDATL
1310 1320 1330 1340 1350
AKLDCYKNVP NLIPLRTSAL RALAACHYIS DIGYKEKIIN IIFKVMESDK
1360 1370 1380 1390 1400
SELQTTAFHC MKHFITGVTL EKEKVQSAMR PLLLKLGDHR NLSIPAIKRL
1410 1420 1430 1440 1450
SYFTQIFPQM FNEKLSEQIL QHCSKIMEIF VSEYKSTSPN VNFFASSKGG
1460 1470 1480 1490 1500
EYEQKIVILI EMFFYISASV KYIEKLCQLV LKTEKNLMIE ASSPYREALI
1510 1520 1530 1540 1550
KFLQRFPTET VDLFLTESLM IDPQWNRLFI YLLKHETGVS FRAVIKSSRY
1560 1570 1580 1590 1600
NNLIHYLNTH TEFPEALKYE IQHQAVLIIF TLMESDDQWI PTRQDIVDAL
1610 1620 1630 1640 1650
KNCWQNYLST LSSEDVLCDL WHLIGKILLH YFSNNTNDIE LLFQLLRALC
1660 1670 1680 1690 1700
FRFIPDVYFL RDFLQHTVAQ SFTVNWKRNA FFYFVENFNN SFLSEELKAK
1710 1720 1730 1740 1750
IITAVIIPCF AVSFDKGEGN KLIGAPPTPY QEDEKNIVSV FINKVFDPDK
1760 1770 1780 1790 1800
QYDDAVRIAL LQLACLLVER ASQHIHDGDA NNKRQGNKLR RLMTFAWPCL
1810 1820 1830 1840 1850
LSKSSVDPTA RYHGHLLLSH IIARLAIHKK IVLQVFHSLL KGHALEARSI
1860 1870 1880 1890 1900
VKQALDVLTP AMPLRMEDGN TMLTHWTKKI IVEEGHAMQQ LFHILQLIIR
1910 1920 1930 1940 1950
HYKVYFPVRH QLVQHLINYM QRLGFPPTAS IEHKKLAVDL AEVIIKWELH
1960 1970 1980 1990 2000
RIKDDRETKT DGTEEELIQE SSVKRSGIDL VETRKKSFDI IRETTVQGVG
2010 2020 2030 2040 2050
SHTKPDDILR SIDKSYCDTV LNFLIRLACQ VNDPQAPILS PGESLSRRCV
2060 2070 2080 2090 2100
MLLKMAMRPE IWPQPFDIKL NWLDKVLATV ETPHHNLNNI CTGIDFLTFL
2110 2120 2130 2140 2150
TTILSPDQLV SIIRPVQRGL SLCIIHQNTR IVRLMHMFLT RIMAIFPPDT
2160 2170 2180 2190 2200
QHKHEDLDLL YTAVSKMIAE NLTSYEKSPQ PNASSLFGTL MILKACTTNN
2210 2220 2230 2240 2250
ASYIDRILVQ FIRVLNHLTR DHINTIGGNT VISQSPDSNA LPLELLVLSL
2260 2270 2280 2290 2300
ELIKNRIFVM SVEIRKLFIG TILVSLIEKS TEVKIIKCII KMLDEWIKTK
2310 2320 2330 2340 2350
EPNVMTQVPS IREKSALLVK LMQNVEKKFT DEIELNIQFL EIINFIYRDE
2360 2370 2380 2390 2400
ILKQTELTNK LEGAFLNGLR FQNPNVRSKF FEILDSSMRR RLHDRLLYII
2410 2420 2430 2440 2450
CSQAWDTIGS HYWIKQCIEL LILTANTMMQ IQCSNEQFKI PSITSVIPVN
2460 2470 2480 2490 2500
SSETQENSFV SFLSSHSESF DIIQTVDDKD DVYDIDLNAD RKEDCQQILP
2510 2520 2530 2540 2550
NRRVTLVELV YKQAEFLEAN RNIRTDQMLV ATSQLCHIDT QLAQSVWLSM
2560 2570 2580 2590 2600
FPRIWSIFTE DQRCNITKEL IPFLSSGTNV NQKDCHPSTL NTFVESLTKC
2610 2620 2630 2640 2650
APPIYIPPNL LAYLGKSHNL WHRAILVLED MAVNQSMQSK DIDGGENQFS
2660 2670 2680 2690 2700
DLDVQQSNNI FDSLSKMYSS MHEEDLWAGL WLKFAHYPET NIAVSYEQMG
2710 2720 2730 2740 2750
FFEEAQGAYD LAMTKFKQDL SNGVVNTYVN SELLLWENHW MRCAKELNQW
2760 2770 2780 2790 2800
DILLDYAQTN KDKNMFLILE SSWRVPDWNL MKIALAKTEQ CYLKHYGFKI
2810 2820 2830 2840 2850
NLYKGYLSIL HQEERQTGNI ERYVEIASSL CIREWRRLPN IVSHIHLPYL
2860 2870 2880 2890 2900
QASQQIMELH EASQIHQGLA QSRNNSLHDM KAIVKTWRNR LPIISDDLSH
2910 2920 2930 2940 2950
WSDIFTWRQH HYQIITQHLE QQSDQGSTML GVHASAQAII SFGKIARKHN
2960 2970 2980 2990 3000
LTGVCQETLS RIYTIPSVPI VDCFQKIRQQ VKCYLQMPST SGKNEINEAL
3010 3020 3030 3040 3050
EVIESTNLKY FTGEMNAEFY ALKGLLLAQI GRSEEAGKSF SVAAQLHDGL
3060 3070 3080 3090 3100
TKAWAMWGDY MEQIFLKERK ITLAVDALIC YLQASRNQIE SKTRKYIAKV
3110 3120 3130 3140 3150
LWFLSYDNNT KILISTLEKH VAGIPPSYWL PWIPQLLCCL EQFEGDVILN
3160 3170 3180 3190 3200
LLSQIGRLYP QAVYFPIRTL YLTLKIEQRE KHKTAEQAVK SSCSNIDGTT
3210 3220 3230 3240 3250
LSFGRGASHG NIPSINPIKA TPPMWRCSKV MQLQREVHPT ILSSLEGIVD
3260 3270 3280 3290 3300
QMVWFRESWT EEVLRQLRQG LIKCYAIAFE KRDTVQHSTI TPHTLHFVKK
3310 3320 3330 3340 3350
LGSTFGIGIE NVPGSVTSSI SNSAASESLA RRAQVTFQDP VFQKMKEQFT
3360 3370 3380 3390 3400
NDFDFSKPGA MKLHNLISKL KTWIKVLETK VKKLPTSFLI EDKCRFLSNF
3410 3420 3430 3440 3450
SQKTAEVELP GELLIPLSSH YYVRIARFMP RVEIVQKNNT AARRLYIRGT
3460 3470 3480 3490 3500
NGKIYPYLVV LDSGLGDARR EERVLQLKRM LNYYLEKQKE TSRRFLNITV
3510 3520 3530 3540 3550
PRVVPISPQM RLAEDNPNSI SLLKIFKKCC QSMQVDYDMP IVKYYDRLSE
3560 3570 3580 3590 3600
VQARGTPTTH TLLREIFSEI QWTMVPKTLL KHWALKTFLA ATDFWHFRKM
3610 3620 3630 3640 3650
LTLQLALAFL CEHALNLTRL NADMMYLHQD SGLMNISYFK FDVNDDKCQL
3660 3670 3680 3690 3700
NQHRPVPFRL TPNVGEFITH FGITGPLSAA IVATARCFIQ PNYKLSSILQ
3710 3720 3730 3740 3750
TILRDEIIAL QKKGFRECKL IEGSEDRYSD GNCMEHSVNI VNSAVDIIMT
3760 3770 3780 3790
RFNKISYFDS IENKKISVLV QSATNIDNLC RMDPAWHPWL
Length:3,790
Mass (Da):435,330
Last modified:January 31, 2018 - v4
<p>The checksum is a form of redundancy check that is calculated from the sequence. It is useful for tracking sequence updates.</p> <p>It should be noted that while, in theory, two different sequences could have the same checksum value, the likelihood that this would happen is extremely low.</p> <p>However UniProtKB may contain entries with identical sequences in case of multiple genes (paralogs).</p> <p>The checksum is computed as the sequence 64-bit Cyclic Redundancy Check value (CRC64) using the generator polynomial: x<sup>64</sup> + x<sup>4</sup> + x<sup>3</sup> + x + 1. The algorithm is described in the ISO 3309 standard. </p> <p class="publication">Press W.H., Flannery B.P., Teukolsky S.A. and Vetterling W.T.<br /> <strong>Cyclic redundancy and other checksums</strong><br /> <a href="http://www.nrbook.com/b/bookcpdf.php">Numerical recipes in C 2nd ed., pp896-902, Cambridge University Press (1993)</a>)</p> Checksum:i366CF25BBC7C5435
GO
Isoform FImported (identifier: Q8I8U7-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1999-2048: VGSHTKPDDILRSIDKSYCDTVLNFLIRLACQVNDPQAPILSPGESLSRR → K

Show »
Length:3,741
Mass (Da):429,936
Checksum:iF8E04B8E5873D5D8
GO

<p>This subsection of the 'Sequence' section reports difference(s) between the protein sequence shown in the UniProtKB entry and other available protein sequences derived from the same gene.<p><a href='/help/sequence_caution' target='_top'>More...</a></p>Sequence cautioni

The sequence AAM11122 differs from that shown. Reason: Erroneous initiation. Truncated N-terminus.Curated
The sequence AAN52145 differs from that shown. Contaminating sequence. Insertion of several transposable element sequences.Curated

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the 'Sequence' section reports difference(s) between the canonical sequence (displayed by default in the entry) and the different sequence submissions merged in the entry. These various submissions may originate from different sequencing projects, different types of experiments, or different biological samples. Sequence conflicts are usually of unknown origin.<p><a href='/help/conflict' target='_top'>More...</a></p>Sequence conflicti468A → T in AAM11122 (PubMed:12537569).Curated1

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
<p>This subsection of the 'Sequence' section describes the sequence of naturally occurring alternative protein isoform(s). The changes in the amino acid sequence may be due to alternative splicing, alternative promoter usage, alternative initiation, or ribosomal frameshifting.<p><a href='/help/var_seq' target='_top'>More...</a></p>Alternative sequenceiVSP_0593121999 – 2048VGSHT…SLSRR → K in isoform F. Add BLAST50

Sequence databases

Select the link destinations:

EMBL nucleotide sequence database

More...
EMBLi

GenBank nucleotide sequence database

More...
GenBanki

DNA Data Bank of Japan; a nucleotide sequence database

More...
DDBJi
Links Updated
AY142217 mRNA Translation: AAN52145.1 Sequence problems.
AE013599 Genomic DNA Translation: ABI31023.2
AE013599 Genomic DNA Translation: ABV53702.2
AY094769 mRNA Translation: AAM11122.1 Different initiation.
DQ352451 mRNA Translation: ABD22987.1

NCBI Reference Sequences

More...
RefSeqi
NP_001097192.2, NM_001103722.3 [Q8I8U7-1]
NP_001303335.1, NM_001316406.1 [Q8I8U7-2]

Genome annotation databases

Ensembl metazoan genome annotation project

More...
EnsemblMetazoai
FBtr0303293; FBpp0292385; FBgn0053554 [Q8I8U7-1]
FBtr0347556; FBpp0312589; FBgn0053554 [Q8I8U7-2]

Database of genes from NCBI RefSeq genomes

More...
GeneIDi
35483

KEGG: Kyoto Encyclopedia of Genes and Genomes

More...
KEGGi
dme:Dmel_CG33554

UCSC genome browser

More...
UCSCi
CG33554-RA, d. melanogaster
CG33554-RD, d. melanogaster

Keywords - Coding sequence diversityi

Alternative splicing

<p>This section provides links to proteins that are similar to the protein sequence(s) described in this entry at different levels of sequence identity thresholds (100%, 90% and 50%) based on their membership in UniProt Reference Clusters (<a href="http://www.uniprot.org/help/uniref">UniRef</a>).<p><a href='/help/similar_proteins_section' target='_top'>More...</a></p>Similar proteinsi

<p>This section is used to point to information related to entries and found in data collections other than UniProtKB.<p><a href='/help/cross_references_section' target='_top'>More...</a></p>Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AY142217 mRNA Translation: AAN52145.1 Sequence problems.
AE013599 Genomic DNA Translation: ABI31023.2
AE013599 Genomic DNA Translation: ABV53702.2
AY094769 mRNA Translation: AAM11122.1 Different initiation.
DQ352451 mRNA Translation: ABD22987.1
RefSeqiNP_001097192.2, NM_001103722.3 [Q8I8U7-1]
NP_001303335.1, NM_001316406.1 [Q8I8U7-2]

3D structure databases

Database of comparative protein structure models

More...
ModBasei
Search...

SWISS-MODEL Interactive Workspace

More...
SWISS-MODEL-Workspacei
Submit a new modelling project...

Protein-protein interaction databases

BioGRIDi61398, 29 interactors
IntActiQ8I8U7, 20 interactors
MINTiQ8I8U7
STRINGi7227.FBpp0085431

Proteomic databases

PaxDbiQ8I8U7
PRIDEiQ8I8U7

Genome annotation databases

EnsemblMetazoaiFBtr0303293; FBpp0292385; FBgn0053554 [Q8I8U7-1]
FBtr0347556; FBpp0312589; FBgn0053554 [Q8I8U7-2]
GeneIDi35483
KEGGidme:Dmel_CG33554
UCSCiCG33554-RA, d. melanogaster
CG33554-RD, d. melanogaster

Organism-specific databases

Comparative Toxicogenomics Database

More...
CTDi
35483
FlyBaseiFBgn0053554, Nipped-A

Phylogenomic databases

eggNOGiKOG0889, Eukaryota
GeneTreeiENSGT00390000017961
InParanoidiQ8I8U7
KOiK08874
OMAiHLSYLMR
PhylomeDBiQ8I8U7

Enzyme and pathway databases

ReactomeiR-DME-201722, Formation of the beta-catenin:TCF transactivating complex
R-DME-5689880, Ub-specific processing proteases
SignaLinkiQ8I8U7

Miscellaneous databases

BioGRID ORCS database of CRISPR phenotype screens

More...
BioGRID-ORCSi
35483, 1 hit in 5 CRISPR screens

ChiTaRS: a database of human, mouse and fruit fly chimeric transcripts and RNA-sequencing data

More...
ChiTaRSi
Nipped-A, fly

Database of phenotypes from RNA interference screens in Drosophila and Homo sapiens

More...
GenomeRNAii
35483

Protein Ontology

More...
PROi
PR:Q8I8U7

Gene expression databases

BgeeiFBgn0053554, Expressed in embryo and 39 other tissues
ExpressionAtlasiQ8I8U7, baseline and differential
GenevisibleiQ8I8U7, DM

Family and domain databases

InterProiView protein in InterPro
IPR016024, ARM-type_fold
IPR003152, FATC_dom
IPR011009, Kinase-like_dom_sf
IPR000403, PI3/4_kinase_cat_dom
IPR003151, PIK-rel_kinase_FAT
IPR014009, PIK_FAT
IPR033317, TRA1/TRRAP
PANTHERiPTHR11139:SF1, PTHR11139:SF1, 1 hit
PfamiView protein in Pfam
PF02259, FAT, 1 hit
PF02260, FATC, 1 hit
PF00454, PI3_PI4_kinase, 1 hit
SMARTiView protein in SMART
SM01343, FATC, 1 hit
SM00146, PI3Kc, 1 hit
SUPFAMiSSF48371, SSF48371, 2 hits
SSF56112, SSF56112, 1 hit
PROSITEiView protein in PROSITE
PS51189, FAT, 1 hit
PS51190, FATC, 1 hit
PS50290, PI3_4_KINASE_3, 1 hit

ProtoNet; Automatic hierarchical classification of proteins

More...
ProtoNeti
Search...

MobiDB: a database of protein disorder and mobility annotations

More...
MobiDBi
Search...

<p>This section provides general information on the entry.<p><a href='/help/entry_information_section' target='_top'>More...</a></p>Entry informationi

<p>This subsection of the 'Entry information' section provides a mnemonic identifier for a UniProtKB entry, but it is not a stable identifier. Each reviewed entry is assigned a unique entry name upon integration into UniProtKB/Swiss-Prot.<p><a href='/help/entry_name' target='_top'>More...</a></p>Entry nameiTRA1_DROME
<p>This subsection of the 'Entry information' section provides one or more accession number(s). These are stable identifiers and should be used to cite UniProtKB entries. Upon integration into UniProtKB, each entry is assigned a unique accession number, which is called 'Primary (citable) accession number'.<p><a href='/help/accession_numbers' target='_top'>More...</a></p>AccessioniPrimary (citable) accession number: Q8I8U7
Secondary accession number(s): A0A140SQB4
, A8DY44, Q2EZ47, Q8T3L7, Q9V9E9
<p>This subsection of the 'Entry information' section shows the date of integration of the entry into UniProtKB, the date of the last sequence update and the date of the last annotation modification ('Last modified'). The version number for both the entry and the <a href="http://www.uniprot.org/help/canonical%5Fand%5Fisoforms">canonical sequence</a> are also displayed.<p><a href='/help/entry_history' target='_top'>More...</a></p>Entry historyiIntegrated into UniProtKB/Swiss-Prot: November 28, 2003
Last sequence update: January 31, 2018
Last modified: October 7, 2020
This is version 154 of the entry and version 4 of the sequence. See complete history.
<p>This subsection of the 'Entry information' section indicates whether the entry has been manually annotated and reviewed by UniProtKB curators or not, in other words, if the entry belongs to the Swiss-Prot section of UniProtKB (<strong>reviewed</strong>) or to the computer-annotated TrEMBL section (<strong>unreviewed</strong>).<p><a href='/help/entry_status' target='_top'>More...</a></p>Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programDrosophila annotation project

<p>This section contains any relevant information that doesn't fit in any other defined sections<p><a href='/help/miscellaneous_section' target='_top'>More...</a></p>Miscellaneousi

Keywords - Technical termi

Reference proteome

Documents

  1. Drosophila
    Drosophila: entries, gene names and cross-references to FlyBase
  2. SIMILARITY comments
    Index of protein domains and families
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again