Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

CCR4-NOT transcription complex subunit 1

Gene

cnot1

Organism
Danio rerio (Zebrafish) (Brachydanio rerio)
Status
Reviewed-Annotation score: Annotation score: 4 out of 5-Experimental evidence at transcript leveli

Functioni

Scaffolding component of the CCR4-NOT complex which is one of the major cellular mRNA deadenylases and is linked to various cellular processes including bulk mRNA degradation, miRNA-mediated repression, translational repression during translational initiation and general transcription regulation. Additional complex functions may be a consequence of its influence on mRNA expression. Its scaffolding function implies its interaction with the catalytic complex module and diverse RNA-binding proteins mediating the complex recruitment to selected mRNA 3'UTRs. Acts as a transcriptional repressor. Represses the ligand-dependent transcriptional activation by nuclear receptors (By similarity).By similarity

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Molecular functioni

Repressor

Keywords - Biological processi

RNA-mediated gene silencing, Transcription, Transcription regulation, Translation regulation

Names & Taxonomyi

Protein namesi
Recommended name:
CCR4-NOT transcription complex subunit 1
Alternative name(s):
CCR4-associated factor 1
Gene namesi
Name:cnot1
ORF Names:zgc:152902
OrganismiDanio rerio (Zebrafish) (Brachydanio rerio)
Taxonomic identifieri7955 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiActinopterygiiNeopterygiiTeleosteiOstariophysiCypriniformesCyprinidaeDanio
Proteomesi
  • UP000000437 Componenti: Unplaced

Organism-specific databases

ZFINiZDB-GENE-040915-1. cnot1.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Cytoplasm, Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 23742374CCR4-NOT transcription complex subunit 1PRO_0000315543Add
BLAST

Proteomic databases

PaxDbiA1A5H6.
PRIDEiA1A5H6.

Interactioni

Subunit structurei

Component of the CCR4-NOT complex.By similarity

GO - Molecular functioni

Protein-protein interaction databases

STRINGi7955.ENSDARP00000004298.

Structurei

3D structure databases

ProteinModelPortaliA1A5H6.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni1082 – 1604523Interaction with CCR4-NOT complex catalytic subunitsBy similarityAdd
BLAST

Motif

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Motifi153 – 1575LXXLL
Motifi181 – 1855LXXLL
Motifi223 – 2275LXXLL
Motifi570 – 5745LXXLL
Motifi1638 – 16425LXXLL
Motifi1940 – 19445LXXLL
Motifi2094 – 20985LXXLL

Compositional bias

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Compositional biasi1029 – 107749Thr-richAdd
BLAST

Domaini

Contains Leu-Xaa-Xaa-Leu-Leu (LXXLL) motifs, a motif known to be important for the association with nuclear receptors.By similarity

Sequence similaritiesi

Belongs to the CNOT1 family.Curated

Phylogenomic databases

eggNOGiKOG1831. Eukaryota.
COG5103. LUCA.
HOGENOMiHOG000265103.
HOVERGENiHBG060834.
InParanoidiA1A5H6.
KOiK12604.
PhylomeDBiA1A5H6.

Family and domain databases

InterProiIPR007196. CCR4-Not_Not1_C.
IPR024557. CCR4-Not_Not1su_DUF3819.
IPR032191. CNOT1_CAF1_bind.
IPR032194. CNOT1_HEAT.
IPR032193. CNOT1_TTP_bind.
[Graphical view]
PfamiPF16415. CNOT1_CAF1_bind. 1 hit.
PF16418. CNOT1_HEAT. 1 hit.
PF16417. CNOT1_TTP_bind. 1 hit.
PF12842. DUF3819. 1 hit.
PF04054. Not1. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

A1A5H6-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MNLDSLSLAL SQISYLVDNL TKKNYRASQQ EIQHIVNRHG PEADRHLLRC
60 70 80 90 100
LFSHVDFSGD GKSSGKDFHQ TQFLIQECVS LITKPNFIST LCYAIDNPLH
110 120 130 140 150
YQKSLKPSPH LFTQLSKVLK LSKVQEVILG LALSNSSNAD LRGFAAQFVK
160 170 180 190 200
QKLPDLLRSY VDADLGGNQE GGFQDIAIEV LHLLLSHLLF GQKGSSGVGQ
210 220 230 240 250
EQIDAFLKTL CRDFPQERCP VVLAPLLYPD KRDILMDRIL PDSGDLNKTM
260 270 280 290 300
MESSLADFMQ EVGYGFCASL EECRNIILQY GVREVTASQV ARVLGMMART
310 320 330 340 350
HSGLSDGISL QTITNPVGGG GIWSDGKDKS DSSQAWNVEV LIDVVKEVNP
360 370 380 390 400
NLNFKEVTYE LDHPGFLIRD SKGLQIVVYG IQRGLGMEVF PVDLIYRPWK
410 420 430 440 450
HAEGQLSFIQ HSLLSPEVFC FADNPCHTVA IDTLKAPPED DNREIATWKS
460 470 480 490 500
LDLVESLLRL SEVGHYEQVK QLFSFPIKHC PDMLVLALLQ ISTSWHTLRH
510 520 530 540 550
ELISTLMPIF LGNHPNSAII LHYAWHGQGQ SPSIRQLIMH SMAEWYMRGE
560 570 580 590 600
QYDQAKLSRI LDVAQDLKSL SMLLNGTPFA FVIDLAALAS RREYLKLDKW
610 620 630 640 650
LTDKIREHGE PFIQACVTFL KRRCPSIMGG LAPEKDQPKS AQLPPETLAT
660 670 680 690 700
MLACLQSCAG SVSQELSETI LTMVANCSNV MNKARQPPPG VLPKGRAPST
710 720 730 740 750
SSLDAISPVQ MDPLSAMGSL SLGVSSTSHT PSMQGFPSLQ GSAFSNPQSP
760 770 780 790 800
AKAFSNLPNP NPSTAFPGIN PLSSQLQGPL STSLSGIGSG LGMPTVSSDV
810 820 830 840 850
FSARKMSTPG LNPPTFQQTD LSQVWPEANQ HFSKEIDDEA NSYFQRIYNH
860 870 880 890 900
PPHPTMSVDE VLEMLQRFKD SNIKREREVF NCMLRNLFEE YRFFPQYPDK
910 920 930 940 950
ELHITACLFG GIIEKGLVTY MALGLALRYV LEALRKPFGS KMYYFGIAAL
960 970 980 990 1000
DRFKNRLKDY PQYCQHLASI AHFLQFPHHL QEYIEYGQQS RDPPVKMQGS
1010 1020 1030 1040 1050
ITTPGSLALA QAQAQSQPPK APQPGQASTL VTTATTTTTA AKTTTITRPT
1060 1070 1080 1090 1100
AVGPKKDVPP SINTTNIDTL LVATDQTERI VEPPENVQEK IAFIFNNLSQ
1110 1120 1130 1140 1150
SNMSQKVEEL KETVKEEFMP WVSQYLVMKR VSIEPNFHSL YSNFLDTLKN
1160 1170 1180 1190 1200
PEFVKMVLNE TYRNIKVLLT SDKAAANFSD RSLLKNLGHW LGMITLAKNK
1210 1220 1230 1240 1250
PILYTDLELK SLLLEAYVKG QQELLYVVPF VAKVLESSLR SVIFRPQNPW
1260 1270 1280 1290 1300
TMGIMNVLAE LHQEHDLKLN LKFEIEVLCK NLSMDITDLK PGNLLRDKDK
1310 1320 1330 1340 1350
LKTLEEQLSA PKKETKPPEE LLPIVTTDSV PFTAAPSTPA TTTACTATGP
1360 1370 1380 1390 1400
PTPQFSYHDI NVYALAGLAP HININVNIPL LQAHPQLKQC VRPAIERAVQ
1410 1420 1430 1440 1450
ELVHPVVDRS IKIAMTTCEQ IVRKDFALDS EESHMRVAAH HMMRNLTAGM
1460 1470 1480 1490 1500
AMITCREPLL MSIATNLKNS FAAALRAPTP QQREMMEEAA ARIAQDNCEL
1510 1520 1530 1540 1550
ACCFIQKTAV EKAGPEMDKR LATEFELRKH ARQEGRRYCD PMVLTYQAER
1560 1570 1580 1590 1600
MPEQIRLKVG GVDPKQLAVY EEFARNVPGF LPSNDLSQPT GFLAQPMKQQ
1610 1620 1630 1640 1650
AWPTDDVAHI YEKCISDLEQ HLHAIPPALA MNPQTQAIRS LLEAVVMARN
1660 1670 1680 1690 1700
SRDGIAALGL LQKAVEGLLD ATSGADPELL LSYRECHLLV LKALQDGRAY
1710 1720 1730 1740 1750
GPQWCNKQIT RCLIECRDEY KYNVEAVELL IRNHLVNMQQ YDLHLAQSME
1760 1770 1780 1790 1800
NGLNYMAVAF AMQLVKLLLV DERSVSHITE ADLFHTIETL MRTSAHSRAN
1810 1820 1830 1840 1850
APEGLPQLMD VVRSNYEAMI DRHHGGPNFM MHSGISQASE YDDPPGLREK
1860 1870 1880 1890 1900
AEYLLREWVN LYHSAAAGRD STKAFSAFVG QMHQQGILKT DDLITRFFRL
1910 1920 1930 1940 1950
CTEMCVEISY RAQAEQQHPT TSPAIIRAKC YHNLDAFVRL IALLVKHSGE
1960 1970 1980 1990 2000
ATNTVTKINL LNKVLGIVVG VLIQDHDVRQ TEFQQLPYHR IFIMLLLELN
2010 2020 2030 2040 2050
APEHVLETIN FQTLTAFCNT FHILRPTKAP GFVYAWLELI SHRIFIARML
2060 2070 2080 2090 2100
AHTPQQKGWP MYAQLLIDLF KYLAPFLRNV ELNKPMQILY KGTLRVLLVL
2110 2120 2130 2140 2150
LHDFPEFLCD YHYGFCDVIP PNCIQLRNLI LSAFPRNMRL PDPFTPNLKV
2160 2170 2180 2190 2200
DMLSEINIAP RILTNFTGVM PSQFKKDLDS YLKTRSPVTF LSELRSNLQV
2210 2220 2230 2240 2250
SNEPGNRYNI QLINALVPYV GTQAIAHIHN KGSTPSMSTI THSAHMDIFQ
2260 2270 2280 2290 2300
NLAVDLDTEG RYLFLNAIAN QLRYPNSHTH YFSCTMLYLF AEANAEAIQE
2310 2320 2330 2340 2350
QITRVLLERL IVNRPHPWGL LITFIELIKN PAFKFWSHDF VHCAPEIEKL
2360 2370
FQSVAQCCMG QKQAQQVMEG TGAS
Length:2,374
Mass (Da):266,604
Last modified:January 23, 2007 - v1
Checksum:iCAE9127310F44516
GO

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti424 – 4241N → Y in AAT68167 (PubMed:15256591).Curated
Sequence conflicti433 – 4331T → I in AAT68167 (PubMed:15256591).Curated

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
BC128655 mRNA. Translation: AAI28656.1.
AY648849 mRNA. Translation: AAT68167.1.
RefSeqiNP_001073420.1. NM_001079951.2.
UniGeneiDr.75497.

Genome annotation databases

GeneIDi448949.
KEGGidre:448949.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
BC128655 mRNA. Translation: AAI28656.1.
AY648849 mRNA. Translation: AAT68167.1.
RefSeqiNP_001073420.1. NM_001079951.2.
UniGeneiDr.75497.

3D structure databases

ProteinModelPortaliA1A5H6.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi7955.ENSDARP00000004298.

Proteomic databases

PaxDbiA1A5H6.
PRIDEiA1A5H6.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

GeneIDi448949.
KEGGidre:448949.

Organism-specific databases

CTDi23019.
ZFINiZDB-GENE-040915-1. cnot1.

Phylogenomic databases

eggNOGiKOG1831. Eukaryota.
COG5103. LUCA.
HOGENOMiHOG000265103.
HOVERGENiHBG060834.
InParanoidiA1A5H6.
KOiK12604.
PhylomeDBiA1A5H6.

Miscellaneous databases

PROiA1A5H6.

Family and domain databases

InterProiIPR007196. CCR4-Not_Not1_C.
IPR024557. CCR4-Not_Not1su_DUF3819.
IPR032191. CNOT1_CAF1_bind.
IPR032194. CNOT1_HEAT.
IPR032193. CNOT1_TTP_bind.
[Graphical view]
PfamiPF16415. CNOT1_CAF1_bind. 1 hit.
PF16418. CNOT1_HEAT. 1 hit.
PF16417. CNOT1_TTP_bind. 1 hit.
PF12842. DUF3819. 1 hit.
PF04054. Not1. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

  1. NIH - Zebrafish Gene Collection (ZGC) project
    Submitted (DEC-2006) to the EMBL/GenBank/DDBJ databases
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
    Strain: AB.
  2. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1-1595.

Entry informationi

Entry nameiCNOT1_DANRE
AccessioniPrimary (citable) accession number: A1A5H6
Secondary accession number(s): Q6DRB0
Entry historyi
Integrated into UniProtKB/Swiss-Prot: January 15, 2008
Last sequence update: January 23, 2007
Last modified: June 8, 2016
This is version 63 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.