Skip Header

Contribute Send feedback
Read comments (?) or add your own

A5YKK6 (CNOT1_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified January 25, 2012. Version 46. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (5) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Interactions·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
CCR4-NOT transcription complex subunit 1
Alternative name(s):
CCR4-associated factor 1
Negative regulator of transcription subunit 1 homolog
Short name=NOT1H
Short name=hNOT1
Gene names
Name:CNOT1
Synonyms:CDC39, KIAA1007, NOT1
ORF Names:AD-005
OrganismHomo sapiens (Human)
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length2376 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Belongs to the CCR4-NOT complex that functions as general transcription regulation complex. Acts as a transcriptional repressor. Represses the ligand-dependent transcriptional activation by nuclear receptors. Ref.7 Ref.8

Subunit structure

Subunit of the CCR4-NOT core complex that contains CNOT1, CNOT2, CNOT3, CNOT4 and CNOT8. Interacts in a ligand-dependent fashion with ESR1 and RXRA. Interacts with NANOS2 By similarity. Ref.7 Ref.8

Subcellular location

CytoplasmP-body By similarity. Note: NANOS2 promotes its localization to P-body By similarity.

Tissue specificity

Strongly expressed in brain, heart, thymus, spleen, kidney, liver, placenta and lung. Weakly expressed in skeletal muscle and colon. Ref.7

Sequence similarities

Belongs to the CNOT1 family.

Sequence caution

The sequence AAF14861.1 differs from that shown. Reason: Frameshift at positions 1969 and 2273.

The sequence CAH18093.1 differs from that shown. Reason: Erroneous initiation. Translation N-terminally extended.

Binary interactions

With

Entry

#Exp.

IntAct

Notes

TOB1P506163EBI-1222758,EBI-723281

Alternative products

This entry describes 4 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: A5YKK6-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: A5YKK6-2)

The sequence of this isoform differs from the canonical sequence as follows:
     822-827: SKMKPS → T
Isoform 3 (identifier: A5YKK6-3)

The sequence of this isoform differs from the canonical sequence as follows:
     822-827: SKMKPS → T
     2152-2155: VDML → SLTL
     2156-2376: Missing.
Isoform 4 (identifier: A5YKK6-4)

The sequence of this isoform differs from the canonical sequence as follows:
     1479-1551: TASPQQREMM...DPVVLTYQAE → VSWLFPWYRY...FFLLYSQHDV
     1552-2376: Missing.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 23762376CCR4-NOT transcription complex subunit 1
PRO_0000315541

Regions

Compositional bias1333 – 135018Thr-rich

Amino acid modifications

Modified residue19331Phosphotyrosine Ref.9

Natural variations

Alternative sequence822 – 8276SKMKPS → T in isoform 2 and isoform 3.
VSP_030559
Alternative sequence1479 – 155173TASPQ…TYQAE → VSWLFPWYRYKTYYCLSVII FFFVYIWHWALPLILNNHHI CLMSSIILDCNSVRQSIMSV CFFFFLLYSQHDV in isoform 4.
VSP_030560
Alternative sequence1552 – 2376825Missing in isoform 4.
VSP_030561
Alternative sequence2152 – 21554VDML → SLTL in isoform 3.
VSP_030562
Alternative sequence2156 – 2376221Missing in isoform 3.
VSP_030563
Natural variant6031D → A. Ref.3
Corresponds to variant rs17854028 [ dbSNP | Ensembl ].
VAR_038254

Experimental info

Sequence conflict181D → G in CAD97851. Ref.1
Sequence conflict3231D → G in CAD97851. Ref.1
Sequence conflict9221E → G in ABQ66268. Ref.1
Sequence conflict10631K → R in CAH18093. Ref.1
Sequence conflict14171M → V in CAD97851. Ref.1
Sequence conflict17991R → G in CAD97851. Ref.1
Sequence conflict19361L → P in CAH18093. Ref.1
Sequence conflict2198 – 22003SNL → RTV in AAF14861. Ref.6
Sequence conflict23581Q → R in ABQ66268. Ref.1

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified January 15, 2008. Version 2.
Checksum: 937A0899537BA615

FASTA2,376266,939
        10         20         30         40         50         60 
MNLDSLSLAL SQISYLVDNL TKKNYRASQQ EIQHIVNRHG PEADRHLLRC LFSHVDFSGD 

        70         80         90        100        110        120 
GKSSGKDFHQ TQFLIQECAL LITKPNFIST LSYAIDNPLH YQKSLKPAPH LFAQLSKVLK 

       130        140        150        160        170        180 
LSKVQEVIFG LALLNSSSSD LRGFAAQFIK QKLPDLLRSY IDADVSGNQE GGFQDIAIEV 

       190        200        210        220        230        240 
LHLLLSHLLF GQKGAFGVGQ EQIDAFLKTL RRDFPQERCP VVLAPLLYPE KRDILMDRIL 

       250        260        270        280        290        300 
PDSGGVAKTM MESSLADFMQ EVGYGFCASI EECRNIIVQF GVREVTAAQV ARVLGMMART 

       310        320        330        340        350        360 
HSGLTDGIPL QSISAPGSGI WSDGKDKSDG AQAHTWNVEV LIDVLKELNP SLNFKEVTYE 

       370        380        390        400        410        420 
LDHPGFQIRD SKGLHNVVYG IQRGLGMEVF PVDLIYRPWK HAEGQLSFIQ HSLINPEIFC 

       430        440        450        460        470        480 
FADYPCHTVA TDILKAPPED DNREIATWKS LDLIESLLRL AEVGQYEQVK QLFSFPIKHC 

       490        500        510        520        530        540 
PDMLVLALLQ INTSWHTLRH ELISTLMPIF LGNHPNSAII LHYAWHGQGQ SPSIRQLIMH 

       550        560        570        580        590        600 
AMAEWYMRGE QYDQAKLSRI LDVAQDLKAL SMLLNGTPFA FVIDLAALAS RREYLKLDKW 

       610        620        630        640        650        660 
LTDKIREHGE PFIQACMTFL KRRCPSILGG LAPEKDQPKS AQLPPETLAT MLACLQACAG 

       670        680        690        700        710        720 
SVSQELSETI LTMVANCSNV MNKARQPPPG VMPKGRPPSA SSLDAISPVQ IDPLAGMTSL 

       730        740        750        760        770        780 
SIGGSAAPHT QSMQGFPPNL GSAFSTPQSP AKAFPPLSTP NQTTAFSGIG GLSSQLPVGG 

       790        800        810        820        830        840 
LGTGSLTGIG TGALGLPAVN NDPFVQRKLG TSGLNQPTFQ QSKMKPSDLS QVWPEANQHF 

       850        860        870        880        890        900 
SKEIDDEANS YFQRIYNHPP HPTMSVDEVL EMLQRFKDST IKREREVFNC MLRNLFEEYR 

       910        920        930        940        950        960 
FFPQYPDKEL HITACLFGGI IEKGLVTYMA LGLALRYVLE ALRKPFGSKM YYFGIAALDR 

       970        980        990       1000       1010       1020 
FKNRLKDYPQ YCQHLASISH FMQFPHHLQE YIEYGQQSRD PPVKMQGSIT TPGSIALAQA 

      1030       1040       1050       1060       1070       1080 
QAQAQVPAKA PLAGQVSTMV TTSTTTTVAK TVTVTRPTGV SFKKDVPPSI NTTNIDTLLV 

      1090       1100       1110       1120       1130       1140 
ATDQTERIVE PPENIQEKIA FIFNNLSQSN MTQKVEELKE TVKEEFMPWV SQYLVMKRVS 

      1150       1160       1170       1180       1190       1200 
IEPNFHSLYS NFLDTLKNPE FNKMVLNETY RNIKVLLTSD KAAANFSDRS LLKNLGHWLG 

      1210       1220       1230       1240       1250       1260 
MITLAKNKPI LHTDLDVKSL LLEAYVKGQQ ELLYVVPFVA KVLESSIRSV VFRPPNPWTM 

      1270       1280       1290       1300       1310       1320 
AIMNVLAELH QEHDLKLNLK FEIEVLCKNL ALDINELKPG NLLKDKDRLK NLDEQLSAPK 

      1330       1340       1350       1360       1370       1380 
KDVKQPEELP PITTTTTSTT PATNTTCTAT VPPQPQYSYH DINVYSLAGL APHITLNPTI 

      1390       1400       1410       1420       1430       1440 
PLFQAHPQLK QCVRQAIERA VQELVHPVVD RSIKIAMTTC EQIVRKDFAL DSEESRMRIA 

      1450       1460       1470       1480       1490       1500 
AHHMMRNLTA GMAMITCREP LLMSISTNLK NSFASALRTA SPQQREMMDQ AAAQLAQDNC 

      1510       1520       1530       1540       1550       1560 
ELACCFIQKT AVEKAGPEMD KRLATEFELR KHARQEGRRY CDPVVLTYQA ERMPEQIRLK 

      1570       1580       1590       1600       1610       1620 
VGGVDPKQLA VYEEFARNVP GFLPTNDLSQ PTGFLAQPMK QAWATDDVAQ IYDKCITELE 

      1630       1640       1650       1660       1670       1680 
QHLHAIPPTL AMNPQAQALR SLLEVVVLSR NSRDAIAALG LLQKAVEGLL DATSGADADL 

      1690       1700       1710       1720       1730       1740 
LLRYRECHLL VLKALQDGRA YGSPWCNKQI TRCLIECRDE YKYNVEAVEL LIRNHLVNMQ 

      1750       1760       1770       1780       1790       1800 
QYDLHLAQSM ENGLNYMAVA FAMQLVKILL VDERSVAHVT EADLFHTIET LMRINAHSRG 

      1810       1820       1830       1840       1850       1860 
NAPEGLPQLM EVVRSNYEAM IDRAHGGPNF MMHSGISQAS EYDDPPGLRE KAEYLLREWV 

      1870       1880       1890       1900       1910       1920 
NLYHSAAAGR DSTKAFSAFV GQMHQQGILK TDDLITRFFR LCTEMCVEIS YRAQAEQQHN 

      1930       1940       1950       1960       1970       1980 
PAANPTMIRA KCYHNLDAFV RLIALLVKHS GEATNTVTKI NLLNKVLGIV VGVLLQDHDV 

      1990       2000       2010       2020       2030       2040 
RQSEFQQLPY HRIFIMLLLE LNAPEHVLET INFQTLTAFC NTFHILRPTK APGFVYAWLE 

      2050       2060       2070       2080       2090       2100 
LISHRIFIAR MLAHTPQQKG WPMYAQLLID LFKYLAPFLR NVELTKPMQI LYKGTLRVLL 

      2110       2120       2130       2140       2150       2160 
VLLHDFPEFL CDYHYGFCDV IPPNCIQLRN LILSAFPRNM RLPDPFTPNL KVDMLSEINI 

      2170       2180       2190       2200       2210       2220 
APRILTNFTG VMPPQFKKDL DSYLKTRSPV TFLSDLRSNL QVSNEPGNRY NLQLINALVL 

      2230       2240       2250       2260       2270       2280 
YVGTQAIAHI HNKGSTPSMS TITHSAHMDI FQNLAVDLDT EGRYLFLNAI ANQLRYPNSH 

      2290       2300       2310       2320       2330       2340 
THYFSCTMLY LFAEANTEAI QEQITRVLLE RLIVNRPHPW GLLITFIELI KNPAFKFWNH 

      2350       2360       2370 
EFVHCAPEIE KLFQSVAQCC MGQKQAQQVM EGTGAS 

« Hide

Isoform 2 [UniParc].

Checksum: 23D129C7F9CE775B
Show »

FASTA2,371266,381
Isoform 3 [UniParc].

Checksum: BBB731593591523C
Show »

FASTA2,150241,260
Isoform 4 [UniParc].

Checksum: 00329322012ABAD2
Show »

FASTA1,551173,792

References

« Hide 'large scale' references
[1]"The full-ORF clone resource of the German cDNA consortium."
Bechtel S., Rosenfelder H., Duda A., Schmidt C.P., Ernst U., Wellenreuther R., Mehrle A., Schuster C., Bahr A., Bloecker H., Heubner D., Hoerlein A., Michel G., Wedler H., Koehrer K., Ottenwaelder B., Poustka A., Wiemann S., Schupp I.
BMC Genomics 8:399-399(2007) [PubMed: 17974005] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 2 AND 3), NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 244-2376 (ISOFORM 1).
Tissue: Cervix, Fetal kidney and Testis.
[2]"The sequence and analysis of duplication-rich human chromosome 16."
Martin J., Han C., Gordon L.A., Terry A., Prabhakar S., She X., Xie G., Hellsten U., Chan Y.M., Altherr M., Couronne O., Aerts A., Bajorek E., Black S., Blumer H., Branscomb E., Brown N.C., Bruno W.J. expand/collapse author list , Buckingham J.M., Callen D.F., Campbell C.S., Campbell M.L., Campbell E.W., Caoile C., Challacombe J.F., Chasteen L.A., Chertkov O., Chi H.C., Christensen M., Clark L.M., Cohn J.D., Denys M., Detter J.C., Dickson M., Dimitrijevic-Bussod M., Escobar J., Fawcett J.J., Flowers D., Fotopulos D., Glavina T., Gomez M., Gonzales E., Goodstein D., Goodwin L.A., Grady D.L., Grigoriev I., Groza M., Hammon N., Hawkins T., Haydu L., Hildebrand C.E., Huang W., Israni S., Jett J., Jewett P.B., Kadner K., Kimball H., Kobayashi A., Krawczyk M.-C., Leyba T., Longmire J.L., Lopez F., Lou Y., Lowry S., Ludeman T., Manohar C.F., Mark G.A., McMurray K.L., Meincke L.J., Morgan J., Moyzis R.K., Mundt M.O., Munk A.C., Nandkeshwar R.D., Pitluck S., Pollard M., Predki P., Parson-Quintana B., Ramirez L., Rash S., Retterer J., Ricke D.O., Robinson D.L., Rodriguez A., Salamov A., Saunders E.H., Scott D., Shough T., Stallings R.L., Stalvey M., Sutherland R.D., Tapia R., Tesmer J.G., Thayer N., Thompson L.S., Tice H., Torney D.C., Tran-Gyamfi M., Tsai M., Ulanovsky L.E., Ustaszewska A., Vo N., White P.S., Williams A.L., Wills P.L., Wu J.-R., Wu K., Yang J., DeJong P., Bruce D., Doggett N.A., Deaven L., Schmutz J., Grimwood J., Richardson P., Rokhsar D.S., Eichler E.E., Gilna P., Lucas S.M., Myers R.M., Rubin E.M., Pennacchio L.A.
Nature 432:988-994(2004) [PubMed: 15616553] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[3]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 4), NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 757-2376 (ISOFORM 1), VARIANT ALA-603.
Tissue: Choriocarcinoma, Leiomyosarcoma and Testis.
[4]"Prediction of the coding sequences of unidentified human genes. XIII. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro."
Nagase T., Ishikawa K., Suyama M., Kikuno R., Hirosawa M., Miyajima N., Tanaka A., Kotani H., Nomura N., Ohara O.
DNA Res. 6:63-70(1999) [PubMed: 10231032] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 537-2376 (ISOFORM 2).
Tissue: Brain.
[5]"Construction of expression-ready cDNA clones for KIAA genes: manual curation of 330 KIAA cDNA clones."
Nakajima D., Okazaki N., Yamakawa H., Kikuno R., Ohara O., Nagase T.
DNA Res. 9:99-106(2002) [PubMed: 12168954] [Abstract]
Cited for: SEQUENCE REVISION.
[6]"Gene expression profiling in the human hypothalamus-pituitary-adrenal axis and full-length cDNA cloning."
Hu R.-M., Han Z.-G., Song H.-D., Peng Y.-D., Huang Q.-H., Ren S.-X., Gu Y.-J., Huang C.-H., Li Y.-B., Jiang C.-L., Fu G., Zhang Q.-H., Gu B.-W., Dai M., Mao Y.-F., Gao G.-F., Rong R., Ye M. expand/collapse author list , Zhou J., Xu S.-H., Gu J., Shi J.-X., Jin W.-R., Zhang C.-K., Wu T.-M., Huang G.-Y., Chen Z., Chen M.-D., Chen J.-L.
Proc. Natl. Acad. Sci. U.S.A. 97:9543-9548(2000) [PubMed: 10931946] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1964-2374 (ISOFORM 1).
Tissue: Adrenal gland.
[7]"Isolation and characterization of human orthologs of yeast CCR4-NOT complex subunits."
Albert T.K., Lemaire M., van Berkum N.L., Gentz R., Collart M.A., Timmers H.T.M.
Nucleic Acids Res. 28:809-817(2000) [PubMed: 10637334] [Abstract]
Cited for: FUNCTION, INTERACTION WITH CNOT2; CNOT3; CNOT4 AND CNOT8, TISSUE SPECIFICITY.
[8]"Human Ccr4-Not complex is a ligand-dependent repressor of nuclear receptor-mediated transcription."
Winkler G.S., Mulder K.W., Bardwell V.J., Kalkhoven E., Timmers H.T.
EMBO J. 25:3089-3099(2006) [PubMed: 16778766] [Abstract]
Cited for: FUNCTION, INTERACTION WITH CNOT2; CNOT3; CNOT8; ESR1 AND RXRA.
[9]"Quantitative phosphoproteome profiling of Wnt3a-mediated signaling network: indicating the involvement of ribonucleoside-diphosphate reductase M2 subunit phosphorylation at residue serine 20 in canonical Wnt signal transduction."
Tang L.-Y., Deng N., Wang L.-S., Dai J., Wang Z.-L., Jiang X.-S., Li S.-J., Li L., Sheng Q.-H., Wu D.-Q., Li L., Zeng R.
Mol. Cell. Proteomics 6:1952-1967(2007) [PubMed: 17693683] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT TYR-1933, MASS SPECTROMETRY.
Tissue: Embryonic kidney.
[10]"Initial characterization of the human central proteome."
Burkard T.R., Planyavsky M., Kaupe I., Breitwieser F.P., Buerckstuemmer T., Bennett K.L., Superti-Furga G., Colinge J.
BMC Syst. Biol. 5:17-17(2011) [PubMed: 21269460] [Abstract]
Cited for: IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AL117492 mRNA. Translation: CAB55960.1.
BX537840 mRNA. Translation: CAD97851.1.
CR749237 mRNA. Translation: CAH18093.1. Different initiation.
EF553522 mRNA. Translation: ABQ66268.1.
AC009118 Genomic DNA. No translation available.
AC010287 Genomic DNA. No translation available.
BC000779 mRNA. Translation: AAH00779.2.
BC024317 mRNA. Translation: AAH24317.1.
BC040523 mRNA. Translation: AAH40523.1.
AB023224 mRNA. Translation: BAA76851.2.
AF110778 mRNA. Translation: AAF14861.1. Frameshift.
IPIIPI00166010.
IPI00410157.
IPI00642900.
IPI00880052.
PIRT17270.
RefSeqNP_057368.3. NM_016284.3.
NP_996882.1. NM_206999.1.
UniGeneHs.460923.
Hs.725184.

3D structure databases

ProteinModelPortalA5YKK6.
ModBaseSearch...

Protein-protein interaction databases

DIPDIP-44954N.
IntActA5YKK6. 14 interactions.
MINTMINT-2835018.
STRINGA5YKK6.

Proteomic databases

PeptideAtlasA5YKK6.
PRIDEA5YKK6.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENST00000317147; ENSP00000320949; ENSG00000125107.
GeneID23019.
KEGGhsa:23019.
UCSCuc002enu.2. human.
uc002env.1. human.
uc002enx.1. human.

Organism-specific databases

CTD23019.
GeneCardsGC16M058553.
HGNCHGNC:7877. CNOT1.
MIM604917. gene.
neXtProtNX_A5YKK6.
PharmGKBPA26672.
HUGESearch...
GenAtlasSearch...

Phylogenomic databases

eggNOGprNOG06926.
HOVERGENHBG060834.
InParanoidA5YKK6.
OMAEAYHKGQ.
OrthoDBEOG495ZQV.
PhylomeDBA5YKK6.

Enzyme and pathway databases

ReactomeREACT_111217. Metabolism.

Gene expression databases

BgeeA5YKK6.
GenevestigatorA5YKK6.

Family and domain databases

InterProIPR007196. CCR4-Not_Not1_C.
IPR024557. CCR4-Not_Not1su_DUF3819.
[Graphical view]
KOK12604.
PfamPF12842. DUF3819. 1 hit.
PF04054. Not1. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

NextBio43960.
SOURCESearch...

Entry information

Entry nameCNOT1_HUMAN
AccessionPrimary (citable) accession number: A5YKK6
Secondary accession number(s): Q68DX7 expand/collapse secondary AC list , Q7Z3K2, Q8IWB8, Q8TB53, Q9BVZ6, Q9UFR8, Q9UI27, Q9Y2L0
Entry history
Integrated into UniProtKB/Swiss-Prot: January 15, 2008
Last sequence update: January 15, 2008
Last modified: January 25, 2012
This is version 46 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

Human chromosome 16

Human chromosome 16: entries, gene names and cross-references to MIM

Human entries with polymorphisms or disease mutations

List of human entries with polymorphisms or disease mutations

Human polymorphisms and disease mutations

Index of human polymorphisms and disease mutations

MIM cross-references

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot

SIMILARITY comments

Index of protein domains and families