Skip Header

Contribute Send feedback
Read comments (?) or add your own

O60308 (CE104_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified January 25, 2012. Version 82. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (4) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Centrosomal protein of 104 kDa
Gene names
Name:CEP104
Synonyms:KIAA0562
OrganismHomo sapiens (Human)
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length925 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Subcellular location

Cytoplasmcytoskeletoncentrosomecentriole Ref.5.

Sequence similarities

Contains 2 HEAT repeats.

Sequence caution

The sequence AAH01640.1 differs from that shown. Reason: Contaminating sequence. Potential poly-A sequence.

The sequence BAA25488.2 differs from that shown. Reason: Erroneous initiation.

Ontologies

Keywords
   Cellular componentCytoplasm
Cytoskeleton
   Coding sequence diversityAlternative splicing
Polymorphism
   DomainCoiled coil
Repeat
   PTMPhosphoprotein
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
   Cellular componentcentriole

Inferred from direct assay Ref.5. Source: UniProtKB

   Molecular functionbinding

Inferred from electronic annotation. Source: InterPro

Complete GO annotation...

Alternative products

This entry describes 3 isoforms produced by alternative splicing. [Align] [Select]

Note: Additional isoforms may exist.
Isoform 1 (identifier: O60308-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: O60308-2)

The sequence of this isoform differs from the canonical sequence as follows:
     189-244: RKSDYISPLD...KLKQAIADLQ → SSVRTGGEST...EGTPFQRCLV
     245-925: Missing.
Note: No experimental confirmation available.
Isoform 3 (identifier: O60308-3)

The sequence of this isoform differs from the canonical sequence as follows:
     554-554: E → V
     555-925: Missing.
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 925925Centrosomal protein of 104 kDa
PRO_0000050763

Regions

Repeat529 – 56739HEAT 1
Repeat604 – 64037HEAT 2
Coiled coil209 – 28981 Potential
Coiled coil677 – 72549 Potential

Amino acid modifications

Modified residue3231Phosphoserine By similarity

Natural variations

Alternative sequence189 – 24456RKSDY…IADLQ → SSVRTGGESTFGELKGPAVP SSVTLSVLGTSLGQWFPCHL PAVDDNEGTPFQRCLV in isoform 2.
VSP_014364
Alternative sequence245 – 925681Missing in isoform 2.
VSP_014365
Alternative sequence5541E → V in isoform 3.
VSP_014366
Alternative sequence555 – 925371Missing in isoform 3.
VSP_014367
Natural variant4141L → I.
Corresponds to variant rs2275824 [ dbSNP | Ensembl ].
VAR_034036
Natural variant6861A → V.
Corresponds to variant rs2275831 [ dbSNP | Ensembl ].
VAR_020042

Experimental info

Sequence conflict511V → L in AAH01640. Ref.4
Sequence conflict2661Y → F in AAH47450. Ref.4
Sequence conflict3831P → S in AAH47450. Ref.4

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified August 1, 1998. Version 1.
Checksum: 6B2BBD5068136887

FASTA925104,448
        10         20         30         40         50         60 
MPHKIGFVVV SSSGHEDGFS ARELMIHAPT VSGWRSPRFC QFPQEIVLQM VERCRIRKLQ 

        70         80         90        100        110        120 
LLAHQYMISS KIEFYISESL PEYFAPYQAE RFRRLGYVSL CDNEKTGCKA RELKSVYVDA 

       130        140        150        160        170        180 
VGQFLKLIFH QNHVNKYNIY NQVALVAINI IGDPADFSDE SNTASREKLI DHYLGHNSED 

       190        200        210        220        230        240 
PALEGTYARK SDYISPLDDL AFDMYQDPEV AQIIRKLDER KREAVQKERY DYAKKLKQAI 

       250        260        270        280        290        300 
ADLQKVGERL GRYEVEKRCA VEKEDYDLAK EKKQQMEQYR AEVYEQLELH SLLDAELMRR 

       310        320        330        340        350        360 
PFDLPLQPLA RSGSPCHQKP MPSLPQLEER GTENQFAEPF LQEKPSSYSL TISPQHSAVD 

       370        380        390        400        410        420 
PLLPATDPHP KINAESLPYD ERPLPAIRKH YGEAVVEPEM SNADISDARR GGMLGEPEPL 

       430        440        450        460        470        480 
TEKALREASS AIDVLGETLV AEAYCKTWSY REDALLALSK KLMEMPVGTP KEDLKNTLRA 

       490        500        510        520        530        540 
SVFLVRRAIK DIVTSVFQAS LKLLKMIITQ YIPKHKLSKL ETAHCVERTI PVLLTRTGDS 

       550        560        570        580        590        600 
SARLRVTAAN FIQEMALFKE VKSLQIIPSY LVQPLKANSS VHLAMSQMGL LARLLKDLGT 

       610        620        630        640        650        660 
GSSGFTIDNV MKFSVSALEH RVYEVRETAV RIILDMYRQH QASILEYLPP DDSNTRRNIL 

       670        680        690        700        710        720 
YKTIFEGFAK IDGRATDAEM RARRKAATEE AEKQKKEEIK ALQGQLAALK EIQAEVQEKE 

       730        740        750        760        770        780 
SDAVKPKNQD IQGGKAAPAE ALGIPDEHYL DNLCIFCGER SESFTEEGLD LHYWKHCLML 

       790        800        810        820        830        840 
TRCDHCKQVV EISSLTEHLL TECDKKDGFG KCYRCSEAVF KEELPRHIKH KDCNPAKPEK 

       850        860        870        880        890        900 
LANRCPLCHE NFSPGEEAWK AHLMGPAGCT MNLRKTHILQ KAPALQPGKS SAVAASGPLG 

       910        920 
SKAGSKIPTP KGGLSKSSSR TYAKR 

« Hide

Isoform 2 [UniParc].

Checksum: 91F314638FBB292C
Show »

FASTA24427,350
Isoform 3 [UniParc].

Checksum: 45407773F3D60E53
Show »

FASTA55463,187

References

« Hide 'large scale' references
[1]"Prediction of the coding sequences of unidentified human genes. IX. The complete sequences of 100 new cDNA clones from brain which can code for large proteins in vitro."
Nagase T., Ishikawa K., Miyajima N., Tanaka A., Kotani H., Nomura N., Ohara O.
DNA Res. 5:31-39(1998) [PubMed: 9628581] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
Tissue: Brain.
[2]Nagase T., Ishikawa K., Miyajima N., Tanaka A., Kotani H., Nomura N., Ohara O.
Submitted (JAN-2004) to the EMBL/GenBank/DDBJ databases
Cited for: SEQUENCE REVISION TO N-TERMINUS.
[3]"The DNA sequence and biological annotation of human chromosome 1."
Gregory S.G., Barlow K.F., McLay K.E., Kaul R., Swarbreck D., Dunham A., Scott C.E., Howe K.L., Woodfine K., Spencer C.C.A., Jones M.C., Gillson C., Searle S., Zhou Y., Kokocinski F., McDonald L., Evans R., Phillips K. expand/collapse author list , Atkinson A., Cooper R., Jones C., Hall R.E., Andrews T.D., Lloyd C., Ainscough R., Almeida J.P., Ambrose K.D., Anderson F., Andrew R.W., Ashwell R.I.S., Aubin K., Babbage A.K., Bagguley C.L., Bailey J., Beasley H., Bethel G., Bird C.P., Bray-Allen S., Brown J.Y., Brown A.J., Buckley D., Burton J., Bye J., Carder C., Chapman J.C., Clark S.Y., Clarke G., Clee C., Cobley V., Collier R.E., Corby N., Coville G.J., Davies J., Deadman R., Dunn M., Earthrowl M., Ellington A.G., Errington H., Frankish A., Frankland J., French L., Garner P., Garnett J., Gay L., Ghori M.R.J., Gibson R., Gilby L.M., Gillett W., Glithero R.J., Grafham D.V., Griffiths C., Griffiths-Jones S., Grocock R., Hammond S., Harrison E.S.I., Hart E., Haugen E., Heath P.D., Holmes S., Holt K., Howden P.J., Hunt A.R., Hunt S.E., Hunter G., Isherwood J., James R., Johnson C., Johnson D., Joy A., Kay M., Kershaw J.K., Kibukawa M., Kimberley A.M., King A., Knights A.J., Lad H., Laird G., Lawlor S., Leongamornlert D.A., Lloyd D.M., Loveland J., Lovell J., Lush M.J., Lyne R., Martin S., Mashreghi-Mohammadi M., Matthews L., Matthews N.S.W., McLaren S., Milne S., Mistry S., Moore M.J.F., Nickerson T., O'Dell C.N., Oliver K., Palmeiri A., Palmer S.A., Parker A., Patel D., Pearce A.V., Peck A.I., Pelan S., Phelps K., Phillimore B.J., Plumb R., Rajan J., Raymond C., Rouse G., Saenphimmachak C., Sehra H.K., Sheridan E., Shownkeen R., Sims S., Skuce C.D., Smith M., Steward C., Subramanian S., Sycamore N., Tracey A., Tromans A., Van Helmond Z., Wall M., Wallis J.M., White S., Whitehead S.L., Wilkinson J.E., Willey D.L., Williams H., Wilming L., Wray P.W., Wu Z., Coulson A., Vaudin M., Sulston J.E., Durbin R.M., Hubbard T., Wooster R., Dunham I., Carter N.P., McVean G., Ross M.T., Harrow J., Olson M.V., Beck S., Rogers J., Bentley D.R.
Nature 441:315-321(2006) [PubMed: 16710414] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[4]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1; 2 AND 3).
Tissue: Kidney, Skin and Testis.
[5]"Novel asymmetrically localizing components of human centrosomes identified by complementary proteomics methods."
Jakobsen L., Vanselow K., Skogs M., Toyoda Y., Lundberg E., Poser I., Falkenby L.G., Bennetzen M., Westendorf J., Nigg E.A., Uhlen M., Hyman A.A., Andersen J.S.
EMBO J. 30:1520-1535(2011) [PubMed: 21399614] [Abstract]
Cited for: IDENTIFICATION BY MASS SPECTROMETRY, SUBCELLULAR LOCATION.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AB011134 mRNA. Translation: BAA25488.2. Different initiation.
AL691523 Genomic DNA. Translation: CAI17368.1.
AL691523, AL365330 Genomic DNA. Translation: CAI17369.1.
AL365330, AL691523 Genomic DNA. Translation: CAI42527.1.
AL365330 Genomic DNA. Translation: CAI42529.1.
BC001640 mRNA. Translation: AAH01640.1. Sequence problems.
BC047450 mRNA. Translation: AAH47450.1.
BC050721 mRNA. Translation: AAH50721.1.
IPIIPI00006014.
IPI00384358.
IPI00607758.
PIRT00334.
RefSeqNP_055519.1. NM_014704.3.
UniGeneHs.133089.
Hs.509017.

3D structure databases

ProteinModelPortalO60308.
SMRO60308. Positions 767-834.
ModBaseSearch...

Protein-protein interaction databases

IntActO60308. 1 interaction.

PTM databases

PhosphoSiteO60308.

Proteomic databases

PRIDEO60308.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENST00000378230; ENSP00000367476; ENSG00000116198.
GeneID9731.
KEGGhsa:9731.
UCSCuc001aky.1. human.
uc001akz.2. human.

Organism-specific databases

CTD9731.
GeneCardsGC01M003728.
HGNCHGNC:24866. CEP104.
HPAHPA010126.
neXtProtNX_O60308.
PharmGKBPA144596418.
HUGESearch...
GenAtlasSearch...

Phylogenomic databases

eggNOGprNOG14085.
GeneTreeENSGT00390000013405.
HOVERGENHBG080375.
InParanoidO60308.
OMAYEQLELH.
OrthoDBEOG4TXBRF.
PhylomeDBO60308.

Gene expression databases

ArrayExpressO60308.
BgeeO60308.
CleanExHS_KIAA0562.
GenevestigatorO60308.
GermOnlineENSG00000116198. Homo sapiens.

Family and domain databases

InterProIPR011989. ARM-like.
IPR016024. ARM-type_fold.
IPR008979. Galactose-bd-like.
[Graphical view]
Gene3DG3DSA:1.25.10.10. ARM-like. 1 hit.
SUPFAMSSF48371. ARM-type_fold. 1 hit.
SSF49785. Gal_bind_like. 1 hit.
PROSITEPS50077. HEAT_REPEAT. False negative.
[Graphical view]
ProtoNetSearch...

Other

NextBio36608.

Entry information

Entry nameCE104_HUMAN
AccessionPrimary (citable) accession number: O60308
Secondary accession number(s): Q5JSQ3 expand/collapse secondary AC list , Q5SR24, Q5SR25, Q6PKF5, Q86W32, Q86X14
Entry history
Integrated into UniProtKB/Swiss-Prot: January 23, 2002
Last sequence update: August 1, 1998
Last modified: January 25, 2012
This is version 82 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

Human chromosome 1

Human chromosome 1: entries, gene names and cross-references to MIM

Human entries with polymorphisms or disease mutations

List of human entries with polymorphisms or disease mutations

Human polymorphisms and disease mutations

Index of human polymorphisms and disease mutations

SIMILARITY comments

Index of protein domains and families