Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q6P1W5 (CA094_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified January 25, 2012. Version 54. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·Ontologies·Interactions·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Uncharacterized protein C1orf94
Gene names
Name:C1orf94
OrganismHomo sapiens (Human)
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length598 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

Ontologies

Keywords
   Coding sequence diversityAlternative splicing
Polymorphism
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
   Molecular functionprotein binding

Inferred from physical interaction. Source: IntAct

Complete GO annotation...

Binary interactions

With

Entry

#Exp.

IntAct

Notes

ATXN1P542533EBI-946029,EBI-930964

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q6P1W5-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q6P1W5-2)

The sequence of this isoform differs from the canonical sequence as follows:
     1-190: Missing.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 598598Uncharacterized protein C1orf94
PRO_0000280097

Regions

Compositional bias467 – 4704Poly-Pro

Natural variations

Alternative sequence1 – 190190Missing in isoform 2.
VSP_042032
Natural variant2351Q → E. Ref.4
Corresponds to variant rs1382602 [ dbSNP | Ensembl ].
VAR_031051
Natural variant3021D → E. Ref.4
Corresponds to variant rs1414474 [ dbSNP | Ensembl ].
VAR_031052
Natural variant4381Y → H.
Corresponds to variant rs17556981 [ dbSNP | Ensembl ].
VAR_050702

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified November 16, 2011. Version 2.
Checksum: AAD349072B925A8E

FASTA59865,353
        10         20         30         40         50         60 
MRGGGGCVLA LGGQRGFQKE RRRMASGNGL PSSSALVAKG PCALGPFPRY IWIHQDTPQD 

        70         80         90        100        110        120 
SLDKTCHEIW KRVQGLPEAS QPWTSMEQLS VPVVGTLRGN ELSFQEEALE LSSGKDEISL 

       130        140        150        160        170        180 
LVEQEFLSLT KEHSILVEES SGELEVPGSS PEGTRELAPC ILAPPLVAGS NERPRASIIV 

       190        200        210        220        230        240 
GDKLLKQKVA MPVISSRQDC DSATSTVTDI LCAAEVKSSK GTEDRGRILG DSNLQVSKLL 

       250        260        270        280        290        300 
SQFPLKSTET SKVPDNKNVL DKTRVTKDFL QDNLFSGPGP KEPTGLSPFL LLPPRPPPAR 

       310        320        330        340        350        360 
PDKLPELPAQ KRQLPVFAKI CSKPKADPAV ERHHLMEWSP GTKEPKKGQG SLFLSQWPQS 

       370        380        390        400        410        420 
QKDACGEEGC CDAVGTASLT LPPKKPTCPA EKNLLYEFLG ATKNPSGQPR LRNKVEVDGP 

       430        440        450        460        470        480 
ELKFNAPVTV ADKNNPKYTG NVFTPHFPTA MTSATLNQPL WLNLNYPPPP VFTNHSTFLQ 

       490        500        510        520        530        540 
YQGLYPQQAA RMPYQQALHP QLGCYSQQVM PYNPQQMGQQ IFRSSYTPLL SYIPFVQPNY 

       550        560        570        580        590 
PYPQRTPPKM SANPRDPPLM AGDGPQYLFP QGYGFGSTSG GPLMHSPYFS SSGNGINF 

« Hide

Isoform 2 [UniParc].

Checksum: 5B1E356339E5941C
Show »

FASTA40844,902

References

[1]"Complete sequencing and characterization of 21,243 full-length human cDNAs."
Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S. expand/collapse author list , Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.
Nat. Genet. 36:40-45(2004) [PubMed: 14702039] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
Tissue: Caudate nucleus.
[2]"The DNA sequence and biological annotation of human chromosome 1."
Gregory S.G., Barlow K.F., McLay K.E., Kaul R., Swarbreck D., Dunham A., Scott C.E., Howe K.L., Woodfine K., Spencer C.C.A., Jones M.C., Gillson C., Searle S., Zhou Y., Kokocinski F., McDonald L., Evans R., Phillips K. expand/collapse author list , Atkinson A., Cooper R., Jones C., Hall R.E., Andrews T.D., Lloyd C., Ainscough R., Almeida J.P., Ambrose K.D., Anderson F., Andrew R.W., Ashwell R.I.S., Aubin K., Babbage A.K., Bagguley C.L., Bailey J., Beasley H., Bethel G., Bird C.P., Bray-Allen S., Brown J.Y., Brown A.J., Buckley D., Burton J., Bye J., Carder C., Chapman J.C., Clark S.Y., Clarke G., Clee C., Cobley V., Collier R.E., Corby N., Coville G.J., Davies J., Deadman R., Dunn M., Earthrowl M., Ellington A.G., Errington H., Frankish A., Frankland J., French L., Garner P., Garnett J., Gay L., Ghori M.R.J., Gibson R., Gilby L.M., Gillett W., Glithero R.J., Grafham D.V., Griffiths C., Griffiths-Jones S., Grocock R., Hammond S., Harrison E.S.I., Hart E., Haugen E., Heath P.D., Holmes S., Holt K., Howden P.J., Hunt A.R., Hunt S.E., Hunter G., Isherwood J., James R., Johnson C., Johnson D., Joy A., Kay M., Kershaw J.K., Kibukawa M., Kimberley A.M., King A., Knights A.J., Lad H., Laird G., Lawlor S., Leongamornlert D.A., Lloyd D.M., Loveland J., Lovell J., Lush M.J., Lyne R., Martin S., Mashreghi-Mohammadi M., Matthews L., Matthews N.S.W., McLaren S., Milne S., Mistry S., Moore M.J.F., Nickerson T., O'Dell C.N., Oliver K., Palmeiri A., Palmer S.A., Parker A., Patel D., Pearce A.V., Peck A.I., Pelan S., Phelps K., Phillimore B.J., Plumb R., Rajan J., Raymond C., Rouse G., Saenphimmachak C., Sehra H.K., Sheridan E., Shownkeen R., Sims S., Skuce C.D., Smith M., Steward C., Subramanian S., Sycamore N., Tracey A., Tromans A., Van Helmond Z., Wall M., Wallis J.M., White S., Whitehead S.L., Wilkinson J.E., Willey D.L., Williams H., Wilming L., Wray P.W., Wu Z., Coulson A., Vaudin M., Sulston J.E., Durbin R.M., Hubbard T., Wooster R., Dunham I., Carter N.P., McVean G., Ross M.T., Harrow J., Olson M.V., Beck S., Rogers J., Bentley D.R.
Nature 441:315-321(2006) [PubMed: 16710414] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[3]Mural R.J., Istrail S., Sutton G.G., Florea L., Halpern A.L., Mobarry C.M., Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., Flanigan M.J., Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., Hannenhalli S., Turner R. expand/collapse author list , Yooseph S., Lu F., Nusskern D.R., Shue B.C., Zheng X.H., Zhong F., Delcher A.L., Huson D.H., Kravitz S.A., Mouchard L., Reinert K., Remington K.A., Clark A.G., Waterman M.S., Eichler E.E., Adams M.D., Hunkapiller M.W., Myers E.W., Venter J.C.
Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[4]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1 AND 2), VARIANTS GLU-235 AND GLU-302.
Tissue: Muscle and Testis.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AK123355 mRNA. Translation: BAG53893.1.
AC115286 Genomic DNA. No translation available.
CH471059 Genomic DNA. Translation: EAX07445.1.
CH471059 Genomic DNA. Translation: EAX07446.1.
BC007637 mRNA. Translation: AAH07637.1.
BC064845 mRNA. Translation: AAH64845.1.
IPIIPI00645901.
RefSeqNP_001128206.1. NM_001134734.1.
NP_116273.2. NM_032884.3.
UniGeneHs.194610.

3D structure databases

ProteinModelPortalQ6P1W5.
ModBaseSearch...

Protein-protein interaction databases

IntActQ6P1W5. 8 interactions.
MINTMINT-2855222.
STRINGQ6P1W5.

Polymorphism databases

DMDM74758237.

Proteomic databases

PRIDEQ6P1W5.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENST00000373374; ENSP00000362472; ENSG00000142698.
ENST00000398041; ENSP00000381121; ENSG00000251514.
GeneID84970.
KEGGhsa:84970.
UCSCuc001bxs.2. human.

Organism-specific databases

CTD84970.
GeneCardsGC01P034632.
HGNCHGNC:28250. C1orf94.
HPAHPA045805.
neXtProtNX_Q6P1W5.
PharmGKBPA142672478.
GenAtlasSearch...

Phylogenomic databases

eggNOGprNOG12523.
GeneTreeENSGT00390000017672.
HOGENOMHBG506161.
HOVERGENHBG080475.
InParanoidQ6P1W5.
OrthoDBEOG4JT054.
PhylomeDBQ6P1W5.

Gene expression databases

ArrayExpressQ6P1W5.
BgeeQ6P1W5.
CleanExHS_C1orf94.
GenevestigatorQ6P1W5.

Family and domain databases

ProtoNetSearch...

Other

NextBio75514.

Entry information

Entry nameCA094_HUMAN
AccessionPrimary (citable) accession number: Q6P1W5
Secondary accession number(s): B3KVT1 expand/collapse secondary AC list , D3DPR3, E9PJ76, Q96IC8
Entry history
Integrated into UniProtKB/Swiss-Prot: March 6, 2007
Last sequence update: November 16, 2011
Last modified: January 25, 2012
This is version 54 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

Human chromosome 1

Human chromosome 1: entries, gene names and cross-references to MIM

Human entries with polymorphisms or disease mutations

List of human entries with polymorphisms or disease mutations

Human polymorphisms and disease mutations

Index of human polymorphisms and disease mutations