Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q96HA4 (CA159_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified April 3, 2013. Version 77. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (1) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Uncharacterized protein C1orf159
Gene names
Name:C1orf159
ORF Names:UNQ2998/PRO9739
OrganismHomo sapiens (Human) [Reference proteome]
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length380 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at transcript level

General annotation (Comments)

Subcellular location

Membrane; Single-pass membrane protein Potential.

Ontologies

Keywords
   Cellular componentMembrane
   Coding sequence diversityAlternative splicing
   DomainSignal
Transmembrane
Transmembrane helix
   PTMGlycoprotein
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
   Cellular_componentintegral to membrane

Inferred from electronic annotation. Source: UniProtKB-KW

Complete GO annotation...

Alternative products

This entry describes 5 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q96HA4-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Note: Gene prediction based on EST data.
Isoform 2 (identifier: Q96HA4-2)

The sequence of this isoform differs from the canonical sequence as follows:
     25-60: Missing.
     185-221: APALQPGEAAAMIPPPQSSGNSSCRIPLWGFPSLGQS → GPAPAGSLPGRWSSQQFGPQAPALQPGEAVSNPHHPG
     222-380: Missing.
Note: No experimental confirmation available.
Isoform 3 (identifier: Q96HA4-3)

The sequence of this isoform differs from the canonical sequence as follows:
     25-60: Missing.
     204-225: GNSSCRIPLWGFPSLGQSQGAL → DVGSAGKEDPPRQGRPPIPAPP
     226-380: Missing.
Note: No experimental confirmation available.
Isoform 4 (identifier: Q96HA4-4)

The sequence of this isoform differs from the canonical sequence as follows:
     25-60: Missing.
     204-349: Missing.
Note: No experimental confirmation available.
Isoform 5 (identifier: Q96HA4-5)

The sequence of this isoform differs from the canonical sequence as follows:
     1-126: Missing.
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 1818 Potential
Chain19 – 380362Uncharacterized protein C1orf159
PRO_0000255250

Regions

Transmembrane148 – 16821Helical; Potential
Compositional bias229 – 29163Pro-rich

Amino acid modifications

Glycosylation1041N-linked (GlcNAc...) Potential
Glycosylation1111N-linked (GlcNAc...) Potential
Glycosylation1281N-linked (GlcNAc...) Potential

Natural variations

Alternative sequence1 – 126126Missing in isoform 5.
VSP_021280
Alternative sequence25 – 6036Missing in isoform 2, isoform 3 and isoform 4.
VSP_021281
Alternative sequence185 – 22137APALQ…SLGQS → GPAPAGSLPGRWSSQQFGPQ APALQPGEAVSNPHHPG in isoform 2.
VSP_021282
Alternative sequence204 – 349146Missing in isoform 4.
VSP_021283
Alternative sequence204 – 22522GNSSC…SQGAL → DVGSAGKEDPPRQGRPPIPA PP in isoform 3.
VSP_021284
Alternative sequence222 – 380159Missing in isoform 2.
VSP_021285
Alternative sequence226 – 380155Missing in isoform 3.
VSP_021286

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified October 31, 2006. Version 2.
Checksum: D82F85CF3D903B81

FASTA38040,283
        10         20         30         40         50         60 
MALRHLALLA GLLVGVASKS MENTVTRNST AVINTQAEGT LSPPGLSSLP VVREWALTHT 

        70         80         90        100        110        120 
AQLPECCVDV VGVNASCPGA SLCGPGCYRR WNADGSASCV RCGNGTLPAY NGSECRSFAG 

       130        140        150        160        170        180 
PGAPFPMNRS SGTPGRPHPG APRVAASLFL GTFFISSGLI LSVAGFFYLK RSSKLPRACY 

       190        200        210        220        230        240 
RRNKAPALQP GEAAAMIPPP QSSGNSSCRI PLWGFPSLGQ SQGALWVCPQ TGLPGSGSRP 

       250        260        270        280        290        300 
PLPGSPGDPP TRQGQGRIWL VPPALDLSWI WPAPPARPPL IPVTSMLFPV PETWGLQERR 

       310        320        330        340        350        360 
THHDRADPQY LLLLEVQLHP RTDAAGLRQA LLSSHRFSGA GSGGPKSQPV RKPRYVRRER 

       370        380 
PLDRATDPAA FPGEARISNV 

« Hide

Isoform 2 [UniParc].

Checksum: 2488F40B8F1239C3
Show »

FASTA18519,097
Isoform 3 [UniParc].

Checksum: B442CB2A09470157
Show »

FASTA18919,464
Isoform 4 [UniParc].

Checksum: 734467FBB65598A9
Show »

FASTA19820,820
Isoform 5 [UniParc].

Checksum: 76175CE950FF64C1
Show »

FASTA25427,411

References

[1]"The secreted protein discovery initiative (SPDI), a large-scale effort to identify novel human secreted and transmembrane proteins: a bioinformatics assessment."
Clark H.F., Gurney A.L., Abaya E., Baker K., Baldwin D.T., Brush J., Chen J., Chow B., Chui C., Crowley C., Currell B., Deuel B., Dowd P., Eaton D., Foster J.S., Grimaldi C., Gu Q., Hass P.E. expand/collapse author list , Heldens S., Huang A., Kim H.S., Klimowski L., Jin Y., Johnson S., Lee J., Lewis L., Liao D., Mark M.R., Robbie E., Sanchez C., Schoenfeld J., Seshagiri S., Simmons L., Singh J., Smith V., Stinson J., Vagts A., Vandlen R.L., Watanabe C., Wieand D., Woods K., Xie M.-H., Yansura D.G., Yi S., Yu G., Yuan J., Zhang M., Zhang Z., Goddard A.D., Wood W.I., Godowski P.J., Gray A.M.
Genome Res. 13:2265-2270(2003) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 3).
[2]"Complete sequencing and characterization of 21,243 full-length human cDNAs."
Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S. expand/collapse author list , Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.
Nat. Genet. 36:40-45(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1; 5 AND 4).
Tissue: Carcinoma, Testis and Thymus.
[3]"The DNA sequence and biological annotation of human chromosome 1."
Gregory S.G., Barlow K.F., McLay K.E., Kaul R., Swarbreck D., Dunham A., Scott C.E., Howe K.L., Woodfine K., Spencer C.C.A., Jones M.C., Gillson C., Searle S., Zhou Y., Kokocinski F., McDonald L., Evans R., Phillips K. expand/collapse author list , Atkinson A., Cooper R., Jones C., Hall R.E., Andrews T.D., Lloyd C., Ainscough R., Almeida J.P., Ambrose K.D., Anderson F., Andrew R.W., Ashwell R.I.S., Aubin K., Babbage A.K., Bagguley C.L., Bailey J., Beasley H., Bethel G., Bird C.P., Bray-Allen S., Brown J.Y., Brown A.J., Buckley D., Burton J., Bye J., Carder C., Chapman J.C., Clark S.Y., Clarke G., Clee C., Cobley V., Collier R.E., Corby N., Coville G.J., Davies J., Deadman R., Dunn M., Earthrowl M., Ellington A.G., Errington H., Frankish A., Frankland J., French L., Garner P., Garnett J., Gay L., Ghori M.R.J., Gibson R., Gilby L.M., Gillett W., Glithero R.J., Grafham D.V., Griffiths C., Griffiths-Jones S., Grocock R., Hammond S., Harrison E.S.I., Hart E., Haugen E., Heath P.D., Holmes S., Holt K., Howden P.J., Hunt A.R., Hunt S.E., Hunter G., Isherwood J., James R., Johnson C., Johnson D., Joy A., Kay M., Kershaw J.K., Kibukawa M., Kimberley A.M., King A., Knights A.J., Lad H., Laird G., Lawlor S., Leongamornlert D.A., Lloyd D.M., Loveland J., Lovell J., Lush M.J., Lyne R., Martin S., Mashreghi-Mohammadi M., Matthews L., Matthews N.S.W., McLaren S., Milne S., Mistry S., Moore M.J.F., Nickerson T., O'Dell C.N., Oliver K., Palmeiri A., Palmer S.A., Parker A., Patel D., Pearce A.V., Peck A.I., Pelan S., Phelps K., Phillimore B.J., Plumb R., Rajan J., Raymond C., Rouse G., Saenphimmachak C., Sehra H.K., Sheridan E., Shownkeen R., Sims S., Skuce C.D., Smith M., Steward C., Subramanian S., Sycamore N., Tracey A., Tromans A., Van Helmond Z., Wall M., Wallis J.M., White S., Whitehead S.L., Wilkinson J.E., Willey D.L., Williams H., Wilming L., Wray P.W., Wu Z., Coulson A., Vaudin M., Sulston J.E., Durbin R.M., Hubbard T., Wooster R., Dunham I., Carter N.P., McVean G., Ross M.T., Harrow J., Olson M.V., Beck S., Rogers J., Bentley D.R.
Nature 441:315-321(2006) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[4]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
Tissue: Uterus.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AY358490 mRNA. Translation: AAQ88854.1.
AK000591 mRNA. Translation: BAA91276.1.
AK057368 mRNA. Translation: BAG51908.1.
AK128434 mRNA. Translation: BAC87438.1.
AL390719 Genomic DNA. Translation: CAI14316.1.
AL390719 Genomic DNA. Translation: CAI14318.1.
BC008788 mRNA. Translation: AAH08788.1.
IPIIPI00016627.
IPI00062955.
IPI00514936.
IPI00515131.
IPI00883596.
RefSeqNP_060361.4. NM_017891.4.
UniGeneHs.235095.

3D structure databases

ProteinModelPortalQ96HA4.
ModBaseSearch...

Protein-protein interaction databases

STRING9606.ENSP00000368623.

PTM databases

PhosphoSiteQ96HA4.

Polymorphism databases

DMDM119371554.

Proteomic databases

PaxDbQ96HA4.
PRIDEQ96HA4.

Protocols and materials databases

DNASU54991.
StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENST00000379319; ENSP00000368623; ENSG00000131591.
ENST00000379325; ENSP00000368629; ENSG00000131591.
ENST00000379339; ENSP00000368644; ENSG00000131591.
ENST00000421241; ENSP00000400736; ENSG00000131591.
ENST00000437760; ENSP00000399027; ENSG00000131591.
ENST00000448924; ENSP00000392290; ENSG00000131591.
GeneID54991.
KEGGhsa:54991.
UCSCuc001act.2. human.
uc001acu.2. human.

Organism-specific databases

CTD54991.
GeneCardsGC01M001018.
H-InvDBHIX0000013.
HGNCHGNC:26062. C1orf159.
HPAHPA010019.
neXtProtNX_Q96HA4.
PharmGKBPA142672410.
GenAtlasSearch...

Phylogenomic databases

eggNOGNOG39970.
HOGENOMHOG000231900.
HOVERGENHBG058261.
InParanoidQ96HA4.
OMAPGCYRHW.
OrthoDBEOG46DM4C.

Gene expression databases

ArrayExpressQ96HA4.
BgeeQ96HA4.
CleanExHS_C1orf159.
GenevestigatorQ96HA4.
GermOnlineENSG00000131591. Homo sapiens.

Family and domain databases

ProtoNetSearch...

Other

ChiTaRSC1orf159. human.
GenomeRNAi54991.
NextBio58290.

Entry information

Entry nameCA159_HUMAN
AccessionPrimary (citable) accession number: Q96HA4
Secondary accession number(s): B3KQ46 expand/collapse secondary AC list , Q5T2W6, Q6UX67, Q6ZR77, Q9NWV0
Entry history
Integrated into UniProtKB/Swiss-Prot: October 31, 2006
Last sequence update: October 31, 2006
Last modified: April 3, 2013
This is version 77 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

Human chromosome 1

Human chromosome 1: entries, gene names and cross-references to MIM