Q96HA4 (CA159_HUMAN) Reviewed, UniProtKB/Swiss-Prot
Last modified
April 3, 2013.
Version 77.
History...
Names·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order
Names·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize orderNames and origin
| Protein names | Recommended name: Uncharacterized protein C1orf159 | ||||
| Gene names |
| ||||
| Organism | Homo sapiens (Human) [Reference proteome] | ||||
| Taxonomic identifier | 9606 [NCBI] | ||||
| Taxonomic lineage | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Mammalia › Eutheria › Euarchontoglires › Primates › Haplorrhini › Catarrhini › Hominidae › Homo![]() |
Protein attributes
| Sequence length | 380 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is further processed into a mature form. |
| Protein existence | Evidence at transcript level |
General annotation (Comments)
| Subcellular location | Membrane; Single-pass membrane protein Potential. |
Ontologies
| Keywords | |
|---|---|
| Cellular component | Membrane |
| Coding sequence diversity | Alternative splicing |
| Domain | Signal Transmembrane Transmembrane helix |
| PTM | Glycoprotein |
| Technical term | Complete proteome Reference proteome |
| Gene Ontology (GO) | |
| Cellular_component | integral to membrane Inferred from electronic annotation. Source: UniProtKB-KW |
| Complete GO annotation... | |
Alternative products
| This entry describes 5 isoforms produced by alternative splicing. [Align] [Select] | ||||||
| Isoform 1 (identifier: Q96HA4-1) This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry. | ||||||
| Note: Gene prediction based on EST data. | ||||||
| Isoform 2 (identifier: Q96HA4-2) The sequence of this isoform differs from the canonical sequence as follows: 25-60: Missing. 185-221: APALQPGEAAAMIPPPQSSGNSSCRIPLWGFPSLGQS → GPAPAGSLPGRWSSQQFGPQAPALQPGEAVSNPHHPG 222-380: Missing. | ||||||
| Note: No experimental confirmation available. | ||||||
| Isoform 3 (identifier: Q96HA4-3) The sequence of this isoform differs from the canonical sequence as follows: 25-60: Missing. 204-225: GNSSCRIPLWGFPSLGQSQGAL → DVGSAGKEDPPRQGRPPIPAPP 226-380: Missing. | ||||||
| Note: No experimental confirmation available. | ||||||
| Isoform 4 (identifier: Q96HA4-4) The sequence of this isoform differs from the canonical sequence as follows: 25-60: Missing. 204-349: Missing. | ||||||
| Note: No experimental confirmation available. | ||||||
| Isoform 5 (identifier: Q96HA4-5) The sequence of this isoform differs from the canonical sequence as follows: 1-126: Missing. | ||||||
| Note: No experimental confirmation available. |
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||
Molecule processing | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Signal peptide | 1 – 18 | 18 | Potential | ||||||
| Chain | 19 – 380 | 362 | Uncharacterized protein C1orf159 | PRO_0000255250 | |||||
Regions | |||||||||
| Transmembrane | 148 – 168 | 21 | Helical; Potential | ||||||
| Compositional bias | 229 – 291 | 63 | Pro-rich | ||||||
Amino acid modifications | |||||||||
| Glycosylation | 104 | 1 | N-linked (GlcNAc...) Potential | ||||||
| Glycosylation | 111 | 1 | N-linked (GlcNAc...) Potential | ||||||
| Glycosylation | 128 | 1 | N-linked (GlcNAc...) Potential | ||||||
Natural variations | |||||||||
| Alternative sequence | 1 – 126 | 126 | Missing in isoform 5. | VSP_021280 | |||||
| Alternative sequence | 25 – 60 | 36 | Missing in isoform 2, isoform 3 and isoform 4. | VSP_021281 | |||||
| Alternative sequence | 185 – 221 | 37 | APALQ…SLGQS → GPAPAGSLPGRWSSQQFGPQ APALQPGEAVSNPHHPG in isoform 2. | VSP_021282 | |||||
| Alternative sequence | 204 – 349 | 146 | Missing in isoform 4. | VSP_021283 | |||||
| Alternative sequence | 204 – 225 | 22 | GNSSC…SQGAL → DVGSAGKEDPPRQGRPPIPA PP in isoform 3. | VSP_021284 | |||||
| Alternative sequence | 222 – 380 | 159 | Missing in isoform 2. | VSP_021285 | |||||
| Alternative sequence | 226 – 380 | 155 | Missing in isoform 3. | VSP_021286 | |||||
Sequences
| ||||||||||||||||||||||||||||||||||||||||||
References
| [1] | "The secreted protein discovery initiative (SPDI), a large-scale effort to identify novel human secreted and transmembrane proteins: a bioinformatics assessment." Clark H.F., Gurney A.L., Abaya E., Baker K., Baldwin D.T., Brush J., Chen J., Chow B., Chui C., Crowley C., Currell B., Deuel B., Dowd P., Eaton D., Foster J.S., Grimaldi C., Gu Q., Hass P.E. Gray A.M.Genome Res. 13:2265-2270(2003) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 3). |
| [2] | "Complete sequencing and characterization of 21,243 full-length human cDNAs." Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S. Sugano S.Nat. Genet. 36:40-45(2004) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1; 5 AND 4). Tissue: Carcinoma, Testis and Thymus. |
| [3] | "The DNA sequence and biological annotation of human chromosome 1." Gregory S.G., Barlow K.F., McLay K.E., Kaul R., Swarbreck D., Dunham A., Scott C.E., Howe K.L., Woodfine K., Spencer C.C.A., Jones M.C., Gillson C., Searle S., Zhou Y., Kokocinski F., McDonald L., Evans R., Phillips K. Bentley D.R.Nature 441:315-321(2006) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. |
| [4] | "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)." The MGC Project Team Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2). Tissue: Uterus. |
Cross-references
Sequence databases | |
|---|---|
| EMBL GenBank DDBJ | AY358490 mRNA. Translation: AAQ88854.1. AK000591 mRNA. Translation: BAA91276.1. AK057368 mRNA. Translation: BAG51908.1. AK128434 mRNA. Translation: BAC87438.1. AL390719 Genomic DNA. Translation: CAI14316.1. AL390719 Genomic DNA. Translation: CAI14318.1. BC008788 mRNA. Translation: AAH08788.1. |
| IPI | IPI00016627. IPI00062955. IPI00514936. IPI00515131. IPI00883596. |
| RefSeq | NP_060361.4. NM_017891.4. |
| UniGene | Hs.235095. |
3D structure databases | |
| ProteinModelPortal | Q96HA4. |
| ModBase | Search... |
Protein-protein interaction databases | |
| STRING | 9606.ENSP00000368623. |
PTM databases | |
| PhosphoSite | Q96HA4. |
Polymorphism databases | |
| DMDM | 119371554. |
Proteomic databases | |
| PaxDb | Q96HA4. |
| PRIDE | Q96HA4. |
Protocols and materials databases | |
| DNASU | 54991. |
| StructuralBiologyKnowledgebase | Search... |
Genome annotation databases | |
| Ensembl | ENST00000379319; ENSP00000368623; ENSG00000131591. ENST00000379325; ENSP00000368629; ENSG00000131591. ENST00000379339; ENSP00000368644; ENSG00000131591. ENST00000421241; ENSP00000400736; ENSG00000131591. ENST00000437760; ENSP00000399027; ENSG00000131591. ENST00000448924; ENSP00000392290; ENSG00000131591. |
| GeneID | 54991. |
| KEGG | hsa:54991. |
| UCSC | uc001act.2. human. uc001acu.2. human. |
Organism-specific databases | |
| CTD | 54991. |
| GeneCards | GC01M001018. |
| H-InvDB | HIX0000013. |
| HGNC | HGNC:26062. C1orf159. |
| HPA | HPA010019. |
| neXtProt | NX_Q96HA4. |
| PharmGKB | PA142672410. |
| GenAtlas | Search... |
Phylogenomic databases | |
| eggNOG | NOG39970. |
| HOGENOM | HOG000231900. |
| HOVERGEN | HBG058261. |
| InParanoid | Q96HA4. |
| OMA | PGCYRHW. |
| OrthoDB | EOG46DM4C. |
Gene expression databases | |
| ArrayExpress | Q96HA4. |
| Bgee | Q96HA4. |
| CleanEx | HS_C1orf159. |
| Genevestigator | Q96HA4. |
| GermOnline | ENSG00000131591. Homo sapiens. |
Family and domain databases | |
| ProtoNet | Search... |
Other | |
| ChiTaRS | C1orf159. human. |
| GenomeRNAi | 54991. |
| NextBio | 58290. |
Entry information
| Entry name | CA159_HUMAN | ||||||||
| Accession | Primary (citable) accession number: Q96HA4 Secondary accession number(s): B3KQ46 Q9NWV0 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation program | Chordata Protein Annotation Program | ||||||||
| Disclaimer | Any medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care. | ||||||||
Relevant documents
| Human chromosome 1 Human chromosome 1: entries, gene names and cross-references to MIM |

Clusters with
