Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q49AR2 (CE022_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified April 16, 2014. Version 65. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (5) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
UPF0489 protein C5orf22
Gene names
Name:C5orf22
OrganismHomo sapiens (Human) [Reference proteome]
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length442 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Sequence similarities

Belongs to the UPF0489 family.

Ontologies

Keywords
   Coding sequence diversityAlternative splicing
Polymorphism
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
None. [Check GOA]

Alternative products

This entry describes 3 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q49AR2-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q49AR2-2)

The sequence of this isoform differs from the canonical sequence as follows:
     1-265: Missing.
Note: No experimental confirmation available.
Isoform 3 (identifier: Q49AR2-3)

The sequence of this isoform differs from the canonical sequence as follows:
     1-279: Missing.
     280-353: QFKKPGTNLT...KRMEVPDYEM → MEVPDYEMFP...PVPKLLPHFQ
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 442442UPF0489 protein C5orf22
PRO_0000305002

Natural variations

Alternative sequence1 – 279279Missing in isoform 3.
VSP_028182
Alternative sequence1 – 265265Missing in isoform 2.
VSP_028183
Alternative sequence280 – 35374QFKKP…PDYEM → MEVPDYEMFPASSSSPSETT SAWISLSMSLSAFWSKPFNK SLGSSKLSNISLPSSEPSKL FQPLPVPKLLPHFQ in isoform 3.
VSP_028184
Natural variant2351T → P.
Corresponds to variant rs17410000 [ dbSNP | Ensembl ].
VAR_035149
Natural variant4051D → E.
Corresponds to variant rs16901277 [ dbSNP | Ensembl ].
VAR_035150

Experimental info

Sequence conflict1991S → P in BAA92060. Ref.1

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified October 2, 2007. Version 2.
Checksum: A79A014E358C7367

FASTA44249,967
        10         20         30         40         50         60 
MSDSAGGRAG LRRYPKLPVW VVEDHQEVLP FIYRAIGSKH LPASNVSFLH FDSHPDLLIP 

        70         80         90        100        110        120 
VNMPADTVFD KETLFGELSI ENWIMPAVYA GHFSHVIWFH PTWAQQIREG RHHFLVGKDT 

       130        140        150        160        170        180 
STTTIRVTST DHYFLSDGLY VPEDQLENQK PLQLDVIMVK PYKLCNNQEE NDAVSSAKKP 

       190        200        210        220        230        240 
KLALEDSENT ASTNCDSSSE GLEKDTATQR SDQTCLEPSC SCSSENQECQ TAASTGEILE 

       250        260        270        280        290        300 
ILKKGKAFVL DIDLDFFSVK NPFKEMFTQE EYKILQELYQ FKKPGTNLTE EDLVDIVDTR 

       310        320        330        340        350        360 
IHQLEDLEAT FADLCDGDDE ETVQRWASNP GMESLVPLVQ SLKKRMEVPD YEMVHQAGLT 

       370        380        390        400        410        420 
CDYSELPHHI STEQEIECLI QSVHYLLKNL PNPTLVTIAR SSLDDYCPSD QVDTIQEKVL 

       430        440 
NMLRALYGNL DLQVYAAESP PS 

« Hide

Isoform 2 [UniParc].

Checksum: 7703ABFA6BE14AA0
Show »

FASTA17720,249
Isoform 3 [UniParc].

Checksum: EC2BAF4545DD8097
Show »

FASTA16318,068

References

[1]"Complete sequencing and characterization of 21,243 full-length human cDNAs."
Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S. expand/collapse author list , Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.
Nat. Genet. 36:40-45(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
Tissue: Placenta.
[2]"The full-ORF clone resource of the German cDNA consortium."
Bechtel S., Rosenfelder H., Duda A., Schmidt C.P., Ernst U., Wellenreuther R., Mehrle A., Schuster C., Bahr A., Bloecker H., Heubner D., Hoerlein A., Michel G., Wedler H., Koehrer K., Ottenwaelder B., Poustka A., Wiemann S., Schupp I.
BMC Genomics 8:399-399(2007) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 3).
Tissue: Lymph node.
[3]Mural R.J., Istrail S., Sutton G.G., Florea L., Halpern A.L., Mobarry C.M., Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., Flanigan M.J., Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., Hannenhalli S., Turner R. expand/collapse author list , Yooseph S., Lu F., Nusskern D.R., Shue B.C., Zheng X.H., Zhong F., Delcher A.L., Huson D.H., Kravitz S.A., Mouchard L., Reinert K., Remington K.A., Clark A.G., Waterman M.S., Eichler E.E., Adams M.D., Hunkapiller M.W., Myers E.W., Venter J.C.
Submitted (JUL-2005) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[4]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1 AND 2).
Tissue: Testis and Uterus.
[5]"A quantitative atlas of mitotic phosphorylation."
Dephoure N., Zhou C., Villen J., Beausoleil S.A., Bakalarski C.E., Elledge S.J., Gygi S.P.
Proc. Natl. Acad. Sci. U.S.A. 105:10762-10767(2008) [PubMed] [Europe PMC] [Abstract]
Cited for: IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
Tissue: Cervix carcinoma.
[6]"Lys-N and trypsin cover complementary parts of the phosphoproteome in a refined SCX-based approach."
Gauci S., Helbig A.O., Slijper M., Krijgsveld J., Heck A.J., Mohammed S.
Anal. Chem. 81:4493-4501(2009) [PubMed] [Europe PMC] [Abstract]
Cited for: IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AK002055 mRNA. Translation: BAA92060.1.
AL834427 mRNA. Translation: CAD39088.2.
CH471118 Genomic DNA. Translation: EAX10771.1.
CH471118 Genomic DNA. Translation: EAX10772.1.
BC021215 mRNA. Translation: AAH21215.1.
BC032845 mRNA. Translation: AAH32845.1.
RefSeqNP_060826.2. NM_018356.2.
XP_005248376.1. XM_005248319.1.
UniGeneHs.519246.

3D structure databases

ProteinModelPortalQ49AR2.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

BioGrid120603. 4 interactions.
IntActQ49AR2. 2 interactions.
MINTMINT-1485452.
STRING9606.ENSP00000326879.

Proteomic databases

PaxDbQ49AR2.
PeptideAtlasQ49AR2.
PRIDEQ49AR2.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENST00000325366; ENSP00000326879; ENSG00000082213. [Q49AR2-1]
ENST00000355907; ENSP00000348171; ENSG00000082213. [Q49AR2-3]
GeneID55322.
KEGGhsa:55322.
UCSCuc003jhj.4. human. [Q49AR2-1]
uc003jhk.4. human. [Q49AR2-2]
uc010iuj.3. human. [Q49AR2-3]

Organism-specific databases

CTD55322.
GeneCardsGC05P031532.
H-InvDBHIX0004781.
HGNCHGNC:25639. C5orf22.
HPAHPA043062.
neXtProtNX_Q49AR2.
PharmGKBPA144596503.
GenAtlasSearch...

Phylogenomic databases

eggNOGNOG71438.
HOGENOMHOG000008429.
HOVERGENHBG057060.
InParanoidQ49AR2.
OMAPHHVSTE.
OrthoDBEOG7VTDQ0.
PhylomeDBQ49AR2.
TreeFamTF324160.

Gene expression databases

ArrayExpressQ49AR2.
BgeeQ49AR2.
CleanExHS_C5orf22.
GenevestigatorQ49AR2.

Family and domain databases

InterProIPR024131. UPF0489.
[Graphical view]
PANTHERPTHR13225. PTHR13225. 1 hit.
PfamPF12640. UPF0489. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

GenomeRNAi55322.
NextBio59582.
PMAP-CutDBQ49AR2.

Entry information

Entry nameCE022_HUMAN
AccessionPrimary (citable) accession number: Q49AR2
Secondary accession number(s): Q8ND28, Q8WU61, Q9NUR1
Entry history
Integrated into UniProtKB/Swiss-Prot: October 2, 2007
Last sequence update: October 2, 2007
Last modified: April 16, 2014
This is version 65 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

Uncharacterized protein families (UPF)

List of uncharacterized protein family (UPF) entries

SIMILARITY comments

Index of protein domains and families

Human polymorphisms and disease mutations

Index of human polymorphisms and disease mutations

Human entries with polymorphisms or disease mutations

List of human entries with polymorphisms or disease mutations

Human chromosome 5

Human chromosome 5: entries, gene names and cross-references to MIM