ID GPA33_HUMAN Reviewed; 319 AA. AC Q99795; Q5VZP6; DT 01-NOV-1997, integrated into UniProtKB/Swiss-Prot. DT 01-MAY-1997, sequence version 1. DT 27-MAR-2024, entry version 194. DE RecName: Full=Cell surface A33 antigen; DE AltName: Full=Glycoprotein A33; DE Flags: Precursor; GN Name=GPA33; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; OC Homo. OX NCBI_TaxID=9606; RN [1] RP NUCLEOTIDE SEQUENCE [MRNA], AND PARTIAL PROTEIN SEQUENCE. RC TISSUE=Colon carcinoma; RX PubMed=9012807; DOI=10.1073/pnas.94.2.469; RA Heath J.K., White S.J., Johnstone C.N., Catimel B., Simpson R.J., RA Moritz R.L., Tu G.-F., Ji H., Whitehead R.H., Groenen L.C., Scott A.M., RA Ritter G., Cohen L., Welt S., Old L.J., Nice E.C., Burgess A.W.; RT "The human A33 antigen is a transmembrane glycoprotein and a novel member RT of the immunoglobulin superfamily."; RL Proc. Natl. Acad. Sci. U.S.A. 94:469-474(1997). RN [2] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. RC TISSUE=Thymus; RX PubMed=14702039; DOI=10.1038/ng1285; RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., RA Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., RA Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S., RA Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., RA Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H., RA Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M., RA Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K., RA Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T., RA Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M., RA Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S., RA Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H., RA Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K., RA Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N., RA Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., RA Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y., RA Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K., RA Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T., RA Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T., RA Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y., RA Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H., RA Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y., RA Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H., RA Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O., RA Isogai T., Sugano S.; RT "Complete sequencing and characterization of 21,243 full-length human RT cDNAs."; RL Nat. Genet. 36:40-45(2004). RN [3] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=16710414; DOI=10.1038/nature04727; RA Gregory S.G., Barlow K.F., McLay K.E., Kaul R., Swarbreck D., Dunham A., RA Scott C.E., Howe K.L., Woodfine K., Spencer C.C.A., Jones M.C., Gillson C., RA Searle S., Zhou Y., Kokocinski F., McDonald L., Evans R., Phillips K., RA Atkinson A., Cooper R., Jones C., Hall R.E., Andrews T.D., Lloyd C., RA Ainscough R., Almeida J.P., Ambrose K.D., Anderson F., Andrew R.W., RA Ashwell R.I.S., Aubin K., Babbage A.K., Bagguley C.L., Bailey J., RA Beasley H., Bethel G., Bird C.P., Bray-Allen S., Brown J.Y., Brown A.J., RA Buckley D., Burton J., Bye J., Carder C., Chapman J.C., Clark S.Y., RA Clarke G., Clee C., Cobley V., Collier R.E., Corby N., Coville G.J., RA Davies J., Deadman R., Dunn M., Earthrowl M., Ellington A.G., Errington H., RA Frankish A., Frankland J., French L., Garner P., Garnett J., Gay L., RA Ghori M.R.J., Gibson R., Gilby L.M., Gillett W., Glithero R.J., RA Grafham D.V., Griffiths C., Griffiths-Jones S., Grocock R., Hammond S., RA Harrison E.S.I., Hart E., Haugen E., Heath P.D., Holmes S., Holt K., RA Howden P.J., Hunt A.R., Hunt S.E., Hunter G., Isherwood J., James R., RA Johnson C., Johnson D., Joy A., Kay M., Kershaw J.K., Kibukawa M., RA Kimberley A.M., King A., Knights A.J., Lad H., Laird G., Lawlor S., RA Leongamornlert D.A., Lloyd D.M., Loveland J., Lovell J., Lush M.J., RA Lyne R., Martin S., Mashreghi-Mohammadi M., Matthews L., Matthews N.S.W., RA McLaren S., Milne S., Mistry S., Moore M.J.F., Nickerson T., O'Dell C.N., RA Oliver K., Palmeiri A., Palmer S.A., Parker A., Patel D., Pearce A.V., RA Peck A.I., Pelan S., Phelps K., Phillimore B.J., Plumb R., Rajan J., RA Raymond C., Rouse G., Saenphimmachak C., Sehra H.K., Sheridan E., RA Shownkeen R., Sims S., Skuce C.D., Smith M., Steward C., Subramanian S., RA Sycamore N., Tracey A., Tromans A., Van Helmond Z., Wall M., Wallis J.M., RA White S., Whitehead S.L., Wilkinson J.E., Willey D.L., Williams H., RA Wilming L., Wray P.W., Wu Z., Coulson A., Vaudin M., Sulston J.E., RA Durbin R.M., Hubbard T., Wooster R., Dunham I., Carter N.P., McVean G., RA Ross M.T., Harrow J., Olson M.V., Beck S., Rogers J., Bentley D.R.; RT "The DNA sequence and biological annotation of human chromosome 1."; RL Nature 441:315-321(2006). RN [4] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Mural R.J., Istrail S., Sutton G., Florea L., Halpern A.L., Mobarry C.M., RA Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., Flanigan M.J., RA Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., Hannenhalli S., RA Turner R., Yooseph S., Lu F., Nusskern D.R., Shue B.C., Zheng X.H., RA Zhong F., Delcher A.L., Huson D.H., Kravitz S.A., Mouchard L., Reinert K., RA Remington K.A., Clark A.G., Waterman M.S., Eichler E.E., Adams M.D., RA Hunkapiller M.W., Myers E.W., Venter J.C.; RL Submitted (JUL-2005) to the EMBL/GenBank/DDBJ databases. RN [5] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. RC TISSUE=Lung; RX PubMed=15489334; DOI=10.1101/gr.2596504; RG The MGC Project Team; RT "The status, quality, and expansion of the NIH full-length cDNA project: RT the Mammalian Gene Collection (MGC)."; RL Genome Res. 14:2121-2127(2004). RN [6] RP GLYCOSYLATION AT ASN-112, AND PALMITOYLATION. RX PubMed=9245713; DOI=10.1006/bbrc.1997.6966; RA Ritter G., Cohen L.S., Nice E.C., Catimel B., Burgess A.W., Moritz R.L., RA Ji H., Heath J.K., White S.J., Welt S., Old L.J., Simpson R.J.; RT "Characterization of posttranslational modifications of human A33 antigen, RT a novel palmitoylated surface glycoprotein of human gastrointestinal RT epithelium."; RL Biochem. Biophys. Res. Commun. 236:682-686(1997). CC -!- FUNCTION: May play a role in cell-cell recognition and signaling. CC -!- INTERACTION: CC Q99795; P54852: EMP3; NbExp=3; IntAct=EBI-4289554, EBI-3907816; CC Q99795; O75355-2: ENTPD3; NbExp=3; IntAct=EBI-4289554, EBI-12279764; CC Q99795; Q13021: MALL; NbExp=3; IntAct=EBI-4289554, EBI-750078; CC Q99795; Q8IZ57: NRSN1; NbExp=3; IntAct=EBI-4289554, EBI-10264528; CC Q99795; P42857: NSG1; NbExp=3; IntAct=EBI-4289554, EBI-6380741; CC Q99795; Q9NUX5: POT1; NbExp=2; IntAct=EBI-4289554, EBI-752420; CC Q99795; Q9NRQ5: SMCO4; NbExp=3; IntAct=EBI-4289554, EBI-8640191; CC Q99795; O00526: UPK2; NbExp=3; IntAct=EBI-4289554, EBI-10179682; CC -!- SUBCELLULAR LOCATION: Membrane; Single-pass type I membrane protein. CC -!- TISSUE SPECIFICITY: Expressed in normal gastrointestinal epithelium and CC in 95% of colon cancers. CC -!- PTM: N-glycosylated, contains approximately 8 kDa of N-linked CC carbohydrate. {ECO:0000269|PubMed:9245713}. CC -!- PTM: Palmitoylated. {ECO:0000269|PubMed:9245713}. CC -!- WEB RESOURCE: Name=Atlas of Genetics and Cytogenetics in Oncology and CC Haematology; CC URL="https://atlasgeneticsoncology.org/gene/40735/GPA33"; CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; U79725; AAC50957.1; -; mRNA. DR EMBL; AK312833; BAG35687.1; -; mRNA. DR EMBL; AL158837; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; CH471067; EAW90783.1; -; Genomic_DNA. DR EMBL; BC069705; AAH69705.1; -; mRNA. DR EMBL; BC069723; AAH69723.1; -; mRNA. DR EMBL; BC069745; AAH69745.1; -; mRNA. DR EMBL; BC069761; AAH69761.1; -; mRNA. DR EMBL; BC069789; AAH69789.1; -; mRNA. DR EMBL; BC074830; AAH74830.1; -; mRNA. DR EMBL; BC074876; AAH74876.1; -; mRNA. DR EMBL; BC107164; AAI07165.1; -; mRNA. DR EMBL; BC107165; AAI07166.1; -; mRNA. DR CCDS; CCDS1258.1; -. DR RefSeq; NP_005805.1; NM_005814.2. DR AlphaFoldDB; Q99795; -. DR SMR; Q99795; -. DR BioGRID; 115517; 17. DR IntAct; Q99795; 8. DR STRING; 9606.ENSP00000356842; -. DR ChEMBL; CHEMBL3712927; -. DR GlyCosmos; Q99795; 3 sites, No reported glycans. DR GlyGen; Q99795; 4 sites, 1 O-linked glycan (1 site). DR iPTMnet; Q99795; -. DR PhosphoSitePlus; Q99795; -. DR SwissPalm; Q99795; -. DR BioMuta; GPA33; -. DR DMDM; 2842765; -. DR jPOST; Q99795; -. DR MassIVE; Q99795; -. DR MaxQB; Q99795; -. DR PaxDb; 9606-ENSP00000356842; -. DR PeptideAtlas; Q99795; -. DR ProteomicsDB; 78476; -. DR ABCD; Q99795; 5 sequenced antibodies. DR Antibodypedia; 1116; 404 antibodies from 31 providers. DR DNASU; 10223; -. DR Ensembl; ENST00000367868.4; ENSP00000356842.3; ENSG00000143167.12. DR GeneID; 10223; -. DR KEGG; hsa:10223; -. DR MANE-Select; ENST00000367868.4; ENSP00000356842.3; NM_005814.3; NP_005805.1. DR UCSC; uc001gea.2; human. DR AGR; HGNC:4445; -. DR CTD; 10223; -. DR DisGeNET; 10223; -. DR GeneCards; GPA33; -. DR HGNC; HGNC:4445; GPA33. DR HPA; ENSG00000143167; Tissue enriched (intestine). DR MIM; 602171; gene. DR neXtProt; NX_Q99795; -. DR OpenTargets; ENSG00000143167; -. DR PharmGKB; PA28826; -. DR VEuPathDB; HostDB:ENSG00000143167; -. DR eggNOG; ENOG502QR0Y; Eukaryota. DR GeneTree; ENSGT00940000160248; -. DR HOGENOM; CLU_040549_2_0_1; -. DR InParanoid; Q99795; -. DR OMA; TEMSGYY; -. DR OrthoDB; 3024056at2759; -. DR PhylomeDB; Q99795; -. DR TreeFam; TF330875; -. DR PathwayCommons; Q99795; -. DR SignaLink; Q99795; -. DR BioGRID-ORCS; 10223; 7 hits in 1137 CRISPR screens. DR ChiTaRS; GPA33; human. DR GeneWiki; GPA33; -. DR GenomeRNAi; 10223; -. DR Pharos; Q99795; Tbio. DR PRO; PR:Q99795; -. DR Proteomes; UP000005640; Chromosome 1. DR RNAct; Q99795; Protein. DR Bgee; ENSG00000143167; Expressed in ileal mucosa and 82 other cell types or tissues. DR ExpressionAtlas; Q99795; baseline and differential. DR GO; GO:0070062; C:extracellular exosome; HDA:UniProtKB. DR GO; GO:0005886; C:plasma membrane; IBA:GO_Central. DR GO; GO:0038023; F:signaling receptor activity; TAS:ProtInc. DR Gene3D; 2.60.40.10; Immunoglobulins; 2. DR InterPro; IPR042474; A33. DR InterPro; IPR007110; Ig-like_dom. DR InterPro; IPR036179; Ig-like_dom_sf. DR InterPro; IPR013783; Ig-like_fold. DR InterPro; IPR003599; Ig_sub. DR InterPro; IPR003598; Ig_sub2. DR InterPro; IPR013106; Ig_V-set. DR PANTHER; PTHR44969; CELL SURFACE A33 ANTIGEN; 1. DR PANTHER; PTHR44969:SF1; CELL SURFACE A33 ANTIGEN; 1. DR Pfam; PF13927; Ig_3; 1. DR Pfam; PF07686; V-set; 1. DR SMART; SM00409; IG; 2. DR SMART; SM00408; IGc2; 2. DR SMART; SM00406; IGv; 1. DR SUPFAM; SSF48726; Immunoglobulin; 2. DR PROSITE; PS50835; IG_LIKE; 2. DR Genevisible; Q99795; HS. PE 1: Evidence at protein level; KW Direct protein sequencing; Disulfide bond; Glycoprotein; KW Immunoglobulin domain; Lipoprotein; Membrane; Palmitate; KW Reference proteome; Signal; Transmembrane; Transmembrane helix. FT SIGNAL 1..21 FT CHAIN 22..319 FT /note="Cell surface A33 antigen" FT /id="PRO_0000014770" FT TOPO_DOM 22..235 FT /note="Extracellular" FT /evidence="ECO:0000255" FT TRANSMEM 236..256 FT /note="Helical" FT /evidence="ECO:0000255" FT TOPO_DOM 257..319 FT /note="Cytoplasmic" FT /evidence="ECO:0000255" FT DOMAIN 22..134 FT /note="Ig-like V-type" FT DOMAIN 140..227 FT /note="Ig-like C2-type" FT REGION 267..319 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT CARBOHYD 112 FT /note="N-linked (GlcNAc...) asparagine" FT /evidence="ECO:0000269|PubMed:9245713" FT CARBOHYD 200 FT /note="N-linked (GlcNAc...) asparagine" FT /evidence="ECO:0000255" FT CARBOHYD 223 FT /note="N-linked (GlcNAc...) asparagine" FT /evidence="ECO:0000255" FT DISULFID 43..117 FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00114" FT DISULFID 146..222 FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00114" FT DISULFID 162..211 FT /evidence="ECO:0000255|PROSITE-ProRule:PRU00114" FT VARIANT 20 FT /note="D -> N (in dbSNP:rs2274531)" FT /id="VAR_020079" FT VARIANT 165 FT /note="K -> N (in dbSNP:rs2228399)" FT /id="VAR_049874" SQ SEQUENCE 319 AA; 35632 MW; 9BFC7AAF45C2408E CRC64; MVGKMWPVLW TLCAVRVTVD AISVETPQDV LRASQGKSVT LPCTYHTSTS SREGLIQWDK LLLTHTERVV IWPFSNKNYI HGELYKNRVS ISNNAEQSDA SITIDQLTMA DNGTYECSVS LMSDLEGNTK SRVRLLVLVP PSKPECGIEG ETIIGNNIQL TCQSKEGSPT PQYSWKRYNI LNQEQPLAQP ASGQPVSLKN ISTDTSGYYI CTSSNEEGTQ FCNITVAVRS PSMNVALYVG IAVGVVAALI IIGIIIYCCC CRGKDDNTED KEDARPNREA YEEPPEQLRE LSREREEEDD YRQEEQRSTG RESPDHLDQ //