Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q8IZF2 (GP116_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified July 9, 2014. Version 120. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (5) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Probable G-protein coupled receptor 116
Gene names
Name:GPR116
Synonyms:KIAA0758
OrganismHomo sapiens (Human) [Reference proteome]
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length1346 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

May have a role in the regulation of acid-base balance By similarity.

Subunit structure

Exists as disulfide-linked dimers at the cell surface By similarity.

Subcellular location

Cell membrane; Multi-pass membrane protein Potential.

Post-translational modification

Proteolytically cleaved into 2 highly conserved sites: one in the SEA domain and the other in the stalk domain region preceding the first transmembrane. The later 2 subunits, the extracellular subunit and the seven-transmembrane subunit, remain tightly associated and non-covalently linked By similarity.

Sequence similarities

Belongs to the G-protein coupled receptor 2 family. LN-TM7 subfamily.

Contains 1 GPS domain.

Contains 3 Ig-like (immunoglobulin-like) domains.

Contains 1 SEA domain.

Sequence caution

The sequence BAA34478.2 differs from that shown. Reason: Erroneous initiation. Translation N-terminally shortened.

The sequence CAB43394.1 differs from that shown. Reason: Contaminating sequence. Potential poly-A sequence.

Alternative products

This entry describes 3 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q8IZF2-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q8IZF2-2)

The sequence of this isoform differs from the canonical sequence as follows:
     1273-1292: Missing.
     1322-1346: Missing.
Isoform 3 (identifier: Q8IZF2-3)

The sequence of this isoform differs from the canonical sequence as follows:
     272-414: NESNFFVTPE...KQEGKINIPG → R
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 2121 Potential
Chain22 – 13461325Probable G-protein coupled receptor 116
PRO_0000012896

Regions

Topological domain22 – 1006985Extracellular Potential
Transmembrane1007 – 102721Helical; Name=1; Potential
Topological domain1028 – 105326Cytoplasmic Potential
Transmembrane1054 – 107421Helical; Name=2; Potential
Topological domain1075 – 109016Extracellular Potential
Transmembrane1091 – 111121Helical; Name=3; Potential
Topological domain1112 – 112817Cytoplasmic Potential
Transmembrane1129 – 114921Helical; Name=4; Potential
Topological domain1150 – 117324Extracellular Potential
Transmembrane1174 – 119421Helical; Name=5; Potential
Topological domain1195 – 122026Cytoplasmic Potential
Transmembrane1221 – 124121Helical; Name=6; Potential
Topological domain1242 – 12443Extracellular Potential
Transmembrane1245 – 126521Helical; Name=7; Potential
Topological domain1266 – 134681Cytoplasmic Potential
Domain166 – 273108SEA
Domain267 – 368102Ig-like 1
Domain369 – 46698Ig-like 2
Domain471 – 56191Ig-like 3
Domain951 – 100252GPS

Sites

Site226 – 2272Cleavage By similarity
Site990 – 9912Cleavage By similarity

Amino acid modifications

Glycosylation731N-linked (GlcNAc...) Potential
Glycosylation941N-linked (GlcNAc...) Potential
Glycosylation1061N-linked (GlcNAc...) Potential
Glycosylation1881N-linked (GlcNAc...) Potential
Glycosylation2561N-linked (GlcNAc...) Ref.9
Glycosylation2721N-linked (GlcNAc...) Potential
Glycosylation3011N-linked (GlcNAc...) Potential
Glycosylation3151N-linked (GlcNAc...) Ref.9
Glycosylation3281N-linked (GlcNAc...) Potential
Glycosylation3981N-linked (GlcNAc...) Potential
Glycosylation4721N-linked (GlcNAc...) Potential
Glycosylation4871N-linked (GlcNAc...) Potential
Glycosylation5051N-linked (GlcNAc...) Potential
Glycosylation5401N-linked (GlcNAc...) Potential
Glycosylation6271N-linked (GlcNAc...) Potential
Glycosylation6491N-linked (GlcNAc...) Potential
Glycosylation6661N-linked (GlcNAc...) Potential
Glycosylation8201N-linked (GlcNAc...) Potential
Glycosylation9311N-linked (GlcNAc...) Potential
Glycosylation9631N-linked (GlcNAc...) Potential
Glycosylation9821N-linked (GlcNAc...) Potential

Natural variations

Alternative sequence272 – 414143NESNF…INIPG → R in isoform 3.
VSP_039130
Alternative sequence1273 – 129220Missing in isoform 2.
VSP_010837
Alternative sequence1322 – 134625Missing in isoform 2.
VSP_010838
Natural variant6041T → M. Ref.2 Ref.3 Ref.5 Ref.7
Corresponds to variant rs586024 [ dbSNP | Ensembl ].
VAR_025326
Natural variant8011V → I. Ref.5
Corresponds to variant rs9395218 [ dbSNP | Ensembl ].
VAR_055291
Natural variant8561M → T. Ref.3 Ref.7
Corresponds to variant rs547499 [ dbSNP | Ensembl ].
VAR_024477

Experimental info

Sequence conflict2721Missing in AAN46672. Ref.1
Sequence conflict5731V → I in CAB43394. Ref.7
Sequence conflict10751Q → R in AL832125. Ref.5

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified May 5, 2009. Version 3.
Checksum: 72A9D02B08218A60

FASTA1,346149,457
        10         20         30         40         50         60 
MKSPRRTTLC LMFIVIYSSK AALNWNYEST IHPLSLHEHE PAGEEALRQK RAVATKSPTA 

        70         80         90        100        110        120 
EEYTVNIEIS FENASFLDPI KAYLNSLSFP IHGNNTDQIT DILSINVTTV CRPAGNEIWC 

       130        140        150        160        170        180 
SCETGYGWPR ERCLHNLICQ ERDVFLPGHH CSCLKELPPN GPFCLLQEDV TLNMRVRLNV 

       190        200        210        220        230        240 
GFQEDLMNTS SALYRSYKTD LETAFRKGYG ILPGFKGVTV TGFKSGSVVV TYEVKTTPPS 

       250        260        270        280        290        300 
LELIHKANEQ VVQSLNQTYK MDYNSFQAVT INESNFFVTP EIIFEGDTVS LVCEKEVLSS 

       310        320        330        340        350        360 
NVSWRYEEQQ LEIQNSSRFS IYTALFNNMT SVSKLTIHNI TPGDAGEYVC KLILDIFEYE 

       370        380        390        400        410        420 
CKKKIDVMPI QILANEEMKV MCDNNPVSLN CCSQGNVNWS KVEWKQEGKI NIPGTPETDI 

       430        440        450        460        470        480 
DSSCSRYTLK ADGTQCPSGS SGTTVIYTCE FISAYGARGS ANIKVTFISV ANLTITPDPI 

       490        500        510        520        530        540 
SVSEGQNFSI KCISDVSNYD EVYWNTSAGI KIYQRFYTTR RYLDGAESVL TVKTSTREWN 

       550        560        570        580        590        600 
GTYHCIFRYK NSYSIATKDV IVHPLPLKLN IMVDPLEATV SCSGSHHIKC CIEEDGDYKV 

       610        620        630        640        650        660 
TFHTGSSSLP AAKEVNKKQV CYKHNFNASS VSWCSKTVDV CCHFTNAANN SVWSPSMKLN 

       670        680        690        700        710        720 
LVPGENITCQ DPVIGVGEPG KVIQKLCRFS NVPSSPESPI GGTITYKCVG SQWEEKRNDC 

       730        740        750        760        770        780 
ISAPINSLLQ MAKALIKSPS QDEMLPTYLK DLSISIDKAE HEISSSPGSL GAIINILDLL 

       790        800        810        820        830        840 
STVPTQVNSE MMTHVLSTVN VILGKPVLNT WKVLQQQWTN QSSQLLHSVE RFSQALQSGD 

       850        860        870        880        890        900 
SPPLSFSQTN VQMSSMVIKS SHPETYQQRF VFPYFDLWGN VVIDKSYLEN LQSDSSIVTM 

       910        920        930        940        950        960 
AFPTLQAILA QDIQENNFAE SLVMTTTVSH NTTMPFRISM TFKNNSPSGG ETKCVFWNFR 

       970        980        990       1000       1010       1020 
LANNTGGWDS SGCYVEEGDG DNVTCICDHL TSFSILMSPD SPDPSSLLGI LLDIISYVGV 

      1030       1040       1050       1060       1070       1080 
GFSILSLAAC LVVEAVVWKS VTKNRTSYMR HTCIVNIAAS LLVANTWFIV VAAIQDNRYI 

      1090       1100       1110       1120       1130       1140 
LCKTACVAAT FFIHFFYLSV FFWMLTLGLM LFYRLVFILH ETSRSTQKAI AFCLGYGCPL 

      1150       1160       1170       1180       1190       1200 
AISVITLGAT QPREVYTRKN VCWLNWEDTK ALLAFAIPAL IIVVVNITIT IVVITKILRP 

      1210       1220       1230       1240       1250       1260 
SIGDKPCKQE KSSLFQISKS IGVLTPLLGL TWGFGLTTVF PGTNLVFHII FAILNVFQGL 

      1270       1280       1290       1300       1310       1320 
FILLFGCLWD LKVQEALLNK FSLSRWSSQH SKSTSLGSST PVFSMSSPIS RRFNNLFGKT 

      1330       1340 
GTYNVSTPEA TSSSLENSSS ASSLLN 

« Hide

Isoform 2 [UniParc].

Checksum: 9DEB380A0BEDA44B
Show »

FASTA1,301144,601
Isoform 3 [UniParc].

Checksum: 0106B9A59C2D5B0E
Show »

FASTA1,204133,307

References

« Hide 'large scale' references
[1]"Novel human G protein-coupled receptors with long N-terminals containing GPS domains and Ser/Thr-rich regions."
Fredriksson R., Lagerstroem M.C., Hoeglund P.J., Schioeth H.B.
FEBS Lett. 531:407-414(2002) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 2).
[2]"Complete coding sequence of GPR116."
Bonner T.I., Nagle J.W., Kauffman D.
Submitted (DEC-2003) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), VARIANT MET-604.
Tissue: Brain.
[3]"Prediction of the coding sequences of unidentified human genes. XII. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro."
Nagase T., Ishikawa K., Suyama M., Kikuno R., Hirosawa M., Miyajima N., Tanaka A., Kotani H., Nomura N., Ohara O.
DNA Res. 5:355-364(1998) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1), VARIANTS MET-604 AND THR-856.
Tissue: Brain.
[4]Ohara O., Nagase T., Kikuno R., Ishikawa K., Suyama M.
Submitted (AUG-2005) to the EMBL/GenBank/DDBJ databases
Cited for: SEQUENCE REVISION.
[5]"The full-ORF clone resource of the German cDNA consortium."
Bechtel S., Rosenfelder H., Duda A., Schmidt C.P., Ernst U., Wellenreuther R., Mehrle A., Schuster C., Bahr A., Bloecker H., Heubner D., Hoerlein A., Michel G., Wedler H., Koehrer K., Ottenwaelder B., Poustka A., Wiemann S., Schupp I.
BMC Genomics 8:399-399(2007) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 3), NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1-613 (ISOFORM 1), VARIANTS MET-604 AND ILE-801.
Tissue: Brain and Cervix.
[6]"The DNA sequence and analysis of human chromosome 6."
Mungall A.J., Palmer S.A., Sims S.K., Edwards C.A., Ashurst J.L., Wilming L., Jones M.C., Horton R., Hunt S.E., Scott C.E., Gilbert J.G.R., Clamp M.E., Bethel G., Milne S., Ainscough R., Almeida J.P., Ambrose K.D., Andrews T.D. expand/collapse author list , Ashwell R.I.S., Babbage A.K., Bagguley C.L., Bailey J., Banerjee R., Barker D.J., Barlow K.F., Bates K., Beare D.M., Beasley H., Beasley O., Bird C.P., Blakey S.E., Bray-Allen S., Brook J., Brown A.J., Brown J.Y., Burford D.C., Burrill W., Burton J., Carder C., Carter N.P., Chapman J.C., Clark S.Y., Clark G., Clee C.M., Clegg S., Cobley V., Collier R.E., Collins J.E., Colman L.K., Corby N.R., Coville G.J., Culley K.M., Dhami P., Davies J., Dunn M., Earthrowl M.E., Ellington A.E., Evans K.A., Faulkner L., Francis M.D., Frankish A., Frankland J., French L., Garner P., Garnett J., Ghori M.J., Gilby L.M., Gillson C.J., Glithero R.J., Grafham D.V., Grant M., Gribble S., Griffiths C., Griffiths M.N.D., Hall R., Halls K.S., Hammond S., Harley J.L., Hart E.A., Heath P.D., Heathcott R., Holmes S.J., Howden P.J., Howe K.L., Howell G.R., Huckle E., Humphray S.J., Humphries M.D., Hunt A.R., Johnson C.M., Joy A.A., Kay M., Keenan S.J., Kimberley A.M., King A., Laird G.K., Langford C., Lawlor S., Leongamornlert D.A., Leversha M., Lloyd C.R., Lloyd D.M., Loveland J.E., Lovell J., Martin S., Mashreghi-Mohammadi M., Maslen G.L., Matthews L., McCann O.T., McLaren S.J., McLay K., McMurray A., Moore M.J.F., Mullikin J.C., Niblett D., Nickerson T., Novik K.L., Oliver K., Overton-Larty E.K., Parker A., Patel R., Pearce A.V., Peck A.I., Phillimore B.J.C.T., Phillips S., Plumb R.W., Porter K.M., Ramsey Y., Ranby S.A., Rice C.M., Ross M.T., Searle S.M., Sehra H.K., Sheridan E., Skuce C.D., Smith S., Smith M., Spraggon L., Squares S.L., Steward C.A., Sycamore N., Tamlyn-Hall G., Tester J., Theaker A.J., Thomas D.W., Thorpe A., Tracey A., Tromans A., Tubby B., Wall M., Wallis J.M., West A.P., White S.S., Whitehead S.L., Whittaker H., Wild A., Willey D.J., Wilmer T.E., Wood J.M., Wray P.W., Wyatt J.C., Young L., Younger R.M., Bentley D.R., Coulson A., Durbin R.M., Hubbard T., Sulston J.E., Dunham I., Rogers J., Beck S.
Nature 425:805-811(2003) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[7]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1), VARIANTS MET-604 AND THR-856.
[8]"The G protein-coupled receptor repertoires of human and mouse."
Vassilatis D.K., Hohmann J.G., Zeng H., Li F., Ranchalis J.E., Mortrud M.T., Brown A., Rodriguez S.S., Weller J.R., Wright A.C., Bergmann J.E., Gaitanaris G.A.
Proc. Natl. Acad. Sci. U.S.A. 100:4903-4908(2003) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 949-1062.
[9]"Human plasma N-glycoproteome analysis by immunoaffinity subtraction, hydrazide chemistry, and mass spectrometry."
Liu T., Qian W.-J., Gritsenko M.A., Camp D.G. II, Monroe M.E., Moore R.J., Smith R.D.
J. Proteome Res. 4:2070-2080(2005) [PubMed] [Europe PMC] [Abstract]
Cited for: GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-256 AND ASN-315.
Tissue: Plasma.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AY140958 mRNA. Translation: AAN46672.1.
AY498875 mRNA. Translation: AAS21061.1.
AB018301 mRNA. Translation: BAA34478.2. Different initiation.
AL050295 mRNA. Translation: CAB43394.1. Sequence problems.
AL832125 mRNA. No translation available.
AL096772 Genomic DNA. Translation: CAB61578.1.
BC066121 mRNA. Translation: AAH66121.1.
AY255552 mRNA. Translation: AAO85064.1.
CCDSCCDS4919.1. [Q8IZF2-1]
PIRT08685.
RefSeqNP_001091988.1. NM_001098518.1. [Q8IZF2-1]
NP_056049.4. NM_015234.4. [Q8IZF2-1]
UniGeneHs.362806.

3D structure databases

ProteinModelPortalQ8IZF2.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

IntActQ8IZF2. 1 interaction.
MINTMINT-4715450.
STRING9606.ENSP00000265417.

Protein family/group databases

MEROPSS63.022.
GPCRDBSearch...

PTM databases

PhosphoSiteQ8IZF2.

Polymorphism databases

DMDM229462973.

Proteomic databases

PaxDbQ8IZF2.
PRIDEQ8IZF2.

Protocols and materials databases

DNASU221395.
StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENST00000265417; ENSP00000265417; ENSG00000069122. [Q8IZF2-1]
ENST00000283296; ENSP00000283296; ENSG00000069122. [Q8IZF2-1]
ENST00000456426; ENSP00000412866; ENSG00000069122. [Q8IZF2-3]
GeneID221395.
KEGGhsa:221395.
UCSCuc003oyo.3. human. [Q8IZF2-1]
uc003oyp.3. human. [Q8IZF2-3]
uc010jzi.1. human. [Q8IZF2-2]

Organism-specific databases

CTD221395.
GeneCardsGC06M046820.
HGNCHGNC:19030. GPR116.
neXtProtNX_Q8IZF2.
PharmGKBPA134886165.
HUGESearch...
GenAtlasSearch...

Phylogenomic databases

eggNOGNOG235397.
HOGENOMHOG000112764.
HOVERGENHBG051772.
InParanoidQ8IZF2.
KOK08458.
OMASTPVFSM.
PhylomeDBQ8IZF2.
TreeFamTF316380.

Gene expression databases

ArrayExpressQ8IZF2.
BgeeQ8IZF2.
CleanExHS_GPR116.
GenevestigatorQ8IZF2.

Family and domain databases

Gene3D2.60.40.10. 2 hits.
InterProIPR017981. GPCR_2-like.
IPR008078. GPCR_2_Ig-hepta_rcpt.
IPR000832. GPCR_2_secretin-like.
IPR017983. GPCR_2_secretin-like_CS.
IPR000203. GPS.
IPR007110. Ig-like_dom.
IPR013783. Ig-like_fold.
IPR013098. Ig_I-set.
IPR003599. Ig_sub.
IPR003598. Ig_sub2.
IPR000082. SEA_dom.
[Graphical view]
PfamPF00002. 7tm_2. 1 hit.
PF01825. GPS. 1 hit.
PF07679. I-set. 1 hit.
PF01390. SEA. 1 hit.
[Graphical view]
PRINTSPR00249. GPCRSECRETIN.
PR01695. IGHEPTARCPTR.
SMARTSM00303. GPS. 1 hit.
SM00409. IG. 1 hit.
SM00408. IGc2. 1 hit.
SM00200. SEA. 1 hit.
[Graphical view]
SUPFAMSSF82671. SSF82671. 1 hit.
PROSITEPS00650. G_PROTEIN_RECEP_F2_2. 1 hit.
PS50261. G_PROTEIN_RECEP_F2_4. 1 hit.
PS50221. GPS. 1 hit.
PS50835. IG_LIKE. 3 hits.
PS50024. SEA. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

ChiTaRSGPR116. human.
GeneWikiGPR116.
GenomeRNAi221395.
NextBio91311.
PROQ8IZF2.

Entry information

Entry nameGP116_HUMAN
AccessionPrimary (citable) accession number: Q8IZF2
Secondary accession number(s): O94858 expand/collapse secondary AC list , Q5TF06, Q6RGN2, Q86SP0, Q9Y3Z2
Entry history
Integrated into UniProtKB/Swiss-Prot: July 19, 2004
Last sequence update: May 5, 2009
Last modified: July 9, 2014
This is version 120 of the entry and version 3 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

SIMILARITY comments

Index of protein domains and families

Human polymorphisms and disease mutations

Index of human polymorphisms and disease mutations

Human entries with polymorphisms or disease mutations

List of human entries with polymorphisms or disease mutations

Human chromosome 6

Human chromosome 6: entries, gene names and cross-references to MIM

7-transmembrane G-linked receptors

List of 7-transmembrane G-linked receptor entries