P10163 (PRB4_HUMAN) Reviewed, UniProtKB/Swiss-Prot
Last modified
April 3, 2013.
Version 96.
History...
Names·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Web links·Cross-refs·Entry info·DocumentsCustomize order
Names·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Web links·Cross-refs·Entry info·DocumentsCustomize orderNames and origin
| Protein names | Recommended name: Basic salivary proline-rich protein 4 Short name=Salivary proline-rich protein Po Alternative name(s): Parotid o protein Salivary proline-rich protein II-1 Cleaved into the following 3 chains:
| ||
| Gene names |
| ||
| Organism | Homo sapiens (Human) [Reference proteome] | ||
| Taxonomic identifier | 9606 [NCBI] | ||
| Taxonomic lineage | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Mammalia › Eutheria › Euarchontoglires › Primates › Haplorrhini › Catarrhini › Hominidae › Homo![]() |
Protein attributes
| Sequence length | 310 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is further processed into a mature form. |
| Protein existence | Evidence at protein level |
General annotation (Comments)
| Subcellular location | |
| Post-translational modification | N-glycosylated. Ref.11 Proteolytically cleaved at the tripeptide Xaa-Pro-Gln, where Xaa in the P3 position is mostly lysine. The endoprotease may be of microbial origin. Pyroglutamate formation found on at least Gln-46, Gln-48, Gln-67, Gln-88; Gln-90; Gln-193; Gln-288 Gln-214 and Gln-295, preferentially in diabetic, and head and neck cancer patients. Ref.10 |
| Polymorphism | The number of repeats is polymorphic and varies among different alleles. Allele S (short), allele M (medium) and allele L (long) contain 6, 7 and 9 tandem repeats respectively. |
| Sequence caution | The sequence CAA30543.1 differs from that shown. Reason: Erroneous gene model prediction. The sequence CAA30729.1 differs from that shown. Reason: Erroneous gene model prediction. |
Ontologies
| Keywords | |
|---|---|
| Cellular component | Secreted |
| Coding sequence diversity | Polymorphism |
| Domain | Repeat Signal |
| PTM | Glycoprotein Pyrrolidone carboxylic acid |
| Technical term | Complete proteome Direct protein sequencing Reference proteome |
| Gene Ontology (GO) | |
| Cellular_component | extracellular region Non-traceable author statement Ref.5. Source: UniProtKB |
| Complete GO annotation... | |
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||
Molecule processing | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Signal peptide | 1 – 16 | 16 | Ref.5 | ||||||
| Chain | 17 – 310 | 294 | Basic salivary proline-rich protein 4 | PRO_0000022102 | |||||
| Peptide | 17 – 39 | 23 | Protein N1 | PRO_0000022103 | |||||
| Chain | 40 – 177 | 138 | Glycosylated protein A | PRO_0000022104 | |||||
| Chain | 241 – 310 | 70 | Peptide P-D | PRO_0000022099 | |||||
Regions | |||||||||
| Repeat | 35 – 55 | 21 | 1 | ||||||
| Repeat | 56 – 76 | 21 | 2 | ||||||
| Repeat | 77 – 97 | 21 | 3 | ||||||
| Repeat | 98 – 118 | 21 | 4 | ||||||
| Repeat | 119 – 139 | 21 | 5 | ||||||
| Repeat | 140 – 160 | 21 | 6 | ||||||
| Repeat | 161 – 181 | 21 | 7 | ||||||
| Repeat | 182 – 202 | 21 | 8 | ||||||
| Repeat | 203 – 223 | 21 | 9 | ||||||
| Repeat | 224 – 234 | 11 | 10; truncated | ||||||
| Region | 35 – 234 | 200 | 9.5 X 21 AA tandem repeats of K-P-[EQ]-[GR]-[PR]-[PR]-P-Q-G-G-N-Q-[PS]-[QH]-[RG]-[PT]-P-P-[PH]-P-G | ||||||
Amino acid modifications | |||||||||
| Glycosylation | 66 | 1 | N-linked (GlcNAc...) Potential | ||||||
| Glycosylation | 87 | 1 | N-linked (GlcNAc...) Ref.11 | ||||||
| Glycosylation | 108 | 1 | N-linked (GlcNAc...) Potential | ||||||
| Glycosylation | 150 | 1 | N-linked (GlcNAc...) Potential | ||||||
| Glycosylation | 171 | 1 | N-linked (GlcNAc...) Potential | ||||||
| Glycosylation | 192 | 1 | N-linked (GlcNAc...) Potential | ||||||
| Glycosylation | 213 | 1 | N-linked (GlcNAc...) Potential | ||||||
| Glycosylation | 234 | 1 | N-linked (GlcNAc...) Potential | ||||||
Natural variations | |||||||||
| Natural variant | 113 – 154 | 42 | Missing in allele M and allele S. | VAR_035034 | |||||
| Natural variant | 164 – 184 | 21 | Missing in allele S. | VAR_035035 | |||||
| Natural variant | 185 | 1 | R → G. Corresponds to variant rs11054244 [ dbSNP | Ensembl ]. | VAR_031548 | |||||
| Natural variant | 186 | 1 | P → R. Corresponds to variant rs11054243 [ dbSNP | Ensembl ]. | VAR_031549 | |||||
| Natural variant | 200 | 1 | P → H. Corresponds to variant rs12308244 [ dbSNP | Ensembl ]. | VAR_031550 | |||||
| Natural variant | 272 | 1 | P → A. Ref.1 Ref.2 Ref.3 Corresponds to variant rs1052808 [ dbSNP | Ensembl ]. | VAR_031551 | |||||
Experimental info | |||||||||
| Sequence conflict | 28 | 1 | S → P AA sequence Ref.5 | ||||||
| Sequence conflict | 31 – 39 | 9 | LISGKPEGR → IIPPKPPG AA sequence Ref.5 | ||||||
| Sequence conflict | 31 – 33 | 3 | LIS → PPP in AAB50687. Ref.6 | ||||||
| Sequence conflict | 37 | 1 | E → Q in CAA30543. Ref.2 | ||||||
| Sequence conflict | 37 | 1 | E → Q in CAA30542. Ref.7 | ||||||
| Sequence conflict | 66 | 1 | N → D AA sequence Ref.5 | ||||||
| Sequence conflict | 74 – 94 | 21 | Missing in CAA30542. Ref.7 | ||||||
| Sequence conflict | 96 | 1 | P → PP AA sequence Ref.5 | ||||||
| Sequence conflict | 101 | 1 | R → E AA sequence Ref.5 | ||||||
| Sequence conflict | 122 – 123 | 2 | SR → RP in CAA30542. Ref.7 | ||||||
| Sequence conflict | 129 | 1 | H → N in CAA30542. Ref.7 | ||||||
| Sequence conflict | 154 – 174 | 21 | Missing in CAA30542. Ref.7 | ||||||
| Sequence conflict | 169 – 171 | 3 | GGN → QGG AA sequence Ref.5 | ||||||
| Sequence conflict | 192 | 1 | N → D AA sequence Ref.5 | ||||||
| Sequence conflict | 213 | 1 | N → D AA sequence Ref.5 | ||||||
Sequences
| ||||||||||||||||||
References
| « Hide 'large scale' references | |
| [1] | "Differential RNA splicing and post-translational cleavages in the human salivary proline-rich protein gene system." Maeda N., Kim H.-S., Azen E.A., Smithies O. J. Biol. Chem. 260:11123-11130(1985) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ALLELE S), VARIANT ALA-272, POLYMORPHISM. |
| [2] | "Length polymorphisms in human proline-rich protein genes generated by intragenic unequal crossing over." Lyons K.M., Stein J.H., Smithies O. Genetics 120:267-278(1988) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] (ALLELES L AND S), VARIANT ALA-272. |
| [3] | "The finished DNA sequence of human chromosome 12." Scherer S.E., Muzny D.M., Buhay C.J., Chen R., Cree A., Ding Y., Dugan-Rocha S., Gill R., Gunaratne P., Harris R.A., Hawes A.C., Hernandez J., Hodgson A.V., Hume J., Jackson A., Khan Z.M., Kovar-Smith C., Lewis L.R. Gibbs R.A.Nature 440:346-351(2006) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], VARIANT ALA-272. |
| [4] | "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)." The MGC Project Team Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ALLELE M). Tissue: Cerebellum. |
| [5] | "Alignment of amino acid and DNA sequences of human proline-rich proteins." Kauffman D.L., Keller P.J., Bennick A., Blum M. Crit. Rev. Oral Biol. Med. 4:287-292(1993) [PubMed] [Europe PMC] [Abstract] Cited for: PROTEIN SEQUENCE OF 17-112 AND 155-240. Tissue: Saliva. |
| [6] | "PRB1, PRB2, and PRB4 coded polymorphisms among human salivary concanavalin-A binding, II-1, and Po proline-rich proteins." Azen E.A., Amberger E., Fisher S., Prakobphol A., Niece R.L. Am. J. Hum. Genet. 58:143-153(1996) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 31-310 (ALLELE M). |
| [7] | "Many protein products from a few loci: assignment of human salivary proline-rich proteins to specific loci." Lyons K.M., Stein J.H., Smithies O. Genetics 120:255-265(1988) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 35-310. |
| [8] | "Complete amino acid sequence of a basic proline-rich peptide, P-D, from human parotid saliva." Saitoh E., Isemura S., Sanada K. J. Biochem. 93:495-502(1983) [PubMed] [Europe PMC] [Abstract] Cited for: PROTEIN SEQUENCE OF 241-310. Tissue: Saliva. |
| [9] | "Basic proline-rich proteins from human parotid saliva: relationships of the covalent structures of ten proteins from a single individual." Kauffman D.L., Bennick A., Blum M., Keller P.J. Biochemistry 30:3351-3356(1991) [PubMed] [Europe PMC] [Abstract] Cited for: PROTEIN SEQUENCE OF 241-310. Tissue: Saliva. |
| [10] | "Identification of Lys-Pro-Gln as a novel cleavage site specificity of saliva-associated proteases." Helmerhorst E.J., Sun X., Salih E., Oppenheim F.G. J. Biol. Chem. 283:19957-19966(2008) [PubMed] [Europe PMC] [Abstract] Cited for: PROTEOLYTIC PROCESSING, MASS SPECTROMETRY. |
| [11] | "Finding new posttranslational modifications in salivary proline-rich proteins." Vitorino R., Alves R., Barros A., Caseiro A., Ferreira R., Lobo M.C., Bastos A., Duarte J., Carvalho D., Santos L.L., Amado F.L. Proteomics 10:3732-3742(2010) [PubMed] [Europe PMC] [Abstract] Cited for: GLYCOSYLATION AT ASN-87, PYROGLUTAMATE FORMATION, VARIANTS ALLELE L AND M, MASS SPECTROMETRY. |
| + | Additional computationally mapped references. |
Web resources
| SHMPD The Singapore human mutation and polymorphism database |
Cross-references
Sequence databases | |
|---|---|
| EMBL GenBank DDBJ | K03207 mRNA. Translation: AAA60188.1. X07882 Genomic DNA. Translation: CAA30729.1. Sequence problems. X07715 Genomic DNA. Translation: CAA30543.1. Sequence problems. AC010176 Genomic DNA. No translation available. BC130386 mRNA. Translation: AAI30387.1. S80916 Genomic DNA. Translation: AAB50687.2. X07704 Genomic DNA. Translation: CAA30542.1. |
| IPI | IPI00019482. |
| PIR | PIHUSD. S03176. |
| RefSeq | NP_001248328.1. NM_001261399.1. NP_002714.2. NM_002723.4. |
| UniGene | Hs.528651. |
3D structure databases | |
| DisProt | DP00119. |
| ModBase | Search... |
Protein-protein interaction databases | |
| STRING | 9606.ENSP00000279575. |
Polymorphism databases | |
| DMDM | 158517854. |
Proteomic databases | |
| PaxDb | P10163. |
| PRIDE | P10163. |
Protocols and materials databases | |
| StructuralBiologyKnowledgebase | Search... |
Genome annotation databases | |
| GeneID | 5545. |
| KEGG | hsa:5545. |
Organism-specific databases | |
| CTD | 5545. |
| GeneCards | GC12M011460. |
| H-InvDB | HIX0079490. |
| HGNC | HGNC:9340. PRB4. |
| MIM | 180990. gene. |
| neXtProt | NX_P10163. |
| PharmGKB | PA33702. |
| GenAtlas | Search... |
Gene expression databases | |
| CleanEx | HS_PRB4. |
| Genevestigator | P10163. |
| GermOnline | ENSG00000121335. Homo sapiens. |
Family and domain databases | |
| InterPro | IPR026086. Pro-rich. [Graphical view] |
| PANTHER | PTHR23203. PTHR23203. 1 hit. |
| ProtoNet | Search... |
Other | |
| GenomeRNAi | 5545. |
| NextBio | 21484. |
| SOURCE | Search... |
Entry information
| Entry name | PRB4_HUMAN | ||||||||
| Accession | Primary (citable) accession number: P10163 Secondary accession number(s): A1L439 P81489 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation program | Chordata Protein Annotation Program | ||||||||
| Disclaimer | Any medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care. | ||||||||
Relevant documents
| Human chromosome 12 Human chromosome 12: entries, gene names and cross-references to MIM |
| Human entries with polymorphisms or disease mutations List of human entries with polymorphisms or disease mutations |
| Human polymorphisms and disease mutations Index of human polymorphisms and disease mutations |
| MIM cross-references Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot |

Clusters with
