Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q04118 (PRB3_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified January 25, 2012. Version 80. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (4) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Basic salivary proline-rich protein 3
Alternative name(s):
Parotid salivary glycoprotein G1
Proline-rich protein G1
Gene names
Name:PRB3
OrganismHomo sapiens (Human)
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length309 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Acts as a receptor for the Gram-negative bacterium F.nucleatum. Ref.4

Subunit structure

The Gl-8 variant forms a disulfide-bonded heterodimer with salivary perodixase.

Subcellular location

Secreted.

Post-translational modification

The Gl-8 variant contains an interchain disulfide bond with salivary peroxidase.

N- and O-glycosylated; contains about 50% carbohydrate. This is composed of highly fucosylated N-linked saccharides, the major structure is a biantennary asialosaccharide containing 2 fucose residues on one antenna and an unsubstituted terminal lactosamine sequence on the other. The Gram-negative bacterium F.nucleatum binds to carbohydrates containing unsubstituted GalBeta1,4GlcNAc residues. Ref.4 Ref.6

Proteolytically cleaved at the tripeptide Xaa-Pro-Gln, where Xaa in the P3 position is mostly lysine. The endoprotease may be of microbial origin. Besides on the N-terminal of mature PRB3, pyroglutamate formation found on at least Gln-67, Gln-88, Gln-214 and Gln-295. Ref.5

Polymorphism

The number of repeats is polymorphic and varies among different alleles. The sequence shown is that of allele L (long). There is an allele S (short) which contains 6 tandem repeats.

Sequence caution

The sequence CAA30477.1 differs from that shown. Reason: Erroneous translation. Wrong choice of frame.

Ontologies

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 1616 Potential
Chain17 – 309293Basic salivary proline-rich protein 3
PRO_0000022097

Regions

Repeat53 – 73211
Repeat74 – 94212
Repeat95 – 115213
Repeat116 – 136214
Repeat137 – 157215
Repeat158 – 178216
Repeat179 – 199217
Repeat200 – 220218
Repeat221 – 241219
Repeat242 – 2622110
Region53 – 26221010 X 21 AA tandem repeats of [RH]-P-G-K-P-[EQ]-G-[PQS]-P-[PS]-Q-[GE]-G-N-[QK]-[SP]-[QR]-[GR]-P-P-P
Compositional bias36 – 308273Pro-rich

Amino acid modifications

Modified residue171Pyrrolidone carboxylic acid
Modified residue241Phosphoserine Ref.6
Glycosylation661N-linked (GlcNAc...) Potential
Glycosylation871N-linked (GlcNAc...); prevelant in head and neck cancer patients Ref.6
Glycosylation891O-linked (Hex) Ref.6
Glycosylation1081N-linked (GlcNAc...) Potential
Glycosylation1291N-linked (GlcNAc...) Potential
Glycosylation1501N-linked (GlcNAc...) Potential
Glycosylation1711N-linked (GlcNAc...) Potential
Glycosylation1921N-linked (GlcNAc...) Potential
Glycosylation2131N-linked (GlcNAc...) Potential
Glycosylation2551N-linked (GlcNAc...) Ref.6

Natural variations

Natural variant531P → C in Gl-8; requires 2 nucleotide substitutions. Ref.3
VAR_019696
Natural variant158 – 22063Missing in allele S.
VAR_055031
Natural variant1861P → Q. Ref.2
Corresponds to variant rs11054208 [ dbSNP | Ensembl ].
VAR_055032

Experimental info

Sequence conflict401R → P in CAA30728. Ref.1
Sequence conflict47 – 537PQRTPPP → SQGPPPR in CAA30728. Ref.1
Sequence conflict511P → S in CAA30477. Ref.1
Sequence conflict601R → P in CAA30728. Ref.1
Sequence conflict811P → Q in CAA30728. Ref.1
Sequence conflict1021Q → P in CAA30728. Ref.1
Sequence conflict1191K → E in CAA30728. Ref.1
Sequence conflict1371R → H in CAA30728. Ref.1
Sequence conflict1581H → R in CAA30728. Ref.1
Sequence conflict1581H → R in AAH96211. Ref.2
Sequence conflict2371Q → R in CAA30728. Ref.1
Sequence conflict2641G → R in CAA30728. Ref.1
Sequence conflict2641G → R in CAA30477. Ref.1

Sequences

Sequence LengthMass (Da)Tools
Q04118 [UniParc].

Last modified May 5, 2009. Version 2.
Checksum: A8FCBEBA784618C6

FASTA30930,980
        10         20         30         40         50         60 
MLLILLSVAL LALSSAQSLN EDVSQEESPS VISGKPEGRR PQGGNQPQRT PPPPGKPEGR 

        70         80         90        100        110        120 
PPQGGNQSQG PPPRPGKPEG PPPQGGNQSQ GPPPRPGKPE GQPPQGGNQS QGPPPRPGKP 

       130        140        150        160        170        180 
EGPPPQGGNQ SQGPPPRPGK PEGPPPQGGN QSQGPPPHPG KPEGPPPQGG NQSQGPPPRP 

       190        200        210        220        230        240 
GKPEGPPPQG GNQSQGPPPR PGKPEGPPPQ GGNQSQGPPP RPGKPEGSPS QGGNKPQGPP 

       250        260        270        280        290        300 
PHPGKPQGPP PQEGNKPQRP PPPGRPQGPP PPGGNPQQPL PPPAGKPQGP PPPPQGGRPH 


RPPQGQPPQ 

« Hide

References

« Hide 'large scale' references
[1]"Length polymorphisms in human proline-rich protein genes generated by intragenic unequal crossing over."
Lyons K.M., Stein J.H., Smithies O.
Genetics 120:267-278(1988) [PubMed: 2851479] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] (ALLELES L AND S), POLYMORPHISM.
[2]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ALLELE L), VARIANT GLN-186.
[3]"Alleles at the PRB3 locus coding for a disulfide-bonded human salivary proline-rich glycoprotein (Gl 8) and a null in an Ashkenazi Jew."
Azen E.A., Minaguchi K., Latreille P., Kim H.-S.
Am. J. Hum. Genet. 47:686-697(1990) [PubMed: 2171329] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 35-309 (ALLELE S), VARIANT GL-8 CYS-53, INTERACTION WITH SALIVARY PEROXIDASE, DISULFIDE BOND.
[4]"Structure and bacterial receptor activity of a human salivary proline-rich glycoprotein."
Gillece-Castro B.L., Prakobphol A., Burlingame A.L., Leffler H., Fisher S.J.
J. Biol. Chem. 266:17358-17368(1991) [PubMed: 1894623] [Abstract]
Cited for: PROTEIN SEQUENCE OF 68-92; 110-127; 131-148; 173-190 AND 194-211, FUNCTION, GLYCOSYLATION.
Tissue: Saliva.
[5]"Identification of Lys-Pro-Gln as a novel cleavage site specificity of saliva-associated proteases."
Helmerhorst E.J., Sun X., Salih E., Oppenheim F.G.
J. Biol. Chem. 283:19957-19966(2008) [PubMed: 18463091] [Abstract]
Cited for: PROTEOLYTIC PROCESSING, MASS SPECTROMETRY.
[6]"Finding new posttranslational modifications in salivary proline-rich proteins."
Vitorino R., Alves R., Barros A., Caseiro A., Ferreira R., Lobo M.C., Bastos A., Duarte J., Carvalho D., Santos L.L., Amado F.L.
Proteomics 10:3732-3742(2010) [PubMed: 20879038] [Abstract]
Cited for: GLYCOSYLATION AT ASN-87; SER-89 AND ASN-255, PYROGLUTAMATE FORMATION AT GLN-17, PHOSPHORYLATION AT SER-24, MASS SPECTROMETRY.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
X07637 Genomic DNA. Translation: CAA30477.1. Sequence problems.
X07881 Genomic DNA. Translation: CAA30728.1.
BC096209 mRNA. Translation: AAH96209.1.
BC096210 mRNA. Translation: AAH96210.1.
BC096211 mRNA. Translation: AAH96211.1.
IPIIPI00006699.
PIRA36298.
B36298.
S10889.
RefSeqNP_006240.4. NM_006249.4.
UniGeneHs.73031.

3D structure databases

ModBaseSearch...

Protein-protein interaction databases

STRINGQ04118.

Polymorphism databases

DMDM229462763.

Proteomic databases

PRIDEQ04118.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENST00000381842; ENSP00000371264; ENSG00000197870.
GeneID5544.
KEGGhsa:5544.

Organism-specific databases

CTD5544.
GeneCardsGC12M011418.
H-InvDBHIX0201930.
HGNCHGNC:9339. PRB3.
MIM168840. gene.
neXtProtNX_Q04118.
PharmGKBPA33701.
GenAtlasSearch...

Phylogenomic databases

GeneTreeENSGT00540000070914.

Gene expression databases

ArrayExpressQ04118.
BgeeQ04118.
CleanExHS_PRB3.
GenevestigatorQ04118.
GermOnlineENSG00000197870. Homo sapiens.

Family and domain databases

ProtoNetSearch...

Other

SOURCESearch...

Entry information

Entry namePRB3_HUMAN
AccessionPrimary (citable) accession number: Q04118
Secondary accession number(s): Q15188 expand/collapse secondary AC list , Q4VAY3, Q4VAY4, Q7M4M9, Q9UCT9
Entry history
Integrated into UniProtKB/Swiss-Prot: September 13, 2004
Last sequence update: May 5, 2009
Last modified: January 25, 2012
This is version 80 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

Human chromosome 12

Human chromosome 12: entries, gene names and cross-references to MIM

Human entries with polymorphisms or disease mutations

List of human entries with polymorphisms or disease mutations

Human polymorphisms and disease mutations

Index of human polymorphisms and disease mutations

MIM cross-references

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot