Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot P00790 (PEPA_HUMAN)

Last modified July 7, 2009. Version 117. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (7) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Pepsin A
    EC=3.4.23.1
Gene names
Name: PGA3
AND
Name: PGA4
AND
Name: PGA5
OrganismHomo sapiens (Human) [Complete proteome]
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length388 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level.

General annotation (Comments)

Function

Shows particularly broad specificity; although bonds involving phenylalanine and leucine are preferred, many others are also cleaved to some extent.

Catalytic activity

Preferential cleavage: hydrophobic, preferably aromatic, residues in P1 and P1' positions. Cleaves 1-Phe-|-Val-2, 4-Gln-|-His-5, 13-Glu-|-Ala-14, 14-Ala-|-Leu-15, 15-Leu-|-Tyr-16, 16-Tyr-|-Leu-17, 23-Gly-|-Phe-24, 24-Phe-|-Phe-25 and 25-Phe-|-Tyr-26 bonds in the B chain of insulin.

Subcellular location

Secreted.

Sequence similarities

Belongs to the peptidase A1 family.

Ontologies

Keywords
   Biological processDigestion
   Cellular componentSecreted
   Coding sequence diversityPolymorphism
   DomainSignal
   Molecular functionAspartyl protease
Hydrolase
Protease
   PTMDisulfide bond
Phosphoprotein
Zymogen
   Technical term3D-structure
Complete proteome
Direct protein sequencing
Gene Ontology (GO)
   Biological processdigestion

Non-traceable author statement. Source: UniProtKB

proteolysis

Non-traceable author statement. Source: UniProtKB

   Cellular componentextracellular region

Inferred from electronic annotation. Source: UniProtKB-SubCell

   Molecular functionaspartic-type endopeptidase activity Ref.1

Traceable author statement. Source: UniProtKB

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 1515 Ref.6 Ref.7 Ref.5
Propeptide16 – 6247Activation peptide
PRO_0000026013
Chain63 – 388326Pepsin A
PRO_0000026014

Sites

Active site941
Active site2771

Amino acid modifications

Modified residue1301Phosphoserine By similarity
Disulfide bond107 ↔ 112
Disulfide bond268 ↔ 272
Disulfide bond311 ↔ 344

Natural variations

Natural variant281L → F in isozyme 5.
VAR_006481
Natural variant581E → K in isozyme 3A, isozyme 4 and isozyme 5.
VAR_006482
Natural variant921V → L in isozyme 4 and isozyme 5.
VAR_006483
Natural variant2221Q → K Ref.4
VAR_006484
Natural variant2651A → T Ref.4
VAR_006485
Natural variant3531L → V: dbSNP rs17595. Ref.4
VAR_006486
Natural variant3761D → E
VAR_006487

Experimental info

Sequence conflict2911A → T in BAF84553. Ref.3
Sequence conflict371 – 3722YF → FY AA sequence Ref.8

Secondary structure

........................................................................ 388
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
P00790-1 [UniParc].

Last modified July 21, 1986. Version 1.
Checksum: C9CB89BA08F4D78B

FASTA38841,977
        10         20         30         40         50         60 
MKWLLLLGLV ALSECIMYKV PLIRKKSLRR TLSERGLLKD FLKKHNLNPA RKYFPQWEAP 

        70         80         90        100        110        120 
TLVDEQPLEN YLDMEYFGTI GIGTPAQDFT VVFDTGSSNL WVPSVYCSSL ACTNHNRFNP 

       130        140        150        160        170        180 
EDSSTYQSTS ETVSITYGTG SMTGILGYDT VQVGGISDTN QIFGLSETEP GSFLYYAPFD 

       190        200        210        220        230        240 
GILGLAYPSI SSSGATPVFD NIWNQGLVSQ DLFSVYLSAD DQSGSVVIFG GIDSSYYTGS 

       250        260        270        280        290        300 
LNWVPVTVEG YWQITVDSIT MNGEAIACAE GCQAIVDTGT SLLTGPTSPI ANIQSDIGAS 

       310        320        330        340        350        360 
ENSDGDMVVS CSAISSLPDI VFTINGVQYP VPPSAYILQS EGSCISGFQG MNLPTESGEL 

       370        380 
WILGDVFIRQ YFTVFDRANN QVGLAPVA 

« Hide

References

« Hide 'large scale' references
[1]"Primary structure of human pepsinogen gene."
Sogawa K., Fujii-Kuriyama Y., Mizukami Y., Ichihara Y., Takahashi K.
J. Biol. Chem. 258:5306-5311(1983) [PubMed: 6300126] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
[2]"Nucleotide sequence comparison of five human pepsinogen A (PGA) genes: evolution of the PGA multigene family."
Evers M.P.J., Zelle B., Bebelman J.-P., van Beusechem V., Kraakman L., Hoffer M.J.V., Pronk J.C., Mager W.H., Planta R.J., Eriksson A.W., Frants R.R.
Genomics 4:232-239(1989) [PubMed: 2714789] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] (ISOZYME 5).
Tissue: Placenta.
[3]"Complete sequencing and characterization of 21,243 full-length human cDNAs."
Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S. expand/collapse author list , Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.
Nat. Genet. 36:40-45(2004) [PubMed: 14702039] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
Tissue: Skeletal muscle.
[4]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA], VARIANTS LYS-222; THR-265 AND VAL-353.
Tissue: Colon and Kidney.
[5]"Isolation of human, swine, and rat prepepsinogens and calf preprochymosin, and determination of the primary structures of their NH2-terminal signal sequences."
Ichihara Y., Sogawa K., Takahashi K.
J. Biochem. 98:483-492(1985) [PubMed: 2415509] [Abstract]
Cited for: PARTIAL PROTEIN SEQUENCE OF 1-28.
[6]"A comparative study on the NH2-terminal amino acid sequences and some other properties of six isozymic forms of human pepsinogens and pepsins."
Athauda S.B.P., Tanji M., Kageyama T., Takahashi K.
J. Biochem. 106:920-927(1989) [PubMed: 2515193] [Abstract]
Cited for: PROTEIN SEQUENCE OF 16-100 (ISOZYMES 2; 3; 3A; 4 AND 5).
[7]"Activation of human pepsinogens."
Foltmann B.
FEBS Lett. 241:69-72(1988) [PubMed: 3197840] [Abstract]
Cited for: PROTEIN SEQUENCE OF 16-68 (ISOZYMES 3 AND 5).
[8]"Carboxyl-terminal sequence of human gastricsin and pepsin."
Huang W.-Y., Tang J.
J. Biol. Chem. 245:2189-2193(1970) [PubMed: 4909888] [Abstract]
Cited for: PROTEIN SEQUENCE OF 362-388.
[9]"Crystal structure of human pepsin and its complex with pepstatin."
Fujinaga M., Chernaia M.M., Tarasova N.I., Mosimann S.C., James M.N.G.
Protein Sci. 4:960-972(1995) [PubMed: 7663352] [Abstract]
Cited for: X-RAY CRYSTALLOGRAPHY (2.2 ANGSTROMS) OF 63-388.
+Additional computationally mapped references.

Cross-references

Sequence databases

J00287 expand/collapse EMBL AC list , J00279, J00280, J00281, J00282, J00283, J00284, J00285, J00286 Genomic DNA. Translation: AAA98529.1.
M26032 expand/collapse EMBL AC list , M26025, M26026, M26027, M26028, M26029, M26030, M26031 Genomic DNA. Translation: AAA60061.1.
AK291864 mRNA. Translation: BAF84553.1.
BC029055 mRNA. Translation: AAH29055.1.
IPIIPI00019641.
PIRPEHU. A00980.
A30142.
A92058.
B30142.
RefSeqNP_001073276.1.
NP_055039.1.
UniGeneHs.432854
Hs.601055
Hs.661883

3D structure databases

EntryMethodResolution (Å)ChainPositionsPDBsum
1FLHX-ray2.45A63-388[»]
1PSNX-ray2.20A63-388[»]
1PSOX-ray2.00E63-388[»]
1QRPX-ray1.96E63-388[»]
ModBaseSearch...

Protein family/group databases

MEROPSA01.001.
A01.070.
A01.071.

Proteomic databases

PRIDEP00790.

Genome annotation databases

EnsemblENSG00000204973. Homo sapiens. [Contig view]
GeneID5222.
643847.
KEGGhsa:5222.
hsa:643847.
UCSCuc001nqy.1. human.

Organism-specific databases

GeneCardsGC11P060715.
GC11P060739.
GC11P060767.
H-InvDBHIX0023228.
HIX0079380.
HGNCHGNC:8885. PGA3.
HGNC:8886. PGA4.
HGNC:8887. PGA5.
MIM169700. gene.
169710. gene.
169720. gene.
169730. gene.
PharmGKBPA33222.
GenAtlasSearch...

Phylogenomic databases

HOGENOMP00790.
HOVERGENP00790.
OMAP00790. MVISCSA.

Enzyme and pathway databases

BRENDA3.4.23.1. 247.

Gene expression databases

BgeeP00790.
CleanExHS_PGA3.
HS_PGA4.
HS_PGA5.
GermOnlineENSG00000204973. Homo sapiens.

Family and domain databases

InterProIPR001461. Peptidase_A1.
IPR001969. Peptidase_aspartic_AS.
IPR009007. Peptidase_aspartic_catalytic.
IPR012848. Propep_A1.
[Graphical view]
Gene3DG3DSA:2.40.70.10. Pept_Aspartc_cat. 2 hits.
PANTHERPTHR13683. Peptidase_A1. 1 hit.
PfamPF07966. A1_Propeptide. 1 hit.
PF00026. Asp. 1 hit.
[Graphical view]
PRINTSPR00792. PEPSIN.
PROSITEPS00141. ASP_PROTEASE. 2 hits.
[Graphical view]
ProtoNetSearch...

Other Resources

BindingDBP00790.
NextBio20188.
SOURCESearch...

Entry information

Entry namePEPA_HUMAN
AccessionPrimary (citable) accession number: P00790
Secondary accession number(s): A8K749, Q7M4R0, Q8N1E3
Entry history
Integrated into UniProtKB/Swiss-Prot: July 21, 1986
Last sequence update: July 21, 1986
Last modified: July 7, 2009
This is version 117 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectHPI (Human Proteome Initiative)

Relevant documents

Human chromosome 11

Human chromosome 11: entries, gene names and cross-references to MIM

Human entries with polymorphisms or disease mutations

List of human entries with polymorphisms or disease mutations

Human polymorphisms and disease mutations

Index of human polymorphisms and disease mutations

MIM cross-references

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot

PDB cross-references

Index of Protein Data Bank (PDB) cross-references

Peptidase families

Classification of peptidase families and list of entries

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents