Reviewed,
UniProtKB/Swiss-Prot P98073 (ENTK_HUMAN)
Last modified
January 19, 2010.
Version 112.
History...
Clusters with 100%,
90%,
50% identity |
Documents (6) |
Third-party data |
Customize display | text xml rdf/xml gff fasta |
Names and origin
| Protein names | Recommended name: Enteropeptidase EC=3.4.21.9 Alternative name(s): Enterokinase Serine protease 7 Cleaved into the following 2 chains: 1- Recommended name: Enteropeptidase non-catalytic heavy chain 2- Recommended name: Enteropeptidase catalytic light chain | ||||
| Gene names |
| ||||
| Organism | Homo sapiens (Human) [Complete proteome] | ||||
| Taxonomic identifier | 9606 [NCBI] | ||||
| Taxonomic lineage | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Mammalia › Eutheria › Euarchontoglires › Primates › Haplorrhini › Catarrhini › Hominidae › Homo |
Protein attributes
| Sequence length | 1019 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is further processed into a mature form. |
| Protein existence | Evidence at transcript level. |
General annotation (Comments)
| Function | Responsible for initiating activation of pancreatic proteolytic proenzymes (trypsin, chymotrypsin and carboxypeptidase A). It catalyzes the conversion of trypsinogen to trypsin which in turn activates other proenzymes including chymotrypsinogen, procarboxypeptidases, and proelastases. |
| Catalytic activity | Activation of trypsinogen by selective cleavage of 6-Lys-|-Ile-7 bond. |
| Subunit structure | Heterodimer of a catalytic (light) chain and a multidomain (heavy) chain linked by a disulfide bond. |
| Subcellular location | Membrane; Single-pass type II membrane protein Probable. |
| Tissue specificity | Intestinal brush border. |
| Post-translational modification | The chains are derived from a single precursor that is cleaved by a trypsin-like protease. |
| Involvement in disease | Defects in PRSS7 are a cause of enterokinase deficiency [MIM:226200]; a life-threatening intestinal malabsorption disorder characterized by diarrhea and failure to thrive. Ref.2 |
| Sequence similarities | Belongs to the peptidase S1 family. Contains 2 CUB domains. Contains 2 LDL-receptor class A domains. Contains 1 MAM domain. Contains 1 peptidase S1 domain. Contains 1 SEA domain. Contains 1 SRCR domain. |
| Sequence caution | The sequence CAB90389.1 differs from that shown. Reason: Erroneous gene model prediction. |
Ontologies
| Keywords | |
|---|---|
| Cellular component | Membrane |
| Coding sequence diversity | Polymorphism |
| Domain | Repeat Signal-anchor Transmembrane |
| Molecular function | Hydrolase Protease Serine protease |
| PTM | Disulfide bond Glycoprotein Lipoprotein Myristate Zymogen |
| Technical term | Complete proteome |
| Gene Ontology (GO) | |
| Biological process | proteolysis Inferred from electronic annotation. Source: InterPro |
| Cellular component | brush border Ref.5 Traceable author statement. Source: ProtInc integral to membraneInferred from electronic annotation. Source: UniProtKB-SubCell |
| Molecular function | scavenger receptor activity Inferred from electronic annotation. Source: InterPro serine-type endopeptidase activityInferred from electronic annotation. Source: InterPro |
| Complete GO annotation... | |
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||||
Molecule processing | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Chain | 1 – 784 | 784 | Enteropeptidase non-catalytic heavy chain | PRO_0000027719 | |||||||
| Chain | 785 – 1019 | 235 | Enteropeptidase catalytic light chain | PRO_0000027720 | |||||||
Regions | |||||||||||
| Topological domain | 1 – 18 | 18 | Cytoplasmic Potential | ||||||||
| Transmembrane | 19 – 47 | 29 | Signal-anchor for type II membrane protein Potential | ||||||||
| Topological domain | 48 – 1019 | 972 | Extracellular Potential | ||||||||
| Domain | 52 – 169 | 118 | SEA | ||||||||
| Domain | 182 – 223 | 42 | LDL-receptor class A 1 | ||||||||
| Domain | 225 – 334 | 110 | CUB 1 | ||||||||
| Domain | 342 – 504 | 163 | MAM | ||||||||
| Domain | 524 – 634 | 111 | CUB 2 | ||||||||
| Domain | 641 – 679 | 39 | LDL-receptor class A 2 | ||||||||
| Domain | 678 – 771 | 94 | SRCR | ||||||||
| Domain | 785 – 1019 | 235 | Peptidase S1 | ||||||||
Sites | |||||||||||
| Active site | 825 | 1 | Charge relay system By similarity | ||||||||
| Active site | 876 | 1 | Charge relay system By similarity | ||||||||
| Active site | 971 | 1 | Charge relay system By similarity | ||||||||
Amino acid modifications | |||||||||||
| Lipidation | 2 | 1 | N-myristoyl glycine Potential | ||||||||
| Glycosylation | 116 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 147 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 179 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 328 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 335 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 388 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 440 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 470 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 503 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 534 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 630 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 682 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 706 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 725 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 848 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 887 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 909 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 949 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Disulfide bond | 184 ↔ 197 | By similarity | |||||||||
| Disulfide bond | 191 ↔ 210 | By similarity | |||||||||
| Disulfide bond | 204 ↔ 221 | By similarity | |||||||||
| Disulfide bond | 225 ↔ 253 | By similarity | |||||||||
| Disulfide bond | 524 ↔ 552 | By similarity | |||||||||
| Disulfide bond | 643 ↔ 655 | By similarity | |||||||||
| Disulfide bond | 650 ↔ 668 | By similarity | |||||||||
| Disulfide bond | 662 ↔ 677 | By similarity | |||||||||
| Disulfide bond | 757 ↔ 767 | By similarity | |||||||||
| Disulfide bond | 772 ↔ 896 | Interchain (between heavy and light chains) By similarity | |||||||||
| Disulfide bond | 810 ↔ 826 | By similarity | |||||||||
| Disulfide bond | 910 ↔ 977 | By similarity | |||||||||
| Disulfide bond | 941 ↔ 956 | By similarity | |||||||||
| Disulfide bond | 967 ↔ 995 | By similarity | |||||||||
Natural variations | |||||||||||
| Natural variant | 65 | 1 | T → I: dbSNP rs35987974. | VAR_031686 | |||||||
| Natural variant | 77 | 1 | K → R: dbSNP rs2824804. | VAR_021940 | |||||||
| Natural variant | 134 | 1 | E → Q: dbSNP rs2824790. Ref.2 Ref.1 Ref.4 | VAR_031687 | |||||||
| Natural variant | 545 | 1 | S → C: dbSNP rs8134187. | VAR_031688 | |||||||
| Natural variant | 641 | 1 | E → K: dbSNP rs2273204. | VAR_020175 | |||||||
| Natural variant | 660 | 1 | N → H: dbSNP rs11088674. | VAR_024292 | |||||||
| Natural variant | 732 | 1 | S → P: dbSNP rs2824721. Ref.3 | VAR_031689 | |||||||
| Natural variant | 828 | 1 | Y → C: dbSNP rs8130110. | VAR_031690 | |||||||
Sequences
| ||||||||||||||||||
References
| « Hide 'large scale' references | |
| [1] | "cDNA sequence and chromosomal localization of human enterokinase, the proteolytic activator of trypsinogen." Kitamoto Y., Veile R.A., Donis-Keller H., Sadler J.E. Biochemistry 34:4562-4568(1995) [PubMed: 7718557] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA], VARIANT GLN-134. Tissue: Duodenum. |
| [2] | "Mutations in the proenteropeptidase gene are the molecular cause of congenital enteropeptidase deficiency." Holzinger A., Maier E.M., Buck C., Mayerhofer P.U., Kappler M., Haworth J.C., Moroz S.P., Hadorn H.-B., Sadler J.E., Roscher A.A. Am. J. Hum. Genet. 70:20-25(2002) [PubMed: 11719902] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA], VARIANT GLN-134, INVOLVEMENT IN ENTEROKINASE DEFICIENCY. |
| [3] | "The DNA sequence of human chromosome 21." Hattori M., Fujiyama A., Taylor T.D., Watanabe H., Yada T., Park H.-S., Toyoda A., Ishii K., Totoki Y., Choi D.-K., Groner Y., Soeda E., Ohki M., Takagi T., Sakaki Y., Taudien S., Blechschmidt K., Polley A. Yaspo M.-L.Nature 405:311-319(2000) [PubMed: 10830953] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], VARIANT PRO-732. |
| [4] | "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)." The MGC Project Team Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA], VARIANT GLN-134. |
| [5] | "Enterokinase, the initiator of intestinal digestion, is a mosaic protease composed of a distinctive assortment of domains." Kitamoto Y., Yuan X., Wu Q., McCourt D.W., Sadler J.E. Proc. Natl. Acad. Sci. U.S.A. 91:7588-7592(1994) [PubMed: 8052624] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 749-1019. Tissue: Duodenum. |
| + | Additional computationally mapped references. |
Cross-references
Sequence databases | |
|---|---|
| EMBL GenBank DDBJ | U09860 mRNA. Translation: AAC50138.1. Y19124 Y19143 Genomic DNA. Translation: CAB65555.1. AL163217 Genomic DNA. Translation: CAB90389.1. Sequence problems. AL163218 Genomic DNA. Translation: CAB90392.1. BC111749 mRNA. Translation: AAI11750.1. |
| IPI | IPI00023788. |
| PIR | A56318. |
| RefSeq | NP_002763.2. |
| UniGene | Hs.149473 |
3D structure databases | |
| ModBase | Search... |
Protein-protein interaction databases | |
| STRING | P98073. |
Protein family/group databases | |
| MEROPS | S01.156. |
Genome annotation databases | |
| Ensembl | ENST00000284885; ENSP00000284885; ENSG00000154646; Homo sapiens. [Genome view] |
| GeneID | 5651. |
| KEGG | hsa:5651. |
| UCSC | uc002ykw.1. human. |
Organism-specific databases | |
| CTD | 5651. |
| GeneCards | GC21M018563. |
| H-InvDB | HIX0040924. |
| HGNC | HGNC:9490. PRSS7. |
| HPA | HPA015611. |
| MIM | 226200. phenotype. 606635. gene. |
| Orphanet | 168601. Congenital enteropathy due to enteropeptidase deficiency. |
| PharmGKB | PA33839. |
| GenAtlas | Search... |
Phylogenomic databases | |
| eggNOG | prNOG08056. |
| HOGENOM | HBG445315. |
| HOVERGEN | P98073. |
| InParanoid | P98073. |
Enzyme and pathway databases | |
| BRENDA | 3.4.21.9. 247. |
Gene expression databases | |
| ArrayExpress | P98073. |
| Bgee | P98073. |
| CleanEx | HS_PRSS7. |
| Genevestigator | P98073. |
| GermOnline | ENSG00000154646. Homo sapiens. |
Family and domain databases | |
| InterPro | IPR000859. CUB. IPR002172. LDL_rcpt_classA_cys-rich_rpt. IPR000998. MAM. IPR018114. Peptidase_S1/S6_AS. IPR001254. Peptidase_S1_S6. IPR001314. Peptidase_S1A. IPR000082. SEA. IPR009003. Ser/Cys_Pept_Trypsin-like. IPR001190. Srcr_rcpt. IPR017448. Srcr_rcpt-rel. [Graphical view] |
| Gene3D | G3DSA:2.60.120.290. CUB. 2 hits. G3DSA:4.10.400.10. LDL_rcpt_classA_cys-rich. 1 hit. |
| Pfam | PF00431. CUB. 2 hits. PF00057. Ldl_recept_a. 2 hits. PF00629. MAM. 1 hit. PF01390. SEA. 1 hit. PF00530. SRCR. 1 hit. PF00089. Trypsin. 1 hit. [Graphical view] |
| PRINTS | PR00722. CHYMOTRYPSIN. PR00261. LDLRECEPTOR. |
| SMART | SM00042. CUB. 2 hits. SM00192. LDLa. 2 hits. SM00137. MAM. 1 hit. SM00200. SEA. 1 hit. SM00202. SR. 1 hit. SM00020. Tryp_SPc. 1 hit. [Graphical view] |
| PROSITE | PS01180. CUB. 2 hits. PS01209. LDLRA_1. 2 hits. PS50068. LDLRA_2. 2 hits. PS00740. MAM_1. 1 hit. PS50060. MAM_2. 1 hit. PS50024. SEA. 1 hit. PS00420. SRCR_1. False negative. PS50287. SRCR_2. 1 hit. PS50240. TRYPSIN_DOM. 1 hit. PS00134. TRYPSIN_HIS. 1 hit. PS00135. TRYPSIN_SER. 1 hit. [Graphical view] |
| ProtoNet | Search... |
Other Resources | |
| NextBio | 21958. |
| SOURCE | Search... |
Entry information
| Entry name | ENTK_HUMAN | ||||||||
| Accession | Primary (citable) accession number: P98073 Secondary accession number(s): Q2NKL7 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation project | HPI (Human Proteome Initiative) | ||||||||
| Disclaimer | Any medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care. | ||||||||
Relevant documents
| Human chromosome 21 Human chromosome 21: entries, gene names and cross-references to MIM |
| Human entries with polymorphisms or disease mutations List of human entries with polymorphisms or disease mutations |
| Human polymorphisms and disease mutations Index of human polymorphisms and disease mutations |
| MIM cross-references Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot |
| Peptidase families Classification of peptidase families and list of entries |
| SIMILARITY comments Index of protein domains and families |

Clusters with


