Q14508 (WFDC2_HUMAN) Reviewed, UniProtKB/Swiss-Prot
Last modified
May 1, 2013.
Version 125.
History...
Names·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order
Names·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize orderNames and origin
| Protein names | Recommended name: WAP four-disulfide core domain protein 2 Alternative name(s): Epididymal secretory protein E4 Major epididymis-specific protein E4 Putative protease inhibitor WAP5 | ||||
| Gene names |
| ||||
| Organism | Homo sapiens (Human) [Reference proteome] | ||||
| Taxonomic identifier | 9606 [NCBI] | ||||
| Taxonomic lineage | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Mammalia › Eutheria › Euarchontoglires › Primates › Haplorrhini › Catarrhini › Hominidae › Homo![]() |
Protein attributes
| Sequence length | 124 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is further processed into a mature form. |
| Protein existence | Evidence at protein level |
General annotation (Comments)
| Subcellular location | |
| Tissue specificity | Expressed in a number of normal tissues, including male reproductive system, regions of the respiratory tract and nasopharynx. Highly expressed in a number of tumors cells lines, such ovarian, colon, breast, lung and renal cells lines. Initially described as being exclusively transcribed in the epididymis. Ref.8 |
| Sequence similarities | Contains 2 WAP domains. |
Ontologies
| Keywords | |
|---|---|
| Cellular component | Secreted |
| Coding sequence diversity | Alternative splicing |
| Domain | Repeat Signal |
| Molecular function | Protease inhibitor Serine protease inhibitor |
| PTM | Disulfide bond Glycoprotein |
| Technical term | Complete proteome Reference proteome |
| Gene Ontology (GO) | |
| Biological_process | proteolysis Traceable author statement Ref.1. Source: ProtInc spermatogenesisTraceable author statement Ref.1. Source: ProtInc |
| Cellular_component | extracellular space Traceable author statement Ref.1. Source: ProtInc |
| Molecular_function | endopeptidase inhibitor activity Traceable author statement Ref.1. Source: ProtInc serine-type endopeptidase inhibitor activityInferred from electronic annotation. Source: UniProtKB-KW |
| Complete GO annotation... | |
Alternative products
| This entry describes 5 isoforms produced by alternative splicing. [Align] [Select] Note: Additional isoforms seem to exist. | ||||||
| Isoform 1 (identifier: Q14508-1) This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry. | ||||||
| Isoform 2 (identifier: Q14508-2) Also known as: HE4-V3; The sequence of this isoform differs from the canonical sequence as follows: 2-23: PACRLGPLAAALLLSLLLFGFT → LQVQVNLPVSPLPTYPYSFFYP 24-74: Missing. | ||||||
| Isoform 3 (identifier: Q14508-3) Also known as: HE4-V2; The sequence of this isoform differs from the canonical sequence as follows: 27-74: Missing. | ||||||
| Isoform 4 (identifier: Q14508-4) Also known as: HE4-V1; The sequence of this isoform differs from the canonical sequence as follows: 71-79: SLPNDKEGS → LLCPNGQLAE 80-124: Missing. | ||||||
| Isoform 5 (identifier: Q14508-5) Also known as: HE4-V4; The sequence of this isoform differs from the canonical sequence as follows: 75-102: DKEGSCPQVNINFPQLGLCRDQCQVDSQ → ALFHWHLKTRRLWEISGPRPRRPTWDSS 103-124: Missing. |
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||||
Molecule processing | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Signal peptide | 1 – 30 | 30 | Potential | ||||||||
| Chain | 31 – 124 | 94 | WAP four-disulfide core domain protein 2 | PRO_0000041370 | |||||||
Regions | |||||||||||
| Domain | 31 – 73 | 43 | WAP 1 | ||||||||
| Domain | 74 – 123 | 50 | WAP 2 | ||||||||
Amino acid modifications | |||||||||||
| Glycosylation | 44 | 1 | N-linked (GlcNAc...) Ref.9 | ||||||||
| Disulfide bond | 36 ↔ 62 | By similarity | |||||||||
| Disulfide bond | 45 ↔ 66 | By similarity | |||||||||
| Disulfide bond | 49 ↔ 61 | By similarity | |||||||||
| Disulfide bond | 55 ↔ 70 | By similarity | |||||||||
| Disulfide bond | 80 ↔ 110 | By similarity | |||||||||
| Disulfide bond | 93 ↔ 114 | By similarity | |||||||||
| Disulfide bond | 97 ↔ 109 | By similarity | |||||||||
| Disulfide bond | 103 ↔ 119 | By similarity | |||||||||
Natural variations | |||||||||||
| Alternative sequence | 2 – 23 | 22 | PACRL…LFGFT → LQVQVNLPVSPLPTYPYSFF YP in isoform 2. | VSP_007666 | |||||||
| Alternative sequence | 24 – 74 | 51 | Missing in isoform 2. | VSP_007667 | |||||||
| Alternative sequence | 27 – 74 | 48 | Missing in isoform 3. | VSP_007668 | |||||||
| Alternative sequence | 71 – 79 | 9 | SLPNDKEGS → LLCPNGQLAE in isoform 4. | VSP_007669 | |||||||
| Alternative sequence | 75 – 102 | 28 | DKEGS…QVDSQ → ALFHWHLKTRRLWEISGPRP RRPTWDSS in isoform 5. | VSP_007670 | |||||||
| Alternative sequence | 80 – 124 | 45 | Missing in isoform 4. | VSP_007671 | |||||||
| Alternative sequence | 103 – 124 | 22 | Missing in isoform 5. | VSP_007672 | |||||||
Experimental info | |||||||||||
| Sequence conflict | 71 – 72 | 2 | SL → LLC in CAA44869. Ref.1 | ||||||||
| Sequence conflict | 71 – 72 | 2 | SL → LLC in AAL37485. Ref.2 | ||||||||
| Sequence conflict | 101 | 1 | S → T in CAA44869. Ref.1 | ||||||||
Sequences
| ||||||||||||||||||||||||||||||||||||||||||
References
| « Hide 'large scale' references | |
| [1] | "A major human epididymis-specific cDNA encodes a protein with sequence homology to extracellular proteinase inhibitors." Kirchhoff C., Habben L., Ivell R., Krull N. Biol. Reprod. 45:350-357(1991) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1). Tissue: Epididymis. |
| [2] | "The putative ovarian tumour marker gene HE4 (WFDC2), is expressed in normal tissues and undergoes complex alternative splicing to yield multiple protein isoforms." Bingle L., Singleton V., Bingle C.D. Oncogene 21:2768-2773(2002) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORMS 1; 2; 3; 4 AND 5). |
| [3] | "The HE4 (WFDC2) protein is a biomarker for ovarian carcinoma." Hellstrom I., Raycraft J., Hayden-Ledbetter M., Ledbetter J.A., Schummer M., McIntosh M., Drescher C., Urban N., Hellstrom K.E. Cancer Res. 63:3695-3700(2003) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1). |
| [4] | "Cloning of human full open reading frames in Gateway(TM) system entry vector (pDONR201)." Ebert L., Schick M., Neubert P., Schatten R., Henze S., Korn B. Submitted (JUN-2004) to the EMBL/GenBank/DDBJ databases Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1). |
| [5] | "The DNA sequence and comparative analysis of human chromosome 20." Deloukas P., Matthews L.H., Ashurst J.L., Burton J., Gilbert J.G.R., Jones M., Stavrides G., Almeida J.P., Babbage A.K., Bagguley C.L., Bailey J., Barlow K.F., Bates K.N., Beard L.M., Beare D.M., Beasley O.P., Bird C.P., Blakey S.E. Rogers J.Nature 414:865-871(2001) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. |
| [6] | Mural R.J., Istrail S., Sutton G.G., Florea L., Halpern A.L., Mobarry C.M., Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., Flanigan M.J., Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., Hannenhalli S., Turner R. Venter J.C.Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. |
| [7] | "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)." The MGC Project Team Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1). Tissue: Colon. |
| [8] | "Human epididymis protein 4 (HE4) is a secreted glycoprotein that is overexpressed by serous and endometrioid ovarian carcinomas." Drapkin R., von Horsten H.H., Lin Y., Mok S.C., Crum C.P., Welch W.R., Hecht J.L. Cancer Res. 65:2162-2169(2005) [PubMed] [Europe PMC] [Abstract] Cited for: SUBCELLULAR LOCATION, TISSUE SPECIFICITY. |
| [9] | "Identification of N-linked glycoproteins in human saliva by glycoprotein capture and mass spectrometry." Ramachandran P., Boontheung P., Xie Y., Sondej M., Wong D.T., Loo J.A. J. Proteome Res. 5:1493-1503(2006) [PubMed] [Europe PMC] [Abstract] Cited for: GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-44, MASS SPECTROMETRY. Tissue: Saliva. |
| + | Additional computationally mapped references. |
Cross-references
Sequence databases | |
|---|---|
| EMBL GenBank DDBJ | X63187 mRNA. Translation: CAA44869.1. AF330259 mRNA. Translation: AAL37485.1. AF330260 mRNA. Translation: AAL37486.1. AF330261 mRNA. Translation: AAL37487.1. AF330262 mRNA. Translation: AAL37488.1. AY212888 mRNA. Translation: AAO52683.1. CR456977 mRNA. Translation: CAG33258.1. AL031663 Genomic DNA. Translation: CAB37641.1. AL031663 Genomic DNA. Translation: CAM28246.1. AL031663 Genomic DNA. Translation: CAM28247.1. AL031663 Genomic DNA. Translation: CAO03535.1. CH471077 Genomic DNA. Translation: EAW75836.1. CH471077 Genomic DNA. Translation: EAW75837.1. CH471077 Genomic DNA. Translation: EAW75839.1. BC046106 mRNA. Translation: AAH46106.1. |
| IPI | IPI00103633. IPI00103636. IPI00103639. IPI00183629. IPI00291488. |
| PIR | S25454. |
| RefSeq | NP_006094.3. NM_006103.3. |
| UniGene | Hs.2719. |
3D structure databases | |
| ProteinModelPortal | Q14508. |
| ModBase | Search... |
Protein-protein interaction databases | |
| IntAct | Q14508. 2 interactions. |
| MINT | MINT-1429295. |
Polymorphism databases | |
| DMDM | 20141958. |
Proteomic databases | |
| PaxDb | Q14508. |
| PeptideAtlas | Q14508. |
| PRIDE | Q14508. |
Protocols and materials databases | |
| DNASU | 10406. |
| StructuralBiologyKnowledgebase | Search... |
Genome annotation databases | |
| Ensembl | ENST00000217425; ENSP00000217425; ENSG00000101443. ENST00000339946; ENSP00000340215; ENSG00000101443. ENST00000342873; ENSP00000342890; ENSG00000101443. ENST00000372676; ENSP00000361761; ENSG00000101443. |
| GeneID | 10406. |
| KEGG | hsa:10406. |
| UCSC | uc002xoo.3. human. uc002xop.3. human. uc002xor.3. human. |
Organism-specific databases | |
| CTD | 10406. |
| GeneCards | GC20P044098. |
| HGNC | HGNC:15939. WFDC2. |
| neXtProt | NX_Q14508. |
| PharmGKB | PA38059. |
| GenAtlas | Search... |
Phylogenomic databases | |
| eggNOG | NOG27860. |
| HOVERGEN | HBG018073. |
| InParanoid | Q14508. |
| OMA | NEKQGSC. |
| OrthoDB | EOG454913. |
| PhylomeDB | Q14508. |
Gene expression databases | |
| ArrayExpress | Q14508. |
| Bgee | Q14508. |
| CleanEx | HS_WFDC2. |
| Genevestigator | Q14508. |
| GermOnline | ENSG00000101443. Homo sapiens. |
Family and domain databases | |
| Gene3D | 4.10.75.10. 2 hits. |
| InterPro | IPR008197. WAP-type_4-diS_core. [Graphical view] |
| Pfam | PF00095. WAP. 2 hits. [Graphical view] |
| PRINTS | PR00003. 4DISULPHCORE. |
| SMART | SM00217. WAP. 2 hits. [Graphical view] |
| SUPFAM | SSF57256. WAP. 2 hits. |
| PROSITE | PS51390. WAP. 2 hits. [Graphical view] |
| ProtoNet | Search... |
Other | |
| GenomeRNAi | 10406. |
| NextBio | 39431. |
Entry information
| Entry name | WFDC2_HUMAN | ||||||||
| Accession | Primary (citable) accession number: Q14508 Secondary accession number(s): A2A2A5 Q96KJ1 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation program | Chordata Protein Annotation Program | ||||||||
| Disclaimer | Any medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care. | ||||||||
Relevant documents
| Human chromosome 20 Human chromosome 20: entries, gene names and cross-references to MIM |
| SIMILARITY comments Index of protein domains and families |

Clusters with
