ID WFD2_HUMAN STANDARD; PRT; 124 AA. AC Q14508; Q8WXV9; Q8WXW0; Q8WXW1; Q8WXW2; Q96KJ1; DT 15-JUL-1998 (Rel. 36, Created) DT 28-FEB-2003 (Rel. 41, Last sequence update) DT 15-JUN-2004 (Rel. 44, Last annotation update) DE WAP four-disulfide core domain protein 2 precursor (Major epididymis- DE specific protein E4) (Epididymal secretory protein E4) (Putative DE protease inhibitor WAP5). GN WFDC2 OR HE4 OR WAP5. OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606; RN [1] RP SEQUENCE FROM N.A. (ISOFORM 1). RC TISSUE=Epididymis; RX MEDLINE=92153963; PubMed=1686187; RA Kirchhoff C., Habben L., Ivell R., Krull N.; RT "A major human epididymis-specific cDNA encodes a protein with RT sequence homology to extracellular proteinase inhibitors."; RL Biol. Reprod. 45:350-357(1991). RN [2] RP SEQUENCE FROM N.A. (ISOFORMS 1; 2; 3; 4 AND 5). RX MEDLINE=21962329; PubMed=11965550; DOI=10.1038/sj/onc/1205363; RA Bingle L., Singleton V., Bingle C.D.; RT "The putative ovarian tumour marker gene HE4 (WFDC2), is expressed in RT normal tissues and undergoes complex alternative splicing to yield RT multiple protein isoforms."; RL Oncogene 21:2768-2773(2002). RN [3] RP SEQUENCE FROM N.A. (ISOFORM 1). RA Hellstrom I., Raycraft J., Ledbetter M., Ledbetter J.A., Schummer M., RA McIntosh M., Drescher C., Urban N., Hellstrom K.E.; RT "The HE4 (WFDC2) protein is a biomarker for ovarian carcinoma."; RL Submitted (JAN-2003) to the EMBL/GenBank/DDBJ databases. RN [4] RP SEQUENCE FROM N.A. RX MEDLINE=21638749; PubMed=11780052; DOI=10.1038/414865a; RA Deloukas P., Matthews L.H., Ashurst J.L., Burton J., Gilbert J.G.R., RA Jones M., Stavrides G., Almeida J.P., Babbage A.K., Bagguley C.L., RA Bailey J., Barlow K.F., Bates K.N., Beard L.M., Beare D.M., RA Beasley O.P., Bird C.P., Blakey S.E., Bridgeman A.M., Brown A.J., RA Buck D., Burrill W.D., Butler A.P., Carder C., Carter N.P., RA Chapman J.C., Clamp M., Clark G., Clark L.N., Clark S.Y., Clee C.M., RA Clegg S., Cobley V.E., Collier R.E., Connor R.E., Corby N.R., RA Coulson A., Coville G.J., Deadman R., Dhami P.D., Dunn M., RA Ellington A.G., Frankland J.A., Fraser A., French L., Garner P., RA Grafham D.V., Griffiths C., Griffiths M.N.D., Gwilliam R., Hall R.E., RA Hammond S., Harley J.L., Heath P.D., Ho S., Holden J.L., Howden P.J., RA Huckle E., Hunt A.R., Hunt S.E., Jekosch K., Johnson C.M., Johnson D., RA Kay M.P., Kimberley A.M., King A., Knights A., Laird G.K., Lawlor S., RA Lehvaeslaiho M.H., Leversha M.A., Lloyd C., Lloyd D.M., Lovell J.D., RA Marsh V.L., Martin S.L., McConnachie L.J., McLay K., McMurray A.A., RA Milne S.A., Mistry D., Moore M.J.F., Mullikin J.C., Nickerson T., RA Oliver K., Parker A., Patel R., Pearce T.A.V., Peck A.I., RA Phillimore B.J.C.T., Prathalingam S.R., Plumb R.W., Ramsay H., RA Rice C.M., Ross M.T., Scott C.E., Sehra H.K., Shownkeen R., Sims S., RA Skuce C.D., Smith M.L., Soderlund C., Steward C.A., Sulston J.E., RA Swann R.M., Sycamore N., Taylor R., Tee L., Thomas D.W., Thorpe A., RA Tracey A., Tromans A.C., Vaudin M., Wall M., Wallis J.M., RA Whitehead S.L., Whittaker P., Willey D.L., Williams L., Williams S.A., RA Wilming L., Wray P.W., Hubbard T., Durbin R.M., Bentley D.R., Beck S., RA Rogers J.; RT "The DNA sequence and comparative analysis of human chromosome 20."; RL Nature 414:865-871(2001). RN [5] RP SEQUENCE FROM N.A. (ISOFORM 1). RC TISSUE=Colon; RX MEDLINE=22388257; PubMed=12477932; DOI=10.1073/pnas.242603899; RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., RA Klausner R.D., Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., RA Diatchenko L., Marusina K., Farmer A.A., Rubin G.M., Hong L., RA Stapleton M., Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C., RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J.A., Gunaratne P.H., RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., RA Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., Sanchez A., RA Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G., RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; RT "Generation and initial analysis of more than 15,000 full-length human RT and mouse cDNA sequences."; RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). CC -!- SUBCELLULAR LOCATION: Secreted (Potential). CC -!- ALTERNATIVE PRODUCTS: CC Event=Alternative splicing; Named isoforms=5; CC Comment=Additional isoforms seem to exist; CC Name=1; CC IsoId=Q14508-1; Sequence=Displayed; CC Name=2; Synonyms=HE4-V3; CC IsoId=Q14508-2; Sequence=VSP_007666, VSP_007667; CC Name=3; Synonyms=HE4-V2; CC IsoId=Q14508-3; Sequence=VSP_007668; CC Name=4; Synonyms=HE4-V1; CC IsoId=Q14508-4; Sequence=VSP_007669, VSP_007671; CC Name=5; Synonyms=HE4-V4; CC IsoId=Q14508-5; Sequence=VSP_007670, VSP_007672; CC -!- TISSUE SPECIFICITY: Expressed in a number of normal tissues, CC including male reproductive system, regions of the respiratory CC tract and nasopharynx. Highly expressed in a number of tumors CC cells lines, such ovarian, colon, breast, lung and renal cells CC lines. Initially described as being exclusively transcripted in CC the epididymis. CC -!- SIMILARITY: Contains 2 WAP-type domains. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; X63187; CAA44869.1; -. DR EMBL; A18924; CAA01433.1; -. DR EMBL; AF330259; AAL37485.1; -. DR EMBL; AF330260; AAL37486.1; -. DR EMBL; AF330261; AAL37487.1; -. DR EMBL; AF330262; AAL37488.1; -. DR EMBL; AY212888; AAO52683.1; -. DR EMBL; AL031663; CAB37641.1; -. DR EMBL; BC046106; AAH46106.1; -. DR PIR; S25454; S25454. DR HSSP; Q9N0L8; 1TWP. DR Genew; HGNC:15939; WFDC2. DR GO; GO:0005615; C:extracellular space; TAS. DR GO; GO:0004866; F:endopeptidase inhibitor activity; TAS. DR GO; GO:0006508; P:proteolysis and peptidolysis; TAS. DR GO; GO:0007283; P:spermatogenesis; TAS. DR InterPro; IPR008197; WAP. DR InterPro; IPR008198; WAP_C. DR Pfam; PF00095; WAP; 2. DR PRINTS; PR00003; 4DISULPHCORE. DR ProDom; PD001224; WAP_C; 1. DR SMART; SM00217; WAP; 2. DR PROSITE; PS00317; 4_DISULFIDE_CORE; 2. KW Serine protease inhibitor; Repeat; Signal; Glycoprotein; KW Alternative splicing. FT SIGNAL 1 30 Potential. FT CHAIN 31 124 WAP four-disulfide core domain protein 2. FT DOMAIN 32 74 WAP 1. FT DOMAIN 76 124 WAP 2. FT DISULFID 36 62 By similarity. FT DISULFID 45 66 By similarity. FT DISULFID 49 61 By similarity. FT DISULFID 55 70 By similarity. FT DISULFID 80 110 By similarity. FT DISULFID 93 114 By similarity. FT DISULFID 97 109 By similarity. FT DISULFID 103 119 By similarity. FT CARBOHYD 44 44 N-linked (GlcNAc...) (Potential). FT VARSPLIC 2 23 PACRLGPLAAALLLSLLLFGFT -> FT LQVQVNLPVSPLPTYPYSFFYP (in isoform 2). FT /FTId=VSP_007666. FT VARSPLIC 24 74 Missing (in isoform 2). FT /FTId=VSP_007667. FT VARSPLIC 27 74 Missing (in isoform 3). FT /FTId=VSP_007668. FT VARSPLIC 71 79 SLPNDKEGS -> LLCPNGQLAE (in isoform 4). FT /FTId=VSP_007669. FT VARSPLIC 75 102 DKEGSCPQVNINFPQLGLCRDQCQVDSQ -> ALFHWHLKT FT RRLWEISGPRPRRPTWDSS (in isoform 5). FT /FTId=VSP_007670. FT VARSPLIC 80 124 Missing (in isoform 4). FT /FTId=VSP_007671. FT VARSPLIC 103 124 Missing (in isoform 5). FT /FTId=VSP_007672. FT CONFLICT 71 72 SL -> LLC (in Ref. 1 and 2; AAL37485). FT CONFLICT 101 101 S -> T (in Ref. 1). SQ SEQUENCE 124 AA; 12993 MW; 9536B00B385259AD CRC64; MPACRLGPLA AALLLSLLLF GFTLVSGTGA EKTGVCPELQ ADQNCTQECV SDSECADNLK CCSAGCATFC SLPNDKEGSC PQVNINFPQL GLCRDQCQVD SQCPGQMKCC RNGCGKVSCV TPNF //