Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q6UW88 (EPGN_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified April 16, 2014. Version 89. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Epigen
Alternative name(s):
Epithelial mitogen
Short name=EPG
Gene names
Name:EPGN
ORF Names:UNQ3072/PRO9904
OrganismHomo sapiens (Human) [Reference proteome]
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length154 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Promotes the growth of epithelial cells. May stimulate the phosphorylation of EGFR and mitogen-activated protein kinases. Ref.5

Subcellular location

Isoform 1: Membrane; Single-pass type I membrane protein Probable.

Isoform 2: Membrane; Single-pass type I membrane protein Probable.

Isoform 3: Secreted Probable.

Isoform 4: Secreted Probable.

Isoform 5: Secreted Probable.

Isoform 6: Secreted Probable.

Sequence similarities

Contains 1 EGF-like domain.

Alternative products

This entry describes 7 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q6UW88-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q6UW88-2)

The sequence of this isoform differs from the canonical sequence as follows:
     36-44: Missing.
     137-154: CLKLKSPYNVCSGERRPL → YEKDKI
Isoform 3 (identifier: Q6UW88-3)

Also known as: B;

The sequence of this isoform differs from the canonical sequence as follows:
     96-137: Missing.
Isoform 4 (identifier: Q6UW88-4)

Also known as: E;

The sequence of this isoform differs from the canonical sequence as follows:
     87-137: Missing.
Isoform 5 (identifier: Q6UW88-5)

Also known as: F;

The sequence of this isoform differs from the canonical sequence as follows:
     36-44: Missing.
     96-137: Missing.
Isoform 6 (identifier: Q6UW88-6)

Also known as: G;

The sequence of this isoform differs from the canonical sequence as follows:
     36-44: Missing.
     87-137: Missing.
Isoform 7 (identifier: Q6UW88-7)

Also known as: D;

The sequence of this isoform differs from the canonical sequence as follows:
     15-44: Missing.
     87-137: Missing.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 2222 Ref.4
Chain23 – 154132Epigen
PRO_0000045462

Regions

Topological domain23 – 11088Extracellular Potential
Transmembrane111 – 13121Helical; Potential
Topological domain132 – 15423Cytoplasmic Potential
Domain56 – 9641EGF-like

Amino acid modifications

Glycosylation371N-linked (GlcNAc...) Potential
Glycosylation411N-linked (GlcNAc...) Potential
Disulfide bond60 ↔ 73 By similarity
Disulfide bond68 ↔ 84 By similarity
Disulfide bond86 ↔ 95 By similarity

Natural variations

Alternative sequence15 – 4430Missing in isoform 7.
VSP_036654
Alternative sequence36 – 449Missing in isoform 2, isoform 5 and isoform 6.
VSP_036655
Alternative sequence87 – 13751Missing in isoform 4, isoform 6 and isoform 7.
VSP_036656
Alternative sequence96 – 13742Missing in isoform 3 and isoform 5.
VSP_036657
Alternative sequence137 – 15418CLKLK…ERRPL → YEKDKI in isoform 2.
VSP_036658

Experimental info

Sequence conflict91V → A in ABB60048. Ref.1
Sequence conflict621E → G in ABB60047. Ref.1

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified March 24, 2009. Version 2.
Checksum: BC78016BE3026474

FASTA15417,091
        10         20         30         40         50         60 
MALGVPISVY LLFNAMTALT EEAAVTVTPP ITAQQGNWTV NKTEADNIEG PIALKFSHLC 

        70         80         90        100        110        120 
LEDHNSYCIN GACAFHHELE KAICRCFTGY TGERCEHLTL TSYAVDSYEK YIAIGIGVGL 

       130        140        150 
LLSGFLVIFY CYIRKRCLKL KSPYNVCSGE RRPL 

« Hide

Isoform 2 [UniParc].

Checksum: 7A5B3257D0484182
Show »

FASTA13314,792
Isoform 3 (B) [UniParc].

Checksum: 8D0EE50785FA2891
Show »

FASTA11212,319
Isoform 4 (E) [UniParc].

Checksum: 062973EAC9EA4570
Show »

FASTA10311,304
Isoform 5 (F) [UniParc].

Checksum: 1C1FB863C2FE3419
Show »

FASTA10311,289
Isoform 6 (G) [UniParc].

Checksum: 13193DA6BF249099
Show »

FASTA9410,274
Isoform 7 (D) [UniParc].

Checksum: 860BFE12C853D368
Show »

FASTA738,149

References

« Hide 'large scale' references
[1]"Human keratinocytes express several alternatively spliced forms of EPIGEN, encoding directly secreted and intracellular proteins."
Johnstone L.S., Abumaree M., Martin J., Webster G., Murison G.
Submitted (OCT-2005) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORMS 1; 3; 4; 5; 6 AND 7).
[2]"The secreted protein discovery initiative (SPDI), a large-scale effort to identify novel human secreted and transmembrane proteins: a bioinformatics assessment."
Clark H.F., Gurney A.L., Abaya E., Baker K., Baldwin D.T., Brush J., Chen J., Chow B., Chui C., Crowley C., Currell B., Deuel B., Dowd P., Eaton D., Foster J.S., Grimaldi C., Gu Q., Hass P.E. expand/collapse author list , Heldens S., Huang A., Kim H.S., Klimowski L., Jin Y., Johnson S., Lee J., Lewis L., Liao D., Mark M.R., Robbie E., Sanchez C., Schoenfeld J., Seshagiri S., Simmons L., Singh J., Smith V., Stinson J., Vagts A., Vandlen R.L., Watanabe C., Wieand D., Woods K., Xie M.-H., Yansura D.G., Yi S., Yu G., Yuan J., Zhang M., Zhang Z., Goddard A.D., Wood W.I., Godowski P.J., Gray A.M.
Genome Res. 13:2265-2270(2003) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
[3]"Complete sequencing and characterization of 21,243 full-length human cDNAs."
Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S. expand/collapse author list , Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.
Nat. Genet. 36:40-45(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
[4]"Signal peptide prediction based on analysis of experimentally verified cleavage sites."
Zhang Z., Henzel W.J.
Protein Sci. 13:2819-2824(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: PROTEIN SEQUENCE OF 23-37.
[5]"Epigen, the last ligand of ErbB receptors, reveals intricate relationships between affinity and mitogenicity."
Kochupurakkal B.S., Harari D., Di-Segni A., Maik-Rachline G., Lyass L., Gur G., Kerber G., Citri A., Lavi S., Eilam R., Chalifa-Caspi V., Eshhar Z., Pikarsky E., Pinkas-Kramarski R., Bacus S.S., Yarden Y.
J. Biol. Chem. 280:8503-8512(2005) [PubMed] [Europe PMC] [Abstract]
Cited for: FUNCTION.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
DQ235264 mRNA. Translation: ABB60043.1.
DQ235265 mRNA. Translation: ABB60044.1.
DQ235266 mRNA. Translation: ABB60045.1.
DQ235267 mRNA. Translation: ABB60046.1.
DQ235268 mRNA. Translation: ABB60047.1.
DQ235269 mRNA. Translation: ABB60048.1.
AY358920 mRNA. Translation: AAQ89279.1.
AK289455 mRNA. Translation: BAF82144.1.
RefSeqNP_001257918.1. NM_001270989.1.
NP_001257919.1. NM_001270990.1.
NP_001257920.1. NM_001270991.1.
NP_001257921.1. NM_001270992.1.
NP_001257922.1. NM_001270993.1.
XP_005265723.1. XM_005265666.2.
UniGeneHs.401237.

3D structure databases

ProteinModelPortalQ6UW88.
SMRQ6UW88. Positions 60-100.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

STRING9606.ENSP00000330375.

Polymorphism databases

DMDM229464464.

Protocols and materials databases

DNASU255324.
StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENST00000332112; ENSP00000330375; ENSG00000182585. [Q6UW88-2]
ENST00000413830; ENSP00000411898; ENSG00000182585. [Q6UW88-1]
ENST00000502358; ENSP00000426678; ENSG00000182585. [Q6UW88-4]
ENST00000503098; ENSP00000425890; ENSG00000182585. [Q6UW88-3]
ENST00000505212; ENSP00000424392; ENSG00000182585. [Q6UW88-6]
ENST00000509145; ENSP00000426630; ENSG00000182585. [Q6UW88-7]
ENST00000514968; ENSP00000426550; ENSG00000182585. [Q6UW88-5]
GeneID255324.
KEGGhsa:255324.
UCSCuc003hhw.4. human. [Q6UW88-2]
uc003hhy.2. human. [Q6UW88-6]
uc003hhz.2. human. [Q6UW88-4]
uc003hia.2. human. [Q6UW88-5]
uc003hib.2. human. [Q6UW88-3]
uc003hic.2. human. [Q6UW88-1]
uc010iin.2. human. [Q6UW88-7]

Organism-specific databases

CTD255324.
GeneCardsGC04P075174.
HGNCHGNC:17470. EPGN.
HPAHPA014369.
HPA014420.
neXtProtNX_Q6UW88.
PharmGKBPA162385143.
GenAtlasSearch...

Phylogenomic databases

eggNOGNOG43616.
HOVERGENHBG079616.
InParanoidQ6UW88.
OMASNWTVNK.
OrthoDBEOG761BWF.
PhylomeDBQ6UW88.
TreeFamTF335931.

Gene expression databases

BgeeQ6UW88.
CleanExHS_EPGN.
GenevestigatorQ6UW88.

Family and domain databases

InterProIPR000742. EG-like_dom.
IPR013032. EGF-like_CS.
IPR015497. EGF_rcpt_ligand.
[Graphical view]
PANTHERPTHR10740. PTHR10740. 1 hit.
SMARTSM00181. EGF. 1 hit.
[Graphical view]
PROSITEPS00022. EGF_1. 1 hit.
PS01186. EGF_2. 1 hit.
PS50026. EGF_3. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

GenomeRNAi255324.
NextBio92547.
PROQ6UW88.

Entry information

Entry nameEPGN_HUMAN
AccessionPrimary (citable) accession number: Q6UW88
Secondary accession number(s): A1BMM3 expand/collapse secondary AC list , A1BMM4, A1BMM5, A1BMM6, A1BMM7, A1BMM8, A8K090
Entry history
Integrated into UniProtKB/Swiss-Prot: January 10, 2006
Last sequence update: March 24, 2009
Last modified: April 16, 2014
This is version 89 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

SIMILARITY comments

Index of protein domains and families

Human chromosome 4

Human chromosome 4: entries, gene names and cross-references to MIM