Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

P04279 (SEMG1_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified July 9, 2014. Version 152. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (6) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Web links·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Semenogelin-1
Alternative name(s):
Semenogelin I
Short name=SGI

Cleaved into the following 3 chains:

  1. Alpha-inhibin-92
  2. Alpha-inhibin-31
  3. Seminal basic protein
Gene names
Name:SEMG1
Synonyms:SEMG
OrganismHomo sapiens (Human) [Reference proteome]
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length462 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Predominant protein in semen. It participates in the formation of a gel matrix entrapping the accessory gland secretions and ejaculated spermatozoa. Fragments of semenogelin and/or fragments of the related proteins may contribute to the activation of progressive sperm movements as the gel-forming proteins are fragmented by KLK3/PSA. Ref.18

Alpha-inhibin-92 and alpha-inhibin-31, derived from the proteolytic degradation of semenogelin, inhibit the secretion of pituitary follicle-stimulating hormone. Ref.18

Subunit structure

Occurs in disulfide-linked complexes which may also contain two less abundant 71- and 76-kDa semenogelin-related polypeptides. Interacts with EPPIN (via C-terminus); Cys-239 is a critical amino acid for both binding to EPPIN. Ref.16 Ref.17

Subcellular location

Secreted.

Tissue specificity

Seminal vesicle.

Post-translational modification

Transglutaminase substrate.

Rapidly cleaved after ejaculation by KLK3/PSA, resulting in liquefaction of the semen coagulum and the progressive release of motile spermatozoa.

Sequence similarities

Belongs to the semenogelin family.

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: P04279-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: P04279-2)

The sequence of this isoform differs from the canonical sequence as follows:
     312-371: Missing.
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 2323
Chain24 – 462439Semenogelin-1
PRO_0000032351
Peptide68 – 15992Alpha-inhibin-92 Ref.14
PRO_0000032352
Peptide108 – 15952Seminal basic protein Ref.9
PRO_0000032353
Peptide108 – 13831Alpha-inhibin-31 Ref.10 Ref.11
PRO_0000032354

Regions

Repeat70 – 129603-1
Repeat141 – 200602-1
Repeat201 – 260602-2
Repeat381 – 439593-2
Region70 – 439370Repeat-rich region By similarity
Region164 – 283120Interaction with EPPIN
Region261 – 3801202 X 60 AA tandem repeats, type 1

Amino acid modifications

Modified residue241Pyrrolidone carboxylic acid Probable
Disulfide bond239Interchain

Natural variations

Alternative sequence312 – 37160Missing in isoform 2.
VSP_004385
Natural variant581E → G.
Corresponds to variant rs11559137 [ dbSNP | Ensembl ].
VAR_053650
Natural variant791S → T Less common genetic variant. Ref.3 Ref.8
Corresponds to variant rs2301366 [ dbSNP | Ensembl ].
VAR_005610
Natural variant1081H → R.
Corresponds to variant rs2233884 [ dbSNP | Ensembl ].
VAR_053651
Natural variant3721R → L. Ref.3
Corresponds to variant rs2233887 [ dbSNP | Ensembl ].
VAR_022679

Experimental info

Mutagenesis2391C → G: Abrogates binding to EPPIN and do not inhibit spem motility. Ref.18
Sequence conflict1001L → Q in AAP82463. Ref.3
Sequence conflict235 – 2373QTS → LRT in AAO20112. Ref.8
Sequence conflict235 – 2373QTS → LRT in AAO20113. Ref.8
Sequence conflict3211K → L AA sequence Ref.12
Sequence conflict4231K → N in CAA87636. Ref.2
Sequence conflict4231K → N in AAA18168. Ref.2
Sequence conflict4571R → Q Ref.4
Sequence conflict4571R → Q Ref.7

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified January 1, 1990. Version 2.
Checksum: 760F48EFCF2FA702

FASTA46252,131
        10         20         30         40         50         60 
MKPNIIFVLS LLLILEKQAA VMGQKGGSKG RLPSEFSQFP HGQKGQHYSG QKGKQQTESK 

        70         80         90        100        110        120 
GSFSIQYTYH VDANDHDQSR KSQQYDLNAL HKTTKSQRHL GGSQQLLHNK QEGRDHDKSK 

       130        140        150        160        170        180 
GHFHRVVIHH KGGKAHRGTQ NPSQDQGNSP SGKGISSQYS NTEERLWVHG LSKEQTSVSG 

       190        200        210        220        230        240 
AQKGRKQGGS QSSYVLQTEE LVANKQQRET KNSHQNKGHY QNVVEVREEH SSKVQTSLCP 

       250        260        270        280        290        300 
AHQDKLQHGS KDIFSTQDEL LVYNKNQHQT KNLNQDQQHG RKANKISYQS SSTEERRLHY 

       310        320        330        340        350        360 
GENGVQKDVS QSSIYSQTEE KAQGKSQKQI TIPSQEQEHS QKANKISYQS SSTEERRLHY 

       370        380        390        400        410        420 
GENGVQKDVS QRSIYSQTEK LVAGKSQIQA PNPKQEPWHG ENAKGESGQS TNREQDLLSH 

       430        440        450        460 
EQKGRHQHGS HGGLDIVIIE QEDDSDRHLA QHLNNDRNPL FT 

« Hide

Isoform 2 [UniParc].

Checksum: 4D7F264E7C7FC15D
Show »

FASTA40245,322

References

« Hide 'large scale' references
[1]"Semenogelin, the predominant protein in human semen. Primary structure and identification of closely related proteins in the male accessory sex glands and on the spermatozoa."
Lilja H., Abrahamsson P.-A., Lundwall A.
J. Biol. Chem. 264:1894-1900(1989) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1).
[2]"Gene structure of semenogelin I and II. The predominant proteins in human semen are encoded by two homologous genes on chromosome 20."
Ulvsbaeck M., Lazure C., Lilja H., Spurr N.K., Rao V.V., Loeffler C., Hansmann I., Lundwall A.
J. Biol. Chem. 267:18080-18084(1992) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
Tissue: Blood.
[3]"Evolution of the hominoid semenogelin genes, the major proteins of ejaculated semen."
Jensen-Seaman M.I., Li W.-H.
J. Mol. Evol. 57:261-270(2003) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA], VARIANTS THR-79 AND LEU-372.
[4]"Cloning of human full-length CDSs in BD Creator(TM) system donor vector."
Kalnine N., Chen X., Rolfs A., Halleck A., Hines L., Eisenstein S., Koundinya M., Raphael J., Moreira D., Kelley T., LaBaer J., Lin Y., Phelan M., Farmer A.
Submitted (MAY-2003) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
[5]"The DNA sequence and comparative analysis of human chromosome 20."
Deloukas P., Matthews L.H., Ashurst J.L., Burton J., Gilbert J.G.R., Jones M., Stavrides G., Almeida J.P., Babbage A.K., Bagguley C.L., Bailey J., Barlow K.F., Bates K.N., Beard L.M., Beare D.M., Beasley O.P., Bird C.P., Blakey S.E. expand/collapse author list , Bridgeman A.M., Brown A.J., Buck D., Burrill W.D., Butler A.P., Carder C., Carter N.P., Chapman J.C., Clamp M., Clark G., Clark L.N., Clark S.Y., Clee C.M., Clegg S., Cobley V.E., Collier R.E., Connor R.E., Corby N.R., Coulson A., Coville G.J., Deadman R., Dhami P.D., Dunn M., Ellington A.G., Frankland J.A., Fraser A., French L., Garner P., Grafham D.V., Griffiths C., Griffiths M.N.D., Gwilliam R., Hall R.E., Hammond S., Harley J.L., Heath P.D., Ho S., Holden J.L., Howden P.J., Huckle E., Hunt A.R., Hunt S.E., Jekosch K., Johnson C.M., Johnson D., Kay M.P., Kimberley A.M., King A., Knights A., Laird G.K., Lawlor S., Lehvaeslaiho M.H., Leversha M.A., Lloyd C., Lloyd D.M., Lovell J.D., Marsh V.L., Martin S.L., McConnachie L.J., McLay K., McMurray A.A., Milne S.A., Mistry D., Moore M.J.F., Mullikin J.C., Nickerson T., Oliver K., Parker A., Patel R., Pearce T.A.V., Peck A.I., Phillimore B.J.C.T., Prathalingam S.R., Plumb R.W., Ramsay H., Rice C.M., Ross M.T., Scott C.E., Sehra H.K., Shownkeen R., Sims S., Skuce C.D., Smith M.L., Soderlund C., Steward C.A., Sulston J.E., Swann R.M., Sycamore N., Taylor R., Tee L., Thomas D.W., Thorpe A., Tracey A., Tromans A.C., Vaudin M., Wall M., Wallis J.M., Whitehead S.L., Whittaker P., Willey D.L., Williams L., Williams S.A., Wilming L., Wray P.W., Hubbard T., Durbin R.M., Bentley D.R., Beck S., Rogers J.
Nature 414:865-871(2001) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[6]Mural R.J., Istrail S., Sutton G.G., Florea L., Halpern A.L., Mobarry C.M., Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., Flanigan M.J., Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., Hannenhalli S., Turner R. expand/collapse author list , Yooseph S., Lu F., Nusskern D.R., Shue B.C., Zheng X.H., Zhong F., Delcher A.L., Huson D.H., Kravitz S.A., Mouchard L., Reinert K., Remington K.A., Clark A.G., Waterman M.S., Eichler E.E., Adams M.D., Hunkapiller M.W., Myers E.W., Venter J.C.
Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[7]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
Tissue: Prostate.
[8]"Reduced polymorphism in the chimpanzee semen coagulating protein, semenogelin I."
Kingan S.B., Tatar M., Rand D.M.
J. Mol. Evol. 57:159-169(2003) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 62-237 AND 241-449, VARIANT THR-79.
[9]"Amino acid sequence of the predominant basic protein in human seminal plasma."
Lilja H., Jeppsson J.-O.
FEBS Lett. 182:181-184(1985) [PubMed] [Europe PMC] [Abstract]
Cited for: PROTEIN SEQUENCE OF 108-159.
[10]"Partial amino acid sequence of a human seminal plasma peptide with inhibin-like activity."
Seidah N.G., Ramasharma K., Sairam M.R., Chretien M.
FEBS Lett. 167:98-102(1984) [PubMed] [Europe PMC] [Abstract]
Cited for: PROTEIN SEQUENCE OF 108-138.
[11]"Isolation, structure, and synthesis of a human seminal plasma peptide with inhibin-like activity."
Ramasharma K., Sairam M.R., Seidah N.G., Chretien M., Manjunath P., Schiller P.W., Yamashiro D., Li C.H.
Science 223:1199-1202(1984) [PubMed] [Europe PMC] [Abstract]
Cited for: PROTEIN SEQUENCE OF 108-138.
[12]"Isolation and structure determination of two peptides occurring in human seminal plasma."
Schneider K., Kausler W., Tripier D., Jouvenal K., Spiteller G.
Biol. Chem. Hoppe-Seyler 370:353-356(1989) [PubMed] [Europe PMC] [Abstract]
Cited for: PROTEIN SEQUENCE OF 316-344.
[13]"Isolation and identification of N-terminally extended forms of 5-oxoprolylglutamylprolinamide (Glp-Glu-Pro-NH2), a thyrotropin-releasing-hormone (TRH)-like peptide present in human semen."
Khan Z., Smyth D.G.
Eur. J. Biochem. 212:35-40(1993) [PubMed] [Europe PMC] [Abstract]
Cited for: PROTEIN SEQUENCE OF 373-397.
[14]"Human seminal alpha inhibins: isolation, characterization, and structure."
Li C.H., Hammonds R.G., Ramasharma K., Chung D.
Proc. Natl. Acad. Sci. U.S.A. 82:4041-4044(1985) [PubMed] [Europe PMC] [Abstract]
Cited for: PROTEIN SEQUENCE OF 68-159.
[15]"Semenogelin I: a coagulum forming, multifunctional seminal vesicle protein."
Robert M., Gagnon C.
Cell. Mol. Life Sci. 55:944-960(1999) [PubMed] [Europe PMC] [Abstract]
Cited for: REVIEW.
[16]"Association of eppin with semenogelin on human spermatozoa."
Wang Z., Widgren E.E., Sivashanmugam P., O'Rand M.G., Richardson R.T.
Biol. Reprod. 72:1064-1070(2005) [PubMed] [Europe PMC] [Abstract]
Cited for: INTERACTION WITH EPPIN.
[17]"Characterization of an eppin protein complex from human semen and spermatozoa."
Wang Z., Widgren E.E., Richardson R.T., O'Rand M.G.
Biol. Reprod. 77:476-484(2007) [PubMed] [Europe PMC] [Abstract]
Cited for: IDENTIFICATION IN A COMPLEX WITH LTF; CLU AND EPPIN.
[18]"Analysis of recombinant human semenogelin as an inhibitor of human sperm motility."
Mitra A., Richardson R.T., O'Rand M.G.
Biol. Reprod. 82:489-496(2010) [PubMed] [Europe PMC] [Abstract]
Cited for: FUNCTION, MUTAGENESIS OF CYS-239.
+Additional computationally mapped references.

Web resources

Protein Spotlight

Shackled sperm - Issue 62 of September 2005

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
J04440 mRNA. Translation: AAB59506.1.
Z47556 Genomic DNA. Translation: CAA87636.1.
M81650 Genomic DNA. Translation: AAA18168.1.
AY256465 Genomic DNA. Translation: AAP82462.1.
AY256466 Genomic DNA. Translation: AAP82463.1.
AY256467 Genomic DNA. Translation: AAP82464.1.
AY256468 Genomic DNA. Translation: AAP82465.1.
AY256469 Genomic DNA. Translation: AAP82466.1.
BT007177 mRNA. Translation: AAP35841.1.
AL049767 Genomic DNA. Translation: CAB53523.1.
CH471077 Genomic DNA. Translation: EAW75871.1.
BC007096 mRNA. Translation: AAH07096.1.
BC055416 mRNA. Translation: AAH55416.1.
AY174423 Genomic DNA. Translation: AAO20112.1.
AY174424 Genomic DNA. Translation: AAO20113.1.
AY174437 Genomic DNA. Translation: AAO20126.1.
CCDSCCDS13345.1. [P04279-1]
PIRWTHUB. B43412.
RefSeqNP_002998.1. NM_003007.3. [P04279-1]
UniGeneHs.1968.

3D structure databases

ProteinModelPortalP04279.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

BioGrid112306. 7 interactions.
IntActP04279. 2 interactions.
MINTMINT-2862979.

PTM databases

PhosphoSiteP04279.

Polymorphism databases

DMDM134426.

Proteomic databases

MaxQBP04279.
PaxDbP04279.
PRIDEP04279.

Protocols and materials databases

DNASU6406.
StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENST00000244069; ENSP00000244069; ENSG00000124233. [P04279-2]
ENST00000372781; ENSP00000361867; ENSG00000124233. [P04279-1]
GeneID6406.
KEGGhsa:6406.
UCSCuc002xni.2. human. [P04279-1]
uc002xnj.2. human. [P04279-2]

Organism-specific databases

CTD6406.
GeneCardsGC20P043835.
HGNCHGNC:10742. SEMG1.
HPAHPA042476.
MIM182140. gene.
neXtProtNX_P04279.
PharmGKBPA35664.
GenAtlasSearch...

Phylogenomic databases

eggNOGNOG74001.
HOVERGENHBG054194.
InParanoidP04279.
OMAQNPNQDQ.
OrthoDBEOG7J4469.
PhylomeDBP04279.
TreeFamTF342360.

Enzyme and pathway databases

ReactomeREACT_116125. Disease.

Gene expression databases

ArrayExpressP04279.
BgeeP04279.
CleanExHS_SEMG1.
GenevestigatorP04279.

Family and domain databases

InterProIPR008836. Semenogelin.
[Graphical view]
PANTHERPTHR10547. PTHR10547. 1 hit.
PfamPF05474. Semenogelin. 2 hits.
[Graphical view]
ProtoNetSearch...

Other

GeneWikiSemenogelin_I.
GenomeRNAi6406.
NextBio24890.
PMAP-CutDBP04279.
PROP04279.
SOURCESearch...

Entry information

Entry nameSEMG1_HUMAN
AccessionPrimary (citable) accession number: P04279
Secondary accession number(s): Q53ZV0 expand/collapse secondary AC list , Q53ZV1, Q53ZV2, Q6X4I9, Q6Y809, Q6Y822, Q6Y823, Q86U64, Q96QM3
Entry history
Integrated into UniProtKB/Swiss-Prot: March 20, 1987
Last sequence update: January 1, 1990
Last modified: July 9, 2014
This is version 152 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

SIMILARITY comments

Index of protein domains and families

Protein Spotlight

Protein Spotlight articles and cited UniProtKB/Swiss-Prot entries

MIM cross-references

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot

Human polymorphisms and disease mutations

Index of human polymorphisms and disease mutations

Human entries with polymorphisms or disease mutations

List of human entries with polymorphisms or disease mutations

Human chromosome 20

Human chromosome 20: entries, gene names and cross-references to MIM