Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Semenogelin-2

Gene

SEMG2

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Participates in the formation of a gel matrix (sperm coagulum) entrapping the accessory gland secretions and ejaculated spermatozoa.

GO - Molecular functioni

  1. structural molecule activity Source: InterPro

GO - Biological processi

  1. sexual reproduction Source: InterPro
Complete GO annotation...

Names & Taxonomyi

Protein namesi
Recommended name:
Semenogelin-2
Alternative name(s):
Semenogelin II
Short name:
SGII
Gene namesi
Name:SEMG2
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
ProteomesiUP000005640 Componenti: Chromosome 20

Organism-specific databases

HGNCiHGNC:10743. SEMG2.

Subcellular locationi

GO - Cellular componenti

  1. extracellular space Source: ProtInc
  2. extracellular vesicular exosome Source: UniProtKB
  3. nucleus Source: UniProtKB
  4. secretory granule Source: InterPro
Complete GO annotation...

Keywords - Cellular componenti

Secreted

Pathology & Biotechi

Organism-specific databases

PharmGKBiPA35665.

Polymorphism and mutation databases

BioMutaiSEMG2.
DMDMi401079.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 2323Add
BLAST
Chaini24 – 582559Semenogelin-2PRO_0000032359Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Glycosylationi272 – 2721N-linked (GlcNAc...)1 Publication

Post-translational modificationi

Semenogelin-2 is thought to form both the 71 kDa polypeptide and, in its glycosylated form, the 76 kDa polypeptide.1 Publication

Keywords - PTMi

Glycoprotein

Proteomic databases

PaxDbiQ02383.
PeptideAtlasiQ02383.
PRIDEiQ02383.

PTM databases

PhosphoSiteiQ02383.

Miscellaneous databases

PMAP-CutDBQ02383.

Expressioni

Tissue specificityi

Seminal vesicles, and to a much lesser extent, epididymis.

Gene expression databases

BgeeiQ02383.
CleanExiHS_SEMG2.
GenevestigatoriQ02383.

Organism-specific databases

HPAiHPA042767.
HPA042835.

Interactioni

Subunit structurei

Interacts with SERPINA5.1 Publication

Protein-protein interaction databases

BioGridi112307. 11 interactions.
IntActiQ02383. 4 interactions.
STRINGi9606.ENSP00000361855.

Structurei

3D structure databases

ProteinModelPortaliQ02383.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Repeati70 – 129603-1Add
BLAST
Repeati141 – 200602-1Add
BLAST
Repeati201 – 260602-2Add
BLAST
Repeati501 – 559593-2Add
BLAST

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni70 – 559490Repeat-rich regionAdd
BLAST
Regioni261 – 5002404 X 60 AA tandem repeats, type IAdd
BLAST

Sequence similaritiesi

Belongs to the semenogelin family.Curated

Keywords - Domaini

Repeat, Signal

Phylogenomic databases

eggNOGiNOG87347.
GeneTreeiENSGT00390000020321.
HOGENOMiHOG000263413.
HOVERGENiHBG054194.
InParanoidiQ02383.
OMAiGHKENKI.
OrthoDBiEOG7J4469.
PhylomeDBiQ02383.
TreeFamiTF342360.

Family and domain databases

InterProiIPR008836. Semenogelin.
[Graphical view]
PANTHERiPTHR10547. PTHR10547. 1 hit.
PfamiPF05474. Semenogelin. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

Q02383-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MKSIILFVLS LLLILEKQAA VMGQKGGSKG QLPSGSSQFP HGQKGQHYFG
60 70 80 90 100
QKDQQHTKSK GSFSIQHTYH VDINDHDWTR KSQQYDLNAL HKATKSKQHL
110 120 130 140 150
GGSQQLLNYK QEGRDHDKSK GHFHMIVIHH KGGQAHHGTQ NPSQDQGNSP
160 170 180 190 200
SGKGLSSQCS NTEKRLWVHG LSKEQASASG AQKGRTQGGS QSSYVLQTEE
210 220 230 240 250
LVVNKQQRET KNSHQNKGHY QNVVDVREEH SSKLQTSLHP AHQDRLQHGP
260 270 280 290 300
KDIFTTQDEL LVYNKNQHQT KNLSQDQEHG RKAHKISYPS SRTEERQLHH
310 320 330 340 350
GEKSVQKDVS KGSISIQTEE KIHGKSQNQV TIHSQDQEHG HKENKISYQS
360 370 380 390 400
SSTEERHLNC GEKGIQKGVS KGSISIQTEE QIHGKSQNQV RIPSQAQEYG
410 420 430 440 450
HKENKISYQS SSTEERRLNS GEKDVQKGVS KGSISIQTEE KIHGKSQNQV
460 470 480 490 500
TIPSQDQEHG HKENKMSYQS SSTEERRLNY GGKSTQKDVS QSSISFQIEK
510 520 530 540 550
LVEGKSQIQT PNPNQDQWSG QNAKGKSGQS ADSKQDLLSH EQKGRYKQES
560 570 580
SESHNIVITE HEVAQDDHLT QQYNEDRNPI ST
Length:582
Mass (Da):65,444
Last modified:July 1, 1993 - v1
Checksum:iEBF63FBF3A8EC45B
GO

Natural variant

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Natural varianti43 – 431Q → K.
Corresponds to variant rs2233896 [ dbSNP | Ensembl ].
VAR_034489
Natural varianti57 – 571T → A.
Corresponds to variant rs2233897 [ dbSNP | Ensembl ].
VAR_034490
Natural varianti274 – 2741S → N.1 Publication
Corresponds to variant rs2233901 [ dbSNP | Ensembl ].
VAR_024630
Natural varianti279 – 2791H → Y.
Corresponds to variant rs2233903 [ dbSNP | Ensembl ].
VAR_034491
Natural varianti368 – 3681G → R.1 Publication
Corresponds to variant rs2071650 [ dbSNP | Ensembl ].
VAR_024631

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
M81652 mRNA. Translation: AAA60562.1.
M81651 Genomic DNA. Translation: AAA60313.1.
Z47556 Genomic DNA. Translation: CAA87637.1.
AY259284 Genomic DNA. Translation: AAP86625.1.
AY259285 Genomic DNA. Translation: AAP86626.1.
AY259286 Genomic DNA. Translation: AAP86627.1.
AL049767 Genomic DNA. Translation: CAB53522.1.
CH471077 Genomic DNA. Translation: EAW75870.1.
CCDSiCCDS13346.1.
PIRiA43412.
RefSeqiNP_002999.1. NM_003008.2.
UniGeneiHs.537218.

Genome annotation databases

EnsembliENST00000372769; ENSP00000361855; ENSG00000124157.
GeneIDi6407.
KEGGihsa:6407.
UCSCiuc002xnk.3. human.

Polymorphism and mutation databases

BioMutaiSEMG2.

Keywords - Coding sequence diversityi

Polymorphism

Cross-referencesi

Web resourcesi

Protein Spotlight

Shackled sperm - Issue 62 of September 2005

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
M81652 mRNA. Translation: AAA60562.1.
M81651 Genomic DNA. Translation: AAA60313.1.
Z47556 Genomic DNA. Translation: CAA87637.1.
AY259284 Genomic DNA. Translation: AAP86625.1.
AY259285 Genomic DNA. Translation: AAP86626.1.
AY259286 Genomic DNA. Translation: AAP86627.1.
AL049767 Genomic DNA. Translation: CAB53522.1.
CH471077 Genomic DNA. Translation: EAW75870.1.
CCDSiCCDS13346.1.
PIRiA43412.
RefSeqiNP_002999.1. NM_003008.2.
UniGeneiHs.537218.

3D structure databases

ProteinModelPortaliQ02383.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi112307. 11 interactions.
IntActiQ02383. 4 interactions.
STRINGi9606.ENSP00000361855.

PTM databases

PhosphoSiteiQ02383.

Polymorphism and mutation databases

BioMutaiSEMG2.
DMDMi401079.

Proteomic databases

PaxDbiQ02383.
PeptideAtlasiQ02383.
PRIDEiQ02383.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000372769; ENSP00000361855; ENSG00000124157.
GeneIDi6407.
KEGGihsa:6407.
UCSCiuc002xnk.3. human.

Organism-specific databases

CTDi6407.
GeneCardsiGC20P043849.
HGNCiHGNC:10743. SEMG2.
HPAiHPA042767.
HPA042835.
MIMi182141. gene.
neXtProtiNX_Q02383.
PharmGKBiPA35665.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiNOG87347.
GeneTreeiENSGT00390000020321.
HOGENOMiHOG000263413.
HOVERGENiHBG054194.
InParanoidiQ02383.
OMAiGHKENKI.
OrthoDBiEOG7J4469.
PhylomeDBiQ02383.
TreeFamiTF342360.

Miscellaneous databases

GeneWikiiSEMG2.
GenomeRNAii6407.
NextBioi24896.
PMAP-CutDBQ02383.
PROiQ02383.
SOURCEiSearch...

Gene expression databases

BgeeiQ02383.
CleanExiHS_SEMG2.
GenevestigatoriQ02383.

Family and domain databases

InterProiIPR008836. Semenogelin.
[Graphical view]
PANTHERiPTHR10547. PTHR10547. 1 hit.
PfamiPF05474. Semenogelin. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "Molecular cloning of epididymal and seminal vesicular transcripts encoding a semenogelin-related protein."
    Lundwall A., Lilja H.
    Proc. Natl. Acad. Sci. U.S.A. 89:4559-4563(1992) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [MRNA].
    Tissue: Seminal vesicle.
  2. "Gene structure of semenogelin I and II. The predominant proteins in human semen are encoded by two homologous genes on chromosome 20."
    Ulvsbaeck M., Lazure C., Lilja H., Spurr N.K., Rao V.V., Loeffler C., Hansmann I., Lundwall A.
    J. Biol. Chem. 267:18080-18084(1992) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
  3. "Evolution of the hominoid semenogelin genes, the major proteins of ejaculated semen."
    Jensen-Seaman M.I., Li W.-H.
    J. Mol. Evol. 57:261-270(2003) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA], VARIANTS ASN-274 AND ARG-368.
  4. "The DNA sequence and comparative analysis of human chromosome 20."
    Deloukas P., Matthews L.H., Ashurst J.L., Burton J., Gilbert J.G.R., Jones M., Stavrides G., Almeida J.P., Babbage A.K., Bagguley C.L., Bailey J., Barlow K.F., Bates K.N., Beard L.M., Beare D.M., Beasley O.P., Bird C.P., Blakey S.E.
    , Bridgeman A.M., Brown A.J., Buck D., Burrill W.D., Butler A.P., Carder C., Carter N.P., Chapman J.C., Clamp M., Clark G., Clark L.N., Clark S.Y., Clee C.M., Clegg S., Cobley V.E., Collier R.E., Connor R.E., Corby N.R., Coulson A., Coville G.J., Deadman R., Dhami P.D., Dunn M., Ellington A.G., Frankland J.A., Fraser A., French L., Garner P., Grafham D.V., Griffiths C., Griffiths M.N.D., Gwilliam R., Hall R.E., Hammond S., Harley J.L., Heath P.D., Ho S., Holden J.L., Howden P.J., Huckle E., Hunt A.R., Hunt S.E., Jekosch K., Johnson C.M., Johnson D., Kay M.P., Kimberley A.M., King A., Knights A., Laird G.K., Lawlor S., Lehvaeslaiho M.H., Leversha M.A., Lloyd C., Lloyd D.M., Lovell J.D., Marsh V.L., Martin S.L., McConnachie L.J., McLay K., McMurray A.A., Milne S.A., Mistry D., Moore M.J.F., Mullikin J.C., Nickerson T., Oliver K., Parker A., Patel R., Pearce T.A.V., Peck A.I., Phillimore B.J.C.T., Prathalingam S.R., Plumb R.W., Ramsay H., Rice C.M., Ross M.T., Scott C.E., Sehra H.K., Shownkeen R., Sims S., Skuce C.D., Smith M.L., Soderlund C., Steward C.A., Sulston J.E., Swann R.M., Sycamore N., Taylor R., Tee L., Thomas D.W., Thorpe A., Tracey A., Tromans A.C., Vaudin M., Wall M., Wallis J.M., Whitehead S.L., Whittaker P., Willey D.L., Williams L., Williams S.A., Wilming L., Wray P.W., Hubbard T., Durbin R.M., Bentley D.R., Beck S., Rogers J.
    Nature 414:865-871(2001) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
  5. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
  6. "Isolation and structure determination of two peptides occurring in human seminal plasma."
    Schneider K., Kausler W., Tripier D., Jouvenal K., Spiteller G.
    Biol. Chem. Hoppe-Seyler 370:353-356(1989) [PubMed] [Europe PMC] [Abstract]
    Cited for: PARTIAL PROTEIN SEQUENCE.
  7. "Isolation and characterization of the major gel proteins in human semen, semenogelin I and semenogelin II."
    Malm J., Hellman J., Magnusson H., Laurell C.B., Lilja H.
    Eur. J. Biochem. 238:48-53(1996) [PubMed] [Europe PMC] [Abstract]
    Cited for: CHARACTERIZATION.
  8. "Characterization of semenogelin II and its molecular interaction with prostate-specific antigen and protein C inhibitor."
    Kise H., Nishioka J., Kawamura J., Suzuki K.
    Eur. J. Biochem. 238:88-96(1996) [PubMed] [Europe PMC] [Abstract]
    Cited for: GLYCOSYLATION, INTERACTION WITH SERPINA5.

Entry informationi

Entry nameiSEMG2_HUMAN
AccessioniPrimary (citable) accession number: Q02383
Secondary accession number(s): Q53ZU2, Q6X2M5, Q6X2M6
Entry historyi
Integrated into UniProtKB/Swiss-Prot: July 1, 1993
Last sequence update: July 1, 1993
Last modified: April 29, 2015
This is version 118 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Direct protein sequencing, Reference proteome

Documents

  1. Human chromosome 20
    Human chromosome 20: entries, gene names and cross-references to MIM
  2. Human entries with polymorphisms or disease mutations
    List of human entries with polymorphisms or disease mutations
  3. Human polymorphisms and disease mutations
    Index of human polymorphisms and disease mutations
  4. MIM cross-references
    Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot
  5. Protein Spotlight
    Protein Spotlight articles and cited UniProtKB/Swiss-Prot entries
  6. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into Uniref entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.