Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q8R1W2 (GSG1_MOUSE) Reviewed, UniProtKB/Swiss-Prot

Last modified April 16, 2014. Version 68. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Interactions·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Germ cell-specific gene 1 protein
Alternative name(s):
Germ cell-associated protein 1
Gene names
Name:Gsg1
OrganismMus musculus (Mouse) [Reference proteome]
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus

Protein attributes

Sequence length324 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

May cause the redistribution of PAPOLB from the cytosol to the endoplasmic reticulum. Ref.6

Subunit structure

Interacts with PAPOLB. Ref.6

Subcellular location

Endoplasmic reticulum membrane; Multi-pass membrane protein. Note: Colocalizes with PAPOLB in the endoplasmic reticulum. Ref.6

Tissue specificity

Expressed in spermatogenic cells (at protein level). Expressed in germ cells within the testis from day 21 onwards. Ref.1 Ref.6

Sequence similarities

Belongs to the GSG1 family.

Sequence caution

The sequence AAH23009.1 differs from that shown. Reason: Erroneous initiation.

The sequence BAA37087.1 differs from that shown. Reason: Frameshift at several positions.

Binary interactions

With

Entry

#Exp.

IntAct

Notes

PapolbQ9WVP68EBI-7842142,EBI-7842113

Alternative products

This entry describes 4 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q8R1W2-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q8R1W2-2)

The sequence of this isoform differs from the canonical sequence as follows:
     125-125: R → RGEKGLLEFATLQGSCHPTLRFGGEWLMEKASLLHLPWGPVAKVF
Isoform 3 (identifier: Q8R1W2-3)

The sequence of this isoform differs from the canonical sequence as follows:
     1-107: Missing.
     108-125: EPGEKCRRFIELTPPAQR → MEKASLLHLPWGPVAKVF
Isoform 4 (identifier: Q8R1W2-4)

The sequence of this isoform differs from the canonical sequence as follows:
     1-4: MAKM → M
     125-125: R → RGEKGLLEFATLQGSCHPTLRFGGEWLMEKASLLHLPWGPVAKVF

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 324324Germ cell-specific gene 1 protein
PRO_0000329462

Regions

Transmembrane15 – 3521Helical; Potential
Transmembrane127 – 14721Helical; Potential
Transmembrane164 – 18421Helical; Potential
Transmembrane208 – 22821Helical; Potential

Natural variations

Alternative sequence1 – 107107Missing in isoform 3.
VSP_032999
Alternative sequence1 – 44MAKM → M in isoform 4.
VSP_032998
Alternative sequence108 – 12518EPGEK…PPAQR → MEKASLLHLPWGPVAKVF in isoform 3.
VSP_033000
Alternative sequence1251R → RGEKGLLEFATLQGSCHPTL RFGGEWLMEKASLLHLPWGP VAKVF in isoform 2 and isoform 4.
VSP_033001

Experimental info

Sequence conflict1781M → I in BAC40258. Ref.3

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified April 29, 2008. Version 2.
Checksum: 9AB3A42C147F2B90

FASTA32436,108
        10         20         30         40         50         60 
MAKMEFQKGS SDQRTFISAI LNMLSLGLST ASLLSSEWFV GTQKVPKPLC GQSLAAKCFD 

        70         80         90        100        110        120 
MPMSLDGGIA NTSAQEVVQY TWETGDDRFS FLAFRSGMWL SCEETMEEPG EKCRRFIELT 

       130        140        150        160        170        180 
PPAQRWLSLG AQTAYIGLQL ISFLLLLTDL LLTTNPGCGL KLSAFAAVSL VLSGLLGMVA 

       190        200        210        220        230        240 
HMLYSQVFQA TANLGPEDWR PHSWNYGWAF YTAWVSFTCC MASAVTTFNM YTRMVLEFKC 

       250        260        270        280        290        300 
RHSKSFNTNP SCLAQHHRCF LPPPLTCTTH AGEPLSSCHQ YPSHPIRSVS EAIDLYSALQ 

       310        320 
DKEFQQGISQ ELKEVVEPSV EEQR 

« Hide

Isoform 2 [UniParc].

Checksum: 0ADF2ADD7DC5E4C3
Show »

FASTA36840,943
Isoform 3 [UniParc].

Checksum: C9775ACA6478AF1B
Show »

FASTA21724,215
Isoform 4 [UniParc].

Checksum: 2D0A0AEC0EA971FD
Show »

FASTA36540,613

References

« Hide 'large scale' references
[1]"Isolation and characterization of cDNA clones specifically expressed in testicular germ cells."
Tanaka H., Yoshimura Y., Nishina Y., Nozaki M., Nojima H., Nishimune Y.
FEBS Lett. 355:4-10(1994) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 4), TISSUE SPECIFICITY.
Strain: C57BL/6.
Tissue: Testis.
[2]"Mapping of six germ-cell-specific genes to mouse chromosomes."
Matsui M., Ichihara H., Kobayashi S., Tanaka H., Tsuchida J., Nozaki M., Yoshimura Y., Nojima H., Rochelle J.M., Nishimune Y., Taketo M.M., Seldin M.F.
Mamm. Genome 8:873-874(1997) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 4).
Strain: C57BL/6.
Tissue: Testis.
[3]"The transcriptional landscape of the mammalian genome."
Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J. expand/collapse author list , Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.
Science 309:1559-1563(2005) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 3).
Strain: C57BL/6J and NOD.
Tissue: Testis and Thymus.
[4]"Lineage-specific biology revealed by a finished genome assembly of the mouse."
Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X., Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y., Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S. expand/collapse author list , Teague B., Potamousis K., Churas C., Place M., Herschleb J., Runnheim R., Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., Lindblad-Toh K., Eichler E.E., Ponting C.P.
PLoS Biol. 7:E1000112-E1000112(2009) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: C57BL/6J.
[5]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
Tissue: Eye.
[6]"Germ cell-specific gene 1 targets testis-specific poly(A) polymerase to the endoplasmic reticulum through protein-protein interactions."
Choi H.-S., Lee S.-H., Kim H., Lee Y.
FEBS Lett. 582:1203-1209(2008) [PubMed] [Europe PMC] [Abstract]
Cited for: FUNCTION, INTERACTION WITH PAPOLB, SUBCELLULAR LOCATION, TISSUE SPECIFICITY.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
D87325 mRNA. Translation: BAA37087.1. Frameshift.
AK006326 mRNA. Translation: BAB24527.1.
AK088285 mRNA. Translation: BAC40258.1.
AC122820 Genomic DNA. No translation available.
BC023009 mRNA. Translation: AAH23009.1. Different initiation.
RefSeqNP_001074021.1. NM_001080552.1.
NP_001074022.1. NM_001080553.1.
NP_034482.2. NM_010352.2.
UniGeneMm.272306.

3D structure databases

ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

BioGrid200083. 1 interaction.
IntActQ8R1W2. 3 interactions.
MINTMINT-6167922.

PTM databases

PhosphoSiteQ8R1W2.

Proteomic databases

PRIDEQ8R1W2.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENSMUST00000087729; ENSMUSP00000085022; ENSMUSG00000030206. [Q8R1W2-2]
ENSMUST00000111910; ENSMUSP00000107541; ENSMUSG00000030206. [Q8R1W2-4]
ENSMUST00000111911; ENSMUSP00000107542; ENSMUSG00000030206. [Q8R1W2-4]
GeneID14840.
KEGGmmu:14840.
UCSCuc009elj.1. mouse. [Q8R1W2-2]
uc009elk.1. mouse. [Q8R1W2-4]

Organism-specific databases

CTD83445.
MGIMGI:1194499. Gsg1.

Phylogenomic databases

eggNOGNOG47621.
GeneTreeENSGT00390000011933.
HOGENOMHOG000112826.
HOVERGENHBG069545.
OMAWNYGWAF.
OrthoDBEOG715Q4M.
PhylomeDBQ8R1W2.
TreeFamTF331388.

Gene expression databases

BgeeQ8R1W2.
GenevestigatorQ8R1W2.

Family and domain databases

InterProIPR012478. GSG-1.
[Graphical view]
PfamPF07803. GSG-1. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

NextBio287055.
PROQ8R1W2.
SOURCESearch...

Entry information

Entry nameGSG1_MOUSE
AccessionPrimary (citable) accession number: Q8R1W2
Secondary accession number(s): Q8C2N5, Q9D9Z3, Q9Z1H7
Entry history
Integrated into UniProtKB/Swiss-Prot: April 29, 2008
Last sequence update: April 29, 2008
Last modified: April 16, 2014
This is version 68 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot