Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Glycinin G2

Gene

Gy2

Organism
Glycine max (Soybean) (Glycine hispida)
Status
Reviewed-Annotation score: Annotation score: 4 out of 5-Experimental evidence at protein leveli

Functioni

Glycinin is the major seed storage protein of soybean.

GO - Molecular functioni

Complete GO annotation...

Keywords - Molecular functioni

Seed storage protein, Storage protein

Names & Taxonomyi

Protein namesi
Recommended name:
Glycinin G2
Cleaved into the following 2 chains:
Gene namesi
Name:Gy2
OrganismiGlycine max (Soybean) (Glycine hispida)
Taxonomic identifieri3847 [NCBI]
Taxonomic lineageiEukaryotaViridiplantaeStreptophytaEmbryophytaTracheophytaSpermatophytaMagnoliophytaeudicotyledonsGunneridaePentapetalaerosidsfabidsFabalesFabaceaePapilionoideaePhaseoleaeGlycineSoja
Proteomesi
  • UP000008827 Componenti: Unplaced

Pathology & Biotechi

Protein family/group databases

Allergomei1143. Gly m 6.0201.
5821. Gly m 6.

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Signal peptidei1 – 181 PublicationAdd BLAST18
ChainiPRO_000003201319 – 296Glycinin A2 subunitAdd BLAST278
PropeptideiPRO_0000032014297 – 3001 Publication4
ChainiPRO_0000032015301 – 480Glycinin B1a subunitAdd BLAST180
PropeptideiPRO_0000032016481 – 4855

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Disulfide bondi28 ↔ 61By similarity
Disulfide bondi104 ↔ 307Interchain (between A2 and B1a chains)1 Publication

Keywords - PTMi

Disulfide bond

Proteomic databases

PRIDEiP04405.

Interactioni

Subunit structurei

Hexamer; each subunit is composed of an acidic and a basic chain derived from a single precursor and linked by a disulfide bond.

Structurei

3D structure databases

ProteinModelPortaliP04405.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Sequence similaritiesi

Keywords - Domaini

Signal

Phylogenomic databases

InParanoidiP04405.

Family and domain databases

Gene3Di2.60.120.10. 3 hits.
InterProiIPR022379. 11S_seedstore_CS.
IPR006044. 11S_seedstore_pln.
IPR006045. Cupin_1.
IPR014710. RmlC-like_jellyroll.
IPR011051. RmlC_Cupin.
[Graphical view]
PfamiPF00190. Cupin_1. 2 hits.
[Graphical view]
PRINTSiPR00439. 11SGLOBULIN.
SMARTiSM00835. Cupin_1. 2 hits.
[Graphical view]
SUPFAMiSSF51182. SSF51182. 3 hits.
PROSITEiPS00305. 11S_SEED_STORAGE. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

P04405-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MAKLVLSLCF LLFSGCFALR EQAQQNECQI QKLNALKPDN RIESEGGFIE
60 70 80 90 100
TWNPNNKPFQ CAGVALSRCT LNRNALRRPS YTNGPQEIYI QQGNGIFGMI
110 120 130 140 150
FPGCPSTYQE PQESQQRGRS QRPQDRHQKV HRFREGDLIA VPTGVAWWMY
160 170 180 190 200
NNEDTPVVAV SIIDTNSLEN QLDQMPRRFY LAGNQEQEFL KYQQQQQGGS
210 220 230 240 250
QSQKGKQQEE ENEGSNILSG FAPEFLKEAF GVNMQIVRNL QGENEEEDSG
260 270 280 290 300
AIVTVKGGLR VTAPAMRKPQ QEEDDDDEEE QPQCVETDKG CQRQSKRSRN
310 320 330 340 350
GIDETICTMR LRQNIGQNSS PDIYNPQAGS ITTATSLDFP ALWLLKLSAQ
360 370 380 390 400
YGSLRKNAMF VPHYTLNANS IIYALNGRAL VQVVNCNGER VFDGELQEGG
410 420 430 440 450
VLIVPQNFAV AAKSQSDNFE YVSFKTNDRP SIGNLAGANS LLNALPEEVI
460 470 480
QHTFNLKSQQ ARQVKNNNPF SFLVPPQESQ RRAVA
Length:485
Mass (Da):54,391
Last modified:October 1, 1989 - v2
Checksum:i78BB459837F77AD8
GO

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti39D → G in CAA26575 (Ref. 5) Curated1
Sequence conflicti39D → N AA sequence (PubMed:6541652).Curated1
Sequence conflicti61C → S AA sequence (PubMed:6541652).Curated1
Sequence conflicti117R → C AA sequence (PubMed:6541652).Curated1
Sequence conflicti343W → S AA sequence (PubMed:6541652).Curated1

Natural variant

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Natural varianti103G → D.1
Natural varianti318N → T.1
Natural varianti331I → V.1
Natural varianti413K → R.1

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
X15122 Genomic DNA. Translation: CAA33216.1.
D00216 mRNA. Translation: BAA00154.1.
Y00398 Genomic DNA. Translation: CAA68460.1.
X02806 mRNA. Translation: CAA26575.1.
K02646 Genomic DNA. Translation: AAA33963.1.
X53404 Genomic DNA. Translation: CAA37480.1.
PIRiA91341. FWSYG1.
S11002.
RefSeqiNP_001235810.1. NM_001248881.1.
UniGeneiGma.1857.

Genome annotation databases

GeneIDi547900.
KEGGigmx:547900.

Keywords - Coding sequence diversityi

Polymorphism

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
X15122 Genomic DNA. Translation: CAA33216.1.
D00216 mRNA. Translation: BAA00154.1.
Y00398 Genomic DNA. Translation: CAA68460.1.
X02806 mRNA. Translation: CAA26575.1.
K02646 Genomic DNA. Translation: AAA33963.1.
X53404 Genomic DNA. Translation: CAA37480.1.
PIRiA91341. FWSYG1.
S11002.
RefSeqiNP_001235810.1. NM_001248881.1.
UniGeneiGma.1857.

3D structure databases

ProteinModelPortaliP04405.
ModBaseiSearch...
MobiDBiSearch...

Protein family/group databases

Allergomei1143. Gly m 6.0201.
5821. Gly m 6.

Proteomic databases

PRIDEiP04405.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

GeneIDi547900.
KEGGigmx:547900.

Phylogenomic databases

InParanoidiP04405.

Family and domain databases

Gene3Di2.60.120.10. 3 hits.
InterProiIPR022379. 11S_seedstore_CS.
IPR006044. 11S_seedstore_pln.
IPR006045. Cupin_1.
IPR014710. RmlC-like_jellyroll.
IPR011051. RmlC_Cupin.
[Graphical view]
PfamiPF00190. Cupin_1. 2 hits.
[Graphical view]
PRINTSiPR00439. 11SGLOBULIN.
SMARTiSM00835. Cupin_1. 2 hits.
[Graphical view]
SUPFAMiSSF51182. SSF51182. 3 hits.
PROSITEiPS00305. 11S_SEED_STORAGE. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiGLYG2_SOYBN
AccessioniPrimary (citable) accession number: P04405
Secondary accession number(s): P04121, P04348, P04349
Entry historyi
Integrated into UniProtKB/Swiss-Prot: March 20, 1987
Last sequence update: October 1, 1989
Last modified: October 5, 2016
This is version 106 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programPlant Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Direct protein sequencing, Reference proteome

Documents

  1. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.