SubmitCancel

Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Q12491

- YB21B_YEAST

UniProt

Q12491 - YB21B_YEAST

(max 400 entries)x

Your basket is currently empty.

Select item(s) and click on "Add to basket" to create your own collection here
(400 entries max)

Protein

Transposon Ty2-B Gag-Pol polyprotein

Gene
TY2B-B, YBLWTy2-1 POL, YBL100W-B, YBL0822, YBL101W-B
Organism
Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast)
Status
Reviewed - Annotation score: 5 out of 5 - Protein inferred from homologyi

Functioni

Capsid protein (CA) is the structural component of the virus-like particle (VLP), forming the shell that encapsulates the retrotransposons dimeric RNA genome. The particles are assembled from trimer-clustered units and there are holes in the capsid shells that allow for the diffusion of macromolecules. CA has also nucleocapsid-like chaperone activity, promoting primer tRNA(i)-Met annealing to the multipartite primer-binding site (PBS), dimerization of Ty2 RNA and initiation of reverse transcription By similarity.
The aspartyl protease (PR) mediates the proteolytic cleavages of the Gag and Gag-Pol polyproteins after assembly of the VLP By similarity.
Reverse transcriptase/ribonuclease H (RT) is a multifunctional enzyme that catalyzes the conversion of the retro-elements RNA genome into dsDNA within the VLP. The enzyme displays a DNA polymerase activity that can copy either DNA or RNA templates, and a ribonuclease H (RNase H) activity that cleaves the RNA strand of RNA-DNA heteroduplexes during plus-strand synthesis and hydrolyzes RNA primers. The conversion leads to a linear dsDNA copy of the retrotransposon that includes long terminal repeats (LTRs) at both ends By similarity.
Integrase (IN) targets the VLP to the nucleus, where a subparticle preintegration complex (PIC) containing at least integrase and the newly synthesized dsDNA copy of the retrotransposon must transit the nuclear membrane. Once in the nucleus, integrase performs the integration of the dsDNA into the host genome By similarity.

Catalytic activityi

Deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1).
Endonucleolytic cleavage to 5'-phosphomonoester.

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sitei397 – 3982Cleavage; by Ty2 protease By similarity
Active sitei457 – 4571For protease activity; shared with dimeric partner By similarity
Sitei578 – 5792Cleavage; by Ty2 protease By similarity
Metal bindingi667 – 6671Magnesium; catalytic; for integrase activity By similarity
Metal bindingi732 – 7321Magnesium; catalytic; for integrase activity By similarity
Sitei1232 – 12332Cleavage; by Ty2 protease By similarity
Metal bindingi1361 – 13611Magnesium; catalytic; for reverse transcriptase activity By similarity
Metal bindingi1442 – 14421Magnesium; catalytic; for reverse transcriptase activity By similarity
Metal bindingi1443 – 14431Magnesium; catalytic; for reverse transcriptase activity By similarity
Metal bindingi1625 – 16251Magnesium; catalytic; for RNase H activity By similarity
Metal bindingi1667 – 16671Magnesium; catalytic; for RNase H activity By similarity
Metal bindingi1700 – 17001Magnesium; catalytic; for RNase H activity By similarity

GO - Molecular functioni

  1. aspartic-type endopeptidase activity Source: UniProtKB-KW
  2. ATP binding Source: UniProtKB-KW
  3. DNA binding Source: UniProtKB-KW
  4. DNA-directed DNA polymerase activity Source: SGD
  5. metal ion binding Source: UniProtKB-KW
  6. peptidase activity Source: SGD
  7. ribonuclease activity Source: SGD
  8. RNA binding Source: SGD
  9. RNA-directed DNA polymerase activity Source: SGD
  10. RNA-DNA hybrid ribonuclease activity Source: UniProtKB-EC

GO - Biological processi

  1. DNA-dependent DNA replication Source: GOC
  2. DNA integration Source: UniProtKB-KW
  3. DNA recombination Source: UniProtKB-KW
  4. RNA-dependent DNA replication Source: GOC
  5. RNA phosphodiester bond hydrolysis Source: GOC
  6. transposition, RNA-mediated Source: SGD
  7. viral release from host cell Source: UniProtKB-KW
Complete GO annotation...

Keywords - Molecular functioni

Aspartyl protease, DNA-directed DNA polymerase, Endonuclease, Hydrolase, Nuclease, Nucleotidyltransferase, Protease, RNA-directed DNA polymerase, Transferase

Keywords - Biological processi

DNA integration, DNA recombination, Transposition, Virion maturation, Virus exit from host cell

Keywords - Ligandi

ATP-binding, DNA-binding, Magnesium, Metal-binding, Nucleotide-binding, RNA-binding, Zinc

Names & Taxonomyi

Protein namesi
Recommended name:
Transposon Ty2-B Gag-Pol polyprotein
Alternative name(s):
TY2A-TY2B
Transposon Ty2 TYA-TYB polyprotein
Cleaved into the following 4 chains:
Capsid protein
Short name:
CA
Ty2 protease (EC:3.4.23.-)
Short name:
PR
Integrase
Short name:
IN
Reverse transcriptase/ribonuclease H (EC:2.7.7.49, EC:2.7.7.7, EC:3.1.26.4)
Short name:
RT
Short name:
RT-RH
Gene namesi
Name:TY2B-B
Synonyms:YBLWTy2-1 POL
Ordered Locus Names:YBL100W-B
ORF Names:YBL0822, YBL101W-B
OrganismiSaccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast)
Taxonomic identifieri559292 [NCBI]
Taxonomic lineageiEukaryotaFungiDikaryaAscomycotaSaccharomycotinaSaccharomycetesSaccharomycetalesSaccharomycetaceaeSaccharomyces
ProteomesiUP000002311: Chromosome II

Organism-specific databases

CYGDiYBL100w-b.
SGDiS000002149. YBL100W-B.

Subcellular locationi

Cytoplasm. Nucleus By similarity

GO - Cellular componenti

  1. cytoplasm Source: UniProtKB-SubCell
  2. nucleus Source: SGD
  3. retrotransposon nucleocapsid Source: SGD
Complete GO annotation...

Keywords - Cellular componenti

Cytoplasm, Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 17701770Transposon Ty2-B Gag-Pol polyproteinPRO_0000279270Add
BLAST
Chaini1 – 397397Capsid protein By similarityPRO_0000279271Add
BLAST
Chaini398 – 578181Ty2 protease By similarityPRO_0000279272Add
BLAST
Chaini579 – 1232654Integrase By similarityPRO_0000279273Add
BLAST
Chaini1233 – 1770538Reverse transcriptase/ribonuclease H By similarityPRO_0000279274Add
BLAST

Post-translational modificationi

Initially, virus-like particles (VLPs) are composed of the structural unprocessed proteins Gag and Gag-Pol, and contain also the host initiator methionine tRNA (tRNA(i)-Met) which serves as a primer for minus-strand DNA synthesis, and a dimer of genomic Ty RNA. Processing of the polyproteins occurs within the particle and proceeds by an ordered pathway, called maturation. First, the protease (PR) is released by autocatalytic cleavage of the Gag-Pol polyprotein, and this cleavage is a prerequisite for subsequent processing at the remaining sites to release the mature structural and catalytic proteins. Maturation takes place prior to the RT reaction and is required to produce transposition-competent VLPs By similarity.

Expressioni

Gene expression databases

GenevestigatoriQ12491.

Interactioni

Subunit structurei

The capsid protein forms a homotrimer, from which the VLPs are assembled. The protease is a homodimer, whose active site consists of two apposed aspartic acid residues By similarity.

Protein-protein interaction databases

BioGridi32604. 1 interaction.

Structurei

3D structure databases

ProteinModelPortaliQ12491.
SMRiQ12491. Positions 632-833.

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini656 – 831176Integrase catalyticAdd
BLAST
Domaini1353 – 1491139Reverse transcriptase Ty1/copia-typeAdd
BLAST
Domaini1625 – 1767143RNase H Ty1/copia-typeAdd
BLAST

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni295 – 397103RNA-binding By similarityAdd
BLAST
Regioni579 – 63658Integrase-type zinc finger-likeAdd
BLAST

Motif

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Motifi1193 – 122735Bipartite nuclear localization signal By similarityAdd
BLAST

Domaini

The C-terminal RNA-binding region of CA is sufficient for all its nucleocapsid-like chaperone activities By similarity.
Integrase core domain contains the D-x(n)-D-x(35)-E motif, named for the phylogenetically conserved glutamic acid and aspartic acid residues and the invariant 35 amino acid spacing between the second and third acidic residues. Each acidic residue of the D,D(35)E motif is independently essential for the 3'-processing and strand transfer activities of purified integrase protein By similarity.

Sequence similaritiesi

Keywords - Domaini

Zinc-finger

Phylogenomic databases

GeneTreeiENSGT00730000111602.
HOGENOMiHOG000280731.
OMAiILVMEIS.
OrthoDBiEOG7TJ3S3.

Family and domain databases

Gene3Di3.30.420.10. 1 hit.
InterProiIPR025724. GAG-pre-integrase_dom.
IPR001584. Integrase_cat-core.
IPR015820. Retrotransposon_Ty1A_N.
IPR012337. RNaseH-like_dom.
IPR013103. RVT_2.
[Graphical view]
PfamiPF13976. gag_pre-integrs. 1 hit.
PF00665. rve. 1 hit.
PF07727. RVT_2. 1 hit.
PF01021. TYA. 1 hit.
[Graphical view]
SUPFAMiSSF53098. SSF53098. 1 hit.
PROSITEiPS50994. INTEGRASE. 1 hit.
[Graphical view]

Sequences (2)i

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

This entry describes 2 isoformsi produced by ribosomal frameshifting. Align

Note: The Gag-Pol polyprotein is generated by a +1 ribosomal frameshift.

Isoform Transposon Ty2-B Gag-Pol polyprotein (identifier: Q12491-1) [UniParc]FASTAAdd to Basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

MESQQLHQNP HSLHGSAYAS VTSKEVSSNQ DPLAVSASNL PEFDRDSTKV     50
NSQQETTPGT SAVPENHHHV SPQPASVPPP QNGQYQQHGM MTPNKAMASN 100
WAHYQQPSMM TCSHYQTSPA YYQPDPHYPL PQYIPPLSTS SPDPIDSQDQ 150
HSEVPQAKTK VRNNVLPPHT LTSEENFYTW VKFYIRFLKN SNLGDIIPND 200
QGEIKRQMTY EEHAYIYNTF QAFAPFHLLP TWVKQILEIN YADILTVLCK 250
SVSKMQTNNQ ELKDWIALAN LEYDGSTSAD TFEITVSTII QRLKENNINV 300
SDRLACQLIL KGLSGDFKYL RNQYRTKTNM KLSQLFAEIQ LIYDENKIMN 350
LNKPSQYKQH SEYKNVSRTS PNTTNTKVTT RNYHRTNSSK PRAAKAHNIA 400
TSSKFSRVNN DHINESTVSS QYLSDDNELS LGQQQKESKP THTIDSNDEL 450
PDHLLIDSGA SQTLVRSAHY LHHATPNSEI NIVDAQKQDI PINAIGNLHF 500
NFQNGTKTSI KALHTPNIAY DLLSLSELAN QNITACFTRN TLERSDGTVL 550
APIVKHGDFY WLSKKYLIPS HISKLTINNV NKSKSVNKYP YPLIHRMLGH 600
ANFRSIQKSL KKNAVTYLKE SDIEWSNAST YQCPDCLIGK STKHRHVKGS 650
RLKYQESYEP FQYLHTDIFG PVHHLPKSAP SYFISFTDEK TRFQWVYPLH 700
DRREESILNV FTSILAFIKN QFNARVLVIQ MDRGSEYTNK TLHKFFTNRG 750
ITACYTTTAD SRAHGVAERL NRTLLNDCRT LLHCSGLPNH LWFSAVEFST 800
IIRNSLVSPK NDKSARQHAG LAGLDITTIL PFGQPVIVNN HNPDSKIHPR 850
GIPGYALHPS RNSYGYIIYL PSLKKTVDTT NYVILQDNQS KLDQFNYDTL 900
TFDDDLNRLT AHNQSFIEQN ETEQSYDQNT ESDHDYQSEI EINSDPLVND 950
FSSQSMNPLQ LDHEPVQKVR ALKEVDADIS EYNILPSPVR SRTPHIINKE 1000
STEMGGTIES DTTSPRHSST FTARNQKRPG SPNDMIDLTS QDRVNYELEN 1050
IKTTRLGGTE EPYIQRNSDT NIKYRTTNST PSIDDRSPDS DSTTPIISIE 1100
TKAACDNTPS IDTDPPEYRS SDHATPNIMP DKSSKNVTAD SILDDLPLPD 1150
LTNKSPTDTS DVSKDIPHIH SRQTNSSLGG MDDSNVLTTT KSKKRSLEDN 1200
ETEIEVSRDT WNNKNMRSLE PPRSKKRINL IAAIKGVKSI KPVRTTLRYD 1250
EAITYNEDNK EKDRYIEAYH KEINQLLRMN TWDTNKYYDR NDIDPKKVIN 1300
SMFIFNKKRD GTHKARFVAR GDIQHPDTYD SDMQSNTVHH YALMTSLSIA 1350
LDNDYYITQL DISSAYLYAD IKEELYIRPP PHLGLNDKLL RLRKSLYGLK 1400
QSGANWYETI KSYLINCCDM QEVRGWSCVF KNSQVTICLF VDDMILFSKD 1450
LNANEKIITT LKKQYDTKII NLGESDNEIQ YDILGLEIKY QRSKYMKLGM 1500
EKSLTEKLPK LNVPLNPKGK KLSAPGQPGH YIDQDELEID EDEYKEKVHE 1550
MQKLIGLASY VGYKFRFDLL YYINTLAQHI LFPSRQVLDM TYELIQFMWD 1600
TRDKQLIWHK NKPTKPDNKL VAISDASYGN QPYYKSQIGN IFLLNGKVIG 1650
GKSTKASLTC TSTTEAEIHA VSEAIPLLNN LSHLVQELNK KPIIKGLLTD 1700
SRSTISIIKS TNEEKFRNRF FGTKAMRLRD EVSGNNLYVY YIETKKNIAD 1750
VMTKPLPIKT FKLLTNKWIH 1770

Note: Produced by +1 ribosomal frameshifting between codon Leu-431 and Gly-432 of the YBL100W-A ORF.

Length:1,770
Mass (Da):202,215
Last modified:November 1, 1996 - v1
Checksum:iEAE87178C187B31B
GO
Isoform Transposon Ty2-B Gag polyprotein (identifier: Q12260-1) [UniParc]FASTAAdd to Basket

The sequence of this isoform can be found in the external entry Q12260.
Isoforms of the same protein are often annotated in two different entries if their sequences differ significantly.

Note: Produced by conventional translation.

Length:438
Mass (Da):49,898
GO

Sequence cautioni

The sequence CAA55998.1 differs from that shown. Reason: Erroneous gene model prediction.

Sequence databases

Select the link destinations:
EMBL
GenBank
DDBJ
Links Updated
X79489 Genomic DNA. Translation: CAA55998.1. Sequence problems.
Z35861 Genomic DNA. Translation: CAA84922.1.
Z35862 Genomic DNA. Translation: CAA84927.2.
BK006936 Genomic DNA. Translation: DAA07024.1.
PIRiS45842.
RefSeqiNP_009450.1. NM_001180050.1. [Q12491-1]

Genome annotation databases

EnsemblFungiiYBL100W-B; YBL100W-B; YBL100W-B. [Q12491-1]
GeneIDi852175.
KEGGisce:YBL100W-B.

Keywords - Coding sequence diversityi

Ribosomal frameshifting

Cross-referencesi

Sequence databases

Select the link destinations:
EMBL
GenBank
DDBJ
Links Updated
X79489 Genomic DNA. Translation: CAA55998.1 . Sequence problems.
Z35861 Genomic DNA. Translation: CAA84922.1 .
Z35862 Genomic DNA. Translation: CAA84927.2 .
BK006936 Genomic DNA. Translation: DAA07024.1 .
PIRi S45842.
RefSeqi NP_009450.1. NM_001180050.1. [Q12491-1 ]

3D structure databases

ProteinModelPortali Q12491.
SMRi Q12491. Positions 632-833.
ModBasei Search...
MobiDBi Search...

Protein-protein interaction databases

BioGridi 32604. 1 interaction.

Protocols and materials databases

Structural Biology Knowledgebase Search...

Genome annotation databases

EnsemblFungii YBL100W-B ; YBL100W-B ; YBL100W-B . [Q12491-1 ]
GeneIDi 852175.
KEGGi sce:YBL100W-B.

Organism-specific databases

CYGDi YBL100w-b.
SGDi S000002149. YBL100W-B.

Phylogenomic databases

GeneTreei ENSGT00730000111602.
HOGENOMi HOG000280731.
OMAi ILVMEIS.
OrthoDBi EOG7TJ3S3.

Miscellaneous databases

NextBioi 970632.

Gene expression databases

Genevestigatori Q12491.

Family and domain databases

Gene3Di 3.30.420.10. 1 hit.
InterProi IPR025724. GAG-pre-integrase_dom.
IPR001584. Integrase_cat-core.
IPR015820. Retrotransposon_Ty1A_N.
IPR012337. RNaseH-like_dom.
IPR013103. RVT_2.
[Graphical view ]
Pfami PF13976. gag_pre-integrs. 1 hit.
PF00665. rve. 1 hit.
PF07727. RVT_2. 1 hit.
PF01021. TYA. 1 hit.
[Graphical view ]
SUPFAMi SSF53098. SSF53098. 1 hit.
PROSITEi PS50994. INTEGRASE. 1 hit.
[Graphical view ]
ProtoNeti Search...

Publicationsi

« Hide 'large scale' publications
  1. "Sequence analysis of a 78.6 kb segment of the left end of Saccharomyces cerevisiae chromosome II."
    Obermaier B., Gassenhuber J., Piravandi E., Domdey H.
    Yeast 11:1103-1112(1995) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
    Strain: ATCC 204508 / S288c.
  2. "Complete DNA sequence of yeast chromosome II."
    Feldmann H., Aigle M., Aljinovic G., Andre B., Baclet M.C., Barthe C., Baur A., Becam A.-M., Biteau N., Boles E., Brandt T., Brendel M., Brueckner M., Bussereau F., Christiansen C., Contreras R., Crouzet M., Cziepluch C.
    , Demolis N., Delaveau T., Doignon F., Domdey H., Duesterhus S., Dubois E., Dujon B., El Bakkoury M., Entian K.-D., Feuermann M., Fiers W., Fobo G.M., Fritz C., Gassenhuber J., Glansdorff N., Goffeau A., Grivell L.A., de Haan M., Hein C., Herbert C.J., Hollenberg C.P., Holmstroem K., Jacq C., Jacquet M., Jauniaux J.-C., Jonniaux J.-L., Kallesoee T., Kiesau P., Kirchrath L., Koetter P., Korol S., Liebl S., Logghe M., Lohan A.J.E., Louis E.J., Li Z.Y., Maat M.J., Mallet L., Mannhaupt G., Messenguy F., Miosga T., Molemans F., Mueller S., Nasr F., Obermaier B., Perea J., Pierard A., Piravandi E., Pohl F.M., Pohl T.M., Potier S., Proft M., Purnelle B., Ramezani Rad M., Rieger M., Rose M., Schaaff-Gerstenschlaeger I., Scherens B., Schwarzlose C., Skala J., Slonimski P.P., Smits P.H.M., Souciet J.-L., Steensma H.Y., Stucka R., Urrestarazu L.A., van der Aart Q.J.M., Van Dyck L., Vassarotti A., Vetter I., Vierendeels F., Vissers S., Wagner G., de Wergifosse P., Wolfe K.H., Zagulski M., Zimmermann F.K., Mewes H.-W., Kleine K.
    EMBO J. 13:5795-5809(1994) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: ATCC 204508 / S288c.
  3. Cited for: GENOME REANNOTATION.
    Strain: ATCC 204508 / S288c.
  4. "Transposable elements and genome organization: a comprehensive survey of retrotransposons revealed by the complete Saccharomyces cerevisiae genome sequence."
    Kim J.M., Vanguri S., Boeke J.D., Gabriel A., Voytas D.F.
    Genome Res. 8:464-478(1998) [PubMed] [Europe PMC] [Abstract]
    Cited for: NOMENCLATURE.
  5. "Happy together: the life and times of Ty retrotransposons and their hosts."
    Lesage P., Todeschini A.L.
    Cytogenet. Genome Res. 110:70-90(2005) [PubMed] [Europe PMC] [Abstract]
    Cited for: REVIEW.

Entry informationi

Entry nameiYB21B_YEAST
AccessioniPrimary (citable) accession number: Q12491
Secondary accession number(s): D6VPQ4, Q05679
Entry historyi
Integrated into UniProtKB/Swiss-Prot: March 6, 2007
Last sequence update: November 1, 1996
Last modified: May 14, 2014
This is version 100 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programFungal Protein Annotation Program

Miscellaneousi

Miscellaneous

Retrotransposons are mobile genetic entities that are able to replicate via an RNA intermediate and a reverse transcription step. In contrast to retroviruses, retrotransposons are non-infectious, lack an envelope and remain intracellular. Ty2 retrotransposons belong to the copia elements (pseudoviridae).

Keywords - Technical termi

Complete proteome, Multifunctional enzyme, Reference proteome, Transposable element

Documents

  1. Peptidase families
    Classification of peptidase families and list of entries
  2. SIMILARITY comments
    Index of protein domains and families
  3. Yeast
    Yeast (Saccharomyces cerevisiae): entries, gene names and cross-references to SGD
  4. Yeast chromosome II
    Yeast (Saccharomyces cerevisiae) chromosome II: entries and gene names

External Data

Dasty 3

Similar proteinsi