P47024 (YJ41B_YEAST) Reviewed, UniProtKB/Swiss-Prot
Last modified
April 3, 2013.
Version 86.
History...
Names·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order
Names·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize orderNames and origin
| Protein names | Recommended name: Transposon Ty4-J Gag-Pol polyprotein Alternative name(s): TY4A-TY4B Transposon Ty4 TYA-TYB polyprotein | ||||||||
| Gene names |
| ||||||||
| Organism | Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast) [Reference proteome] | ||||||||
| Taxonomic identifier | 559292 [NCBI] | ||||||||
| Taxonomic lineage | Eukaryota › Fungi › Dikarya › Ascomycota › Saccharomycotina › Saccharomycetes › Saccharomycetales › Saccharomycetaceae › Saccharomyces › ![]() |
Protein attributes
| Sequence length | 1803 AA. |
| Sequence status | Complete. |
| Protein existence | Evidence at transcript level |
General annotation (Comments)
| Function | Capsid protein (CA) is the structural component of the virus-like particle (VLP), forming the shell that encapsulates the retrotransposons dimeric RNA genome By similarity. The aspartyl protease (PR) mediates the proteolytic cleavages of the Gag and Gag-Pol polyproteins after assembly of the VLP By similarity. Reverse transcriptase/ribonuclease H (RT) is a multifunctional enzyme that catalyzes the conversion of the retro-elements RNA genome into dsDNA within the VLP. The enzyme displays a DNA polymerase activity that can copy either DNA or RNA templates, and a ribonuclease H (RNase H) activity that cleaves the RNA strand of RNA-DNA heteroduplexes during plus-strand synthesis and hydrolyzes RNA primers. The conversion leads to a linear dsDNA copy of the retrotransposon that includes long terminal repeats (LTRs) at both ends By similarity. Integrase (IN) targets the VLP to the nucleus, where a subparticle preintegration complex (PIC) containing at least integrase and the newly synthesized dsDNA copy of the retrotransposon must transit the nuclear membrane. Once in the nucleus, integrase performs the integration of the dsDNA into the host genome By similarity. |
| Catalytic activity | Deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1). Endonucleolytic cleavage to 5'-phosphomonoester. |
| Subunit structure | The protease is a homodimer, whose active site consists of two apposed aspartic acid residues By similarity. |
| Subcellular location | |
| Domain | Integrase core domain contains the D-x(n)-D-x(35)-E motif, named for the phylogenetically conserved glutamic acid and aspartic acid residues and the invariant 35 amino acid spacing between the second and third acidic residues. Each acidic residue of the D,D35E motif is independently essential for the 3'-processing and strand transfer activities of purified integrase protein By similarity. |
| Post-translational modification | Proteolytically processed into capsid protein (CA), Ty4 protease (PR), integrase (IN) and reverse transcriptase/ribonuclease H (RT) proteins Probable. Initially, virus-like particles (VLPs) are composed of the structural unprocessed proteins Gag and Gag-Pol, and contain also the host initiator methionine tRNA (tRNA(i)-Met) which serves as a primer for minus-strand DNA synthesis, and a dimer of genomic Ty RNA. Processing of the polyproteins occurs within the particle and proceeds by an ordered pathway, called maturation. First, the protease (PR) is released by autocatalytic cleavage of the Gag-Pol polyprotein, and this cleavage is a prerequisite for subsequent processing at the remaining sites to release the mature structural and catalytic proteins. Maturation takes place prior to the RT reaction and is required to produce transposition-competent VLPs By similarity. |
| Miscellaneous | Retrotransposons are mobile genetic entities that are able to replicate via an RNA intermediate and a reverse transcription step. In contrast to retroviruses, retrotransposons are non-infectious, lack an envelope and remain intracellular. Ty4 retrotransposons belong to the copia elements (pseudoviridae). |
| Sequence similarities | Contains 1 integrase catalytic domain. Contains 1 reverse transcriptase Ty1/copia-type domain. Contains 1 RNase H Ty1/copia-type domain. |
| Sequence caution | The sequence CAA89409.1 differs from that shown. Reason: Erroneous gene model prediction. The sequence DAA08686.2 differs from that shown. Reason: Frameshift at positions 226 and 240. |
Ontologies
Alternative products
| This entry describes 2 isoforms produced by ribosomal frameshifting. [Align] [Select] Note: The Gag-Pol polyprotein is generated by a +1 ribosomal frameshift. | ||||||
| Isoform Transposon Ty4-J Gag-Pol polyprotein (identifier: P47024-1) This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry. | ||||||
| Note: Produced by +1 ribosomal frameshifting between codon Leu-363 and Gly-364 of the YJL114W ORF. | ||||||
| Isoform Transposon Ty4-J Gag polyprotein (identifier: P47023-1) The sequence of this isoform can be found in the external entry P47023. Isoforms of the same protein are often annotated in two different entries if their sequences differ significantly. | ||||||
| Note: Produced by conventional translation. |
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||
Molecule processing | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Chain | 1 – 1803 | 1803 | Transposon Ty4-J Gag-Pol polyprotein | PRO_0000203502 | |||||
Regions | |||||||||
| Domain | 620 – 787 | 168 | Integrase catalytic | ||||||
| Domain | 1376 – 1511 | 136 | Reverse transcriptase Ty1/copia-type | ||||||
| Domain | 1645 – 1791 | 147 | RNase H Ty1/copia-type | ||||||
| Region | 382 – 502 | 121 | Ty4 protease | ||||||
| Region | 540 – 600 | 61 | Integrase-type zinc finger-like | ||||||
| Coiled coil | 39 – 115 | 77 | Potential | ||||||
Sites | |||||||||
| Active site | 415 | 1 | For protease activity; shared with dimeric partner By similarity | ||||||
| Metal binding | 631 | 1 | Magnesium; catalytic; for integrase activity By similarity | ||||||
| Metal binding | 696 | 1 | Magnesium; catalytic; for integrase activity By similarity | ||||||
| Metal binding | 1384 | 1 | Magnesium; catalytic; for reverse transcriptase activity By similarity | ||||||
| Metal binding | 1463 | 1 | Magnesium; catalytic; for reverse transcriptase activity By similarity | ||||||
| Metal binding | 1464 | 1 | Magnesium; catalytic; for reverse transcriptase activity By similarity | ||||||
| Metal binding | 1645 | 1 | Magnesium; catalytic; for RNase H activity By similarity | ||||||
| Metal binding | 1687 | 1 | Magnesium; catalytic; for RNase H activity By similarity | ||||||
| Metal binding | 1721 | 1 | Magnesium; catalytic; for RNase H activity By similarity | ||||||
Experimental info | |||||||||
| Sequence conflict | 452 | 1 | V → L in X67284. Ref.1 | ||||||
| Sequence conflict | 684 | 1 | T → A in X67284. Ref.1 | ||||||
| Sequence conflict | 920 | 1 | A → S in X67284. Ref.1 | ||||||
| Sequence conflict | 1020 | 1 | S → R in X67284. Ref.1 | ||||||
| Sequence conflict | 1803 | 1 | Y → YLINEVLNTQISVEVQ in X67284. Ref.1 | ||||||
Sequences
| ||||||||||||||||||||||||
References
| « Hide 'large scale' references | |
| [1] | "Ty4, a new retrotransposon from Saccharomyces cerevisiae, flanked by tau-elements." Janetzky B., Lehle L. J. Biol. Chem. 267:19798-19805(1992) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA]. Strain: ATCC 204508 / S288c. |
| [2] | "Sequencing analysis of a 40.2 kb fragment of yeast chromosome X reveals 19 open reading frames including URA2 (5' end), TRK1, PBS2, SPT10, GCD14, RPE1, PHO86, NCA3, ASF1, CCT7, GZF3, two tRNA genes, three remnant delta elements and a Ty4 transposon." Cziepluch C., Kordes E., Pujol A., Jauniaux J.-C. Yeast 12:1471-1474(1996) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA]. Strain: ATCC 96604 / S288c / FY1679. |
| [3] | "Complete nucleotide sequence of Saccharomyces cerevisiae chromosome X." Galibert F., Alexandraki D., Baur A., Boles E., Chalwatzis N., Chuat J.-C., Coster F., Cziepluch C., de Haan M., Domdey H., Durand P., Entian K.-D., Gatius M., Goffeau A., Grivell L.A., Hennemann A., Herbert C.J., Heumann K. Karpfinger-Hartl L.EMBO J. 15:2031-2049(1996) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. Strain: ATCC 96604 / S288c / FY1679. |
| [4] | Saccharomyces Genome Database Submitted (MAR-2011) to the EMBL/GenBank/DDBJ databases Cited for: GENOME REANNOTATION, SEQUENCE REVISION TO 226 AND 240. Strain: ATCC 204508 / S288c. |
| [5] | "Transposable elements and genome organization: a comprehensive survey of retrotransposons revealed by the complete Saccharomyces cerevisiae genome sequence." Kim J.M., Vanguri S., Boeke J.D., Gabriel A., Voytas D.F. Genome Res. 8:464-478(1998) [PubMed] [Europe PMC] [Abstract] Cited for: NOMENCLATURE. |
| [6] | "Happy together: the life and times of Ty retrotransposons and their hosts." Lesage P., Todeschini A.L. Cytogenet. Genome Res. 110:70-90(2005) [PubMed] [Europe PMC] [Abstract] Cited for: REVIEW. |
| + | Additional computationally mapped references. |
Cross-references
Sequence databases | |
|---|---|
| EMBL GenBank DDBJ | X67284 Genomic DNA. No translation available. Z49389 Genomic DNA. Translation: CAA89409.1. Sequence problems. BK006943 Genomic DNA. Translation: DAA08686.2. Frameshift. |
| PIR | S31262. S56894. |
| RefSeq | NP_012421.2. NM_001181546.2. |
3D structure databases | |
| ProteinModelPortal | P47024. |
| ModBase | Search... |
Protein-protein interaction databases | |
| IntAct | P47024. 6 interactions. |
| MINT | MINT-436711. |
| STRING | 4932.YJL113W. |
Proteomic databases | |
| PaxDb | P47024. |
Protocols and materials databases | |
| StructuralBiologyKnowledgebase | Search... |
Genome annotation databases | |
| GeneID | 853330. |
| KEGG | sce:YJL113W. |
Organism-specific databases | |
| SGD | S000003649. YJL113W. |
Phylogenomic databases | |
| eggNOG | NOG283194. |
| HOGENOM | HOG000155565. |
| OrthoDB | EOG469V3D. |
Gene expression databases | |
| ArrayExpress | P47024. |
| Genevestigator | P47024. |
| GermOnline | YJL113W. Saccharomyces cerevisiae. |
Family and domain databases | |
| InterPro | IPR001584. Integrase_cat-core. IPR012337. RNaseH-like_dom. IPR013103. RVT_2. IPR001878. Znf_CCHC. [Graphical view] |
| Pfam | PF00665. rve. 1 hit. PF07727. RVT_2. 1 hit. [Graphical view] |
| SMART | SM00343. ZnF_C2HC. 1 hit. [Graphical view] |
| SUPFAM | SSF53098. RNaseH_fold. 1 hit. |
| PROSITE | PS50994. INTEGRASE. 1 hit. [Graphical view] |
| ProtoNet | Search... |
Other | |
| NextBio | 973695. |
Entry information
| Entry name | YJ41B_YEAST | ||||||||
| Accession | Primary (citable) accession number: P47024 Secondary accession number(s): D6VW70, P87192 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation program | Fungal Protein Annotation Program | ||||||||
Relevant documents
| Yeast Yeast (Saccharomyces cerevisiae): entries, gene names and cross-references to SGD |
| Yeast chromosome X Yeast (Saccharomyces cerevisiae) chromosome X: entries and gene names |
| SIMILARITY comments Index of protein domains and families |

Clusters with
