Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Enzymatic polyprotein

Gene

ORF V

Organism
Cauliflower mosaic virus (strain Strasbourg) (CaMV)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Protein inferred from homologyi

Functioni

Encodes for at least two polypeptides: protease (PR) and reverse transcriptase (RT). The protease processes the polyprotein in cis. Reverse transcriptase is multifunctional enzyme that converts the viral RNA genome into dsDNA in viral cytoplasmic capsids. This enzyme displays a DNA polymerase activity that can copy either DNA or RNA templates, and a ribonuclease H (RNase H) activity that cleaves the RNA strand of RNA-DNA heteroduplexes in a partially processive 3'- to 5'-endonucleasic mode. Neo-synthesized pregenomic RNA (pgRNA) are encapsidated, and reverse-transcribed inside the nucleocapsid. Partial (+)DNA is synthesized from the (-)DNA template and generates the relaxed circular DNA (RC-DNA) genome. After budding and infection, the RC-DNA migrates in the nucleus, and is converted into a plasmid-like covalently closed circular DNA (cccDNA) (By similarity).By similarity

Catalytic activityi

Deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1).PROSITE-ProRule annotation

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Active sitei45 – 451

GO - Molecular functioni

  1. aspartic-type endopeptidase activity Source: UniProtKB-KW
  2. endonuclease activity Source: UniProtKB-KW
  3. RNA-directed DNA polymerase activity Source: UniProtKB-KW
Complete GO annotation...

Keywords - Molecular functioni

Aspartyl protease, Endonuclease, Hydrolase, Nuclease, Nucleotidyltransferase, Protease, RNA-directed DNA polymerase, Transferase

Protein family/group databases

MEROPSiA03.001.

Names & Taxonomyi

Protein namesi
Recommended name:
Enzymatic polyprotein
Including the following 3 domains:
Aspartic protease (EC:3.4.23.-)
Endonuclease
Reverse transcriptase (EC:2.7.7.49)
Gene namesi
ORF Names:ORF V
OrganismiCauliflower mosaic virus (strain Strasbourg) (CaMV)
Taxonomic identifieri10648 [NCBI]
Taxonomic lineageiVirusesRetro-transcribing virusesCaulimoviridaeCaulimovirus
Virus hostiArabidopsis thaliana (Mouse-ear cress) [TaxID: 3702]
Brassica [TaxID: 3705]
Raphanus [TaxID: 3725]
ProteomesiUP000002501 Componenti: Genome

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 679679Enzymatic polyproteinPRO_0000222054Add
BLAST

Structurei

3D structure databases

ProteinModelPortaliP03554.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini272 – 452181Reverse transcriptasePROSITE-ProRule annotationAdd
BLAST

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni40 – 13091ProteaseBy similarityAdd
BLAST

Domaini

The polymerase/reverse transcriptase (RT) and ribonuclease H (RH) domains are structured in five subdomains: finger, palm, thumb, connection and RNase H. Within the palm subdomain, the 'primer grip' region is thought to be involved in the positioning of the primer terminus for accommodating the incoming nucleotide. The RH domain stabilizes the association of RT with primer-template (By similarity).By similarity

Sequence similaritiesi

Contains 1 peptidase A3A domain.Curated
Contains 1 reverse transcriptase domain.PROSITE-ProRule annotation

Family and domain databases

InterProiIPR000588. Pept_A3A.
IPR000477. RT_dom.
[Graphical view]
PfamiPF02160. Peptidase_A3. 1 hit.
PF00078. RVT_1. 1 hit.
[Graphical view]
PRINTSiPR00731. CAULIMOPTASE.
PROSITEiPS50878. RT_POL. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

P03554-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MDHLLLKTQT QTEQVMNVTN PNSIYIKGRL YFKGYKKIEL HCFVDTGASL
60 70 80 90 100
CIASKFVIPE EHWVNAERPI MVKIADGSSI TISKVCKDID LIIAGEIFRI
110 120 130 140 150
PTVYQQESGI DFIIGNNFCQ LYEPFIQFTD RVIFTKNKSY PVHIAKLTRA
160 170 180 190 200
VRVGTEGFLE SMKKRSKTQQ PEPVNISTNK IENPLEEIAI LSEGRRLSEE
210 220 230 240 250
KLFITQQRMQ KIEELLEKVC SENPLDPNKT KQWMKASIKL SDPSKAIKVK
260 270 280 290 300
PMKYSPMDRE EFDKQIKELL DLKVIKPSKS PHMAPAFLVN NEAEKRRGKK
310 320 330 340 350
RMVVNYKAMN KATVGDAYNL PNKDELLTLI RGKKIFSSFD CKSGFWQVLL
360 370 380 390 400
DQESRPLTAF TCPQGHYEWN VVPFGLKQAP SIFQRHMDEA FRVFRKFCCV
410 420 430 440 450
YVDDILVFSN NEEDHLLHVA MILQKCNQHG IILSKKKAQL FKKKINFLGL
460 470 480 490 500
EIDEGTHKPQ GHILEHINKF PDTLEDKKQL QRFLGILTYA SDYIPKLAQI
510 520 530 540 550
RKPLQAKLKE NVPWRWTKED TLYMQKVKKN LQGFPPLHHP LPEEKLIIET
560 570 580 590 600
DASDDYWGGM LKAIKINEGT NTELICRYAS GSFKAAEKNY HSNDKETLAV
610 620 630 640 650
INTIKKFSIY LTPVHFLIRT DNTHFKSFVN LNYKGDSKLG RNIRWQAWLS
660 670
HYSFDVEHIK GTDNHFADFL SREFNKVNS
Length:679
Mass (Da):78,629
Last modified:July 20, 1986 - v1
Checksum:i9EE527BCD460B766
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
V00141 Genomic DNA. Translation: CAA23460.1.
PIRiD90799. QQCV5.
RefSeqiNP_056728.1. NC_001497.1.

Genome annotation databases

GeneIDi1489542.
KEGGivg:1489542.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
V00141 Genomic DNA. Translation: CAA23460.1.
PIRiD90799. QQCV5.
RefSeqiNP_056728.1. NC_001497.1.

3D structure databases

ProteinModelPortaliP03554.
ModBaseiSearch...
MobiDBiSearch...

Protein family/group databases

MEROPSiA03.001.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

GeneIDi1489542.
KEGGivg:1489542.

Family and domain databases

InterProiIPR000588. Pept_A3A.
IPR000477. RT_dom.
[Graphical view]
PfamiPF02160. Peptidase_A3. 1 hit.
PF00078. RVT_1. 1 hit.
[Graphical view]
PRINTSiPR00731. CAULIMOPTASE.
PROSITEiPS50878. RT_POL. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

  1. "Nucleotide sequence of cauliflower mosaic virus DNA."
    Franck A., Guilley H., Jonard G., Richards K., Hirth L.
    Cell 21:285-294(1979) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].

Entry informationi

Entry nameiPOL_CAMVS
AccessioniPrimary (citable) accession number: P03554
Entry historyi
Integrated into UniProtKB/Swiss-Prot: July 20, 1986
Last sequence update: July 20, 1986
Last modified: March 31, 2015
This is version 81 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programViral Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Peptidase families
    Classification of peptidase families and list of entries
  2. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into Uniref entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.