Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q91TW9 (POLG_MRFVC) Reviewed, UniProtKB/Swiss-Prot

Last modified May 1, 2013. Version 57. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (1) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Genome polyprotein

Cleaved into the following 2 chains:

  1. RNA replication protein
  2. Capsid protein CP1
    Short name=CP1
    Alternative name(s):
    Coat protein

Including the following 3 domains:

  1. RNA-directed RNA polymerase
    EC=2.7.7.48
  2. Helicase
    EC=3.6.4.13
  3. Methyltransferase
    EC=2.1.1.-
Gene names
ORF Names:ORF1
OrganismMaize rayado fino virus (isolate Costa Rica/Guapiles) (MRFV) [Reference proteome]
Taxonomic identifier652669 [NCBI]
Taxonomic lineageVirusesssRNA positive-strand viruses, no DNA stageTymoviralesTymoviridaeMarafivirus
Virus hostZea mays (Maize) [TaxID: 4577]

Protein attributes

Sequence length2027 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existencePredicted

General annotation (Comments)

Function

RNA replication protein replicates the viral genomic RNA. The central part of this protein possibly functions as an ATP-binding helicase and/or methyltransferase Probable. Ref.2

Capsid protein CP1 and CP2 assemble to form an icosahedral capsid, about 30 nm in diameter, and consisting of capsid proteins CP1 and CP2 in a 1:3 ratio. The capsid encapsulates the single-stranded RNA genome. While CP1 is produced as a C-terminal fusion of the replication protein, CP2 may be expressed from a 3'-co-terminal subgenomic RNA. Ref.2

Catalytic activity

Nucleoside triphosphate + RNA(n) = diphosphate + RNA(n+1).

ATP + H2O = ADP + phosphate.

Subcellular location

Capsid protein CP1: Virion Potential.

Sequence similarities

Contains 1 (+)RNA virus helicase ATP-binding domain.

Contains 1 (+)RNA virus helicase C-terminal domain.

Contains 1 RdRp catalytic domain.

Alternative products

This entry describes 2 isoforms produced by alternative promoter usage. [Align] [Select]
Isoform Genome polyprotein (identifier: Q91TW9-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform Subgenomic capsid protein CP2 (identifier: Q91TW9-2)

The sequence of this isoform differs from the canonical sequence as follows:
     1-1827: Missing.
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 20272027Genome polyprotein
PRO_0000402496
Chain1 – 17991799RNA replication protein
PRO_0000402497
Chain1800 – 2027228Capsid protein CP1
PRO_0000402498

Regions

Domain881 – 1038158(+)RNA virus helicase ATP-binding
Domain1039 – 1171133(+)RNA virus helicase C-terminal
Domain1507 – 1613107RdRp catalytic
Compositional bias546 – 60358Pro-rich
Compositional bias605 – 6128Poly-Arg

Natural variations

Alternative sequence1 – 18271827Missing in isoform Subgenomic capsid protein CP2.
VSP_040286

Sequences

Sequence LengthMass (Da)Tools
Isoform Genome polyprotein [UniParc].

Last modified December 1, 2001. Version 1.
Checksum: 77B46DC9950B0BF7

FASTA2,027223,738
        10         20         30         40         50         60 
MSSFLRGGHL LSGVESLTPT THRDTITAPI VESLATPLRR SLERYPWSIP KEFHSFLHTC 

        70         80         90        100        110        120 
GVDISGFGHA AHPHPVHKTI ETHLLLDVWP NYARGPSDVM FIKPEKFAKL QSRQPNFAHL 

       130        140        150        160        170        180 
INYRLVPKDT TRYPSTSTNL PDCETVFMHD ALMYYTPGQI ADLFFLCPQL QKIYASVVVP 

       190        200        210        220        230        240 
AESSFTHLSL HPEIYRFRFQ GSDLVYEPEG NPAANYTQPR SALDWLQTTG FTVGHEFFSV 

       250        260        270        280        290        300 
TLLDSFGPVH SLLIQRGRPP VFQAEDIASF RVPDAVALPA PASLHQDLRH RLVPRKVYDA 

       310        320        330        340        350        360 
LFNYVRAVRL RVTDPAGFVR TQVGKPEYSW VTSSAWDNLQ HFALQTAAVR PNTSHPLFQS 

       370        380        390        400        410        420 
PFARLSHWLR THTWALWCLA SPSASVSAWA TASALGRLLP LHTDRLRLFG FDIIGRRFWP 

       430        440        450        460        470        480 
RLPFHGPEPR FLWETHPACR PPVLFADSAF ECQILAGLAN RCSPSPFWSR LFPTASPPSW 

       490        500        510        520        530        540 
VAYSALALAA VPLAALALRW FYGPDSPQAL HDQYHATFHP DPWTLDLPRR LRRFERESFM 

       550        560        570        580        590        600 
RTGSAPLPQS LPPPEGSLLP VEPPPVPSDP EPALEPSPPA ASVPAPAPAL ASEPPPSPES 

       610        620        630        640        650        660 
VAPSRRRRRA RRAAARAPSP SPALLGADLR FGDLPPVSAW DSDPEISKLG ESTQGTVFAV 

       670        680        690        700        710        720 
TPGPRAPEPD TARLDADPSA SGPVMEFREL QKGAYIEPTG AFLTRARNSV SSSIPYPTRA 

       730        740        750        760        770        780 
ACLLVAVSQA TGLPTRTLWA ALCANLPDSV LDDGSLATLG LTTDHFAVLA RIFSLRCRFV 

       790        800        810        820        830        840 
SEHGDVELGL HDATSRFTIR HTPGHFELVA DNFSLPALVG ASSVPGADLA EACKRFVAPD 

       850        860        870        880        890        900 
RTVLPFRDVH IHRTDVRRAK NLISNMKNGF DGVMAQANPL DPKSARERFL MLDSCLDIAA 

       910        920        930        940        950        960 
PRRVRLIHIA GFAGCGKSWP ISHLLRTPAF RVFKLAVPTT ELRDEWKALM DPRDQDKWRF 

       970        980        990       1000       1010       1020 
GTWESSLLKT ARVLVIDEVY KMPRGYLDLA IHADAAIQFV ILLGDPIQGE YHSTHPSSSN 

      1030       1040       1050       1060       1070       1080 
ARLSPEHRYL RPYVDFYCFW SRRIPQNVAR VLDVPTTSTE MGFARYSQQF PFSGKILISA 

      1090       1100       1110       1120       1130       1140 
RDSAKSLADC GYHAVTIASS QGSTIAGPAY VHLDNHSRRL SHQHSLVAIT RSKSGIVFTG 

      1150       1160       1170       1180       1190       1200 
DKAAADGTSS ANLLFSAVLL DRRLSVRSLF SALLPCCPFV TEPPTSRAVL LRGAGYGIAR 

      1210       1220       1230       1240       1250       1260 
PLRARDAPPL GPDYVGDVIL DSSAPILGDG SANAPQVSTH FLPETRRPLH FDIPSARHQV 

      1270       1280       1290       1300       1310       1320 
ADHPLAPDHS ACAIEPVYPG ESFESLASLF LPPTDAESKE TYFRGEMSNQ FPHLDKPFEL 

      1330       1340       1350       1360       1370       1380 
GAQTSSLLAP LHNSKHDPTL LPASIGKRLR FRHSEAPYVI APRDEILGSL LYEAWCRAYH 

      1390       1400       1410       1420       1430       1440 
RSPRDVEPFD PDLYAECINL NEFAQLSSKT QATIMANANR SDPDWRWSAV RIFAKTQHKV 

      1450       1460       1470       1480       1490       1500 
NEGSLFGSWK ACQTLALMHD AVVLLLGPVK KYQRFFDQRD RPSTLYVHAG HTPFEMADWC 

      1510       1520       1530       1540       1550       1560 
RAHLTPAVKL ANDYTAFDQS QHGEAVVFER YKMNRLSIPA ELVDLHVYLK TNVSTQFGPL 

      1570       1580       1590       1600       1610       1620 
TCMRLTGEPG TYDDNTDYNI AVLHLEYAVG STPLMVSGDD SLLDSEPPVR DQWSAIAPML 

      1630       1640       1650       1660       1670       1680 
ALTFKKERGR YATFCGYYVG FTGAVRSPPA LFAKLMIAVD DGSISDKLIA YLTEFTVGHS 

      1690       1700       1710       1720       1730       1740 
SGDAFWTILP VEAVPYQSAC FDFFCRRAPA QAKVMLRLGE APESLLSLAF EGLKWASHSV 

      1750       1760       1770       1780       1790       1800 
YALMNSSHRR QLLHSSRRPR SLPEDPEVSQ LQGELLHQFQ SLHLPLRGGH MPNPLAAPFR 

      1810       1820       1830       1840       1850       1860 
LLQQSSSLGP TYAVAPIARA PQVPPPSMAD NATQVGPVPP RDDRVDRQPP LPDPPRVLET 

      1870       1880       1890       1900       1910       1920 
APSHFLDLPF QWKVTDFTGY AAYHGTDDLV ASAVLTTLCA PYRHAELLYV EISVAPCPPS 

      1930       1940       1950       1960       1970       1980 
FSKPIMFTVV WTPATLSPRD GKETDYYGGR QITVGGPVML SSTTAVPADL ARMNPFIKSS 

      1990       2000       2010       2020 
VSYNDTPRWT MSVPAVTGGD TKIPLATAFV RGIVRVRAPS GAATPSA 

« Hide

Isoform Subgenomic capsid protein CP2 [UniParc].

Checksum: C931BF8A03D7DD46
Show »

FASTA20021,500

References

[1]"Molecular characterization of the genome of Maize rayado fino virus, the type member of the genus Marafivirus."
Hammond R.W., Ramirez P.
Virology 282:338-347(2001) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA].
[2]"The two capsid proteins of maize rayado fino virus contain common peptide sequences."
Falk B.W., Tsai J.H.
Intervirology 25:111-116(1986) [PubMed] [Europe PMC] [Abstract]
Cited for: FUNCTION OF CAPSID PROTEINS, IDENTIFICATION OF SUBGENOMIC CAPSID PROTEIN CP2.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AF265566 Genomic RNA. Translation: AAK52838.1.
RefSeqNP_115454.1. NC_002786.1.

3D structure databases

HSSPHSSP built from PDB template 1E57 based on UniProtKB P36351.
ProteinModelPortalQ91TW9.
ModBaseSearch...

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Family and domain databases

InterProIPR027351. (+)RNA_virus_helicase_core_dom.
IPR008043. Peptidase_C21.
IPR007094. RNA-dir_pol_PSvirus.
IPR000574. Tymo_coat.
IPR002588. Tymovirus_MeTrfase.
IPR001788. Tymovirus_RNA-dep_RNA_pol.
[Graphical view]
PfamPF05381. Peptidase_C21. 1 hit.
PF00978. RdRP_2. 1 hit.
PF00983. Tymo_coat. 1 hit.
PF01443. Viral_helicase1. 1 hit.
PF01660. Vmethyltransf. 1 hit.
[Graphical view]
ProDomPD003886. Tymo_coat. 1 hit.
[Graphical view] [Entries sharing at least one domain]
PROSITEPS51657. PSRV_HELICASE. 1 hit.
PS50507. RDRP_SSRNA_POS. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry namePOLG_MRFVC
AccessionPrimary (citable) accession number: Q91TW9
Entry history
Integrated into UniProtKB/Swiss-Prot: November 30, 2010
Last sequence update: December 1, 2001
Last modified: May 1, 2013
This is version 57 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programViral Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families