Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein
Submitted name:

Polyprotein

Gene
N/A
Organism
Cacao swollen shoot virus
Status
Unreviewed-Annotation score: Annotation score: 1 out of 5-Protein predictedi

Functioni

Catalytic activityi

Deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1).SAAS annotation

GO - Molecular functioni

Complete GO annotation...

Keywords - Molecular functioni

EndonucleaseSAAS annotation, Hydrolase, Nuclease, Nucleotidyltransferase, ProteaseSAAS annotation, RNA-directed DNA polymeraseSAAS annotation, Transferase

Keywords - Ligandi

Metal-binding, Zinc

Names & Taxonomyi

Protein namesi
Submitted name:
PolyproteinImported
OrganismiCacao swollen shoot virusImported
Taxonomic identifieri31559 [NCBI]
Taxonomic lineageiVirusesRetro-transcribing virusesCaulimoviridaeBadnavirus

Structurei

3D structure databases

ProteinModelPortaliQ66233.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini787 – 803CCHC-typeInterPro annotationAdd BLAST17
Domaini1314 – 1494Reverse transcriptaseInterPro annotationAdd BLAST181
Domaini1615 – 1720RNase HInterPro annotationAdd BLAST106

Coiled coil

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Coiled coili965 – 999Sequence analysisAdd BLAST35

Sequence similaritiesi

Contains 1 reverse transcriptase domain.UniRule annotation

Keywords - Domaini

Coiled coilSequence analysis, Zinc-fingerSAAS annotation

Family and domain databases

Gene3Di4.10.60.10. 1 hit.
InterProiIPR021109. Peptidase_aspartic_dom.
IPR018061. Retropepsins.
IPR002156. RNaseH_domain.
IPR000477. RT_dom.
IPR001878. Znf_CCHC.
[Graphical view]
PfamiPF00077. RVP. 1 hit.
PF00078. RVT_1. 1 hit.
PF00098. zf-CCHC. 1 hit.
[Graphical view]
SMARTiSM00343. ZnF_C2HC. 1 hit.
[Graphical view]
SUPFAMiSSF50630. SSF50630. 1 hit.
SSF57756. SSF57756. 1 hit.
PROSITEiPS50879. RNASE_H. 1 hit.
PS50878. RT_POL. 1 hit.
PS50158. ZF_CCHC. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Q66233-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MSRARPQHPV PSVTTTTSEQ NREGPLYEDQ IRDYRRGQRR IFNLRRRARR
60 70 80 90 100
LRRSMMGSRY QETLEQEIDP QTTLRLSMQE RARLVPAEVL YRSRRDTVHH
110 120 130 140 150
RVYTHRSEES VLCVGGSQVD RAFIQPESLE QLQRTGMSFI HIGILQVRIQ
160 170 180 190 200
ILHRQEEGTM ALVVFRDNRW SGDQSIFAQM EIDLTKGSQL VFVIPDTMMT
210 220 230 240 250
IGDFARNVQL SILTRGYENW QNGEANLLIT RGMTGRLSNT PNVAFAYQIA
260 270 280 290 300
SATDYLASHG VKAIAGKKMN LQHLRNQQWI LRPPQTDITP MQPRSVETRN
310 320 330 340 350
LVDGSISIRF HDYEAATSAS RPHYNEEDEE VESETESEIR EHTIAVWIGE
360 370 380 390 400
EEIPDQTGRK KVWEESSNGN GRFFRYYTPP PTFEGQIIAT GWGSDDDNEK
410 420 430 440 450
TPPKWDESPD EEGPTEPIWD QEEEEDEYDP NVYRAYLQKE EDEWQEITAS
460 470 480 490 500
LREEMEYPKR RPQTEMAFSE TVDYTPPGDT MMTPVGYPPA SSSRSTVTTP
510 520 530 540 550
SRPPLFEGRT THVPRFLKRD EYTEWWQLPS SQGTTGALFV MPKQMGLFHE
560 570 580 590 600
VFSRWESITK NYVAAQGFTD PTEKMEFMEN LLGETEKLTW IQWRMNYEAE
610 620 630 640 650
YQQLLTQADG RQGTQNILSQ IKRIFSLEDP ASGSTRIQDA AYRDLERLTC
660 670 680 690 700
HNIKDIVQFL NDYGRLAAKS GRLFLGTELS EKLWMKMPPE LGHRMKEAFQ
710 720 730 740 750
KEYSGNEVGV FPRILFAYRY LEQECKDAAF KRSLKSLSFC KDMPLTGYYD
760 770 780 790 800
KTSKYGMRKS RTYKGKPHAS HARVEKRKHL IRNKKCKCYL CGDEGHFARE
810 820 830 840 850
CPNQRRDVKR VAIFEGIDLP EGFDIVSVEE GEEESDAIYS ISENEDGELD
860 870 880 890 900
TEVVHEKVFM MREEDQSYWL GKTNHWTAMV RVSSQQYHCM HQWEHNKEIL
910 920 930 940 950
VVAHINCHFC KQPTQLRSRI HCPTCQLTSC FMCAPIYCNM IVQQQPKPPV
960 970 980 990 1000
PFNTHTLLQQ QAAYIQWLEK ENQRLTEAVE FYKKEAEELR LERDLEQDRR
1010 1020 1030 1040 1050
SLEPTLLDKG KKVQILDPDE DQHTAYLEED TISRVIGHTV EQQEVRKPVK
1060 1070 1080 1090 1100
KGNMLYNLDV VLHIPEVGRP IKVKAILDTG ATTCCININS VPQTAIEQNT
1110 1120 1130 1140 1150
FLVQFRGINS TQSVDKKLKY GRMTISNHQF RIPYCYAFPL SLGDGIEMIL
1160 1170 1180 1190 1200
GCNFIRGMYG GLRIEGHTIT FYKNVTTIQT RLAAVMVGGT TASELGGGEE
1210 1220 1230 1240 1250
SKSDSESMFD LSETEEFDSE THQQIVSHVA AQAQQQKLDP KLQQLMVQLQ
1260 1270 1280 1290 1300
DQGFIGENPM QHWAKNKILC RLDIKNPDLI IEDKPIKHLT PAMEKQFQKH
1310 1320 1330 1340 1350
IKALLDIGVI RPSKSKHRTT AFIVESGTVI DPVTKKTIHG KERLVFNYKR
1360 1370 1380 1390 1400
LNDNTEKDQY SLPGIQTILK RVGNKKVFSK FDLKSGFHQV AMAEESIPWT
1410 1420 1430 1440 1450
AFWVPQGLYE WLVMPFGLKN APAVFQRKMD QCFKGTEEFI AVYIDDILVF
1460 1470 1480 1490 1500
SENMAEHTKH IGIMLKICQE NGLVLSPSKI CLAQREIEFL GTVISQGQMK
1510 1520 1530 1540 1550
LQAHVIKKIV NKANIELETT KGLRSFLGLL NYARIYIPNL GRKLSPLYAK
1560 1570 1580 1590 1600
TSPTGEKRFN RQDWHLIKEI KDMVQKLPNL AIPPARCYII IESDGCMEGW
1610 1620 1630 1640 1650
GAVCKWKLAK EDSRTTEKIC AYASGKFGVV KSTIDAEIYA LIKALESFKI
1660 1670 1680 1690 1700
FYLDKKHLVV RTDCQAIVTF YNKTSTHKPS RIRWITFSDY ITGLGVPVTI
1710 1720 1730 1740 1750
EHIDGKENQL ADTLSRLVYT TWNQSQTHQP EEEELEKSQH LSFAGLAIPI
1760 1770 1780 1790 1800
AWPMMGSYNK RRTPLLTGQS LWQRNKPSQH SSTASKSRQP RKHYWPYVTY
1810 1820 1830
RAYSTSRETI WPLLPLETTG LATDCQLPNK TQPP
Length:1,834
Mass (Da):211,387
Last modified:November 1, 1996 - v1
Checksum:iA8CD742D82099E6E
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
L14546 Unassigned DNA. Translation: AAA03171.1.
RefSeqiNP_041734.1. NC_001574.1.

Genome annotation databases

GeneIDi1496970.
KEGGivg:1496970.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
L14546 Unassigned DNA. Translation: AAA03171.1.
RefSeqiNP_041734.1. NC_001574.1.

3D structure databases

ProteinModelPortaliQ66233.
ModBaseiSearch...
MobiDBiSearch...

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

GeneIDi1496970.
KEGGivg:1496970.

Family and domain databases

Gene3Di4.10.60.10. 1 hit.
InterProiIPR021109. Peptidase_aspartic_dom.
IPR018061. Retropepsins.
IPR002156. RNaseH_domain.
IPR000477. RT_dom.
IPR001878. Znf_CCHC.
[Graphical view]
PfamiPF00077. RVP. 1 hit.
PF00078. RVT_1. 1 hit.
PF00098. zf-CCHC. 1 hit.
[Graphical view]
SMARTiSM00343. ZnF_C2HC. 1 hit.
[Graphical view]
SUPFAMiSSF50630. SSF50630. 1 hit.
SSF57756. SSF57756. 1 hit.
PROSITEiPS50879. RNASE_H. 1 hit.
PS50878. RT_POL. 1 hit.
PS50158. ZF_CCHC. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiQ66233_9VIRU
AccessioniPrimary (citable) accession number: Q66233
Entry historyi
Integrated into UniProtKB/TrEMBL: November 1, 1996
Last sequence update: November 1, 1996
Last modified: November 30, 2016
This is version 109 of the entry and version 1 of the sequence. [Complete history]
Entry statusiUnreviewed (UniProtKB/TrEMBL)

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.