Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein
Submitted name:

RT-IN

Gene

gag-pro-pol

Organism
Mason-Pfizer monkey virus (MPMV) (Simian Mason-Pfizer virus)
Status
Unreviewed-Annotation score: Annotation score: 2 out of 5-Protein predictedi

Functioni

Catalytic activityi

Deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1).SAAS annotation

Cofactori

Mg2+SAAS annotation

GO - Molecular functioni

GO - Biological processi

Keywordsi

Molecular functionAspartyl proteaseSAAS annotation, DNA-bindingSAAS annotation, EndonucleaseSAAS annotation, Hydrolase, Nuclease, Nucleotidyltransferase, Protease, RNA-directed DNA polymeraseSAAS annotation, Transferase
Biological processDNA integration, DNA recombinationSAAS annotation, Viral genome integrationSAAS annotation, Virus entry into host cell
LigandMagnesiumSAAS annotation, Metal-binding, Zinc

Names & Taxonomyi

Protein namesi
Submitted name:
RT-INImported
Gene namesi
Name:gag-pro-polImported
OrganismiMason-Pfizer monkey virus (MPMV) (Simian Mason-Pfizer virus)Imported
Taxonomic identifieri11855 [NCBI]
Taxonomic lineageiVirusesRetro-transcribing virusesRetroviridaeOrthoretrovirinaeBetaretrovirus
Virus hostiMacaca mulatta (Rhesus macaque) [TaxID: 9544]
Proteomesi
  • UP000105838 Componenti: Genome

Subcellular locationi

  • Virion SAAS annotation

GO - Cellular componenti

Keywords - Cellular componenti

Capsid proteinSAAS annotation, Virion

Structurei

3D structure databases

ProteinModelPortaliO56224.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini549 – 562CCHC-typeInterPro annotationAdd BLAST14
Domaini780 – 856Peptidase A2InterPro annotationAdd BLAST77
Domaini867 – 913G-patchInterPro annotationAdd BLAST47
Domaini959 – 1147Reverse transcriptaseInterPro annotationAdd BLAST189
Domaini1361 – 1492RNase HInterPro annotationAdd BLAST132
Domaini1496 – 1537Integrase-typeInterPro annotationAdd BLAST42
Domaini1550 – 1719Integrase catalyticInterPro annotationAdd BLAST170
Domaini1716 – 1765Integrase-type DNA-bindingInterPro annotationAdd BLAST50

Keywords - Domaini

RepeatSAAS annotation, Zinc-fingerSAAS annotation

Family and domain databases

CDDicd05482. HIV_retropepsin_like. 1 hit.
cd07557. trimeric_dUTPase. 1 hit.
Gene3Di1.10.10.200. 1 hit.
1.10.1200.30. 1 hit.
1.10.375.10. 1 hit.
2.30.30.10. 1 hit.
2.40.70.10. 1 hit.
3.30.420.10. 2 hits.
4.10.60.10. 1 hit.
InterProiView protein in InterPro
IPR001969. Aspartic_peptidase_AS.
IPR003322. B_retro_matrix.
IPR029054. dUTPase-like.
IPR033704. dUTPase_trimeric.
IPR000467. G_patch_dom.
IPR000721. Gag_p24.
IPR001037. Integrase_C_retrovir.
IPR001584. Integrase_cat-core.
IPR017856. Integrase_Zn-bd_dom-like_N.
IPR003308. Integrase_Zn-bd_dom_N.
IPR001995. Peptidase_A2_cat.
IPR021109. Peptidase_aspartic_dom.
IPR034170. Retropepsin-like_cat_dom.
IPR018061. Retropepsins.
IPR008916. Retrov_capsid_C.
IPR008919. Retrov_capsid_N.
IPR010999. Retrovr_matrix.
IPR012337. RNaseH-like_dom.
IPR002156. RNaseH_domain.
IPR000477. RT_dom.
IPR010661. RVT_thumb.
IPR001878. Znf_CCHC.
PfamiView protein in Pfam
PF00692. dUTPase. 1 hit.
PF01585. G-patch. 1 hit.
PF02337. Gag_p10. 1 hit.
PF00607. Gag_p24. 1 hit.
PF00552. IN_DBD_C. 1 hit.
PF02022. Integrase_Zn. 1 hit.
PF00075. RNase_H. 1 hit.
PF00665. rve. 1 hit.
PF00077. RVP. 1 hit.
PF00078. RVT_1. 1 hit.
PF06817. RVT_thumb. 1 hit.
ProDomiView protein in ProDom or Entries sharing at least one domain
PD004265. B_retro_matrix_N. 1 hit.
SMARTiView protein in SMART
SM00443. G_patch. 1 hit.
SM00343. ZnF_C2HC. 2 hits.
SUPFAMiSSF46919. SSF46919. 1 hit.
SSF47353. SSF47353. 1 hit.
SSF47836. SSF47836. 1 hit.
SSF47943. SSF47943. 1 hit.
SSF50122. SSF50122. 1 hit.
SSF50630. SSF50630. 1 hit.
SSF51283. SSF51283. 1 hit.
SSF53098. SSF53098. 1 hit.
SSF57756. SSF57756. 1 hit.
PROSITEiView protein in PROSITE
PS50175. ASP_PROT_RETROV. 1 hit.
PS00141. ASP_PROTEASE. 1 hit.
PS50174. G_PATCH. 1 hit.
PS50994. INTEGRASE. 1 hit.
PS51027. INTEGRASE_DBD. 1 hit.
PS50879. RNASE_H. 1 hit.
PS50878. RT_POL. 1 hit.
PS50158. ZF_CCHC. 1 hit.
PS50876. ZF_INTEGRASE. 1 hit.

Sequencei

Sequence statusi: Complete.

O56224-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MGQELSQHER YVEQLKQALK TRGVKVKYAD LLKFFDFVKD TCPWFPQEGT
60 70 80 90 100
IDIKRWRRVG DCFQDYYNTF GPEKVPVTAF SYWNLIKELI DKKEVNPQVM
110 120 130 140 150
AAVAQTEEIL KSNSQTDLTK TSQNPDLDLI SLDSDDEGAK SSSLQDKGLS
160 170 180 190 200
STKKPKRFPV LLTAQTSKDP EDPNPSEVDW DGLEDEAAKY HNPDWPPFLT
210 220 230 240 250
RPPPYNKATP SAPTVMAVVN PKEELKEKIA QLEEQIKLEE LHQALISKLQ
260 270 280 290 300
KLKTGNETVT HPDTAGGLSR TPHWPGQHIP KGKCCASREK EEQIPKDIFP
310 320 330 340 350
VTETVDGQGQ AWRHHNGFDF AVIKELKTAA SQYGATAPYT LAIVESVADN
360 370 380 390 400
WLTPTDWNTL VRAVLSGGDH LLWKSEFFEN CRDTAKRNQQ AGNGWDFDML
410 420 430 440 450
TGSGNYSSTD AQMQYDPGLF AQIQAAATKA WRKLPVKGDP GASLTGVKQG
460 470 480 490 500
PDEPFADFVH RLITTAGRIF GSAEAGVDYV KQLAYENANP ACQAAIRPYR
510 520 530 540 550
KKTDLTGYIR LCSDIGPSYQ QGLAMAAAFS GQTVKDFLNN KNKEKGGCCF
560 570 580 590 600
KCGKKGHFAK NCHEHAHNNA EPKVPGLCPR CKRGKHWANE CKSKTDNQGN
610 620 630 640 650
PIPPHQGNRV EGPAPGPETS LWGSQLCSSQ QKQPISKLTR ATPGSAGLDL
660 670 680 690 700
CSTSHTVLTP EMGPQALSTG IYGPLPPNTF GLILGRSSIT MKGLQVYPGV
710 720 730 740 750
IDNDYTGEIK IMAKAVNNIV TVSQGNRIAQ LILLPLIETD NKVQQPYRGQ
760 770 780 790 800
GSFGSSDIYW VQPITCQKPS LTLWLDDKMF TGLIDTGADV TIIKLEDWPP
810 820 830 840 850
NWPITDTLTN LRGIGQSNNP KQSSKYLTWR DKENNSGLIK PFVIPNLPVN
860 870 880 890 900
LWGRDLLSQM KIMMCSPNDI VTAQMLAQGY SPGKGLGKKE NGILHPIPNQ
910 920 930 940 950
GQSNKKGFGN FLTAAIDILA PQQCAEPITW KSDEPVWVDQ WPLTNDKLAA
960 970 980 990 1000
AQQLVQEQLE AGHITESSSP WNTPIFVIKK KSGKWRLLQD LRAVNATMVL
1010 1020 1030 1040 1050
MGALQPGLPS PVAIPQGYLK IIIDLKDCFF SIPLHPSDQK RFAFSLPSTN
1060 1070 1080 1090 1100
FKEPMQRFQW KVLPQGMANS PTLCQKYVAT AIHKVRHAWK QMYIIHYMDD
1110 1120 1130 1140 1150
ILIAGKDGQQ VLQCFDQLKQ ELTAAGLHIA PEKVQLQDPY TYLGFELNGP
1160 1170 1180 1190 1200
KITNQKAVIR KDKLQTLNDF QKLLGDINWL RPYLKLTTGD LKPLFDTLKG
1210 1220 1230 1240 1250
DSDPNSHRSL SKEALASLEK VETAIAEQFV THINYSLPLI FLIFNTALTP
1260 1270 1280 1290 1300
TGLFWQDNPI MWIHLPASPK KVLLPYYDAI ADLIILGRDH SKKYFGIEPS
1310 1320 1330 1340 1350
TIIQPYSKSQ IDWLMQNTEM WPIACASFVG ILDNHYPPNK LIQFCKLHTF
1360 1370 1380 1390 1400
VFPQIISKTP LNNALLVFTD GSSTGMAAYT LTDTTIKFQT NLNSAQLVEL
1410 1420 1430 1440 1450
QALIAVLSAF PNQPLNIYTD SAYLAHSIPL LETVAQIKHI SETAKLFLQC
1460 1470 1480 1490 1500
QQLIYNRSIP FYIGHVRAHS GLPGPIAQGN QRADLATKIV ASNINTNLES
1510 1520 1530 1540 1550
AQNAHTLHHL NAQTLRLMFN IPREQARQIV KQCPICVTYL PVPHLGVNPR
1560 1570 1580 1590 1600
GLFPNMIWQM DVTHYSEFGN LKYIHVSIDT FSGFLLATLQ TGETTKHVIT
1610 1620 1630 1640 1650
HLLHCFSIIG LPKQIKTDNG PGYTSKNFQE FCSTLQIKHI TGIPYNPQGQ
1660 1670 1680 1690 1700
GIVERAHLSL KTTIEKIKKG EWYPRKGTPR NILNHALFIL NFLNLDDQNK
1710 1720 1730 1740 1750
SAADRFWHNN PKKQFAMVKW KDPLDNTWHG PDPVLIWGRG SVCVYSQTYD
1760 1770
AARWLPERLV RQVSNNNQSR E
Length:1,771
Mass (Da):198,014
Last modified:December 1, 2001 - v2
Checksum:iBE8BEAB195B4E833
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AF033815 Genomic RNA. Translation: AAC82576.1.
RefSeqiNP_056891.1. NC_001550.1.

Genome annotation databases

GeneIDi2746973.

Keywords - Coding sequence diversityi

Ribosomal frameshiftingSAAS annotation

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AF033815 Genomic RNA. Translation: AAC82576.1.
RefSeqiNP_056891.1. NC_001550.1.

3D structure databases

ProteinModelPortaliO56224.
ModBaseiSearch...
MobiDBiSearch...

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

GeneIDi2746973.

Family and domain databases

CDDicd05482. HIV_retropepsin_like. 1 hit.
cd07557. trimeric_dUTPase. 1 hit.
Gene3Di1.10.10.200. 1 hit.
1.10.1200.30. 1 hit.
1.10.375.10. 1 hit.
2.30.30.10. 1 hit.
2.40.70.10. 1 hit.
3.30.420.10. 2 hits.
4.10.60.10. 1 hit.
InterProiView protein in InterPro
IPR001969. Aspartic_peptidase_AS.
IPR003322. B_retro_matrix.
IPR029054. dUTPase-like.
IPR033704. dUTPase_trimeric.
IPR000467. G_patch_dom.
IPR000721. Gag_p24.
IPR001037. Integrase_C_retrovir.
IPR001584. Integrase_cat-core.
IPR017856. Integrase_Zn-bd_dom-like_N.
IPR003308. Integrase_Zn-bd_dom_N.
IPR001995. Peptidase_A2_cat.
IPR021109. Peptidase_aspartic_dom.
IPR034170. Retropepsin-like_cat_dom.
IPR018061. Retropepsins.
IPR008916. Retrov_capsid_C.
IPR008919. Retrov_capsid_N.
IPR010999. Retrovr_matrix.
IPR012337. RNaseH-like_dom.
IPR002156. RNaseH_domain.
IPR000477. RT_dom.
IPR010661. RVT_thumb.
IPR001878. Znf_CCHC.
PfamiView protein in Pfam
PF00692. dUTPase. 1 hit.
PF01585. G-patch. 1 hit.
PF02337. Gag_p10. 1 hit.
PF00607. Gag_p24. 1 hit.
PF00552. IN_DBD_C. 1 hit.
PF02022. Integrase_Zn. 1 hit.
PF00075. RNase_H. 1 hit.
PF00665. rve. 1 hit.
PF00077. RVP. 1 hit.
PF00078. RVT_1. 1 hit.
PF06817. RVT_thumb. 1 hit.
ProDomiView protein in ProDom or Entries sharing at least one domain
PD004265. B_retro_matrix_N. 1 hit.
SMARTiView protein in SMART
SM00443. G_patch. 1 hit.
SM00343. ZnF_C2HC. 2 hits.
SUPFAMiSSF46919. SSF46919. 1 hit.
SSF47353. SSF47353. 1 hit.
SSF47836. SSF47836. 1 hit.
SSF47943. SSF47943. 1 hit.
SSF50122. SSF50122. 1 hit.
SSF50630. SSF50630. 1 hit.
SSF51283. SSF51283. 1 hit.
SSF53098. SSF53098. 1 hit.
SSF57756. SSF57756. 1 hit.
PROSITEiView protein in PROSITE
PS50175. ASP_PROT_RETROV. 1 hit.
PS00141. ASP_PROTEASE. 1 hit.
PS50174. G_PATCH. 1 hit.
PS50994. INTEGRASE. 1 hit.
PS51027. INTEGRASE_DBD. 1 hit.
PS50879. RNASE_H. 1 hit.
PS50878. RT_POL. 1 hit.
PS50158. ZF_CCHC. 1 hit.
PS50876. ZF_INTEGRASE. 1 hit.
ProtoNetiSearch...

Entry informationi

Entry nameiO56224_MPMV
AccessioniPrimary (citable) accession number: O56224
Entry historyiIntegrated into UniProtKB/TrEMBL: June 1, 1998
Last sequence update: December 1, 2001
Last modified: April 12, 2017
This is version 131 of the entry and version 2 of the sequence. See complete history.
Entry statusiUnreviewed (UniProtKB/TrEMBL)

Miscellaneousi

Keywords - Technical termi

Complete proteomeImported, Multifunctional enzymeSAAS annotation

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.