Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein
Submitted name:

Gag, pol and env protein

Gene
N/A
Organism
Caenorhabditis elegans
Status
Unreviewed-Annotation score: Annotation score: 1 out of 5-Protein predictedi

Functioni

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Names & Taxonomyi

Protein namesi
Submitted name:
Gag, pol and env proteinImported
OrganismiCaenorhabditis elegansImported
Taxonomic identifieri6239 [NCBI]
Taxonomic lineageiEukaryotaMetazoaEcdysozoaNematodaChromadoreaRhabditidaRhabditoideaRhabditidaePeloderinaeCaenorhabditis

PTM / Processingi

Proteomic databases

EPDiQ17329.
PaxDbiQ17329.
PeptideAtlasiQ17329.

Interactioni

Protein-protein interaction databases

STRINGi6239.F44E2.2c.2.

Structurei

3D structure databases

ProteinModelPortaliQ17329.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini677 – 692CCHC-typeInterPro annotationAdd BLAST16
Domaini1052 – 1231Reverse transcriptaseInterPro annotationAdd BLAST180
Domaini1616 – 1774Integrase catalyticInterPro annotationAdd BLAST159

Coiled coil

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Coiled coili197 – 257Sequence analysisAdd BLAST61

Sequence similaritiesi

Contains 1 reverse transcriptase domain.UniRule annotation

Keywords - Domaini

Coiled coilSequence analysis

Family and domain databases

Gene3Di3.30.420.10. 1 hit.
4.10.60.10. 1 hit.
InterProiIPR001969. Aspartic_peptidase_AS.
IPR001584. Integrase_cat-core.
IPR021109. Peptidase_aspartic_dom.
IPR012337. RNaseH-like_dom.
IPR000477. RT_dom.
IPR001878. Znf_CCHC.
[Graphical view]
PfamiPF00665. rve. 1 hit.
PF00078. RVT_1. 1 hit.
[Graphical view]
SMARTiSM00343. ZnF_C2HC. 1 hit.
[Graphical view]
SUPFAMiSSF50630. SSF50630. 1 hit.
SSF53098. SSF53098. 1 hit.
SSF57756. SSF57756. 1 hit.
PROSITEiPS00141. ASP_PROTEASE. 1 hit.
PS50994. INTEGRASE. 1 hit.
PS50878. RT_POL. 1 hit.
PS50158. ZF_CCHC. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Q17329-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MEVNEGQDTE GGSSRAQTLT PPPNPQQQLY DEEDLLRESM DTTEKTFENG
60 70 80 90 100
FQVRKAEHEV KKKDVIKHIQ NYAKANEAQT ALMVEPFIKI IKEEEDIIEI
110 120 130 140 150
REKSIMMLKK VVNEQGITIS DLQIQKEQIR QHLQDSSQRG TAEDAETQKM
160 170 180 190 200
KQFLDTNELH NVSDLEEIIK EYSVLKMKEE KEKQCLQMAS DSWAMMREEI
210 220 230 240 250
MEKRETNRDL NRQLKEKSEE LMQKSQILVE TTLKLKAVEE ERDKRKKEEQ
260 270 280 290 300
FREADARSNN YARKGEISSN IEQKNHQNIQ IMDTRCTTSS SRMNTPAQRI
310 320 330 340 350
GENLSTSNVG NNVVRETVRE YCEETGEILE DFEVNQNDSV LTERNVTGSV
360 370 380 390 400
RNGDSQVQTN SLERMTQMML AQSLPEPAKF TAEEGSISIE AFEKTFKLKF
410 420 430 440 450
GTFSDEQQVA ILESKYLEGR AQKAYRSLTA GEKVKVKVVL NALANRLRLS
460 470 480 490 500
VEDENHRAKQ KWNILSRKPD QSCEDYCLLI DDIARIAFRR VSPEELSSMK
510 520 530 540 550
YVKLLDEVTD MHLRCSIDNK IMDTDEINHY DVCREMIIRH EWNVSKINEK
560 570 580 590 600
QCLNSKLSEK RGKVQNAENF VNQNTQNNFK PFSPNKAADN SRNSWNNNSQ
610 620 630 640 650
NNSAASQNIS REQSWKTISV PQKHQNPSDR CSDCQQRGWH MFWCSKKSKD
660 670 680 690 700
NASQKCDECQ QSGWHMASCF KLKNRACFRC NEMGHIAWNC PKKNENTSEK
710 720 730 740 750
EAPVAKVETI EGVRMKDCLL MVKSEKSESE VTRSLEKGQI GKANVEILLD
760 770 780 790 800
SGASISLMSK NTWEKIVEVN GKSWEQDQIY EELEYKTART ANNQLFTLLR
810 820 830 840 850
AVMVEIKMQT KSEVIKFHIG DMDRENVIIG AGHFEQMGIQ MNMIIEPRIV
860 870 880 890 900
RIDEDVEIPP RSCQLVEVNV TGIIREGAYC LITPTMRHVE NAVVRLNEQG
910 920 930 940 950
KAWVRIVNQF KHMLSLKKGE VIGKGETGGF EVLSNKAEQD ITVEEVLNDP
960 970 980 990 1000
TLFSEIETDT NSCEVVKTAE TYERFTTICE HLKRENGDDR KIWDVIEQFQ
1010 1020 1030 1040 1050
DVFAISDDEL GRNSGTECVI ELKEGAEPIR QKPRPIPLAL KPEIRKMIQK
1060 1070 1080 1090 1100
MLNQKVIRES KSPWSSPVVL VKKKDGSIRM CIDYRKVNKV VKNNAHPLPN
1110 1120 1130 1140 1150
IEATLQSLAG KKLYTVFDMI AGFWQIPLDE KSKEITAFAI GSELFEWNVL
1160 1170 1180 1190 1200
PFGLVISPAL FQGTMEEIIG DLLGVCAFVY VDDLLIASKD MEQHLQDVKE
1210 1220 1230 1240 1250
ALTRIRKSGM KLRASKCHIA KKEVEYLGHK VTLDGVETQE VKTDKMKQFS
1260 1270 1280 1290 1300
RPTNVKELQS FLGLVGYYRK FILNFAQIAS SLTSLISAKV AWIWEKEQEI
1310 1320 1330 1340 1350
AFQELKKLVC QTPVLAQPDV EAALKGDRPF MIYTDASRKG IGAVLAQEGP
1360 1370 1380 1390 1400
DGQQHPIAFA SKALSPAETR YHITDLEALA MMFALRRFKT IIYGTAITVF
1410 1420 1430 1440 1450
TDHKPLISLL KGSPLADRLW RWSIEILEFD VKIVYLAGKA NAVADALSRG
1460 1470 1480 1490 1500
GCPPNELEEE QTKELTSIVN AIQTELPDIL DSSCWLERLK GEDEGWKEVI
1510 1520 1530 1540 1550
AALEGGKTKG TFKIVGIESE ISLEYYKIVG GVLKNTEIEE QSRSVVPEKI
1560 1570 1580 1590 1600
RTPLLKELHE GMLAGHFGIK KMWRMVHRKF YWPQMRVCVE NCVRTCAKCL
1610 1620 1630 1640 1650
CANDHSKLTS SLTPYRMTFP LEIVACDLMD VGLSVQGNRY ILTIIDLFTK
1660 1670 1680 1690 1700
YGTAVPIPDK KAETVLKAFV ERWAIGEGRI PLKLLTDQGK EFVNGLFAQF
1710 1720 1730 1740 1750
THMLKIEHIT TKGYNSRANG AVERFNKTIM HIMKKKTAVP MEWDDQVVYA
1760 1770 1780 1790 1800
VYAYNNCVHE NTGETPMFLM HGRDVMGPLE MSGEDAVGIN YADMDEYKHL
1810 1820 1830 1840 1850
LTQELLKVQK IAKEHAMREQ ESYKSLFDQK YASKKHRFPQ PGSRVLLEIP
1860 1870 1880 1890 1900
SEKLGAQCPK LVNKWSGPYR VISCSENSAE ITPVLGKRKH ILQIPFENLR
1910 1920 1930 1940 1950
VIPEAMPDIL IVTKKGRSKK PEPEIYCDEI TVVSENNENS CFSCRYICRC
1960 1970 1980 1990 2000
ALKPCMFNMT LVPEAHTPSP TQLYRMYCIM EKSGNRKIDP KQLMAMSSRP
2010 2020 2030 2040 2050
LPSPLQITLP DKMMDNLFKD MIGCSSLWTY VAELGWENSY NRYVDKLLNE
2060 2070 2080 2090 2100
NCGDILNGPG TMLILADGLR LEDLPVSTKN CFVCTDYDEE TLIALQKKCC
2110 2120 2130 2140 2150
RERFKMIVLV IPFTIDVELV DCWNRLIAKI SEETKILVVS NMTPDELEDH
2160 2170 2180 2190 2200
ALLVEFTSIL QKCRRVDDGY LEIISLHDRL EAHPRKTLEM TALAGKVEYW
2210 2220 2230 2240 2250
KAVQTRAKEV GMEWKAFELK RCTSDTPVKN SDCEASTSMK SASTVRTFED
2260 2270
RMVKRGNHNR VYHHFTPYGR KK
Length:2,272
Mass (Da):259,697
Last modified:November 1, 1996 - v1
Checksum:i47034F67AC3DA2B0
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U15406 Genomic DNA. Translation: AAA50456.1.
PIRiS44816.
T18572.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U15406 Genomic DNA. Translation: AAA50456.1.
PIRiS44816.
T18572.

3D structure databases

ProteinModelPortaliQ17329.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi6239.F44E2.2c.2.

Proteomic databases

EPDiQ17329.
PaxDbiQ17329.
PeptideAtlasiQ17329.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Family and domain databases

Gene3Di3.30.420.10. 1 hit.
4.10.60.10. 1 hit.
InterProiIPR001969. Aspartic_peptidase_AS.
IPR001584. Integrase_cat-core.
IPR021109. Peptidase_aspartic_dom.
IPR012337. RNaseH-like_dom.
IPR000477. RT_dom.
IPR001878. Znf_CCHC.
[Graphical view]
PfamiPF00665. rve. 1 hit.
PF00078. RVT_1. 1 hit.
[Graphical view]
SMARTiSM00343. ZnF_C2HC. 1 hit.
[Graphical view]
SUPFAMiSSF50630. SSF50630. 1 hit.
SSF53098. SSF53098. 1 hit.
SSF57756. SSF57756. 1 hit.
PROSITEiPS00141. ASP_PROTEASE. 1 hit.
PS50994. INTEGRASE. 1 hit.
PS50878. RT_POL. 1 hit.
PS50158. ZF_CCHC. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiQ17329_CAEEL
AccessioniPrimary (citable) accession number: Q17329
Entry historyi
Integrated into UniProtKB/TrEMBL: November 1, 1996
Last sequence update: November 1, 1996
Last modified: November 2, 2016
This is version 95 of the entry and version 1 of the sequence. [Complete history]
Entry statusiUnreviewed (UniProtKB/TrEMBL)

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.