Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q859P9 (Q859P9_BPN4) Unreviewed, UniProtKB/TrEMBL

Last modified June 11, 2014. Version 53. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein names
OrganismEnterobacteria phage N4 (Bacteriophage N4) [Reference proteome]
Taxonomic identifier10752 [NCBI]
Taxonomic lineageVirusesdsDNA viruses, no RNA stageCaudoviralesPodoviridaeN4likevirus
Virus hostEscherichia coli [TaxID: 562]

Protein attributes

Sequence length3500 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Regions

Nucleotide binding1434 – 14374GTP PDB 3Q0A PDB 3Q22 PDB 3Q24
Nucleotide binding1556 – 15616ATP PDB 4FF2 PDB 4FF3
Nucleotide binding1559 – 15613GTP PDB 3Q22
Nucleotide binding1667 – 16682GTP PDB 3Q22
Region1472 – 14732Phosphate binding PDB 4FF2 PDB 4FF3

Sites

Metal binding15561Manganese PDB 3Q23 PDB 4FF2 PDB 4FF3
Metal binding15571Manganese; via carbonyl oxygen PDB 3Q23 PDB 4FF2 PDB 4FF3
Metal binding19481Manganese PDB 3Q23 PDB 4FF2 PDB 4FF3
Binding site14211AMP
Binding site14211ATP PDB 4FF2 PDB 4FF3
Binding site14211GTP PDB 3Q0A PDB 3Q22 PDB 3Q24
Binding site14661Phosphate PDB 4FF2 PDB 4FF3
Binding site15611AMP; via amide nitrogen
Binding site16091ATP PDB 4FF2 PDB 4FF3
Binding site16091GTP PDB 3Q22
Binding site16631ATP PDB 4FF2 PDB 4FF3
Binding site16631GTP PDB 3Q22
Binding site16671ATP PDB 4FF2 PDB 4FF3
Binding site16751AMP
Binding site16751ATP PDB 4FF2 PDB 4FF3
Binding site16751GTP PDB 3Q22
Binding site19481AMP
Binding site19481ATP PDB 4FF2 PDB 4FF3
Binding site19481GTP PDB 3Q0A PDB 3Q22 PDB 3Q24

Sequences

Sequence LengthMass (Da)Tools
Q859P9 [UniParc].

Last modified December 12, 2006. Version 2.
Checksum: B4A52EAC2E28BBDB

FASTA3,500382,495
        10         20         30         40         50         60 
MSVFDRLAGF ADSVTNAKQV DVSTATAQKK AEQGVTTPLV SPDAAYQMQA ARTGNVGANA 

        70         80         90        100        110        120 
FEPGTVQSDF MNLTPMQIMN KYGVEQGLQL INARADAGNQ VFNDSVTTRT PGEELGDIAT 

       130        140        150        160        170        180 
GVGLGFVNTL GGIGALGAGL LNDDAGAVVA QQLSKFNDAV HATQSQALQD KRKLFAARNL 

       190        200        210        220        230        240 
MNEVESERQY QTDKKEGTND IVASLSKFGR DFVGSIENAA QTDSIISDGL AEGVGSLLGA 

       250        260        270        280        290        300 
GPVLRGASLL GKAVVPANTL RSAALAGAID AGTGTQSLAR IASTVGRAAP GMVGVGAMEA 

       310        320        330        340        350        360 
GGAYQQTADE IMKMSLKDLE KSPVYQQHIK DGMSPEQARR QTASETGLTA AAIQLPIAAA 

       370        380        390        400        410        420 
TGPLVSRFEM APFRAGSLGA VGMNLARETV EEGVQGATGQ LAQNIAQQQN IDKNQDLLKG 

       430        440        450        460        470        480 
VGTQAGLGAL YGFGSAGVVQ APAGAARLAG AATAPVLRTT MAGVKAAGSV AGKVVSPIKN 

       490        500        510        520        530        540 
TLVARGERVM KQNEEASPVA DDYVAQAAQE AMAQAPEAEV TIRDAVEATD ATPEQKVAAH 

       550        560        570        580        590        600 
QYVSDLMNAT RFNPENYQEA PEHIRNAVAG STDQVQVIQK LADLVNTLDE SNPQALMEAA 

       610        620        630        640        650        660 
SYMYDAVSEF EQFINRDPAA LDSIPKDSPA IELLNRYTNL TANIQNTPKV IGALNVINRM 

       670        680        690        700        710        720 
INESAQNGSL NVTEESSPQE MQNVALAAEV APEKLNPESV NVVLKHAADG RIKLNNRQIA 

       730        740        750        760        770        780 
ALQNAAAILK GAREYDAEAA RLGLRPQDIV SKQIKTDESR TQEGQYSALQ HANRIRSAYN 

       790        800        810        820        830        840 
SGNFELASAY LNDFMQFAQH MQNKVGALNE HLVTGNADKN KSVHYQALTA DREWVRSRTG 

       850        860        870        880        890        900 
LGVNPYDTKS VKFAQQVALE AKTVADIANA LASAYPELKV SHIKVTPLDS RLNAPAAEVV 

       910        920        930        940        950        960 
KAFRQGNRDV ASSQPKADSV NQVKETPVTK QEPVTSTVQT KTPVSESVKT EPTTKESSPQ 

       970        980        990       1000       1010       1020 
AIKEPVNQSE KQDVNLTNED NIKQPTESVK ETETSTKEST VTEELKEGID AVYPSLVGTA 

      1030       1040       1050       1060       1070       1080 
DSKAEGIKNY FKLSFTLPEE QKSRTVGSEA PLKDVAQALS SRARYELFTE KETANPAFNG 

      1090       1100       1110       1120       1130       1140 
EVIKRYKELM EHGEGIADIL RSRLAKFLNT KDVGKRFAQG TEANRWVGGK LLNIVEQDGD 

      1150       1160       1170       1180       1190       1200 
TFKYNEQLLQ TAVLAGLQWR LTATSNTAIK DAKDVAAITG IDQALLPEGL VEQFDTGMTL 

      1210       1220       1230       1240       1250       1260 
TEAVSSLAQK IESYWGLSRN PNAPLGYTKG IPTAMAAEIL AAFVESTDVV ENIVDMSEID 

      1270       1280       1290       1300       1310       1320 
PDNKKTIGLY TITELDSFDP INSFPTAIEE AVLVNPTEKM FFGDDIPPVA NTQLRNPAVR 

      1330       1340       1350       1360       1370       1380 
NTPEQKAALK AEQATEFYVH TPMVQFYETL GKDRILELMG AGTLNKELLN DNHAKSLEGK 

      1390       1400       1410       1420       1430       1440 
NRSVEDSYNQ LFSVIEQVRA QSEDISTVPI HYAYNMTRVG RMQMLGKYNP QSAKLVREAI 

      1450       1460       1470       1480       1490       1500 
LPTKATLDLS NQNNEDFSAF QLGLAQALDI KVHTMTREVM SDELTKLLEG NLKPAIDMMV 

      1510       1520       1530       1540       1550       1560 
EFNTTGSLPE NAVDVLNTAL GDRKSFVALM ALMEYSRYLV AEDKSAFVTP LYVEADGVTN 

      1570       1580       1590       1600       1610       1620 
GPINAMMLMT GGLFTPDWIR NIAKGGLFIG SPNKTMNEHR STADNNDLYQ ASTNALMESL 

      1630       1640       1650       1660       1670       1680 
GKLRSNYASN MPIQSQIDSL LSLMDLFLPD INLGENGALE LKRGIAKNPL TITIYGSGAR 

      1690       1700       1710       1720       1730       1740 
GIAGKLVSSV TDAIYERMSD VLKARAKDPN ISAAMAMFGK QAASEAHAEE LLARFLKDME 

      1750       1760       1770       1780       1790       1800 
TLTSTVPVKR KGVLELQSTG TGAKGKINPK TYTIKGEQLK ALQENMLHFF VEPLRNGITQ 

      1810       1820       1830       1840       1850       1860 
TVGESLVYST EQLQKATQIQ SVVLEDMFKQ RVQEKLAEKA KDPTWKKGDF LTQKELNDIQ 

      1870       1880       1890       1900       1910       1920 
ASLNNLAPMI ETGSQTFYIA GSENAEVANQ VLATNLDDRM RVPMSIYAPA QAGVAGIPFM 

      1930       1940       1950       1960       1970       1980 
TIGTGDGMMM QTLSTMKGAP KNTLKIFDGM NIGLNDITDA SRKANEAVYT SWQGNPIKNV 

      1990       2000       2010       2020       2030       2040 
YESYAKFMKN VDFSKLSPEA LEAIGKSALE YDQRENATVD DIANAASLIE RNLRNIALGV 

      2050       2060       2070       2080       2090       2100 
DIRHKVLDKV NLSIDQMAAV GAPYQNNGKI DLSNMTPEQQ ADELNKLFRE ELEARKQKVA 

      2110       2120       2130       2140       2150       2160 
KARAEVKEET VSEKEPVNPD FGMVGREHKA SGVRILSATA IRNLAKISNL PSTQAATLAE 

      2170       2180       2190       2200       2210       2220 
IQKSLAAKDY KIIYGTPTQV AEYARQKNVT ELTSQEMEEA QAGNIYGWTN FDDKTIYLVS 

      2230       2240       2250       2260       2270       2280 
PSMETLIHEL VHASTFEEVY SFYQGNEVSP TSKQAIENLE GLMEQFRSLD ISKDSPEMRE 

      2290       2300       2310       2320       2330       2340 
AYADAIATIE GHLSNGFVDP AISKAAALNE FMAWGLANRA LAAKQKRTSS LVQMVKDVYQ 

      2350       2360       2370       2380       2390       2400 
AIKKLIWGRK QAPALGEDMF SNLLFNSAIL MRSQPTTQAV AKDGTLFHSK AYGNNERLSQ 

      2410       2420       2430       2440       2450       2460 
LNQTFDKLVT DYLRTDPVTE VERRGNVANA LMSATRLVRD VQSHGFNMTA QEQSVFQMVT 

      2470       2480       2490       2500       2510       2520 
AALATEAAID PHAMARAQEL YTHVMKHLTV EHFMADPDST NPADRYYAQQ KYDTISGANL 

      2530       2540       2550       2560       2570       2580 
VEVDAKGRTS LLPTFLGLAM VNEELRSIIK EMPVPKADKK LGNDIDTLLT NAGTQVMESL 

      2590       2600       2610       2620       2630       2640 
NRRMAGDQKA TNVQDSIDAL SETIMAAALK RESFYDAVAT PTGNFIDRAN QYVTDSIERL 

      2650       2660       2670       2680       2690       2700 
SETVIEKADK VIANPSNIAA KGVAHLAKLT AAIASEKQGE IVAQGVMTAM NQGKVWQPFH 

      2710       2720       2730       2740       2750       2760 
DLVNDIVGRT KTNANVYDLI KLVKSQISQD RQQFREHLPT VIAGKFSRKL TDTEWSAMHT 

      2770       2780       2790       2800       2810       2820 
GLGKTDLAVL RETMSMAEIR DLLSSSKKVK DEISTLEKEI QNQAGRNWNL VQKKSKQLAQ 

      2830       2840       2850       2860       2870       2880 
YMIMGEVGNN LLRNAHAISR LLGERITNGP VADVAAIDKL ITLYSLELMN KSDRDLLSEL 

      2890       2900       2910       2920       2930       2940 
AQSEVEGMEF SIAYMVGQRT EEMRKAKGDN RTLLNHFKGY IPVENQQGVN LIIADDKEFA 

      2950       2960       2970       2980       2990       3000 
KLNSQSFTRI GTYQGSTGFR TGSKGYYFSP VAARAPYSQG ILQNVRNTAG GVDIGTGFTL 

      3010       3020       3030       3040       3050       3060 
GTMVAGRITD KPTVERITKA LAKGERGREP LMPIYNSKGQ VVAYEQSVDP NMLKHLNQDN 

      3070       3080       3090       3100       3110       3120 
HFAKMVGVWR GRQVEEAKAQ RFNDILIEQL HAMYEKDIKD SSANKSQYVN LLGKIDDPVL 

      3130       3140       3150       3160       3170       3180 
ADAINLMNIE TRHKAEELFG KDELWVRRDM LNDALGYRAA SIGDVWTGNS RWSPSTLDTV 

      3190       3200       3210       3220       3230       3240 
KKMFLGAFGN KAYHVVMNAE NTIQNLVKDA KTVIVVKSVV VPAVNFLANI YQMIGRGVPV 

      3250       3260       3270       3280       3290       3300 
KDIAVNIPRK TSEINQYIKS RLRQIDAEAE LRAAEGNPNL VRKLKTEIQS ITDSHRRMSI 

      3310       3320       3330       3340       3350       3360 
WPLIEAGEFS SIADAGISRD DLLVAEGKIH EYMEKLANKL PEKVRNAGRY ALIAKDTALF 

      3370       3380       3390       3400       3410       3420 
QGIQKTVEYS DFIAKAIIYD DLVKRKKKSS SEALGQVTEE FINYDRLPGR FRGYMESMGL 

      3430       3440       3450       3460       3470       3480 
MWFYNFKIRS IKVAMSMIRN NPVHSLIATV VPAPTMFGNV GLPIQDNMLT MLAEGRLDYS 

      3490       3500 
LGFGQGLRAP TLNPWFNLTH 

« Hide

References

« Hide 'large scale' references
[1]"Genome sequence and analysis of bacteriophage N4."
Hendrix R.W., Rothman-Denes L., Hatfull G.F., Lawrence J.G., Pedulla M.
Submitted (NOV-2006) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[2]"Structural basis for DNA-hairpin promoter recognition by the bacteriophage N4 virion RNA polymerase."
Gleghorn M.L., Davydova E.K., Rothman-Denes L.B., Murakami K.S.
Mol. Cell 32:707-717(2008) [PubMed] [Europe PMC] [Abstract]
Cited for: X-RAY CRYSTALLOGRAPHY (2.00 ANGSTROMS) OF 998-2102.
[3]"X-ray crystal structure of the polymerase domain of the bacteriophage N4 virion RNA polymerase."
Murakami K.S., Davydova E.K., Rothman-Denes L.B.
Proc. Natl. Acad. Sci. U.S.A. 105:5046-5051(2008) [PubMed] [Europe PMC] [Abstract]
Cited for: X-RAY CRYSTALLOGRAPHY (2.00 ANGSTROMS) OF 998-2101.
[4]"X-ray crystal structures elucidate the nucleotidyl transfer reaction of transcript initiation using two nucleotides."
Gleghorn M.L., Davydova E.K., Basu R., Rothman-Denes L.B., Murakami K.S.
Proc. Natl. Acad. Sci. U.S.A. 108:3566-3571(2011) [PubMed] [Europe PMC] [Abstract]
Cited for: X-RAY CRYSTALLOGRAPHY (1.80 ANGSTROMS) OF 998-2103 IN COMPLEX WITH GTP; MANGANESE AND PHOSPHATE.
[5]"Watching the bacteriophage N4 RNA polymerase transcription by time-dependent soak-trigger-freeze X-ray crystallography."
Basu R.S., Murakami K.S.
J. Biol. Chem. 288:3305-3311(2013) [PubMed] [Europe PMC] [Abstract]
Cited for: X-RAY CRYSTALLOGRAPHY (2.00 ANGSTROMS) OF 998-2103 IN COMPLEX WITH AMP; ATP; MANGANESE AND PHOSPHATE.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
EF056009 Genomic DNA. Translation: AAO24831.2.
RefSeqYP_950528.1. NC_008720.1.

3D structure databases

PDBe
RCSB-PDB
PDBj
EntryMethodResolution (Å)ChainPositionsPDBsum
2PO4X-ray2.00A998-2101[»]
3C2PX-ray2.00A/B998-2102[»]
3C3LX-ray2.40A/B998-2102[»]
3C46X-ray2.00A/B998-2102[»]
3Q0AX-ray2.69A/B998-2103[»]
3Q22X-ray2.11A/B998-2103[»]
3Q23X-ray1.80A/B998-2103[»]
3Q24X-ray1.81A/B998-2102[»]
4FF1X-ray2.47A/B998-2103[»]
4FF2X-ray2.00A/B998-2103[»]
4FF3X-ray2.00A/B998-2103[»]
4FF4X-ray2.03A/B998-2103[»]
ProteinModelPortalQ859P9.
SMRQ859P9. Positions 1008-2101.
ModBaseSearch...
MobiDBSearch...

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

GeneID5075715.

Family and domain databases

ProtoNetSearch...

Other

EvolutionaryTraceQ859P9.

Entry information

Entry nameQ859P9_BPN4
AccessionPrimary (citable) accession number: Q859P9
Entry history
Integrated into UniProtKB/TrEMBL: June 1, 2003
Last sequence update: December 12, 2006
Last modified: June 11, 2014
This is version 53 of the entry and version 2 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)