Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Ankyrin repeat protein A

Gene

arpA

Organism
Escherichia coli (strain K12)
Status
Reviewed-Annotation score: Annotation score: 2 out of 5-Protein predictedi

Functioni

Enzyme and pathway databases

BioCyciEcoCyc:EG11208-MONOMER.
ECOL316407:JW3977-MONOMER.

Names & Taxonomyi

Protein namesi
Recommended name:
Ankyrin repeat protein A
Alternative name(s):
Ankyrin-like regulatory protein
Gene namesi
Name:arpA
Synonyms:arp, yjaC
Ordered Locus Names:b4017, JW3977
OrganismiEscherichia coli (strain K12)
Taxonomic identifieri83333 [NCBI]
Taxonomic lineageiBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia
ProteomesiUP000000318 Componenti: Chromosome UP000000625 Componenti: Chromosome

Organism-specific databases

EcoGeneiEG11208. arpA.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 728728Ankyrin repeat protein APRO_0000067228Add
BLAST

Proteomic databases

PaxDbiP23325.
PRIDEiP23325.

Expressioni

Gene expression databases

GenevestigatoriP23325.

Interactioni

Protein-protein interaction databases

DIPiDIP-9159N.
IntActiP23325. 7 interactions.
STRINGi511145.b4017.

Structurei

3D structure databases

ProteinModelPortaliP23325.
SMRiP23325. Positions 423-451.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Repeati381 – 41030ANK 11 PublicationAdd
BLAST
Repeati429 – 45830ANK 21 PublicationAdd
BLAST
Repeati477 – 50630ANK 31 PublicationAdd
BLAST
Repeati525 – 55430ANK 41 PublicationAdd
BLAST
Repeati573 – 60230ANK 51 PublicationAdd
BLAST

Sequence similaritiesi

Contains 5 ANK repeats.Curated

Keywords - Domaini

ANK repeat, Repeat

Phylogenomic databases

eggNOGiCOG0666.
HOGENOMiHOG000009667.
InParanoidiP23325.
KOiK06867.
OMAiILEALPC.
OrthoDBiEOG628F1T.

Family and domain databases

Gene3Di1.25.40.20. 1 hit.
InterProiIPR002110. Ankyrin_rpt.
IPR020683. Ankyrin_rpt-contain_dom.
IPR012927. Toxin_15_N.
[Graphical view]
PfamiPF07906. Toxin_15. 1 hit.
[Graphical view]
SMARTiSM00248. ANK. 6 hits.
[Graphical view]
SUPFAMiSSF48403. SSF48403. 1 hit.

Sequencei

Sequence statusi: Complete.

P23325-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MITRIPRSSF SANINNTAQT NEHQTLSELF YKELEDKFSG KELATPLLKS
60 70 80 90 100
FSENCRQNGR HIFSNKDFVI KFSTSVLQAD KKEITIINKN ENTTLTQTIA
110 120 130 140 150
PIFEKYLMEI LPQRSDTLDK QELNLKSDRK EKEFPRIKLN GQCYFPGRPQ
160 170 180 190 200
NRIVCRHIAA QYINDIYQNV DYKPHQDDYS SAEKFLTHFN KKCKNQTLAL
210 220 230 240 250
VSSRPEGRCV AACGDFGLVM KAYFDKMESN GISVMAAILL VDNHALTVRL
260 270 280 290 300
RIKNTTEGCT HYVVSVYDPN VTNDKIRIMS ESKENIKHYS LMDFMNVDYS
310 320 330 340 350
LLKWSNDHVI NQSVAIIPAL PKEQLLMLKG SVDEITPPLS PATMNLLMAI
360 370 380 390 400
GQNHQLTQLM IQLQKMPELH RTEMLTAYNS INLPGLYLAI NYGNADIVET
410 420 430 440 450
IFNSLSETGY EGLLSKKNLM HILEAKDKNG FSGLFLAISR KDKNVVTSIL
460 470 480 490 500
NALPKLAATH HLDNEQVYKF LSAKNRTSSH VLYHVMANGD ADMLKIVLNA
510 520 530 540 550
LPLLIRTCHL TKEQVLDLLK AKDFYGCPGL YLAMQNGHSD IVKVILEALP
560 570 580 590 600
SLAQEINISA SDIVDLLTAK SLARDTGLFM AMQRGHMNVI NTIFNALPTL
610 620 630 640 650
FNTFKFDKKN MKPLLLANNS NEYPGLFSAI QHKQQNVVET VYLALSDHAR
660 670 680 690 700
LFGFTAEDIM DFWQHKAPQK YSAFELAFEF GHRVIAELIL NTLNKMAESF
710 720
GFTDNPRYIA EKNYMEALLK KASPHTVR
Length:728
Mass (Da):82,598
Last modified:July 19, 2003 - v3
Checksum:i19EE240C4561B3F0
GO

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti124 – 1241N → D in AAA73004 (PubMed:1995429).Curated
Sequence conflicti282 – 2821S → T in AAA73004 (PubMed:1995429).Curated
Sequence conflicti701 – 72828GFTDN…PHTVR → TQKSISPYRTLNLCLRRYA in AAA73004 (PubMed:1995429).CuratedAdd
BLAST

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U00006 Genomic DNA. Translation: AAC43111.1.
U00096 Genomic DNA. Translation: AAC76987.1.
AP009048 Genomic DNA. Translation: BAE78019.1.
M63497 Genomic DNA. Translation: AAA73004.1.
PIRiH65208.
RefSeqiNP_418441.1. NC_000913.3.

Genome annotation databases

EnsemblBacteriaiAAC76987; AAC76987; b4017.
BAE78019; BAE78019; BAE78019.
GeneIDi944933.
KEGGiecj:Y75_p3904.
eco:b4017.
PATRICi32123563. VBIEscCol129921_4130.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U00006 Genomic DNA. Translation: AAC43111.1.
U00096 Genomic DNA. Translation: AAC76987.1.
AP009048 Genomic DNA. Translation: BAE78019.1.
M63497 Genomic DNA. Translation: AAA73004.1.
PIRiH65208.
RefSeqiNP_418441.1. NC_000913.3.

3D structure databases

ProteinModelPortaliP23325.
SMRiP23325. Positions 423-451.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

DIPiDIP-9159N.
IntActiP23325. 7 interactions.
STRINGi511145.b4017.

Proteomic databases

PaxDbiP23325.
PRIDEiP23325.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaiAAC76987; AAC76987; b4017.
BAE78019; BAE78019; BAE78019.
GeneIDi944933.
KEGGiecj:Y75_p3904.
eco:b4017.
PATRICi32123563. VBIEscCol129921_4130.

Organism-specific databases

EchoBASEiEB1193.
EcoGeneiEG11208. arpA.

Phylogenomic databases

eggNOGiCOG0666.
HOGENOMiHOG000009667.
InParanoidiP23325.
KOiK06867.
OMAiILEALPC.
OrthoDBiEOG628F1T.

Enzyme and pathway databases

BioCyciEcoCyc:EG11208-MONOMER.
ECOL316407:JW3977-MONOMER.

Miscellaneous databases

PROiP23325.

Gene expression databases

GenevestigatoriP23325.

Family and domain databases

Gene3Di1.25.40.20. 1 hit.
InterProiIPR002110. Ankyrin_rpt.
IPR020683. Ankyrin_rpt-contain_dom.
IPR012927. Toxin_15_N.
[Graphical view]
PfamiPF07906. Toxin_15. 1 hit.
[Graphical view]
SMARTiSM00248. ANK. 6 hits.
[Graphical view]
SUPFAMiSSF48403. SSF48403. 1 hit.
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "Analysis of the Escherichia coli genome. IV. DNA sequence of the region from 89.2 to 92.8 minutes."
    Blattner F.R., Burland V.D., Plunkett G. III, Sofia H.J., Daniels D.L.
    Nucleic Acids Res. 21:5408-5417(1993) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: K12 / MG1655 / ATCC 47076.
  2. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], SEQUENCE REVISION TO 282.
    Strain: K12 / MG1655 / ATCC 47076.
  3. "Highly accurate genome sequences of Escherichia coli K-12 strains MG1655 and W3110."
    Hayashi K., Morooka N., Yamamoto Y., Fujita K., Isono K., Choi S., Ohtsubo E., Baba T., Wanner B.L., Mori H., Horiuchi T.
    Mol. Syst. Biol. 2:E1-E5(2006) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: K12 / W3110 / ATCC 27325 / DSM 5911.
  4. "Primary structure of the intergenic region between aceK and iclR in the Escherichia coli chromosome."
    Galinier A., Bleicher F., Negre D., Perriere G., Duclos B., Cozzone A.J., Cortay J.-C.
    Gene 97:149-150(1991) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 108-728.
  5. "Detecting patterns in protein sequences."
    Neuwald A.F., Green P.
    J. Mol. Biol. 239:698-712(1994) [PubMed] [Europe PMC] [Abstract]
    Cited for: IDENTIFICATION OF ANKYRIN REPEATS.

Entry informationi

Entry nameiARPA_ECOLI
AccessioniPrimary (citable) accession number: P23325
Secondary accession number(s): P76781, Q2M6T7
Entry historyi
Integrated into UniProtKB/Swiss-Prot: November 1, 1991
Last sequence update: July 19, 2003
Last modified: May 27, 2015
This is version 114 of the entry and version 3 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Escherichia coli
    Escherichia coli (strain K12): entries and cross-references to EcoGene
  2. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into Uniref entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.