Skip Header

Contribute Send feedback
Read comments (?) or add your own

O65543 (PP343_ARATH) Reviewed, UniProtKB/Swiss-Prot

Last modified May 1, 2013. Version 72. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Web links·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Pentatricopeptide repeat-containing protein At4g31070, mitochondrial
Gene names
Name:PCMP-E7
Ordered Locus Names:At4g31070
ORF Names:F6I18.20
OrganismArabidopsis thaliana (Mouse-ear cress) [Reference proteome]
Taxonomic identifier3702 [NCBI]
Taxonomic lineageEukaryotaViridiplantaeStreptophytaEmbryophytaTracheophytaSpermatophytaMagnoliophytaeudicotyledonscore eudicotyledonsrosidsmalvidsBrassicalesBrassicaceaeCamelineaeArabidopsis

Protein attributes

Sequence length624 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at transcript level

General annotation (Comments)

Subcellular location

Mitochondrion Potential.

Sequence similarities

Belongs to the PPR family. PCMP-E subfamily.

Contains 14 PPR (pentatricopeptide) repeats.

Sequence caution

The sequence AEE85853.1 differs from that shown. Reason: Erroneous initiation. Translation N-terminally extended.

The sequence CAA18186.1 differs from that shown. Reason: Erroneous initiation. Translation N-terminally extended.

The sequence CAB79825.1 differs from that shown. Reason: Erroneous initiation. Translation N-terminally extended.

Ontologies

Keywords
   Cellular componentMitochondrion
   DomainRepeat
Transit peptide
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
   Cellular_componentmitochondrion

Inferred from electronic annotation. Source: UniProtKB-SubCell

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Transit peptide1 – 1515Mitochondrion Potential
Chain16 – 624609Pentatricopeptide repeat-containing protein At4g31070, mitochondrial
PRO_0000363460

Regions

Repeat56 – 9136PPR 1
Repeat92 – 12231PPR 2
Repeat123 – 15735PPR 3
Repeat158 – 19235PPR 4
Repeat195 – 22531PPR 5
Repeat226 – 26035PPR 6
Repeat261 – 29636PPR 7
Repeat297 – 32731PPR 8
Repeat328 – 36235PPR 9
Repeat363 – 39735PPR 10
Repeat398 – 42831PPR 11
Repeat429 – 46335PPR 12
Repeat464 – 49835PPR 13
Repeat499 – 52931PPR 14
Region534 – 61077Type E motif

Sequences

Sequence LengthMass (Da)Tools
O65543 [UniParc].

Last modified February 10, 2009. Version 2.
Checksum: BC9BD98ED8CCE20D

FASTA62469,841
        10         20         30         40         50         60 
MRWVKLGRRV IMSRALSSRL NLELGNKLKG LVSDQFYDEA LRLYKLKIHS LGTNGFTAIL 

        70         80         90        100        110        120 
PSVIKACAFQ QEPFLLGAQL HCLCLKAGAD CDTVVSNSLI SMYAKFSRKY AVRKVFDEML 

       130        140        150        160        170        180 
HRDTVSYCSI INSCCQDGLL YEAMKLIKEM YFYGFIPKSE LVASLLALCT RMGSSSKVAR 

       190        200        210        220        230        240 
MFHALVLVDE RMQESVLLST ALVDMYLKFD DHAAAFHVFD QMEVKNEVSW TAMISGCVAN 

       250        260        270        280        290        300 
QNYEMGVDLF RAMQRENLRP NRVTLLSVLP ACVELNYGSS LVKEIHGFSF RHGCHADERL 

       310        320        330        340        350        360 
TAAFMTMYCR CGNVSLSRVL FETSKVRDVV MWSSMISGYA ETGDCSEVMN LLNQMRKEGI 

       370        380        390        400        410        420 
EANSVTLLAI VSACTNSTLL SFASTVHSQI LKCGFMSHIL LGNALIDMYA KCGSLSAARE 

       430        440        450        460        470        480 
VFYELTEKDL VSWSSMINAY GLHGHGSEAL EIFKGMIKGG HEVDDMAFLA ILSACNHAGL 

       490        500        510        520        530        540 
VEEAQTIFTQ AGKYHMPVTL EHYACYINLL GRFGKIDDAF EVTINMPMKP SARIWSSLLS 

       550        560        570        580        590        600 
ACETHGRLDV AGKIIANELM KSEPDNPANY VLLSKIHTES GNYHAAEEVR RVMQRRKLNK 

       610        620 
CYGFSKIEPE LQIEDYQGKS WSPI 

« Hide

References

« Hide 'large scale' references
[1]"Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana."
Mayer K.F.X., Schueller C., Wambutt R., Murphy G., Volckaert G., Pohl T., Duesterhoeft A., Stiekema W., Entian K.-D., Terryn N., Harris B., Ansorge W., Brandt P., Grivell L.A., Rieger M., Weichselgartner M., de Simone V., Obermaier B. expand/collapse author list , Mache R., Mueller M., Kreis M., Delseny M., Puigdomenech P., Watson M., Schmidtheini T., Reichert B., Portetelle D., Perez-Alonso M., Boutry M., Bancroft I., Vos P., Hoheisel J., Zimmermann W., Wedler H., Ridley P., Langham S.-A., McCullagh B., Bilham L., Robben J., van der Schueren J., Grymonprez B., Chuang Y.-J., Vandenbussche F., Braeken M., Weltjens I., Voet M., Bastiaens I., Aert R., Defoor E., Weitzenegger T., Bothe G., Ramsperger U., Hilbert H., Braun M., Holzer E., Brandt A., Peters S., van Staveren M., Dirkse W., Mooijman P., Klein Lankhorst R., Rose M., Hauf J., Koetter P., Berneiser S., Hempel S., Feldpausch M., Lamberth S., Van den Daele H., De Keyser A., Buysshaert C., Gielen J., Villarroel R., De Clercq R., van Montagu M., Rogers J., Cronin A., Quail M.A., Bray-Allen S., Clark L., Doggett J., Hall S., Kay M., Lennard N., McLay K., Mayes R., Pettett A., Rajandream M.A., Lyne M., Benes V., Rechmann S., Borkova D., Bloecker H., Scharfe M., Grimm M., Loehnert T.-H., Dose S., de Haan M., Maarse A.C., Schaefer M., Mueller-Auer S., Gabel C., Fuchs M., Fartmann B., Granderath K., Dauner D., Herzl A., Neumann S., Argiriou A., Vitale D., Liguori R., Piravandi E., Massenet O., Quigley F., Clabauld G., Muendlein A., Felber R., Schnabl S., Hiller R., Schmidt W., Lecharny A., Aubourg S., Chefdor F., Cooke R., Berger C., Monfort A., Casacuberta E., Gibbons T., Weber N., Vandenbol M., Bargues M., Terol J., Torres A., Perez-Perez A., Purnelle B., Bent E., Johnson S., Tacon D., Jesse T., Heijnen L., Schwarz S., Scholler P., Heber S., Francs P., Bielke C., Frishman D., Haase D., Lemcke K., Mewes H.-W., Stocker S., Zaccaria P., Bevan M., Wilson R.K., de la Bastide M., Habermann K., Parnell L., Dedhia N., Gnoj L., Schutz K., Huang E., Spiegel L., Sekhon M., Murray J., Sheet P., Cordes M., Abu-Threideh J., Stoneking T., Kalicki J., Graves T., Harmon G., Edwards J., Latreille P., Courtney L., Cloud J., Abbott A., Scott K., Johnson D., Minx P., Bentley D., Fulton B., Miller N., Greco T., Kemp K., Kramer J., Fulton L., Mardis E., Dante M., Pepin K., Hillier L.W., Nelson J., Spieth J., Ryan E., Andrews S., Geisel C., Layman D., Du H., Ali J., Berghoff A., Jones K., Drone K., Cotton M., Joshu C., Antonoiu B., Zidanic M., Strong C., Sun H., Lamar B., Yordan C., Ma P., Zhong J., Preston R., Vil D., Shekher M., Matero A., Shah R., Swaby I.K., O'Shaughnessy A., Rodriguez M., Hoffman J., Till S., Granat S., Shohdy N., Hasegawa A., Hameed A., Lodhi M., Johnson A., Chen E., Marra M.A., Martienssen R., McCombie W.R.
Nature 402:769-777(1999) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: cv. Columbia.
[2]The Arabidopsis Information Resource (TAIR)
Submitted (APR-2011) to the EMBL/GenBank/DDBJ databases
Cited for: GENOME REANNOTATION.
Strain: cv. Columbia.
[3]"In Arabidopsis thaliana, 1% of the genome codes for a novel protein family unique to plants."
Aubourg S., Boudet N., Kreis M., Lecharny A.
Plant Mol. Biol. 42:603-613(2000) [PubMed] [Europe PMC] [Abstract]
Cited for: GENE FAMILY.
[4]"Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins reveals their essential role in organelle biogenesis."
Lurin C., Andres C., Aubourg S., Bellaoui M., Bitton F., Bruyere C., Caboche M., Debast C., Gualberto J., Hoffmann B., Lecharny A., Le Ret M., Martin-Magniette M.-L., Mireau H., Peeters N., Renou J.-P., Szurek B., Taconnat L., Small I.
Plant Cell 16:2089-2103(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: GENE FAMILY.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AL022198 Genomic DNA. Translation: CAA18186.1. Different initiation.
AL161578 Genomic DNA. Translation: CAB79825.1. Different initiation.
CP002687 Genomic DNA. Translation: AEE85853.1. Different initiation.
IPIIPI00522187.
IPI01019370.
PIRH85363.
RefSeqNP_194836.1. NM_119257.1.
UniGeneAt.65439.

3D structure databases

ProteinModelPortalO65543.
SMRO65543. Positions 535-566.
ModBaseSearch...

Proteomic databases

PaxDbO65543.
PRIDEO65543.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

GeneID829234.
KEGGath:AT4G31070.

Organism-specific databases

GeneFarm4022. 367.
TAIRAt4g31070.

Phylogenomic databases

eggNOGNOG305071.
HOGENOMHOG000176676.
InParanoidO65543.
PhylomeDBO65543.
ProtClustDBCLSN2685833.

Gene expression databases

ArrayExpressO65543.
GenevestigatorO65543.

Family and domain databases

Gene3D1.25.40.10. 2 hits.
InterProIPR002885. Pentatricopeptide_repeat.
IPR011990. TPR-like_helical.
[Graphical view]
PfamPF01535. PPR. 6 hits.
PF13812. PPR_3. 1 hit.
[Graphical view]
TIGRFAMsTIGR00756. PPR. 4 hits.
PROSITEPS51375. PPR. 12 hits.
[Graphical view]
ProtoNetSearch...

Entry information

Entry namePP343_ARATH
AccessionPrimary (citable) accession number: O65543
Secondary accession number(s): F4JR67
Entry history
Integrated into UniProtKB/Swiss-Prot: February 10, 2009
Last sequence update: February 10, 2009
Last modified: May 1, 2013
This is version 72 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programPlant Protein Annotation Program

Relevant documents

Arabidopsis thaliana

Arabidopsis thaliana: entries and gene names

SIMILARITY comments

Index of protein domains and families