Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Pentatricopeptide repeat-containing protein DOT4, chloroplastic

Gene

DOT4

Organism
Arabidopsis thaliana (Mouse-ear cress)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at transcript leveli

Functioni

Plays a major role in single RNA editing events in chloroplasts. Acts as a site-recognition transacting factor involved in the edition of the unique site (corresponding to cytidine-488) of rpoC1, which is a plastid-encoded subunit of the chloroplast DNA-directed RNA polymerase. May provide the catalytic activity for editing site conversion (PubMed:24194514). Involved in leaf vasculature patterning (PubMed:18643975).2 Publications

Cofactori

Zn2+By similarityNote: Binds 2 zinc ions per subunit.By similarity

GO - Molecular functioni

GO - Biological processi

  • chloroplast RNA modification Source: UniProtKB
  • cotyledon vascular tissue pattern formation Source: TAIR
  • leaf development Source: TAIR
  • leaf vascular tissue pattern formation Source: TAIR
  • mRNA processing Source: UniProtKB-KW
  • phloem or xylem histogenesis Source: TAIR

Keywordsi

Molecular functionRNA-binding
Biological processmRNA processing
LigandMetal-binding, Zinc

Names & Taxonomyi

Protein namesi
Recommended name:
Pentatricopeptide repeat-containing protein DOT4, chloroplasticCurated
Alternative name(s):
Protein DEFECTIVELY ORGANIZED TRIBUTARIES 41 Publication
Protein FLAVODENTATA1 Publication
Gene namesi
Name:DOT41 Publication
Synonyms:FLV1 Publication, PCMP-H45
Ordered Locus Names:At4g18750
ORF Names:F28A21.160
OrganismiArabidopsis thaliana (Mouse-ear cress)
Taxonomic identifieri3702 [NCBI]
Taxonomic lineageiEukaryotaViridiplantaeStreptophytaEmbryophytaTracheophytaSpermatophytaMagnoliophytaeudicotyledonsGunneridaePentapetalaerosidsmalvidsBrassicalesBrassicaceaeCamelineaeArabidopsis
Proteomesi
  • UP000006548 Componenti: Chromosome 4

Organism-specific databases

AraportiAT4G18750.
TAIRilocus:2124137. AT4G18750.

Subcellular locationi

Extracellular region or secreted Cytosol Plasma membrane Cell wall Cytoskeleton Vacuole Chloroplast Endosome Peroxisome ER Golgi apparatus Nucleus Mitochondrion Manual annotation Automatic computational assertion Graphics by Christian Stolte; Source: COMPARTMENTS

Keywords - Cellular componenti

Chloroplast, Plastid

Pathology & Biotechi

Disruption phenotypei

Defects in venation pattern in leaves and cotyledons.1 Publication

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Transit peptidei1 – 28ChloroplastSequence analysisAdd BLAST28
ChainiPRO_000036343729 – 871Pentatricopeptide repeat-containing protein DOT4, chloroplasticAdd BLAST843

Proteomic databases

PaxDbiQ9SN39.
PRIDEiQ9SN39.

PTM databases

iPTMnetiQ9SN39.

Expressioni

Tissue specificityi

Weakly expressed in leaves.1 Publication

Gene expression databases

GenevisibleiQ9SN39. AT.

Interactioni

Protein-protein interaction databases

STRINGi3702.AT4G18750.1.

Structurei

3D structure databases

ProteinModelPortaliQ9SN39.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Repeati60 – 94PPR 1Add BLAST35
Repeati96 – 127PPR 2Add BLAST32
Repeati128 – 158PPR 3Add BLAST31
Repeati159 – 193PPR 4Add BLAST35
Repeati194 – 228PPR 5Add BLAST35
Repeati229 – 259PPR 6Add BLAST31
Repeati260 – 294PPR 7Add BLAST35
Repeati295 – 329PPR 8Add BLAST35
Repeati330 – 360PPR 9Add BLAST31
Repeati361 – 395PPR 10Add BLAST35
Repeati396 – 430PPR 11Add BLAST35
Repeati431 – 465PPR 12Add BLAST35
Repeati466 – 497PPR 13Add BLAST32
Repeati498 – 532PPR 14Add BLAST35
Repeati533 – 563PPR 15Add BLAST31
Repeati564 – 598PPR 16Add BLAST35
Repeati599 – 629PPR 17Add BLAST31
Repeati635 – 665PPR 18Add BLAST31

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Regioni670 – 745Type E motifAdd BLAST76
Regioni746 – 776Type E(+) motifAdd BLAST31
Regioni777 – 871Type DYW motifAdd BLAST95

Sequence similaritiesi

Belongs to the PPR family. PCMP-H subfamily.Curated

Keywords - Domaini

Repeat, Transit peptide

Phylogenomic databases

eggNOGiKOG4197. Eukaryota.
ENOG410Z7Z7. LUCA.
HOGENOMiHOG000237570.
InParanoidiQ9SN39.
OMAiCGEERSL.
OrthoDBiEOG093604DP.
PhylomeDBiQ9SN39.

Family and domain databases

Gene3Di1.25.40.10. 4 hits.
InterProiView protein in InterPro
IPR032867. DYW_dom.
IPR002885. Pentatricopeptide_repeat.
IPR011990. TPR-like_helical_dom_sf.
PfamiView protein in Pfam
PF14432. DYW_deaminase. 1 hit.
PF01535. PPR. 3 hits.
PF13041. PPR_2. 4 hits.
TIGRFAMsiTIGR00756. PPR. 9 hits.
PROSITEiView protein in PROSITE
PS51375. PPR. 18 hits.

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

Q9SN39-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MAMLVTNLSS SSFCFFSSPH LQNQKEIRSG VRVRKYVIFN RASLRTVSDC
60 70 80 90 100
VDSITTFDRS VTDANTQLRR FCESGNLENA VKLLCVSGKW DIDPRTLCSV
110 120 130 140 150
LQLCADSKSL KDGKEVDNFI RGNGFVIDSN LGSKLSLMYT NCGDLKEASR
160 170 180 190 200
VFDEVKIEKA LFWNILMNEL AKSGDFSGSI GLFKKMMSSG VEMDSYTFSC
210 220 230 240 250
VSKSFSSLRS VHGGEQLHGF ILKSGFGERN SVGNSLVAFY LKNQRVDSAR
260 270 280 290 300
KVFDEMTERD VISWNSIING YVSNGLAEKG LSVFVQMLVS GIEIDLATIV
310 320 330 340 350
SVFAGCADSR LISLGRAVHS IGVKACFSRE DRFCNTLLDM YSKCGDLDSA
360 370 380 390 400
KAVFREMSDR SVVSYTSMIA GYAREGLAGE AVKLFEEMEE EGISPDVYTV
410 420 430 440 450
TAVLNCCARY RLLDEGKRVH EWIKENDLGF DIFVSNALMD MYAKCGSMQE
460 470 480 490 500
AELVFSEMRV KDIISWNTII GGYSKNCYAN EALSLFNLLL EEKRFSPDER
510 520 530 540 550
TVACVLPACA SLSAFDKGRE IHGYIMRNGY FSDRHVANSL VDMYAKCGAL
560 570 580 590 600
LLAHMLFDDI ASKDLVSWTV MIAGYGMHGF GKEAIALFNQ MRQAGIEADE
610 620 630 640 650
ISFVSLLYAC SHSGLVDEGW RFFNIMRHEC KIEPTVEHYA CIVDMLARTG
660 670 680 690 700
DLIKAYRFIE NMPIPPDATI WGALLCGCRI HHDVKLAEKV AEKVFELEPE
710 720 730 740 750
NTGYYVLMAN IYAEAEKWEQ VKRLRKRIGQ RGLRKNPGCS WIEIKGRVNI
760 770 780 790 800
FVAGDSSNPE TENIEAFLRK VRARMIEEGY SPLTKYALID AEEMEKEEAL
810 820 830 840 850
CGHSEKLAMA LGIISSGHGK IIRVTKNLRV CGDCHEMAKF MSKLTRREIV
860 870
LRDSNRFHQF KDGHCSCRGF W
Length:871
Mass (Da):97,698
Last modified:May 1, 2000 - v1
Checksum:iC8AE13BF2589DF7A
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AL035526 Genomic DNA. Translation: CAB37460.1.
AL161549 Genomic DNA. Translation: CAB78877.1.
CP002687 Genomic DNA. Translation: AEE84086.1.
AK221529 mRNA. Translation: BAD94843.1.
PIRiT04867.
RefSeqiNP_193610.1. NM_117991.3.
UniGeneiAt.54400.

Genome annotation databases

EnsemblPlantsiAT4G18750.1; AT4G18750.1; AT4G18750.
GeneIDi827609.
GrameneiAT4G18750.1; AT4G18750.1; AT4G18750.
KEGGiath:AT4G18750.

Keywords - Coding sequence diversityi

RNA editing

Similar proteinsi

Entry informationi

Entry nameiPP320_ARATH
AccessioniPrimary (citable) accession number: Q9SN39
Secondary accession number(s): Q56XZ4
Entry historyiIntegrated into UniProtKB/Swiss-Prot: February 10, 2009
Last sequence update: May 1, 2000
Last modified: November 22, 2017
This is version 102 of the entry and version 1 of the sequence. See complete history.
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programPlant Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Arabidopsis thaliana
    Arabidopsis thaliana: entries and gene names
  2. SIMILARITY comments
    Index of protein domains and families