Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Carboxyl-terminal-processing peptidase 1, chloroplastic

Gene

CTPA1

Organism
Arabidopsis thaliana (Mouse-ear cress)
Status
Reviewed-Annotation score: Annotation score: 4 out of 5-Experimental evidence at protein leveli

Functioni

Protease involved in the C-terminal processing of the chloroplastic D1 protein of photosystem II. This proteolytic processing is necessary to allow the light-driven assembly of the tetranuclear manganese cluster, which is responsible for photosynthetic water oxidation.By similarity

Catalytic activityi

The enzyme shows specific recognition of a C-terminal tripeptide, Xaa-Yaa-Zaa, in which Xaa is preferably Ala or Leu, Yaa is preferably Ala or Tyr, and Zaa is preferably Ala, but then cleaves at a variable distance from the C-terminus. A typical cleavage is -Ala-Ala-|-Arg-Ala-Ala-Lys-Glu-Asn-Tyr-Ala-Leu-Ala-Ala.

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Active sitei403Charge relay systemBy similarity1
Active sitei428Charge relay systemBy similarity1

GO - Molecular functioni

Keywordsi

Molecular functionHydrolase, Protease, Serine protease

Protein family/group databases

MEROPSiS41.A02.

Names & Taxonomyi

Protein namesi
Recommended name:
Carboxyl-terminal-processing peptidase 1, chloroplastic (EC:3.4.21.102)
Alternative name(s):
D1 C-terminal processing protease 1
Photosystem II D1 protein processing peptidase 1
Gene namesi
Name:CTPA1
Ordered Locus Names:At5g46390
ORF Names:MPL12.19
OrganismiArabidopsis thaliana (Mouse-ear cress)
Taxonomic identifieri3702 [NCBI]
Taxonomic lineageiEukaryotaViridiplantaeStreptophytaEmbryophytaTracheophytaSpermatophytaMagnoliophytaeudicotyledonsGunneridaePentapetalaerosidsmalvidsBrassicalesBrassicaceaeCamelineaeArabidopsis
Proteomesi
  • UP000006548 Componenti: Chromosome 5

Organism-specific databases

AraportiAT5G46390.
TAIRilocus:2170443. AT5G46390.

Subcellular locationi

GO - Cellular componenti

  • chloroplast thylakoid lumen Source: UniProtKB-SubCell
  • thylakoid lumen Source: TAIR

Keywords - Cellular componenti

Chloroplast, Plastid, Thylakoid

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Transit peptidei? – 67Thylakoid1 Publication
Transit peptidei1 – ?ChloroplastSequence analysis
ChainiPRO_000042932168 – 489Carboxyl-terminal-processing peptidase 1, chloroplasticAdd BLAST422

Proteomic databases

PaxDbiF4KHG6.

Expressioni

Gene expression databases

GenevisibleiF4KHG6. AT.

Interactioni

Protein-protein interaction databases

IntActiF4KHG6. 1 interactor.
STRINGi3702.AT5G46390.2.

Structurei

3D structure databases

ProteinModelPortaliF4KHG6.
SMRiF4KHG6.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini189 – 273PDZPROSITE-ProRule annotationAdd BLAST85

Sequence similaritiesi

Belongs to the peptidase S41A family.Curated

Keywords - Domaini

Transit peptide

Phylogenomic databases

eggNOGiENOG410IHNS. Eukaryota.
COG0793. LUCA.
HOGENOMiHOG000038766.
InParanoidiF4KHG6.
OMAiTFNQVDW.
OrthoDBiEOG09360861.

Family and domain databases

InterProiView protein in InterPro
IPR029045. ClpP/crotonase-like_dom.
IPR001478. PDZ.
IPR004447. Peptidase_S41A.
IPR005151. Tail-specific_protease.
PfamiView protein in Pfam
PF00595. PDZ. 1 hit.
PF03572. Peptidase_S41. 1 hit.
SMARTiView protein in SMART
SM00228. PDZ. 1 hit.
SM00245. TSPc. 1 hit.
SUPFAMiSSF50156. SSF50156. 1 hit.
SSF52096. SSF52096. 2 hits.
TIGRFAMsiTIGR00225. prc. 1 hit.
PROSITEiView protein in PROSITE
PS50106. PDZ. 1 hit.

Sequences (2)i

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

This entry describes 2 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: F4KHG6-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MRLLLPFSSP LSATSSPSTP QFIPELPPPS QFDYSGLTKI LKKSVIGTLT
60 70 80 90 100
GALSLTLVFS SPISSVAATN DPYLSVNPPS SSFESSLNHF DSAPEDCPNE
110 120 130 140 150
EEADTEIQDD DIEPQLVTNE GIVEEAWEIV NGAFLDTRSH SWTPETWQKQ
160 170 180 190 200
KDDILASPIK SRSKAHEVIK NMLASLGDQY TRFLSPDEFS RMSKYDITGI
210 220 230 240 250
GINLREVSDG GGNVKLKVLG LVLDSAADIA GVKQGDEILA VNGMDVSGKS
260 270 280 290 300
SFEVSSLLQG PSKTFVVLKV KHGKCGPVKS LKIQRQVNAQ TPVSYRLEKV
310 320 330 340 350
DNGTVSVGYI RLKEFNALAR KDLVIAMKRL LDKGASYFVM DLRDNLGGLV
360 370 380 390 400
QAGIETAKLF LDEGDTVIYT AGRDPEAQKT VVSDKKPLIT APLIVMVNNR
410 420 430 440 450
TASASEIVAS ALHDNCKAVL VGERTYGKGL IQSVYELRDG SGVVVTIGKY
460 470 480
VTPNHMDING GGIEPDFRNL PAWDEVKERL SKCSILQQS
Length:489
Mass (Da):53,033
Last modified:June 28, 2011 - v1
Checksum:iE421EF7DE41FBAB8
GO
Isoform 2 (identifier: F4KHG6-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     396-428: MVNNRTASASEIVASALHDNCKAVLVGERTYGK → CDESCKPVNLSHYYVILHLALIRILIIVAGNGK
     429-489: Missing.

Show »
Length:428
Mass (Da):46,480
Checksum:i2A88D6B94BC67113
GO

Sequence cautioni

The sequence BAB11094 differs from that shown. Reason: Erroneous gene model prediction.Curated

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_054870396 – 428MVNNR…RTYGK → CDESCKPVNLSHYYVILHLA LIRILIIVAGNGK in isoform 2. 1 PublicationAdd BLAST33
Alternative sequenceiVSP_054871429 – 489Missing in isoform 2. 1 PublicationAdd BLAST61

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AB010698 Genomic DNA. Translation: BAB11094.1. Sequence problems.
CP002688 Genomic DNA. Translation: AED95377.1.
CP002688 Genomic DNA. Translation: AED95378.1.
AY062767 mRNA. Translation: AAL32845.1.
AY081650 mRNA. Translation: AAM10212.1.
RefSeqiNP_199451.2. NM_124009.2. [F4KHG6-2]
NP_974893.1. NM_203164.3. [F4KHG6-1]
UniGeneiAt.9191.

Genome annotation databases

EnsemblPlantsiAT5G46390.1; AT5G46390.1; AT5G46390. [F4KHG6-2]
AT5G46390.2; AT5G46390.2; AT5G46390. [F4KHG6-1]
GeneIDi834682.
GrameneiAT5G46390.1; AT5G46390.1; AT5G46390.
AT5G46390.2; AT5G46390.2; AT5G46390.
KEGGiath:AT5G46390.

Keywords - Coding sequence diversityi

Alternative splicing

Similar proteinsi

Entry informationi

Entry nameiCTPA1_ARATH
AccessioniPrimary (citable) accession number: F4KHG6
Secondary accession number(s): Q8W484, Q9FL23
Entry historyiIntegrated into UniProtKB/Swiss-Prot: June 11, 2014
Last sequence update: June 28, 2011
Last modified: August 30, 2017
This is version 55 of the entry and version 1 of the sequence. See complete history.
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programPlant Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Direct protein sequencing, Reference proteome

Documents

  1. Arabidopsis thaliana
    Arabidopsis thaliana: entries and gene names
  2. Peptidase families
    Classification of peptidase families and list of entries
  3. SIMILARITY comments
    Index of protein domains and families