Skip Header

 
Contribute Send feedback

Reviewed, UniProtKB/Swiss-Prot Q9Y4P1 (ATG4B_HUMAN)

Last modified November 4, 2008. Version 68. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (8) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Binary interactions · Alternative products · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Cysteine protease ATG4B
    EC=3.4.22.-
Alternative name(s):
    Autophagy-related protein 4 homolog B
      Short name=hAPG4B
    Autophagin-1
    Autophagy-related cysteine endopeptidase 1
    AUT-like 1 cysteine endopeptidase
Gene names
Name: ATG4B
Synonyms: APG4B, AUTL1, KIAA0943
OrganismHomo sapiens (Human)
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length393 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceEvidence at protein level.

General annotation (Comments)

Function

Cysteine protease required for autophagy, which cleaves the C-terminal part of either MAP1LC3, GABARAPL2 or GABARAP, allowing the liberation of form I. A subpopulation of form I is subsequently converted to a smaller form (form II). Form II, with a revealed C-terminal glycine, is considered to be the phosphatidylethanolamine (PE)-conjugated form, and has the capacity for the binding to autophagosomes.

Enzyme regulation

Inhibited by N-ethylmaleimide.

Subcellular location

CytoplasmProbable.

Tissue specificity

Mainly expressed in the skeletal muscle, followed by brain, heart, liver and pancreas.

Sequence similarities

Belongs to the peptidase C54 family.

Binary interactions

With

Entry

#Exp.

IntAct

Notes

GABARAPL2P605201EBI-712014,EBI-720116

Alternative products

This entry describes 5 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q9Y4P1-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q9Y4P1-2)

The sequence of this isoform differs from the canonical sequence as follows:
     1-1: M → MAHSVPSDSR...GSVGGRTGKM
     370-393: DSSDVERLERFFDSEDEDFEILSL → GESCQVQILLM
Notes: No experimental confirmation available.
Isoform 3 (identifier: Q9Y4P1-3)

The sequence of this isoform differs from the canonical sequence as follows:
     1-74: Missing.
     321-354: FFCKTEDDFNDWCQQVKKLSLLGGALPMFELVEQ → KQGRLVRSLIPWAPRPSSWCAAVLGAAVVMCGTP
     355-393: Missing.
Notes: No experimental confirmation available.
Isoform 4 (identifier: Q9Y4P1-4)

The sequence of this isoform differs from the canonical sequence as follows:
     1-74: Missing.
     369-369: L → LGESCQVQVGSLG
Isoform 6 (identifier: Q9Y4P1-6)

The sequence of this isoform differs from the canonical sequence as follows:
     370-393: DSSDVERLERFFDSEDEDFEILSL → GESCQVQILLM
Notes: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 393393Cysteine protease ATG4B
PRO_0000215844

Sites

Active site741Nucleophile
Active site2781 Potential
Active site2801

Amino acid modifications

Modified residue341Phosphoserine By similarity
Modified residue3161Phosphoserine
Modified residue3831Phosphoserine
Modified residue3921Phosphoserine

Natural variations

Alternative sequence1 – 7474Missing in isoform 3 and isoform 4.
VSP_013028
Alternative sequence11M → MAHSVPSDSRTSRRPTTRPH AARGAPRGSRRPGRTPKWRL PRISARAPYRLRRLRRHTYW PPRRPVAASRCWPVGATPLG SVGGRTGKM in isoform 2.
VSP_013029
Alternative sequence321 – 35434FFCKT…ELVEQ → KQGRLVRSLIPWAPRPSSWC AAVLGAAVVMCGTP in isoform 3.
VSP_013031
Alternative sequence355 – 39339Missing in isoform 3.
VSP_013032
Alternative sequence3691L → LGESCQVQVGSLG in isoform 4.
VSP_013033
Alternative sequence370 – 39324DSSDV…EILSL → GESCQVQILLM in isoform 2 and isoform 6.
VSP_013034
Natural variant3541Q → L: dbSNP rs7601000.
VAR_021486

Experimental info

Mutagenesis741C → S: Complete loss of protease activity
Mutagenesis1421W → A: Strongly reduced protease activity
Mutagenesis2291R → A: Strongly reduced protease activity
Mutagenesis2781D → A: Complete loss of protease activity
Mutagenesis2801H → A: Complete loss of protease activity
Sequence conflict1361Missing in BAB83890. Ref.2
Sequence conflict1881P → L in BAB55127. Ref.5
Sequence conflict2471F → Y in BAB55353. Ref.5
Sequence conflict2731E → G in BAB55042. Ref.5
Sequence conflict3121E → N in BAB83890. Ref.2

Secondary structure

................................................................ 393
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified November 1, 1999. Version 1.
Checksum: 7E850A192176E0F5

FASTA39344,309
        10         20         30         40         50         60 
MDAATLTYDT LRFAEFEDFP ETSEPVWILG RKYSIFTEKD EILSDVASRL WFTYRKNFPA 

        70         80         90        100        110        120 
IGGTGPTSDT GWGCMLRCGQ MIFAQALVCR HLGRDWRWTQ RKRQPDSYFS VLNAFIDRKD 

       130        140        150        160        170        180 
SYYSIHQIAQ MGVGEGKSIG QWYGPNTVAQ VLKKLAVFDT WSSLAVHIAM DNTVVMEEIR 

       190        200        210        220        230        240 
RLCRTSVPCA GATAFPADSD RHCNGFPAGA EVTNRPSPWR PLVLLIPLRL GLTDINEAYV 

       250        260        270        280        290        300 
ETLKHCFMMP QSLGVIGGKP NSAHYFIGYV GEELIYLDPH TTQPAVEPTD GCFIPDESFH 

       310        320        330        340        350        360 
CQHPPCRMSI AELDPSIAVG FFCKTEDDFN DWCQQVKKLS LLGGALPMFE LVEQQPSHLA 

       370        380        390 
CPDVLNLSLD SSDVERLERF FDSEDEDFEI LSL 

« Hide

Isoform 2 [UniParc].

Checksum: F97302AD8D6960D6
Show »

46852,544
Isoform 3 [UniParc].

Checksum: 0904A795A1B596B3
Show »

28031,042
Isoform 4 [UniParc].

Checksum: 97592E31055D7882
Show »

33137,052
Isoform 6 [UniParc].

Checksum: 481414A271E7533D
Show »

38042,637

References

« Hide 'large scale' references
[1]"Human autophagins, a family of cysteine proteinases potentially implicated in cell degradation by autophagy."
Marino G., Uria J.A., Puente X.S., Quesada V., Bordallo J., Lopez-Otin C.
J. Biol. Chem. 278:3671-3678(2003) [PubMed: 12446702] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), TISSUE SPECIFICITY.
Tissue: Liver.
[2]"LC3, GABARAP and GATE16 localize to autophagosomal membrane depending on form-II formation."
Kabeya Y., Mizushima N., Yamamoto A., Oshitani-Okamoto S., Ohsumi Y., Yoshimori T.
J. Cell Sci. 117:2805-2812(2004) [PubMed: 15169837] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), TISSUE SPECIFICITY, FUNCTION IN GABARAPL2; GABARAP AND MAP1LC3A CLEAVAGE, MUTAGENESIS OF CYS-74.
[3]"Prediction of the coding sequences of unidentified human genes. XIII. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro."
Nagase T., Ishikawa K., Suyama M., Kikuno R., Hirosawa M., Miyajima N., Tanaka A., Kotani H., Nomura N., Ohara O.
DNA Res. 6:63-70(1999) [PubMed: 10231032] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
Tissue: Brain.
[4]Ohara O., Nagase T., Kikuno R.
Submitted (AUG-2005) to the EMBL/GenBank/DDBJ databases
Cited for: SEQUENCE REVISION.
[5]"Complete sequencing and characterization of 21,243 full-length human cDNAs."
Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., Wakamatsu A., Hayashi K.,