Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Cathepsin W

Gene

CTSW

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 4 out of 5-Experimental evidence at protein leveli

Functioni

May have a specific function in the mechanism or regulation of T-cell cytolytic activity.

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Active sitei153 – 1531By similarity
Active sitei291 – 2911By similarity
Active sitei331 – 3311By similarity

GO - Molecular functioni

  1. cysteine-type peptidase activity Source: ProtInc

GO - Biological processi

  1. immune response Source: ProtInc
Complete GO annotation...

Keywords - Molecular functioni

Hydrolase, Protease, Thiol protease

Protein family/group databases

MEROPSiC01.037.

Names & Taxonomyi

Protein namesi
Recommended name:
Cathepsin W (EC:3.4.22.-)
Alternative name(s):
Lymphopain
Gene namesi
Name:CTSW
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
ProteomesiUP000005640: Chromosome 11

Organism-specific databases

HGNCiHGNC:2546. CTSW.

Subcellular locationi

GO - Cellular componenti

  1. membrane Source: UniProtKB
Complete GO annotation...

Pathology & Biotechi

Organism-specific databases

PharmGKBiPA27042.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 21211 PublicationAdd
BLAST
Propeptidei22 – 127106Sequence AnalysisPRO_0000026327Add
BLAST
Chaini128 – 376249Cathepsin WPRO_0000026328Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Glycosylationi50 – 501N-linked (GlcNAc...)Sequence Analysis
Disulfide bondi150 ↔ 191By similarity
Disulfide bondi184 ↔ 226By similarity
Glycosylationi205 – 2051N-linked (GlcNAc...)Sequence Analysis
Disulfide bondi284 ↔ 352By similarity

Keywords - PTMi

Disulfide bond, Glycoprotein, Zymogen

Proteomic databases

MaxQBiP56202.
PaxDbiP56202.
PRIDEiP56202.

PTM databases

PhosphoSiteiP56202.

Expressioni

Tissue specificityi

Expressed in natural killer and cytotoxic T cells.

Gene expression databases

BgeeiP56202.
CleanExiHS_CTSW.
ExpressionAtlasiP56202. baseline and differential.
GenevestigatoriP56202.

Organism-specific databases

HPAiCAB016345.

Interactioni

Protein-protein interaction databases

BioGridi107901. 1 interaction.
IntActiP56202. 1 interaction.
STRINGi9606.ENSP00000311300.

Structurei

3D structure databases

ProteinModelPortaliP56202.
SMRiP56202. Positions 49-363.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Sequence similaritiesi

Belongs to the peptidase C1 family.PROSITE-ProRule annotation

Keywords - Domaini

Signal

Phylogenomic databases

eggNOGiNOG288820.
GeneTreeiENSGT00760000118871.
HOGENOMiHOG000230774.
HOVERGENiHBG100117.
InParanoidiP56202.
KOiK08569.
OMAiNCNCCWA.
PhylomeDBiP56202.
TreeFamiTF337736.

Family and domain databases

InterProiIPR025661. Pept_asp_AS.
IPR025660. Pept_his_AS.
IPR013128. Peptidase_C1A.
IPR000668. Peptidase_C1A_C.
IPR013201. Prot_inhib_I29.
[Graphical view]
PANTHERiPTHR12411. PTHR12411. 1 hit.
PfamiPF08246. Inhibitor_I29. 1 hit.
PF00112. Peptidase_C1. 1 hit.
[Graphical view]
PRINTSiPR00705. PAPAIN.
SMARTiSM00848. Inhibitor_I29. 1 hit.
SM00645. Pept_C1. 1 hit.
[Graphical view]
PROSITEiPS00640. THIOL_PROTEASE_ASN. 1 hit.
PS00639. THIOL_PROTEASE_HIS. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

P56202-1 [UniParc]FASTAAdd to Basket

« Hide

        10         20         30         40         50
MALTAHPSCL LALLVAGLAQ GIRGPLRAQD LGPQPLELKE AFKLFQIQFN
60 70 80 90 100
RSYLSPEEHA HRLDIFAHNL AQAQRLQEED LGTAEFGVTP FSDLTEEEFG
110 120 130 140 150
QLYGYRRAAG GVPSMGREIR SEEPEESVPF SCDWRKVASA ISPIKDQKNC
160 170 180 190 200
NCCWAMAAAG NIETLWRISF WDFVDVSVQE LLDCGRCGDG CHGGFVWDAF
210 220 230 240 250
ITVLNNSGLA SEKDYPFQGK VRAHRCHPKK YQKVAWIQDF IMLQNNEHRI
260 270 280 290 300
AQYLATYGPI TVTINMKPLQ LYRKGVIKAT PTTCDPQLVD HSVLLVGFGS
310 320 330 340 350
VKSEEGIWAE TVSSQSQPQP PHPTPYWILK NSWGAQWGEK GYFRLHRGSN
360 370
TCGITKFPLT ARVQKPDMKP RVSCPP
Length:376
Mass (Da):42,120
Last modified:September 22, 2009 - v2
Checksum:iD2956524D17B593A
GO

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti179 – 1791Q → H in AAB82449. (PubMed:9823953)Curated
Sequence conflicti179 – 1791Q → H in AAB82457. (PubMed:9823953)Curated
Sequence conflicti179 – 1791Q → H in AAC32181. (PubMed:9675123)Curated

Natural variant

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Natural varianti139 – 1391S → G.4 Publications
Corresponds to variant rs604630 [ dbSNP | Ensembl ].
VAR_058847
Natural varianti218 – 2181Q → R.
Corresponds to variant rs606830 [ dbSNP | Ensembl ].
VAR_057041

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AF013611 mRNA. Translation: AAB82449.1.
AF015954 Genomic DNA. Translation: AAB82457.1.
AF055903 Genomic DNA. Translation: AAC32181.1.
AP001201 Genomic DNA. No translation available.
BC048255 mRNA. Translation: AAH48255.1.
CCDSiCCDS8117.1.
RefSeqiNP_001326.2. NM_001335.3.
UniGeneiHs.416848.

Genome annotation databases

EnsembliENST00000307886; ENSP00000311300; ENSG00000172543.
GeneIDi1521.
KEGGihsa:1521.
UCSCiuc001ogc.1. human.

Polymorphism databases

DMDMi259016196.

Keywords - Coding sequence diversityi

Polymorphism

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AF013611 mRNA. Translation: AAB82449.1.
AF015954 Genomic DNA. Translation: AAB82457.1.
AF055903 Genomic DNA. Translation: AAC32181.1.
AP001201 Genomic DNA. No translation available.
BC048255 mRNA. Translation: AAH48255.1.
CCDSiCCDS8117.1.
RefSeqiNP_001326.2. NM_001335.3.
UniGeneiHs.416848.

3D structure databases

ProteinModelPortaliP56202.
SMRiP56202. Positions 49-363.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi107901. 1 interaction.
IntActiP56202. 1 interaction.
STRINGi9606.ENSP00000311300.

Protein family/group databases

MEROPSiC01.037.

PTM databases

PhosphoSiteiP56202.

Polymorphism databases

DMDMi259016196.

Proteomic databases

MaxQBiP56202.
PaxDbiP56202.
PRIDEiP56202.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000307886; ENSP00000311300; ENSG00000172543.
GeneIDi1521.
KEGGihsa:1521.
UCSCiuc001ogc.1. human.

Organism-specific databases

CTDi1521.
GeneCardsiGC11P065647.
HGNCiHGNC:2546. CTSW.
HPAiCAB016345.
MIMi602364. gene.
neXtProtiNX_P56202.
PharmGKBiPA27042.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiNOG288820.
GeneTreeiENSGT00760000118871.
HOGENOMiHOG000230774.
HOVERGENiHBG100117.
InParanoidiP56202.
KOiK08569.
OMAiNCNCCWA.
PhylomeDBiP56202.
TreeFamiTF337736.

Miscellaneous databases

GeneWikiiCathepsin_W.
GenomeRNAii1521.
NextBioi6295.
PROiP56202.
SOURCEiSearch...

Gene expression databases

BgeeiP56202.
CleanExiHS_CTSW.
ExpressionAtlasiP56202. baseline and differential.
GenevestigatoriP56202.

Family and domain databases

InterProiIPR025661. Pept_asp_AS.
IPR025660. Pept_his_AS.
IPR013128. Peptidase_C1A.
IPR000668. Peptidase_C1A_C.
IPR013201. Prot_inhib_I29.
[Graphical view]
PANTHERiPTHR12411. PTHR12411. 1 hit.
PfamiPF08246. Inhibitor_I29. 1 hit.
PF00112. Peptidase_C1. 1 hit.
[Graphical view]
PRINTSiPR00705. PAPAIN.
SMARTiSM00848. Inhibitor_I29. 1 hit.
SM00645. Pept_C1. 1 hit.
[Graphical view]
PROSITEiPS00640. THIOL_PROTEASE_ASN. 1 hit.
PS00639. THIOL_PROTEASE_HIS. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "Lymphopain, a cytotoxic T and natural killer cell-associated cysteine proteinase."
    Brown J., Matutes E., Singleton A., Price C., Molgaard H., Buttle D., Enver T.
    Leukemia 12:1771-1781(1998) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA / MRNA], VARIANT GLY-139.
  2. "Human cathepsin W, a putative cysteine protease predominantly expressed in CD8+ T-lymphocytes."
    Linnevers C., Smeekens S.P., Broemme D.
    FEBS Lett. 405:253-259(1997) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [MRNA], VARIANT GLY-139.
  3. "Genomic structure, chromosomal localization, and expression of human cathepsin W."
    Wex T., Levy B., Smeekens S.P., Ansorge S., Desnick R.J., Bromme D.
    Biochem. Biophys. Res. Commun. 248:255-261(1998) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA], VARIANT GLY-139.
  4. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
  5. "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
    The MGC Project Team
    Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA], VARIANT GLY-139.
    Tissue: Pancreas.
  6. "Signal peptide prediction based on analysis of experimentally verified cleavage sites."
    Zhang Z., Henzel W.J.
    Protein Sci. 13:2819-2824(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: PROTEIN SEQUENCE OF 22-36.

Entry informationi

Entry nameiCATW_HUMAN
AccessioniPrimary (citable) accession number: P56202
Secondary accession number(s): Q86VT4
Entry historyi
Integrated into UniProtKB/Swiss-Prot: November 1, 1997
Last sequence update: September 22, 2009
Last modified: January 7, 2015
This is version 116 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Direct protein sequencing, Reference proteome

Documents

  1. Human chromosome 11
    Human chromosome 11: entries, gene names and cross-references to MIM
  2. Human entries with polymorphisms or disease mutations
    List of human entries with polymorphisms or disease mutations
  3. Human polymorphisms and disease mutations
    Index of human polymorphisms and disease mutations
  4. MIM cross-references
    Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot
  5. Peptidase families
    Classification of peptidase families and list of entries
  6. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into Uniref entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.