Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q0WNJ6 (CLAH1_ARATH) Reviewed, UniProtKB/Swiss-Prot

Last modified June 11, 2014. Version 72. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Interactions·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Clathrin heavy chain 1
Gene names
Name:CHC1
Ordered Locus Names:At3g11130
ORF Names:F11B9.30, F9F8.6
OrganismArabidopsis thaliana (Mouse-ear cress) [Reference proteome]
Taxonomic identifier3702 [NCBI]
Taxonomic lineageEukaryotaViridiplantaeStreptophytaEmbryophytaTracheophytaSpermatophytaMagnoliophytaeudicotyledonsGunneridaePentapetalaerosidsmalvidsBrassicalesBrassicaceaeCamelineaeArabidopsis

Protein attributes

Sequence length1705 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Clathrin is the major protein of the polyhedral coat of coated pits and vesicles By similarity. Mediates endocytosis and is required for a correct polar distribution of PIN auxin transporters. Ref.4

Subunit structure

Clathrin triskelions, composed of 3 heavy chains and 3 light chains, are the basic subunits of the clathrin coat By similarity.

Subcellular location

Cytoplasmic vesicle membrane; Peripheral membrane protein; Cytoplasmic side By similarity. Membranecoated pit; Peripheral membrane protein; Cytoplasmic side By similarity. Note: Cytoplasmic face of coated pits and vesicles By similarity.

Domain

The C-terminal third of the heavy chains forms the hub of the triskelion. This region contains the trimerization domain and the light-chain binding domain involved in the assembly of the clathrin lattice.

The N-terminal seven-bladed beta-propeller is formed by WD40-like repeats, and projects inward from the polyhedral outer clathrin coat. It consitutes a major protein-protein interaction node By similarity.

Sequence similarities

Belongs to the clathrin heavy chain family.

Contains 7 CHCR (clathrin heavy-chain) repeats.

Sequence caution

The sequence AAF01510.1 differs from that shown. Reason: Erroneous gene model prediction.

The sequence AAG50967.1 differs from that shown. Reason: Erroneous gene model prediction.

Binary interactions

With

Entry

#Exp.

IntAct

Notes

EPSIN1Q8VY073EBI-1162845,EBI-1162785

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Initiator methionine11Removed Ref.5
Chain2 – 17051704Clathrin heavy chain 1
PRO_0000413949

Regions

Repeat551 – 697147CHCR 1
Repeat700 – 842143CHCR 2
Repeat847 – 986140CHCR 3
Repeat993 – 1138146CHCR 4
Repeat1142 – 1283142CHCR 5
Repeat1288 – 1434147CHCR 6
Repeat1437 – 1580144CHCR 7
Region2 – 492491Globular terminal domain By similarity
Region25 – 6743WD40-like repeat 1
Region68 – 11346WD40-like repeat 2
Region114 – 15542WD40-like repeat 3
Region156 – 20550WD40-like repeat 4
Region206 – 27065WD40-like repeat 5
Region271 – 31444WD40-like repeat 6
Region315 – 34329WD40-like repeat 7
Region462 – 47817Binding site for the uncoating ATPase, involved in lattice disassembly By similarity
Region493 – 53644Flexible linker By similarity
Region537 – 17051169Heavy chain arm By similarity
Region537 – 648112Distal segment By similarity
Region653 – 17051053Proximal segment By similarity
Region1227 – 1536310Involved in binding clathrin light chain By similarity
Region1564 – 1705142Trimerization By similarity

Amino acid modifications

Modified residue21N-acetylalanine Ref.5

Sequences

Sequence LengthMass (Da)Tools
Q0WNJ6 [UniParc].

Last modified September 5, 2006. Version 1.
Checksum: 0850CAE77FE17616

FASTA1,705193,245
        10         20         30         40         50         60 
MAAANAPIIM KEVLTLPSVG IGQQFITFTN VTMESDKYIC VRETAPQNSV VIIDMNMPMQ 

        70         80         90        100        110        120 
PLRRPITADS ALMNPNSRIL ALKAQVPGTT QDHLQIFNIE AKAKLKSHQM PEQVAFWKWI 

       130        140        150        160        170        180 
TPKMLGLVTQ TSVYHWSIEG DSEPVKMFDR TANLANNQII NYKCSPNEKW LVLIGIAPGS 

       190        200        210        220        230        240 
PERPQLVKGN MQLFSVDQQR SQALEAHAAS FAQFKVPGNE NPSILISFAS KSFNAGQITS 

       250        260        270        280        290        300 
KLHVIELGAQ PGKPSFTKKQ ADLFFPPDFA DDFPVAMQVS HKFNLIYVIT KLGLLFVYDL 

       310        320        330        340        350        360 
ETASAIYRNR ISPDPIFLTS EASSVGGFYA INRRGQVLLA TVNEATIIPF ISGQLNNLEL 

       370        380        390        400        410        420 
AVNLAKRGNL PGAENLVVQR FQELFAQTKY KEAAELAAES PQGILRTPDT VAKFQSVPVQ 

       430        440        450        460        470        480 
AGQTPPLLQY FGTLLTRGKL NSYESLELSR LVVNQNKKNL LENWLAEDKL ECSEELGDLV 

       490        500        510        520        530        540 
KTVDNDLALK IYIKARATPK VVAAFAERRE FDKILIYSKQ VGYTPDYMFL LQTILRTDPQ 

       550        560        570        580        590        600 
GAVNFALMMS QMEGGCPVDY NTITDLFLQR NLIREATAFL LDVLKPNLPE HAFLQTKVLE 

       610        620        630        640        650        660 
INLVTFPNVA DAILANGMFS HYDRPRVAQL CEKAGLYIQS LKHYSELPDI KRVIVNTHAI 

       670        680        690        700        710        720 
EPQALVEFFG TLSSEWAMEC MKDLLLVNLR GNLQIIVQAC KEYCEQLGVD ACIKLFEQFK 

       730        740        750        760        770        780 
SYEGLYFFLG SYLSMSEDPE IHFKYIEAAA KTGQIKEVER VTRESNFYDA EKTKNFLMEA 

       790        800        810        820        830        840 
KLPDARPLIN VCDRFGFVPD LTHYLYTNNM LRYIEGYVQK VNPGNAPLVV GQLLDDECPE 

       850        860        870        880        890        900 
DFIKGLILSV RSLLPVEPLV AECEKRNRLR LLTQFLEHLV SEGSQDVHVH NALGKIIIDS 

       910        920        930        940        950        960 
NNNPEHFLTT NPYYDSKVVG KYCEKRDPTL AVVAYRRGQC DEELINVTNK NSLFKLQARY 

       970        980        990       1000       1010       1020 
VVERMDGDLW EKVLTEENEY RRQLIDQVVS TALPESKSPE QVSAAVKAFM TADLPHELIE 

      1030       1040       1050       1060       1070       1080 
LLEKIVLQNS AFSGNFNLQN LLILTAIKAD PSRVMDYINR LDNFDGPAVG EVAVDAQLYE 

      1090       1100       1110       1120       1130       1140 
EAFAIFKKFN LNVQAVNVLL DNVRSIERAV EFAFRVEEDA VWSQVAKAQL REGLVSDAIE 

      1150       1160       1170       1180       1190       1200 
SFIRADDTTQ FLEVIRASED TNVYDDLVRY LLMVRQKVKE PKVDSELIYA YAKIERLGEI 

      1210       1220       1230       1240       1250       1260 
EEFILMPNVA NLQHVGDRLY DEALYEAAKI IYAFISNWAK LAVTLVKLQQ FQGAVDAARK 

      1270       1280       1290       1300       1310       1320 
ANSAKTWKEV CFACVDAEEF RLAQICGLNI IIQVDDLEEV SEYYQNRGCF NELISLMESG 

      1330       1340       1350       1360       1370       1380 
LGLERAHMGI FTELGVLYAR YRYEKLMEHI KLFSTRLNIP KLIRACDEQQ HWQELTYLYI 

      1390       1400       1410       1420       1430       1440 
QYDEFDNAAT TVMNHSPEAW EHMQFKDIVA KVANVELYYK AVHFYLQEHP DIINDLLNVL 

      1450       1460       1470       1480       1490       1500 
ALRLDHTRVV DIMRKAGHLR LIKPYMVAVQ SNNVSAVNEA LNEIYAEEED YDRLRESIDL 

      1510       1520       1530       1540       1550       1560 
HDSFDQIGLA QKIEKHELVE MRRVAAYIYK KAGRWKQSIA LSKKDNMYKD CMETASQSGD 

      1570       1580       1590       1600       1610       1620 
HDLAEQLLVY FIEQGKKECF ATCLFVCYDL IRPDVALELA WINNMIDFAF PYLLQFIREY 

      1630       1640       1650       1660       1670       1680 
SGKVDELIKD KLEAQKEVKA KEQEEKDVMS QQNMYAQLLP LALPAPPMPG MGGGGYGPPP 

      1690       1700 
QMGGMPGMSG MPPMPPYGMP PMGGY 

« Hide

References

« Hide 'large scale' references
[1]"Sequence and analysis of chromosome 3 of the plant Arabidopsis thaliana."
Salanoubat M., Lemcke K., Rieger M., Ansorge W., Unseld M., Fartmann B., Valle G., Bloecker H., Perez-Alonso M., Obermaier B., Delseny M., Boutry M., Grivell L.A., Mache R., Puigdomenech P., De Simone V., Choisne N., Artiguenave F. expand/collapse author list , Robert C., Brottier P., Wincker P., Cattolico L., Weissenbach J., Saurin W., Quetier F., Schaefer M., Mueller-Auer S., Gabel C., Fuchs M., Benes V., Wurmbach E., Drzonek H., Erfle H., Jordan N., Bangert S., Wiedelmann R., Kranz H., Voss H., Holland R., Brandt P., Nyakatura G., Vezzi A., D'Angelo M., Pallavicini A., Toppo S., Simionati B., Conrad A., Hornischer K., Kauer G., Loehnert T.-H., Nordsiek G., Reichelt J., Scharfe M., Schoen O., Bargues M., Terol J., Climent J., Navarro P., Collado C., Perez-Perez A., Ottenwaelder B., Duchemin D., Cooke R., Laudie M., Berger-Llauro C., Purnelle B., Masuy D., de Haan M., Maarse A.C., Alcaraz J.-P., Cottet A., Casacuberta E., Monfort A., Argiriou A., Flores M., Liguori R., Vitale D., Mannhaupt G., Haase D., Schoof H., Rudd S., Zaccaria P., Mewes H.-W., Mayer K.F.X., Kaul S., Town C.D., Koo H.L., Tallon L.J., Jenkins J., Rooney T., Rizzo M., Walts A., Utterback T., Fujii C.Y., Shea T.P., Creasy T.H., Haas B., Maiti R., Wu D., Peterson J., Van Aken S., Pai G., Militscher J., Sellers P., Gill J.E., Feldblyum T.V., Preuss D., Lin X., Nierman W.C., Salzberg S.L., White O., Venter J.C., Fraser C.M., Kaneko T., Nakamura Y., Sato S., Kato T., Asamizu E., Sasamoto S., Kimura T., Idesawa K., Kawashima K., Kishida Y., Kiyokawa C., Kohara M., Matsumoto M., Matsuno A., Muraki A., Nakayama S., Nakazaki N., Shinpo S., Takeuchi C., Wada T., Watanabe A., Yamada M., Yasuda M., Tabata S.
Nature 408:820-822(2000) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: cv. Columbia.
[2]The Arabidopsis Information Resource (TAIR)
Submitted (APR-2011) to the EMBL/GenBank/DDBJ databases
Cited for: GENOME REANNOTATION.
Strain: cv. Columbia.
[3]"Large-scale analysis of RIKEN Arabidopsis full-length (RAFL) cDNAs."
Totoki Y., Seki M., Ishida J., Nakajima M., Enju A., Kamiya A., Narusaka M., Shin-i T., Nakagawa M., Sakamoto N., Oishi K., Kohara Y., Kobayashi M., Toyoda A., Sakaki Y., Sakurai T., Iida K., Akiyama K. expand/collapse author list , Satou M., Toyoda T., Konagaya A., Carninci P., Kawai J., Hayashizaki Y., Shinozaki K.
Submitted (JUL-2006) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
Strain: cv. Columbia.
[4]"Clathrin mediates endocytosis and polar distribution of PIN auxin transporters in Arabidopsis."
Kitakura S., Vanneste S., Robert S., Loefke C., Teichmann T., Tanaka H., Friml J.
Plant Cell 23:1920-1931(2011) [PubMed] [Europe PMC] [Abstract]
Cited for: FUNCTION.
[5]"Comparative large-scale characterisation of plant vs. mammal proteins reveals similar and idiosyncratic N-alpha acetylation features."
Bienvenut W.V., Sumpton D., Martinez A., Lilla S., Espagne C., Meinnel T., Giglione C.
Mol. Cell. Proteomics 11:M111.015131-M111.015131(2012) [PubMed] [Europe PMC] [Abstract]
Cited for: ACETYLATION [LARGE SCALE ANALYSIS] AT ALA-2, IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS], CLEAVAGE OF INITIATOR METHIONINE [LARGE SCALE ANALYSIS].
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AC009991 Genomic DNA. Translation: AAF01510.1. Sequence problems.
AC073395 Genomic DNA. Translation: AAG50967.1. Sequence problems.
CP002686 Genomic DNA. Translation: AEE75005.1.
AK229443 mRNA. Translation: BAF01303.1.
AK229949 mRNA. Translation: BAF01775.1.
RefSeqNP_187724.2. NM_111950.2.
UniGeneAt.17332.
At.26828.

3D structure databases

ProteinModelPortalQ0WNJ6.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

BioGrid5618. 7 interactions.
IntActQ0WNJ6. 3 interactions.
STRING3702.AT3G11130.1-P.

Proteomic databases

PRIDEQ0WNJ6.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblPlantsAT3G11130.1; AT3G11130.1; AT3G11130.
GeneID820284.
KEGGath:AT3G11130.

Organism-specific databases

TAIRAT3G11130.

Phylogenomic databases

eggNOGNOG314149.
HOGENOMHOG000188877.
InParanoidQ0WNJ6.
KOK04646.
OMALAGCQMI.
PhylomeDBQ0WNJ6.

Gene expression databases

GenevestigatorQ0WNJ6.

Family and domain databases

Gene3D1.25.40.10. 3 hits.
2.130.10.110. 1 hit.
InterProIPR016024. ARM-type_fold.
IPR000547. Clathrin_H-chain/VPS_repeat.
IPR016025. Clathrin_H-chain_link/propller.
IPR015348. Clathrin_H-chain_linker_core.
IPR001473. Clathrin_H-chain_propeller_N.
IPR022365. Clathrin_H-chain_propeller_rpt.
IPR016341. Clathrin_heavy_chain.
IPR011990. TPR-like_helical.
[Graphical view]
PfamPF00637. Clathrin. 7 hits.
PF09268. Clathrin-link. 1 hit.
PF01394. Clathrin_propel. 2 hits.
[Graphical view]
PIRSFPIRSF002290. Clathrin_H_chain. 1 hit.
SMARTSM00299. CLH. 7 hits.
[Graphical view]
SUPFAMSSF48371. SSF48371. 5 hits.
SSF50989. SSF50989. 1 hit.
PROSITEPS50236. CHCR. 7 hits.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameCLAH1_ARATH
AccessionPrimary (citable) accession number: Q0WNJ6
Secondary accession number(s): Q0WM81, Q9SRM1
Entry history
Integrated into UniProtKB/Swiss-Prot: November 16, 2011
Last sequence update: September 5, 2006
Last modified: June 11, 2014
This is version 72 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programPlant Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families

Arabidopsis thaliana

Arabidopsis thaliana: entries and gene names