Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Endoglucanase

Gene

cel1

Organism
Actinoplanes sp. N902-109
Status
Unreviewed-Annotation score: Annotation score: 2 out of 5-Protein inferred from homologyi

Functioni

Catalytic activityi

Endohydrolysis of (1->4)-beta-D-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans.UniRule annotation

GO - Molecular functioni

GO - Biological processi

Keywordsi

Molecular functionGlycosidaseUniRule annotation, Hydrolase
Biological processCarbohydrate metabolism, Cellulose degradationUniRule annotation, Polysaccharide degradation

Names & Taxonomyi

Protein namesi
Recommended name:
EndoglucanaseUniRule annotation (EC:3.2.1.4UniRule annotation)
Gene namesi
Name:cel1Imported
ORF Names:L083_3175Imported
OrganismiActinoplanes sp. N902-109Imported
Taxonomic identifieri649831 [NCBI]
Taxonomic lineageiBacteriaActinobacteriaMicromonosporalesMicromonosporaceaeActinoplanes
Proteomesi
  • UP000013541 Componenti: Chromosome

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Signal peptidei1 – 33UniRule annotationAdd BLAST33
ChainiPRO_500514539834 – 1016EndoglucanaseUniRule annotationAdd BLAST983

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini903 – 1016CBM2InterPro annotationAdd BLAST114

Sequence similaritiesi

Belongs to the glycosyl hydrolase 9 (cellulase E) family.UniRule annotation

Keywords - Domaini

SignalUniRule annotation

Phylogenomic databases

KOiK01179.
OrthoDBiPOG091H04TS.

Family and domain databases

CDDicd02850. E_set_Cellulase_N. 1 hit.
Gene3Di2.60.120.260. 2 hits.
2.60.40.10. 1 hit.
InterProiView protein in InterPro
IPR008928. 6-hairpin_glycosidase-like.
IPR008965. Carb-bd_dom_sf.
IPR001919. CBD2.
IPR004197. Cellulase_Ig-like.
IPR003305. CenC_carb-bd.
IPR008979. Galactose-bd-like_sf.
IPR001701. Glyco_hydro_9.
IPR033126. Glyco_hydro_9_Asp/Glu_AS.
IPR013783. Ig-like_fold.
IPR014756. Ig_E-set.
IPR006311. TAT_signal.
PfamiView protein in Pfam
PF00553. CBM_2. 1 hit.
PF02018. CBM_4_9. 2 hits.
PF02927. CelD_N. 1 hit.
PF00759. Glyco_hydro_9. 1 hit.
SMARTiView protein in SMART
SM00637. CBD_II. 1 hit.
SUPFAMiSSF48208. SSF48208. 1 hit.
SSF49384. SSF49384. 1 hit.
SSF49785. SSF49785. 2 hits.
SSF81296. SSF81296. 1 hit.
PROSITEiView protein in PROSITE
PS51173. CBM2. 1 hit.
PS00698. GLYCOSYL_HYDROL_F9_2. 1 hit.
PS51318. TAT. 1 hit.

Sequencei

Sequence statusi: Complete.

R4LN52-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MTPHLRRRVT MLTAAATAAM LALTSLTANP ASAEPAQLIA NGDFATGTTG
60 70 80 90 100
WWSTDNAPIS AAGGELCADV PGGTTNAWDV SLGYNDVPLV AGAAYRLSFR
110 120 130 140 150
AHADAPVTVR ANVQLNEDPY TAALSRSVAL TTAAQTFDSS FTSSLQSANG
160 170 180 190 200
TLTFQLGGSA QAFRFCLDDV SLTSDTAAPP AGAEQLENGD FSDGTAGWYS
210 220 230 240 250
YGTTATGVDD GQLCTTVPGG LANPWDAGIG QNNVTLQAGS AYTLSFDATA
260 270 280 290 300
SPGATVRAAV QLGADPYTSY LSRDVALTPA RQHLEYTFTA SEDTTAGQVA
310 320 330 340 350
FQVGGAAAEY RLCLDNVSLT GGEAEPPYVP DTGPRVRVNQ VGYLPAGPKN
360 370 380 390 400
ATLVTDATTA LDWQLKNAAG SVVRSGRSTP RGVDAASGQN VHTIDFTGYT
410 420 430 440 450
TAGTGYTLVA DGQTSHPFDI SGTVYERLRP DALQFFYIQR SGIAIDGGLV
460 470 480 490 500
GEQYARPAGH LGVAPNKGDT DVPCRANTCD YRLDVRGGWY DAGDQGKYVV
510 520 530 540 550
NGGIAVQQLM SSFERTKTAV TAAHGAGLAD STLRVPERGN KVPDILDEAR
560 570 580 590 600
WELEFLMRMQ VPAGQPLAGM AHHKMHDANW TGLPMQPQDD PEQRELQPPS
610 620 630 640 650
TAATLNLAAT TAQCARLFAP YDATFAAKCL TVARTAYAAA KANPAKLAQD
660 670 680 690 700
LGGGGGSYGD DDVSDEFYWA AAELYLTTGE AAFLTDVTAS RHHTGDVFAA
710 720 730 740 750
TGFGWASTAA LGRLDLATVP SALPAADRER VRQSVLTAAD GYLATVGAQA
760 770 780 790 800
YGLPMPGNAG AYFWGANSNI LNNVQVLATA FDMTGAAKYR DAAVQGVDYI
810 820 830 840 850
FGRNALNQSY VTGWGEKASE NQHTRIYAHE KDAALPHPPA GSLAGGANAG
860 870 880 890 900
LDDPYAKSLL TGCKPMFCYV DDIESYATNE LAINWNSALA WVASFLADQG
910 920 930 940 950
RGEPAAAVSC RATYTNYGDW ADKSGFTAQL AVTNTGTKAI DGWAVRFAFL
960 970 980 990 1000
GGQKVRDAWS AEATQSGATV TAKNLAANQR IQPGATVYFG FNATTPGGPN
1010
PAPELITLNG SACGRS
Length:1,016
Mass (Da):106,083
Last modified:July 24, 2013 - v1
Checksum:i21CF618AB27EE9C0
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
CP005929 Genomic DNA. Translation: AGL16685.1.

Genome annotation databases

EnsemblBacteriaiAGL16685; AGL16685; L083_3175.
KEGGiactn:L083_3175.
PATRICifig|649831.3.peg.3127.

Similar proteinsi

Entry informationi

Entry nameiR4LN52_9ACTN
AccessioniPrimary (citable) accession number: R4LN52
Entry historyiIntegrated into UniProtKB/TrEMBL: July 24, 2013
Last sequence update: July 24, 2013
Last modified: November 22, 2017
This is version 25 of the entry and version 1 of the sequence. See complete history.
Entry statusiUnreviewed (UniProtKB/TrEMBL)

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteomeImported