Skip Header

Contribute Send feedback
Read comments (?) or add your own

D9SX09 (D9SX09_CLOC7) Unreviewed, UniProtKB/TrEMBL

Last modified May 1, 2013. Version 14. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein names
Gene names
Ordered Locus Names:Clocel_1624
OrganismClostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) [Complete proteome] [HAMAP] EMBL ADL51370.1
Taxonomic identifier573061 [NCBI]
Taxonomic lineageBacteriaFirmicutesClostridiaClostridialesClostridiaceaeClostridium

Protein attributes

Sequence length826 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existencePredicted

Ontologies

Keywords
   DomainSignal EMBL ADL51370.1
   Molecular functionHydrolase EMBL ADL51370.1
   Technical termComplete proteome
Gene Ontology (GO)
   Biological_processpolysaccharide catabolic process

Inferred from electronic annotation. Source: InterPro

   Molecular_functioncellulase activity

Inferred from electronic annotation. Source: InterPro

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 2929 Potential EMBL ADL51370.1
Chain30 – 826797 Potential EMBL ADL51370.1
PRO_5000619674

Sequences

Sequence LengthMass (Da)Tools
D9SX09 [UniParc].

Last modified October 5, 2010. Version 1.
Checksum: 51CB12AE1E8ED7A7

FASTA82690,656
        10         20         30         40         50         60 
MKNKKIMGLV LAASLTASVF TPILSVNADT TVSRKLMDLE VFKSASITGW SGSAGGELEV 

        70         80         90        100        110        120 
ASDSNLPIDT SATYNGLPSL RLNVTKASAQ WWSSLLTLRG WCTQDLTQYL ANGYLEFNVK 

       130        140        150        160        170        180 
GKVGGEDFQI GLQDQTHERA AGDSVTSVKS IKNYVNISTN WQHVKIPLKD IMGPSTGFDP 

       190        200        210        220        230        240 
TTARCINIVK GSSEIFTAWI NDLKITSTDN EKSYAPIKVN QDGYLPSSEK YALVSGFSDE 

       250        260        270        280        290        300 
LNANAGSQFQ VKDATTNAVV YSGTLTLASS FDSDSGEKIL KGDFSSVTTP GTYYISVPDA 

       310        320        330        340        350        360 
GNSNSVKFKI ASDVYKNLLF DSQRYFFYQR QGIELKAPYV TDYPRTDETP NDAIAQFESG 

       370        380        390        400        410        420 
TQPAREITKG WYDAGDKGKY INNGALAVSN MFWAYEMFPE TLKDNQFNIP ESGNGVVDIL 

       430        440        450        460        470        480 
DEARWEVEWI LKMQDSVSGG FYARVQSKDG KDGDSSAPRI IKDGSTNIKS TDDTACAAAI 

       490        500        510        520        530        540 
LAHSYIMFKN IDPTFANKCL EAAKSAWSYL EKNPTNIVSP SGPYNVYNDS SDRLWAAASL 

       550        560        570        580        590        600 
LRATNEDKYN TYFLNNYSKF STYFRDANGY GHNWGDMWTT AFWCYLKADK KDSNAVSWIK 

       610        620        630        640        650        660 
TEFSTWLDNK ISRTSTNPWQ ISNPTGSFFW GINSNILLTW EDAIIGSKLL GTYSDTIAKQ 

       670        680        690        700        710        720 
TQASLNWILG VNPLRKSFVT GHGEDSTKKI YHVTYSADGK AGVPNGYLAG GINASEGKTL 

       730        740        750        760        770        780 
SNFPGKCYID SDGDWVTNEN CLNWNASLVF ISTFVNSSTP TSYKLGDLNN DGKINAIDMA 

       790        800        810        820 
LMKKGILGGF TDSATQLAGD VNKDGKTNAI DLALIKKYIL GQINSF 

« Hide

References

[1]"Complete sequence of Clostridium cellulovorans 743B."
US DOE Joint Genome Institute
Lucas S., Copeland A., Lapidus A., Cheng J.-F., Bruce D., Goodwin L., Pitluck S., Chertkov O., Detter J.C., Han C., Tapia R., Land M., Hauser L., Chang Y.-J., Jeffries C., Kyrpides N., Ivanova N., Mikhailova N., Hemme C.L., Woyke T.
Submitted (AUG-2010) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: ATCC 35296 / DSM 3052 / OCM 3 / 743B.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
CP002160 Genomic DNA. Translation: ADL51370.1.
RefSeqYP_003843134.1. NC_014393.1.

3D structure databases

ProteinModelPortalD9SX09.
ModBaseSearch...

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaADL51370; ADL51370; Clocel_1624.
GeneID9608491.
KEGGccb:Clocel_1624.
PATRIC41741069. VBICloCel81632203721_4226.

Organism-specific databases

CMRSearch...

Phylogenomic databases

HOGENOMHOG000088267.

Enzyme and pathway databases

BioCycCCEL573061:GIXD-1683-MONOMER.

Family and domain databases

Gene3D1.10.1330.10. 1 hit.
1.50.10.10. 1 hit.
2.60.40.10. 1 hit.
InterProIPR008928. 6-hairpin_glycosidase-like.
IPR012341. 6hp_glycosidase.
IPR016134. Cellulos_enz_dockerin_1.
IPR018242. Dockerin_1.
IPR008979. Galactose-bd-like.
IPR001701. Glyco_hydro_9.
IPR004197. Glyco_hydro_9_Ig-like.
IPR013783. Ig-like_fold.
IPR014756. Ig_E-set.
[Graphical view]
PfamPF02927. CelD_N. 1 hit.
PF00404. Dockerin_1. 2 hits.
PF00759. Glyco_hydro_9. 1 hit.
[Graphical view]
SUPFAMSSF63446. Cellulos_enz_dockerin_1. 1 hit.
SSF49785. Gal_bind_like. 1 hit.
SSF48208. Glyco_trans_6hp. 1 hit.
SSF81296. Ig_E-set. 1 hit.
ProtoNetSearch...

Entry information

Entry nameD9SX09_CLOC7
AccessionPrimary (citable) accession number: D9SX09
Entry history
Integrated into UniProtKB/TrEMBL: October 5, 2010
Last sequence update: October 5, 2010
Last modified: May 1, 2013
This is version 14 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)