Skip Header

Contribute Send feedback
Read comments (?) or add your own

A4XIF6 (A4XIF6_CALS8) Unreviewed, UniProtKB/TrEMBL

Last modified January 25, 2012. Version 32. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein names
Gene names
Ordered Locus Names:Csac_1077
OrganismCaldicellulosiruptor saccharolyticus (strain ATCC 43494 / DSM 8903) [Complete proteome] [HAMAP]
Taxonomic identifier351627 [NCBI]
Taxonomic lineageBacteriaFirmicutesClostridiaThermoanaerobacteralesThermoanaerobacterales Family III. Incertae SedisCaldicellulosiruptor

Protein attributes

Sequence length1303 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existencePredicted

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 3939 Potential EMBL ABP66691.1
Chain40 – 13031264 Potential EMBL ABP66691.1
PRO_5000239797

Sequences

Sequence LengthMass (Da)Tools
A4XIF6 [UniParc].

Last modified May 29, 2007. Version 1.
Checksum: 78152BD5B92053C0

FASTA1,303143,667
        10         20         30         40         50         60 
MRLKTKIRKK WLSVLCTVVF LLNILFIANV TILPKVGAAT SNDGVVKIDT STLIGTNHAH 

        70         80         90        100        110        120 
CWYRDRLDTA LRGIRSWGMN SVRVVLSNGY RWTKIPASEV ANIISLSRSL GFKAIILEVH 

       130        140        150        160        170        180 
DTTGYGEDGA ACSLAQAVEY WKEIKSVLDG NEDFVIINIG NEPYGNNNYQ NWVNDTKNAI 

       190        200        210        220        230        240 
KALRDAGFKH TIMVDAPNWG QDWSNTMRDN AQSIMEADPL RNLVFSIHMY GVYNTASKVE 

       250        260        270        280        290        300 
EYIKSFVDKG LPLVIGEFGH QHTDGDPDEE AIVRYAKQYK IGLFSWSWCG NSSYVGYLDM 

       310        320        330        340        350        360 
VNNWDPNNPT PWGQWYKTNA IGTSSTPTPT STVTPTPTPT PTPTPTVTAT PTPTPTPVST 

       370        380        390        400        410        420 
PATSGQIKVL YANKETNSTT NTIRPWLKVV NSGSSSIDLS RVTIRYWYTV DGERAQSAIS 

       430        440        450        460        470        480 
DWAQIGASNV TFKFVKLSSS VSGADYYLEI GFKSGAGQLQ PGKDTGEIQM RFNKDDWSNY 

       490        500        510        520        530        540 
NQGNDWSWIQ SMTSYGENEK VTAYIDGVLV WGQEPSGATP APAPTATPTP TPTVTPTPTV 

       550        560        570        580        590        600 
TPTPTVTATP TPTPTPTPTP VSTPATGGQI KVLYANKETN STTNTIRPWL KVVNSGSSSI 

       610        620        630        640        650        660 
DLSRVTIRYW YTVDGERAQS AISDWAQIGA SNVTFKFVKL SSSVSGADYY LEIGFKSGAG 

       670        680        690        700        710        720 
QLQPGKDTGE IQIRFNKSDW SNYNQGNDWS WIQSMTSYGE NEKVTAYIDG VLVWGQEPSG 

       730        740        750        760        770        780 
TTPAPTSTPT VTVTPTPTPT PTPTPTPTVT PTPTVTPTPT VTATPTPTPT PIPTVTPLPT 

       790        800        810        820        830        840 
ISPSPSVVEI TINTNAGRTQ ISPYIYGANQ DIEGVVHSAR RLGGNRLTGY NWENNFSNAG 

       850        860        870        880        890        900 
NDWYHSSDDY LCWSMGISGE DAKVPAAVVS KFHEYSLKNN AYSAVTLQMA GYVSKDNYGT 

       910        920        930        940        950        960 
VSENETAPSN RWAEVKFKKD APLSLNPDLN DNFVYMDEFI NYLINKYGMA SSPTGIKGYI 

       970        980        990       1000       1010       1020 
LDNEPDLWAS THPRIHPNKV TCKELIEKSV ELAKVIKTLD PSAEVFGYAS YGFMGYYSLQ 

      1030       1040       1050       1060       1070       1080 
DAPDWNQVKG EHRWFISWYL EQMKKASDSF GKRLLDVLDL HWYPEARGGN IRVCFDGEND 

      1090       1100       1110       1120       1130       1140 
TSKEVVIARM QAPRTLWDPT YKTSVKGQIT AGENSWINQW FSDYLPIIPN VKADIEKYYP 

      1150       1160       1170       1180       1190       1200 
GTKLAISEFD YGGRNHISGG IALADVLGIF GKYGVNFAAR WGDSGSYAAA AYNIYLNYDG 

      1210       1220       1230       1240       1250       1260 
KGSKYGNTNV SANTSDVENM PVYASINGQD DSELHIILIN RNYDQKLQVK INITSTTKYT 

      1270       1280       1290       1300 
KAEIYGFDSN SPDIRKMGNI DNIESNVFTL EVPNLTVYHI VLR 

« Hide

References

[1]"Genome sequence of the thermophilic hydrogen-producing bacterium Caldicellulosiruptor saccharolyticus DSM 8903."
Copeland A., Lucas S., Lapidus A., Barry K., Detter J.C., Glavina del Rio T., Hammon N., Israni S., Dalin E., Tice H., Pitluck S., Kiss H., Brettin T., Bruce D., Han C., Schmutz J., Larimer F., Land M. expand/collapse author list , Hauser L., Kyrpides N., Lykidis A., van de Werken H.J.G., Verhaart M.R.A., VanFossen A.L., Lewis D.L., Nichols J.D., Goorissen H.P., van Niel E.W.J., Stams F.J.M., Willquist K.U., Ward D.E., van der Oost J., Kelly R.M., Kengen S.M.W., Richardson P.
Submitted (APR-2007) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: ATCC 43494 / DSM 8903.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
CP000679 Genomic DNA. Translation: ABP66691.1.
RefSeqYP_001179882.1. NC_009437.1.

3D structure databases

ProteinModelPortalA4XIF6.
ModBaseSearch...

Protein-protein interaction databases

STRINGA4XIF6.

Protein family/group databases

CAZyCBM3. Carbohydrate-Binding Module Family 3.
GH44. Glycoside Hydrolase Family 44.
GH5. Glycoside Hydrolase Family 5.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

GeneID5088364.
GenomeReviewsGene locus Csac_1077 in contig CP000679_GR.
KEGGcsc:Csac_1077.
PATRIC21251794. VBICalSac56748_1212.

Organism-specific databases

CMRSearch...

Phylogenomic databases

eggNOGCOG2730.
HOGENOMHBG312533.
OMAGENDTSK.
ProtClustDBCLSK2472936.

Family and domain databases

InterProIPR008965. Carb-bd_dom.
IPR001956. CBD_3.
IPR001547. Glyco_hydro_5.
IPR018087. Glyco_hydro_5_CS.
IPR013781. Glyco_hydro_subgr_catalytic.
IPR017853. Glycoside_hydrolase_SF.
[Graphical view]
Gene3DG3DSA:2.60.40.710. CBD_3. 2 hits.
G3DSA:3.20.20.80. Glyco_hydro_cat. 1 hit.
PfamPF00942. CBM_3. 2 hits.
PF00150. Cellulase. 1 hit.
[Graphical view]
SMARTSM01067. CBM_3. 2 hits.
[Graphical view]
SUPFAMSSF49384. Cellul_bind. 2 hits.
SSF51445. Glyco_hydro_cat. 2 hits.
PROSITEPS51172. CBM3. 2 hits.
PS00659. GLYCOSYL_HYDROL_F5. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameA4XIF6_CALS8
AccessionPrimary (citable) accession number: A4XIF6
Entry history
Integrated into UniProtKB/TrEMBL: May 29, 2007
Last sequence update: May 29, 2007
Last modified: January 25, 2012
This is version 32 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)