Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

E4SGZ0 (E4SGZ0_CALK2) Unreviewed, UniProtKB/TrEMBL

Last modified April 16, 2014. Version 20. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry infoCustomize order

Protein attributes

Sequence length482 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existencePredicted

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 3434 Potential EMBL ADQ47015.1
Chain35 – 482448 Potential EMBL ADQ47015.1
PRO_5000660175

Sequences

Sequence LengthMass (Da)Tools
E4SGZ0 [UniParc].

Last modified February 8, 2011. Version 1.
Checksum: 07DFDD19C5392E72

FASTA48253,810
        10         20         30         40         50         60 
MNGLFKKLSK KLTIISLIVI IVFLLTSSLS IYAGVMMQGF YWDVPAGGTW WNTLASKAYE 

        70         80         90        100        110        120 
LKYMVGGSYG INRIWFPPAY KGQGGAYSMG YDPHDYYDLG QYYQDGTTET RFGSQSELKN 

       130        140        150        160        170        180 
AISKYKSYGI SVTEDIVLNH RSGGKSEYNP KTGTNTWTDF TNTASGMCQW HWDAFHPNNY 

       190        200        210        220        230        240 
CSGDEGTFAG FPDVCYTSGP AYNDMKAWMN WLKSSTNAGF DSWRYDYVKG YGYWVVKDFN 

       250        260        270        280        290        300 
AATSPTFSVG EYWDANTSTL DWWANSSGSS VFDFALYYTL RDICNNTSGS GYLPNVFDYS 

       310        320        330        340        350        360 
KSYAAKNPFK AVTFVANHDT DEIVNDKMMA YAFILTYQGY PCIFWKDYYD YGLATGGGAS 

       370        380        390        400        410        420 
PGGWGNGIKQ LVWCREKLAA GAPNIEILKS NDGDIIIYGS KGYSTSSPGY IVVINDHPSQ 

       430        440        450        460        470        480 
WKGAWVQTSN SYLKGKTLKA YAWSSTVSGQ NVQPQNKYCD ANGWVEVWAP PRGYAVYSVD 


GL 

« Hide

References

« Hide 'large scale' references
[1]"Complete sequence of Caldicellulosiruptor kronotskyensis 2002."
US DOE Joint Genome Institute
Lucas S., Copeland A., Lapidus A., Cheng J.-F., Bruce D., Goodwin L., Pitluck S., Davenport K., Detter J.C., Han C., Tapia R., Land M., Hauser L., Jeffries C., Kyrpides N., Ivanova N., Mikhailova N., Blumer-Schuette S.E., Kelly R.M., Woyke T.
Submitted (NOV-2010) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE.
Strain: 2002.
[2]"Complete genome sequences for the anaerobic, extremely thermophilic plant biomass-degrading bacteria Caldicellulosiruptor hydrothermalis, Caldicellulosiruptor kristjanssonii, Caldicellulosiruptor kronotskyensis, Caldicellulosiruptor owensensis, and Caldicellulosiruptor lactoaceticus."
Blumer-Schuette S.E., Ozdemir I., Mistry D., Lucas S., Lapidus A., Cheng J.F., Goodwin L.A., Pitluck S., Land M.L., Hauser L.J., Woyke T., Mikhailova N., Pati A., Kyrpides N.C., Ivanova N., Detter J.C., Walston-Davenport K., Han S., Adams M.W., Kelly R.M.
J. Bacteriol. 193:1483-1484(2011) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: DSM 18902 / VKM B-2412 / 2002.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
CP002330 Genomic DNA. Translation: ADQ47015.1.
RefSeqYP_004024834.1. NC_014720.1.

3D structure databases

ModBaseSearch...
MobiDBSearch...

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaADQ47015; ADQ47015; Calkro_2177.
GeneID9982156.
KEGGckn:Calkro_2177.
PATRIC42811421. VBICalKro6863_2292.

Organism-specific databases

CMRSearch...

Phylogenomic databases

HOGENOMHOG000031038.
KOK01176.

Enzyme and pathway databases

BioCycCKRO632348:GI5C-2229-MONOMER.

Family and domain databases

Gene3D2.60.40.1180. 1 hit.
3.20.20.80. 1 hit.
InterProIPR015237. Alpha-amylase_C_pro.
IPR015902. Glyco_hydro_13.
IPR013780. Glyco_hydro_13_b.
IPR006047. Glyco_hydro_13_cat_dom.
IPR013781. Glyco_hydro_catalytic_dom.
IPR017853. Glycoside_hydrolase_SF.
[Graphical view]
PANTHERPTHR10357. PTHR10357. 1 hit.
PfamPF00128. Alpha-amylase. 1 hit.
PF09154. DUF1939. 1 hit.
[Graphical view]
SUPFAMSSF51445. SSF51445. 1 hit.
ProtoNetSearch...

Entry information

Entry nameE4SGZ0_CALK2
AccessionPrimary (citable) accession number: E4SGZ0
Entry history
Integrated into UniProtKB/TrEMBL: February 8, 2011
Last sequence update: February 8, 2011
Last modified: April 16, 2014
This is version 20 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)