Skip Header

Contribute Send feedback
Read comments (?) or add your own

E4QD38 (E4QD38_CALH1) Unreviewed, UniProtKB/TrEMBL

Last modified May 1, 2013. Version 14. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·Ontologies·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein names
Gene names
Ordered Locus Names:Calhy_1821
OrganismCaldicellulosiruptor hydrothermalis (strain DSM 18901 / VKM B-2411 / 108) [Complete proteome] [HAMAP] EMBL ADQ07532.1
Taxonomic identifier632292 [NCBI]
Taxonomic lineageBacteriaFirmicutesClostridiaThermoanaerobacteralesThermoanaerobacterales Family III. Incertae SedisCaldicellulosiruptor

Protein attributes

Sequence length433 AA.
Sequence statusComplete.
Protein existencePredicted

Ontologies

Keywords
   Technical termComplete proteome
Gene Ontology (GO)
   Biological_processDNA modification

Inferred from electronic annotation. Source: InterPro

   Molecular_functionDNA binding

Inferred from electronic annotation. Source: InterPro

Complete GO annotation...

Sequences

Sequence LengthMass (Da)Tools
E4QD38 [UniParc].

Last modified February 8, 2011. Version 1.
Checksum: 81C790D10A3DD740

FASTA43349,775
        10         20         30         40         50         60 
MDGMSRKNYK FKDSPLGRIP EEWEVVRLGD IAKIKTGNSN VQDAAETGDY LFFDRSGEIK 

        70         80         90        100        110        120 
RSNRYLFDKE AVIVPGEGTE FLPKYYCGKF DLHQRAYAIF DFSSVLSGEY LFYAMHKFNR 

       130        140        150        160        170        180 
ILANWAVGTT VKSLRLPMFE NLLLLLPPLP EQRKIAEILE TIDNAIEKTD AIIEKYKRIK 

       190        200        210        220        230        240 
QGLMQDLLTK GVVSEGEGES ERWRLRDENI DKFKDSPLGR IPEEWKICKL DHREITIMIT 

       250        260        270        280        290        300 
DGSHYSPQPV ENSEYYIVNI ENIINGKIEF ETCKKISPKD YKKLVSNKCN PKYGDVLFTK 

       310        320        330        340        350        360 
DGTVGITLVF SGERNVVLLS SIAIIRPSNC LDSYYLKYSL ETEQIKKQID ILIGGSVLKR 

       370        380        390        400        410        420 
IVLKDIKSLV IFIPPIPEQQ RIASILSQID EAIEKERAYK EKLERIKKGL MEDLLTGKVR 

       430 
VNHLIEEENK DGN 

« Hide

References

« Hide 'large scale' references
[1]"Complete sequence of Caldicellulosiruptor hydrothermalis 108."
US DOE Joint Genome Institute
Lucas S., Copeland A., Lapidus A., Cheng J.-F., Bruce D., Goodwin L., Pitluck S., Davenport K., Detter J.C., Han C., Tapia R., Land M., Hauser L., Chang Y.-J., Jeffries C., Kyrpides N., Ivanova N., Mikhailova N. expand/collapse author list , Blumer-Schuette S.E., Kelly R.M., Woyke T.
Submitted (SEP-2010) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE.
Strain: 108.
[2]"Complete genome sequences for the anaerobic, extremely thermophilic plant biomass-degrading bacteria Caldicellulosiruptor hydrothermalis, Caldicellulosiruptor kristjanssonii, Caldicellulosiruptor kronotskyensis, Caldicellulosiruptor owensensis, and Caldicellulosiruptor lactoaceticus."
Blumer-Schuette S.E., Ozdemir I., Mistry D., Lucas S., Lapidus A., Cheng J.F., Goodwin L.A., Pitluck S., Land M.L., Hauser L.J., Woyke T., Mikhailova N., Pati A., Kyrpides N.C., Ivanova N., Detter J.C., Walston-Davenport K., Han S., Adams M.W., Kelly R.M.
J. Bacteriol. 193:1483-1484(2011) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: DSM 18901 / VKM B-2411 / 108.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
CP002219 Genomic DNA. Translation: ADQ07532.1.
RefSeqYP_003992901.1. NC_014652.1.

3D structure databases

ModBaseSearch...

Protein family/group databases

REBASE28852. S.Chy108ORF1822P.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaADQ07532; ADQ07532; Calhy_1821.
GeneID9936375.
KEGGchd:Calhy_1821.
PATRIC42587293. VBICalHyd101559_1909.

Organism-specific databases

CMRSearch...

Phylogenomic databases

HOGENOMHOG000218216.
KOK01154.
OMAPLAPYVF.

Enzyme and pathway databases

BioCycCHYD632292:GHA8-1867-MONOMER.

Family and domain databases

InterProIPR000055. Restrct_endonuc_typeI_HsdS.
[Graphical view]
PfamPF01420. Methylase_S. 2 hits.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameE4QD38_CALH1
AccessionPrimary (citable) accession number: E4QD38
Entry history
Integrated into UniProtKB/TrEMBL: February 8, 2011
Last sequence update: February 8, 2011
Last modified: May 1, 2013
This is version 14 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)