Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

A0QRL4 (A0QRL4_MYCS2) Unreviewed, UniProtKB/TrEMBL

Last modified July 9, 2014. Version 61. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein names

Aldehyde dehydrogenase EMBL AFP37604.1
EC=1.2.1.- EMBL AFP37604.1
Gene names
Ordered Locus Names:MSMEG_1158 EMBL ABK71275.1, MSMEI_1126 EMBL AFP37604.1
OrganismMycobacterium smegmatis (strain ATCC 700084 / mc(2)155) [Reference proteome] [HAMAP] EMBL ABK71275.1
Taxonomic identifier246196 [NCBI]
Taxonomic lineageBacteriaActinobacteriaActinobacteridaeActinomycetalesCorynebacterineaeMycobacteriaceaeMycobacterium

Protein attributes

Sequence length475 AA.
Sequence statusComplete.
Protein existenceInferred from homology

General annotation (Comments)

Sequence similarities

Belongs to the aldehyde dehydrogenase family. RuleBase RU003345

Sequences

Sequence LengthMass (Da)Tools
A0QRL4 [UniParc].

Last modified January 9, 2007. Version 1.
Checksum: EB02CBB8BDEAB5A1

FASTA47549,696
        10         20         30         40         50         60 
MTNAQNLIGG AWVGEPIIER RNPANPDDVV AIAPSASAAD VHDAVAAATD AQPGWAALTA 

        70         80         90        100        110        120 
VQRGAILMDA ADLLRRRHEE IATDLTREEG KTRAEAMGEV RRAIDVLRFF GSAGWRPSGE 

       130        140        150        160        170        180 
TLPSTMPNTS VHTRREPLGV VGLVTPWNFP IAIPAWKMAP ALVSGNAVVI KPAELTPLSI 

       190        200        210        220        230        240 
NHLATALIDA GLPAGVLNVV HGSGSLAGDA LVRHQEVAAV SFTGSTGVGM AIRDVVNARN 

       250        260        270        280        290        300 
ARVQLEMGGK NAYLVLDDAD VEAAAATVAA GAFSLTGQAC TATSRVYVTP GVREVFVKAL 

       310        320        330        340        350        360 
REKAGAVKSG NGLDPGTTMG PVVSDAQLAK DVTAIHAAVE AGFDAGEVTE PKGQFLAPVV 

       370        380        390        400        410        420 
FSGVPHDHPL VTREVFGPVV GVIDVADYQE GLNLVNDSPY GLTAGICTRD LGKAYDFAAR 

       430        440        450        460        470 
VRVGVVKINR PTTGLDLNVP FGGVRDSSTN TFREQGERAV DFYTWTKSVY IGHDL 

« Hide

References

« Hide 'large scale' references
[1]"ICDS database: interrupted CoDing sequences in prokaryotic genomes."
Perrodou E., Deshayes C., Muller J., Schaeffer C., Van Dorsselaer A., Ripp R., Poch O., Reyrat J.M., Lecompte O.
Nucleic Acids Res. 34:D338-D343(2006) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE.
Strain: MC2 155 EMBL AFP37604.1.
[2]Fleischmann R.D., Dodson R.J., Haft D.H., Merkel J.S., Nelson W.C., Fraser C.M.
Submitted (OCT-2006) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: ATCC 700084 / mc(2)155 and MC2 155 EMBL ABK71275.1.
[3]"Interrupted coding sequences in Mycobacterium smegmatis: authentic mutations or sequencing errors?"
Deshayes C., Perrodou E., Gallien S., Euphrasie D., Schaeffer C., Van-Dorsselaer A., Poch O., Lecompte O., Reyrat J.M.
Genome Biol. 8:R20.1-R20.9(2007) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: ATCC 700084 / mc(2)155 and MC2 155 EMBL AFP37604.1.
[4]"Ortho-proteogenomics: multiple proteomes investigation through orthology and a new MS-based protocol."
Gallien S., Perrodou E., Carapito C., Deshayes C., Reyrat J.M., Van Dorsselaer A., Poch O., Schaeffer C., Lecompte O.
Genome Res. 19:128-135(2009) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: ATCC 700084 / mc(2)155 and MC2 155.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
CP000480 Genomic DNA. Translation: ABK71275.1.
CP001663 Genomic DNA. Translation: AFP37604.1.
RefSeqYP_006565899.1. NC_018289.1.
YP_885552.1. NC_008596.1.

3D structure databases

ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

STRING246196.MSMEG_1158.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaABK71275; ABK71275; MSMEG_1158.
AFP37604; AFP37604; MSMEI_1126.
GeneID4537648.
KEGGmsg:MSMEI_1126.
msm:MSMEG_1158.
PATRIC18074781. VBIMycSme59918_1151.

Phylogenomic databases

eggNOGCOG1012.
HOGENOMHOG000271511.
KOK00128.
OMAAVDFYTW.
OrthoDBEOG6BS8QW.

Enzyme and pathway databases

BioCycMSME246196:GJ4Y-1158-MONOMER.

Family and domain databases

Gene3D3.40.309.10. 1 hit.
3.40.605.10. 1 hit.
InterProIPR016161. Ald_DH/histidinol_DH.
IPR016163. Ald_DH_C.
IPR016160. Ald_DH_CS_CYS.
IPR029510. Ald_DH_CS_GLU.
IPR016162. Ald_DH_N.
IPR015590. Aldehyde_DH_dom.
[Graphical view]
PfamPF00171. Aldedh. 1 hit.
[Graphical view]
SUPFAMSSF53720. SSF53720. 1 hit.
PROSITEPS00070. ALDEHYDE_DEHYDR_CYS. 1 hit.
PS00687. ALDEHYDE_DEHYDR_GLU. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameA0QRL4_MYCS2
AccessionPrimary (citable) accession number: A0QRL4
Entry history
Integrated into UniProtKB/TrEMBL: January 9, 2007
Last sequence update: January 9, 2007
Last modified: July 9, 2014
This is version 61 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)