Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q49689 (Y593_MYCLE) Reviewed, UniProtKB/Swiss-Prot

Last modified May 29, 2013. Version 91. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
UPF0051 protein ML0593

Cleaved into the following chain:

  1. Mle pps1 intein
Gene names
Ordered Locus Names:ML0593
ORF Names:B1496_C2_189, MLCL536.28c
OrganismMycobacterium leprae (strain TN) [Complete proteome] [HAMAP]
Taxonomic identifier272631 [NCBI]
Taxonomic lineageBacteriaActinobacteriaActinobacteridaeActinomycetalesCorynebacterineaeMycobacteriaceaeMycobacterium

Protein attributes

Sequence length869 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceInferred from homology

General annotation (Comments)

Post-translational modification

This protein undergoes a protein self splicing that involves a post-translational excision of the intervening region (intein) followed by peptide ligation Potential.

Sequence similarities

Belongs to the UPF0051 (ycf24) family.

Contains 1 DOD-type homing endonuclease domain.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 201201UPF0051 protein ML0593, 1st part Potential
PRO_0000036187
Chain202 – 587386Mle pps1 intein Potential
PRO_0000036188
Chain588 – 869282UPF0051 protein ML0593, 2nd part Potential
PRO_0000036189

Regions

Domain344 – 477134DOD-type homing endonuclease

Experimental info

Sequence conflict4821A → R in AAA17127. Ref.1

Sequences

Sequence LengthMass (Da)Tools
Q49689 [UniParc].

Last modified April 27, 2001. Version 2.
Checksum: DB04CF70CB50765A

FASTA86995,573
        10         20         30         40         50         60 
MTRTSETTKS PAPELLTQQQ AIDSLGKYGY GWADSDVAGA SARRGLSEDV VRDISAKKDE 

        70         80         90        100        110        120 
PEWMLQARLK ALRVFERKPM PRWGSNLDGI DFDNIKYFVR STEKQAASWD ELPEDIRNTY 

       130        140        150        160        170        180 
DRLGIPDAEK QRLVAGVAAQ YESEVVYHQI RADLKDQGVV FLDTETGLRE YPDIFKQYLG 

       190        200        210        220        230        240 
TVIPAGDNKF SALNTAVWSG GCLTADARIN VKGKGLVSIA DVQPGDEVFG VNIGCELERG 

       250        260        270        280        290        300 
KVLAKVASGT KPVYEMHVAG RALEATGNHQ FLVARRVEEG KRTRWTAVWA PLEEIESGEP 

       310        320        330        340        350        360 
IAVARVLPDD SGTIFFSESE LDIKNRTRQC LYFPCQNSVD LLWLLGLWLG DGHTAAPHKH 

       370        380        390        400        410        420 
MRQVAFSVPA GDPVHHTAIR VVSEQFGANV TVVNCGFIVS SKAFETWLAE LGFSGDEKTK 

       430        440        450        460        470        480 
RLPAWIYSLP HEHQLALIGG LVDADGWTES SGATMSIAFA SRELLEDVRQ LAIGCGLYPD 

       490        500        510        520        530        540 
GALVERTRSA TCRDGRIVTS TSWRLRIQGS LDRVGTRTPG KRGKPVSNKG RRQRYVAAAG 

       550        560        570        580        590        600 
LNFSSLSTDT VGFARLKSKT LVGEKPTYDI QVVGLENFVA NGIVAHNSFI YVPPGVHVDI 

       610        620        630        640        650        660 
PLQAYFRINT ENMGQFERTL IIADTGSYVH YVEGCTAPIY KSDSLHSAVV EIIVKPHARV 

       670        680        690        700        710        720 
RYTTIQNWSN NVYNLVTKRA RVETGATMEW IDGNIGSKVT MKYPAVWMTG EHAKGEVLSV 

       730        740        750        760        770        780 
AFAGEGQHQD TGAKMLHLAS NTSSNIVSKS VARGGGRTSY RGLVQVNKGA HGSRSSVKCD 

       790        800        810        820        830        840 
ALLVDTISRS DTYPYVDIRE DDVTMGHEAT VSKVSENQLF YLMSRGLAED EAMAMVVRGF 

       850        860 
VEPIAKELPM EYALELNRLI ELQMEGAVG 

« Hide

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
U00013 Genomic DNA. Translation: AAA17127.1.
Z99125 Genomic DNA. Translation: CAB16171.1.
Z99125 Genomic DNA. Translation: CAB16172.1.
AL583919 Genomic DNA. Translation: CAC30101.1.
PIRA86983.
S72760.
RefSeqNP_301502.1. NC_002677.1.

3D structure databases

ProteinModelPortalQ49689.
ModBaseSearch...

Protein-protein interaction databases

STRING272631.ML0593.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaCAC30101; CAC30101; CAC30101.
GeneID909370.
KEGGmle:ML0593.
PATRIC18052168. VBIMycLep78757_1031.

Organism-specific databases

LepromaML0593.
CMRSearch...

Phylogenomic databases

eggNOGCOG1372.
KOK09014.
ProtClustDBCLSK2460462.

Family and domain databases

Gene3D3.10.28.10. 1 hit.
InterProIPR003586. Hint_dom_C.
IPR003587. Hint_dom_N.
IPR027434. Homing_endonucl.
IPR006142. INTEIN.
IPR004042. Intein_endonuc.
IPR006141. Intein_splice_site.
IPR010231. SUF_FeS_clus_asmbl_SufB.
IPR000825. SUF_FeS_clus_asmbl_SufBD.
[Graphical view]
PfamPF01458. UPF0051. 1 hit.
[Graphical view]
PRINTSPR00379. INTEIN.
SMARTSM00305. HintC. 1 hit.
SM00306. HintN. 1 hit.
[Graphical view]
SUPFAMSSF55608. SSF55608. 1 hit.
TIGRFAMsTIGR01443. intein_Cterm. 1 hit.
TIGR01980. sufB. 1 hit.
PROSITEPS50818. INTEIN_C_TER. 1 hit.
PS50819. INTEIN_ENDONUCLEASE. 1 hit.
PS50817. INTEIN_N_TER. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameY593_MYCLE
AccessionPrimary (citable) accession number: Q49689
Secondary accession number(s): O33141
Entry history
Integrated into UniProtKB/Swiss-Prot: July 15, 1998
Last sequence update: April 27, 2001
Last modified: May 29, 2013
This is version 91 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Relevant documents

Intein-containing proteins

List of intein-containing protein entries

Uncharacterized protein families (UPF)

List of uncharacterized protein family (UPF) entries

SIMILARITY comments

Index of protein domains and families