Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Unreviewed, UniProtKB/TrEMBL Q8XMJ5 (Q8XMJ5_CLOPE)

Last modified May 5, 2009. Version 43. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequences · References · Cross-references · Entry information

Names and origin

Protein namesSubmitted name:
    Putative uncharacterized protein CPE0693 (Endo-alpha-N-acetylgalactosaminidase) EMBL BAB80399.1
    EC=3.2.1.97
Gene names
Name: engCP EMBL BAG69287.1
Ordered Locus Names: CPE0693
OrganismClostridium perfringens [Complete proteome] [HAMAP]
Taxonomic identifier1502 [NCBI]
Taxonomic lineageBacteriaFirmicutesClostridiaClostridialesClostridiaceaeClostridium

Protein attributes

Sequence length1686 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existencePredicted.

General annotation (Comments)

Sequence similarities

Contains 2 fibronectin type-III domains. RuleBase RU000718V1

Ontologies

Keywords
   Molecular functionGlycosidase
Hydrolase
   Technical termComplete proteome
Gene Ontology (GO)
   Biological processmetabolic process

Inferred from electronic annotation. Source: UniProtKB-KW

   Molecular functionglycopeptide alpha-N-acetylgalactosaminidase activity

Inferred from electronic annotation. Source: EC

Complete GO annotation...

Sequences

Sequence LengthMass (Da)Tools
Q8XMJ5-1 [UniParc].

Last modified March 1, 2002. Version 1.
Checksum: 00F276EBE357453B

FASTA1,686187,828
        10         20         30         40         50         60 
MGRKCMNKKI AAIIAAAVIV GQLPISVLAT PVNEAGDEIN SESAEILTNS DEEAEAYIQN 

        70         80         90        100        110        120 
YDRPEGITWT KLAGSGSVEV TDGFLSVTNN GDYRIMEDQS PNIKNGELES KFTVGGSQTG 

       130        140        150        160        170        180 
IIFRATESNY GMINYNSGTG WVIENKNSWE DITGPKLNNG DVVTVKATFV EKHLTVNVSV 

       190        200        210        220        230        240 
NDGEFETIYD KESDLIPLQA GKVGYRGWGN AKTTKFDYIK YAPMTIDKGP IVSINEVNVE 

       250        260        270        280        290        300 
TYPRVKPILP SSVTVNHENG MSSIKDVSWN YIPKESYSKP GTFKVEGTVE GTDVKAIANV 

       310        320        330        340        350        360 
TVSSDLAYYE TNFETEETRG DWQVVQGGGS PSYEEGKVKI PMNGVSIAVD MNSPEVKNFT 

       370        380        390        400        410        420 
YETDFSVDNN GGRIGLLFRY VSETEWGAVC YDNGSWVWKT GDGKYGNFPG TFTPEPGKTY 

       430        440        450        460        470        480 
RIKLKVEDTN ITMWVDGEKI GQVAVSNLPD VRGKVGLTGW FGNKNVTLDN LVVEELGGIM 

       490        500        510        520        530        540 
APEVGPLEEQ SIESDSMKVV LDNRFPTVIR YEWKGTEDVL SGASVDDLEA QYMVEINGEK 

       550        560        570        580        590        600 
RIPKVTSEFA NNEGIYTLNF EDIGMTITLK MTVNENKLRM EVTDIQEGDV KLQTLNFPNH 

       610        620        630        640        650        660 
SLASVSSLNN GKTASVLTTG DWNNINEEFT DVAKAKPGVK GKTYAFINDD KFAVTINNNT 

       670        680        690        700        710        720 
IEGGNRVVLT TENDTLPDNT NYKKVGISNG TWTYKEILQD TTDQGSKLYQ GEKPWSEVII 

       730        740        750        760        770        780 
ARDENEDGQV DWQDGAIQYR KNMKIPVGGE EIKNQMSYID FNIGYTQNPF LRSLDTIKKL 

       790        800        810        820        830        840 
SNYTDGFGQL VLHKGYQGEG HDDSHPDYGG HIGMRQGGKE DFNTLIEQGK EYNAKIGVHI 

       850        860        870        880        890        900 
NATEYTMDAF EYPTELVNEN APGWGWLDQA YYVNQRGDIT SGELFRRLDM LMEDAPELGW 

       910        920        930        940        950        960 
IYVDVYTGNG WNAHQLGEKI NDYGIMIATE MNGPLEQHVP WTHWGGDPAY PNKGNASKIM 

       970        980        990       1000       1010       1020 
RFMKNDTQDS FLADPLVKGN KHLLSGGWGT RHDIEGAYGT EVFYNQVLPT KYLQHFQITK 

      1030       1040       1050       1060       1070       1080 
MSENEVLFEN GVKAVRENSN INYYRNDRLV ATTPENSIGN TGIGDTQLFL PWNPVDEANS 

      1090       1100       1110       1120       1130       1140 
EKIYHWNPLG TTSEWTLPEG WTSNDKVYLY ELSDLGRTLV KEVPVVDGKV NLEVKQDTPY 

      1150       1160       1170       1180       1190       1200 
IVTKEKVEEK RIEDWGYGSE IADPGFDSQT FDKWNKESTA ENTDHITIEN ESVQKRLGND 

      1210       1220       1230       1240       1250       1260 
VLKISGNEGA DAKISQSISG LEEGVTYSVS AWVKNDNNRE VTLGVNVGGK DFTNVITSGG 

      1270       1280       1290       1300       1310       1320 
KVRQGEGVKY IDDTFVRMEV EFTVPKGVNS ADVYLKASEG DADSVVLVDD FRIWDHPGHT 

      1330       1340       1350       1360       1370       1380 
NRDGYVFYED FENVDEGISP FYLSPGRGHS NRSHLAEKDI SIDANQRMNW VLDGRFSLKS 

      1390       1400       1410       1420       1430       1440 
NQQPKEIGEM LTTDVSSFKL EPNKTYEFGF LYSLENAAPG YSVNIKNRDG EKIVSIPLEA 

      1450       1460       1470       1480       1490       1500 
TGSNYAQDIF TKTKSVTHEF TTGDFAGDYY ITLEKGDGFK EVILDNIYVK EIDKSIESPE 

      1510       1520       1530       1540       1550       1560 
LAHVNLNTVE HDLEVGQSVP FAINALMNNG ANVNLEEAEV EYKVSKPEVL TIENGMMTGA 

      1570       1580       1590       1600       1610       1620 
SEGFTDVQVN ITVNGNKVSS NTVRVKVGNP EVEEEEVIVN PVRNFKVTDK TKKNVTVSWE 

      1630       1640       1650       1660       1670       1680 
EPEKTYGLEG YVLYKDGKKV KEIGADKTEF TFKGLNRHTI YNFKIAAKYS NGELSTKESI 


TVRTAR 

« Hide

References

« Hide 'large scale' references
[1]"Complete genome sequence of Clostridium perfringens, an anaerobic flesh-eater."
Shimizu T., Ohtani K., Hirakawa H., Ohshima K., Yamashita A., Shiba T., Ogasawara N., Hattori M., Kuhara S., Hayashi H.
Proc. Natl. Acad. Sci. U.S.A. 99:996-1001(2002) [PubMed: 11792842] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: 13 / Type A.
[2]"Characterization of two different endo-alpha-N-acetylgalactosaminidases from probiotic and pathogenic enterobacteria, Bifidobacterium longum and Clostridium perfringens."
Ashida H., Maki R., Ozawa H., Tani Y., Kiyohara M., Fujita M., Imamura A., Ishida H., Kiso M., Yamamoto K.
Glycobiology 18:727-734(2008) [PubMed: 18559962] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE.
Strain: Strain 13 EMBL BAG69287.1.

Cross-references

Sequence databases

BA000016 Genomic DNA. Translation: BAB80399.1.
AB427163 Genomic DNA. Translation: BAG69287.1.
RefSeqNP_561609.1.

3D structure databases

ModBaseSearch...

Protein family/group databases

CAZyGH101. Glycoside Hydrolase Family 101.

Genome annotation databases

GeneID988952.
GenomeReviewsGene locus CPE0693 in contig BA000016_GR.
KEGGcpe:CPE0693.
NMPDRfig|195102.1.peg.756.

Organism-specific databases

CMRSearch...

Phylogenomic databases

HOGENOMQ8XMJ5.
OMAQ8XMJ5. VHINATE.

Enzyme and pathway databases

BioCycCPER195102:CPE0693-MON.

Family and domain databases

InterProIPR011081. Big_4.
IPR008957. Fibronectin_typ-III-like_fold.
IPR003961. FN_III.
[Graphical view]
Gene3DG3DSA:2.60.40.30. FN_III-like. 1 hit.
PfamPF07532. Big_4. 1 hit.
PF00041. fn3. 1 hit.
[Graphical view]
SMARTSM00060. FN3. 2 hits.
[Graphical view]
PROSITEPS50853. FN3. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameQ8XMJ5_CLOPE
AccessionPrimary (citable) accession number: Q8XMJ5
Entry history
Integrated into UniProtKB/TrEMBL: March 1, 2002
Last sequence update: March 1, 2002
Last modified: May 5, 2009
This is version 43 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequences · References · Cross-references · Entry information