Skip Header

 
Contribute Send feedback
Read comments (1) or add your own

Unreviewed, UniProtKB/TrEMBL Q934I7 (Q934I7_9SPHI)

Last modified April 14, 2009. Version 26. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · Ontologies · Sequences · References · Cross-references · Entry information

Names and origin

Protein namesSubmitted name:
    MS115, putative beta-agarase EMBL AAK62837.1
Encoded onPlasmid pSD15 EMBL AAK62837.1
OrganismMicroscilla sp. PRE1 EMBL AAK62837.1
Taxonomic identifier155537 [NCBI]
Taxonomic lineageBacteriaBacteroidetesSphingobacteriaSphingobacterialesFlexibacteraceaeMicroscilla

Protein attributes

Sequence length1330 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existencePredicted.

Ontologies

Keywords
   Technical termPlasmid
Gene Ontology (GO)
   Molecular functioncarbohydrate binding

Inferred from electronic annotation. Source: InterPro

Complete GO annotation...

Sequences

Sequence LengthMass (Da)Tools
Q934I7-1 [UniParc].

Last modified December 1, 2001. Version 1.
Checksum: 083073D8EFCEB28F

FASTA1,330143,942
        10         20         30         40         50         60 
MYGKVVLFTV LFLGNIFCLY SQGVQVDVNL NVKHSVGGVS DFGRDRHMTV HSSLTEPDWQ 

        70         80         90        100        110        120 
GEEAKMDYLL TDLDTYLGRD NGSATWKFAS TPQDPNNPHH PSVDSMQAFG DWLKGEYESL 

       130        140        150        160        170        180 
TNRHQYESRA SGMIMGTNAH PTYPTLSWYA NGSTWTDPQW QPKDVQTSAD WVTEYLDKFF 

       190        200        210        220        230        240 
AHSPSVDGEP LPKYWEVVNE PDMEYMTGKF MVTSQEKIWE YHNLVAQGVK ERLGTDAPLI 

       250        260        270        280        290        300 
GGMTWGLHDL FAGDGLSRYQ PDYLDQYLDA ETAEFYRNAA ATQWPGNQNQ PWYQWDVQWK 

       310        320        330        340        350        360 
GFIDAAGANM DFYSVHFYDW PTYNASGGAV RSGGHVEATL DMLEWYDVQK FGVSNRKPVV 

       370        380        390        400        410        420 
ISEYGAVQGS WTYLPHDNRY DWECIKPFNS MLMQFLERPD YIIYTLPFTP IKAQWGDVDQ 

       430        440        450        460        470        480 
NGDGTPEYVY QYKLMRDDDH DGNWEWSDYI KFYELWSEVK GTRIDTKSTD PDIQIDAYVD 

       490        500        510        520        530        540 
GKDVFLILNN LENQATTIHL NLYEDFGNNV QNVNIKHLHL TGTSTVTLDN NDHATAPESV 

       550        560        570        580        590        600 
QLAGDGTMVI KYTYGSAVNI NHNSIEKKFY GESLSGTVPN RVSIPNGEMT MQINGVDVPA 

       610        620        630        640        650        660 
DASKAEAMLR ITCALYNDDD NQVGHLSIDK LTVNGTEIET PLDWRGTNQV RNRYFSTLEI 

       670        680        690        700        710        720 
PVPVGLLQTN NTFTVDFHHV GEVAVVNLQT WEFSKVPGRS TSSDPVAVTG VTVSPGSPTV 

       730        740        750        760        770        780 
AQGSTVQMTA HVQPTNATDQ SVTWSSSNAN VASVNASGEV TGIAQGTATI TATTNDGGFI 

       790        800        810        820        830        840 
ASTQVMVTTG DVDVTGVSVT PTSASLLVGQ SIDLTETVSP TNATDKSVTW TTSNSAVVTV 

       850        860        870        880        890        900 
NGSGLVTAKG NGSATVTVTT NDGGFTAQSA VTVTTGGGGS AIVIEAESFT STGGTTDDSP 

       910        920        930        940        950        960 
YGGPGIGVNN AGTNINYVNS EDWAEYGINV SEAGTYQIEY QISTPSNNAQ VRFELDGNVV 

       970        980        990       1000       1010       1020 
STDNVPNNGQ WDSYTKLIAG STISTLSTGS HTVRLVASGA NAWQWNLDKV TLTKTGSSTV 

      1030       1040       1050       1060       1070       1080 
NVTGVSASPT NVSLSIGGST DLTETVNPGN ATDKSVSWSS NNTAVATVDA NGLVSAVGAG 

      1090       1100       1110       1120       1130       1140 
TAVITVTTSD GGHTATCSVT VSGGNSVELT IQAEDFATTG GTHDGFQVYS VNGVTATNWN 

      1150       1160       1170       1180       1190       1200 
QTGDWADYSV TIPEAGDYSI EYFMGTTVNG AAVTISVDGA VQRTDAVPNN GNWDVFESLK 

      1210       1220       1230       1240       1250       1260 
VGGRISLNAG SHTIRLLGDG TNGWEWNMDR FVLSKGAASS RVESSSSSQI VNQGISIYPV 

      1270       1280       1290       1300       1310       1320 
PADDKITVRG LAPDLYQLTI SNVSGKIVRK MSVEGPNDYI LDVGDLKTGV YFLHFHGSKT 

      1330 
TFNARIIMQH 

« Hide

References

[1]"Sequence analysis of a 101-kilobase plasmid required for agar degradation by a Microscilla isolate."
Zhong Z., Toukdarian A., Helinski D., Knauf V., Sykes S., Wilkinson J.E., O'Bryne C., Shea T., DeLoughery C., Caspi R.
Appl. Environ. Microbiol. 67:5771-5779(2001) [PubMed: 11722934] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE.
Strain: PRE1 EMBL AAK62837.1.

Cross-references

Sequence databases

AF339846 Genomic DNA. Translation: AAK62837.1.
RefSeqNP_116803.1.

3D structure databases

ModBaseSearch...

Protein family/group databases

CAZyCBM6. Carbohydrate-Binding Module Family 6.
GH86. Glycoside Hydrolase Family 86.

Genome annotation databases

GeneID3207704.

Family and domain databases

InterProIPR003343. Big_2.
IPR005084. CBM_6.
IPR006584. Cellulose_bd_IV.
[Graphical view]
PfamPF02368. Big_2. 3 hits.
PF03422. CBM_6. 2 hits.
[Graphical view]
SMARTSM00635. BID_2. 3 hits.
SM00606. CBD_IV. 2 hits.
[Graphical view]
PROSITEPS51175. CBM6. 2 hits.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameQ934I7_9SPHI
AccessionPrimary (citable) accession number: Q934I7
Entry history
Integrated into UniProtKB/TrEMBL: December 1, 2001
Last sequence update: December 1, 2001
Last modified: April 14, 2009
This is version 26 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)
Names and origin · Protein attributes · Ontologies · Sequences · References · Cross-references · Entry information