Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q8R874 (Q8R874_THETN) Unreviewed, UniProtKB/TrEMBL

Last modified January 25, 2012. Version 43. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·Ontologies·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein attributes

Sequence length2862 AA.
Sequence statusComplete.
Protein existencePredicted

Ontologies

Keywords
   Technical termComplete proteome
Gene Ontology (GO)
   Biological processcarbohydrate metabolic process

Inferred from electronic annotation. Source: InterPro

   Molecular functioncarbohydrate binding

Inferred from electronic annotation. Source: InterPro

catalytic activity

Inferred from electronic annotation. Source: InterPro

Complete GO annotation...

Sequences

Sequence LengthMass (Da)Tools
Q8R874 [UniParc].

Last modified June 1, 2002. Version 1.
Checksum: FE1509962ED2CCEF

FASTA2,862330,543
        10         20         30         40         50         60 
MWYAVFVLIL LLAGIYLKSK KIEVDDSMRE FDDIILSSEE MEKHAEELAQ NHVIANRNRA 

        70         80         90        100        110        120 
SFLLIPRMNK NYEYIKSVYR SLNNLLKEKD TYISQEEEWL LDNFYIIEEQ VKEIRKSLSK 

       130        140        150        160        170        180 
KYYAGLPVLK NGAFRGYPRV YALAFELVLH TDGKIEEKGI INFIKAYQKK ALLTTSELWA 

       190        200        210        220        230        240 
LSLMIRIALI EKIKKVCEEI VESRLQREKA EKMLSALMEK EMSYEEVKKL IKSNIKVVDR 

       250        260        270        280        290        300 
FPLQFVEYLV SRIKREGSNS SDILKTLEKI LMEYDSSIND IAEKAHYFQA KRQVSIGNAI 

       310        320        330        340        350        360 
VSLKTVSSLD WAEIFETLSP VEQVLKQDPD GTYPKMDFES KDYYRHEIEK LARYYNVSET 

       370        380        390        400        410        420 
YVAKKAVECA REVADQGENL GYINHVGFYL IGKGRSILES KLNNKKRRFF DFYRIRQKNP 

       430        440        450        460        470        480 
ATVYFGLIIL FFALGEIISL GYLRHFTGSF WNLFASSLVL AIPLSEISIQ MTNWVLMHIF 

       490        500        510        520        530        540 
KPVMLPKIEL KDGIPDDAKT FVVISSLLPD EKKAKELVEN LEVYYHANRE RNLYFGILGD 

       550        560        570        580        590        600 
FKDAPLEVMP EDEKIVKATL EEIEKLNEKY AENGEKVFYY FHRKRIYNEM QKSWMGWERK 

       610        620        630        640        650        660 
RGALMEFVDL LRGEKDTTFY IVSDDVSKLG IKYVITLDAD TNLPIDTAKK LVGAMLHPLN 

       670        680        690        700        710        720 
RAIIDRDEGI VVEGYGLLQP RIGVDIESAN ASLFSKIYGG EGGIDPYTTA TSDIYQDLFG 

       730        740        750        760        770        780 
EGIYTGKGIF DVDVFRELLK DTIPDNSILS HDLLEGSFVR TGLVTDIELI DGFPAKYNSY 

       790        800        810        820        830        840 
MMRLHRWVRG DWQLLPYLRS KIRNRRGELI RNPLSLITKW KIMDNLRRSL ISISLIVMLF 

       850        860        870        880        890        900 
LGFSALPASA LFWVAVAALT VFFPVMPALF DLIFRGQLRQ YLEKRHRAVI TGVEVAFYQA 

       910        920        930        940        950        960 
LLNFIFLPYN AYIMADAIIR TISRMYITKR NLLEWVTAAD MEKRLKNDFI SFVKRMWVVL 

       970        980        990       1000       1010       1020 
LKGVVLILLT AYFKPGALIF AVGVFFLWAF SPYVAFYISQ PVLLKIKFIL DEDIEEVRLI 

      1030       1040       1050       1060       1070       1080 
ARKTWKFFED TVTEAQNYLP PDNFQEDPPN GIAERTSPTN IGLYLVSTVG ARDLGYITTS 

      1090       1100       1110       1120       1130       1140 
EMVDRIENTI NTIKKMEKWN GHLFNWYDTR TLKPLRPYYV STVDSGNLVG YLITVKEALE 

      1150       1160       1170       1180       1190       1200 
EFLDKPVIDL EFLRGLKDTV RMLKIERIDK SLFEEFLKKG DIDPLAWKKI LDDLEEVEEE 

      1210       1220       1230       1240       1250       1260 
RLRDIVKKFK NEIREFMPWL EFEDAEGGYG EIFNECNSFE ELKKVYEKYL EETFRAKKEG 

      1270       1280       1290       1300       1310       1320 
LPEFKIKQIQ RAVEKIEELK ERILKLKQEI EDIIEKTEFK HLYDEKRQLF SIGYNVEEEK 

      1330       1340       1350       1360       1370       1380 
LTKSYYDLLA SEARQASFIA IAKREVDKKH WFKLGRMLTR ANRSKGLVSW SGTMFEYFMP 

      1390       1400       1410       1420       1430       1440 
LLIMKNYENT LLDETYSFAA KVQKEYGVKL GIPWGISESG FYAFDMSLNY QYKAFGVPIL 

      1450       1460       1470       1480       1490       1500 
GLKRGLSHDK VVAPYGSILA ISVDPEGVMK NIEFLKKEGA EGEYGLYEAI DYTPERVPFG 

      1510       1520       1530       1540       1550       1560 
KKNAIVKSFM AHHQGMIFVA IDNFIHENIM QKRFHRDPRV KATQILLQEK APIYLDMTRE 

      1570       1580       1590       1600       1610       1620 
EREEPRKIQK IRKEDLDFVR VLGESRSWIP EVHIVSSGKY FVMLTEKGTG YSKNIKGIFL 

      1630       1640       1650       1660       1670       1680 
NRWRKDIAQD YGTFIFIRNV DSNEVWSATF APFYQKGQHY RVVFSADKAE YFKRVGGIDS 

      1690       1700       1710       1720       1730       1740 
YLEITVSPED DVEIRRLTLK NHSKYPQILE ITSFSEISLM DLPSDVAHPA FNKLFVKTEF 

      1750       1760       1770       1780       1790       1800 
LKDEDAIIVC RRPRDPEKSR LWALHKVVVL SGEAMGDTQF ETDRLKFIGR GRSVRKPLAL 

      1810       1820       1830       1840       1850       1860 
EPDQPLSNTE GAVLDPIVSL RKRIRIMPGG VAKIAYISAI TETKEEAVKI VSKYKEENAI 

      1870       1880       1890       1900       1910       1920 
ERAFEMSWTR SRVELEYINL KPRELGLFQR MLPYLIFASP QRKMREEMIL KNTKGQSGLW 

      1930       1940       1950       1960       1970       1980 
AHGISGDLPI VLLEVEKMEE IELVKWFLKA YEYWRMKGIN IDLVIVNKDK SGYLQPLNDK 

      1990       2000       2010       2020       2030       2040 
IKEVINTTFA YDVFGKYGGV YLLQENNLKE DDFYLLNAVA ALKFDGKNES IYDQIMVKVH 

      2050       2060       2070       2080       2090       2100 
KKALKPRSFQ EKVSSCRDDG LEEIELQYYN GFGGFTPDGK EYVIKWEGKS SPAPWINIIS 

      2110       2120       2130       2140       2150       2160 
NPNFGFQVSE VGAGYTWAEN SREYKLTPWY NDPVLDPHGE VIYLIDEITG EKWTITPHPA 

      2170       2180       2190       2200       2210       2220 
GNSGIYYIRH GFGYSTFESA SCELKSRLTM FVPKEDSVKI NLIKLKNTSK NSRKIQIVYY 

      2230       2240       2250       2260       2270       2280 
IRPVLGVTDE ATSQYIASEF DKEERILYIR NVYNEDFVNR IAFLATSEGI NSYESERGEF 

      2290       2300       2310       2320       2330       2340 
IGVGFDLSSP QALSYETLSN SEGLAVDPCS AIEFSVEIGP GEEKEISILL GHAKEKKEAK 

      2350       2360       2370       2380       2390       2400 
DLVLKYLKVE NCKKELEKVK GFWGEILGKL TVNTPDKSLD LLVNGWLPYQ TIACRLWARS 

      2410       2420       2430       2440       2450       2460 
AFYQSGGAYG FRDQLQDAMN MVLLNPEFTK RQIINACEHQ FIEGDVQHWW HPVLNKGIRT 

      2470       2480       2490       2500       2510       2520 
KFSDDLLWLP YVVADYLEKT EDWAILEEKA GYLEDLPLKE EEEERYSVPS ISSHKGTVYE 

      2530       2540       2550       2560       2570       2580 
HCVKAIDYAL KFGEHGLPLI GTGDWNDGMN KVGHRGKGES VWLGWFLYTV LKKFASISEK 

      2590       2600       2610       2620       2630       2640 
MGDIERKEKY IKEAERLLKS IEENAWDGSW YKRAYFDDGT PLGSINNLEC KIDSISQSWA 

      2650       2660       2670       2680       2690       2700 
LISKGGRIER AKEAMKAVVN YLVNEEEGII KLLTPPFDSG DLNPGYIKGY VPGVRENGGQ 

      2710       2720       2730       2740       2750       2760 
YTHAAAWVIL AFTELGDGDT AWKLYNMINP INHTRTPIEC MKYKVEPYVM AADVYAVDPH 

      2770       2780       2790       2800       2810       2820 
AGRGGWTWYT GAAGWMYRVA VEHILGLKKY GDKFTVDPCV PRNWESFVIE YAHGHSKYVI 

      2830       2840       2850       2860 
KVINPDRVNK GVREIYLDGE PVDKFVPLKD ENKVFRVLVV MG 

« Hide

References

[1]"A complete sequence of the T. tengcongensis genome."
Bao Q., Tian Y., Li W., Xu Z., Xuan Z., Hu S., Dong W., Yang J., Chen Y., Xue Y., Xu Y., Lai X., Huang L., Dong X., Ma Y., Ling L., Tan H., Chen R. expand/collapse author list , Wang J., Yu J., Yang H.
Genome Res. 12:689-700(2002) [PubMed: 11997336] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: DSM 15242 / JCM 11007 / NBRC 100824 / MB4.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AE008691 Genomic DNA. Translation: AAM25311.1.
RefSeqNP_623707.1. NC_003869.1.

3D structure databases

ProteinModelPortalQ8R874.
ModBaseSearch...

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

GeneID997466.
GenomeReviewsGene locus TTE2146 in contig AE008691_GR.
KEGGtte:TTE2146.
NMPDRfig|273068.3.peg.2250.
PATRIC23899035. VBITheTen82880_2154.

Organism-specific databases

CMRSearch...

Phylogenomic databases

HOGENOMHBG294780.
OMAGGDWNDG.
ProtClustDBCLSK2461803.

Enzyme and pathway databases

BioCycTTEN273068:TTE2146-MONOMER.

Family and domain databases

InterProIPR008928. 6-hairpin_glycosidase-like.
IPR009342. Carb-bd_put_dom.
IPR019282. DUF2329.
IPR011013. Glyco_hydro-type_carb-bd.
IPR010383. Glyco_transf_36.
IPR010403. GT36_AF.
[Graphical view]
PfamPF06204. CBM_X. 2 hits.
PF10091. DUF2329. 1 hit.
PF06165. Glyco_transf_36. 2 hits.
PF06205. GT36_AF. 2 hits.
[Graphical view]
SMARTSM01068. CBM_X. 2 hits.
[Graphical view]
SUPFAMSSF74650. Gal_mut_like. 2 hits.
SSF48208. Glyco_trans_6hp. 1 hit.
ProtoNetSearch...

Entry information

Entry nameQ8R874_THETN
AccessionPrimary (citable) accession number: Q8R874
Entry history
Integrated into UniProtKB/TrEMBL: June 1, 2002
Last sequence update: June 1, 2002
Last modified: January 25, 2012
This is version 43 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)