Skip Header

Contribute Send feedback
Read comments (?) or add your own

A1R4S7 (A1R4S7_ARTAT) Unreviewed, UniProtKB/TrEMBL

Last modified December 14, 2011. Version 25. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·Ontologies·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein names
Gene names
Ordered Locus Names:AAur_1465
OrganismArthrobacter aurescens (strain TC1) [Complete proteome] [HAMAP] EMBL ABM10140.1
Taxonomic identifier290340 [NCBI]
Taxonomic lineageBacteriaActinobacteriaActinobacteridaeActinomycetalesMicrococcineaeMicrococcaceaeArthrobacter

Protein attributes

Sequence length1465 AA.
Sequence statusComplete.
Protein existencePredicted

Ontologies

Keywords
   Technical termComplete proteome
Gene Ontology (GO)
   Molecular functionglycopeptide alpha-N-acetylgalactosaminidase activity

Inferred from electronic annotation. Source: InterPro

Complete GO annotation...

Sequences

Sequence LengthMass (Da)Tools
A1R4S7 [UniParc].

Last modified February 6, 2007. Version 1.
Checksum: 7502C82D92C0E095

FASTA1,465155,265
        10         20         30         40         50         60 
MPRLSSPGRL ASLSLACVVA SSSLGLLAIP PAAAAPSTQP ADIVSAADTA TITSGDLRVD 

        70         80         90        100        110        120 
VGTTFPQVLG YTDAASKARL DGTTTRLSTI TLNGTEYTVS GTSAASGKDA RDYVLTLPDF 

       130        140        150        160        170        180 
GNTVIKARLS VKKNVVSFNI TEIKDSAEHQ VRTLQLPRLN LVTVGSTQPG SQVSTANLSV 

       190        200        210        220        230        240 
DRSVTGDEFT PITASTPLDA AAKSSAYALA NTATLGAAVE SNALYDTSSG PGAKDRGRFW 

       250        260        270        280        290        300 
RQAVSDGAGG VNMGLASGQW LYRAEGSTTT EELPWTRVAI TSDANNDGGV DWQDAAIAMR 

       310        320        330        340        350        360 
SIQVSPNKGE QTPDNVITHI PFNFASQATH PFLRTLDDVK RISLATDGLG QVAMLKGYTS 

       370        380        390        400        410        420 
EGHDSANTDY GNNFNTRAGG LEDLNTLVKE GKEWNASFGV HINATEIYPE AKSFSEDLLR 

       430        440        450        460        470        480 
ADKGLGWNWL DQSYYMNQRE DINSGKLAQR IKELRESTNK NLDFVYVDVY YEFGWLAERL 

       490        500        510        520        530        540 
QQELVKNGFR VGSEWADHLS RNNTWSHWAN DEKYGGSTNK GINSQILRFI NNTQSDVWNP 

       550        560        570        580        590        600 
DPKLGVSHIV EFEGWTGQND FNAFSENVWT ANVPAKFLQH HPITKWTAER IELADGVAVT 

       610        620        630        640        650        660 
GNTAEGRNIT VGGTSVLQGG TYLLPWSSKE NGKVDKLYHY NPTGGASTWT LTQEFAKSSS 

       670        680        690        700        710        720 
LEQFKLTDNG RVKVADVPVV NGQVTVTADA KQPYILAPKN NKAELPKKAD FGEGTAFNDP 

       730        740        750        760        770        780 
GFNGTDLSPW NPAGPVTQVR DDKGRRFAEM GATPSSISQD VQLDAGTQSV SAWIEIQPGK 

       790        800        810        820        830        840 
TRPTTLSVDI DGKTESVTID SSNAENYVAG DEKHGTAFQR IRVLVDVPRN NTKATVTVQA 

       850        860        870        880        890        900 
ADGDATVRVD DFRAVKTTRV PTTGVLSEDF ENVDQGWGPF VKGDAGGSTD PRTHITERHE 

       910        920        930        940        950        960 
PFTQKGWDAN VIDEVLDGTW SLIAHDENRA PNGGPGMVYR TTEASVPFQA GHKYKVSFDY 

       970        980        990       1000       1010       1020 
QNSKAGQYAW VSGYDSQAGP AVTGSQAIEA KTSTTRFEQI LDTGFCGDYF VGLQRTGSSN 

      1030       1040       1050       1060       1070       1080 
GSDFTLDNFL VEDLGASEAV PACAQLSAEL QGDVVQQGKA QDFVTTFVSD EPAAISGLAV 

      1090       1100       1110       1120       1130       1140 
ALELPEGWTA TPSTPATAPT LPAGGTLTTT WKITAPASAD GDYPITAKAS YTVSSSGIDP 

      1150       1160       1170       1180       1190       1200 
AGSRTISTTT TVRTLPKPPQ ATVFASDHPW VSATNGWGPV EKDQSNGGTG AGDGTPLTLN 

      1210       1220       1230       1240       1250       1260 
GTVYAKGLGA HANGTVRYYL GGYCTAFTAT VGIDDAQPTR GSVKFSVVAD GTTKVTTPVL 

      1270       1280       1290       1300       1310       1320 
GATSAPLPLT VDVTGAQYVE LVANDAGDSN GNDHADWADA KFTCSSTSQE PPAPVLSGTV 

      1330       1340       1350       1360       1370       1380 
FASDLPWIGS TNGWGPAERD RANGEQNAGD GPALRLDGVV YSKGIGVHAD SKISIATEAK 

      1390       1400       1410       1420       1430       1440 
CTAFTAVAGV DDAKLNKGLH GSVVFIVKGG GRELLRTPVL SADSAALPLN VDITGVQNVE 

      1450       1460 
LIADKNGDDA GDDWGDWADA KFSCA 

« Hide

References

[1]"Secrets of soil survival revealed by the genome sequence of Arthrobacter aurescens TC1."
Mongodin E.F., Shapir N., Daugherty S.C., DeBoy R.T., Emerson J.B., Shvartzbeyn A., Radune D., Vamathevan J., Riggs F., Grinberg V., Khouri H.M., Wackett L.P., Nelson K.E., Sadowsky M.J.
PLoS Genet. 2:2094-2106(2006) [PubMed: 17194220] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
CP000474 Genomic DNA. Translation: ABM10140.1.
RefSeqYP_947239.1. NC_008711.1.

3D structure databases

ProteinModelPortalA1R4S7.
ModBaseSearch...

Protein-protein interaction databases

STRINGA1R4S7.

Protein family/group databases

CAZyCBM51. Carbohydrate-Binding Module Family 51.
GH101. Glycoside Hydrolase Family 101.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

GeneID4637872.
GenomeReviewsGene locus AAur_1465 in contig CP000474_GR.
KEGGaau:AAur_1465.
PATRIC20978310. VBIArtAur67810_1495.
TIGRAAur_1465.

Phylogenomic databases

eggNOGNOG10337.
HOGENOMHBG699060.
OMAVHINATE.
ProtClustDBCLSK560393.

Family and domain databases

InterProIPR018905. A-galactase_NEW3.
IPR013222. Glyco_hyd_98_carb-bd.
IPR024746. Glyco_hydro.
[Graphical view]
PfamPF12899. Glyco_hydro_100. 1 hit.
PF08305. NPCBM. 2 hits.
PF10633. NPCBM_assoc. 1 hit.
[Graphical view]
SMARTSM00776. NPCBM. 2 hits.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameA1R4S7_ARTAT
AccessionPrimary (citable) accession number: A1R4S7
Entry history
Integrated into UniProtKB/TrEMBL: February 6, 2007
Last sequence update: February 6, 2007
Last modified: December 14, 2011
This is version 25 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)