Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q1R962 (Q1R962_ECOUT) Unreviewed, UniProtKB/TrEMBL

Last modified December 14, 2011. Version 38. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·Ontologies·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein names
Gene names
Name:sia EMBL ABE08102.1
Ordered Locus Names:UTI89_C2636
OrganismEscherichia coli (strain UTI89 / UPEC) [Complete proteome] [HAMAP] EMBL ABE08102.1
Taxonomic identifier364106 [NCBI]
Taxonomic lineageBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia

Protein attributes

Sequence length981 AA.
Sequence statusComplete.
Protein existencePredicted

Ontologies

Keywords
   Technical termComplete proteome
Gene Ontology (GO)
   Biological processpathogenesis

Inferred from electronic annotation. Source: InterPro

Complete GO annotation...

Sequences

Sequence LengthMass (Da)Tools
Q1R962 [UniParc].

Last modified May 16, 2006. Version 1.
Checksum: C2D8C4DE6C1BBA40

FASTA981109,378
        10         20         30         40         50         60 
MTDITANVIV SMPSQLFTMA RSFKAVANGK IYIGKIDTDP VNTENQIQVY VENEDGSHVP 

        70         80         90        100        110        120 
VSQPIIINAA GYPVYNGQIA KFVTVQGHSM AVYDAYGAQQ FYFPNVLKYD PDQLRQQLED 

       130        140        150        160        170        180 
TDGANKYPKL QIARWRDSYD VRGWGAIGDG VHDDTSALSE LLSVATGGEK IDGRGLTFKV 

       190        200        210        220        230        240 
STLPDVSRFK NARFLFERIP GQPLFYVSED FIQGELFKIT DTPWYNAWTQ DKTFVYDNVI 

       250        260        270        280        290        300 
YAPFMAGDRH GVNNLHVAWV RSGDDGKTWT TPEWLTDLHE NYPTVNYHCM SMGVVRNRLF 

       310        320        330        340        350        360 
AVIETRTVSG NKLQVAELWD RPMSRSLRVY GGITKAANQQ VAYIRITDHG LFAGDFVNFS 

       370        380        390        400        410        420 
NSGVTGVTGN MTVTTVIDKN TFTVTTQNTQ DVDQNNEGRY WSFGTSFHSS PWRKTSLGTI 

       430        440        450        460        470        480 
PSFVDGSTPV TEIHSFATIS DNSFAVGYHN GDIGPRELGI LYFSDAFGSP GSFVRRRIPA 

       490        500        510        520        530        540 
EYEANASEPC VKYYDGILYL TTRGTLSTQP GSSLHRSSDL GTSWNSLRFP NNVHHSNLPF 

       550        560        570        580        590        600 
AKVGDELIIF GSERAFGEWE GGEPDNRYAG NYPRTFMTRV NVNEWSLDNV EWVNVTDQIY 

       610        620        630        640        650        660 
QGGIVNSAVG VGSVCIKDNW LYYIFGGEDF LNPWSIGDNN RKYPYVHDGH PADLYCFRVK 

       670        680        690        700        710        720 
IKQEEFVSRD FVYGATPNRT LPTFMSTSGV RTVPVPVDFT DDVAVQSLTV HAGTSGQVRA 

       730        740        750        760        770        780 
EVKLEGNYAI IAKKVPSDDV TAQRLIVSGG ETTSSADGAM ITLHGSRSST PRRAVYNALE 

       790        800        810        820        830        840 
HLFENGDVKP YLDNVNALGG PGNRFSIVYL GSNPVVTSDG TLKTEPVSPD ETLLDAWGDV 

       850        860        870        880        890        900 
RYIAYKWLNA VAIKGEEGAR IHHGVIAQQL RDVLISHGLM EEESTTCRYA FLCYDDYPAV 

       910        920        930        940        950        960 
YDDVITGQRE MPLTDNDGSI IVDEDDNPVM VMEDIIERVE ITPAGSRWGV RPDLLFYIEA 

       970        980 
AWQRREIERI KARLDLIEGK H 

« Hide

References

[1]"Identification of genes subject to positive selection in uropathogenic strains of Escherichia coli: a comparative genomics approach."
Chen S.L., Hung C.-S., Xu J., Reigstad C.S., Magrini V., Sabo A., Blasiar D., Bieri T., Meyer R.R., Ozersky P., Armstrong J.R., Fulton R.S., Latreille J.P., Spieth J., Hooton T.M., Mardis E.R., Hultgren S.J., Gordon J.I.
Proc. Natl. Acad. Sci. U.S.A. 103:5977-5982(2006) [PubMed: 16585510] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
CP000243 Genomic DNA. Translation: ABE08102.1.
RefSeqYP_541633.1. NC_007946.1.

3D structure databases

ProteinModelPortalQ1R962.
SMRQ1R962. Positions 6-109, 146-817.
ModBaseSearch...

Protein-protein interaction databases

STRINGQ1R962.

Protein family/group databases

CAZyGH58. Glycoside Hydrolase Family 58.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaEBESCT00000066166; EBESCP00000063644; EBESCG00000065213.
GeneID3991806.
GenomeReviewsGene locus UTI89_C2636 in contig CP000243_GR.
KEGGeci:UTI89_C2636.
PATRIC18454850. VBIEscCol42261_2661.

Organism-specific databases

CMRSearch...

Phylogenomic databases

eggNOGNOG85669.
GeneTreeEBGT00050000008374.
HOGENOMHBG469174.
OMASTAISAC.
ProtClustDBCLSK773607.

Family and domain databases

InterProIPR023366. ATPase_F1/A1-cplx_a_su_N.
IPR024427. Endosialidase_beta_barrel.
IPR024428. Endosialidase_beta_prop.
IPR024430. Endosialidase_C.
IPR024429. Endosialidase_N-extension.
IPR001724. Glyco_hydro_58.
IPR011040. Neuraminidase.
IPR009093. Phage_P22_Gp9_tailspike_N.
[Graphical view]
Gene3DG3DSA:2.40.30.20. G3DSA:2.40.30.20. 1 hit.
G3DSA:3.30.750.60. G3DSA:3.30.750.60. 1 hit.
G3DSA:4.10.1090.10. G3DSA:4.10.1090.10. 1 hit.
G3DSA:2.120.10.10. Neuraminidase. 2 hits.
G3DSA:2.170.14.10. Tailspk_head_bd. 1 hit.
PfamPF12195. End_beta_barrel. 1 hit.
PF12217. End_beta_propel. 1 hit.
PF12218. End_N_terminal. 1 hit.
PF12219. End_tail_spike. 1 hit.
PF09008. Head_binding. 1 hit.
[Graphical view]
PRINTSPR00849. GLHYDRLASE58.
SUPFAMSSF50939. Sialidase. 1 hit.
SSF51327. Tailspk_head_bd. 1 hit.
ProtoNetSearch...

Entry information

Entry nameQ1R962_ECOUT
AccessionPrimary (citable) accession number: Q1R962
Entry history
Integrated into UniProtKB/TrEMBL: May 16, 2006
Last sequence update: May 16, 2006
Last modified: December 14, 2011
This is version 38 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)