Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot P46590 (ALS1_CANAL)

Last modified November 24, 2009. Version 36. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (1) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Agglutinin-like protein 1
Gene names
Name: ALS1
OrganismCandida albicans (Yeast)
Taxonomic identifier5476 [NCBI]
Taxonomic lineageEukaryotaFungiDikaryaAscomycotaSaccharomycotinaSaccharomycetesSaccharomycetalesmitosporic SaccharomycetalesCandida

Protein attributes

Sequence length1260 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existencePredicted.

General annotation (Comments)

Function

May play a role in adhesion and pathogenesis.

Post-translational modification

N-glycosylated and O-glycosylated Potential.

Sequence similarities

To yeast SAG1.

Ontologies

Keywords
   Biological processCell adhesion
   DomainRepeat
Signal
   PTMGlycoprotein
Gene Ontology (GO)
   Biological processcell adhesion

Inferred from electronic annotation. Source: UniProtKB-KW

   Molecular functionprotein binding

Inferred from electronic annotation. Source: UniProtKB-KW

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 1717 Potential
Chain18 – 12601243Agglutinin-like protein 1
PRO_0000020691

Regions

Repeat433 – 468361-1
Repeat469 – 504361-2
Repeat505 – 540361-3
Repeat541 – 576361-4
Repeat577 – 612361-5
Repeat613 – 648361-6
Repeat649 – 684361-7
Repeat685 – 720361-8
Repeat721 – 756361-9
Repeat757 – 792361-10
Repeat983 – 1043612-1
Repeat1092 – 1152612-2
Region433 – 79236010 X 36 AA tandem repeats
Region983 – 11521702 X 26 AA approximate repeats
Compositional bias399 – 4046Poly-Thr
Compositional bias408 – 41811Poly-Thr
Compositional bias450 – 4556Poly-Thr
Compositional bias486 – 4916Poly-Thr
Compositional bias522 – 5276Poly-Thr
Compositional bias558 – 5636Poly-Thr
Compositional bias594 – 5996Poly-Thr
Compositional bias630 – 6356Poly-Thr
Compositional bias666 – 6716Poly-Thr
Compositional bias702 – 7076Poly-Thr
Compositional bias738 – 7436Poly-Thr
Compositional bias774 – 7796Poly-Thr
Compositional bias874 – 8774Poly-Ser

Amino acid modifications

Glycosylation4711N-linked (GlcNAc...) Potential
Glycosylation5791N-linked (GlcNAc...) Potential
Glycosylation6151N-linked (GlcNAc...) Potential
Glycosylation6871N-linked (GlcNAc...) Potential
Glycosylation7231N-linked (GlcNAc...) Potential
Glycosylation8201N-linked (GlcNAc...) Potential
Glycosylation8861N-linked (GlcNAc...) Potential
Glycosylation9181N-linked (GlcNAc...) Potential
Glycosylation9731N-linked (GlcNAc...) Potential
Glycosylation10451N-linked (GlcNAc...) Potential
Glycosylation10681N-linked (GlcNAc...) Potential

Sequences

Sequence LengthMass (Da)Tools
P46590-1 [UniParc].

Last modified November 1, 1995. Version 1.
Checksum: 763D1063A2354C24

FASTA1,260132,641
        10         20         30         40         50         60 
MLQQFTLLFL YLSIASAKTI TGVFDSFNSL TWSNAANYAF KGPGYPTWNA VLGWSLDGTS 

        70         80         90        100        110        120 
ANPGDTFTLN MPCVFKYTTS QTSVDLTADG VKYATCQFYS GEEFTTFSTL TCTVNDALKS 

       130        140        150        160        170        180 
SIKAFGTVTL PIAFNVGGTG SSTDLEDSKC FTAGTNTVTF NDGDKDISID VEFEKSTVDP 

       190        200        210        220        230        240 
SAYLYASRVM PSLNKVTTLF VAPQCENGYT SGTMGFSSSN GDVAIDCSNI HIGITKGLND 

       250        260        270        280        290        300 
WNYPVSSESF SYTKTCTSNG IQIKYQNVPA GYRPFIDAYI SATDVNQYTL AYTNDYTCAG 

       310        320        330        340        350        360 
SRSQSKPFTL RWTGYKNSDA GSNGIVIVAT TRTVTDSTTA VTTLPFNPSV DKTKTIEILQ 

       370        380        390        400        410        420 
PIPTTTITTS YVGVTTSYST KTAPIGETAT VIVDVPYHTT TTVTSEWTGT ITTTTTRTNP 

       430        440        450        460        470        480 
TDSIDTVVVQ VPSPNPTVST TEYWSQSFAT TTTVTAPPGG TDTVIIREPP NHTVTTTEYW 

       490        500        510        520        530        540 
SQSFATTTTV TAPPGGTDSV IIREPPNPTV TTTEYWSQSF ATTTTVTAPP GGTDSVIIRE 

       550        560        570        580        590        600 
PPNPTVTTTE YWSQSYATTT TVTAPPGGTD SVIIREPPNH TVTTTEYWSQ SYATTTTVTA 

       610        620        630        640        650        660 
PPGGTDTVII REPPNHTVTT TEYWSQSFAT TTTVTGPPSG TDTVIIREPP NPTVTTTEYW 

       670        680        690        700        710        720 
SQSYATTTTI TAPPGETDTV LIREPPNHTV TTTEYWSQSY ATTTTVTAPP GETDTVLIRE 

       730        740        750        760        770        780 
PPNHTVTTTE YWSQSYATTT TVTAPPGGTD TVIIREPPNP TVTTTEYWSQ SFATTTTVTA 

       790        800        810        820        830        840 
PPGGTDTVII YESMSSSKIS TSSNDITSII PSFSRPHYVN STTSDLSTFE SSSMNTPTSI 

       850        860        870        880        890        900 
SSDGMLLSST TLVTESETTT ESICSDGKEC SRLSSSSGIV TNPDSNESSI VTSTVPTAST 

       910        920        930        940        950        960 
MSDSLSSTDG ISATSSDNVS KSGVSVTTET SVTTIQTTPN PLSSSVTSLT QLSSIPSVSE 

       970        980        990       1000       1010       1020 
SESKVTFTSN GDNQSGTHDS QSTSTEIEIV TTSSTKVLPP VVSSNTDLTS EPTNTREQPT 

      1030       1040       1050       1060       1070       1080 
TLSTTSNSIT EDITTSQPTG DNGDNTSSTN PVPTVATSTL ASASEEDNKS GSHESASTSL 

      1090       1100       1110       1120       1130       1140 
KPSMGENSGL TTSTEIEATT TSPTEAPSPA VSSGTDVTTE PTDTREQPTT LSTTSKTNSE 

      1150       1160       1170       1180       1190       1200 
SVATTQATNE NGGKSPSTDL TSSLTTGTSA STSANSELVT SGSVTGGAVA SASNDQSHST 

      1210       1220       1230       1240       1250       1260 
SVTNSNSIVS NTPQTTLSQQ VTSSSPSTNT FIASTYDGSG SIIQHSTWLY GLITLLSLFI 

« Hide

References

[1]"Candida albicans ALS1: domains related to a Saccharomyces cerevisiae sexual agglutinin separated by a repeating motif."
Hoyer L.L., Scherer S., Shatzman A.R., Livi G.P.
Mol. Microbiol. 15:39-54(1995) [PubMed: 7752895] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
Strain: ATCC 11651 / B792 / 171D.

Cross-references

Sequence databases

L25902 Genomic DNA. Translation: AAC41649.2.
PIRS60896.

3D structure databases

ModBaseSearch...

Family and domain databases

InterProIPR008966. Adhesion_bac.
IPR008440. Candida_ALS.
[Graphical view]
PfamPF05792. Candida_ALS. 2 hits.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameALS1_CANAL
AccessionPrimary (citable) accession number: P46590
Entry history
Integrated into UniProtKB/Swiss-Prot: November 1, 1995
Last sequence update: November 1, 1995
Last modified: November 24, 2009
This is version 36 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectFPAP (Fungal Proteome Annotation Project)

Relevant documents

Candida albicans

Candida albicans: entries and gene names

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents