Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot O74657 (ALS2_CANAL)

Last modified November 24, 2009. Version 31. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (1) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Agglutinin-like protein 2
Gene names
Name: ALS2
ORF Names: CaO19.1097, CaO19.8699
OrganismCandida albicans (Yeast)
Taxonomic identifier5476 [NCBI]
Taxonomic lineageEukaryotaFungiDikaryaAscomycotaSaccharomycotinaSaccharomycetesSaccharomycetalesmitosporic SaccharomycetalesCandida

Protein attributes

Sequence length1756 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existencePredicted.

General annotation (Comments)

Function

May play a role in adhesion and pathogenesis.

Post-translational modification

N-glycosylated and O-glycosylated Potential.

Ontologies

Keywords
   Biological processCell adhesion
   DomainRepeat
Signal
   PTMGlycoprotein
Gene Ontology (GO)
   Biological processcell adhesion

Inferred from electronic annotation. Source: UniProtKB-KW

   Molecular functionprotein binding

Inferred from electronic annotation. Source: UniProtKB-KW

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 1717 Potential
Chain18 – 17561739Agglutinin-like protein 2
PRO_0000020692

Amino acid modifications

Glycosylation2531N-linked (GlcNAc...) Potential
Glycosylation3151N-linked (GlcNAc...) Potential
Glycosylation5781N-linked (GlcNAc...) Potential
Glycosylation6141N-linked (GlcNAc...) Potential
Glycosylation6861N-linked (GlcNAc...) Potential
Glycosylation8661N-linked (GlcNAc...) Potential
Glycosylation10101N-linked (GlcNAc...) Potential
Glycosylation11541N-linked (GlcNAc...) Potential
Glycosylation13701N-linked (GlcNAc...) Potential
Glycosylation14061N-linked (GlcNAc...) Potential
Glycosylation14421N-linked (GlcNAc...) Potential
Glycosylation15141N-linked (GlcNAc...) Potential
Glycosylation15501N-linked (GlcNAc...) Potential
Glycosylation15861N-linked (GlcNAc...) Potential
Glycosylation16221N-linked (GlcNAc...) Potential
Glycosylation16581N-linked (GlcNAc...) Potential
Glycosylation17301N-linked (GlcNAc...) Potential

Sequences

Sequence LengthMass (Da)Tools
O74657-1 [UniParc].

Last modified May 5, 2009. Version 2.
Checksum: 10B72B1E480D5EB6

FASTA1,756188,008
        10         20         30         40         50         60 
MLLQFLLLSL CVSVATAKVI TGVFNSFDSL TWTRAGNYAY KGPNRPTWNA VLGWSLDGTS 

        70         80         90        100        110        120 
ANPGDTFTLN MPCVFKFITD QTSVDLTAEG VKYATCQFYS GEEFTTFSSL KCTVSNTLTS 

       130        140        150        160        170        180 
SIKALGTVTL PISFNVGGTG SSVDLESSQC FKAGTNTVTF NDGDKKISID VDFEKTNEDA 

       190        200        210        220        230        240 
SGYFIASRLI PSINKVSITY VAPQCANGYT SGAMGFIVLT GDTTIDCSNV HVGITKGLND 

       250        260        270        280        290        300 
WNFPVSSDSL SYNKTCSSTG ISITYENVPA GYRPFFDVYT SVSGQNRQLR YTNDYACVGS 

       310        320        330        340        350        360 
SLQSKPFNLR LRGYNNSEAN SNGFVIVATT RTVTDSTTAV TTLPFNPSVD KTKTIEILQP 

       370        380        390        400        410        420 
IPTTTITTSY VGVTTSYSTK TAPIGETATV IVDVPYHTTT TVTSEWTGTI TTTTTRTNPT 

       430        440        450        460        470        480 
DSIDTVVVQV PSPNPTVTTT EYWSQSYATT TTVTAPPGGT DSVIIREPPN PTVTTTEYWS 

       490        500        510        520        530        540 
QSYATSSTVT APPGGTDTVI IREPPNPTVT TTEYWSQSYA TTTTVTAPPG GTDSVIIREP 

       550        560        570        580        590        600 
PNPTVTTTEY WSQSFATTTT ITAPPGETDT VLIREPPNHT VTTTEYWSQS YVTTSTITAP 

       610        620        630        640        650        660 
PGGTDTVIIR EPPNYTVTTT EYWSQSYATT TTVTAPPGGT DTVIIREPPN PTVTTTEYWS 

       670        680        690        700        710        720 
QSYATTTTVT GPPGGTDTVI IREPPNQTVT TTEYWSQSYA TTTTVTAPPG GTDTVIIREP 

       730        740        750        760        770        780 
PNPTVTTTEY WSQSYATTTT VTGPPGGTDT VIIREPPNPT VTTTEYWSQS YATTTTVTAP 

       790        800        810        820        830        840 
PGGTATVIIR EPPNPTVTTT EYWSQSYATT TTVTGPPGGT DTVIIREPPN PTVTTTEYWS 

       850        860        870        880        890        900 
QSYATTTTVT APPGGTATVI IREPPNYTVT TTEYWSQSYA TTTTVTGPPG GTDTVIIREP 

       910        920        930        940        950        960 
PSPTVTTTEY WSQSYATTTT VTAPPGGTAT VIIREPPNPT VTTTEYWSQS YATTTTVTGP 

       970        980        990       1000       1010       1020 
PGGTDTVIIR EPPNPTVTTT EYWSQSYATT TTVTAPPGGT ATVIIREPPN YTVTTTEYWS 

      1030       1040       1050       1060       1070       1080 
QSYATTTTVT GPPGGTDTVI IREPPSPTVT TTEYWSQSYA TTTTVTAPPG GTATVIIWEP 

      1090       1100       1110       1120       1130       1140 
PNPTVTTTEY WSQSYATTTT VTGPPGGTDT VIIREPPSPT VTTTEYWSQS YATTTTVTAP 

      1150       1160       1170       1180       1190       1200 
PGGTATVIIR EPPNYTVTTT EYWSQSYATT TTVTGPPGGT DTVIIREPPN PTVTTTEYWS 

      1210       1220       1230       1240       1250       1260 
QSFATTTTVT APPGGTDSVI IREPPNPTVT TTEYWSQSYA TTTTVTAPPG GTDSVIIREP 

      1270       1280       1290       1300       1310       1320 
PNPTVTTTEY WSQSFATTTT VTAPPGGTDS VIIREPPNPT VTTTEYWSQS YATTTTVTAP 

      1330       1340       1350       1360       1370       1380 
PGGTDSVIIR EPPNPTVTTT EYWSQSYATT TTVTAPPGGT ATVIIREPPN YTVTTTEYWS 

      1390       1400       1410       1420       1430       1440 
QSYATTTTVT APPGGTATVI IREPPNYTVT TTEYWSQSYA TTTTITAPPG DTDTVIIREP 

      1450       1460       1470       1480       1490       1500 
PNYTVTTTEY WSQSFATTTT VTAPPGGTDS VIIREPPNPT VTTTEYWSQS YATTTTVTAP 

      1510       1520       1530       1540       1550       1560 
PGGTATVIIR EPPNYTVTTT EYWSQSYATT TTVTAPPGGT ATVIIREPPN YTVTTTEYWS 

      1570       1580       1590       1600       1610       1620 
QSYATTTTIT APPGDTDTVI IREPPNYTVT TTEYWSQSYA TTTTVTAPPG GTDTVIIREP 

      1630       1640       1650       1660       1670       1680 
PNYTVTTTEY WSQSYATTTT VTAPPGGTAT VIIREPPNYT VTTTEYWSQS YATTTTVTGP 

      1690       1700       1710       1720       1730       1740 
PGSTDTVIIR EPPNPTVTTT EYWSQSYATT TTVTAPPGGT ATVIIREPPN YTVTTTEYWS 

      1750 
QSYATTTTVT APPRWY 

« Hide

References

« Hide 'large scale' references
[1]"The diploid genome sequence of Candida albicans."
Jones T., Federspiel N.A., Chibana H., Dungan J., Kalman S., Magee B.B., Newport G., Thorstenson Y.R., Agabian N., Magee P.T., Davis R.W., Scherer S.
Proc. Natl. Acad. Sci. U.S.A. 101:7329-7334(2004) [PubMed: 15123810] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: SC5314.
[2]"Identification of Candida albicans ALS2 and ALS4 and localization of als proteins to the fungal cell surface."
Hoyer L.L., Payne T.L., Hecht J.E.
J. Bacteriol. 180:5334-5343(1998) [PubMed: 9765564] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-468.
Strain: 1161.

Cross-references

Sequence databases

AACQ01000169 Genomic DNA. Translation: EAK92889.1.
AACQ01000168 Genomic DNA. Translation: EAK92914.1.
AF024580 Genomic DNA. Translation: AAC64235.1.
RefSeqXP_712085.1.
XP_712109.1.

3D structure databases

ModBaseSearch...

Genome annotation databases

GeneID3646295.
3646321.
KEGGcal:CaO19.1097.
cal:CaO19.8699.

Organism-specific databases

CGDCAL0075525. ALS2.

Family and domain databases

InterProIPR008966. Adhesion_bac.
IPR008440. Candida_ALS.
[Graphical view]
PfamPF05792. Candida_ALS. 6 hits.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameALS2_CANAL
AccessionPrimary (citable) accession number: O74657
Secondary accession number(s): Q59QW1
Entry history
Integrated into UniProtKB/Swiss-Prot: July 15, 1999
Last sequence update: May 5, 2009
Last modified: November 24, 2009
This is version 31 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectFPAP (Fungal Proteome Annotation Project)

Relevant documents

Candida albicans

Candida albicans: entries and gene names

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents