Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Spectrin alpha chain, non-erythrocytic 1

Gene

Sptan1

Organism
Rattus norvegicus (Rat)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Fodrin, which seems to be involved in secretion, interacts with calmodulin in a calcium-dependent manner and is thus candidate for the calcium-dependent movement of the cytoskeleton at the membrane.By similarity

Regions

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Calcium bindingi2336 – 2347121PROSITE-ProRule annotationAdd
BLAST
Calcium bindingi2379 – 2390122PROSITE-ProRule annotationAdd
BLAST

GO - Molecular functioni

  • calcium ion binding Source: InterPro
  • protein complex binding Source: RGD
  • protein N-terminus binding Source: RGD
  • spectrin binding Source: MGI
  • syntaxin binding Source: RGD

GO - Biological processi

  • actin cytoskeleton reorganization Source: RGD
  • actin filament capping Source: UniProtKB-KW
Complete GO annotation...

Keywords - Molecular functioni

Actin capping

Keywords - Ligandi

Actin-binding, Calcium, Calmodulin-binding, Metal-binding

Enzyme and pathway databases

ReactomeiR-RNO-264870. Caspase-mediated cleavage of cytoskeletal proteins.
R-RNO-373753. Nephrin interactions.
R-RNO-375165. NCAM signaling for neurite out-growth.
R-RNO-445095. Interaction between L1 and Ankyrins.
R-RNO-5673001. RAF/MAP kinase cascade.
R-RNO-6807878. COPI-mediated anterograde transport.

Names & Taxonomyi

Protein namesi
Recommended name:
Spectrin alpha chain, non-erythrocytic 1
Alternative name(s):
Alpha-II spectrin
Fodrin alpha chain
Gene namesi
Name:Sptan1
Synonyms:Spna2, Spta2
OrganismiRattus norvegicus (Rat)
Taxonomic identifieri10116 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeRattus
Proteomesi
  • UP000002494 Componenti: Chromosome 3

Organism-specific databases

RGDi621714. Sptan1.

Subcellular locationi

  • Cytoplasmcytoskeleton By similarity
  • Cytoplasmcell cortex 1 Publication

  • Note: Expressed along the cell membrane in podocytes and presumptive tubule cells during glomerulogenesis and is expressed along lateral cell margins in tubule cells.

GO - Cellular componenti

  • cytosol Source: Reactome
  • plasma membrane Source: RGD
  • protein complex Source: RGD
  • spectrin Source: RGD
Complete GO annotation...

Keywords - Cellular componenti

Cytoplasm, Cytoskeleton

Pathology & Biotechi

Mutagenesis

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Mutagenesisi985 – 9851E → A: Abolishes interaction with ACP1. 1 Publication
Mutagenesisi1002 – 10021K → E: No effect on interaction with ACP1. 1 Publication
Mutagenesisi1017 – 10171P → L: Abolishes interaction with ACP1. 1 Publication
Mutagenesisi1176 – 11761Y → E: Abolishes in vitro phosphorylation by Src and Lck. 1 Publication

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 24722472Spectrin alpha chain, non-erythrocytic 1PRO_0000073457Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Modified residuei1 – 11N-acetylmethionineBy similarity
Modified residuei587 – 5871PhosphoserineBy similarity
Modified residuei637 – 6371N6-acetyllysineBy similarity
Modified residuei803 – 8031N6-acetyllysineBy similarity
Modified residuei924 – 9241PhosphoserineCombined sources
Modified residuei982 – 9821PhosphoserineBy similarity
Modified residuei999 – 9991PhosphoserineBy similarity
Modified residuei1029 – 10291PhosphoserineBy similarity
Modified residuei1031 – 10311PhosphoserineCombined sources
Modified residuei1041 – 10411PhosphoserineBy similarity
Modified residuei1176 – 11761Phosphotyrosine1 Publication
Modified residuei1190 – 11901PhosphoserineBy similarity
Modified residuei1207 – 12071PhosphoserineBy similarity
Modified residuei1217 – 12171PhosphoserineCombined sources
Modified residuei1291 – 12911PhosphoserineBy similarity
Modified residuei1306 – 13061PhosphoserineCombined sources
Modified residuei1323 – 13231PhosphoserineBy similarity
Modified residuei1338 – 13381PhosphoserineBy similarity
Modified residuei1519 – 15191N6-acetyllysineBy similarity
Modified residuei1550 – 15501PhosphoserineBy similarity
Modified residuei1557 – 15571PhosphoserineBy similarity
Modified residuei1578 – 15781PhosphoserineBy similarity
Modified residuei1615 – 16151PhosphoserineBy similarity
Modified residuei1647 – 16471PhosphoserineBy similarity
Modified residuei2020 – 20201PhosphothreonineBy similarity
Modified residuei2052 – 20521N6-acetyllysineBy similarity
Modified residuei2066 – 20661PhosphothreonineBy similarity
Modified residuei2421 – 24211N6-acetyllysineBy similarity

Post-translational modificationi

Phosphorylation of Tyr-1176 decreases sensitivity to cleavage by calpain in vitro.1 Publication

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sitei1176 – 11772Cleavage; by mu-calpainBy similarity

Keywords - PTMi

Acetylation, Phosphoprotein

Proteomic databases

PaxDbiP16086.
PRIDEiP16086.

PTM databases

iPTMnetiP16086.
PhosphoSiteiP16086.

Expressioni

Tissue specificityi

Expressed in the foot process layer of podocytes in the kidney glomerulus and in tubules (at protein level).1 Publication

Developmental stagei

Expressed throughout glomerulogenesis.1 Publication

Gene expression databases

ExpressionAtlasiP16086. baseline and differential.
GenevisibleiP16086. RN.

Interactioni

Subunit structurei

Like erythrocyte spectrin, the spectrin-like proteins are capable of forming dimers which can further associate to tetramers. Interacts (via C-terminal spectrin repeats) with TRPC4. Interacts with CALM and EMD. Interacts with isoform 1 of ACP1. Identified in a complex with ACTN4, CASK, IQGAP1, MAGI2, NPHS1 and SPTBN1. Interacts with SHANK3 (via ANK repeats).3 Publications

GO - Molecular functioni

  • protein complex binding Source: RGD
  • protein N-terminus binding Source: RGD
  • spectrin binding Source: MGI
  • syntaxin binding Source: RGD

Protein-protein interaction databases

BioGridi248986. 13 interactions.
IntActiP16086. 1 interaction.
MINTiMINT-4996484.
STRINGi10116.ENSRNOP00000042382.

Structurei

Secondary structure

1
2472
Legend: HelixTurnBeta strand
Show more details
Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Beta strandi971 – 9744Combined sources
Beta strandi993 – 9986Combined sources
Beta strandi1001 – 10099Combined sources
Beta strandi1012 – 10176Combined sources
Helixi1018 – 10203Combined sources
Beta strandi1021 – 10233Combined sources

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
EntryMethodResolution (Å)ChainPositionsPDBsum
3THKX-ray1.70A/B967-1035[»]
ProteinModelPortaliP16086.
SMRiP16086. Positions 2-147, 967-1025, 1445-1549, 1657-1974.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Repeati10 – 4233Spectrin 1Add
BLAST
Repeati44 – 147104Spectrin 2Add
BLAST
Repeati149 – 253105Spectrin 3Add
BLAST
Repeati255 – 359105Spectrin 4Add
BLAST
Repeati361 – 465105Spectrin 5Add
BLAST
Repeati467 – 571105Spectrin 6Add
BLAST
Repeati573 – 676104Spectrin 7Add
BLAST
Repeati678 – 782105Spectrin 8Add
BLAST
Repeati784 – 888105Spectrin 9Add
BLAST
Repeati890 – 95566Spectrin 10Add
BLAST
Domaini967 – 102660SH3PROSITE-ProRule annotationAdd
BLAST
Repeati1062 – 108928Spectrin 11Add
BLAST
Repeati1091 – 116171Spectrin 12Add
BLAST
Repeati1208 – 123124Spectrin 13Add
BLAST
Repeati1233 – 1337105Spectrin 14Add
BLAST
Repeati1339 – 1443105Spectrin 15Add
BLAST
Repeati1445 – 1549105Spectrin 16Add
BLAST
Repeati1551 – 1656106Spectrin 17Add
BLAST
Repeati1658 – 1762105Spectrin 18Add
BLAST
Repeati1764 – 1868105Spectrin 19Add
BLAST
Repeati1870 – 1974105Spectrin 20Add
BLAST
Repeati1976 – 2081106Spectrin 21Add
BLAST
Repeati2091 – 2195105Spectrin 22Add
BLAST
Repeati2205 – 2310106Spectrin 23Add
BLAST
Domaini2323 – 235836EF-hand 1PROSITE-ProRule annotationAdd
BLAST
Domaini2366 – 240136EF-hand 2PROSITE-ProRule annotationAdd
BLAST
Domaini2404 – 243936EF-hand 3PROSITE-ProRule annotationAdd
BLAST

Sequence similaritiesi

Belongs to the spectrin family.Curated
Contains 3 EF-hand domains.PROSITE-ProRule annotation
Contains 1 SH3 domain.PROSITE-ProRule annotation
Contains 23 spectrin repeats.Curated

Keywords - Domaini

Repeat, SH3 domain

Phylogenomic databases

eggNOGiKOG0035. Eukaryota.
COG5069. LUCA.
GeneTreeiENSGT00760000118813.
HOGENOMiHOG000246965.
HOVERGENiHBG059266.
InParanoidiP16086.
KOiK06114.
PhylomeDBiP16086.

Family and domain databases

Gene3Di1.10.238.10. 2 hits.
InterProiIPR011992. EF-hand-dom_pair.
IPR014837. EF-hand_Ca_insen.
IPR018247. EF_Hand_1_Ca_BS.
IPR002048. EF_hand_dom.
IPR001452. SH3_domain.
IPR018159. Spectrin/alpha-actinin.
IPR013315. Spectrin_alpha_SH3.
IPR002017. Spectrin_repeat.
[Graphical view]
PfamiPF13499. EF-hand_7. 1 hit.
PF08726. EFhand_Ca_insen. 1 hit.
PF00018. SH3_1. 1 hit.
PF00435. Spectrin. 20 hits.
[Graphical view]
PRINTSiPR00452. SH3DOMAIN.
PR01887. SPECTRNALPHA.
SMARTiSM00054. EFh. 2 hits.
SM00326. SH3. 1 hit.
SM00150. SPEC. 20 hits.
[Graphical view]
SUPFAMiSSF47473. SSF47473. 1 hit.
SSF50044. SSF50044. 1 hit.
PROSITEiPS00018. EF_HAND_1. 2 hits.
PS50222. EF_HAND_2. 3 hits.
PS50002. SH3. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

P16086-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MDPSGVKVLE TAEDIQERRQ QVLDRYHRFK ELSTLRRQKL EDSYRFQFFQ
60 70 80 90 100
RDAEELEKWI QEKLQVASDE NYKDPTNLQG KLQKHQAFEA EVQANSGAIV
110 120 130 140 150
KLDETGNLMI SEGHFASETI RTRLMELHRQ WELLLEKMRE KGIKLLQAQK
160 170 180 190 200
LVQYLRECED VMDWINDKEA IVTSEELGQD LEHVEVLQKK FEEFQTDLAA
210 220 230 240 250
HEERVNEVNQ FAAKLIQEQH PEEELIKTKQ EEVNAAWQRL KGLALQRQGK
260 270 280 290 300
LFGAAEVQRF NRDVDETIGW IKEKEQLMAS DDFGRDLASV QALLRKHEGL
310 320 330 340 350
ERDLAALEDK VKALCAEADR LQQSHPLSAN QIQVKREELI TNWEQIRTLA
360 370 380 390 400
AERHARLDDS YRLQRFLADF RDLTSWVTEM KALINADELA NDVAGAEALL
410 420 430 440 450
DRHQEHKGEI DAHEDSFKSA DESGQALLAA GHYASDEVRE KLSILSEERA
460 470 480 490 500
ALLELWELRR QQYEQCMDLQ LFYRDTEQVD NWMSKQEAFL LNEDLGDSLD
510 520 530 540 550
SVEALLKKHE DFEKSLSAQE EKITALDEFA TKLIQNNHYA MEDVATRRDA
560 570 580 590 600
LLSRRNALHE RAMHRRAQLA DSFHLQQFFR DSDELKSWVN EKMKTATDEA
610 620 630 640 650
YKDPSNLQGK VQKHQAFEAE LSANQSRIDA LEKAGQKLID VNHYAKEEVA
660 670 680 690 700
ARMNEVISLW KKLLEATELK GVKLREANQQ QQFNRNVEDI ELWLYEVEGH
710 720 730 740 750
LASDDYGKDL TNVQNLQKKH ALLEADVAAH QDRIDGITIQ ARQFQDAGHF
760 770 780 790 800
DAENIKKKQE ALVARYEALK EPMVARKQKL ADSLRLQQLF RDVEDEETWI
810 820 830 840 850
REKEPIAAST NRGKDLIGVQ NLLKKHQALQ AEIAGHEPRI KAVTQKGNAM
860 870 880 890 900
VEEGHFAAED VKAKLSELNQ KWEALKAKAS QRRQDLEDSL QAQQYFADAN
910 920 930 940 950
EAESWMREKE PIVGSTDYGK DEDSAEALLK KHEALMSDLS AYGSSIQALR
960 970 980 990 1000
EQAQSCRQQV APMDDETGKE LVLALYDYQE KSPREVTMKK GDILTLLNST
1010 1020 1030 1040 1050
NKDWWKVEVN DRQGFVPAAY VKKLDPAQSA SRENLLEEQG SIALRQGQID
1060 1070 1080 1090 1100
NQTRITKEAG SVSLRMKQVE ELYQSLLELG EKRKGMLEKS CKKFMLFREA
1110 1120 1130 1140 1150
NELQQWINEK EAALTSEEVG ADLEQVEVLQ KKFDDFQKDL KANESRLKDI
1160 1170 1180 1190 1200
NKVAEDLESE GLMAEEVQAV QQQEVYGMMP RDEADSKTAS PWKSARLMVH
1210 1220 1230 1240 1250
TVATFNSIKE LNERWRSLQQ LAEERSQLLG SAHEVQRFHR DADETKEWIE
1260 1270 1280 1290 1300
EKNQALNTDN YGHDLASVQA LQRKHEGFER DLAALGDKVN SLGETAQRLI
1310 1320 1330 1340 1350
QSHPESAEDL KEKCTELNQA WTSLGKRADQ RKAKLGDSHD LQRFLSDFRD
1360 1370 1380 1390 1400
LMSWINGIRG LVSSDELAKD VTGAEALLER HQEHRTEIDA RAGTFQAFEQ
1410 1420 1430 1440 1450
FGQQLLAHGH YASPEIKEKL DILDQERTDL EKAWVQRRMM LDHCLELQLF
1460 1470 1480 1490 1500
HRDCEQAENW MAAREAFLNT EDKGDSLDSV EALIKKHEDF DKAINVQEEK
1510 1520 1530 1540 1550
IAALQAFADQ LIAVDHYAKG DIANRRNEVL DRWRRLKAQM IEKRSKLGES
1560 1570 1580 1590 1600
QTLQQFSRDV DEIEAWISEK LQTASDESYK DPTNIQSKHQ KHQAFEAELH
1610 1620 1630 1640 1650
ANADRIRGVI DMGNSLIERG ACAGSEDAVK ARLAALADQW QFLVQKSAEK
1660 1670 1680 1690 1700
SQKLKEANKQ QNFNTGIKDF DFWLSEVEAL LASEDYGKDL ASVNNLLKKH
1710 1720 1730 1740 1750
QLLEADISAH EDRLKDLNSQ ADSLMTSSAF DTSQVKEKRD TINGRFQKIK
1760 1770 1780 1790 1800
SMATSRRAKL SESHRLHQFF RDMDDEESWI KEKKLLVSSE DYGRDLTGVQ
1810 1820 1830 1840 1850
NLRKKHKRLE AELAAHEPAI QGVLDTGKKL SDDNTIGQEE IQQRLAQFVE
1860 1870 1880 1890 1900
HWKELKQLAA ARGQRLEESL EYQQFVANVE EEEAWINEKM TLVASEDYGD
1910 1920 1930 1940 1950
TLAAIQGLLK KHEAFETDFT VHKDRVNDVC TNGQDLIKKN NHHEENISSK
1960 1970 1980 1990 2000
MKGLNGKVSD LEKAAAQRKA KLDENSAFLQ FNWKADVVES WIGEKENSLK
2010 2020 2030 2040 2050
TDDYGRDLSS VQTLLTKQET FDAGLQAFQQ EGIANITALK DQLLAAKHIQ
2060 2070 2080 2090 2100
SKAIEARHAS LMKRWTQLLA NSATRKKKLL EAQSHFRKVE DLFLTFAKKA
2110 2120 2130 2140 2150
SAFNSWFENA EEDLTDPVRC NSLEEIKALR EAHDAFRSSL SSAQADFNQL
2160 2170 2180 2190 2200
AELDRQIKSF RVASNPYTWF TMEALEETWR NLQKIIKERE LELQKEQRRQ
2210 2220 2230 2240 2250
EENDKLRQEF AQHANAFHQW IQETRTYLLD GSCMVEESGT LESQLEATKR
2260 2270 2280 2290 2300
KHQEIRAMRS QLKKIEDLGA AMEEALILDN KYTEHSTVGL AQQWDQLDQL
2310 2320 2330 2340 2350
GMRMQHNLEQ QIQARNTTGV TEEALKEFSM MFKHFDKDKS GRLNHQEFKS
2360 2370 2380 2390 2400
CLRSLGYDLP MVEEGEPDPE FEAILDTVDP NRDGHVSLQE YMAFMISRET
2410 2420 2430 2440 2450
ENVKSSEEIE SAFRALSSEG KPYVTKEELY QNLTREQADY CVSHMKPYVD
2460 2470
GKGRELPTAF DYVEFTRSLF VN
Length:2,472
Mass (Da):284,637
Last modified:November 16, 2001 - v2
Checksum:i08DDF01A2871278A
GO

Sequence cautioni

The sequence AAA40770 differs from that shown. Reason: Frameshift at position 2310. Curated

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti1329 – 13291D → Y in AAA40770 (PubMed:2753883).Curated
Sequence conflicti1514 – 15141V → L in CAA62350 (Ref. 1) Curated
Sequence conflicti1702 – 17021L → A in AAA40770 (PubMed:2753883).Curated
Sequence conflicti1971 – 19722KL → NV in CAA62350 (Ref. 1) Curated
Sequence conflicti2205 – 22062KL → NV in CAA62350 (Ref. 1) Curated

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
X90845 mRNA. Translation: CAA62350.1.
AF084186 mRNA. Translation: AAC33127.1.
J04828 mRNA. Translation: AAA40770.1. Frameshift.
PIRiA32612.
S61217.
RefSeqiNP_741984.2. NM_171983.2.
UniGeneiRn.5812.

Genome annotation databases

EnsembliENSRNOT00000045827; ENSRNOP00000042382; ENSRNOG00000015396.
GeneIDi64159.
KEGGirno:64159.
UCSCiRGD:621714. rat.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
X90845 mRNA. Translation: CAA62350.1.
AF084186 mRNA. Translation: AAC33127.1.
J04828 mRNA. Translation: AAA40770.1. Frameshift.
PIRiA32612.
S61217.
RefSeqiNP_741984.2. NM_171983.2.
UniGeneiRn.5812.

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
EntryMethodResolution (Å)ChainPositionsPDBsum
3THKX-ray1.70A/B967-1035[»]
ProteinModelPortaliP16086.
SMRiP16086. Positions 2-147, 967-1025, 1445-1549, 1657-1974.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi248986. 13 interactions.
IntActiP16086. 1 interaction.
MINTiMINT-4996484.
STRINGi10116.ENSRNOP00000042382.

PTM databases

iPTMnetiP16086.
PhosphoSiteiP16086.

Proteomic databases

PaxDbiP16086.
PRIDEiP16086.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSRNOT00000045827; ENSRNOP00000042382; ENSRNOG00000015396.
GeneIDi64159.
KEGGirno:64159.
UCSCiRGD:621714. rat.

Organism-specific databases

CTDi6709.
RGDi621714. Sptan1.

Phylogenomic databases

eggNOGiKOG0035. Eukaryota.
COG5069. LUCA.
GeneTreeiENSGT00760000118813.
HOGENOMiHOG000246965.
HOVERGENiHBG059266.
InParanoidiP16086.
KOiK06114.
PhylomeDBiP16086.

Enzyme and pathway databases

ReactomeiR-RNO-264870. Caspase-mediated cleavage of cytoskeletal proteins.
R-RNO-373753. Nephrin interactions.
R-RNO-375165. NCAM signaling for neurite out-growth.
R-RNO-445095. Interaction between L1 and Ankyrins.
R-RNO-5673001. RAF/MAP kinase cascade.
R-RNO-6807878. COPI-mediated anterograde transport.

Miscellaneous databases

PROiP16086.

Gene expression databases

ExpressionAtlasiP16086. baseline and differential.
GenevisibleiP16086. RN.

Family and domain databases

Gene3Di1.10.238.10. 2 hits.
InterProiIPR011992. EF-hand-dom_pair.
IPR014837. EF-hand_Ca_insen.
IPR018247. EF_Hand_1_Ca_BS.
IPR002048. EF_hand_dom.
IPR001452. SH3_domain.
IPR018159. Spectrin/alpha-actinin.
IPR013315. Spectrin_alpha_SH3.
IPR002017. Spectrin_repeat.
[Graphical view]
PfamiPF13499. EF-hand_7. 1 hit.
PF08726. EFhand_Ca_insen. 1 hit.
PF00018. SH3_1. 1 hit.
PF00435. Spectrin. 20 hits.
[Graphical view]
PRINTSiPR00452. SH3DOMAIN.
PR01887. SPECTRNALPHA.
SMARTiSM00054. EFh. 2 hits.
SM00326. SH3. 1 hit.
SM00150. SPEC. 20 hits.
[Graphical view]
SUPFAMiSSF47473. SSF47473. 1 hit.
SSF50044. SSF50044. 1 hit.
PROSITEiPS00018. EF_HAND_1. 2 hits.
PS50222. EF_HAND_2. 3 hits.
PS50002. SH3. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiSPTN1_RAT
AccessioniPrimary (citable) accession number: P16086
Secondary accession number(s): O88663, P70477
Entry historyi
Integrated into UniProtKB/Swiss-Prot: April 1, 1990
Last sequence update: November 16, 2001
Last modified: June 8, 2016
This is version 161 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

3D-structure, Complete proteome, Direct protein sequencing, Reference proteome

Documents

  1. PDB cross-references
    Index of Protein Data Bank (PDB) cross-references
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.