Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Complement C4-B

Gene

C4b

Organism
Mus musculus (Mouse)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Non-enzymatic component of C3 and C5 convertases and thus essential for the propagation of the classical complement pathway. Covalently binds to immunoglobulins and immune complexes and enhances the solubilization of immune aggregates and the clearance of IC through CR1 on erythrocytes. Catalyzes the transacylation of the thioester carbonyl group to form ester bonds with carbohydrate antigens (By similarity).By similarity

GO - Molecular functioni

GO - Biological processi

  • complement activation Source: MGI
  • complement activation, classical pathway Source: UniProtKB-KW
  • immunoglobulin mediated immune response Source: MGI
  • inflammatory response Source: UniProtKB-KW
  • innate immune response Source: UniProtKB-KW
Complete GO annotation...

Keywords - Biological processi

Complement pathway, Immunity, Inflammatory response, Innate immunity

Enzyme and pathway databases

ReactomeiR-MMU-166663. Initial triggering of complement.
R-MMU-174577. Activation of C3 and C5.
R-MMU-977606. Regulation of Complement cascade.

Protein family/group databases

MEROPSiI39.951.

Names & Taxonomyi

Protein namesi
Recommended name:
Complement C4-B
Cleaved into the following 4 chains:
Gene namesi
Name:C4b
Synonyms:C4
OrganismiMus musculus (Mouse)
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus
Proteomesi
  • UP000000589 Componenti: Chromosome 17

Organism-specific databases

MGIiMGI:88228. C4b.

Subcellular locationi

  • Secreted By similarity
  • Cell junctionsynapse By similarity
  • Cell projectionaxon By similarity
  • Cell projectiondendrite By similarity

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Cell junction, Cell projection, Secreted, Synapse

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Signal peptidei1 – 19Add BLAST19
ChainiPRO_000000597320 – 673Complement C4 beta chainAdd BLAST654
PropeptideiPRO_0000005974674 – 6774
ChainiPRO_0000005975678 – 1443Complement C4 alpha chainAdd BLAST766
ChainiPRO_0000005976678 – 753C4a anaphylatoxinAdd BLAST76
PropeptideiPRO_00000059771444 – 14474
ChainiPRO_00000059781448 – 1738Complement C4 gamma chainAdd BLAST291

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Glycosylationi224N-linked (GlcNAc...)Sequence analysis1
Disulfide bondi700 ↔ 726By similarity
Disulfide bondi701 ↔ 733By similarity
Disulfide bondi714 ↔ 734By similarity
Glycosylationi743N-linked (GlcNAc...)1
Cross-linki1006 ↔ 1009Isoglutamyl cysteine thioester (Cys-Gln)By similarity
Glycosylationi1324N-linked (GlcNAc...)1 Publication1
Glycosylationi1387N-linked (GlcNAc...)Sequence analysis1
Modified residuei1413SulfotyrosineBy similarity1
Modified residuei1416SulfotyrosineBy similarity1
Modified residuei1417SulfotyrosineBy similarity1
Disulfide bondi1589 ↔ 1667By similarity
Disulfide bondi1612 ↔ 1736By similarity

Post-translational modificationi

Prior to secretion, the single-chain precursor is enzymatically cleaved to yield non-identical chains alpha, beta and gamma. During activation, the alpha chain is cleaved by C1 into C4a and C4b, and C4b stays linked to the beta and gamma chains. Further degradation of C4b by C1 into the inactive fragments C4c and C4d blocks the generation of C3 convertase.

Keywords - PTMi

Cleavage on pair of basic residues, Disulfide bond, Glycoprotein, Sulfation, Thioester bond

Proteomic databases

MaxQBiP01029.
PaxDbiP01029.
PRIDEiP01029.

PTM databases

iPTMnetiP01029.
PhosphoSitePlusiP01029.
SwissPalmiP01029.

Expressioni

Gene expression databases

BgeeiENSMUSG00000073418.
CleanExiMM_C4B.
ExpressionAtlasiP01029. baseline and differential.
GenevisibleiP01029. MM.

Interactioni

Subunit structurei

Circulates in blood as a disulfide-linked trimer of an alpha, beta and gamma chain.

GO - Molecular functioni

Protein-protein interaction databases

IntActiP01029. 2 interactors.
MINTiMINT-1857263.
STRINGi10090.ENSMUSP00000069418.

Structurei

3D structure databases

ProteinModelPortaliP01029.
SMRiP01029.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini700 – 734Anaphylatoxin-likePROSITE-ProRule annotationAdd BLAST35
Domaini1589 – 1736NTRPROSITE-ProRule annotationAdd BLAST148

Sequence similaritiesi

Contains 1 anaphylatoxin-like domain.PROSITE-ProRule annotation
Contains 1 NTR domain.PROSITE-ProRule annotation

Keywords - Domaini

Signal

Phylogenomic databases

eggNOGiKOG1366. Eukaryota.
ENOG410XRED. LUCA.
GeneTreeiENSGT00760000118982.
HOVERGENiHBG107123.
InParanoidiP01029.
KOiK03989.
OMAiFIQEYST.
OrthoDBiEOG091G00FL.
TreeFamiTF313285.

Family and domain databases

Gene3Di1.20.91.20. 1 hit.
1.50.10.20. 1 hit.
2.60.40.690. 1 hit.
InterProiIPR009048. A-macroglobulin_rcpt-bd.
IPR011626. A2M_comp.
IPR002890. A2M_N.
IPR011625. A2M_N_2.
IPR000020. Anaphylatoxin/fibulin.
IPR018081. Anaphylatoxin_comp_syst.
IPR001840. Anaphylatoxn_comp_syst_dom.
IPR001599. Macroglobln_a2.
IPR019742. MacrogloblnA2_CS.
IPR019565. MacrogloblnA2_thiol-ester-bond.
IPR001134. Netrin_domain.
IPR018933. Netrin_module_non-TIMP.
IPR008930. Terpenoid_cyclase/PrenylTrfase.
IPR008993. TIMP-like_OB-fold.
[Graphical view]
PfamiPF00207. A2M. 1 hit.
PF07678. A2M_comp. 1 hit.
PF01835. A2M_N. 1 hit.
PF07703. A2M_N_2. 1 hit.
PF07677. A2M_recep. 1 hit.
PF01821. ANATO. 1 hit.
PF01759. NTR. 1 hit.
PF10569. Thiol-ester_cl. 1 hit.
[Graphical view]
PRINTSiPR00004. ANAPHYLATOXN.
SMARTiSM01360. A2M. 1 hit.
SM01359. A2M_N_2. 1 hit.
SM01361. A2M_recep. 1 hit.
SM00104. ANATO. 1 hit.
SM00643. C345C. 1 hit.
[Graphical view]
SUPFAMiSSF47686. SSF47686. 1 hit.
SSF48239. SSF48239. 1 hit.
SSF49410. SSF49410. 1 hit.
SSF50242. SSF50242. 1 hit.
PROSITEiPS00477. ALPHA_2_MACROGLOBULIN. 1 hit.
PS01177. ANAPHYLATOXIN_1. 1 hit.
PS01178. ANAPHYLATOXIN_2. 1 hit.
PS50189. NTR. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

P01029-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MRLLWGLAWV FSFCASSLQK PRLLLFSPSV VNLGTPLSVG VQLLDAPPGQ
60 70 80 90 100
EVKGSVFLRN PKGGSCSPKK DFKLSSGDDF VLLSLEVPLE DVRSCGLFDL
110 120 130 140 150
RRAPHIQLVA QSPWLRNTAF KATETQGVNL LFSSRRGHIF VQTDQPIYNP
160 170 180 190 200
GQRVRYRVFA LDQKMRPSTD FLTITVENSH GLRVLKKEIF TSTSIFQDAF
210 220 230 240 250
TIPDISEPGT WKISARFSDG LESNRSTHFE VKKYVLPNFE VKITPWKPYI
260 270 280 290 300
LMVPSNSDEI QLDIQARYIY GKPVQGVAYT RFALMDEQGK RTFLRGLETQ
310 320 330 340 350
AKLVEGRTHI SISKDQFQAA LDKINIGVRD LEGLRLYAAT AVIESPGGEM
360 370 380 390 400
EEAELTSWRF VSSAFSLDLS RTKRHLVPGA HFLLQALVQE MSGSEASNVP
410 420 430 440 450
VKVSATLVSG SDSQVLDIQQ STNGIGQVSI SFPIPPTVTE LRLLVSAGSL
460 470 480 490 500
YPAIARLTVQ APPSRGTGFL SIEPLDPRSP SVGDTFILNL QPVGIPAPTF
510 520 530 540 550
SHYYYMIISR GQIMAMGREP RKTVTSVSVL VDHQLAPSFY FVAYFYHQGH
560 570 580 590 600
PVANSLLINI QSRDCEGKLQ LKVDGAKEYR NADMMKLRIQ TDSKALVALG
610 620 630 640 650
AVDMALYAVG GRSHKPLDMS KVFEVINSYN VGCGPGGGDD ALQVFQDAGL
660 670 680 690 700
AFSDGDRLTQ TREDLSCPKE KKSRQKRNVN FQKAVSEKLG QYSSPDAKRC
710 720 730 740 750
CQDGMTKLPM KRTCEQRAAR VPQQACREPF LSCCKFAEDL RRNQTRSQAH
760 770 780 790 800
LARNNHNMLQ EEDLIDEDDI LVRTSFPENW LWRVEPVDSS KLLTVWLPDS
810 820 830 840 850
MTTWEIHGVS LSKSKGLCVA KPTRVRVFRK FHLHLRLPIS IRRFEQFELR
860 870 880 890 900
PVLYNYLNDD VAVSVHVTPV EGLCLAGGGM MAQQVTVPAG SARPVAFSVV
910 920 930 940 950
PTAAANVPLK VVARGVFDLG DAVSKILQIE KEGAIHREEL VYNLDPLNNL
960 970 980 990 1000
GRTLEIPGSS DPNIVPDGDF SSLVRVTASE PLETMGSEGA LSPGGVASLL
1010 1020 1030 1040 1050
RLPQGCAEQT MIYLAPTLTA SNYLDRTEQW SKLSPETKDH AVDLIQKGYM
1060 1070 1080 1090 1100
RIQQFRKNDG SFGAWLHRDS STWLTAFVLK ILSLAQEQVG NSPEKLQETA
1110 1120 1130 1140 1150
SWLLAQQLGD GSFHDPCPVI HRAMQGGLVG SDETVALTAF VVIALHHGLD
1160 1170 1180 1190 1200
VFQDDDAKQL KNRVEASITK ANSFLGQKAS AGLLGAHAAA ITAYALTLTK
1210 1220 1230 1240 1250
ASEDLRNVAH NSLMAMAEET GEHLYWGLVL GSQDKVVLRP TAPRSPTEPV
1260 1270 1280 1290 1300
PQAPALWIET TAYALLHLLL REGKGKMADK AASWLTHQGS FHGAFRSTQD
1310 1320 1330 1340 1350
TVVTLDALSA YWIASHTTEE KALNVTLSSM GRNGLKTHGL HLNNHQVKGL
1360 1370 1380 1390 1400
EEELKFSLGS TISVKVEGNS KGTLKILRTY NVLDMKNTTC QDLQIEVKVT
1410 1420 1430 1440 1450
GAVEYAWDAN EDYEDYYDMP AADDPSVPLQ PVTPLQLFEG RRSRRRREAP
1460 1470 1480 1490 1500
KVVEEQESRV QYTVCIWRNG KLGLSGMAIA DITLLSGFHA LRADLEKLTS
1510 1520 1530 1540 1550
LSDRYVSHFE TDGPHVLLYF DSVPTTRECV GFGASQEVVV GLVQPSSAVL
1560 1570 1580 1590 1600
YDYYSPDHKC SVFYAAPTKS QLLATLCSGD VCQCAEGKCP RLLRSLERRV
1610 1620 1630 1640 1650
EDKDGYRMRF ACYYPRVEYG FTVKVLREDG RAAFRLFESK ITQVLHFRKD
1660 1670 1680 1690 1700
TMASIGQTRN FLSRASCRLR LEPNKEYLIM GMDGETSDNK GDPQYLLDSN
1710 1720 1730
TWIEEMPSEQ MCKSTRHRAA CFQLKDFLME FSSRGCQV
Length:1,738
Mass (Da):192,915
Last modified:July 27, 2011 - v3
Checksum:iFCC7580209029E88
GO

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti132F → Y in AAA39557 (PubMed:2993295).Curated1
Sequence conflicti177E → G in BAE34429 (PubMed:16141072).Curated1
Sequence conflicti283A → V in BAE34429 (PubMed:16141072).Curated1
Sequence conflicti327G → E in AAA39557 (PubMed:2993295).Curated1
Sequence conflicti440E → K in AAH67394 (PubMed:15489334).Curated1
Sequence conflicti440E → K in AAH67409 (PubMed:15489334).Curated1
Sequence conflicti570Q → E in AAA39557 (PubMed:2993295).Curated1
Sequence conflicti570Q → E in BAE34280 (PubMed:16141072).Curated1
Sequence conflicti604M → T in AAA39557 (PubMed:2993295).Curated1
Sequence conflicti604M → T in AAA39506 (PubMed:3862104).Curated1
Sequence conflicti604M → T in AAA39561 (PubMed:2777798).Curated1
Sequence conflicti604M → T in BAE34280 (PubMed:16141072).Curated1
Sequence conflicti604M → T in BAE34429 (PubMed:16141072).Curated1
Sequence conflicti604M → T in AAH67394 (PubMed:15489334).Curated1
Sequence conflicti604M → T in AAH67409 (PubMed:15489334).Curated1
Sequence conflicti639D → G in BAE34429 (PubMed:16141072).Curated1
Sequence conflicti758M → I in BAE34280 (PubMed:16141072).Curated1
Sequence conflicti838P → R in AAA39557 (PubMed:2993295).Curated1
Sequence conflicti916V → I in BAE34280 (PubMed:16141072).Curated1
Sequence conflicti1077F → S in AAH67394 (PubMed:15489334).Curated1
Sequence conflicti1077F → S in AAH67409 (PubMed:15489334).Curated1
Sequence conflicti1119V → A in AAC42021 (PubMed:3856857).Curated1
Sequence conflicti1190A → T in AAC42021 (PubMed:3856857).Curated1
Sequence conflicti1206R → Q in BAE34280 (PubMed:16141072).Curated1
Sequence conflicti1206R → Q in BAE34429 (PubMed:16141072).Curated1
Sequence conflicti1206R → Q in AAH67394 (PubMed:15489334).Curated1
Sequence conflicti1206R → Q in AAH67409 (PubMed:15489334).Curated1
Sequence conflicti1206R → Q in AAA40487 (PubMed:2459207).Curated1
Sequence conflicti1290S → N in AAC42022 (PubMed:6149581).Curated1
Sequence conflicti1324N → K in AAA39506 (PubMed:3862104).Curated1
Sequence conflicti1324N → K in AAA39561 (PubMed:2777798).Curated1
Sequence conflicti1324N → K in AAC42021 (PubMed:3856857).Curated1
Sequence conflicti1365K → E in BAE34429 (PubMed:16141072).Curated1
Sequence conflicti1401G → S in AAA39554 (PubMed:6192448).Curated1
Sequence conflicti1442R → K in AAA39557 (PubMed:2993295).Curated1
Sequence conflicti1453V → A in AAA39506 (PubMed:3862104).Curated1
Sequence conflicti1453V → A in AAA39561 (PubMed:2777798).Curated1
Sequence conflicti1453V → A in BAE34429 (PubMed:16141072).Curated1
Sequence conflicti1453V → A in AAA39554 (PubMed:6192448).Curated1
Sequence conflicti1456Q → R in BAE34429 (PubMed:16141072).Curated1
Sequence conflicti1586E → Q in CAA28936 (PubMed:3008092).Curated1
Sequence conflicti1611A → T in AAC05279 (PubMed:14656967).Curated1

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
M11789 Genomic DNA. Translation: AAA39557.1.
M11729 mRNA. Translation: AAA39506.1.
M17440 Genomic DNA. Translation: AAA39561.1.
AK157954 mRNA. Translation: BAE34280.1.
AK158256 mRNA. Translation: BAE34429.1.
AF049850 Genomic DNA. Translation: AAC05279.1.
CT573030 Genomic DNA. No translation available.
BC067394 mRNA. Translation: AAH67394.1.
BC067409 mRNA. Translation: AAH67409.1.
M12968 Genomic DNA. Translation: AAA39558.1.
M12969 Genomic DNA. Translation: AAA39559.1.
M14225 Genomic DNA. Translation: AAA39563.1.
X05314 mRNA. Translation: CAA28936.1.
M12970 mRNA. Translation: AAA39555.1.
M12972 mRNA. Translation: AAA39556.1.
M23186 Genomic DNA. Translation: AAA40487.1.
X55493 Genomic DNA. Translation: CAA39112.1.
X55495 Genomic DNA. Translation: CAA39114.1.
K02798 mRNA. Translation: AAC42021.1.
K02799 mRNA. Translation: AAC42022.1.
K00019 mRNA. Translation: AAA39554.1.
CCDSiCCDS28657.1.
PIRiA24558.
A29176.
RefSeqiNP_033910.2. NM_009780.2.
UniGeneiMm.439678.
Mm.477109.

Genome annotation databases

EnsembliENSMUST00000069507; ENSMUSP00000069418; ENSMUSG00000073418.
GeneIDi12268.
KEGGimmu:12268.
UCSCiuc008cdk.2. mouse.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
M11789 Genomic DNA. Translation: AAA39557.1.
M11729 mRNA. Translation: AAA39506.1.
M17440 Genomic DNA. Translation: AAA39561.1.
AK157954 mRNA. Translation: BAE34280.1.
AK158256 mRNA. Translation: BAE34429.1.
AF049850 Genomic DNA. Translation: AAC05279.1.
CT573030 Genomic DNA. No translation available.
BC067394 mRNA. Translation: AAH67394.1.
BC067409 mRNA. Translation: AAH67409.1.
M12968 Genomic DNA. Translation: AAA39558.1.
M12969 Genomic DNA. Translation: AAA39559.1.
M14225 Genomic DNA. Translation: AAA39563.1.
X05314 mRNA. Translation: CAA28936.1.
M12970 mRNA. Translation: AAA39555.1.
M12972 mRNA. Translation: AAA39556.1.
M23186 Genomic DNA. Translation: AAA40487.1.
X55493 Genomic DNA. Translation: CAA39112.1.
X55495 Genomic DNA. Translation: CAA39114.1.
K02798 mRNA. Translation: AAC42021.1.
K02799 mRNA. Translation: AAC42022.1.
K00019 mRNA. Translation: AAA39554.1.
CCDSiCCDS28657.1.
PIRiA24558.
A29176.
RefSeqiNP_033910.2. NM_009780.2.
UniGeneiMm.439678.
Mm.477109.

3D structure databases

ProteinModelPortaliP01029.
SMRiP01029.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

IntActiP01029. 2 interactors.
MINTiMINT-1857263.
STRINGi10090.ENSMUSP00000069418.

Protein family/group databases

MEROPSiI39.951.

PTM databases

iPTMnetiP01029.
PhosphoSitePlusiP01029.
SwissPalmiP01029.

Proteomic databases

MaxQBiP01029.
PaxDbiP01029.
PRIDEiP01029.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSMUST00000069507; ENSMUSP00000069418; ENSMUSG00000073418.
GeneIDi12268.
KEGGimmu:12268.
UCSCiuc008cdk.2. mouse.

Organism-specific databases

CTDi721.
MGIiMGI:88228. C4b.

Phylogenomic databases

eggNOGiKOG1366. Eukaryota.
ENOG410XRED. LUCA.
GeneTreeiENSGT00760000118982.
HOVERGENiHBG107123.
InParanoidiP01029.
KOiK03989.
OMAiFIQEYST.
OrthoDBiEOG091G00FL.
TreeFamiTF313285.

Enzyme and pathway databases

ReactomeiR-MMU-166663. Initial triggering of complement.
R-MMU-174577. Activation of C3 and C5.
R-MMU-977606. Regulation of Complement cascade.

Miscellaneous databases

ChiTaRSiC4b. mouse.
PROiP01029.
SOURCEiSearch...

Gene expression databases

BgeeiENSMUSG00000073418.
CleanExiMM_C4B.
ExpressionAtlasiP01029. baseline and differential.
GenevisibleiP01029. MM.

Family and domain databases

Gene3Di1.20.91.20. 1 hit.
1.50.10.20. 1 hit.
2.60.40.690. 1 hit.
InterProiIPR009048. A-macroglobulin_rcpt-bd.
IPR011626. A2M_comp.
IPR002890. A2M_N.
IPR011625. A2M_N_2.
IPR000020. Anaphylatoxin/fibulin.
IPR018081. Anaphylatoxin_comp_syst.
IPR001840. Anaphylatoxn_comp_syst_dom.
IPR001599. Macroglobln_a2.
IPR019742. MacrogloblnA2_CS.
IPR019565. MacrogloblnA2_thiol-ester-bond.
IPR001134. Netrin_domain.
IPR018933. Netrin_module_non-TIMP.
IPR008930. Terpenoid_cyclase/PrenylTrfase.
IPR008993. TIMP-like_OB-fold.
[Graphical view]
PfamiPF00207. A2M. 1 hit.
PF07678. A2M_comp. 1 hit.
PF01835. A2M_N. 1 hit.
PF07703. A2M_N_2. 1 hit.
PF07677. A2M_recep. 1 hit.
PF01821. ANATO. 1 hit.
PF01759. NTR. 1 hit.
PF10569. Thiol-ester_cl. 1 hit.
[Graphical view]
PRINTSiPR00004. ANAPHYLATOXN.
SMARTiSM01360. A2M. 1 hit.
SM01359. A2M_N_2. 1 hit.
SM01361. A2M_recep. 1 hit.
SM00104. ANATO. 1 hit.
SM00643. C345C. 1 hit.
[Graphical view]
SUPFAMiSSF47686. SSF47686. 1 hit.
SSF48239. SSF48239. 1 hit.
SSF49410. SSF49410. 1 hit.
SSF50242. SSF50242. 1 hit.
PROSITEiPS00477. ALPHA_2_MACROGLOBULIN. 1 hit.
PS01177. ANAPHYLATOXIN_1. 1 hit.
PS01178. ANAPHYLATOXIN_2. 1 hit.
PS50189. NTR. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiCO4B_MOUSE
AccessioniPrimary (citable) accession number: P01029
Secondary accession number(s): E9QKK7
, O70346, Q31201, Q3TYY1, Q3TZC9, Q61372, Q61859, Q62353, Q6NWV8
Entry historyi
Integrated into UniProtKB/Swiss-Prot: July 21, 1986
Last sequence update: July 27, 2011
Last modified: November 2, 2016
This is version 164 of the entry and version 3 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Miscellaneous

C4 is a major histocompatibility complex class-III protein.

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. MGD cross-references
    Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.