P0C0L5 (CO4B_HUMAN) Reviewed, UniProtKB/Swiss-Prot
Last modified
January 25, 2012.
Version 75.
History...
Names·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Web links·Cross-refs·Entry info·DocumentsCustomize order
Names·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Web links·Cross-refs·Entry info·DocumentsCustomize orderNames and origin
| Protein names | Recommended name: Complement C4-B Alternative name(s): Basic complement C4 C3 and PZP-like alpha-2-macroglobulin domain-containing protein 3 Cleaved into the following 6 chains: | ||||
| Gene names |
| ||||
| Organism | Homo sapiens (Human) | ||||
| Taxonomic identifier | 9606 [NCBI] | ||||
| Taxonomic lineage | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Mammalia › Eutheria › Euarchontoglires › Primates › Haplorrhini › Catarrhini › Hominidae › Homo |
Protein attributes
| Sequence length | 1744 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is further processed into a mature form. |
| Protein existence | Evidence at protein level |
General annotation (Comments)
| Function | C4 plays a central role in the activation of the classical pathway of the complement system. It is processed by activated C1 which removes from the alpha chain the C4a anaphylatoxin. The remaining alpha chain fragment C4b is the major activation product and is an essential subunit of the C3 convertase (C4b2a) and the C5 convertase (C3bC4b2a) enzymes of the classical complement pathway. Derived from proteolytic degradation of complement C4, C4a anaphylatoxin is a mediator of local inflammatory process. It induces the contraction of smooth muscle, increases vascular permeability and causes histamine release from mast cells and basophilic leukocytes. |
| Subunit structure | Circulates in blood as a disulfide-linked trimer of an alpha, beta and gamma chain. |
| Subcellular location | |
| Post-translational modification | Prior to secretion, the single-chain precursor is enzymatically cleaved to yield the non-identical chains (alpha, beta and gamma). During activation, the alpha chain is cleaved by C1 into C4a and C4b, and C4b stays linked to the beta and gamma chains. Further degradation of C4b by C1 into the inactive fragments C4c and C4d blocks the generation of C3 convertase. |
| Polymorphism | Human complement component C4 is polymorphic at two loci, C4A and C4B. 13 alleles of C4A and 22 alleles of C4B have been detected. The C4A alleles carry the Rodgers (Rg) while the C4B alleles carry the Chido (Ch) blood group antigens. |
| Involvement in disease | Defects in C4B are a cause of susceptibility to systemic lupus erythematosus (SLE) [MIM:152700]. A chronic, inflammatory and often febrile multisystemic disorder of connective tissue. It affects principally the skin, joints, kidneys and serosal membranes. It is thought to represent a failure of the regulatory mechanisms of the autoimmune system. Note=Interindividual copy-number variation (CNV) of complement component C4 and associated polymorphisms result in different susceptibilities to SLE. The risk of SLE susceptibility has been shown to be significantly increased among subjects with only two copies of total C4. A high copy number is a protective factor against SLE. Ref.16 |
| Miscellaneous | C4A allotypes react more rapidly with the amino group of peptide antigens while C4B allotypes react more rapidly with the hydroxyl group of carbohydrate antigens. |
| Sequence similarities | Contains 1 anaphylatoxin-like domain. Contains 1 NTR domain. |
| Sequence caution | The sequence AAA99717.1 differs from that shown. Reason: Erroneous gene model prediction. |
Ontologies
| Keywords | |
|---|---|
| Biological process | Complement pathway Immunity Inflammatory response Innate immunity |
| Cellular component | Secreted |
| Coding sequence diversity | Polymorphism |
| Disease | Systemic lupus erythematosus |
| Domain | Signal |
| Molecular function | Blood group antigen |
| PTM | Cleavage on pair of basic residues Disulfide bond Glycoprotein Sulfation Thioester bond |
| Technical term | Complete proteome Direct protein sequencing Reference proteome |
| Gene Ontology (GO) | |
| Biological process | complement activation, classical pathway Inferred from electronic annotation. Source: UniProtKB-KW inflammatory responseInferred from electronic annotation. Source: UniProtKB-KW innate immune responseTraceable author statement. Source: Reactome |
| Cellular component | extracellular space Inferred from electronic annotation. Source: InterPro |
| Molecular function | endopeptidase inhibitor activity Inferred from electronic annotation. Source: InterPro |
| Complete GO annotation... | |
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||||
Molecule processing | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Signal peptide | 1 – 19 | 19 | |||||||||
| Chain | 20 – 675 | 656 | Complement C4 beta chain | PRO_0000042699 | |||||||
| Propeptide | 676 – 679 | 4 | PRO_0000042700 | ||||||||
| Chain | 680 – 1446 | 767 | Complement C4-B alpha chain | PRO_0000042701 | |||||||
| Chain | 680 – 756 | 77 | C4a anaphylatoxin | PRO_0000042702 | |||||||
| Chain | 757 – 1446 | 690 | C4b-B | PRO_0000042703 | |||||||
| Chain | 957 – 1336 | 380 | C4d-B | PRO_0000042704 | |||||||
| Propeptide | 1447 – 1453 | 7 | PRO_0000042705 | ||||||||
| Chain | 1454 – 1744 | 291 | Complement C4 gamma chain | PRO_0000042706 | |||||||
Regions | |||||||||||
| Domain | 702 – 736 | 35 | Anaphylatoxin-like | ||||||||
| Domain | 1595 – 1742 | 148 | NTR | ||||||||
Amino acid modifications | |||||||||||
| Modified residue | 1417 | 1 | Sulfotyrosine | ||||||||
| Modified residue | 1420 | 1 | Sulfotyrosine | ||||||||
| Modified residue | 1422 | 1 | Sulfotyrosine | ||||||||
| Glycosylation | 226 | 1 | N-linked (GlcNAc...) Ref.12 | ||||||||
| Glycosylation | 862 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 1328 | 1 | N-linked (GlcNAc...) Ref.14 | ||||||||
| Glycosylation | 1391 | 1 | N-linked (GlcNAc...) Ref.13 | ||||||||
| Disulfide bond | 702 ↔ 728 | By similarity | |||||||||
| Disulfide bond | 703 ↔ 735 | By similarity | |||||||||
| Disulfide bond | 716 ↔ 736 | By similarity | |||||||||
| Disulfide bond | 1595 ↔ 1673 | By similarity | |||||||||
| Disulfide bond | 1618 ↔ 1742 | By similarity | |||||||||
| Cross-link | 1010 ↔ 1013 | Isoglutamyl cysteine thioester (Cys-Gln) | |||||||||
Natural variations | |||||||||||
| Natural variant | 347 | 1 | S → Y. Ref.2 Corresponds to variant rs392610 [ dbSNP | Ensembl ]. | VAR_023729 | |||||||
| Natural variant | 907 | 1 | A → T. Ref.1 Ref.3 Corresponds to variant rs429329 [ dbSNP | Ensembl ]. | VAR_023730 | |||||||
| Natural variant | 1073 | 1 | D → G in allotype C4B1 and allotype C4B3. Ref.8 Corresponds to variant rs2258218 [ dbSNP | Ensembl ]. | VAR_023731 | |||||||
| Natural variant | 1176 | 1 | N → S in allotype C4B1, allotype C4B3 and allotype C4B5. Ref.8 Corresponds to variant rs2746414 [ dbSNP | Ensembl ]. | VAR_023732 | |||||||
| Natural variant | 1201 | 1 | S → T in allotype C4B. Ref.8 Ref.10 | VAR_023733 | |||||||
| Natural variant | 1207 | 1 | V → A in allotype C4B1, allotype C4B2 and allotype C4B3. Ref.8 Ref.10 Corresponds to variant rs2229403 [ dbSNP | Ensembl ]. | VAR_023734 | |||||||
| Natural variant | 1210 | 1 | L → R in allotype C4B1, allotype C4B2 and allotype C4B3. Ref.8 Ref.10 Corresponds to variant rs2229409 [ dbSNP | Ensembl ]. | VAR_023735 | |||||||
| Natural variant | 1286 | 1 | S → A. Ref.8 Ref.10 Corresponds to variant rs9501603 [ dbSNP | Ensembl ]. | VAR_023736 | |||||||
Experimental info | |||||||||||
| Sequence conflict | 980 – 981 | 2 | VT → LQ in AAA99717. Ref.1 | ||||||||
| Sequence conflict | 1013 | 1 | Q → E AA sequence Ref.7 | ||||||||
| Sequence conflict | 1013 | 1 | Q → E AA sequence Ref.8 | ||||||||
| Sequence conflict | 1013 | 1 | Q → E AA sequence Ref.9 | ||||||||
| Sequence conflict | 1109 – 1110 | 2 | SQ → IA AA sequence Ref.8 | ||||||||
| Sequence conflict | 1271 | 1 | H → V AA sequence Ref.8 | ||||||||
| Sequence conflict | 1271 | 1 | H → V AA sequence Ref.10 | ||||||||
| Sequence conflict | 1300 | 1 | R → V AA sequence Ref.8 | ||||||||
| Sequence conflict | 1300 | 1 | R → V AA sequence Ref.10 | ||||||||
| Sequence conflict | 1317 | 1 | I → F in AAA99717. Ref.1 | ||||||||
| Sequence conflict | 1654 | 1 | T → RA in AAA99717. Ref.1 | ||||||||
| Sequence conflict | 1698 | 1 | H → Q in AAA99717. Ref.1 | ||||||||
Sequences
| ||||||||||||||||||
References
| « Hide 'large scale' references | |
| [1] | "Complete sequence of the complement C4 gene from the HLA-A1, B8, C4AQ0, C4B1, DR3 haplotype." Ulgiati D., Townend D.C., Christiansen F.T., Dawkins R.L., Abraham L.J. Immunogenetics 43:250-252(1996) [PubMed: 8575831] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA], VARIANT THR-907. Tissue: Blood. |
| [2] | "Sequence determination of 300 kilobases of the human class III MHC locus." Rowen L., Dankers C., Baskin D., Faust J., Loretz C., Ahearn M.E., Banta A., Swartzell S., Smith T.M., Spies T., Hood L. Submitted (OCT-1999) to the EMBL/GenBank/DDBJ databases Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA], VARIANT TYR-347. |
| [3] | "The DNA sequence and analysis of human chromosome 6." Mungall A.J., Palmer S.A., Sims S.K., Edwards C.A., Ashurst J.L., Wilming L., Jones M.C., Horton R., Hunt S.E., Scott C.E., Gilbert J.G.R., Clamp M.E., Bethel G., Milne S., Ainscough R., Almeida J.P., Ambrose K.D., Andrews T.D. Beck S.Nature 425:805-811(2003) [PubMed: 14574404] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], VARIANT THR-907. |
| [4] | "Complete primary structure of human C4a anaphylatoxin." Moon K.E., Gorski J.P., Hugli T.E. J. Biol. Chem. 256:8685-8692(1981) [PubMed: 6167582] [Abstract] Cited for: PROTEIN SEQUENCE OF 680-756. |
| [5] | "Importance of the alpha 3-fragment of complement C4 for the binding with C4b-binding protein." Hessing M., van 't Veer C., Hackeng T.M., Bouma B.N., Iwanaga S. FEBS Lett. 271:131-136(1990) [PubMed: 1699796] [Abstract] Cited for: PROTEIN SEQUENCE OF 757-771 AND 980-990. |
| [6] | "The structural basis of the multiple forms of human complement component C4." Belt K.T., Carroll M.C., Porter R.R. Cell 36:907-914(1984) [PubMed: 6546707] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 956-1336. Tissue: Liver. |
| [7] | "Amino acid sequence around the thiol and reactive acyl groups of human complement component C4." Campbell R.D., Gagnon J., Porter R.R. Biochem. J. 199:359-370(1981) [PubMed: 6978711] [Abstract] Cited for: PROTEIN SEQUENCE OF 957-1044. |
| [8] | "The chemical structure of the C4d fragment of the human complement component C4." Chakravarti D.N., Campbell R.D., Porter R.R. Mol. Immunol. 24:1187-1197(1987) [PubMed: 3696167] [Abstract] Cited for: PROTEIN SEQUENCE OF 957-1336, VARIANTS GLY-1073; SER-1176; THR-1201; ALA-1207; ARG-1210 AND ALA-1286. |
| [9] | "Sequence determination of the thiolester site of the fourth component of human complement." Harrison R.A., Thomas M.L., Tack B.F. Proc. Natl. Acad. Sci. U.S.A. 78:7388-7392(1981) [PubMed: 6950384] [Abstract] Cited for: PROTEIN SEQUENCE OF 990-1037. |
| [10] | "Amino acid sequence of a polymorphic segment from fragment C4d of human complement component C4." Chakravarti D.N., Campbell R.D., Gagnon J. FEBS Lett. 154:387-390(1983) [PubMed: 6832377] [Abstract] Cited for: PROTEIN SEQUENCE OF 1199-1304, VARIANTS THR-1201; ALA-1207; ARG-1210 AND ALA-1286. |
| [11] | "Identification of the site of sulfation of the fourth component of human complement." Hortin G., Sims H., Strauss A.W. J. Biol. Chem. 261:1786-1793(1986) [PubMed: 3944109] [Abstract] Cited for: PROTEIN SEQUENCE OF 1405-1431, SULFATION. |
| [12] | "Identification and quantification of N-linked glycoproteins using hydrazide chemistry, stable isotope labeling and mass spectrometry." Zhang H., Li X.-J., Martin D.B., Aebersold R. Nat. Biotechnol. 21:660-666(2003) [PubMed: 12754519] [Abstract] Cited for: GLYCOSYLATION AT ASN-226. |
| [13] | "Screening for N-glycosylated proteins by liquid chromatography mass spectrometry." Bunkenborg J., Pilch B.J., Podtelejnikov A.V., Wisniewski J.R. Proteomics 4:454-465(2004) [PubMed: 14760718] [Abstract] Cited for: GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-1391, MASS SPECTROMETRY. Tissue: Plasma. |
| [14] | "Identification of N-linked glycoproteins in human saliva by glycoprotein capture and mass spectrometry." Ramachandran P., Boontheung P., Xie Y., Sondej M., Wong D.T., Loo J.A. J. Proteome Res. 5:1493-1503(2006) [PubMed: 16740002] [Abstract] Cited for: GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-1328, MASS SPECTROMETRY. Tissue: Saliva. |
| [15] | "Structural basis of the polymorphism of human complement components C4A and C4B: gene size, reactivity and antigenicity." Yu C.Y., Belt K.T., Giles C.M., Campbell R.D., Porter R.R. EMBO J. 5:2873-2881(1986) [PubMed: 2431902] [Abstract] Cited for: STRUCTURAL BASIS OF POLYMORPHISM. |
| [16] | "Gene copy-number variation and associated polymorphisms of complement component C4 in human systemic lupus erythematosus (SLE): low copy number is a risk factor for and high copy number is a protective factor against SLE susceptibility in European Americans." Yang Y., Chung E.K., Wu Y.L., Savelli S.L., Nagaraja H.N., Zhou B., Hebert M., Jones K.N., Shu Y., Kitzmiller K., Blanchong C.A., McBride K.L., Higgins G.C., Rennebohm R.M., Rice R.R., Hackshaw K.V., Roubey R.A., Grossman J.M. Yu C.Y.Am. J. Hum. Genet. 80:1037-1054(2007) [PubMed: 17503323] [Abstract] Cited for: INVOLVEMENT IN SLE. |
| + | Additional computationally mapped references. |
Web resources
| dbRBC/BGMUT Blood group antigen gene mutation database |
Cross-references
Sequence databases | |
|---|---|
| EMBL GenBank DDBJ | U24578 Genomic DNA. Translation: AAA99717.1. Sequence problems. AF019413 Genomic DNA. Translation: AAB67980.1. AL049547 Genomic DNA. Translation: CAB89302.1. K02404 mRNA. Translation: AAA59651.1. |
| IPI | IPI00654875. |
| PIR | B20807. |
| RefSeq | NP_001002029.3. NM_001002029.3. NP_001229752.1. NM_001242823.2. |
| UniGene | Hs.534847. Hs.720022. |
3D structure databases | |
| ProteinModelPortal | P0C0L5. |
| SMR | P0C0L5. Positions 138-239, 683-745, 764-984, 996-1321, 1324-1743. |
| ModBase | Search... |
Protein-protein interaction databases | |
| DIP | DIP-47260N. |
| IntAct | P0C0L5. 1 interaction. |
| STRING | P0C0L5. |
Protein family/group databases | |
| MEROPS | I39.951. |
PTM databases | |
| PhosphoSite | P0C0L5. |
Polymorphism databases | |
| DMDM | 81175167. |
2D gel databases | |
| SWISS-2DPAGE | P0C0L5. |
Proteomic databases | |
| PRIDE | P0C0L5. |
Protocols and materials databases | |
| StructuralBiologyKnowledgebase | Search... |
Genome annotation databases | |
| Ensembl | ENST00000414246; ENSP00000403377; ENSG00000236625. ENST00000449788; ENSP00000414200; ENSG00000236625. |
| GeneID | 100293534. 721. |
| KEGG | hsa:100293534. hsa:721. |
Organism-specific databases | |
| CTD | 721. |
| GeneCards | GC06P031982. |
| HGNC | HGNC:1324. C4B. |
| MIM | 120790. phenotype. 120820. gene. 152700. phenotype. |
| neXtProt | NX_P0C0L5. |
| Orphanet | 169147. Immunodeficiency due to an early component of complement deficiency. |
| GenAtlas | Search... |
Phylogenomic databases | |
| HOVERGEN | HBG107123. |
| InParanoid | P0C0L5. |
| OrthoDB | EOG4JM7NW. |
Enzyme and pathway databases | |
| Reactome | REACT_6900. Immune System. |
Gene expression databases | |
| Genevestigator | P0C0L5. |
| GermOnline | ENSG00000204319. Homo sapiens. |
Family and domain databases | |
| InterPro | IPR009048. A-macroglobulin_rcpt-bd. IPR011626. A2M_comp. IPR002890. A2M_N. IPR011625. A2M_N_2. IPR000020. Anaphylatoxin/fibulin. IPR018081. Anaphylatoxin_. IPR001840. Anaphylatoxn. IPR001599. Macroglobln_a2. IPR019742. MacrogloblnA2_CS. IPR019565. MacrogloblnA2_thiol-ester-bond. IPR001134. Netrin_domain. IPR018933. Netrin_module_non-TIMP. IPR008930. Terpenoid_cyclase/PrenylTrfase. IPR008993. TIMP-like_OB-fold. [Graphical view] |
| Gene3D | G3DSA:2.60.40.690. A-macroglobulin_rcpt-bd. 1 hit. G3DSA:1.20.91.20. Anaphylatoxin. 1 hit. |
| KO | K03989. |
| Pfam | PF00207. A2M. 1 hit. PF07678. A2M_comp. 1 hit. PF01835. A2M_N. 1 hit. PF07703. A2M_N_2. 1 hit. PF07677. A2M_recep. 1 hit. PF01821. ANATO. 1 hit. PF01759. NTR. 1 hit. PF10569. Thiol-ester_cl. 1 hit. [Graphical view] |
| PRINTS | PR00004. ANAPHYLATOXN. |
| SMART | SM00104. ANATO. 1 hit. SM00643. C345C. 1 hit. [Graphical view] |
| SUPFAM | SSF49410. AM_receptor_bind. 1 hit. SSF47686. Anaphylatoxin. 1 hit. SSF48239. Terp_cyc_toroid. 1 hit. SSF50242. TIMP_like. 1 hit. |
| PROSITE | PS00477. ALPHA_2_MACROGLOBULIN. 1 hit. PS01177. ANAPHYLATOXIN_1. 1 hit. PS01178. ANAPHYLATOXIN_2. 1 hit. PS50189. NTR. 1 hit. [Graphical view] |
| ProtoNet | Search... |
Other | |
| SOURCE | Search... |
Entry information
| Entry name | CO4B_HUMAN | ||||||||
| Accession | Primary (citable) accession number: P0C0L5 Secondary accession number(s): P01028 Q9UIP5 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation program | Chordata Protein Annotation Program | ||||||||
| Disclaimer | Any medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care. | ||||||||
Relevant documents
| Blood group antigen proteins Nomenclature of blood group antigens and list of entries |
| Human chromosome 6 Human chromosome 6: entries, gene names and cross-references to MIM |
| Human entries with polymorphisms or disease mutations List of human entries with polymorphisms or disease mutations |
| Human polymorphisms and disease mutations Index of human polymorphisms and disease mutations |
| MIM cross-references Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot |
| SIMILARITY comments Index of protein domains and families |

Clusters with