Reviewed,
UniProtKB/Swiss-Prot P01029 (CO4B_MOUSE)
Last modified
June 16, 2009.
Version 102.
History...
Clusters with 100%,
90%,
50% identity |
Documents (2) |
Third-party data |
Customize display | text xml rdf/xml gff fasta |
Names and origin
| Protein names | Recommended name: Complement C4-B Cleaved into the following 4 chains: 1- Recommended name: Complement C4 beta chain 2- Recommended name: Complement C4 alpha chain 3- Recommended name: C4a anaphylatoxin 4- Recommended name: Complement C4 gamma chain | ||||
| Gene names |
| ||||
| Organism | Mus musculus (Mouse) | ||||
| Taxonomic identifier | 10090 [NCBI] | ||||
| Taxonomic lineage | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Mammalia › Eutheria › Euarchontoglires › Glires › Rodentia › Sciurognathi › Muroidea › Muridae › Murinae › Mus |
Protein attributes
| Sequence length | 1738 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is further processed into a mature form. |
| Protein existence | Evidence at protein level. |
General annotation (Comments)
| Function | C4 plays a central role in the activation of the classical pathway of the complement system. It is processed by activated C1 which removes from the alpha chain the C4a anaphylatoxin. The remaining alpha chain fragment C4b is the major activation product and is an essential subunit of the C3 convertase (C4b2a) and the C5 convertase (C3bC4b2a) enzymes of the classical complement pathway. |
| Subunit structure | Circulates in blood as a disulfide-linked trimer of an alpha, beta and gamma chain. |
| Subcellular location | |
| Post-translational modification | Prior to secretion, the single-chain precursor is enzymatically cleaved to yield the non-identical chains (alpha, beta and gamma). During activation, the alpha chain is cleaved by C1 into C4a and C4b, and C4b stays linked to the beta and gamma chains. Further degradation of C4b by C1 into the inactive fragments C4c and C4d blocks the generation of C3 convertase. |
| Miscellaneous | C4 is a major histocompatibility complex class-III protein. |
| Sequence similarities | Contains 1 anaphylatoxin-like domain. Contains 1 NTR domain. |
Ontologies
| Keywords | |
|---|---|
| Biological process | Complement pathway Immune response Inflammatory response Innate immunity |
| Cellular component | Secreted |
| Domain | Signal |
| PTM | Cleavage on pair of basic residues Disulfide bond Glycoprotein Sulfation Thioester bond |
| Gene Ontology (GO) | |
| Biological process | complement activation, classical pathway Inferred from electronic annotation. Source: UniProtKB-KW innate immune responseInferred from electronic annotation. Source: UniProtKB-KW |
| Cellular component | extracellular space Inferred from electronic annotation. Source: InterPro |
| Molecular function | endopeptidase inhibitor activity Inferred from electronic annotation. Source: InterPro protein bindingInferred from electronic annotation. Source: InterPro |
| Complete GO annotation... | |
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||||
Molecule processing | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Signal peptide | 1 – 19 | 19 | |||||||||
| Chain | 20 – 673 | 654 | Complement C4 beta chain | PRO_0000005973 | |||||||
| Propeptide | 674 – 677 | 4 | PRO_0000005974 | ||||||||
| Chain | 678 – 1443 | 766 | Complement C4 alpha chain | PRO_0000005975 | |||||||
| Chain | 678 – 753 | 76 | C4a anaphylatoxin | PRO_0000005976 | |||||||
| Propeptide | 1444 – 1447 | 4 | PRO_0000005977 | ||||||||
| Chain | 1448 – 1738 | 291 | Complement C4 gamma chain | PRO_0000005978 | |||||||
Regions | |||||||||||
| Domain | 700 – 734 | 35 | Anaphylatoxin-like | ||||||||
| Domain | 1589 – 1736 | 148 | NTR | ||||||||
Amino acid modifications | |||||||||||
| Modified residue | 1413 | 1 | Sulfotyrosine | ||||||||
| Modified residue | 1416 | 1 | Sulfotyrosine | ||||||||
| Modified residue | 1417 | 1 | Sulfotyrosine | ||||||||
| Glycosylation | 224 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Glycosylation | 743 | 1 | N-linked (GlcNAc...) | ||||||||
| Glycosylation | 1324 | 1 | N-linked (GlcNAc...) Ref.16 | ||||||||
| Glycosylation | 1387 | 1 | N-linked (GlcNAc...) Potential | ||||||||
| Disulfide bond | 700 ↔ 726 | By similarity | |||||||||
| Disulfide bond | 701 ↔ 733 | By similarity | |||||||||
| Disulfide bond | 714 ↔ 734 | By similarity | |||||||||
| Disulfide bond | 1589 ↔ 1667 | By similarity | |||||||||
| Disulfide bond | 1612 ↔ 1736 | By similarity | |||||||||
| Cross-link | 1006 ↔ 1009 | Isoglutamyl cysteine thioester (Cys-Gln) By similarity | |||||||||
Experimental info | |||||||||||
| Sequence conflict | 132 | 1 | F → Y in AAA39557. Ref.3 | ||||||||
| Sequence conflict | 177 | 1 | E → G in BAE34429. Ref.5 | ||||||||
| Sequence conflict | 283 | 1 | A → V in BAE34429. Ref.5 | ||||||||
| Sequence conflict | 327 | 1 | G → E in AAA39557. Ref.3 | ||||||||
| Sequence conflict | 440 | 1 | E → K in AAH67409. Ref.6 | ||||||||
| Sequence conflict | 440 | 1 | E → K in AAH67394. Ref.6 | ||||||||
| Sequence conflict | 570 | 1 | Q → E in AAA39557. Ref.3 | ||||||||
| Sequence conflict | 570 | 1 | Q → E in BAE34280. Ref.5 | ||||||||
| Sequence conflict | 604 | 1 | T → M in AAC05279. Ref.4 | ||||||||
| Sequence conflict | 604 | 1 | T → M in CAA28936. Ref.9 | ||||||||
| Sequence conflict | 639 | 1 | D → G in BAE34429. Ref.5 | ||||||||
| Sequence conflict | 720 | 1 | R → G Ref.10 | ||||||||
| Sequence conflict | 739 – 740 | 2 | DL → AI Ref.10 | ||||||||
| Sequence conflict | 758 | 1 | M → I in BAE34280. Ref.5 | ||||||||
| Sequence conflict | 838 | 1 | P → R in AAA39557. Ref.3 | ||||||||
| Sequence conflict | 916 | 1 | V → I in BAE34280. Ref.5 | ||||||||
| Sequence conflict | 993 | 1 | P → L Ref.10 | ||||||||
| Sequence conflict | 1043 | 1 | D → E Ref.10 | ||||||||
| Sequence conflict | 1077 | 1 | F → S in AAH67409. Ref.6 | ||||||||
| Sequence conflict | 1077 | 1 | F → S in AAH67394. Ref.6 | ||||||||
| Sequence conflict | 1119 | 1 | V → A in AAC42021. Ref.13 | ||||||||
| Sequence conflict | 1190 | 1 | A → T in AAC42021. Ref.13 | ||||||||
| Sequence conflict | 1206 | 1 | R → Q Ref.11 | ||||||||
| Sequence conflict | 1206 | 1 | R → Q in BAE34429. Ref.5 | ||||||||
| Sequence conflict | 1206 | 1 | R → Q in BAE34280. Ref.5 | ||||||||
| Sequence conflict | 1206 | 1 | R → Q in AAH67409. Ref.6 | ||||||||
| Sequence conflict | 1206 | 1 | R → Q in AAH67394. Ref.6 | ||||||||
| Sequence conflict | 1290 | 1 | S → N in AAC42022. Ref.14 | ||||||||
| Sequence conflict | 1324 | 1 | N → K in AAA39506. Ref.1 | ||||||||
| Sequence conflict | 1324 | 1 | N → K in AAA39561. Ref.2 | ||||||||
| Sequence conflict | 1324 | 1 | N → K in AAC42021. Ref.13 | ||||||||
| Sequence conflict | 1365 | 1 | K → E in BAE34429. Ref.5 | ||||||||
| Sequence conflict | 1401 | 1 | G → S in AAA39554. Ref.15 | ||||||||
| Sequence conflict | 1442 | 1 | R → K in AAA39557. Ref.3 | ||||||||
| Sequence conflict | 1453 | 1 | V → A in AAA39506. Ref.1 | ||||||||
| Sequence conflict | 1453 | 1 | V → A in AAA39561. Ref.2 | ||||||||
| Sequence conflict | 1453 | 1 | V → A in BAE34429. Ref.5 | ||||||||
| Sequence conflict | 1453 | 1 | V → A in AAA39554. Ref.15 | ||||||||
| Sequence conflict | 1456 | 1 | Q → R in BAE34429. Ref.5 | ||||||||
| Sequence conflict | 1586 | 1 | E → Q in CAA28936. Ref.9 | ||||||||
| Sequence conflict | 1611 | 1 | A → T in AAC05279. Ref.4 | ||||||||
Sequences
| ||||||||||||||||||
References
| « Hide 'large scale' references | |
| [1] | "Complete nucleotide and derived amino acid sequences of the fourth component of mouse complement (C4). Evolutionary aspects." Nonaka M., Nakayama K., Yeul Y.D., Takahashi M. J. Biol. Chem. 260:10936-10943(1985) [PubMed: 2993295] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA]. Strain: FM. |
| [2] | "Complete cDNA sequence of the fourth component of murine complement." Sepich D.S., Noonan D.J., Ogata R.T. Proc. Natl. Acad. Sci. U.S.A. 82:5895-5899(1985) [PubMed: 3862104] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA]. Strain: B10.WR. |
| [3] | "Sequence of the gene for murine complement component C4." Ogata R.T., Rosa P.A., Zepf N.E. J. Biol. Chem. 264:16565-16572(1989) [PubMed: 2777798] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA]. Strain: B10.WR. |
| [4] | "Analysis of the gene-dense major histocompatibility complex class III region and its comparison to mouse." Xie T., Rowen L., Aguado B., Ahearn M.E., Madan A., Qin S., Campbell R.D., Hood L. Genome Res. 13:2621-2636(2003) [PubMed: 14656967] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. Strain: 129. |
| [5] | "The transcriptional landscape of the mammalian genome." Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J. Hayashizaki Y.Science 309:1559-1563(2005) [PubMed: 16141072] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. Strain: C57BL/6J. Tissue: Inner ear. |
| [6] | "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)." The MGC Project Team Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. Strain: C57BL/6J and CD-1. Tissue: Germ cell and Neural stem cell. |
| [7] | "Molecular cloning and characterization of complementary and genomic DNA clones for mouse C4 and Slp." Nonaka M., Nakayama K., Yeul Y.D., Shimizu A., Takahashi M. Immunol. Rev. 87:81-99(1985) [PubMed: 2997024] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-128. Strain: FM. |
| [8] | "Identification of the 5'-flanking regulatory region responsible for the difference in transcriptional control between mouse complement C4 and Slp genes." Nonaka M., Kimura H., Yeul Y.D., Yokoyama S., Nakayama K., Takahashi M. Proc. Natl. Acad. Sci. U.S.A. 83:7883-7887(1986) [PubMed: 3464002] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-21. |
| [9] | "Sequence comparison of alleles of the fourth component of complement (C4) and sex-limited protein (Slp)." Hemenway C., Kalff M., Stavenhagen J., Walthall D., Robins D. Nucleic Acids Res. 14:2539-2554(1986) [PubMed: 3008092] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 591-1738. Strain: C57BL/10 X DBA/2. |
| [10] | "Isolation of cDNA clones specifying the fourth component of mouse complement and its isotype, sex-limited protein." Nonaka M., Takahashi M., Natsuume-Sakai S., Nonaka M., Tanaka S., Shimizu A., Honjo T. Proc. Natl. Acad. Sci. U.S.A. 81:6822-6826(1984) [PubMed: 6208559] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 651-810 AND 924-1083. |
| [11] | "Structural basis for the C4d.1/C4d.2 serologic allotypes of murine complement component C4." Taillon-Miller P.A., Shreffler D.C. J. Immunol. 141:2382-2387(1988) [PubMed: 2459207] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 961-1290. |
| [12] | "C4 from C4-high and C4-low mouse strains have identical sequences in the region corresponding to the isotype-specific segment of human C4." Ogata R.T., Zepf N.E. Eur. J. Immunol. 20:1607-1610(1990) [PubMed: 2387317] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1099-1142. Strain: B10.BR, B10.WR, C3H/He, C57BL/6, CBA/J and DBA/2. |
| [13] | "Multiple duplications of complement C4 gene correlate with H-2-controlled testosterone-independent expression of its sex-limited isoform, C4-Slp." Levi-Strauss M., Tosi M., Steinmetz M., Klein J., Meo T. Proc. Natl. Acad. Sci. U.S.A. 82:1746-1750(1985) [PubMed: 3856857] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 1105-1449. |
| [14] | "Sequence heterogeneity of murine complementary DNA clones related to the C4 and C4-Slp isoforms of the fourth complement component." Tosi M., Levi-Strauss M., Duponchel C., Meo T. Philos. Trans. R. Soc. Lond., B, Biol. Sci. 306:389-394(1984) [PubMed: 6149581] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 1257-1376. |
| [15] | "cDNA clone spanning the alpha-gamma subunit junction in the precursor of the murine fourth complement component (C4)." Ogata R.T., Shreffler D.C., Sepich D.S., Lilly S.P. Proc. Natl. Acad. Sci. U.S.A. 80:5061-5065(1983) [PubMed: 6192448] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 1360-1511. |
| [16] | "Enhanced analysis of the mouse plasma proteome using cysteine-containing tryptic glycopeptides." Bernhard O.K., Kapp E.A., Simpson R.J. J. Proteome Res. 6:987-995(2007) [PubMed: 17330941] [Abstract] Cited for: GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-1324, MASS SPECTROMETRY. Tissue: Plasma. |
| + | Additional computationally mapped references. |
Cross-references
Sequence databases | |
|---|---|
| M11789 Genomic DNA. Translation: AAA39557.1. M11729 mRNA. Translation: AAA39506.1. M17440 Genomic DNA. Translation: AAA39561.1. AF049850 Genomic DNA. Translation: AAC05279.1. AK157954 mRNA. Translation: BAE34280.1. AK158256 mRNA. Translation: BAE34429.1. BC067394 mRNA. Translation: AAH67394.1. BC067409 mRNA. Translation: AAH67409.1. M12969 Genomic DNA. Translation: AAA39559.1. M14225 Genomic DNA. Translation: AAA39563.1. X05314 mRNA. Translation: CAA28936.1. M12970 mRNA. Translation: AAA39555.1. M12972 mRNA. Translation: AAA39556.1. M23186 Genomic DNA. Translation: AAA40487.1. X55493 Genomic DNA. Translation: CAA39112.1. X55495 Genomic DNA. Translation: CAA39114.1. K02798 mRNA. Translation: AAC42021.1. K02799 mRNA. Translation: AAC42022.1. K00019 mRNA. Translation: AAA39554.1. M12968 Genomic DNA. Translation: AAA39558.1. | |
| IPI | IPI00131091. |
| PIR | A24558. A29176. |
| RefSeq | NP_033910.2. |
| UniGene | Mm.439678 Mm.472690 |
3D structure databases | |
| HSSP | HSSP built from PDB template 1KJS based on UniProtKB P01031. |
| SMR | P01029. Positions 992-1317. |
| ModBase | Search... |
Proteomic databases | |
| PRIDE | P01029. |
Genome annotation databases | |
| Ensembl | ENSMUSG00000073418. Mus musculus. [Contig view] |
| GeneID | 12268. |
| KEGG | mmu:12268. |
Organism-specific databases | |
| MGI | MGI:88228. C4b. |
Phylogenomic databases | |
| HOVERGEN | P01029. |
Gene expression databases | |
| ArrayExpress | P01029. |
| CleanEx | MM_C4B. |
| GermOnline | ENSMUSG00000073418. Mus musculus. |
Family and domain databases | |
| InterPro | IPR009048. A-macroglobulin_rcpt-bd. IPR011626. A2M_comp. IPR002890. A2M_N. IPR011625. A2M_N_2. IPR000020. Anaphylatoxin/fibulin. IPR018081. Anaphylatoxin_. IPR001840. Anaphylatoxn. IPR001599. MacrogloblnA2. IPR019742. MacrogloblnA2_CS. IPR019565. MacrogloblnA2_thiol-ester-bond. IPR001134. Netrin_domain. IPR018933. Netrin_module_non-TIMP. [Graphical view] |
| Gene3D | G3DSA:2.60.40.690. A-macroglobulin_rcpt-bd. 1 hit. G3DSA:1.20.91.20. Anaphylatoxin. 1 hit. |
| Pfam | PF00207. A2M. 1 hit. PF07678. A2M_comp. 1 hit. PF01835. A2M_N. 1 hit. PF07703. A2M_N_2. 1 hit. PF07677. A2M_recep. 1 hit. PF01821. ANATO. 1 hit. PF01759. NTR. 1 hit. PF10569. Thiol-ester_cl. 1 hit. [Graphical view] |
| PRINTS | PR00004. ANAPHYLATOXN. |
| ProDom | PD003264. Anaphylatoxin. 1 hit. [Graphical view] [Entries sharing at least one domain] |
| SMART | SM00104. ANATO. 1 hit. SM00643. C345C. 1 hit. [Graphical view] |
| PROSITE | PS00477. ALPHA_2_MACROGLOBULIN. 1 hit. PS01177. ANAPHYLATOXIN_1. 1 hit. PS01178. ANAPHYLATOXIN_2. 1 hit. PS50189. NTR. 1 hit. [Graphical view] |
| ProtoNet | Search... |
Other Resources | |
| NextBio | 280722. |
| SOURCE | Search... |
Entry information
| Entry name | CO4B_MOUSE | ||||||||
| Accession | Primary (citable) accession number: P01029 Secondary accession number(s): O70346 Q6NWV8 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation project | HPI (Human Proteome Initiative) | ||||||||
Relevant documents
| MGD cross-references Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot |
| SIMILARITY comments Index of protein domains and families |

Clusters with


