Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot P06683 (CO9_MOUSE)

Last modified June 16, 2009. Version 91. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Complement component C9
Gene names
Name: C9
OrganismMus musculus (Mouse)
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMus

Protein attributes

Sequence length548 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level.

General annotation (Comments)

Function

C9 is the final component of the complement system to be added in the assembly of the membrane attack complex. It is able to enter lipid bilayers, forming transmembrane channels.

Subcellular location

Membrane; Multi-pass membrane protein Potential. Secreted.

Sequence similarities

Belongs to the complement C6/C7/C8/C9 family.

Contains 1 EGF-like domain.

Contains 1 LDL-receptor class A domain.

Contains 1 MACPF domain.

Contains 1 TSP type-1 domain.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 2020 By similarity
Chain21 – 548528Complement component C9
PRO_0000023605

Regions

Transmembrane312 – 32817 Potential
Transmembrane333 – 35220 Potential
Domain40 – 9354TSP type-1
Domain97 – 13438LDL-receptor class A
Domain136 – 512377MACPF
Domain513 – 54331EGF-like

Amino acid modifications

Glycosylation481N-linked (GlcNAc...) Potential
Glycosylation2631N-linked (GlcNAc...) Ref.3
Glycosylation4171N-linked (GlcNAc...) Potential
Disulfide bond41 ↔ 76 By similarity
Disulfide bond52 ↔ 55 By similarity
Disulfide bond86 ↔ 92 By similarity
Disulfide bond99 ↔ 110 By similarity
Disulfide bond105 ↔ 123 By similarity
Disulfide bond117 ↔ 132 By similarity
Disulfide bond140 ↔ 179 By similarity
Disulfide bond378 ↔ 407 By similarity
Disulfide bond513 ↔ 529 By similarity
Disulfide bond516 ↔ 531 By similarity
Disulfide bond533 ↔ 542 By similarity

Experimental info

Sequence conflict881P → T in CAA29038. Ref.2
Sequence conflict1531T → R in CAA29038. Ref.2
Sequence conflict2411A → P in CAA29038. Ref.2
Sequence conflict2461E → Q in CAA29038. Ref.2
Sequence conflict2611P → A in CAA29038. Ref.2
Sequence conflict269 – 2702KF → TI in CAA29038. Ref.2
Sequence conflict2851F → L in CAA29038. Ref.2
Sequence conflict5471K → T in CAA29038. Ref.2

Sequences

Sequence LengthMass (Da)Tools
P06683-1 [UniParc].

Last modified January 23, 2002. Version 2.
Checksum: 8F1D16184E4781BE

FASTA54862,002
        10         20         30         40         50         60 
MASGMAITLA LAIFALGVNA QMPIPVSREE QEQHYPIPID CRMSPWSNWS ECDPCLKQRF 

        70         80         90        100        110        120 
RSRSILAFGQ FNGKSCVDVL GDRQGCEPTQ ECEEIQENCG NDFQCETGRC IKRRLLCNGD 

       130        140        150        160        170        180 
NDCGDYSDEN DCDDDPRTPC RDRVAEESEL GLTAGYGINI LGMEPLRTPF DNEFYNGLCD 

       190        200        210        220        230        240 
RVRDEKTYYR KPWNVVSLIY ETKADKSFRT ENYDEHLEVF KAINREKTSN FNADFALKFS 

       250        260        270        280        290        300 
ATEVPEKGAG EVSPAEHSSK PTNISAKFKF SYFMGKNFRR LSSYFSQSKK MFVHLRGVVQ 

       310        320        330        340        350        360 
LGRFVMRNRD VVLRSTFLDD VKALPTSYEK GEYFGFLETY GTHYSTSGSL GGQYEIVYVL 

       370        380        390        400        410        420 
DKASMKEKGV DLNDVKHCLG FNMDLRIPLQ DDLKDASVTA SVNADGCIKT DNGKTVNITR 

       430        440        450        460        470        480 
DNIIDDVISF IRGGTREQAI LLKEKILRGD KTFDKTDFAN WASSLANAPA LISQRMSPIY 

       490        500        510        520        530        540 
NLIPLKIKDA YIKKQNLEKA VEDYIDEFST KRCYPCLNGG TIILLDGQCL CSCPMMFRGM 


ACEIHQKI 

« Hide

References

« Hide 'large scale' references
[1]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
Tissue: Liver.
[2]"Topological mapping of complement component C9 by recombinant DNA techniques suggests a novel mechanism for its insertion into target membranes."
Stanley K.K., Herz J.
EMBO J. 6:1951-1957(1987) [PubMed: 2443347] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 21-548.
[3]"Proteome-wide characterization of N-glycosylation events by diagonal chromatography."
Ghesquiere B., Van Damme J., Martens L., Vandekerckhove J., Gevaert K.
J. Proteome Res. 5:2438-2447(2006) [PubMed: 16944957] [Abstract]
Cited for: GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-263, MASS SPECTROMETRY.
Tissue: Plasma.
+Additional computationally mapped references.

Cross-references

Sequence databases

BC011137 mRNA. Translation: AAH11137.1. Different initiation.
X05475 mRNA. Translation: CAA29038.1.
IPIIPI00230718.
PIRA29677.
RefSeqNP_038513.1.
UniGeneMm.29095

3D structure databases

HSSPHSSP built from PDB template 1J8E based on UniProtKB Q07954.
ModBaseSearch...

Proteomic databases

PRIDEP06683.

Genome annotation databases

EnsemblENSMUSG00000022149. Mus musculus. [Contig view]
GeneID12279.
KEGGmmu:12279.

Organism-specific databases

MGIMGI:1098282. C9.

Phylogenomic databases

HOGENOMP06683.
HOVERGENP06683.

Gene expression databases

ArrayExpressP06683.
BgeeP06683.
CleanExMM_C9.
GermOnlineENSMUSG00000022149. Mus musculus.

Family and domain databases

InterProIPR013032. EGF-like_reg_CS.
IPR002172. LDL_rcpt_classA_cys-rich.
IPR001862. MAC_perforin.
IPR000884. Thrombospondin_1_rpt.
[Graphical view]
Gene3DG3DSA:4.10.400.10. LDL_rcpt_classA_cys-rich. 1 hit.
PfamPF00057. Ldl_recept_a. 1 hit.
PF01823. MACPF. 1 hit.
PF00090. TSP_1. 1 hit.
[Graphical view]
PRINTSPR00764. COMPLEMENTC9.
SMARTSM00192. LDLa. 1 hit.
SM00457. MACPF. 1 hit.
SM00209. TSP1. 1 hit.
[Graphical view]
PROSITEPS00022. EGF_1. 1 hit.
PS01186. EGF_2. False negative.
PS50026. EGF_3. False negative.
PS01209. LDLRA_1. 1 hit.
PS50068. LDLRA_2. 1 hit.
PS00279. MACPF_1. 1 hit.
PS51412. MACPF_2. 1 hit.
PS50092. TSP1. 1 hit.
[Graphical view]
ProtoNetSearch...

Other Resources

NextBio280740.
SOURCESearch...

Entry information

Entry nameCO9_MOUSE
AccessionPrimary (citable) accession number: P06683
Secondary accession number(s): Q91XA7
Entry history
Integrated into UniProtKB/Swiss-Prot: January 1, 1988
Last sequence update: January 23, 2002
Last modified: June 16, 2009
This is version 91 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectHPI (Human Proteome Initiative)

Relevant documents

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents