Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot Q9WUW3 (CFAI_RAT)

Last modified June 16, 2009. Version 68. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Complement factor I
    EC=3.4.21.45
Alternative name(s):
    C3B/C4B inactivator
Cleaved into the following 2 chains:
    1- Recommended name:
            Complement factor I heavy chain
    2- Recommended name:
            Complement factor I light chain
Gene names
Name: Cfi
Synonyms: If
OrganismRattus norvegicus (Rat)
Taxonomic identifier10116 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeRattus

Protein attributes

Sequence length604 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at transcript level.

General annotation (Comments)

Function

Responsible for cleaving the alpha-chains of C4b and C3b in the presence of the cofactors C4-binding protein and factor H respectively.

Catalytic activity

Inactivates complement subcomponents C3b, iC3b and C4b by proteolytic cleavage.

Subunit structure

Heterodimer of a light and heavy chains linked by disulfide bonds.

Subcellular location

Secretedextracellular space.

Tissue specificity

Plasma.

Sequence similarities

Belongs to the peptidase S1 family.

Contains 1 Kazal-like domain.

Contains 2 LDL-receptor class A domains.

Contains 1 peptidase S1 domain.

Contains 1 SRCR domain.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 1818 Potential
Chain19 – 604586Complement factor I
PRO_0000027574
Chain19 – 357339Complement factor I heavy chain
PRO_0000027575
Chain362 – 604243Complement factor I light chain
PRO_0000027576

Regions

Domain63 – 10947Kazal-like
Domain117 – 217101SRCR
Domain218 – 26245LDL-receptor class A 1
Domain263 – 29937LDL-receptor class A 2
Domain362 – 595234Peptidase S1

Sites

Active site4021Charge relay system By similarity
Active site4501Charge relay system By similarity
Active site5461Charge relay system By similarity

Amino acid modifications

Glycosylation401N-linked (GlcNAc...) Potential
Glycosylation1061N-linked (GlcNAc...) Potential
Glycosylation1161N-linked (GlcNAc...) Potential
Glycosylation1821N-linked (GlcNAc...) Potential
Glycosylation5151N-linked (GlcNAc...) Potential
Glycosylation5571N-linked (GlcNAc...) Potential
Disulfide bond157 ↔ 219 By similarity
Disulfide bond191 ↔ 201 By similarity
Disulfide bond234 ↔ 252 By similarity
Disulfide bond246 ↔ 261 By similarity
Disulfide bond264 ↔ 276 By similarity
Disulfide bond271 ↔ 289 By similarity
Disulfide bond283 ↔ 298 By similarity
Disulfide bond387 ↔ 403 By similarity
Disulfide bond488 ↔ 552 By similarity
Disulfide bond516 ↔ 531 By similarity
Disulfide bond542 ↔ 571 By similarity

Sequences

Sequence LengthMass (Da)Tools
Q9WUW3-1 [UniParc].

Last modified November 1, 1999. Version 1.
Checksum: C775AE68D2D52D51

FASTA60467,298
        10         20         30         40         50         60 
MKLALLILLL LNPHLSSSKN TPASGQPQED LVEQKCLLKN YTHHSCDKVF CQPWQKCIEG 

        70         80         90        100        110        120 
TCACKLPYQC PKAGTPVCAT NGRGYPTYCH LKSFECLHPE IKFSNNGTCT AEEKFNVSLI 

       130        140        150        160        170        180 
YGSTDTEGIV QVKLVDQDEK MFICKNSWST VEANVACFDL GFPLGVRDIQ GRFNIPVNHK 

       190        200        210        220        230        240 
INSTECLHVR CQGVETSLAE CTFTKKSSKA PHGLAGVVCY TQDADFPTSQ SFQCVNGKRI 

       250        260        270        280        290        300 
PQEKACDGVN DCGDQSDELC CKGCRGQAFL CKSGVCIPNQ RKCNGEVDCI TGEDESGCEE 

       310        320        330        340        350        360 
DKKNKIHKGL ARSDQGGETE IETEETEMLT PDMDTERKRI KSLLPKLSCG VKRNTHIRRK 

       370        380        390        400        410        420 
RVVGGKPAEM GDYPWQVAIK DGDRITCGGI YIGGCWILTA AHCVRPSRYR NYQVWTSLLD 

       430        440        450        460        470        480 
WLKPNSQLAV QGVSRVVVHE KYNGATYQND IALVEMKKHP GKKECELINS VPACVPWSPY 

       490        500        510        520        530        540 
LFQPNDRCII SGWGREKDNQ KVYSLRWGEV DLIGNCSRFY PGRYYEKEMQ CAGTSDGSID 

       550        560        570        580        590        600 
ACKGDSGGPL VCKDVNNVTY VWGIVSWGEN CGKPEFPGVY TRVASYFDWI SYYVGRPLVS 


QYNV 

« Hide

References

« Hide 'large scale' references
[1]"Rat complement factor I: molecular cloning, sequencing and expression in tissues and isolated cells."
Schlaf G., Rothermel E., Oppermann M., Schieferdecker H.L., Jungermann K., Gotze O.
Immunology 98:464-474(1999) [PubMed: 10583609] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA].
Strain: Sprague-Dawley.
Tissue: Liver.
[2]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
Tissue: Liver.

Cross-references

Sequence databases

Y18965 mRNA. Translation: CAB41688.1.
BC089798 mRNA. Translation: AAH89798.1.
IPIIPI00204451.
RefSeqNP_077071.1.
UniGeneRn.7424

3D structure databases

HSSPHSSP built from PDB template 1RTF based on UniProtKB P00750.
ModBaseSearch...

Protein family/group databases

MEROPSS01.199.

Genome annotation databases

GeneID79126.
KEGGrno:79126.

Organism-specific databases

RGD620429. Cfi.

Phylogenomic databases

HOVERGENQ9WUW3.

Enzyme and pathway databases

BRENDA3.4.21.45. 248.

Family and domain databases

InterProIPR002172. LDL_rcpt_classA_cys-rich.
IPR018114. Peptidase_S1/S6_AS.
IPR001254. Peptidase_S1_S6.
IPR001314. Peptidase_S1A.
IPR002350. Prot_inh_Kazal.
IPR011497. Prot_Inh_Kazal_2.
IPR001190. Srcr_rcpt.
IPR017448. Srcr_rcpt-rel.
[Graphical view]
Gene3DG3DSA:4.10.400.10. LDL_rcpt_classA_cys-rich. 1 hit.
PfamPF07648. Kazal_2. 1 hit.
PF00057. Ldl_recept_a. 2 hits.
PF00530. SRCR. 1 hit.
PF00089. Trypsin. 1 hit.
[Graphical view]
PRINTSPR00722. CHYMOTRYPSIN.
PR00261. LDLRECEPTOR.
SMARTSM00280. KAZAL. 1 hit.
SM00192. LDLa. 2 hits.
SM00202. SR. 1 hit.
SM00020. Tryp_SPc. 1 hit.
[Graphical view]
PROSITEPS01209. LDLRA_1. 1 hit.
PS50068. LDLRA_2. 2 hits.
PS00420. SRCR_1. False negative.
PS50287. SRCR_2. 1 hit.
PS50240. TRYPSIN_DOM. 1 hit.
PS00134. TRYPSIN_HIS. 1 hit.
PS00135. TRYPSIN_SER. 1 hit.
[Graphical view]
ProtoNetSearch...

Other Resources

NextBio614562.

Entry information

Entry nameCFAI_RAT
AccessionPrimary (citable) accession number: Q9WUW3
Secondary accession number(s): Q5EBC4
Entry history
Integrated into UniProtKB/Swiss-Prot: May 4, 2001
Last sequence update: November 1, 1999
Last modified: June 16, 2009
This is version 68 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectHPI (Human Proteome Initiative)

Relevant documents

Peptidase families

Classification of peptidase families and list of entries

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents