Skip Header

 
Contribute Send feedback
Read comments (1) or add your own

Reviewed, UniProtKB/Swiss-Prot Q5R5A4 (CFAI_PONAB)

Last modified June 16, 2009. Version 30. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Complement factor I
    EC=3.4.21.45
Alternative name(s):
    C3B/C4B inactivator
Cleaved into the following 2 chains:
    1- Recommended name:
            Complement factor I heavy chain
    2- Recommended name:
            Complement factor I light chain
Gene names
Name: CFI
OrganismPongo abelii (Sumatran orangutan)
Taxonomic identifier9601 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaePongo

Protein attributes

Sequence length583 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at transcript level.

General annotation (Comments)

Function

Responsible for cleaving the alpha-chains of C4b and C3b in the presence of the cofactors C4-binding protein and factor H respectively By similarity.

Catalytic activity

Inactivates complement subcomponents C3b, iC3b and C4b by proteolytic cleavage.

Subunit structure

Heterodimer of a light and heavy chains linked by disulfide bonds By similarity.

Subcellular location

Secretedextracellular space.

Tissue specificity

Plasma.

Sequence similarities

Belongs to the peptidase S1 family.

Contains 2 LDL-receptor class A domains.

Contains 1 peptidase S1 domain.

Contains 1 SRCR domain.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 1818 By similarity
Chain19 – 583565Complement factor I
PRO_0000285861
Chain19 – 335317Complement factor I heavy chain
PRO_0000285862
Chain340 – 583244Complement factor I light chain
PRO_0000285863

Regions

Domain114 – 21299SRCR
Domain213 – 25745LDL-receptor class A 1
Domain258 – 29437LDL-receptor class A 2
Domain340 – 574235Peptidase S1

Sites

Active site3801Charge relay system By similarity
Active site4291Charge relay system By similarity
Active site5251Charge relay system By similarity

Amino acid modifications

Glycosylation701N-linked (GlcNAc...) Potential
Glycosylation1031N-linked (GlcNAc...) Potential
Glycosylation1731N-linked (GlcNAc...) Potential
Glycosylation1771N-linked (GlcNAc...) Potential
Glycosylation4641N-linked (GlcNAc...) Potential
Glycosylation4941N-linked (GlcNAc...) Potential
Glycosylation5361N-linked (GlcNAc...) Potential
Disulfide bond154 ↔ 214 By similarity
Disulfide bond186 ↔ 196 By similarity
Disulfide bond229 ↔ 247 By similarity
Disulfide bond241 ↔ 256 By similarity
Disulfide bond259 ↔ 271 By similarity
Disulfide bond266 ↔ 284 By similarity
Disulfide bond278 ↔ 293 By similarity
Disulfide bond365 ↔ 381 By similarity
Disulfide bond467 ↔ 531 By similarity
Disulfide bond495 ↔ 510 By similarity
Disulfide bond521 ↔ 550 By similarity

Sequences

Sequence LengthMass (Da)Tools
Q5R5A4-1 [UniParc].

Last modified December 21, 2004. Version 1.
Checksum: 8A390600223498AA

FASTA58365,620
        10         20         30         40         50         60 
MKLLHVFLLF LCFHLSFCKV TYTSQEDLVE KKCLAKKHTH LSCNKVFCQP WQICIEGTCI 

        70         80         90        100        110        120 
CKLPYQCPKN GTTVCATNGR SFPTYCQQKS LECLRPGTKF LNNGTCTAEG KFSVSLKHGN 

       130        140        150        160        170        180 
TDSEGIVEVK LVDQDKTMFI CKSSWSMREA NVACLDLGFQ QGADTQRRFK LSNLSINSTE 

       190        200        210        220        230        240 
CLHVHCRGLE TSLAECTFTK RRTMGYQDLA DVVCYTQKAD SPTNDFFQCV NGKYISQMKA 

       250        260        270        280        290        300 
CDGINDCGDQ SDELCCKACQ GKSFHCKSGV CIPSQYRCNG EVDCITGEDE VGCEGFASVA 

       310        320        330        340        350        360 
QEETEILTAD MDAERRRIKS LLPKLSCGVK NRRHIRRKRI VGGKRAQLGD LPWQVGIKDA 

       370        380        390        400        410        420 
SGITCGGIYI GGCWILTAAH CLRASKTHHY QIWTTVVDWI HPDRKRIVIE YVDRIIFHEN 

       430        440        450        460        470        480 
YNAGTYQNDI ALMEMKKDGN KKDCELPRSI PACVPWSPYL FQPNDTCIVS GWGREKDNEK 

       490        500        510        520        530        540 
VFSLQWGEVK LISNCSKFYG NRFYEKEMEC AGTYDGSIDA CKGDSGGPLV CMDANNVTYV 

       550        560        570        580 
WGVVSWGENC GKPEFPGVYT KVANYFDWIS YHVGRPFISQ YNV 

« Hide

References

[1]The German cDNA consortium
Submitted (NOV-2004) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
Tissue: Liver.

Cross-references

Sequence databases

CR860960 mRNA. Translation: CAH93062.1.
RefSeqNP_001127624.1.
UniGenePab.18587

3D structure databases

ModBaseSearch...

Genome annotation databases

GeneID100174703.

Phylogenomic databases

HOVERGENQ5R5A4.

Enzyme and pathway databases

BRENDA3.4.21.45. 269192.

Family and domain databases

InterProIPR002172. LDL_rcpt_classA_cys-rich.
IPR018114. Peptidase_S1/S6_AS.
IPR001254. Peptidase_S1_S6.
IPR001314. Peptidase_S1A.
IPR002350. Prot_inh_Kazal.
IPR011497. Prot_Inh_Kazal_2.
IPR001190. Srcr_rcpt.
IPR017448. Srcr_rcpt-rel.
[Graphical view]
Gene3DG3DSA:4.10.400.10. LDL_rcpt_classA_cys-rich. 1 hit.
PfamPF07648. Kazal_2. 1 hit.
PF00057. Ldl_recept_a. 2 hits.
PF00530. SRCR. 1 hit.
PF00089. Trypsin. 1 hit.
[Graphical view]
PRINTSPR00722. CHYMOTRYPSIN.
PR00261. LDLRECEPTOR.
SMARTSM00280. KAZAL. 1 hit.
SM00192. LDLa. 2 hits.
SM00202. SR. 1 hit.
SM00020. Tryp_SPc. 1 hit.
[Graphical view]
PROSITEPS01209. LDLRA_1. 1 hit.
PS50068. LDLRA_2. 2 hits.
PS00420. SRCR_1. False negative.
PS50287. SRCR_2. 1 hit.
PS50240. TRYPSIN_DOM. 1 hit.
PS00134. TRYPSIN_HIS. 1 hit.
PS00135. TRYPSIN_SER. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameCFAI_PONAB
AccessionPrimary (citable) accession number: Q5R5A4
Entry history
Integrated into UniProtKB/Swiss-Prot: May 1, 2007
Last sequence update: December 21, 2004
Last modified: June 16, 2009
This is version 30 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectHPI (Human Proteome Initiative)

Relevant documents

Peptidase families

Classification of peptidase families and list of entries

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents