Skip Header

 
Contribute Send feedback

Reviewed, UniProtKB/Swiss-Prot Q04756 (HGFA_HUMAN)

Last modified November 4, 2008. Version 91. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (7) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Binary interactions · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Hepatocyte growth factor activator
      Short name=HGF activator
      Short name=HGFA
    EC=3.4.21.-
Cleaved into the following 2 chains:
    1- Recommended name:
            Hepatocyte growth factor activator short chain
    2- Recommended name:
            Hepatocyte growth factor activator long chain
Gene names
Name: HGFAC
OrganismHomo sapiens (Human)
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length655 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level.

General annotation (Comments)

Function

Activates hepatocyte growth factor (HGF) by converting it from a single chain to a heterodimeric form.

Subunit structure

Heterodimer of a short chain and a long chain linked by a disulfide bond.

Subcellular location

Secreted. Note= Secreted as an inactive single-chain precursor and is then activated to a heterodimeric form.

Tissue specificity

Liver.

Sequence similarities

Belongs to the peptidase S1 family.

Contains 2 EGF-like domains.

Contains 1 fibronectin type-I domain.

Contains 1 fibronectin type-II domain.

Contains 1 kringle domain.

Contains 1 peptidase S1 domain.

Caution

It is uncertain whether Met-1 is the initiator.

Ontologies

Keywords

   Cellular componentSecreted
   Coding sequence diversityPolymorphism
   DomainEGF-like domain
Kringle
Repeat
Signal
   Molecular functionHydrolase
Protease
Serine protease
   PTMGlycoprotein
Zymogen
   Technical term3D-structure
Direct protein sequencing

Gene Ontology (GO)

   Biological processproteolysis

Traceable author statement. Source: ProtInc

   Cellular componentextracellular region Ref.1

Non-traceable author statement. Source: UniProtKB

   Molecular functionprotein binding

Inferred from physical interaction. Source: IntAct

serine-type endopeptidase activity Ref.1

Traceable author statement. Source: ProtInc

Complete GO annotation...

Binary interactions

With

Entry

#Exp.

IntAct

Notes

SPINT1O432781EBI-1041722,EBI-953990

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 3535
Propeptide36 – 372337Removed in mature form
PRO_0000027911
Chain373 – 40735Hepatocyte growth factor activator short chain
PRO_0000027912
Chain408 – 655248Hepatocyte growth factor activator long chain
PRO_0000027913

Regions

Domain103 – 15048Fibronectin type-II
Domain160 – 19839EGF-like 1
Domain200 – 24041Fibronectin type-I
Domain241 – 27939EGF-like 2
Domain286 – 36782Kringle
Domain408 – 646239Peptidase S1

Sites

Active site4471Charge relay system By similarity
Active site4971Charge relay system By similarity
Active site5981Charge relay system By similarity

Amino acid modifications

Glycosylation481N-linked (GlcNAc...) Potential
Glycosylation2901N-linked (GlcNAc...) Potential
Glycosylation4681N-linked (GlcNAc...)
Glycosylation4921N-linked (GlcNAc...) Potential
Glycosylation5461N-linked (GlcNAc...) Potential
Disulfide bond108 ↔ 133 By similarity
Disulfide bond122 ↔ 148 By similarity
Disulfide bond164 ↔ 175 By similarity
Disulfide bond169 ↔ 186 By similarity
Disulfide bond188 ↔ 197 By similarity
Disulfide bond202 ↔ 230 By similarity
Disulfide bond228 ↔ 237 By similarity
Disulfide bond245 ↔ 256 By similarity
Disulfide bond250 ↔ 267 By similarity
Disulfide bond269 ↔ 278 By similarity
Disulfide bond286 ↔ 367 By similarity
Disulfide bond307 ↔ 349 By similarity
Disulfide bond338 ↔ 362 By similarity
Disulfide bond394 ↔ 521Interchain (between short and long chains) By similarity
Disulfide bond432 ↔ 448 By similarity
Disulfide bond440 ↔ 510 By similarity
Disulfide bond535 ↔ 604 By similarity
Disulfide bond567 ↔ 583 By similarity
Disulfide bond594 ↔ 622 By similarity

Natural variations

Natural variant2251V → M: dbSNP rs16844370.
VAR_033651
Natural variant2311F → L: dbSNP rs1987546.
VAR_033652
Natural variant5091R → H: dbSNP rs16844401.
VAR_024294
Natural variant6441R → Q: dbSNP rs2498323.
VAR_024295

Secondary structure

............................................. 655
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
Q04756-1 [UniParc].

Last modified June 1, 1994. Version 1.
Checksum: 2CF72F1E1B862ED7

FASTA65570,682
        10         20         30         40         50         60 
MGRWAWVPSP WPPPGLGPFL LLLLLLLLLP RGFQPQPGGN RTESPEPNAT ATPAIPTILV 

        70         80         90        100        110        120 
TSVTSETPAT SAPEAEGPQS GGLPPPPRAV PSSSSPQAQA LTEDGRPCRF PFRYGGRMLH 

       130        140        150        160        170        180 
ACTSEGSAHR KWCATTHNYD RDRAWGYCVE ATPPPGGPAA LDPCASGPCL NGGSCSNTQD 

       190        200        210        220        230        240 
PQSYHCSCPR AFTGKDCGTE KCFDETRYEY LEGGDRWARV RQGHVEQCEC FGGRTWCEGT 

       250        260        270        280        290        300 
RHTACLSSPC LNGGTCHLIV ATGTTVCACP PGFAGRLCNI EPDERCFLGN GTGYRGVAST 

       310        320        330        340        350        360 
SASGLSCLAW NSDLLYQELH VDSVGAAALL GLGPHAYCRN PDNDERPWCY VVKDSALSWE 

       370        380        390        400        410        420 
YCRLEACESL TRVQLSPDLL ATLPEPASPG RQACGRRHKK RTFLRPRIIG GSSSLPGSHP 

       430        440        450        460        470        480 
WLAAIYIGDS FCAGSLVHTC WVVSAAHCFS HSPPRDSVSV VLGQHFFNRT TDVTQTFGIE 

       490        500        510        520        530        540 
KYIPYTLYSV FNPSDHDLVL IRLKKKGDRC ATRSQFVQPI CLPEPGSTFP AGHKCQIAGW 

       550        560        570        580        590        600 
GHLDENVSGY SSSLREALVP LVADHKCSSP EVYGADISPN MLCAGYFDCK SDACQGDSGG 

       610        620        630        640        650 
PLACEKNGVA YLYGIISWGD GCGRLHKPGV YTRVANYVDW INDRIRPPRR LVAPS 

« Hide

References

« Hide 'large scale' references
[1]"Molecular cloning and sequence analysis of the cDNA for a human serine protease reponsible for activation of hepatocyte growth factor. Structural similarity of the protease precursor to blood coagulation factor XII."
Miyazawa K., Shimomura T., Kitamura A., Kondo J., Morimoto Y., Kitamura N.
J. Biol. Chem. 268:10024-10028(1993) [PubMed: 7683665] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA], PARTIAL PROTEIN SEQUENCE.
Tissue: Liver and Serum.
[2]Zhao S., Odell C.
Submitted (FEB-1996) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE OF 40-655, VARIANT GLN-644.
[3]"Signal peptide prediction based on analysis of experimentally verified cleavage sites."
Zhang Z., Henzel W.J.
Protein Sci. 13:2819-2824(2004) [PubMed: 15340161] [Abstract]
Cited for: PROTEIN SEQUENCE OF 36-50.
[4]"Human plasma N-glycoproteome analysis by immunoaffinity subtraction, hydrazide chemistry, and mass spectrometry."
Liu T., Qian W.-J., Gritsenko M.A., Camp D.G. II, Monroe M.E., Moore R.J., Smith R.D.
J. Proteome Res. 4:2070-2080(2005) [PubMed: 16335952] [Abstract]
Cited for: GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-468, MASS SPECTROMETRY.
Tissue: Plasma.
+Additional computationally mapped references.

Cross-references

Sequence databases

D14012 mRNA. Translation: BAA03113.1.
Z69923 Genomic DNA. No translation available.
PIRA46688.
RefSeqNP_001519.1.
UniGeneHs.104

3D structure databases

EntryMethodResolution (Å)ChainPositionsPDBsum
1YBWX-ray2.70A/B373-655[»]
1YC0X-ray2.60A373-655[»]
2R0KX-ray3.51A373-655[»]
2R0LX-ray2.20A408-655[»]
B373-407[»]
ModBaseSearch...

Protein-protein interaction databases

DIPDIP:6022N.
IntActQ04756.

Protein family/group databases

MEROPSS01.228.

PTM databases

PhosphoSiteQ04756.

Proteomic databases

PeptideAtlasQ04756.

Genome annotation databases

EnsemblENSG00000109758. Homo sapiens. [Contig view]
GeneID3083.
KEGGhsa:3083.

Organism-specific databases

H-InvDBHIX0031501.
HGNCHGNC:4894. HGFAC.
HPACAB005215.
MIM604552. gene.
PharmGKB