Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q57408 (HGP4_HAEIN) Reviewed, UniProtKB/Swiss-Prot

Last modified February 19, 2014. Version 92. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Probable hemoglobin and hemoglobin-haptoglobin-binding protein 4
Gene names
Ordered Locus Names:HI_1565/HI_1567
OrganismHaemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) [Reference proteome] [HAMAP]
Taxonomic identifier71421 [NCBI]
Taxonomic lineageBacteriaProteobacteriaGammaproteobacteriaPasteurellalesPasteurellaceaeHaemophilus

Protein attributes

Sequence length999 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceInferred from homology

General annotation (Comments)

Function

Acts as a receptor for hemoglobin or the hemoglobin/haptoglobin complex of the human host and is required for heme uptake By similarity.

Subcellular location

Cell outer membrane; Peripheral membrane protein By similarity.

Miscellaneous

This protein is subject to phase-variable expression associated with alteration in the length of the CCAA repeat region. This mechanism is called slipped-strand mispairing. Addition or loss of CCAA repeat units would change the reading frame and result in introduction of stop codons downstream of the repeat region. This may be a mechanism of regulation and a way to avoid the immunological response of the host By similarity.

Sequence similarities

Belongs to the TonB-dependent receptor family. Hemoglobin/haptoglobin binding protein subfamily.

Sequence caution

The sequence AAC23213.1 differs from that shown. Reason: Frameshift at positions 49 and 289. The first frameshift is found in the repeats region.

The sequence AAC23214.1 differs from that shown. Reason: Frameshift at positions 49 and 289. The first frameshift is found in the repeats region.

Ontologies

Keywords
   Biological processTransport
   Cellular componentCell outer membrane
Membrane
   DomainRepeat
Signal
TonB box
Transmembrane
Transmembrane beta strand
   Molecular functionReceptor
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
   Cellular_componentcell outer membrane

Inferred from electronic annotation. Source: UniProtKB-SubCell

integral component of membrane

Inferred from electronic annotation. Source: UniProtKB-KW

   Molecular_functionreceptor activity

Inferred from electronic annotation. Source: InterPro

transporter activity

Inferred from electronic annotation. Source: InterPro

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 2424 Potential
Chain25 – 999975Probable hemoglobin and hemoglobin-haptoglobin-binding protein 4
PRO_0000034788

Regions

Repeat26 – 2941
Repeat30 – 3342
Repeat34 – 3743
Repeat38 – 4144
Repeat42 – 4545
Repeat46 – 4946
Region26 – 49246 X 4 AA tandem repeats of P-T-N-Q
Motif58 – 658TonB box
Motif982 – 99918TonB C-terminal box

Sequences

Sequence LengthMass (Da)Tools
Q57408 [UniParc].

Last modified August 29, 2001. Version 3.
Checksum: DAFCD4EB7000A876

FASTA999114,315
        10         20         30         40         50         60 
MTNFRLNVLA YSVMLGLTAS VAYAEPTNQP TNQPTNQPTN QPTNQPTNQN SNASEQLEQI 

        70         80         90        100        110        120 
NVLGSDNNND NTPPKIAETV KTASQLKRQQ VQDSRDLVRY ETGVTVVEAG RFGSSGYAIR 

       130        140        150        160        170        180 
GVDENRVAIT VDGLHQAETL SSQGFKELFE GYGNFNNTRN SVEIETLKVA KIAKGADSVK 

       190        200        210        220        230        240 
VGSGSLGGAV LFETKDARDF LTEKDWHIGY KAGYSTADNQ GLNAVTLAGR YQMFDALIMH 

       250        260        270        280        290        300 
SKRHGHELEN YDYKNGRDIQ GKEREKADPY TITKESTLVK FSFSPTENHR FTVASDTYLQ 

       310        320        330        340        350        360 
HSRGHDFSYN LVKTTYINKD EEELRHTNDL TKRKNVSFTY ENYTVTPFWD TLKLSYSQQR 

       370        380        390        400        410        420 
ITTRARTEDY CDGNEKCDSY KNPLGLQLKE GKVVDRNGDP VELKLVEDEQ GQKRHQVVDK 

       430        440        450        460        470        480 
YNNPFSVASG TNNDAFVGKQ LSPSEFWLDC SIFNCDKPVR VYKYQYSNQE PESKEVELNR 

       490        500        510        520        530        540 
TMEINGKKFA TYESNNYRDR YHMILPNSKG YLPLDYKERD LNTKTKQINL DLTKAFTLFE 

       550        560        570        580        590        600 
IENELSYGGV YAKTTKEMVN KAGYYGRNPT WWAERTLGKS LLNGLRTCKE DSSYNGLLCP 

       610        620        630        640        650        660 
RHEPKTSFLI PVETTTKSLY FADNIKLHNM LSVDLGYRYD DIKYQPEYIP GVTPKIADDM 

       670        680        690        700        710        720 
VRELFVPLPP ANGKDWQGNP VYTPEQIRKN AEENIAYIAQ EKRFKKHSYS LGATFDPLNF 

       730        740        750        760        770        780 
LRVQVKYSKG FRTPTSDELY FTFKHPDFTI LPNPNMKPEE AKNQEIALTF HHDWGFFSTN 

       790        800        810        820        830        840 
VFQTKYRQFI DLAYLGSRNL SNSVGGQAQA RDFQVYQNVN VDRAKVKGVE INSRLNIGYF 

       850        860        870        880        890        900 
FEKLDGFNVS YKFTYQRGRL DGNRPMNAIQ PKTSVIGLGY DHKEQRFGAD LYVTHVSAKK 

       910        920        930        940        950        960 
AKDTYNMFYK EQGYKDSAVR WRSDDYTLVD FVTYIKPVKN VTLQFGVYNL TDRKYLTWES 

       970        980        990 
ARSIKPFGTS NLINQGTGAG INRFYSPGRN YKLSAEITF 

« Hide

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
L42023 Genomic DNA. Translation: AAC23213.1. Sequence problems.
L42023 Genomic DNA. Translation: AAC23214.1. Sequence problems.
PIRA64130.

3D structure databases

ProteinModelPortalQ57408.
ModBaseSearch...
MobiDBSearch...

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaAAC23213; AAC23213; HI_1565.
AAC23214; AAC23214; HI_1567.

Family and domain databases

Gene3D2.170.130.10. 1 hit.
2.40.170.20. 3 hits.
InterProIPR012910. Plug.
IPR006970. PT.
IPR000531. TonB-dep_rcpt_b-brl.
IPR010949. TonB_Hb/transfer/lactofer_rcpt.
IPR010917. TonB_rcpt_CS.
[Graphical view]
PfamPF07715. Plug. 1 hit.
PF04886. PT. 1 hit.
PF00593. TonB_dep_Rec. 1 hit.
[Graphical view]
TIGRFAMsTIGR01786. TonB-hemlactrns. 1 hit.
PROSITEPS01156. TONB_DEPENDENT_REC_2. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameHGP4_HAEIN
AccessionPrimary (citable) accession number: Q57408
Secondary accession number(s): O86244, P96344
Entry history
Integrated into UniProtKB/Swiss-Prot: November 1, 1997
Last sequence update: August 29, 2001
Last modified: February 19, 2014
This is version 92 of the entry and version 3 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families

Haemophilus influenzae

Haemophilus influenzae (strain Rd): entries and gene names