Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q60715 (P4HA1_MOUSE) Reviewed, UniProtKB/Swiss-Prot

Last modified July 9, 2014. Version 126. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Prolyl 4-hydroxylase subunit alpha-1

Short name=4-PH alpha-1
EC=1.14.11.2
Alternative name(s):
Procollagen-proline,2-oxoglutarate-4-dioxygenase subunit alpha-1
Gene names
Name:P4ha1
OrganismMus musculus (Mouse) [Reference proteome]
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus

Protein attributes

Sequence length534 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at transcript level

General annotation (Comments)

Function

Catalyzes the post-translational formation of 4-hydroxyproline in -Xaa-Pro-Gly- sequences in collagens and other proteins.

Catalytic activity

L-proline-[procollagen] + 2-oxoglutarate + O2 = trans-4-hydroxy-L-proline-[procollagen] + succinate + CO2.

Cofactor

Binds 1 Fe2+ ion per subunit.

Ascorbate.

Subunit structure

Heterotetramer of two alpha-1 chains and two beta chains (the beta chain is the multi-functional PDI).

Subcellular location

Endoplasmic reticulum lumen.

Sequence similarities

Belongs to the P4HA family.

Contains 1 Fe2OG dioxygenase domain.

Contains 1 TPR repeat.

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q60715-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q60715-2)

The sequence of this isoform differs from the canonical sequence as follows:
     361-380: RRATISNPVTGALETVHYRI → SRATVHDPETGKLTTAQYRV

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 1717 By similarity
Chain18 – 534517Prolyl 4-hydroxylase subunit alpha-1
PRO_0000022724

Regions

Repeat205 – 23834TPR
Domain411 – 519109Fe2OG dioxygenase

Sites

Metal binding4291Iron By similarity
Metal binding4311Iron By similarity
Metal binding5001Iron By similarity
Binding site51012-oxoglutarate Potential

Amino acid modifications

Glycosylation1131N-linked (GlcNAc...) Potential
Glycosylation2591N-linked (GlcNAc...) Potential

Natural variations

Alternative sequence361 – 38020RRATI…VHYRI → SRATVHDPETGKLTTAQYRV in isoform 2.
VSP_004505

Experimental info

Sequence conflict691T → R in AAC52197. Ref.3
Sequence conflict1471T → N in AAC52197. Ref.3
Sequence conflict3541D → Y in AAC52197. Ref.3

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified May 2, 2002. Version 2.
Checksum: 81F6C61019E79460

FASTA53460,910
        10         20         30         40         50         60 
MIWVVLMMAI LLPQSLAHPG FFTSIGQMTD LIHNEKDLVT SLKDYIKAEE DKLEQIKKWA 

        70         80         90        100        110        120 
EKLDRLTSTA TKDPEGFVGH PVNAFKLMKR LNTEWSELEN LILKDMSDGF ISNLTIQRQY 

       130        140        150        160        170        180 
FPNDEDQVGA AKALFRLQDT YNLDTNTISK GNLPGVQHKS FLTAEDCFEL GKVAYTEADY 

       190        200        210        220        230        240 
YHTELWMEQA LTQLEEGELS TVDKVSVLDY LSYAVYQQGD LDKALLLTKK LLELDPEHQR 

       250        260        270        280        290        300 
ANGNLVYFEY IMSKEKDANK SASGDQSDQK TAPKKKGIAV DYLPERQKYE MLCRGEGIKM 

       310        320        330        340        350        360 
TPRRQKRLFC RYHDGNRNPK FILAPAKQED EWDKPRIIRF HDIISDAEIE IVKDLAKPRL 

       370        380        390        400        410        420 
RRATISNPVT GALETVHYRI SKSAWLSGYE DPVVSRINMR IQDLTGLDVS TAEELQVANY 

       430        440        450        460        470        480 
GVGGQYEPHF DFARKDEPDA FRELGTGNRI ATWLFYMSDV SAGGATVFPE VGASVWPKKG 

       490        500        510        520        530 
TAVFWYNLFA SGEGDYSTRH AACPVLVGNK WVSNKWLHER GQEFRRPCTL SELE 

« Hide

Isoform 2 [UniParc].

Checksum: CBDE1E9D6D261DD3
Show »

FASTA53460,886

References

« Hide 'large scale' references
[1]"The transcriptional landscape of the mammalian genome."
Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J. expand/collapse author list , Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.
Science 309:1559-1563(2005) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1 AND 2).
Strain: C57BL/6J and NOD.
Tissue: Embryo, Head and Thymus.
[2]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
Tissue: Mammary tumor.
[3]"Cloning, baculovirus expression, and characterization of a second mouse prolyl 4-hydroxylase alpha-subunit isoform: formation of an alpha 2 beta 2 tetramer with the protein disulfide-isomerase/beta subunit."
Helaakoski T., Annunen P., Vuori K., Macneil I.A., Pihlajaniemi T., Kivirikko K.I.
Proc. Natl. Acad. Sci. U.S.A. 92:4427-4431(1995) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 9-534 (ISOFORM 2).
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AK045008 mRNA. Translation: BAC32183.1.
AK160798 mRNA. Translation: BAE36020.1.
AK169726 mRNA. Translation: BAE41331.1.
BC009654 mRNA. Translation: AAH09654.1.
U16162 mRNA. Translation: AAC52197.1.
CCDSCCDS23863.1. [Q60715-1]
PIRI49134.
RefSeqNP_035160.1. NM_011030.2. [Q60715-1]
XP_006513403.1. XM_006513340.1. [Q60715-2]
UniGeneMm.2212.

3D structure databases

ProteinModelPortalQ60715.
SMRQ60715. Positions 18-254, 335-518.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

BioGrid202006. 2 interactions.
IntActQ60715. 4 interactions.
MINTMINT-4106073.

PTM databases

PhosphoSiteQ60715.

Proteomic databases

MaxQBQ60715.
PaxDbQ60715.
PRIDEQ60715.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENSMUST00000009789; ENSMUSP00000009789; ENSMUSG00000019916. [Q60715-1]
ENSMUST00000105466; ENSMUSP00000101106; ENSMUSG00000019916. [Q60715-2]
GeneID18451.
KEGGmmu:18451.
UCSCuc007fdo.2. mouse. [Q60715-1]
uc007fdp.2. mouse. [Q60715-2]

Organism-specific databases

CTD5033.
MGIMGI:97463. P4ha1.

Phylogenomic databases

eggNOGNOG78926.
GeneTreeENSGT00390000018885.
HOGENOMHOG000230465.
HOVERGENHBG006834.
KOK00472.
OMANDEDQIG.
OrthoDBEOG7W6WKC.
PhylomeDBQ60715.
TreeFamTF313393.

Gene expression databases

ArrayExpressQ60715.
BgeeQ60715.
GenevestigatorQ60715.

Family and domain databases

Gene3D1.25.40.10. 2 hits.
InterProIPR005123. Oxoglu/Fe-dep_dioxygenase.
IPR006620. Pro_4_hyd_alph.
IPR013547. Pro_4_hyd_alph_N.
IPR013026. TPR-contain_dom.
IPR011990. TPR-like_helical.
IPR019734. TPR_repeat.
[Graphical view]
PfamPF03171. 2OG-FeII_Oxy. 1 hit.
PF08336. P4Ha_N. 1 hit.
[Graphical view]
SMARTSM00702. P4Hc. 1 hit.
[Graphical view]
PROSITEPS51471. FE2OG_OXY. 1 hit.
PS50005. TPR. 1 hit.
PS50293. TPR_REGION. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

ChiTaRSP4HA1. mouse.
NextBio294136.
PROQ60715.
SOURCESearch...

Entry information

Entry nameP4HA1_MOUSE
AccessionPrimary (citable) accession number: Q60715
Secondary accession number(s): Q3TEB7, Q80T05, Q91VJ7
Entry history
Integrated into UniProtKB/Swiss-Prot: November 1, 1997
Last sequence update: May 2, 2002
Last modified: July 9, 2014
This is version 126 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot