Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot P15863 (PAX1_HUMAN)

Last modified June 16, 2009. Version 87. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (5) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Alternative products · Sequence annotation (Features) · Sequences · References · Web resources · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Paired box protein Pax-1
Alternative name(s):
    HUP48
Gene names
Name: PAX1
Synonyms: HUP48
OrganismHomo sapiens (Human)
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length440 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceEvidence at transcript level.

General annotation (Comments)

Function

This protein is a transcriptional activator. It may play a role in the formation of segmented structures of the embryo. May play an important role in the normal development of the vertebral column By similarity.

Subcellular location

Nucleus.

Sequence similarities

Contains 1 paired domain.

Ontologies

Keywords
   Biological processTranscription
Transcription regulation
   Cellular componentNucleus
   Coding sequence diversityAlternative splicing
Polymorphism
   DomainPaired box
   LigandDNA-binding
   Molecular functionActivator
Developmental protein
Gene Ontology (GO)
   Biological processtranscription from RNA polymerase II promoter

Traceable author statement. Source: ProtInc

   Cellular componentnucleus

Inferred from electronic annotation. Source: UniProtKB-SubCell

   Molecular functionDNA binding

Inferred from electronic annotation. Source: UniProtKB-KW

Complete GO annotation...

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: P15863-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: P15863-2)

The sequence of this isoform differs from the canonical sequence as follows:
     330-361: PSREGSLPAPAARPRTPSVAYTDCPSRPRPPR → REGTDRKPPSSGSKAPDALSSLHGLPIPASTS
     362-440: Missing.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 440440Paired box protein Pax-1
PRO_0000050172

Regions

Domain4 – 130127Paired

Natural variations

Alternative sequence330 – 36132PSREG…PRPPR → REGTDRKPPSSGSKAPDALS SLHGLPIPASTS in isoform 2.
VSP_012986
Alternative sequence362 – 44079Missing in isoform 2.
VSP_012987
Natural variant451Q → H in a patient with neural tube defects, but not clearly linked to the disease. Ref.5
VAR_003787
Natural variant3451T → R: dbSNP rs17861058.
VAR_055369
Natural variant3591P → L: dbSNP rs17861059.
VAR_055370
Natural variant4101P → L: dbSNP rs17861061.
VAR_055371

Experimental info

Sequence conflict11M → A in CAA33146. Ref.4

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified March 1, 2005. Version 3.
Checksum: 500B7155C81616F9

FASTA44046,295
        10         20         30         40         50         60 
MEQTYGEVNQ LGGVFVNGRP LPNAIRLRIV ELAQLGIRPC DISRQLRVSH GCVSKILARY 

        70         80         90        100        110        120 
NETGSILPGA IGGSKPRVTT PNVVKHIRDY KQGDPGIFAW EIRDRLLADG VCDKYNVPSV 

       130        140        150        160        170        180 
SSISRILRNK IGSLAQPGPY EASKQPPSQP TLPYNHIYQY PYPSPVSPTG AKMGSHPGVP 

       190        200        210        220        230        240 
GTAGHVSIPR SWPSAHSVSN ILGIRTFMEQ TGALAGSEGT AYSPKMEDWA GVNRTAFPAT 

       250        260        270        280        290        300 
PAVNGLEKPA LEADIKYTQS ASTLSAVGGF LPACAYPASN QHGVYSAPGG GYLAPGPPWP 

       310        320        330        340        350        360 
PAQGPPLAPP GAGVAVHGGE LAAAMTFKHP SREGSLPAPA ARPRTPSVAY TDCPSRPRPP 

       370        380        390        400        410        420 
RGSSPRTRAR RERQADPGAQ VCAAAPAIGT GRIGGLAEEE ASAGPRGARP ASPQAQPCLW 

       430        440 
PDPPHFLYWS GFLGFSELGF 

« Hide

Isoform 2.

Checksum: 03B85D83A00D86E1
Show »

FASTA36137,825

References

« Hide 'large scale' references
[1]NIEHS SNPs program
Submitted (SEP-2004) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA], VARIANTS ARG-345; LEU-359 AND LEU-410.
[2]"The DNA sequence and comparative analysis of human chromosome 20."
Deloukas P., Matthews L.H., Ashurst J.L., Burton J., Gilbert J.G.R., Jones M., Stavrides G., Almeida J.P., Babbage A.K., Bagguley C.L., Bailey J., Barlow K.F., Bates K.N., Beard L.M., Beare D.M., Beasley O.P., Bird C.P., Blakey S.E. expand/collapse author list , Bridgeman A.M., Brown A.J., Buck D., Burrill W.D., Butler A.P., Carder C., Carter N.P., Chapman J.C., Clamp M., Clark G., Clark L.N., Clark S.Y., Clee C.M., Clegg S., Cobley V.E., Collier R.E., Connor R.E., Corby N.R., Coulson A., Coville G.J., Deadman R., Dhami P.D., Dunn M., Ellington A.G., Frankland J.A., Fraser A., French L., Garner P., Grafham D.V., Griffiths C., Griffiths M.N.D., Gwilliam R., Hall R.E., Hammond S., Harley J.L., Heath P.D., Ho S., Holden J.L., Howden P.J., Huckle E., Hunt A.R., Hunt S.E., Jekosch K., Johnson C.M., Johnson D., Kay M.P., Kimberley A.M., King A., Knights A., Laird G.K., Lawlor S., Lehvaeslaiho M.H., Leversha M.A., Lloyd C., Lloyd D.M., Lovell J.D., Marsh V.L., Martin S.L., McConnachie L.J., McLay K., McMurray A.A., Milne S.A., Mistry D., Moore M.J.F., Mullikin J.C., Nickerson T., Oliver K., Parker A., Patel R., Pearce T.A.V., Peck A.I., Phillimore B.J.C.T., Prathalingam S.R., Plumb R.W., Ramsay H., Rice C.M., Ross M.T., Scott C.E., Sehra H.K., Shownkeen R., Sims S., Skuce C.D., Smith M.L., Soderlund C., Steward C.A., Sulston J.E., Swann R.M., Sycamore N., Taylor R., Tee L., Thomas D.W., Thorpe A., Tracey A., Tromans A.C., Vaudin M., Wall M., Wallis J.M., Whitehead S.L., Whittaker P., Willey D.L., Williams L., Williams S.A., Wilming L., Wray P.W., Hubbard T., Durbin R.M., Bentley D.R., Beck S., Rogers J.
Nature 414:865-871(2001) [PubMed: 11780052] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[3]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
[4]"Conservation of the paired domain in metazoans and its structure in three isolated human genes."
Burri M., Tromvoukis Y., Bopp D., Frigerio G., Noll M.
EMBO J. 8:1183-1190(1989) [PubMed: 2501086] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-142.
[5]"PAX genes and human neural tube defects: an amino acid substitution in PAX1 in a patient with spina bifida."
Hol F.A., Geurds M.P.A., Chatkupt S., Shugart Y.Y., Balling R., Schrander-Stumpel C.T.R.M., Johnson W.G., Hamel B.C.J., Mariman E.C.M.
J. Med. Genet. 33:655-660(1996) [PubMed: 8863157] [Abstract]
Cited for: VARIANT HIS-45.
+Additional computationally mapped references.

Web resources

Cross-references

Sequence databases

AY740018 Genomic DNA. Translation: AAU21037.1.
AL035562 Genomic DNA. Translation: CAB46996.1.
BC069134 mRNA. Translation: AAH69134.1.
X15044 Genomic DNA. Translation: CAA33146.1. Different initiation.
IPIIPI00013473.
IPI00419577.
PIRS06961.
UniGeneHs.349082

3D structure databases

HSSPHSSP built from PDB template 1K78 based on UniProtKB Q02548.
SMRP15863. Positions 8-130.
ModBaseSearch...

Proteomic databases

PRIDEP15863.

Genome annotation databases

EnsemblENSG00000125813. Homo sapiens. [Contig view]

Organism-specific databases

GeneCardsGC20P021634.
HGNCHGNC:8615. PAX1.
MIM167411. gene.
PharmGKBPA32955.
GenAtlasSearch...

Phylogenomic databases

HOGENOMP15863.
HOVERGENP15863.

Gene expression databases

ArrayExpressP15863.
BgeeP15863.
CleanExHS_PAX1.
GermOnlineENSG00000125813. Homo sapiens.

Family and domain databases

InterProIPR001523. Paired_box_N.
IPR011991. Wing_hlx_DNA_bd.
[Graphical view]
Gene3DG3DSA:1.10.10.10. Wing_hlx_DNA_bd. 2 hits.
PfamPF00292. PAX. 1 hit.
[Graphical view]
PRINTSPR00027. PAIREDBOX.
SMARTSM00351. PAX. 1 hit.
[Graphical view]
PROSITEPS00034. PAIRED_1. 1 hit.
PS51057. PAIRED_2. 1 hit.
[Graphical view]
ProtoNetSearch...

Other Resources

NextBio19556.
SOURCESearch...

Entry information

Entry namePAX1_HUMAN
AccessionPrimary (citable) accession number: P15863
Secondary accession number(s): Q642X9, Q6NTC0, Q9Y558
Entry history
Integrated into UniProtKB/Swiss-Prot: April 1, 1990
Last sequence update: March 1, 2005
Last modified: June 16, 2009
This is version 87 of the entry and version 3 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectHPI (Human Proteome Initiative)

Relevant documents

Human chromosome 20

Human chromosome 20: entries, gene names and cross-references to MIM

Human entries with polymorphisms or disease mutations

List of human entries with polymorphisms or disease mutations

Human polymorphisms and disease mutations

Index of human polymorphisms and disease mutations

MIM cross-references

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Alternative products · Sequence annotation (Features) · Sequences · References · Web resources · Cross-references · Entry information · Relevant documents