Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

POU domain, class 5, transcription factor 1

Gene

pou5f1

Organism
Danio rerio (Zebrafish) (Brachydanio rerio)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at transcript leveli

Functioni

Involved in early development of embryos, especially in the process of gastrulation. May play an important role in establishing and specifying rhombomeric segments. Seems to be required to maintain the cells in a highly undifferentiated state. In contrast to POU2, T-POU2 lacks DNA-binding activity because of its incomplete pou domain structure. Overexpression of POU2 does not have any effect on development, whereas overexpression of t-POU2 causes developmental retardation or arrest before gastrulation.

Regions

Feature keyPosition(s)DescriptionActionsGraphical viewLength
DNA bindingi343 – 402HomeoboxPROSITE-ProRule annotationAdd BLAST60

GO - Molecular functioni

  • chromatin binding Source: ZFIN
  • protein heterodimerization activity Source: ZFIN
  • sequence-specific DNA binding Source: ZFIN
  • transcription factor activity, sequence-specific DNA binding Source: ZFIN

GO - Biological processi

  • anatomical structure morphogenesis Source: ZFIN
  • brain development Source: ZFIN
  • brain segmentation Source: ZFIN
  • dorsal/ventral pattern formation Source: ZFIN
  • ectoderm development Source: ZFIN
  • embryonic pattern specification Source: ZFIN
  • endoderm formation Source: ZFIN
  • epiboly Source: ZFIN
  • epiboly involved in gastrulation with mouth forming second Source: ZFIN
  • fin regeneration Source: ZFIN
  • hindbrain development Source: ZFIN
  • mesoderm development Source: ZFIN
  • morphogenesis of embryonic epithelium Source: ZFIN
  • positive regulation of transcription, DNA-templated Source: ZFIN
  • regulation of DNA methylation Source: ZFIN
  • regulation of endodermal cell fate specification Source: ZFIN
  • regulation of gene expression Source: ZFIN
  • transcription, DNA-templated Source: UniProtKB-KW
Complete GO annotation...

Keywords - Molecular functioni

Developmental protein

Keywords - Biological processi

Transcription, Transcription regulation

Keywords - Ligandi

DNA-binding

Names & Taxonomyi

Protein namesi
Recommended name:
POU domain, class 5, transcription factor 1
Alternative name(s):
POU domain protein 2
Gene namesi
Name:pou5f1
Synonyms:gp-9, pou-2, pou2
OrganismiDanio rerio (Zebrafish) (Brachydanio rerio)
Taxonomic identifieri7955 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiActinopterygiiNeopterygiiTeleosteiOstariophysiCypriniformesCyprinidaeDanio
Proteomesi
  • UP000000437 Componenti: Unplaced

Organism-specific databases

ZFINiZDB-GENE-980526-485. pou5f3.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00001007521 – 472POU domain, class 5, transcription factor 1Add BLAST472

Proteomic databases

PaxDbiQ90270.

Expressioni

Developmental stagei

Maternally expressed. Present from the one-cell stage to the gastrula stage. Present in all blastomeres until the midblastula stage. The expression is restricted to the epiblast during gastrulation, and to the neural plate after gastrulation. In the adult, expression is limited to the ovary.

Gene expression databases

BgeeiENSDARG00000044774.

Interactioni

GO - Molecular functioni

  • protein heterodimerization activity Source: ZFIN

Protein-protein interaction databases

STRINGi7955.ENSDARP00000065816.

Structurei

3D structure databases

ProteinModelPortaliQ90270.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini249 – 323POU-specificPROSITE-ProRule annotationAdd BLAST75

Compositional bias

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Compositional biasi247 – 250Poly-Glu4
Compositional biasi419 – 423Poly-Pro5

Sequence similaritiesi

Contains 1 homeobox DNA-binding domain.PROSITE-ProRule annotation
Contains 1 POU-specific domain.PROSITE-ProRule annotation

Keywords - Domaini

Homeobox

Phylogenomic databases

eggNOGiKOG3802. Eukaryota.
ENOG410XQ7X. LUCA.
HOGENOMiHOG000063726.
HOVERGENiHBG053782.
InParanoidiQ90270.
KOiK09369.
TreeFamiTF316413.

Family and domain databases

Gene3Di1.10.10.60. 1 hit.
1.10.260.40. 1 hit.
InterProiIPR017970. Homeobox_CS.
IPR001356. Homeobox_dom.
IPR009057. Homeodomain-like.
IPR010982. Lambda_DNA-bd_dom.
IPR013847. POU.
IPR000327. POU_dom.
IPR015585. POU_dom_5.
[Graphical view]
PANTHERiPTHR11636:SF79. PTHR11636:SF79. 1 hit.
PfamiPF00046. Homeobox. 1 hit.
PF00157. Pou. 1 hit.
[Graphical view]
PRINTSiPR00028. POUDOMAIN.
SMARTiSM00389. HOX. 1 hit.
SM00352. POU. 1 hit.
[Graphical view]
SUPFAMiSSF46689. SSF46689. 1 hit.
SSF47413. SSF47413. 1 hit.
PROSITEiPS00027. HOMEOBOX_1. 1 hit.
PS50071. HOMEOBOX_2. 1 hit.
PS00035. POU_1. 1 hit.
PS00465. POU_2. 1 hit.
PS51179. POU_3. 1 hit.
[Graphical view]

Sequences (2)i

Sequence statusi: Complete.

This entry describes 2 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform POU2 (identifier: Q90270-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MTERAQSPTA ADCRPYEVNR AMYPQAAGLD GLGGASLQFA HGMLQDPSLI
60 70 80 90 100
FNKAHFNGIT PATAQTFFPF SGDFKTNDLQ GGDFTQPKHW YPFAAPEFTG
110 120 130 140 150
QVAGATAATQ PANISPPIGE TREQIKMPSE VKTEKDVEEY GNEENKPPSQ
160 170 180 190 200
YHLTAGTSSI PTGVNYYTPW NPNFWPGLSQ ITAQANISQA PPTPSASSPS
210 220 230 240 250
LSPSPPGNGF GSPGFFSGGT AQNIPSAQAQ SAPRSSGSSS GGCSNSEEEE
260 270 280 290 300
TLTTEDLEQF AKELKHKRIT LGFTQADVGL ALGNLYGKMF SQTTICRFEA
310 320 330 340 350
LQLSFKNMCK LKPLLQRWLN EAENSENPQD MYKIERVFVD TRKRKRRTSL
360 370 380 390 400
EGTVRSALES YFVKCPKPNT LEITHISDDL GLERDVVRVW FCNRRQKGKR
410 420 430 440 450
LALPFDDECV EAQYYEQSPP PPPHMGGTVL PGQGYPGPAH PGGAPALYMP
460 470
SLHRPDVFKN GFHPGLVGHL NS
Length:472
Mass (Da):51,505
Last modified:November 1, 1996 - v1
Checksum:iD11678834FBD031F
GO
Isoform T-POU2 (identifier: Q90270-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     386-399: VVRVWFCNRRQKGK → CVYGSATVDRRESV
     400-472: Missing.

Show »
Length:399
Mass (Da):43,502
Checksum:iAB3C4C29A7E5C395
GO

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti160I → V in CAA59006 (PubMed:7669688).Curated1
Sequence conflicti245N → D in CAA59006 (PubMed:7669688).Curated1
Sequence conflicti462F → L in CAA59006 (PubMed:7669688).Curated1
Sequence conflicti471N → T in CAA59006 (PubMed:7669688).Curated1

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_002337386 – 399VVRVW…RQKGK → CVYGSATVDRRESV in isoform T-POU2. CuratedAdd BLAST14
Alternative sequenceiVSP_002338400 – 472Missing in isoform T-POU2. CuratedAdd BLAST73

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
D28548 mRNA. Translation: BAA05901.1.
X84224 mRNA. Translation: CAA59006.1.
PIRiA49836.
B49836.
RefSeqiNP_571187.1. NM_131112.1.
UniGeneiDr.258.

Genome annotation databases

GeneIDi30333.
KEGGidre:30333.

Keywords - Coding sequence diversityi

Alternative splicing

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
D28548 mRNA. Translation: BAA05901.1.
X84224 mRNA. Translation: CAA59006.1.
PIRiA49836.
B49836.
RefSeqiNP_571187.1. NM_131112.1.
UniGeneiDr.258.

3D structure databases

ProteinModelPortaliQ90270.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi7955.ENSDARP00000065816.

Proteomic databases

PaxDbiQ90270.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

GeneIDi30333.
KEGGidre:30333.

Organism-specific databases

CTDi30333.
ZFINiZDB-GENE-980526-485. pou5f3.

Phylogenomic databases

eggNOGiKOG3802. Eukaryota.
ENOG410XQ7X. LUCA.
HOGENOMiHOG000063726.
HOVERGENiHBG053782.
InParanoidiQ90270.
KOiK09369.
TreeFamiTF316413.

Miscellaneous databases

PROiQ90270.

Gene expression databases

BgeeiENSDARG00000044774.

Family and domain databases

Gene3Di1.10.10.60. 1 hit.
1.10.260.40. 1 hit.
InterProiIPR017970. Homeobox_CS.
IPR001356. Homeobox_dom.
IPR009057. Homeodomain-like.
IPR010982. Lambda_DNA-bd_dom.
IPR013847. POU.
IPR000327. POU_dom.
IPR015585. POU_dom_5.
[Graphical view]
PANTHERiPTHR11636:SF79. PTHR11636:SF79. 1 hit.
PfamiPF00046. Homeobox. 1 hit.
PF00157. Pou. 1 hit.
[Graphical view]
PRINTSiPR00028. POUDOMAIN.
SMARTiSM00389. HOX. 1 hit.
SM00352. POU. 1 hit.
[Graphical view]
SUPFAMiSSF46689. SSF46689. 1 hit.
SSF47413. SSF47413. 1 hit.
PROSITEiPS00027. HOMEOBOX_1. 1 hit.
PS50071. HOMEOBOX_2. 1 hit.
PS00035. POU_1. 1 hit.
PS00465. POU_2. 1 hit.
PS51179. POU_3. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiPO5F1_DANRE
AccessioniPrimary (citable) accession number: Q90270
Secondary accession number(s): Q90483
Entry historyi
Integrated into UniProtKB/Swiss-Prot: November 1, 1997
Last sequence update: November 1, 1996
Last modified: October 5, 2016
This is version 127 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.