Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Forkhead box protein P2

Gene

FOXP2

Organism
Pan troglodytes (Chimpanzee)
Status
Reviewed-Annotation score: Annotation score: 4 out of 5-Experimental evidence at transcript leveli

Functioni

Transcriptional repressor that may play a role in the specification and differentiation of lung epithelium. May also play a role in developing neural, gastrointestinal and cardiovascular tissues. Can act with CTBP1 to synergistically repress transcription but CTPBP1 is not essential. Plays a role in synapse formation by regulating SRPX2 levels (By similarity).By similarity

Regions

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Zinc fingeri347 – 37226C2H2-typeAdd
BLAST
DNA bindingi505 – 59591Fork-headPROSITE-ProRule annotationAdd
BLAST

GO - Molecular functioni

  1. DNA binding Source: UniProtKB
  2. metal ion binding Source: UniProtKB-KW
  3. protein homodimerization activity Source: UniProtKB
  4. RNA polymerase II core promoter proximal region sequence-specific DNA binding Source: Ensembl
  5. RNA polymerase II core promoter proximal region sequence-specific DNA binding transcription factor activity involved in negative regulation of transcription Source: Ensembl

GO - Biological processi

  1. camera-type eye development Source: Ensembl
  2. caudate nucleus development Source: UniProtKB
  3. cerebellum development Source: Ensembl
  4. cerebral cortex development Source: Ensembl
  5. growth Source: Ensembl
  6. lung alveolus development Source: Ensembl
  7. positive regulation of epithelial cell proliferation involved in lung morphogenesis Source: Ensembl
  8. positive regulation of mesenchymal cell proliferation Source: Ensembl
  9. post-embryonic development Source: Ensembl
  10. putamen development Source: UniProtKB
  11. righting reflex Source: Ensembl
  12. skeletal muscle tissue development Source: Ensembl
  13. smooth muscle tissue development Source: Ensembl
  14. vocal learning Source: Ensembl
Complete GO annotation...

Keywords - Molecular functioni

Repressor

Keywords - Biological processi

Transcription, Transcription regulation

Keywords - Ligandi

DNA-binding, Metal-binding, Zinc

Names & Taxonomyi

Protein namesi
Recommended name:
Forkhead box protein P2
Gene namesi
Name:FOXP2
OrganismiPan troglodytes (Chimpanzee)
Taxonomic identifieri9598 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaePan
ProteomesiUP000002277: Chromosome 7

Subcellular locationi

Nucleus Curated

GO - Cellular componenti

  1. nucleus Source: UniProtKB-SubCell
Complete GO annotation...

Keywords - Cellular componenti

Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 716716Forkhead box protein P2PRO_0000091884Add
BLAST

Interactioni

Subunit structurei

Forms homodimers and heterodimers with FOXP1 and FOXP4. Dimerization is required for DNA-binding. Interacts with CTBP1 (By similarity).By similarity

Structurei

3D structure databases

ProteinModelPortaliQ8MJA0.
SMRiQ8MJA0. Positions 504-585.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni389 – 41022Leucine-zipperAdd
BLAST
Regioni423 – 4275CTBP1-bindingBy similarity

Compositional bias

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Compositional biasi53 – 269217Gln-richAdd
BLAST

Domaini

The leucine-zipper is required for dimerization and transcriptional repression.By similarity

Sequence similaritiesi

Contains 1 C2H2-type zinc finger.Curated
Contains 1 fork-head DNA-binding domain.PROSITE-ProRule annotation

Zinc finger

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Zinc fingeri347 – 37226C2H2-typeAdd
BLAST

Keywords - Domaini

Zinc-finger

Phylogenomic databases

eggNOGiCOG5025.
GeneTreeiENSGT00780000121840.
HOGENOMiHOG000092089.
HOVERGENiHBG051657.
InParanoidiQ8MJA0.
KOiK09409.
OMAiPETKLCV.
OrthoDBiEOG7M6D7G.
TreeFamiTF326978.

Family and domain databases

Gene3Di1.10.10.10. 1 hit.
InterProiIPR001766. TF_fork_head.
IPR018122. TF_fork_head_CS.
IPR011991. WHTH_DNA-bd_dom.
IPR015880. Znf_C2H2-like.
[Graphical view]
PfamiPF00250. Fork_head. 1 hit.
[Graphical view]
PRINTSiPR00053. FORKHEAD.
SMARTiSM00339. FH. 1 hit.
SM00355. ZnF_C2H2. 1 hit.
[Graphical view]
PROSITEiPS00658. FORK_HEAD_2. 1 hit.
PS50039. FORK_HEAD_3. 1 hit.
PS00028. ZINC_FINGER_C2H2_1. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Q8MJA0-1 [UniParc]FASTAAdd to Basket

« Hide

        10         20         30         40         50
MMQESATETI SNSSMNQNGM STLSSQLDAG SRDGRSSGDT SSEVSTVELL
60 70 80 90 100
HLQQQQALQA ARQLLLQQQT SGLKSPKSSD KQRPLQVPVS VAMMTPQVIT
110 120 130 140 150
PQQMQQILQQ QVLSPQQLQA LLQQQQAVML QQQQLQEFYK KQQEQLHLQL
160 170 180 190 200
LQQQQQQQQQ QQQQQQQQQQ QQQQQQQQQQ QQQQQQQQQQ QQHPGKQAKE
210 220 230 240 250
QQQQQQQQQQ LAAQQLVFQQ QLLQMQQLQQ QQHLLSLQRQ GLISIPPGQA
260 270 280 290 300
ALPVQSLPQA GLSPAEIQQL WKEVTGVHSM EDNGIKHGGL DLTTNNSSST
310 320 330 340 350
TSSTTSKASP PITHHSIVNG QSSVLNARRD SSSHEETGAS HTLYGHGVCK
360 370 380 390 400
WPGCESICED FGQFLKHLNN EHALDDRSTA QCRVQMQVVQ QLEIQLSKER
410 420 430 440 450
ERLQAMMTHL HMRPSEPKPS PKPLNLVSSV TMSKNMLETS PQSLPQTPTT
460 470 480 490 500
PTAPVTPITQ GPSVITPASV PNVGAIRRRH SDKYNIPMSS EIAPNYEFYK
510 520 530 540 550
NADVRPPFTY ATLIRQAIME SSDRQLTLNE IYSWFTRTFA YFRRNAATWK
560 570 580 590 600
NAVRHNLSLH KCFVRVENVK GAVWTVDEVE YQKRRSQKIT GSPTLVKNIP
610 620 630 640 650
TSLGYGAALN ASLQAALAES SLPLLSNPGL INNASSGLLQ AVHEDLNGSL
660 670 680 690 700
DHIDSNGNSS PGCSPQPHIH SIHVKEEPVI AEDEDCPMSL VTTANHSPEL
710
EDDREIEEEP LSEDLE
Length:716
Mass (Da):80,061
Last modified:October 1, 2002 - v1
Checksum:i3169A2786B42F79F
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AF512947 mRNA. Translation: AAN03385.1.
AF515051 Genomic DNA. Translation: AAN03409.1.
AF515052 Genomic DNA. Translation: AAN03410.1.
AY143178 mRNA. Translation: AAN60056.1.
AY064549 mRNA. Translation: AAL57735.1.
AY064565
, AY064551, AY064552, AY064553, AY064554, AY064555, AY064556, AY064557, AY064558, AY064559, AY064560, AY064561, AY064562, AY064563, AY064564 Genomic DNA. Translation: AAL57731.1.
RefSeqiNP_001009020.1. NM_001009020.2.
XP_009452287.1. XM_009454012.1.
UniGeneiPtr.6303.

Genome annotation databases

EnsembliENSPTRT00000036314; ENSPTRP00000033573; ENSPTRG00000019608.
GeneIDi449627.
KEGGiptr:449627.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AF512947 mRNA. Translation: AAN03385.1.
AF515051 Genomic DNA. Translation: AAN03409.1.
AF515052 Genomic DNA. Translation: AAN03410.1.
AY143178 mRNA. Translation: AAN60056.1.
AY064549 mRNA. Translation: AAL57735.1.
AY064565
, AY064551, AY064552, AY064553, AY064554, AY064555, AY064556, AY064557, AY064558, AY064559, AY064560, AY064561, AY064562, AY064563, AY064564 Genomic DNA. Translation: AAL57731.1.
RefSeqiNP_001009020.1. NM_001009020.2.
XP_009452287.1. XM_009454012.1.
UniGeneiPtr.6303.

3D structure databases

ProteinModelPortaliQ8MJA0.
SMRiQ8MJA0. Positions 504-585.
ModBaseiSearch...
MobiDBiSearch...

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSPTRT00000036314; ENSPTRP00000033573; ENSPTRG00000019608.
GeneIDi449627.
KEGGiptr:449627.

Organism-specific databases

CTDi93986.

Phylogenomic databases

eggNOGiCOG5025.
GeneTreeiENSGT00780000121840.
HOGENOMiHOG000092089.
HOVERGENiHBG051657.
InParanoidiQ8MJA0.
KOiK09409.
OMAiPETKLCV.
OrthoDBiEOG7M6D7G.
TreeFamiTF326978.

Miscellaneous databases

NextBioi20832734.

Family and domain databases

Gene3Di1.10.10.10. 1 hit.
InterProiIPR001766. TF_fork_head.
IPR018122. TF_fork_head_CS.
IPR011991. WHTH_DNA-bd_dom.
IPR015880. Znf_C2H2-like.
[Graphical view]
PfamiPF00250. Fork_head. 1 hit.
[Graphical view]
PRINTSiPR00053. FORKHEAD.
SMARTiSM00339. FH. 1 hit.
SM00355. ZnF_C2H2. 1 hit.
[Graphical view]
PROSITEiPS00658. FORK_HEAD_2. 1 hit.
PS50039. FORK_HEAD_3. 1 hit.
PS00028. ZINC_FINGER_C2H2_1. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

  1. "Molecular evolution of FOXP2, a gene involved in speech and language."
    Enard W., Przeworski M., Fisher S.E., Lai C.S.L., Wiebe V., Kitano T., Monaco A.P., Paeaebo S.
    Nature 418:869-872(2002) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA / MRNA].
  2. "Accelerated protein evolution and origins of human-specific features: Foxp2 as an example."
    Zhang J., Webb D.M., Podlaha O.
    Genetics 162:1825-1835(2002) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [MRNA].
  3. "The FOXP2 gene, implicated in language development, is conserved in mammalian evolution."
    Walter N.A.R., Thompson J., McGoldrick D.J., Messier W.
    Submitted (NOV-2001) to the EMBL/GenBank/DDBJ databases
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA / MRNA].
    Tissue: Blood.

Entry informationi

Entry nameiFOXP2_PANTR
AccessioniPrimary (citable) accession number: Q8MJA0
Secondary accession number(s): Q8MHX3
Entry historyi
Integrated into UniProtKB/Swiss-Prot: November 21, 2003
Last sequence update: October 1, 2002
Last modified: February 4, 2015
This is version 96 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into Uniref entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.