Skip Header

 
Contribute Send feedback

Reviewed, UniProtKB/Swiss-Prot Q15911 (ZFHX3_HUMAN)

Last modified November 25, 2008. Version 84. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (7) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Alternative products · Sequence annotation (Features) · Sequences · References · Web resources · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Zinc finger homeobox protein 3
Alternative name(s):
    Zinc finger homeodomain protein 3
      Short name=ZFH-3
    Alpha-fetoprotein enhancer-binding protein
    AT motif-binding factor
    AT-binding transcription factor 1
Gene names
Name: ZFHX3
Synonyms: ATBF1
OrganismHomo sapiens (Human)
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length3703 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceEvidence at protein level.

General annotation (Comments)

Function

Transcriptional activator that binds to the AT-rich core sequence of the enhancer element of the AFP gene.

Subunit structure

Interacts with FNBP3 By similarity.

Subcellular location

Nucleus.

Post-translational modification

Phosphorylated upon DNA damage, probably by ATM or ATR.

Sequence similarities

Contains 22 C2H2-type zinc fingers.

Contains 4 homeobox DNA-binding domains.

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform A (identifier: Q15911-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform B (identifier: Q15911-2)

The sequence of this isoform differs from the canonical sequence as follows:
     1-914: Missing.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 37033703Zinc finger homeobox protein 3
PRO_0000046930

Regions

Zinc finger282 – 30524C2H2-type 1
Zinc finger640 – 66324C2H2-type 2
Zinc finger671 – 69424C2H2-type 3
Zinc finger726 – 75025C2H2-type 4
Zinc finger804 – 82825C2H2-type 5; atypical
Zinc finger945 – 96824C2H2-type 6; degenerate
Zinc finger984 – 100825C2H2-type 7; atypical
Zinc finger1040 – 106425C2H2-type 8; atypical
Zinc finger1088 – 111225C2H2-type 9; atypical
Zinc finger1223 – 124624C2H2-type 10; atypical
Zinc finger1252 – 127524C2H2-type 11
Zinc finger1360 – 138526C2H2-type 12
Zinc finger1401 – 142323C2H2-type 13
Zinc finger1429 – 145224C2H2-type 14
Zinc finger1545 – 156925C2H2-type 15
Zinc finger1596 – 162025C2H2-type 16
Zinc finger1983 – 200624C2H2-type 17
DNA binding2145 – 220460Homeobox 1
DNA binding2242 – 230160Homeobox 2
Zinc finger2328 – 235124C2H2-type 18; atypical
Zinc finger2530 – 255223C2H2-type 19
DNA binding2641 – 270060Homeobox 3
Zinc finger2711 – 273424C2H2-type 20
DNA binding2944 – 300360Homeobox 4
Zinc finger3024 – 304825C2H2-type 21
Zinc finger3529 – 355325C2H2-type 22
Compositional bias104 – 1074Poly-Pro
Compositional bias460 – 48930Poly-Glu
Compositional bias770 – 78415Poly-Ala
Compositional bias1723 – 174321Poly-Gln
Compositional bias1789 – 17946Poly-Gln
Compositional bias1852 – 18576Poly-Gln
Compositional bias2037 – 205216Poly-Pro
Compositional bias3197 – 320913Poly-Gln
Compositional bias3210 – 32145Poly-Pro
Compositional bias3227 – 32315Poly-Gln
Compositional bias3376 – 338914Poly-Gln
Compositional bias3392 – 33954Poly-Gln
Compositional bias3507 – 352721Poly-Gly
Compositional bias3597 – 36004Poly-Pro
Compositional bias3636 – 36394Poly-Ser

Amino acid modifications

Modified residue2671Phosphoserine
Modified residue5711Phosphoserine
Modified residue11801Phosphoserine
Modified residue11971Phosphoserine
Modified residue12041Phosphoserine
Modified residue15901Phosphoserine By similarity

Natural variations

Alternative sequence1 – 914914Missing in isoform B.
VSP_006825
Natural variant721S → A: dbSNP rs7193297.
VAR_026663
Natural variant4281T → P: dbSNP rs16971436.
VAR_026664
Natural variant4601E → Q: dbSNP rs2073852.
VAR_019968
Natural variant7771V → A: dbSNP rs4788682.
VAR_026665
Natural variant33741A → V
VAR_011694
Natural variant3377 – 33848Missing
VAR_011695
Natural variant34211P → A: dbSNP rs8044440.
VAR_026666
Natural variant35271G → GGG
VAR_011696

Experimental info

Sequence conflict4221P → A Ref.1
Sequence conflict5791A → T Ref.1
Sequence conflict7671S → I Ref.1
Sequence conflict846 – 8494RHLG → HHRV Ref.1
Sequence conflict9971A → S Ref.4
Sequence conflict1150 – 119041EEAIE…LTDSP → GEWSHRHGRPRLGLGVHLLE TSRGLLFEGDVTDPAGPHVP Y Ref.4

Secondary structure

....... 3703
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
Isoform A [UniParc].

Last modified June 13, 2006. Version 2.
Checksum: 395F5D14A08112CB

FASTA3,703404,419
        10         20         30         40         50         60 
MEGCDSPVVS GKDNGCGIPQ HQQWTELNST HLPDKPSSME QSTGESHGPL DSLRAPFNER 

        70         80         90        100        110        120 
LAESTASAGP PSEPASKEVT CNECSASFAS LQTYMEHHCP SARPPPPLRE ESASDTGEEG 

       130        140        150        160        170        180 
DEESDVENLA GEIVYQPDGS AYIVESLSQL TQGGGACGSG SGSGPLPSLF LNSLPGAGGK 

       190        200        210        220        230        240 
QGDPSCAAPV YPQIINTFHI ASSFGKWFEG PDQAFPNTSA LAGLSPVLHS FRVFDVRHKS 

       250        260        270        280        290        300 
NKDYLNSDGS AKSSCVSKDV PNNVDLSKFD GFVLYGKRKP ILMCFLCKLS FGYVRSFVTH 

       310        320        330        340        350        360 
AVHDHRMTLS EDERKILSNK NISAIIQGIG KDKEPLVSFL EPKNKNFQHP LVSTANLIGP 

       370        380        390        400        410        420 
GHSFYGKFSG IRMEGEEALP AGSAAGPEQP QAGLLTPSTL LNLGGLTSSV LKTPITSVPL 

       430        440        450        460        470        480 
GPLASSPTKS SEGKDSGAAE GEKQEVGDGD CFSEKVEPAE EEAEEEEEEE EAEEEEEEEE 

       490        500        510        520        530        540 
EEEEEEEDEG CKGLFPSELD EELEDRPHEE PGAAAGSSSK KDLALSNQSI SNSPLMPNVL 

       550        560        570        580        590        600 
QTLSRGTAST SSNSASSFVV FDGANRRNRL SFNSEGVRAN VAEGGRRLDF ADESANKDNA 

       610        620        630        640        650        660 
TAPEPNESTE GDDGGFVPHH QHAGSLCELG VGECPSGSGV ECPKCDTVLG SSRSLGGHMT 

       670        680        690        700        710        720 
MMHSRNSCKT LKCPKCNWHY KYQQTLEAHM KEKHPEPGGS CVYCKSGQPH PRLARGESYT 

       730        740        750        760        770        780 
CGYKPFRCEV CNYSTTTKGN LSIHMQSDKH LNNMQNLQNG GGEQVFSHTA GAAAAAVAAA 

       790        800        810        820        830        840 
AAAANISSSC GAPSPTKPKT KPTWRCEVCD YETNVARNLR IHMTSEKHMH NMMLLQQNMT 

       850        860        870        880        890        900 
QIQHNRHLGL GSLPSPAEAE LYQYYLAQNM NLPNLKMDSA ASDAQFMMSG FQLDPAGPMA 

       910        920        930        940        950        960 
AMTPALVGGE IPLDMRLGGG QLVSEELMNL GESFIQTNDP SLKLFQCAVC NKFTTDNLDM 

       970        980        990       1000       1010       1020 
LGLHMNVERS LSEDEWKAVM GDSYQCKLCR YNTQLKANFQ LHCKTDKHVQ KYQLVAHIKE 

      1030       1040       1050       1060       1070       1080 
GGKANEWRLK CVAIGNPVHL KCNACDYYTN SLEKLRLHTV NSRHEASLKL YKHLQQHESG 

      1090       1100       1110       1120       1130       1140 
VEGESCYYHC VLCNYSTKAK LNLIQHVRSM KHQRSESLRK LQRLQKGLPE EDEDLGQIFT 

      1150       1160       1170       1180       1190       1200 
IRRCPSTDPE EAIEDVEGPS ETAADPEELA KDQEGGASSS QAEKELTDSP ATSKRISFPG 

      1210       1220       1230       1240       1250       1260 
SSESPLSSKR PKTAEEIKPE QMYQCPYCKY SNADVNRLRV HAMTQHSVQP MLRCPLCQDM 

      1270       1280       1290       1300       1310       1320 
LNNKIHLQLH LTHLHSVAPD CVEKLIMTVT TPEMVMPSSM FLPAAVPDRD GNSNLEEAGK 

      1330       1340       1350       1360       1370       1380 
QPETSEDLGK NILPSASTEQ SGDLKPSPAD PGSVREDSGF ICWKKGCNQV FKTSAALQTH 

      1390       1400       1410       1420       1430       1440 
FNEVHAKRPQ LPVSDRHVYK YRCNQCSLAF KTIEKLQLHS QYHVIRAATM CCLCQRSFRT 

      1450       1460       1470       1480       1490       1500 
FQALKKHLET SHLELSEADI QQLYGGLLAN GDLLAMGDPT LAEDHTIIVE EDKEEESDLE 

      1510       1520       1530       1540       1550       1560 
DKQSPTGSDS GSVQEDSGSE PKRALPFRKG PNFTMEKFLD PSRPYKCTVC KESFTQKNIL 

      1570       1580       1590       1600       1610       1620 
LVHYNSVSHL HKLKRALQES ATGQPEPTSS PDNKPFKCNT CNVAYSQSST LEIHMRSVLH 

      1630       1640       1650       1660       1670       1680 
QTKARAAKLE AASGSSNGTG NSSSISLSSS TPSPVSTSGS NTFTTSNPSS AGIAPSSNLL 

      1690       1700       1710       1720       1730       1740 
SQVPTESVGM PPLGNPIGAN IASPSEPKEA NRKKLADMIA SRQQQQQQQQ QQQQQQQQQQ 

      1750       1760       1770       1780       1790       1800 
QAQTLAQAQA QVQAHLQQEL QQQAALIQSQ LFNPTLLPHF PMTTETLLQL QQQQHLLFPF 

      1810       1820       1830       1840       1850       1860 
YIPSAEFQLN PEVSLPVTSG ALTLTGTGPG LLEDLKAQVQ VPQQSHQQIL PQQQQNQLSI 

      1870       1880       1890       1900       1910       1920 
AQSHSALLQP SQHPEKKNKL VIKEKEKESQ RERDSAEGGE GNTGPKETLP DALKAKEKKE 

      1930       1940       1950       1960       1970       1980 
LAPGGGSEPS MLPPRIASDA RGNATKALLE NFGFELVIQY NENKQKVQKK NGKTDQGENL 

      1990       2000       2010       2020       2030       2040 
EKLECDSCGK LFSNILILKS HQEHVHQNYF PFKQLERFAK QYRDHYDKLY PLRPQTPEPP 

      2050       2060       2070       2080       2090       2100 
PPPPPPPPPP LPAAPPQPAS TPAIPASAPP ITSPTIAPAQ PSVPLTQLSM PMELPIFSPL 

      2110       2120       2130       2140       2150       2160 
MMQTMPLQTL PAQLPPQLGP VEPLPADLAQ LYQHQLNPTL LQQQNKRPRT RITDDQLRVL 

      2170       2180       2190       2200       2210       2220 
RQYFDINNSP SEEQIKEMAD KSGLPQKVIK HWFRNTLFKE RQRNKDSPYN FSNPPITSLE 

      2230       2240       2250       2260       2270       2280 
ELKIDSRPPS PEPPKQEYWG SKRSSRTRFT DYQLRVLQDF FDANAYPKDD EFEQLSNLLN 

      2290       2300       2310       2320       2330       2340 
LPTRVIVVWF QNARQKARKN YENQGEGKDG ERRELTNDRY IRTSNLNYQC KKCSLVFQRI 

      2350       2360       2370       2380       2390       2400 
FDLIKHQKKL CYKDEDEEGQ DDSQNEDSMD AMEILTPTSS SCSTPMPSQA YSAPAPSANN 

      2410       2420       2430       2440       2450       2460 
TASSAFLQLT AEAEELATFN SKTEAGDEKP KLAEAPSAQP NQTQEKQGQP KPELQQQEQP 

      2470       2480       2490       2500       2510       2520 
EQKTNTPQQK LPQLVSLPSL PQPPPQAPPP QCPLPQSSPS PSQLSHLPLK PLHTSTPQQL 

      2530       2540       2550       2560       2570       2580 
ANLPPQLIPY QCDQCKLAFP SFEHWQEHQQ LHFLSAQNQF IHPQFLDRSL DMPFMLFDPS 

      2590       2600       2610       2620       2630       2640 
NPLLASQLLS GAIPQIPASS ATSPSTPTST MNTLKRKLEE KASASPGEND SGTGGEEPQR 

      2650       2660       2670       2680       2690       2700 
DKRLRTTITP EQLEILYQKY LLDSNPTRKM LDHIAHEVGL KKRVVQVWFQ NTRARERKGQ 

      2710       2720       2730       2740       2750       2760 
FRAVGPAQAH RRCPFCRALF KAKTALEAHI RSRHWHEAKR AGYNLTLSAM LLDCDGGLQM 

      2770       2780       2790       2800       2810       2820 
KGDIFDGTSF SHLPPSSSDG QGVPLSPVSK TMELSPRTLL SPSSIKVEGI EDFESPSMSS 

      2830       2840       2850       2860       2870       2880 
VNLNFDQTKL DNDDCSSVNT AITDTTTGDE GNADNDSATG IATETKSSSA PNEGLTKAAM 

      2890       2900       2910       2920       2930       2940 
MAMSEYEDRL SSGLVSPAPS FYSKEYDNEG TVDYSETSSL ADPCSPSPGA SGSAGKSGDS 

      2950       2960       2970       2980       2990       3000 
GDRPGQKRFR TQMTNLQLKV LKSCFNDYRT PTMLECEVLG NDIGLPKRVV QVWFQNARAK 

      3010       3020       3030       3040       3050       3060 
EKKSKLSMAK HFGINQTSYE GPKTECTLCG IKYSARLSVR DHIFSQQHIS KVKDTIGSQL 

      3070       3080       3090       3100       3110       3120 
DKEKEYFDPA TVRQLMAQQE LDRIKKANEV LGLAAQQQGM FDNTPLQALN LPTAYPALQG 

      3130       3140       3150       3160       3170       3180 
IPPVLLPGLN SPSLPGFTPS NTALTSPKPN LMGLPSTTVP SPGLPTSGLP NKPSSASLSS 

      3190       3200       3210       3220       3230       3240 
PTPAQATMAM GPQQPPQQQQ QQQQPQVQQP PPPPAAQPPP TPQLPLQQQQ QRKDKDSEKV 

      3250       3260       3270       3280       3290       3300 
KEKEKAHKGK GEPLPVPKKE KGEAPTATAA TISAPLPTME YAVDPAQLQA LQAALTSDPT 

      3310       3320       3330       3340       3350       3360 
ALLTSQFLPY FVPGFSPYYA PQIPGALQSG YLQPMYGMEG LFPYSPALSQ ALMGLSPGSL 

      3370       3380       3390       3400       3410       3420 
LQQYQQYQQS LQEAIQQQQQ RQLQQQQQQK VQQQQPKASQ TPVPPGAPSP DKDPAKESPK 

      3430       3440       3450       3460       3470       3480 
PEEQKNTPRE VSPLLPKLPE EPEAESKSAD SLYDPFIVPK VQYKLVCRKC QAGFSDEEAA 

      3490       3500       3510       3520       3530       3540 
RSHLKSLCFF GQSVVNLQEM VLHVPTGGGG GGSGGGGGGG GGGGGGGSYH CLACESALCG 

      3550       3560       3570       3580       3590       3600 
EEALSQHLES ALHKHRTITR AARNAKEHPS LLPHSACFPD PSTASTSQSA AHSNDSPPPP 

      3610       3620       3630       3640       3650       3660 
SAAAPSSASP HASRKSWPQV VSRASAAKPP SFPPLSSSST VTSSSCSTSG VQPSMPTDDY 

      3670       3680       3690       3700 
SEESDTDLSQ KSDGPASPVE GPKDPSCPKD SGLTSVGTDT FRL 

« Hide

Isoform B [UniParc].

Checksum: 457BA84E550E9AA4
Show »

2,789306,664