Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Box A-binding factor

Gene

srp

Organism
Drosophila melanogaster (Fruit fly)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

May function as a transcriptional activator protein and may play a key role in the organogenesis of the fat body. Binds a sequence element (5'-[TA]GATAA-3') found in the larval promoters of all known alcohol dehydrogenase (ADH) genes. Acts as a homeotic gene downstream of the terminal gap gene HKB to promote morphogenesis and differentiation of anterior and posterior midgut.

Regions

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Zinc fingeri803 – 827GATA-typePROSITE-ProRule annotationAdd BLAST25

GO - Molecular functioni

GO - Biological processi

  • amnioserosa maintenance Source: FlyBase
  • autophagy Source: FlyBase
  • cell fate commitment Source: FlyBase
  • cell fate determination Source: FlyBase
  • crystal cell differentiation Source: FlyBase
  • dorsal closure Source: FlyBase
  • embryonic heart tube development Source: FlyBase
  • embryonic hemopoiesis Source: FlyBase
  • endoderm development Source: FlyBase
  • endoderm formation Source: FlyBase
  • fat body development Source: FlyBase
  • germ-band shortening Source: FlyBase
  • germ cell migration Source: FlyBase
  • head involution Source: FlyBase
  • hemocyte development Source: FlyBase
  • hemopoiesis Source: FlyBase
  • larval lymph gland hemopoiesis Source: FlyBase
  • lymph gland development Source: FlyBase
  • mesodermal cell fate commitment Source: FlyBase
  • midgut development Source: FlyBase
  • negative regulation of crystal cell differentiation Source: FlyBase
  • phagocytosis Source: FlyBase
  • positive regulation of transcription, DNA-templated Source: FlyBase
  • regulation of glucose metabolic process Source: FlyBase
  • salivary gland morphogenesis Source: FlyBase
Complete GO annotation...

Keywords - Molecular functioni

Activator, Developmental protein

Keywords - Biological processi

Transcription, Transcription regulation

Keywords - Ligandi

DNA-binding, Metal-binding, Zinc

Names & Taxonomyi

Protein namesi
Recommended name:
Box A-binding factor
Short name:
ABF
Alternative name(s):
GATA-binding factor B
Protein serpent
Transcription factor GATA-B
dGATA-B
Gene namesi
Name:srp
Synonyms:ABF
ORF Names:CG3992
OrganismiDrosophila melanogaster (Fruit fly)
Taxonomic identifieri7227 [NCBI]
Taxonomic lineageiEukaryotaMetazoaEcdysozoaArthropodaHexapodaInsectaPterygotaNeopteraEndopterygotaDipteraBrachyceraMuscomorphaEphydroideaDrosophilidaeDrosophilaSophophora
Proteomesi
  • UP000000803 Componenti: Chromosome 3R

Organism-specific databases

FlyBaseiFBgn0003507. srp.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00000834611 – 1264Box A-binding factorAdd BLAST1264

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei1208Phosphoserine1 Publication1
Modified residuei1210Phosphoserine1 Publication1

Keywords - PTMi

Phosphoprotein

Proteomic databases

PaxDbiP52172.
PRIDEiP52172.

PTM databases

iPTMnetiP52172.

Expressioni

Developmental stagei

Initially observed in the analgen of the anterior and posterior midgut and the cephalic mesoderm. It is found in both the endodermal and mesodermal germ layers and for a brief period during gastrulation it is expressed in the amnioserosa. During germ band retraction it becomes restricted to the fat body.

Gene expression databases

BgeeiFBgn0003507.
ExpressionAtlasiP52172. baseline.
GenevisibleiP52172. DM.

Interactioni

GO - Molecular functioni

Protein-protein interaction databases

BioGridi66998. 8 interactors.
IntActiP52172. 1 interactor.
MINTiMINT-318912.
STRINGi7227.FBpp0082669.

Structurei

3D structure databases

ProteinModelPortaliP52172.
SMRiP52172.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Compositional bias

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Compositional biasi106 – 113Poly-Ala8
Compositional biasi181 – 188Poly-Asn8
Compositional biasi212 – 221Poly-Gln10
Compositional biasi225 – 228Poly-Gln4
Compositional biasi239 – 253Poly-GlnAdd BLAST15
Compositional biasi273 – 283Poly-AlaAdd BLAST11
Compositional biasi327 – 335Poly-Gly9
Compositional biasi343 – 353Poly-AlaAdd BLAST11
Compositional biasi421 – 426Poly-Gln6
Compositional biasi430 – 439Poly-Gln10
Compositional biasi440 – 444Poly-His5
Compositional biasi535 – 555Poly-GlnAdd BLAST21
Compositional biasi566 – 570Poly-Ser5
Compositional biasi579 – 582Poly-Ser4
Compositional biasi616 – 621Poly-Gly6
Compositional biasi690 – 699Poly-Ala10
Compositional biasi738 – 741Poly-Gly4
Compositional biasi923 – 929Poly-Gln7
Compositional biasi966 – 971Poly-Gln6
Compositional biasi992 – 998Poly-Gln7
Compositional biasi1024 – 1031Poly-Asn8
Compositional biasi1036 – 1042Poly-Asn7
Compositional biasi1063 – 1072Poly-Ser10
Compositional biasi1087 – 1099Poly-AlaAdd BLAST13
Compositional biasi1185 – 1199Poly-GlnAdd BLAST15
Compositional biasi1224 – 1233Poly-Gln10

Sequence similaritiesi

Contains 1 GATA-type zinc finger.PROSITE-ProRule annotation

Zinc finger

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Zinc fingeri803 – 827GATA-typePROSITE-ProRule annotationAdd BLAST25

Keywords - Domaini

Zinc-finger

Phylogenomic databases

eggNOGiKOG1601. Eukaryota.
COG5641. LUCA.
GeneTreeiENSGT00760000119221.
InParanoidiP52172.
OMAiEDGYGSP.
OrthoDBiEOG091G07TZ.
PhylomeDBiP52172.

Family and domain databases

Gene3Di3.30.50.10. 1 hit.
InterProiIPR000679. Znf_GATA.
IPR013088. Znf_NHR/GATA.
[Graphical view]
PfamiPF00320. GATA. 1 hit.
[Graphical view]
PRINTSiPR00619. GATAZNFINGER.
SMARTiSM00401. ZnF_GATA. 1 hit.
[Graphical view]
PROSITEiPS00344. GATA_ZN_FINGER_1. 1 hit.
PS50114. GATA_ZN_FINGER_2. 1 hit.
[Graphical view]

Sequences (3)i

Sequence statusi: Complete.

This entry describes 3 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: P52172-1) [UniParc]FASTAAdd to basket
Also known as: A

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MTKTTKPKEK AAAGGAVIGS GSGLGSVTKA GGGSLLSNAA DSKIRTAKSN
60 70 80 90 100
NNKRQAGRAA TALAATTTAS ALAATTTAGA TGSNAAANET EIAIETENGE
110 120 130 140 150
AATPTAAATA AAANLSSLES ARSQALTSVV SETARQAVTT ANASATSTST
160 170 180 190 200
VTAATEIATA TASDTAATSE AAIDDDPSAI NTNNNNNNSK AQNDASESVK
210 220 230 240 250
TKVISYHQSE DQQQQQQQQA QIYEQQQQFL SQQLISHHQQ EQHQQAQQQQ
260 270 280 290 300
HQQVVQEQHQ ASWLAYDLTS GSAAAAAAAA AAASHPHLFG QFSYPPSHHT
310 320 330 340 350
PTQLYEHYPS TDPIMRNNFA FYSVYTGGGG GVGVGMTSHE HLAAAAAAAA
360 370 380 390 400
AVAQGTTPNI DEVIQDTLKD ECFEDGHSTD YHVLTSVSDM HTLKDSSPYA
410 420 430 440 450
LTHEQLHQQQ HHHQQQLHHH QQQQQQLYHQ QQQQQQQQQH HHHHNNSTSS
460 470 480 490 500
AGGDSPSSSH ALSTLQSFTQ LTSATQRDSL SPENDAYFAA AQLGSSLQNS
510 520 530 540 550
SVYAGSLLTQ TANGIQYGMQ SPNQTQAHLQ QQHHQQQQQQ HQQHQQQQLQ
560 570 580 590 600
QQQQQHHHNQ HQHHNSSSSS PGPAGLHHSS SSAATAAAVA AATAAVNGHN
610 620 630 640 650
SSLEDGYGSP RSSHSGGGGG GTLPAFQRIA YPNSGSVERY APITNYRGQN
660 670 680 690 700
DTWFDPLSYA TSSSGQAQLG VGVGAGVVSN VIRNGRAISA ANAAAAAAAD
710 720 730 740 750
GTTGRVDPGT FLSASASLSA MAAESGGDFY KPNSFNVGGG GRSKANTSGA
760 770 780 790 800
ASSYSCPGSN ATSAATSAVA SGTAATAATT LDEHVSRANS RRLSASKRAG
810 820 830 840 850
LSCSNCHTTH TSLWRRNPAG EPVCNACGLY YKLHSVPRPL TMKKDTIQKR
860 870 880 890 900
KRKPKGTKSE KSKSKSKNAL NAIMESGSLV TNCHNVGVVL DSSQMDVNDD
910 920 930 940 950
MKPQLDLKPY NSYSSQPQQQ LPQYQQQQQL LMADQHSSAA SSPHSMGSTS
960 970 980 990 1000
LSPSAMSHQH QTHPHQQQQQ QLCSGLDMSP NSNYQMSPLN MQQHQQQQSC
1010 1020 1030 1040 1050
SMQHSPSTPT SIFNTPSPTH QLHNNNNNNN NSSIFNNNNN NNSSSNENNN
1060 1070 1080 1090 1100
KLIQKYLQAQ QLSSSSNSGS TSDHQLLAQL LPNSITAAAA AAAAAAAAAI
1110 1120 1130 1140 1150
KTEALSLTSQ ANCSTASAGL MVTSTPTTAS STLSSLSHSN IISLQNPYHQ
1160 1170 1180 1190 1200
AGMTLCKPTR PSPPYYLTPE EDEQPALIKM EEMDQSQQQQ QQQQHQQQQH
1210 1220 1230 1240 1250
GEIMLSRSAS LDEHYELAAF QRHQQQQQQL QQQTAALLGQ HEQHVTNYAM
1260
HKFGVDRETV VKME
Length:1,264
Mass (Da):134,158
Last modified:August 30, 2005 - v2
Checksum:i3E337C34555B23C2
GO
Isoform 2 (identifier: P52172-2) [UniParc]FASTAAdd to basket
Also known as: B

The sequence of this isoform differs from the canonical sequence as follows:
     721-790: MAAESGGDFY...LDEHVSRANS → TLFDADYFTE...GMNRPLIKQP

Show »
Length:1,249
Mass (Da):133,783
Checksum:i83CDAD995A7DB7DB
GO
Isoform 3 (identifier: P52172-3) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-518: Missing.

Show »
Length:746
Mass (Da):79,790
Checksum:i81BC4DD66E169F1C
GO

Sequence cautioni

The sequence AAL39968 differs from that shown. Reason: Erroneous initiation. Translation N-terminally extended.Curated
The sequence CAA53807 differs from that shown. Reason: Frameshift at positions 490, 584, 974, 1011 and 1015.Curated
The sequence CAA68943 differs from that shown. Reason: Frameshift at positions 132, 976 and 1015.Curated

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti138V → A in CAA68943 (PubMed:9012522).Curated1
Sequence conflicti161T → P in CAA68943 (PubMed:9012522).Curated1
Sequence conflicti181N → I in CAA68943 (PubMed:9012522).Curated1
Sequence conflicti274A → T in CAA68943 (PubMed:9012522).Curated1
Sequence conflicti278A → T in CAA68943 (PubMed:9012522).Curated1
Sequence conflicti282A → T in CAA68943 (PubMed:9012522).Curated1
Sequence conflicti346A → T in CAA68943 (PubMed:9012522).Curated1
Sequence conflicti412 – 413HH → QQHQ in CAA53807 (PubMed:8187633).Curated2
Sequence conflicti931L → V in CAA68943 (PubMed:9012522).Curated1
Sequence conflicti931L → V in CAA53807 (PubMed:8187633).Curated1
Sequence conflicti1031 – 1035NSSIF → SSLFN in CAA68943 (PubMed:9012522).Curated5
Sequence conflicti1031 – 1035NSSIF → SSLFN in CAA53807 (PubMed:8187633).Curated5
Sequence conflicti1069G → D in CAA68943 (PubMed:9012522).Curated1
Sequence conflicti1069G → D in CAA53807 (PubMed:8187633).Curated1
Sequence conflicti1230 – 1231LQ → PE in CAA68943 (PubMed:9012522).Curated2
Sequence conflicti1230 – 1231LQ → PE in CAA53807 (PubMed:8187633).Curated2
Sequence conflicti1236A → S in CAA68943 (PubMed:9012522).Curated1
Sequence conflicti1236A → S in CAA53807 (PubMed:8187633).Curated1

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_0151861 – 518Missing in isoform 3. 1 PublicationAdd BLAST518
Alternative sequenceiVSP_015187721 – 790MAAES…SRANS → TLFDADYFTEGRECVNCGAI STPLWRRDNTGHYLCNACGL YMKMNGMNRPLIKQP in isoform 2. CuratedAdd BLAST70

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AE014297 Genomic DNA. Translation: AAF55261.2.
AE014297 Genomic DNA. Translation: AAN13691.1.
BT011543 mRNA. Translation: AAS15679.1.
Y07662 mRNA. Translation: CAA68943.1. Frameshift.
AY069823 mRNA. Translation: AAL39968.1. Different initiation.
X76217 mRNA. Translation: CAA53807.1. Frameshift.
PIRiS40382.
RefSeqiNP_001027190.1. NM_001032019.3. [P52172-3]
NP_001262618.1. NM_001275689.1. [P52172-3]
NP_732098.1. NM_169694.3. [P52172-2]
NP_732100.2. NM_169696.3. [P52172-1]
UniGeneiDm.1934.

Genome annotation databases

EnsemblMetazoaiFBtr0083215; FBpp0082669; FBgn0003507. [P52172-1]
GeneIDi41944.
KEGGidme:Dmel_CG3992.

Keywords - Coding sequence diversityi

Alternative splicing

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AE014297 Genomic DNA. Translation: AAF55261.2.
AE014297 Genomic DNA. Translation: AAN13691.1.
BT011543 mRNA. Translation: AAS15679.1.
Y07662 mRNA. Translation: CAA68943.1. Frameshift.
AY069823 mRNA. Translation: AAL39968.1. Different initiation.
X76217 mRNA. Translation: CAA53807.1. Frameshift.
PIRiS40382.
RefSeqiNP_001027190.1. NM_001032019.3. [P52172-3]
NP_001262618.1. NM_001275689.1. [P52172-3]
NP_732098.1. NM_169694.3. [P52172-2]
NP_732100.2. NM_169696.3. [P52172-1]
UniGeneiDm.1934.

3D structure databases

ProteinModelPortaliP52172.
SMRiP52172.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi66998. 8 interactors.
IntActiP52172. 1 interactor.
MINTiMINT-318912.
STRINGi7227.FBpp0082669.

PTM databases

iPTMnetiP52172.

Proteomic databases

PaxDbiP52172.
PRIDEiP52172.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaiFBtr0083215; FBpp0082669; FBgn0003507. [P52172-1]
GeneIDi41944.
KEGGidme:Dmel_CG3992.

Organism-specific databases

CTDi41944.
FlyBaseiFBgn0003507. srp.

Phylogenomic databases

eggNOGiKOG1601. Eukaryota.
COG5641. LUCA.
GeneTreeiENSGT00760000119221.
InParanoidiP52172.
OMAiEDGYGSP.
OrthoDBiEOG091G07TZ.
PhylomeDBiP52172.

Miscellaneous databases

GenomeRNAii41944.
PROiP52172.

Gene expression databases

BgeeiFBgn0003507.
ExpressionAtlasiP52172. baseline.
GenevisibleiP52172. DM.

Family and domain databases

Gene3Di3.30.50.10. 1 hit.
InterProiIPR000679. Znf_GATA.
IPR013088. Znf_NHR/GATA.
[Graphical view]
PfamiPF00320. GATA. 1 hit.
[Graphical view]
PRINTSiPR00619. GATAZNFINGER.
SMARTiSM00401. ZnF_GATA. 1 hit.
[Graphical view]
PROSITEiPS00344. GATA_ZN_FINGER_1. 1 hit.
PS50114. GATA_ZN_FINGER_2. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiSRP_DROME
AccessioniPrimary (citable) accession number: P52172
Secondary accession number(s): Q6NMW1
, Q7K0H5, Q8INC6, Q94884, Q9VF01
Entry historyi
Integrated into UniProtKB/Swiss-Prot: October 1, 1996
Last sequence update: August 30, 2005
Last modified: November 30, 2016
This is version 146 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programDrosophila annotation project

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Drosophila
    Drosophila: entries, gene names and cross-references to FlyBase
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.