Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Transcription factor collier

Gene

kn

Organism
Drosophila melanogaster (Fruit fly)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at transcript leveli

Functioni

May act as a 'second-level regulator' of head patterning. Required for establishment of the PS(-1)/PS0 parasegmental border and formation of the intercalary segment. Required for expression of the segment polarity genes hedgehog, engrailed and wingless, and the segment-identity genes CAP and collar in the intercalary segment. Required at the onset of the gastrulation for the correct formation of the mandibular segment.4 Publications

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sitei179 – 1791Interaction with DNABy similarity
Sitei188 – 1881Interaction with DNABy similarity

Regions

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Zinc fingeri167 – 18620C5-typeSequence AnalysisAdd
BLAST

GO - Molecular functioni

GO - Biological processi

  • anterior head segmentation Source: FlyBase
  • blastoderm segmentation Source: FlyBase
  • dendrite morphogenesis Source: FlyBase
  • determination of muscle attachment site Source: FlyBase
  • embryonic development via the syncytial blastoderm Source: FlyBase
  • head segmentation Source: FlyBase
  • imaginal disc-derived wing morphogenesis Source: FlyBase
  • imaginal disc-derived wing vein specification Source: FlyBase
  • innate immune response Source: FlyBase
  • larval somatic muscle development Source: FlyBase
  • muscle cell fate specification Source: FlyBase
  • pattern specification process Source: FlyBase
  • positive regulation of JAK-STAT cascade Source: FlyBase
  • positive regulation of transcription from RNA polymerase II promoter Source: FlyBase
  • posterior head segmentation Source: FlyBase
  • regulation of dendrite morphogenesis Source: FlyBase
  • regulation of lamellocyte differentiation Source: FlyBase
  • regulation of transcription, DNA-templated Source: FlyBase
  • response to symbiont Source: FlyBase
  • specification of segmental identity, intercalary segment Source: FlyBase
  • transcription, DNA-templated Source: UniProtKB-KW
Complete GO annotation...

Keywords - Molecular functioni

Developmental protein

Keywords - Biological processi

Transcription, Transcription regulation

Keywords - Ligandi

DNA-binding, Metal-binding, Zinc

Enzyme and pathway databases

SignaLinkiP56721.

Names & Taxonomyi

Protein namesi
Recommended name:
Transcription factor collier
Alternative name(s):
Transcription factor knot
Gene namesi
Name:kn
Synonyms:col
ORF Names:CG10197
OrganismiDrosophila melanogaster (Fruit fly)
Taxonomic identifieri7227 [NCBI]
Taxonomic lineageiEukaryotaMetazoaEcdysozoaArthropodaHexapodaInsectaPterygotaNeopteraEndopterygotaDipteraBrachyceraMuscomorphaEphydroideaDrosophilidaeDrosophilaSophophora
ProteomesiUP000000803 Componenti: Chromosome 2R

Organism-specific databases

FlyBaseiFBgn0001319. kn.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 575575Transcription factor collierPRO_0000107823Add
BLAST

Proteomic databases

PaxDbiP56721.

Expressioni

Tissue specificityi

Its expression at the blastoderm stage is restricted to a single stripe of cells corresponding to part of the intercalary and mandibular segment primordia, possibly parasegment O.1 Publication

Developmental stagei

Isoform COL1 is expressed from 3 hours of embryogenesis, with a peak of accumulation between 8 and 16 hours post-fertilization. Expression persists at very low level in first instar larvae and accumulates again in third instar larvae and pupae. Isoform COL2 is expressed after 8 hours of embryogenesis, peaks in first instar larvae and is present at low levels in third instar larvae and pupae.1 Publication

Gene expression databases

BgeeiP56721.
ExpressionAtlasiP56721. differential.
GenevisibleiP56721. DM.

Interactioni

Protein-protein interaction databases

BioGridi69588. 37 interactions.
STRINGi7227.FBpp0111722.

Structurei

3D structure databases

ProteinModelPortaliP56721.
SMRiP56721. Positions 60-431.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini299 – 38284IPT/TIGAdd
BLAST

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni79 – 824Interaction with DNABy similarity
Regioni213 – 2208Interaction with DNABy similarity
Regioni252 – 2554Interaction with DNABy similarity

Sequence similaritiesi

Belongs to the COE family.Curated
Contains 1 IPT/TIG domain.Curated

Zinc finger

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Zinc fingeri167 – 18620C5-typeSequence AnalysisAdd
BLAST

Keywords - Domaini

Zinc-finger

Phylogenomic databases

eggNOGiNOG259050.
GeneTreeiENSGT00390000014051.
InParanoidiP56721.
KOiK09103.
OrthoDBiEOG7J446V.
PhylomeDBiP56721.

Family and domain databases

Gene3Di2.60.40.10. 1 hit.
InterProiIPR011598. bHLH_dom.
IPR013783. Ig-like_fold.
IPR014756. Ig_E-set.
IPR002909. IPT.
IPR003523. Transcription_factor_COE.
IPR018350. Transcription_factor_COE_CS.
[Graphical view]
PANTHERiPTHR10747. PTHR10747. 1 hit.
PfamiPF01833. TIG. 1 hit.
[Graphical view]
SMARTiSM00353. HLH. 1 hit.
SM00429. IPT. 1 hit.
[Graphical view]
SUPFAMiSSF81296. SSF81296. 1 hit.
PROSITEiPS01345. COE. 1 hit.
[Graphical view]

Sequences (2)i

Sequence statusi: Complete.

This entry describes 2 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform COL1 (identifier: P56721-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MEWGRKLYPS AVSGPRSAGG LMFGLPPTAA VDMNQPRGPM TSLKEEPLGS
60 70 80 90 100
RWAMQPVVDQ SNLGIGRAHF EKQPPSNLRK SNFFHFVIAL YDRAGQPIEI
110 120 130 140 150
ERTAFIGFIE KDSESDATKT NNGIQYRLQL LYANGARQEQ DIFVRLIDSV
160 170 180 190 200
TKQAIIYEGQ DKNPEMCRVL LTHEVMCSRC CDKKSCGNRN ETPSDPVIID
210 220 230 240 250
RFFLKFFLKC NQNCLKNAGN PRDMRRFQVV ISTQVAVDGP LLAISDNMFV
260 270 280 290 300
HNNSKHGRRA KRLDTTEGTG NTSLSISGHP LAPDSTYDGL YPPLPVATPC
310 320 330 340 350
IKAISPSEGW TTGGATVIIV GDNFFDGLQV VFGTMLVWSE LITSHAIRVQ
360 370 380 390 400
TPPRHIPGVV EVTLSYKSKQ FCKGSPGRFV YVSALNEPTI DYGFQRLQKL
410 420 430 440 450
IPRHPGDPEK LQKEIILKRA ADLVEALYSM PRSPGGSTGF NSYAGQLAVS
460 470 480 490 500
VQDGSGQWTE DDYQRAQSSS VSPRGGYCSS ASTPHSSGGS YGATAASAAV
510 520 530 540 550
AATANGYAPA PNMGTLSSSP GSVFNSTSMS AVSSTWHQAF VQHHHAATAH
560 570
PHHHYPHPHQ PWHNPAVSAA TAAAV
Length:575
Mass (Da):62,494
Last modified:October 11, 2004 - v2
Checksum:iD15144D95BDCDFBC
GO
Isoform COL2 (identifier: P56721-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     529-575: MSAVSSTWHQ...AVSAATAAAV → RVSSLSFNPFALPTCNTQGYSTQLVTSTK

Show »
Length:557
Mass (Da):60,488
Checksum:i16E4EFC1AC05A3BC
GO

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti354 – 3552RH → SD in X97803 (PubMed:8793297).Curated
Sequence conflicti357 – 3571P → R in AAM50962 (PubMed:12537569).Curated
Sequence conflicti384 – 3841Missing in AAM50962 (PubMed:12537569).Curated
Sequence conflicti435 – 4351G → D in X97803 (PubMed:8793297).Curated

Alternative sequence

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Alternative sequencei529 – 57547MSAVS…TAAAV → RVSSLSFNPFALPTCNTQGY STQLVTSTK in isoform COL2. 1 PublicationVSP_001111Add
BLAST

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
X97803 mRNA. No translation available.
AE013599 Genomic DNA. Translation: AAF58204.2.
AY119102 mRNA. Translation: AAM50962.1.
RefSeqiNP_524813.2. NM_080074.4. [P56721-2]
NP_725419.2. NM_166070.3. [P56721-1]
UniGeneiDm.1803.

Genome annotation databases

EnsemblMetazoaiFBtr0112809; FBpp0111721; FBgn0001319. [P56721-1]
GeneIDi45318.
KEGGidme:Dmel_CG10197.
UCSCiCG10197-RA. d. melanogaster. [P56721-1]

Keywords - Coding sequence diversityi

Alternative splicing

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
X97803 mRNA. No translation available.
AE013599 Genomic DNA. Translation: AAF58204.2.
AY119102 mRNA. Translation: AAM50962.1.
RefSeqiNP_524813.2. NM_080074.4. [P56721-2]
NP_725419.2. NM_166070.3. [P56721-1]
UniGeneiDm.1803.

3D structure databases

ProteinModelPortaliP56721.
SMRiP56721. Positions 60-431.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi69588. 37 interactions.
STRINGi7227.FBpp0111722.

Proteomic databases

PaxDbiP56721.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaiFBtr0112809; FBpp0111721; FBgn0001319. [P56721-1]
GeneIDi45318.
KEGGidme:Dmel_CG10197.
UCSCiCG10197-RA. d. melanogaster. [P56721-1]

Organism-specific databases

CTDi45318.
FlyBaseiFBgn0001319. kn.

Phylogenomic databases

eggNOGiNOG259050.
GeneTreeiENSGT00390000014051.
InParanoidiP56721.
KOiK09103.
OrthoDBiEOG7J446V.
PhylomeDBiP56721.

Enzyme and pathway databases

SignaLinkiP56721.

Miscellaneous databases

GenomeRNAii45318.
NextBioi838001.
PROiP56721.

Gene expression databases

BgeeiP56721.
ExpressionAtlasiP56721. differential.
GenevisibleiP56721. DM.

Family and domain databases

Gene3Di2.60.40.10. 1 hit.
InterProiIPR011598. bHLH_dom.
IPR013783. Ig-like_fold.
IPR014756. Ig_E-set.
IPR002909. IPT.
IPR003523. Transcription_factor_COE.
IPR018350. Transcription_factor_COE_CS.
[Graphical view]
PANTHERiPTHR10747. PTHR10747. 1 hit.
PfamiPF01833. TIG. 1 hit.
[Graphical view]
SMARTiSM00353. HLH. 1 hit.
SM00429. IPT. 1 hit.
[Graphical view]
SUPFAMiSSF81296. SSF81296. 1 hit.
PROSITEiPS01345. COE. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "Collier, a novel regulator of Drosophila head development, is expressed in a single mitotic domain."
    Crozatier M., Valle D., Dubois L., Ibnsouda S., Vincent A.
    Curr. Biol. 6:707-718(1996) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE (ISOFORMS COL1 AND COL2), FUNCTION, TISSUE SPECIFICITY, DEVELOPMENTAL STAGE.
    Tissue: Embryo.
  2. "The genome sequence of Drosophila melanogaster."
    Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N., Sutton G.G., Wortman J.R., Yandell M.D.
    , Zhang Q., Chen L.X., Brandon R.C., Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C., Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A., An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B., Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C., Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S., Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M., Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D., Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F., Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D., Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A., Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C., McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C., Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L., Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R., Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F., Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J., Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R., Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y., Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T., Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S., Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.
    Science 287:2185-2195(2000) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: Berkeley.
  3. Cited for: GENOME REANNOTATION.
    Strain: Berkeley.
  4. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM COL2).
    Strain: Berkeley.
    Tissue: Embryo.
  5. "The COE transcription factor Collier is a mediator of short-range Hedgehog-induced patterning of the Drosophila wing."
    Vervoort M., Crozatier M., Valle D., Vincent A.
    Curr. Biol. 9:632-639(1999) [PubMed] [Europe PMC] [Abstract]
    Cited for: FUNCTION.
  6. "Requirement for the Drosophila COE transcription factor Collier in formation of an embryonic muscle: transcriptional response to notch signalling."
    Crozatier M., Vincent A.
    Development 126:1495-1504(1999) [PubMed] [Europe PMC] [Abstract]
    Cited for: FUNCTION.
  7. "Head versus trunk patterning in the Drosophila embryo; collier requirement for formation of the intercalary segment."
    Crozatier M., Valle D., Dubois L., Ibnsouda S., Vincent A.
    Development 126:4385-4394(1999) [PubMed] [Europe PMC] [Abstract]
    Cited for: FUNCTION.

Entry informationi

Entry nameiCOLL_DROME
AccessioniPrimary (citable) accession number: P56721
Secondary accession number(s): Q8MS49, Q9V758
Entry historyi
Integrated into UniProtKB/Swiss-Prot: May 30, 2000
Last sequence update: October 11, 2004
Last modified: June 24, 2015
This is version 118 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programDrosophila annotation project

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Drosophila
    Drosophila: entries, gene names and cross-references to FlyBase
  2. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into Uniref entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.