Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q01371 (WC1_NEUCR) Reviewed, UniProtKB/Swiss-Prot

Last modified December 14, 2011. Version 89. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (1) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
White collar 1 protein

Short name=WC1
Gene names
Name:wc-1
ORF Names:NCU02356
OrganismNeurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987)
Taxonomic identifier367110 [NCBI]
Taxonomic lineageEukaryotaFungiDikaryaAscomycotaPezizomycotinaSordariomycetesSordariomycetidaeSordarialesSordariaceaeNeurospora

Protein attributes

Sequence length1167 AA.
Sequence statusComplete.
Protein existenceEvidence at transcript level

General annotation (Comments)

Function

May function as a transcription factor involved in light regulation. Binds and affects blue light regulation of the AL-3 gene. WC1 and WC2 proteins interact via homologous PAS domains, bind to promoters of light regulated genes such as FRQ, and activate transcription.

Subunit structure

Heterodimer of WC1 and WC2 Potential.

Subcellular location

Nucleus.

Induction

By blue light.

Domain

The glutamine-rich domain might function in activating gene expression.

Post-translational modification

FMN binds covalently to cysteine after exposure to blue light and is reversed in the dark By similarity.

Sequence similarities

Contains 1 GATA-type zinc finger.

Contains 2 PAC (PAS-associated C-terminal) domains.

Contains 3 PAS (PER-ARNT-SIM) domains.

Sequence caution

The sequence EAA30541.2 differs from that shown. Reason: Erroneous gene model prediction.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 11671167White collar 1 protein
PRO_0000083489

Regions

Domain381 – 45272PAS 1
Domain469 – 50840PAC 1
Domain574 – 64471PAS 2
Domain650 – 69142PAC 2
Domain693 – 76371PAS 3
Zinc finger934 – 95926GATA-type
Compositional bias16 – 6146Gln-rich
Compositional bias21 – 5737Poly-Gln
Compositional bias329 – 3335Poly-Pro

Amino acid modifications

Modified residue4281S-4a-FMN cysteine By similarity

Experimental info

Sequence conflict11671V → Q in EAA30541. Ref.3

Sequences

Sequence LengthMass (Da)Tools
Q01371 [UniParc].

Last modified July 11, 2001. Version 2.
Checksum: 6489D04DAB50EE38

FASTA1,167127,454
        10         20         30         40         50         60 
MNNNYYGSPL SPEELQHQMH QHQQQQQQQQ QQQQQQQQQQ QQQQQQQQQQ HQHQQQQKTN 

        70         80         90        100        110        120 
QHRNAGMMNT PPTTNQGNST IHASDVTMSG GSDSLDEIIQ QNLDEMHRRR SVPQPYGGQT 

       130        140        150        160        170        180 
RRLSMFDYAN PNDGFSDYQL DNMSGNYGDM TGGMGMSGHS SPYAGQNIMA MSDHSGGYSH 

       190        200        210        220        230        240 
MSPNVMGNMM TYPNLNMYHS PPIENPYSSA GLDTIRTDFS MDMNMDSGSV SAASVHPTPG 

       250        260        270        280        290        300 
LNKQDDEMMT MEQGFGGGDD ANASHQAQQN MGGLTPAMTP AMTPAMTPGV SNFAQGMATP 

       310        320        330        340        350        360 
VSQDAASTPA TTFQSPSLSA TTQTIRIGPP PPPSVTNAPT PAPFTSTPSG GGASQTKSIY 

       370        380        390        400        410        420 
SKSGFDMLRA LWYVASRKDP KLKLGAVDMS CAFVVCDVTL NDCPIIYVSD NFQNLTGYSR 

       430        440        450        460        470        480 
HEIVGRNCRF LQAPDGNVEA GTKREFVENN AVYTLKKTIA EGQEIQQSLI NYRKGGKPFL 

       490        500        510        520        530        540 
NLLTMIPIPW DTEEIRYFIG FQIDLVECPD AIIGQEGNGP MQVNYTHSDI GQYIWTPPTQ 

       550        560        570        580        590        600 
KQLEPADGQT LGVDDVSTLL QQCNSKGVAS DWHKQSWDKM LLENADDVVH VLSLKGLFLY 

       610        620        630        640        650        660 
LSPACKKVLE YDASDLVGTS LSSICHPSDI VPVTRELKEA QQHTPVNIVF RIRRKNSGYT 

       670        680        690        700        710        720 
WFESHGTLFN EQGKGRKCII LVGRKRPVFA LHRKDLELNG GIGDSEIWTK VSTSGMFLFV 

       730        740        750        760        770        780 
SSNVRSLLDL LPENLQGTSM QDLMRKESRA EFGRTIEKAR KGKIASCKHE VQNKRGQVLQ 

       790        800        810        820        830        840 
AYTTFYPGDG GEGQRPTFLL AQTKLLKASS RTLAPATVTV KNMSPGGVPL SPMKGIQTDS 

       850        860        870        880        890        900 
DSNTLMGGMS KSGSSDSTGA MVSARSSAGP GQDAALDADN IFDELKTTRC TSWQYELRQM 

       910        920        930        940        950        960 
EKVNRMLAEE LAQLLSNKKK RKRRKGGGNM VRDCANCHTR NTPEWRRGPS GNRDLCNSCG 

       970        980        990       1000       1010       1020 
LRWAKQTGRV SPRTSSRGGN GDSMSKKSNS PSHSSPLHRE VGNDSPSTTT ATKNSPSLRG 

      1030       1040       1050       1060       1070       1080 
SSTTAPGTIT TDSGPAVASS ASGTGSTTIA TSANSAASTV NALGPPATGP SGGSPAQHLP 

      1090       1100       1110       1120       1130       1140 
PHLQGTHLNA QAMQRVHQHK QHQQHQQQHQ QQHQQQHQQQ HQQLQQHQFN PPQSQPLLEG 

      1150       1160 
GSGFRGSGME MTSIREEMGE HQQGLSV 

« Hide

References

« Hide 'large scale' references
[1]"White collar-1, a central regulator of blue light responses in Neurospora, is a zinc finger protein."
Ballario P., Vittorioso P., Magrelli A., Talora C., Cabibbo A., Macino G.
EMBO J. 15:1650-1657(1996) [PubMed: 8612589] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
Strain: ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987.
[2]Ballario P.
Submitted (JUL-1999) to the EMBL/GenBank/DDBJ databases
Cited for: SEQUENCE REVISION TO C-TERMINUS.
[3]"The genome sequence of the filamentous fungus Neurospora crassa."
Galagan J.E., Calvo S.E., Borkovich K.A., Selker E.U., Read N.D., Jaffe D.B., FitzHugh W., Ma L.-J., Smirnov S., Purcell S., Rehman B., Elkins T., Engels R., Wang S., Nielsen C.B., Butler J., Endrizzi M., Qui D. expand/collapse author list , Ianakiev P., Bell-Pedersen D., Nelson M.A., Werner-Washburne M., Selitrennikoff C.P., Kinsey J.A., Braun E.L., Zelter A., Schulte U., Kothe G.O., Jedd G., Mewes H.-W., Staben C., Marcotte E., Greenberg D., Roy A., Foley K., Naylor J., Stange-Thomann N., Barrett R., Gnerre S., Kamal M., Kamvysselis M., Mauceli E.W., Bielke C., Rudd S., Frishman D., Krystofova S., Rasmussen C., Metzenberg R.L., Perkins D.D., Kroken S., Cogoni C., Macino G., Catcheside D.E.A., Li W., Pratt R.J., Osmani S.A., DeSouza C.P.C., Glass N.L., Orbach M.J., Berglund J.A., Voelker R., Yarden O., Plamann M., Seiler S., Dunlap J.C., Radford A., Aramayo R., Natvig D.O., Alex L.A., Mannhaupt G., Ebbole D.J., Freitag M., Paulsen I., Sachs M.S., Lander E.S., Nusbaum C., Birren B.W.
Nature 422:859-868(2003) [PubMed: 12712197] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
X94300 Genomic DNA. Translation: CAA63964.2.
AABX02000010 Genomic DNA. Translation: EAA30541.2. Sequence problems.
RefSeqXP_959777.2. XM_954684.2.
UniGeneNcr.25268.

3D structure databases

ProteinModelPortalQ01371.
ModBaseSearch...

Protein-protein interaction databases

DIPDIP-1155N.
IntActQ01371. 3 interactions.
MINTMINT-7033864.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblFungiEFNCRT00000003067; EFNCRP00000003067; EFNCRG00000003064.
GeneID3875924.
KEGGncr:NCU02356.

Phylogenomic databases

eggNOGfuNOG07259.
GeneTreeEFGT00050000000349.
OrthoDBEOG47WRWS.

Family and domain databases

InterProIPR011989. ARM-like.
IPR001610. PAC.
IPR000014. PAS.
IPR013655. PAS_fold_3.
IPR000679. Znf_GATA.
IPR013088. Znf_NHR/GATA.
[Graphical view]
Gene3DG3DSA:1.25.10.10. ARM-like. 1 hit.
G3DSA:3.30.50.10. Znf_NHR/GATA. 1 hit.
PfamPF00320. GATA. 1 hit.
PF08447. PAS_3. 1 hit.
[Graphical view]
SMARTSM00086. PAC. 2 hits.
SM00091. PAS. 3 hits.
SM00401. ZnF_GATA. 1 hit.
[Graphical view]
TIGRFAMsTIGR00229. Sensory_box. 2 hits.
PROSITEPS00344. GATA_ZN_FINGER_1. 1 hit.
PS50114. GATA_ZN_FINGER_2. 1 hit.
PS50113. PAC. False negative.
PS50112. PAS. 3 hits.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameWC1_NEUCR
AccessionPrimary (citable) accession number: Q01371
Secondary accession number(s): Q7RVA7
Entry history
Integrated into UniProtKB/Swiss-Prot: November 1, 1997
Last sequence update: July 11, 2001
Last modified: December 14, 2011
This is version 89 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programFungal Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families