Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Aryl hydrocarbon receptor protein 1

Gene

ahr-1

Organism
Caenorhabditis elegans
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Probable ligand-activated transcriptional activator. Acts as a transcriptional regulator in GABAergic motor neuron cell fate specification and development. Promotes cell-type-specific expression of guanylate cyclase genes that have key roles in aggregation behavior and hyperoxia avoidance. Has no role in carbon dioxide avoidance.5 Publications

GO - Molecular functioni

  • aryl hydrocarbon receptor activity Source: WormBase
  • DNA binding Source: WormBase
  • RNA polymerase II transcription factor binding Source: WormBase
  • signal transducer activity Source: WormBase

GO - Biological processi

  • behavior Source: UniProtKB-KW
  • intracellular receptor signaling pathway Source: WormBase
  • negative regulation of locomotion Source: WormBase
  • nervous system development Source: UniProtKB-KW
  • positive regulation of transcription from RNA polymerase II promoter Source: GO_Central
  • response to xenobiotic stimulus Source: WormBase
  • transcription, DNA-templated Source: UniProtKB-KW
Complete GO annotation...

Keywords - Molecular functioni

Activator, Receptor

Keywords - Biological processi

Behavior, Neurogenesis, Transcription, Transcription regulation

Keywords - Ligandi

DNA-binding

Names & Taxonomyi

Protein namesi
Recommended name:
Aryl hydrocarbon receptor protein 1Imported
Gene namesi
Name:ahr-1Imported
ORF Names:C41G7.5
OrganismiCaenorhabditis elegans
Taxonomic identifieri6239 [NCBI]
Taxonomic lineageiEukaryotaMetazoaEcdysozoaNematodaChromadoreaRhabditidaRhabditoideaRhabditidaePeloderinaeCaenorhabditis
Proteomesi
  • UP000001940 Componenti: Chromosome I

Organism-specific databases

WormBaseiC41G7.5a; CE20561; WBGene00000096; ahr-1.
C41G7.5b; CE38293; WBGene00000096; ahr-1.

Subcellular locationi

GO - Cellular componenti

  • intracellular Source: WormBase
  • nuclear aryl hydrocarbon receptor complex Source: GO_Central
Complete GO annotation...

Keywords - Cellular componenti

Nucleus

Pathology & Biotechi

Disruption phenotypei

Defects in neuronal differentiation, axon branching, aberrant cell migration and abnormal aggregation behavior on lawns of bacterial food.3 Publications

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
PropeptideiPRO_00003887061 – 2Sequence analysisBy similarity2
ChainiPRO_00003887073 – 602Aryl hydrocarbon receptor protein 1Sequence analysisAdd BLAST600

Proteomic databases

PaxDbiO44712.

Expressioni

Tissue specificityi

Expressed in many distinct neuronal cells including RMED, RMEV, RMEL and RMER. Functions in URX neurons to promote aggregation behavior.3 Publications

Developmental stagei

Expressed during embryonic and larval development.1 Publication

Gene expression databases

BgeeiWBGene00000096.

Interactioni

Subunit structurei

Interacts with daf-21/hsp90. Interacts with aha-1.2 Publications

Binary interactionsi

WithEntry#Exp.IntActNotes
aha-1O022193EBI-2409183,EBI-2408984

GO - Molecular functioni

  • RNA polymerase II transcription factor binding Source: WormBase

Protein-protein interaction databases

IntActiO44712. 2 interactors.
STRINGi6239.C41G7.5a.

Structurei

3D structure databases

ProteinModelPortaliO44712.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini18 – 71bHLHPROSITE-ProRule annotationAdd BLAST54
Domaini126 – 196PASPROSITE-ProRule annotationAdd BLAST71

Sequence similaritiesi

Contains 1 bHLH (basic helix-loop-helix) domain.PROSITE-ProRule annotation
Contains 1 PAS (PER-ARNT-SIM) domain.PROSITE-ProRule annotation

Phylogenomic databases

eggNOGiKOG3560. Eukaryota.
ENOG410XRZH. LUCA.
GeneTreeiENSGT00860000133766.
InParanoidiO44712.
OMAiFLQCKAH.
OrthoDBiEOG091G16AJ.
PhylomeDBiO44712.

Family and domain databases

InterProiIPR011598. bHLH_dom.
IPR000014. PAS.
IPR013767. PAS_fold.
IPR013655. PAS_fold_3.
[Graphical view]
PfamiPF00989. PAS. 1 hit.
PF08447. PAS_3. 1 hit.
[Graphical view]
SMARTiSM00091. PAS. 1 hit.
[Graphical view]
SUPFAMiSSF55785. SSF55785. 2 hits.
PROSITEiPS50888. BHLH. 1 hit.
PS50112. PAS. 1 hit.
[Graphical view]

Sequences (2)i

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

This entry describes 2 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform a1 Publication (identifier: O44712-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MYASKRRQRN FKRVRDPPKQ LTNTNPSKRH RERLNGELET VAMLLPYDSS
60 70 80 90 100
TISRLDKLSV LRLAVSFLQC KAHFQACLHN SQFLSAGFPM STHSYSYQPH
110 120 130 140 150
PPIPFSNKVP TIFDLRIGTP MLDPEESNFE EISLKSLGGF ILVLNDNGEI
160 170 180 190 200
YYASENVENY LGFHQSDVLH QPVYDLIHSE DRDDIRQQLD SNFHIPTSSA
210 220 230 240 250
SNQFDVFAPQ NSKYLERNVN ARFRCLLDNT CGFLRIDMRG KLMSLHGLPS
260 270 280 290 300
SYVMGRTASG PVLGMICVCT PFVPPSTSDL ASEDMILKTK HQLDGALVSM
310 320 330 340 350
DQKVYEMLEI DETDLPMPLY NLVHVEDAVC MAEAHKEAIK NGSSGLLVYR
360 370 380 390 400
LVSTKTRRTY FVQSSCRMFY KNSKPESIGL THRLLNEVEG TMLLEKRSTL
410 420 430 440 450
KAKLLSFDDS FLQSPRNLQS TAALPLPSVL KDDQDCLEPS TSNSLFPSVP
460 470 480 490 500
VPTPTTTKAN RRRKENSHEI VPTIPSIPIP THFDMQMFDP SWNHGVHPPA
510 520 530 540 550
WPHDVYHLTQ YPPTYPHPPG TVGYPDVQIA PVDYPGWHPN DIHMTQLPHG
560 570 580 590 600
FTPDAQKLVP PHPQMSHFTE YPTPSTHHDL HHHPLKQDNF HLISEVTNLL

GT
Length:602
Mass (Da):68,232
Last modified:June 1, 1998 - v1
Checksum:i6A891C2950B3FD36
GO
Isoform b1 Publication (identifier: O44712-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-42: Missing.

Note: No experimental confirmation available.Curated
Show »
Length:560
Mass (Da):63,170
Checksum:i813BD9C8C058EC06
GO

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_0531641 – 42Missing in isoform b. 1 PublicationAdd BLAST42

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AF039570 mRNA. Translation: AAC00000.1.
Z81048 Genomic DNA. Translation: CAB51463.1.
Z81048 Genomic DNA. Translation: CAI79126.1.
PIRiT19898.
RefSeqiNP_001021036.1. NM_001025865.3. [O44712-1]
NP_001021037.1. NM_001025866.3. [O44712-2]
UniGeneiCel.19551.

Genome annotation databases

EnsemblMetazoaiC41G7.5a; C41G7.5a; WBGene00000096. [O44712-1]
GeneIDi172788.
KEGGicel:CELE_C41G7.5.
UCSCiC41G7.5a. c. elegans.

Keywords - Coding sequence diversityi

Alternative splicing

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AF039570 mRNA. Translation: AAC00000.1.
Z81048 Genomic DNA. Translation: CAB51463.1.
Z81048 Genomic DNA. Translation: CAI79126.1.
PIRiT19898.
RefSeqiNP_001021036.1. NM_001025865.3. [O44712-1]
NP_001021037.1. NM_001025866.3. [O44712-2]
UniGeneiCel.19551.

3D structure databases

ProteinModelPortaliO44712.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

IntActiO44712. 2 interactors.
STRINGi6239.C41G7.5a.

Proteomic databases

PaxDbiO44712.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaiC41G7.5a; C41G7.5a; WBGene00000096. [O44712-1]
GeneIDi172788.
KEGGicel:CELE_C41G7.5.
UCSCiC41G7.5a. c. elegans.

Organism-specific databases

CTDi172788.
WormBaseiC41G7.5a; CE20561; WBGene00000096; ahr-1.
C41G7.5b; CE38293; WBGene00000096; ahr-1.

Phylogenomic databases

eggNOGiKOG3560. Eukaryota.
ENOG410XRZH. LUCA.
GeneTreeiENSGT00860000133766.
InParanoidiO44712.
OMAiFLQCKAH.
OrthoDBiEOG091G16AJ.
PhylomeDBiO44712.

Miscellaneous databases

PROiO44712.

Gene expression databases

BgeeiWBGene00000096.

Family and domain databases

InterProiIPR011598. bHLH_dom.
IPR000014. PAS.
IPR013767. PAS_fold.
IPR013655. PAS_fold_3.
[Graphical view]
PfamiPF00989. PAS. 1 hit.
PF08447. PAS_3. 1 hit.
[Graphical view]
SMARTiSM00091. PAS. 1 hit.
[Graphical view]
SUPFAMiSSF55785. SSF55785. 2 hits.
PROSITEiPS50888. BHLH. 1 hit.
PS50112. PAS. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiAHR_CAEEL
AccessioniPrimary (citable) accession number: O44712
Secondary accession number(s): Q564W4, Q93369
Entry historyi
Integrated into UniProtKB/Swiss-Prot: November 3, 2009
Last sequence update: June 1, 1998
Last modified: November 30, 2016
This is version 129 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programCaenorhabditis annotation project

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Caenorhabditis elegans
    Caenorhabditis elegans: entries, gene names and cross-references to WormBase
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.