Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Protein alan shepard

Gene

shep

Organism
Drosophila melanogaster (Fruit fly)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Has a role in the perception of gravity.1 Publication

GO - Molecular functioni

  • mRNA binding Source: FlyBase
  • nucleotide binding Source: InterPro

GO - Biological processi

  • adult locomotory behavior Source: FlyBase
  • female courtship behavior Source: FlyBase
  • gravitaxis Source: FlyBase
  • metamorphosis Source: FlyBase
  • neuron remodeling Source: FlyBase
  • response to endoplasmic reticulum stress Source: FlyBase
  • response to gravity Source: UniProtKB
  • sleep Source: FlyBase
Complete GO annotation...

Keywords - Ligandi

RNA-binding

Names & Taxonomyi

Protein namesi
Recommended name:
Protein alan shepard
Gene namesi
Name:shepImported
ORF Names:CG32423
OrganismiDrosophila melanogaster (Fruit fly)
Taxonomic identifieri7227 [NCBI]
Taxonomic lineageiEukaryotaMetazoaEcdysozoaArthropodaHexapodaInsectaPterygotaNeopteraEndopterygotaDipteraBrachyceraMuscomorphaEphydroideaDrosophilidaeDrosophilaSophophora
Proteomesi
  • UP000000803 Componenti: Chromosome 3L

Organism-specific databases

FlyBaseiFBgn0052423. shep.

Subcellular locationi

GO - Cellular componenti

  • cytosol Source: FlyBase
Complete GO annotation...

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00003794991 – 590Protein alan shepardAdd BLAST590

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei5Phosphotyrosine1 Publication1
Modified residuei125Phosphotyrosine1 Publication1
Modified residuei142Phosphotyrosine1 Publication1

Keywords - PTMi

Phosphoprotein

Proteomic databases

PaxDbiQ8MSV2.
PRIDEiQ8MSV2.

PTM databases

iPTMnetiQ8MSV2.

Expressioni

Gene expression databases

BgeeiFBgn0052423.
GenevisibleiQ8MSV2. DM.

Interactioni

Protein-protein interaction databases

BioGridi64073. 1 interactor.
IntActiQ8MSV2. 27 interactors.
MINTiMINT-339377.
STRINGi7227.FBpp0301014.

Structurei

3D structure databases

ProteinModelPortaliQ8MSV2.
SMRiQ8MSV2.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini243 – 316RRM 1PROSITE-ProRule annotationAdd BLAST74
Domaini322 – 401RRM 2PROSITE-ProRule annotationAdd BLAST80

Compositional bias

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Compositional biasi62 – 155Ala-richSequence analysisAdd BLAST94

Sequence similaritiesi

Contains 2 RRM (RNA recognition motif) domains.PROSITE-ProRule annotation

Keywords - Domaini

Repeat

Phylogenomic databases

eggNOGiKOG4733. Eukaryota.
ENOG410XRB5. LUCA.
GeneTreeiENSGT00760000118913.
InParanoidiQ8MSV2.
OMAiNTNTNMG.
OrthoDBiEOG091G0FXS.
PhylomeDBiQ8MSV2.

Family and domain databases

Gene3Di3.30.70.330. 2 hits.
InterProiIPR002343. Hud_Sxl_RNA.
IPR012677. Nucleotide-bd_a/b_plait.
IPR000504. RRM_dom.
[Graphical view]
PfamiPF00076. RRM_1. 2 hits.
[Graphical view]
PRINTSiPR00961. HUDSXLRNA.
SMARTiSM00360. RRM. 2 hits.
[Graphical view]
SUPFAMiSSF54928. SSF54928. 2 hits.
PROSITEiPS50102. RRM. 2 hits.
[Graphical view]

Sequences (6)i

Sequence statusi: Complete.

This entry describes 6 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform F (identifier: Q8MSV2-5) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MHPRYSPAPP PQQQQQMGGP PHQQQGGGGG GGVSMRGPSN AQQLPPQIPR
60 70 80 90 100
SQNYSNGSSS SAAAAPLTSR SAFPGAPLTA SAVALKGALP QRPPAMTSPA
110 120 130 140 150
AAAAGAALAA GAPYRGAASW TPQGYAPAAA AAAAAVAQQA AYRYTAPLPQ
160 170 180 190 200
PAYAAYTPHT ATTPATTTVS FLSQPVDYYW YGQRVPTAAS PSNTNSSSSS
210 220 230 240 250
NTGSQSGTLS TSLSNTTNTN TNMGPNGTVQ NQNQQGGEQL SKTNLYIRGL
260 270 280 290 300
QQGTTDKDLV NMCAQYGTII STKAILDKTT NKCKGYGFVD FEQPAFAECA
310 320 330 340 350
VKGLQGKGVQ AQMAKQQEQD PTNLYIANLP PHFKETDLEA MLSKYGQVVS
360 370 380 390 400
TRILRDQQMN SKGVGFARME SREKCEQIIQ MFNGNTIPGA KDPLLVKFAD
410 420 430 440 450
GGPKKKNLFK TPDPNARAWR DVSAEGIPVA YDPTMQQNGV SVNVGTPIGV
460 470 480 490 500
PYSRFSAPQV GGYPVAGSQW IPGYMMTQVD DQTSYSPQYM QMAAAPPLGV
510 520 530 540 550
TSYKPEAVNQ VQPRGISMMV SGDTGVPYGT MMPQLATLQI GNSYISPTYP
560 570 580 590
YYAPPPTIIP TMPMTDSEQA STAASPDEAY TQYPHQAAPK
Note: No experimenal confirmation available.
Length:590
Mass (Da):62,175
Last modified:April 16, 2014 - v2
Checksum:i7700D1E378A5255A
GO
Isoform A1 Publication (identifier: Q8MSV2-1) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     169-180: Missing.

Note: No experimental confirmation available.Curated
Show »
Length:578
Mass (Da):60,689
Checksum:iA834D141737A0A89
GO
Isoform B1 Publication (identifier: Q8MSV2-2) [UniParc]FASTAAdd to basket
Also known as: D1 Publication

The sequence of this isoform differs from the canonical sequence as follows:
     1-95: Missing.
     169-180: Missing.
     284-285: Missing.
     315-315: K → KVGIWVLHRPAI
     543-543: S → SNFSPSLQ

Note: No experimental confirmation available.Curated
Show »
Length:499
Mass (Da):53,021
Checksum:i741195A5D8E51552
GO
Isoform E1 Publication (identifier: Q8MSV2-4) [UniParc]FASTAAdd to basket
Also known as: G

The sequence of this isoform differs from the canonical sequence as follows:
     1-222: Missing.
     315-315: K → KVGIWVLHRPAI

Note: No experimental confirmation available.Curated
Show »
Length:379
Mass (Da):41,266
Checksum:i741C06806CB3B168
GO
Isoform I (identifier: Q8MSV2-6) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-222: Missing.
     316-317: QQ → VGIWVLHRPAI

Note: No experimental confirmation available.
Show »
Length:377
Mass (Da):41,010
Checksum:iB82E7FDFC03FE5CE
GO
Isoform H (identifier: Q8MSV2-7) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-222: Missing.
     316-317: QQ → VGIWVLHRPAI
     483-488: Missing.

Note: No experimental confirmation available.
Show »
Length:371
Mass (Da):40,346
Checksum:iEA8DEB33523808DE
GO

Sequence cautioni

The sequence AAL28502 differs from that shown. Reason: Erroneous initiation. Translation N-terminally extended.Curated
The sequence AAL28543 differs from that shown. Reason: Frameshift at position 420.Curated

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_0542121 – 222Missing in isoform E, isoform H and isoform I. 1 PublicationAdd BLAST222
Alternative sequenceiVSP_0542131 – 95Missing in isoform B. 2 PublicationsAdd BLAST95
Alternative sequenceiVSP_054214169 – 180Missing in isoform A and isoform B. 1 PublicationAdd BLAST12
Alternative sequenceiVSP_054215284 – 285Missing in isoform B. 1 Publication2
Alternative sequenceiVSP_054216315K → KVGIWVLHRPAI in isoform B and isoform E. 1 Publication1
Alternative sequenceiVSP_054217316 – 317QQ → VGIWVLHRPAI in isoform I and isoform H. Curated2
Alternative sequenceiVSP_054218483 – 488Missing in isoform H. Curated6
Alternative sequenceiVSP_054219543S → SNFSPSLQ in isoform B. 1 Publication1

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AE014296 Genomic DNA. Translation: AAF50788.5.
AE014296 Genomic DNA. Translation: AAF50790.2.
AE014296 Genomic DNA. Translation: AAF50792.4.
AE014296 Genomic DNA. Translation: AAN12240.1.
AE014296 Genomic DNA. Translation: ACZ94622.1.
AE014296 Genomic DNA. Translation: AFH04307.1.
AE014296 Genomic DNA. Translation: AGB94133.1.
AE014296 Genomic DNA. Translation: AGB94134.1.
AY060954 mRNA. Translation: AAL28502.1. Different initiation.
AY060995 mRNA. Translation: AAL28543.1. Frameshift.
AY094948 mRNA. Translation: AAM11301.1.
AY118564 mRNA. Translation: AAM49933.1.
RefSeqiNP_001163350.1. NM_001169879.2. [Q8MSV2-4]
NP_001246636.1. NM_001259707.1. [Q8MSV2-5]
NP_001261438.1. NM_001274509.1. [Q8MSV2-4]
NP_001261439.1. NM_001274510.1. [Q8MSV2-7]
NP_729054.3. NM_168111.4. [Q8MSV2-1]
NP_729055.1. NM_168112.4. [Q8MSV2-2]
NP_729056.1. NM_168113.3. [Q8MSV2-2]
NP_729057.3. NM_168114.5. [Q8MSV2-6]
UniGeneiDm.20861.

Genome annotation databases

EnsemblMetazoaiFBtr0308859; FBpp0301014; FBgn0052423. [Q8MSV2-5]
GeneIDi38605.
KEGGidme:Dmel_CG32423.
UCSCiCG32423-RA. d. melanogaster. [Q8MSV2-5]
CG32423-RB. d. melanogaster.
CG32423-RC. d. melanogaster.

Keywords - Coding sequence diversityi

Alternative splicing

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AE014296 Genomic DNA. Translation: AAF50788.5.
AE014296 Genomic DNA. Translation: AAF50790.2.
AE014296 Genomic DNA. Translation: AAF50792.4.
AE014296 Genomic DNA. Translation: AAN12240.1.
AE014296 Genomic DNA. Translation: ACZ94622.1.
AE014296 Genomic DNA. Translation: AFH04307.1.
AE014296 Genomic DNA. Translation: AGB94133.1.
AE014296 Genomic DNA. Translation: AGB94134.1.
AY060954 mRNA. Translation: AAL28502.1. Different initiation.
AY060995 mRNA. Translation: AAL28543.1. Frameshift.
AY094948 mRNA. Translation: AAM11301.1.
AY118564 mRNA. Translation: AAM49933.1.
RefSeqiNP_001163350.1. NM_001169879.2. [Q8MSV2-4]
NP_001246636.1. NM_001259707.1. [Q8MSV2-5]
NP_001261438.1. NM_001274509.1. [Q8MSV2-4]
NP_001261439.1. NM_001274510.1. [Q8MSV2-7]
NP_729054.3. NM_168111.4. [Q8MSV2-1]
NP_729055.1. NM_168112.4. [Q8MSV2-2]
NP_729056.1. NM_168113.3. [Q8MSV2-2]
NP_729057.3. NM_168114.5. [Q8MSV2-6]
UniGeneiDm.20861.

3D structure databases

ProteinModelPortaliQ8MSV2.
SMRiQ8MSV2.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi64073. 1 interactor.
IntActiQ8MSV2. 27 interactors.
MINTiMINT-339377.
STRINGi7227.FBpp0301014.

PTM databases

iPTMnetiQ8MSV2.

Proteomic databases

PaxDbiQ8MSV2.
PRIDEiQ8MSV2.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaiFBtr0308859; FBpp0301014; FBgn0052423. [Q8MSV2-5]
GeneIDi38605.
KEGGidme:Dmel_CG32423.
UCSCiCG32423-RA. d. melanogaster. [Q8MSV2-5]
CG32423-RB. d. melanogaster.
CG32423-RC. d. melanogaster.

Organism-specific databases

CTDi38605.
FlyBaseiFBgn0052423. shep.

Phylogenomic databases

eggNOGiKOG4733. Eukaryota.
ENOG410XRB5. LUCA.
GeneTreeiENSGT00760000118913.
InParanoidiQ8MSV2.
OMAiNTNTNMG.
OrthoDBiEOG091G0FXS.
PhylomeDBiQ8MSV2.

Miscellaneous databases

ChiTaRSishep. fly.
GenomeRNAii38605.
PROiQ8MSV2.

Gene expression databases

BgeeiFBgn0052423.
GenevisibleiQ8MSV2. DM.

Family and domain databases

Gene3Di3.30.70.330. 2 hits.
InterProiIPR002343. Hud_Sxl_RNA.
IPR012677. Nucleotide-bd_a/b_plait.
IPR000504. RRM_dom.
[Graphical view]
PfamiPF00076. RRM_1. 2 hits.
[Graphical view]
PRINTSiPR00961. HUDSXLRNA.
SMARTiSM00360. RRM. 2 hits.
[Graphical view]
SUPFAMiSSF54928. SSF54928. 2 hits.
PROSITEiPS50102. RRM. 2 hits.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiSHEP_DROME
AccessioniPrimary (citable) accession number: Q8MSV2
Secondary accession number(s): E1JID7
, M9ND50, M9PEP6, Q7KU60, Q86BS4, Q8SWY6, Q95S20, Q95S52, Q9VRK3, Q9VRK5
Entry historyi
Integrated into UniProtKB/Swiss-Prot: July 7, 2009
Last sequence update: April 16, 2014
Last modified: November 2, 2016
This is version 116 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programDrosophila annotation project

Miscellaneousi

Miscellaneous

Named after Alan Bartlett Shepard, Jr. who was the second person and the first American in space and the fifth person to walk on the moon.1 Publication

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Drosophila
    Drosophila: entries, gene names and cross-references to FlyBase
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.