Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein
Submitted name:

Uncharacterized protein

Gene

CELE_M02G9.1

Organism
Caenorhabditis elegans
Status
Unreviewed-Annotation score: Annotation score: 1 out of 5-Experimental evidence at protein leveli

Names & Taxonomyi

Protein namesi
Submitted name:
Uncharacterized proteinImported
Gene namesi
ORF Names:CELE_M02G9.1Imported, M02G9.1Imported
OrganismiCaenorhabditis elegansImported
Taxonomic identifieri6239 [NCBI]
Taxonomic lineageiEukaryotaMetazoaEcdysozoaNematodaChromadoreaRhabditidaRhabditoideaRhabditidaePeloderinaeCaenorhabditis
Proteomesi
  • UP000001940 Componenti: Chromosome II

Organism-specific databases

WormBaseiM02G9.1a; CE41249; WBGene00010830.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 2222Sequence analysisAdd
BLAST
Chaini23 – 909887Sequence analysisPRO_5004157443Add
BLAST

Proteomic databases

PaxDbiO17970.

Expressioni

Gene expression databases

ExpressionAtlasiO17970. baseline.

Interactioni

Protein-protein interaction databases

DIPiDIP-26074N.
IntActiO17970. 5 interactions.
MINTiMINT-1072669.
STRINGi6239.M02G9.1b.

Family & Domainsi

Keywords - Domaini

SignalSequence analysis

Phylogenomic databases

eggNOGiENOG410JDP0. Eukaryota.
ENOG411177E. LUCA.
GeneTreeiENSGT00730000112111.
OrthoDBiEOG773XKP.

Sequencei

Sequence statusi: Complete.

O17970-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MGRSNSLRLL FLFCLLLALS TAASRRLKRQ CGCSNFCNCQ QQPVFFAQIS
60 70 80 90 100
LPACSCQQAP ICQPQCPRAE INSDCSATCV RACIPSCSKS TGNTFACSTT
110 120 130 140 150
CESTCDKTCA SAAQQAMSHI QVSPPNPQPL VPIAAAPTVD DSCQNVCQNV
160 170 180 190 200
CQGACVSQNS PPAVCQQTCR QSCQFGCATN EQLPTTSSTS TNAPTIKITL
210 220 230 240 250
NINDAYFDSN CAPKCTQSCH SQCISQGNPA ASCSNSCNTE CSDKCSTRPV
260 270 280 290 300
QAQVQQIQPQ QVQITIPQTC QSRCENRCLS TCTASQPSVC APSCSTACQL
310 320 330 340 350
SCDSIGAATA PQSTNPTPVE VRALCTPTCM PQCLPSCTAT TTTTTTQTPY
360 370 380 390 400
IQTLAPAPAP APVQIQIIQK QCVPSCMPAC QPSCTNPVPL TTQAPVPVVN
410 420 430 440 450
QCIPPCQPQC LQSCLEQHIQ PQVVTQLPQC IPQCQPACEP QCIQETTTTT
460 470 480 490 500
TTTTTQSPQK PPIIKITVTG QQVGCVEQCQ PACDPKCIIA TIKTPSQQPQ
510 520 530 540 550
FVVTTTQAPP APAPQQQQQL ASCPQLCQPQ CTSQCVQQQQ CPCQQTCQTG
560 570 580 590 600
CQQHNPDARV CQNVCVEVCA SECPRTQPTV QQPAPQLVQQ PIYQTVQAPL
610 620 630 640 650
SYVPVVPVAT SAPSQASGPQ ITINFAVPEC IPVCEQSCNT QCVEKFPQEH
660 670 680 690 700
CGSVCNSQCQ TACATQTPAV QAAPAPSCQP QCQPACEPVC IAQQAQPVRI
710 720 730 740 750
QINLATASSV LQASDACQPM CEQSCVQECQ STTLNVQAAT CQPACQSICQ
760 770 780 790 800
QSCAPLGTSA PVMQTIPVVP VATAPVASTQ LCAPKCISDC QGLCKSNSPQ
810 820 830 840 850
CIQGCDASCQ QLCGTAPTPA VPLTVNYNCN LPCDQQCTQQ CYHQAPTCAP
860 870 880 890 900
ACASACEAQC PVVSCEDACQ TVCKGQCVFS GQNSRQCGPA CAQSCSSLCH

KKRVKRGEH
Length:909
Mass (Da):95,816
Last modified:July 10, 2007 - v3
Checksum:i12C7EF98D0F13989
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
BX284602 Genomic DNA. Translation: CAB04625.3.
PIRiT23681.
RefSeqiNP_001254245.1. NM_001267316.1.
UniGeneiCel.14728.

Genome annotation databases

EnsemblMetazoaiM02G9.1a; M02G9.1a; WBGene00010830.
GeneIDi174597.
UCSCiM02G9.1. c. elegans.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
BX284602 Genomic DNA. Translation: CAB04625.3.
PIRiT23681.
RefSeqiNP_001254245.1. NM_001267316.1.
UniGeneiCel.14728.

3D structure databases

ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

DIPiDIP-26074N.
IntActiO17970. 5 interactions.
MINTiMINT-1072669.
STRINGi6239.M02G9.1b.

Proteomic databases

PaxDbiO17970.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaiM02G9.1a; M02G9.1a; WBGene00010830.
GeneIDi174597.
UCSCiM02G9.1. c. elegans.

Organism-specific databases

CTDi174597.
WormBaseiM02G9.1a; CE41249; WBGene00010830.

Phylogenomic databases

eggNOGiENOG410JDP0. Eukaryota.
ENOG411177E. LUCA.
GeneTreeiENSGT00730000112111.
OrthoDBiEOG773XKP.

Gene expression databases

ExpressionAtlasiO17970. baseline.

Family and domain databases

ProtoNetiSearch...

Publicationsi

  1. "Genome sequence of the nematode C. elegans: a platform for investigating biology."
    Caenorhabditis elegans Sequencing Consortium
    Sulson J.E., Waterston R.
    Science 282:2012-2018(1998) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: Bristol N2Imported.

Entry informationi

Entry nameiO17970_CAEEL
AccessioniPrimary (citable) accession number: O17970
Entry historyi
Integrated into UniProtKB/TrEMBL: January 1, 1998
Last sequence update: July 10, 2007
Last modified: June 8, 2016
This is version 100 of the entry and version 3 of the sequence. [Complete history]
Entry statusiUnreviewed (UniProtKB/TrEMBL)

Miscellaneousi

Keywords - Technical termi

Complete proteome, Proteomics identificationCombined sources, Reference proteomeImported

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.