Skip Header

 
Contribute Send feedback

Unreviewed, UniProtKB/TrEMBL Q5U158 (Q5U158_DROME)

Last modified October 14, 2008. Version 19. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequences · References · Cross-references · Entry information

Names and origin

Protein namesSubmitted name:
    RE02581p EMBL AAV36919.1
Gene names
ORF Names: CG2025 EMBL AAV36919.1 FlyBase FBgn0030344
OrganismDrosophila melanogaster (Fruit fly) [Complete proteome] EMBL AAV36919.1
Taxonomic identifier7227 [NCBI]
Taxonomic lineageEukaryotaMetazoaArthropodaHexapodaInsectaPterygotaNeopteraEndopterygotaDipteraBrachyceraMuscomorphaEphydroideaDrosophilidaeDrosophilaSophophora

Protein attributes

Sequence length1147 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceEvidence at transcript level.

General annotation (Comments)

Sequence similarities

Belongs to the peptidase M16 family. RuleBase RU004447V0

Ontologies

Keywords

   Technical termComplete proteome

Gene Ontology (GO)

   Biological processproteolysis

Inferred from electronic annotation. Source: InterPro

   Molecular functionmetalloendopeptidase activity

Inferred from electronic annotation. Source: InterPro

zinc ion binding

Inferred from electronic annotation. Source: InterPro

Complete GO annotation...

Sequences

Sequence LengthMass (Da)Tools
Q5U158-1 [UniParc].

Last modified December 7, 2004. Version 1.
Checksum: BAD0E8F9E1ECDAA6

FASTA1,147132,692
        10         20         30         40         50         60 
MLWQRYSAHL FRGFEFHVRT KRRSPITQLA GVRAITYSSL CKMTDQVKYL DIPDKSETDK 

        70         80         90        100        110        120 
KLYKTLLLGN GLHALIVSDP SPMPHDGFTT SESSSSKSTV STSSSIISRS ESTSSTSTDS 

       130        140        150        160        170        180 
ESSEESSSEE GDEKLAACAL LIDYGSFAEP TKYQGLAHFL EHMIFMGSEK YPKENIFDAH 

       190        200        210        220        230        240 
IKKCGGFANA NTDCEDTLFY FEVAEKHLDS SLDYFTALMK APLMKQEAMQ RERSAVDSEF 

       250        260        270        280        290        300 
QQILQDDETR RDQLLASLAT KGFPHGTFAW GNMKSLKENV DDAELHKILH EIRKEHYGAN 

       310        320        330        340        350        360 
RMYVCLQARL PIDELESLVV RHFSGIPHNE VKAPDLSSFN YKDAFKAEFH EQVFFVKPVE 

       370        380        390        400        410        420 
NETKLELTWV LPNVRQYYRS KPDQFLSYLL GYEGRGSLCA YLRRRLWALQ LIAGIDENGF 

       430        440        450        460        470        480 
DMNSMYSLFN ICIYLTDEGF KNLDEVLAAT FAYVKLFANC GSMKDVYEEQ QRNEETGFRF 

       490        500        510        520        530        540 
HAQRPAFDNV QELVLNLKYF PPKDILTGKE LYYEYNEEHL KELISHLNEM KFNLMVTSRR 

       550        560        570        580        590        600 
KYDDISAYDK TEEWFGTEYA TIPMPEKWRK LWEDSVPLPE LFLPESNKYV TDDFTLHWHS 

       610        620        630        640        650        660 
MGRPEVPDSP KLLIKTDTCE LWFRQDDKFD LPEAHMAFYF ISPMQRQNAK NDAMCSLYEE 

       670        680        690        700        710        720 
MVRFHVCEEL YPAISAGLSY SLSTIEKGLL LKVCGYNEKL HLIVEAIAEG MLNVAETLDE 

       730        740        750        760        770        780 
NMLSAFVKNQ RKAFFNALIK PKALNRDIRL CVLERIRWLM INKYKCLSSV ILEDMREFAH 

       790        800        810        820        830        840 
QFPKELYIQS LIQGNYTEES VHNVMNSLLS RLNCKQIRER GRFLEDITVK LPVGTSIIRC 

       850        860        870        880        890        900 
HALNVQDTNT VITNFYQIGP NTVRVESILD LLMMFVDEPL FDQLRTKEQL GYHVGATVRL 

       910        920        930        940        950        960 
NYGIAGYSIM VNSQETKTTA DYVEGRIEVF RAKMLQILRH LPQDEYEHTR DSLIKLKLVA 

       970        980        990       1000       1010       1020 
DLALSTEMSR NWDEIINESY LFDRRRRQIE VLRTLQKDEI INFVISIDGD NMRKLSVQVI 

      1030       1040       1050       1060       1070       1080 
GHRPAGMPEP LCGEDTAKCA SKSDDESESE NDDDDDEDEE EEESSEEEEE EEKEKEGLKG 

      1090       1100       1110       1120       1130       1140 
EDEDDLFYSL ENKLNIVFLP AKFNNAFIIT DIEKFKDDQY VYPQQKTQPK EEDELISAHI 


ADAIRQV 

« Hide

References

[1]Stapleton M., Carlson J., Chavez C., Frise E., George R., Pacleb J., Park S., Wan K., Yu C., Rubin G.M., Celniker S.
Submitted (OCT-2004) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE.
Strain: Berkeley EMBL AAV36919.1.

Cross-references

Sequence databases

BT016034 mRNA. Translation: AAV36919.1.

3D structure databases

ModBaseSearch...

Organism-specific databases

FlyBaseFBgn0030344. CG2025.

Phylogenomic databases

HOGENOMQ5U158.

Gene expression databases

ArrayExpressQ5U158.

Family and domain databases

InterProIPR011237. Pept_M16_core.
IPR011765. Pept_M16_N.
IPR001431. Pept_M16_Zn_BS.
IPR007863. Peptidase_M16_C.
[Graphical view]
Gene3DG3DSA:3.30.830.10. Pept_M16_core. 1 hit.
PfamPF00675. Peptidase_M16. 1 hit.
PF05193. Peptidase_M16_C. 2 hits.
[Graphical view]
PROSITEPS00143. INSULINASE. 1 hit.
[Graphical view]

Entry information

Entry nameQ5U158_DROME
AccessionPrimary (citable) accession number: Q5U158
Entry history
Integrated into UniProtKB/TrEMBL: December 7, 2004
Last sequence update: December 7, 2004
Last modified: October 14, 2008
This is version 19 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequences · References · Cross-references · Entry information