Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Unreviewed, UniProtKB/TrEMBL A1Z7C4 (A1Z7C4_DROME)

Last modified June 16, 2009. Version 34. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · Ontologies · Sequences · References · Cross-references · Entry information

Names and origin

Protein namesSubmitted name:
    CG33087 EMBL AAF59114.3
    EC=3.6.1.3
Gene names
ORF Names: CG33087 FlyBase FBgn0053087, Dmel_CG33087 EMBL AAF59114.3
OrganismDrosophila melanogaster (Fruit fly) [Complete proteome]
Taxonomic identifier7227 [NCBI]
Taxonomic lineageEukaryotaMetazoaArthropodaHexapodaInsectaPterygotaNeopteraEndopterygotaDipteraBrachyceraMuscomorphaEphydroideaDrosophilidaeDrosophilaSophophora

Protein attributes

Sequence length4699 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceEvidence at protein level.

Sequences

Sequence LengthMass (Da)Tools
A1Z7C4-1 [UniParc].

Last modified February 6, 2007. Version 1.
Checksum: 166D46B38C74A6BF

FASTA4,699524,928
        10         20         30         40         50         60 
MQTVAETQMI EWLAACTSSF FLCILSDHYS VIYPIAGPCP ASYFTCNDGF CIPMRWKCDS 

        70         80         90        100        110        120 
KADCPDMSDE GSECAPKCNE GQFRCGVSRH CIPNNWLCDG EFDCGKGDIS DELNCPNGDT 

       130        140        150        160        170        180 
PKCRAFEGQC RNGDCLELSR FCDGRWDCDN DELQCDKQNA ACAALNCSFN CKLTPQGARC 

       190        200        210        220        230        240 
YCPKDQVPES SNSTRCVDYD ECSEPGTCDQ VCRNTPGSYE CSCVSGYAKT KGNRCRAINV 

       250        260        270        280        290        300 
PPTEPTTLIF LSRDGVQSVG TNGTEVIGPP GAKDNDKVTD KDGVGSEEEL LLSQPLRFVH 

       310        320        330        340        350        360 
AFEVWHRNRT LCSLLFSWPE LQMRCQRVDD ARVNWTLPFS SFVSPQQFFT ELRLDWLSGN 

       370        380        390        400        410        420 
WYLVSEDDGL VYLCTNAMTY CRVILQQVDP LSSLDLDPTK GFMFYTDWTP SLSRSLLDGS 

       430        440        450        460        470        480 
NRTVLVTDQV YHPSSVTLDL ANELVYWIDI YKDEVNRVDY EGRNRWTLKR PLDSPVPLKT 

       490        500        510        520        530        540 
IHAVEVFENS IYLAAWMDTA IVALDKFSLK THILQSNVSR GANLRIFHRQ KQPEVAHPCR 

       550        560        570        580        590        600 
DNNAGCNQIC VPQWTKGFAS AKCMCTAGYK LHNQTTCLLS ALDKFLVYSD KHLARISGIP 

       610        620        630        640        650        660 
LDTEQVQQLE QIGEQPDVMV PVYNVSKTLA IDVNVRGKAV FYVVADSGAS PFGNGEPSCS 

       670        680        690        700        710        720 
IRSQSLNGSV SRLLAQGLKR VHAVAFDWIN DHLYWTSHKK MQVAPLRNLS KVLTFNTDCD 

       730        740        750        760        770        780 
AMSLELDPTT GLLYWSQWES QSCEAGIYSS WMDGTHKELL AKGTSSMPMQ WPRSLDVDRR 

       790        800        810        820        830        840 
TKELYWCDIR LSTIELMRLD GTGREVLFKS DQFHPYSIVQ NNGLIYWADN KNSTILRFHA 

       850        860        870        880        890        900 
HQANLSSTFS STVHLQRTGR AADLRIFDIA SQPLPQTPSA CAQSKCPGMC LNTPKGAICR 

       910        920        930        940        950        960 
CPDGFTLNGT GSHCIPQLAP SPIRPNCTSG YMCRSTRQCL DTKDMCDGFE DCEDGIDESS 

       970        980        990       1000       1010       1020 
DPKGPCNVNT CDKTHNFVCN GRCYQRSLLC STIPYCSDGT DQANCHQNTC NSNEFTCHKS 

      1030       1040       1050       1060       1070       1080 
GRCIQLTWVN DGVVDCGPDD DSDETSETIF ASKCPEFDCN NGRCRQFADV CDGIDNCGNN 

      1090       1100       1110       1120       1130       1140 
ADEMECEQEC EHGEKYCRPI GCYGEMHMCD GIHDCLDFSD EANCNQTKSD NHPVTEWKEL 

      1150       1160       1170       1180       1190       1200 
GECAPLEFAC MFPFECIPDF LRCDGISHCF DKTDEFNCTH INTTRFDMNE TVICEHPDRL 

      1210       1220       1230       1240       1250       1260 
CGFSKQCVTV DQLCDGKNDC EDTTDEGFLC ADKLCDRGHE CSHRCHNTPE GYICSCPDHL 

      1270       1280       1290       1300       1310       1320 
YLQPNGKRCS MQHACDHWDT CSQVCESSGK GYDCRCLDGF DLGFDRFTCK STAPDEPYVI 

      1330       1340       1350       1360       1370       1380 
FTNRQDIKGI NLKTLNVGNF YSSLRNIIAL DFLYNNESNV EIYWTDVIDD KIYRGHLVGE 

      1390       1400       1410       1420       1430       1440 
SLRNVEAVIH SGLSTTEGLA VDWVGKNLYW IDSNLDQIEV AKLNGSFRRT LIAGNMESPR 

      1450       1460       1470       1480       1490       1500 
AIALDPREGL LFWTDWDDNS PRIERASMSG DGRRMISTSW QLSAGWPNGL TLDYTQKRVY 

      1510       1520       1530       1540       1550       1560 
WVDAKSDSIS STMYDGSEHH VVLRNKEILS HPFAISVFEN YVYWTDWRTT SVIRANKWNG 

      1570       1580       1590       1600       1610       1620 
SDVQVLQRTQ SQPFGIQVLH SSRQPWDRNP CGENNGGCSH LCLLSGRGTF KCECPHVMRL 

      1630       1640       1650       1660       1670       1680 
DPANERNCVP NEQVLLFVMV DEIRGIDLHQ PNHHTIPTIR QSPRRIDFLV DESRIFWSDI 

      1690       1700       1710       1720       1730       1740 
QQNEISSAGI SNGLIEPIIN TNIEKPYGFA VDWIARNMYF SSGQIKCNIL ASNLKGEFAS 

      1750       1760       1770       1780       1790       1800 
IIHEDLNMVD SIVLDPANGK MYWIHSASDG SMSQLEQSNL DGSSRSLIYQ HENNLQSLTM 

      1810       1820       1830       1840       1850       1860 
DFDSQRLYYA YDNSGIAYYD IPRNETRKVL VASPITSISS LTVYNGTLYF PENIQSVIMQ 

      1870       1880       1890       1900       1910       1920 
CEKEACSNMS YLRVNTKSIQ SMKMFYADAQ TGSNTCAEWA YRGGCQQLCL ATSSIDHVCR 

      1930       1940       1950       1960       1970       1980 
CALGYDVEPN NPTGCVPRAE FIFYSIDVLQ GVEMIDPSEQ FDTPSPALVP ISRVSSASFI 

      1990       2000       2010       2020       2030       2040 
DYLANTDTLY WGDNELGSIS RVKRDGTQRE TILEALNLVG YKQQDWLGGI AIDWVAGNIY 

      2050       2060       2070       2080       2090       2100 
WSDTKRNIIE VARLDGSHRY VVVSNLEKPT ALAVDPLQGL LFYVTQQHIG RVGLDGSQPF 

      2110       2120       2130       2140       2150       2160 
VLVNQTRANW AVGSLVLDIE ATKVYWCERY PDALMKVDYD GNLREQLLNE SLNNPVALAK 

      2170       2180       2190       2200       2210       2220 
MGDYLYWAEN KYNEGIIRVA PLANLSQSKV VLQTEQDAIR DLKIYSKHLQ RGSNPCAHSN 

      2230       2240       2250       2260       2270       2280 
GACEQLCLFN GTSAVCACAH SRLASDGYSC EPYENFLLFS YRSNIESIHM TDHADKNWPV 

      2290       2300       2310       2320       2330       2340 
QMISNTSLMR NVIAITYNYE EQLVYYSDVQ LSTINQVHFN GTGHRVLLEQ QQRVEGLAYD 

      2350       2360       2370       2380       2390       2400 
IVNEQLFWTS NNNATIRSVE LRHLSEHADQ NQVHVKKVLS LREDDKPRGI AVEPCLGMIY 

      2410       2420       2430       2440       2450       2460 
WTNWNEGSPC IQRSYLTGYG TEVIIKTDIK MPNALTLDLE QQKLYWADAR LDKIERTNYD 

      2470       2480       2490       2500       2510       2520 
GSNRVVLAHS TPKHAFAMAV YGDLLFWTDW VLHAVVRANK YTGTDVLFLR EHVTRPMGIV 

      2530       2540       2550       2560       2570       2580 
AVQNTSINCD ANQCKILNGQ CEDVCILNKS GQATCHCTQG VLAPDGRRCI APVNTSCGLS 

      2590       2600       2610       2620       2630       2640 
QYNCHSGECI PLELTCDNVT HCADGSDEFR SYCIFRQCPE THFMCQNHRC IPKEHKCDGE 

      2650       2660       2670       2680       2690       2700 
QQCGDGSDET PLLCKCQSED IDMHPSNNNT KEMPDMFRCG SGECIPRKFL CDSLKDCRDF 

      2710       2720       2730       2740       2750       2760 
SDEKMCAPIP CEKNDMTFVH CGNSTICIMP RWRCDGDPDC PDGTDELDCA NHTSLSCDPG 

      2770       2780       2790       2800       2810       2820 
QFRCASGNCI AGSWHCDGEK DCPDGSDEIN CRTECRHNQF ACDKTCIPAS WQCDGKSDCE 

      2830       2840       2850       2860       2870       2880 
DGSDEGPQCP NRPCRPHLFQ CKSSGRCIPQ KWVCDGEKDC PSGLGDEGSE DEGPQCGGVA 

      2890       2900       2910       2920       2930       2940 
HIPDCPPPAH LCTSGLCIDS HYVCDGDEDC PGGDDEYEGC VPAFQPHSCP GGSLMHQCQD 

      2950       2960       2970       2980       2990       3000 
GLCIFKNQTC DGKPDCGDGS DETSSLCAHT RGCNGTDDFR CKNGACIHAD LLCDRRNDCA 

      3010       3020       3030       3040       3050       3060 
DFSDEELCNV NECLIPDICE HECEDKVVGY QCHCRPGYKV LPKSPHLCTD IDECDEQQPC 

      3070       3080       3090       3100       3110       3120 
SQTCINTYGS YKCLCAKGYA LVDHHTCKAT SNVSMELIFS NRYYIRQVDM TGNGSILINE 

      3130       3140       3150       3160       3170       3180 
LSNAVALDYD WDSQCLYWSD VTSTVGTIKR YCPKENKTQT LHQAMLKNPD GLAVDWVAKN 

      3190       3200       3210       3220       3230       3240 
LYWCDKGLDT IEVSQLDGKY RKVLINEYLR EPRGIALHPY QQHIFWSDWG DSPHIGKAGM 

      3250       3260       3270       3280       3290       3300 
DGSNPKMIIR DGLGWPNALT ISFETQQLFW GDAREDTISV SDLDGNHTRL LLARSINPLL 

      3310       3320       3330       3340       3350       3360 
NLHHIFAIAV WEGHIYWSDW ETKSIEYCSI FNGQNCTTLI TTIHRPMDLR VFHPYRQQQP 

      3370       3380       3390       3400       3410       3420 
MSGNPCLAAN CSTLCVLSPE EPYYKCMCPT NFILADDGRT CRANCTAAHF ECVNTYKCIP 

      3430       3440       3450       3460       3470       3480 
FYWRCDTQDD CGDGSDEPET CPPFHCEPGQ YQCANKKCTH PSNLCDGINQ CGDGSDELNC 

      3490       3500       3510       3520       3530       3540 
DKFTCFDNHM KCGATANSSA FCVDNVKRCD GVKDCPGGED ESACTPLVCK KDQFQCGNNR 

      3550       3560       3570       3580       3590       3600 
CMPFVWVCDG DIDCPDKSDE ANCDNVSCGP NDFQCDSGRC IPLAWRCDDD HDCPNGEDEP 

      3610       3620       3630       3640       3650       3660 
ASCFSSKATC DPTYFKCNNS KCIPGRWRCD YENDCGDGSD ELNCQMRNCS ESEFRCGTGK 

      3670       3680       3690       3700       3710       3720 
CIKHNYRCDG EIHCDDNSDE INCNITCKEN QFKCAAFNTC INKQYKCDGD DDCPDGSDEV 

      3730       3740       3750       3760       3770       3780 
NCTCHSDHFS CGNGKCIMSR WKCDGWDDCL DGSDESLETC AKTHCHANAF KCRNQLCVRN 

      3790       3800       3810       3820       3830       3840 
SALCDGINDC GENEDESDAV CAALPKCRHD QFQCENDDCI SKAFRCDGQY NCVDGSDEMN 

      3850       3860       3870       3880       3890       3900 
CQPPVCGFGS CSQICIEKKA GHYNCKCADG YHKGPEKNAT CLASGPDQIL LLASEQEFRF 

      3910       3920       3930       3940       3950       3960 
ILPAKQEGTT VVGFFQTDSL KIDVFDILIR PKDTLLFWID SHHGKVHTMK IATPHVEGTG 

      3970       3980       3990       4000       4010       4020 
VRVRRDLKEL TAFNIPELDD PKSLAVDWIS QRVYIIDSRH NQILATDIEG KKYISLVSTG 

      4030       4040       4050       4060       4070       4080 
MNPTDIVLEP ESRIMIWSTL ENGILVASLD GSNKKSLVER DVGWPISLSM DYPTGRLYWA 

      4090       4100       4110       4120       4130       4140 
DYRKGTIETC RLNGKDRNVV RRFGNREKPQ KIDVFEDYLY IKLYDQSIIK MNKFGNDNGT 

      4150       4160       4170       4180       4190       4200 
YLLKGYRSSD IGILHPMKQN RNISNPCAKD PCKSSRALCI LSSESSVGYS CKCAEGYVMT 

      4210       4220       4230       4240       4250       4260 
DDGVCKAHAD IPDYCPLQCN LGTCKIVDHV PKCICQPQFE GELCEHYRCS GYCQNYGVCS 

      4270       4280       4290       4300       4310       4320 
VAPALPGSQE PPPLKCTCTA GWSGARCETS MPACQSRCHN GGSCLISETE GMKCSCPKMF 

      4330       4340       4350       4360       4370       4380 
TGEQCEHCRN LTCENGGICR ETLTGTPQCE CPDGFTGKRC EIDECADFCK NGGSCVISTK 

      4390       4400       4410       4420       4430       4440 
GQRQCKCPSG YFGEHCESNS CRDFCRNGGT CSERGGRLSC TCPPRYIGES CESDLCKTSS 

      4450       4460       4470       4480       4490       4500 
PPHFCDNTKV PTRDPCTLMI CQNAGTCHII KGVALCNCTD QWNGDLCTLP VTDDNPCARY 

      4510       4520       4530       4540       4550       4560 
CANGGVCHLD EYRLPHCSCI GEWQGNACEM PPHCVGGECN VCRPGSSINE CLCENNRVVP 

      4570       4580       4590       4600       4610       4620 
CLSDSADALK EEQEPTESGG VFSVVVLVLA VILLVFALFA GAVYFLKKHR IAQPFSHARL 

      4630       4640       4650       4660       4670       4680 
TDNVEIMLTN AMYRGDADEA PTFASEDDKG NFANPVYESM YADAIPEPVS TEITHSTAPD 

      4690 
ERKGLLQHTH DENHTPDIL 

« Hide

References

[1]"Annotation of the Drosophila melanogaster euchromatic genome: a systematic review."
Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S., Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E., Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P., Bettencourt B.R., Celniker S.E., de Grey A.D.N.J. expand/collapse author list , Drysdale R.A., Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M., Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.
Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002) [PubMed: 12537572] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: Berkeley.
[2]"The genome sequence of Drosophila melanogaster."
Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N., Sutton G.G., Wortman J.R., Yandell M.D. expand/collapse author list , Zhang Q., Chen L.X., Brandon R.C., Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C., Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A., An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B., Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C., Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S., Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M., Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D., Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F., Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D., Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A., Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C., McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C., Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L., Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R., Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F., Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J., Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R., Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y., Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T., Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S., Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.
Science 287:2185-2195(2000) [PubMed: 10731132] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: Berkeley.
[3]"Identification of N-glycosylated proteins from the central nervous system of Drosophila melanogaster."
Koles K., Lim J.-M., Aoki K., Porterfield M., Tiemeyer M., Wells L., Panin V.
Glycobiology 17:1388-1403(2007) [PubMed: 17893096] [Abstract]
Cited for: GLYCOSYLATION [LARGE SCALE ANALYSIS], MASS SPECTROMETRY.
[4]"Phosphoproteome analysis of Drosophila melanogaster embryos."
Zhai B., Villen J., Beausoleil S.A., Mintseris J., Gygi S.P.
J. Proteome Res. 7:1675-1682(2008) [PubMed: 18327897] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS], MASS SPECTROMETRY.

Cross-references

Sequence databases

AE013599 Genomic DNA. Translation: AAF59114.3.
RefSeqNP_788284.1.
UniGeneDm.19876

3D structure databases

ModBaseSearch...

Proteomic databases

PRIDEA1Z7C4.

Genome annotation databases

EnsemblFBgn0053087. Drosophila melanogaster. [Contig view]
GeneID35799.
KEGGdme:Dmel_CG33087.
NMPDRfig|7227.3.peg.3843.

Organism-specific databases

FlyBaseFBgn0053087. CG33087.

Phylogenomic databases

OMAA1Z7C4. RPMGIVA.

Family and domain databases

InterProIPR011042. 6-blade_b-propeller_TolB-like.
IPR017871. ABC_transporter_CS.
IPR006209. EGF.
IPR006210. EGF-like.
IPR013032. EGF-like_reg_CS.
IPR000152. EGF-type_Asp/Asn_hydroxyl_CS.
IPR000742. EGF_3.
IPR001881. EGF_Ca_bd.
IPR013091. EGF_Ca_bd_2.
IPR018097. EGF_Ca_bd_CS.
IPR003645. Fol_N.
IPR002172. LDL_rcpt_classA_cys-rich.
IPR000033. LDLR.
[Graphical view]
Gene3DG3DSA:2.120.10.30. 6-blade_b-propeller_TolB-like. 8 hits.
G3DSA:4.10.400.10. LDL_rcpt_classA_cys-rich. 23 hits.
PfamPF00008. EGF. 7 hits.
PF07645. EGF_CA. 2 hits.
PF00057. Ldl_recept_a. 23 hits.
PF00058. Ldl_recept_b. 27 hits.
[Graphical view]
PRINTSPR00261. LDLRECEPTOR.
SMARTSM00181. EGF. 19 hits.
SM00179. EGF_CA. 3 hits.
SM00274. FOLN. 4 hits.
SM00192. LDLa. 31 hits.
SM00135. LY. 32 hits.
[Graphical view]
PROSITEPS00211. ABC_TRANSPORTER_1. 1 hit. Uncertain.
PS00010. ASX_HYDROXYL. 4 hits.
PS00022. EGF_1. 8 hits.
PS01186. EGF_2. 9 hits.
PS50026. EGF_3. 11 hits.
PS01187. EGF_CA. 3 hits.
PS01209. LDLRA_1. 22 hits.
PS50068. LDLRA_2. 29 hits.
PS51120. LDLRB. 33 hits.
[Graphical view]
ProtoNetSearch...

Other Resources

NextBio795266.

Entry information

Entry nameA1Z7C4_DROME
AccessionPrimary (citable) accession number: A1Z7C4
Entry history
Integrated into UniProtKB/TrEMBL: February 6, 2007
Last sequence update: February 6, 2007
Last modified: June 16, 2009
This is version 34 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)
Names and origin · Protein attributes · Ontologies · Sequences · References · Cross-references · Entry information