ID C3Y5K5_BRAFL Unreviewed; 5576 AA. AC C3Y5K5; DT 28-JUL-2009, integrated into UniProtKB/TrEMBL. DT 28-JUL-2009, sequence version 1. DT 27-MAR-2024, entry version 89. DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EEN64252.1}; GN ORFNames=BRAFLDRAFT_125057 {ECO:0000313|EMBL:EEN64252.1}; OS Branchiostoma floridae (Florida lancelet) (Amphioxus). OC Eukaryota; Metazoa; Chordata; Cephalochordata; Leptocardii; Amphioxiformes; OC Branchiostomatidae; Branchiostoma. OX NCBI_TaxID=7739; RN [1] {ECO:0000313|EMBL:EEN64252.1} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=S238N-H82 {ECO:0000313|EMBL:EEN64252.1}; RC TISSUE=Testes {ECO:0000313|EMBL:EEN64252.1}; RX PubMed=18563158; DOI=10.1038/nature06967; RG US DOE Joint Genome Institute (JGI-PGF); RA Putnam N.H., Butts T., Ferrier D.E.K., Furlong R.F., Hellsten U., RA Kawashima T., Robinson-Rechavi M., Shoguchi E., Terry A., Yu J.-K., RA Benito-Gutierrez E.L., Dubchak I., Garcia-Fernandez J., Gibson-Brown J.J., RA Grigoriev I.V., Horton A.C., de Jong P.J., Jurka J., Kapitonov V.V., RA Kohara Y., Kuroki Y., Lindquist E., Lucas S., Osoegawa K., Pennacchio L.A., RA Salamov A.A., Satou Y., Sauka-Spengler T., Schmutz J., Shin-I T., RA Toyoda A., Bronner-Fraser M., Fujiyama A., Holland L.Z., Holland P.W.H., RA Satoh N., Rokhsar D.S.; RT "The amphioxus genome and the evolution of the chordate karyotype."; RL Nature 453:1064-1071(2008). CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; GG666487; EEN64252.1; -; Genomic_DNA. DR RefSeq; XP_002608242.1; XM_002608196.1. DR STRING; 7739.C3Y5K5; -. DR eggNOG; KOG1216; Eukaryota. DR eggNOG; KOG1217; Eukaryota. DR InParanoid; C3Y5K5; -. DR GO; GO:0031012; C:extracellular matrix; IBA:GO_Central. DR GO; GO:0005615; C:extracellular space; IBA:GO_Central. DR CDD; cd19941; TIL; 4. DR Gene3D; 2.10.25.10; Laminin; 6. DR InterPro; IPR006207; Cys_knot_C. DR InterPro; IPR000742; EGF-like_dom. DR InterPro; IPR036084; Ser_inhib-like_sf. DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom. DR InterPro; IPR001007; VWF_dom. DR InterPro; IPR001846; VWF_type-D. DR InterPro; IPR025155; WxxW_domain. DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1. DR PANTHER; PTHR11339:SF373; HEMOLECTIN, ISOFORM A; 1. DR Pfam; PF08742; C8; 4. DR Pfam; PF13330; Mucin2_WxxW; 12. DR Pfam; PF00094; VWD; 4. DR SMART; SM00832; C8; 4. DR SMART; SM00041; CT; 1. DR SMART; SM00181; EGF; 5. DR SMART; SM00214; VWC; 7. DR SMART; SM00215; VWC_out; 2. DR SMART; SM00216; VWD; 4. DR SUPFAM; SSF57567; Serine protease inhibitors; 4. DR PROSITE; PS01225; CTCK_2; 1. DR PROSITE; PS00022; EGF_1; 3. DR PROSITE; PS01186; EGF_2; 2. DR PROSITE; PS50026; EGF_3; 3. DR PROSITE; PS01208; VWFC_1; 2. DR PROSITE; PS50184; VWFC_2; 5. DR PROSITE; PS51233; VWFD; 4. PE 4: Predicted; KW Copper {ECO:0000256|ARBA:ARBA00023008}; KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE- KW ProRule:PRU00076}; EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076}; KW Repeat {ECO:0000256|ARBA:ARBA00022737}; KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}. FT SIGNAL 1..23 FT /evidence="ECO:0000256|SAM:SignalP" FT CHAIN 24..5576 FT /evidence="ECO:0000256|SAM:SignalP" FT /id="PRO_5002935236" FT DOMAIN 181..213 FT /note="EGF-like" FT /evidence="ECO:0000259|PROSITE:PS50026" FT DOMAIN 244..276 FT /note="EGF-like" FT /evidence="ECO:0000259|PROSITE:PS50026" FT DOMAIN 308..340 FT /note="EGF-like" FT /evidence="ECO:0000259|PROSITE:PS50026" FT DOMAIN 363..547 FT /note="VWFD" FT /evidence="ECO:0000259|PROSITE:PS51233" FT DOMAIN 736..935 FT /note="VWFD" FT /evidence="ECO:0000259|PROSITE:PS51233" FT DOMAIN 1168..1342 FT /note="VWFD" FT /evidence="ECO:0000259|PROSITE:PS51233" FT DOMAIN 2991..3064 FT /note="VWFC" FT /evidence="ECO:0000259|PROSITE:PS50184" FT DOMAIN 3372..3443 FT /note="VWFC" FT /evidence="ECO:0000259|PROSITE:PS50184" FT DOMAIN 4622..4797 FT /note="VWFD" FT /evidence="ECO:0000259|PROSITE:PS51233" FT DOMAIN 4945..5018 FT /note="VWFC" FT /evidence="ECO:0000259|PROSITE:PS50184" FT DOMAIN 5325..5396 FT /note="VWFC" FT /evidence="ECO:0000259|PROSITE:PS50184" FT DOMAIN 5399..5476 FT /note="VWFC" FT /evidence="ECO:0000259|PROSITE:PS50184" FT DOMAIN 5483..5570 FT /note="CTCK" FT /evidence="ECO:0000259|PROSITE:PS01225" FT REGION 146..167 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1542..1579 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1668..1699 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1819..1962 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 2082..2145 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 2433..2457 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 2477..2556 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 2690..2713 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 3762..3797 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 3920..3944 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 3964..4043 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 4177..4201 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT DISULFID 185..195 FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076" FT DISULFID 203..212 FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076" FT DISULFID 248..258 FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076" FT DISULFID 266..275 FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076" FT DISULFID 312..322 FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076" FT DISULFID 330..339 FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076" FT DISULFID 5508..5562 FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00039" FT DISULFID 5512..5564 FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00039" SQ SEQUENCE 5576 AA; 609789 MW; AE26F0E7433B4612 CRC64; MAIGRAAALS TLLIFAAVLS TEGCGVAVNK NQEASSREIH RVRREAPETI VDDVKAVNED LPHEARKLLW HSEDINDRDG NKKSRTKRDL SWPSSSLGTW SDLLSGLTAE AEAEPTAGDV LSQVQAEAEP SLDDLLSQAQ AQVQAEPSID EFLEQSQSQS QGQSQSEFQG ISQDIQIVNG TIVYCQQDCE NGGVCTGLNV CRCAPGWTGT YCTDHMCTQT CVHGSCSGPN TCLCDEGYQG ETCEQAICNP VCANGGVCTA PNECQCEEGY EGDQCQTAIC HSTCENGGYC SRPDFCTCQI GYAGQQCETP ICTLECQNGG VCTEPGRCSC PSSFGGPQCQ YDLYQDDGIT QVQTTPAPDH SGRFCGAFGH QHYFTFDGHF YTFPGTCQYL LSGDCLQATF QVFVRNDPNC TTSSLLCHRE VRVHMMGLSH DLILHQGHNY EVTKGDVTLS LPNSEEGVKV EKVGDYVRVI LMPEFDGKLV RVAVYWDGVT SVYVEVDEDF AGNLCGLCGN FNGNSSDEYR LRDGTASSSR ITFANSWKMT DAEEHCPNVH VNTPNSCASA TQQQLQEIWG ICNVLIGNAA FAPCHAVVDP NPYLDACVTD LCNCDYAVRG DCQCEALTQY SRACAHRRIV LDWRQPELCY RGCPATMVYT ECASSCPRTC RNPTGDHDCD DHCVDGCSCP EGSLWDEGSL QCVVEQECPC THLGMEYAPG SNYLDDCNKY YCLAGRWIGT NLECNATCSI HGDPHITTFD KRYYQFAGVC QYIVAKDFVN QQFTLLADQQ PCGDDDSSSC IRSVTLIVGG NTAGKIKLHQ HGVLSVGHSD VHLQLPYHNA LKGHTKGLCG TFNDKQNDDF TTRSGVIVAN VAEFGNSWKV SADCEDQPVV TSDLQLGPCA INTQRAEYAR QHCGLMSGGS FEACNRLLEP ATYITACEYD VCQCQDGDEC LCSAIAHYAR ECAKLGAVVN WRTEGTCVEE CPQVGQVWQE CSSACRSSCR YLSEPDSGCT EDCVAGCNCP PGLYQSESGA CVPQEECQCF YQGEVYQTQT STVLGNLICF CEDGIMNCQE RVIMSSEYNC TSGMEYFDCE GAAAGQMGKA CEETCGNIGM ECLAEQCVSG CQCPHGLVRD GSECVLPEEC RCEHNGQMYD PGQTISVDCN SCTCSQGVWS CTTKECAAQC SMFGNSHYTT FDGHSYEFEG TCKYIIAQDY CYNQTGSFRI HAEKTACNGL SGNVCARTVT VTLQQLQIHL EHGKDVHVGP IPGSEVAYTS YKFNIYRSGF FTIIKVENGI DLYWDNATRL YIKVWPNHRG RVCGMCGNFD GNQINDFNTP ELDRATTVQD FANSWKVSSS CPDVEAPLSD YCALHPHREA WARRQCNIIL SDTFSPCHYK VDPEPYYQAC VQDSCSCDSG GDCECFCTAV AAYGDMCNTQ DVHIRWRTHE LCPTQCEDYN WDPEKCEWHY DPCGTSCPAT CEDPHPSSCD LQCMEGCHPK CPNGTVLYNG KCIPPMDCPV TTTTPLPTTT AFIPTTVSTT LLPTTTESIA TTTLEVTTAS ETTPMPTTTR GEGGGKTTTA APTTEVTPTG SPSPPVQTTT EATTIQSTTT FSTTTPEVTT PCEHAEVCYW TSWINSDLPR NYRKILSALI MSGLYCDASQ IPGFFGVCYD YEIRVKCCWE ECVTPTSTEY PPTTTESTTV STTTEPTTTE STTSTVSTTT QPTTTESIVT TTPCEHVEEM CEWSDWMNKD EPSDGNPNQN ETYDNLRDIY EFCEEPMDIE CSVAGMEGTQ MSLPEGVSCD QATGLVCYAS QLPGKGQVCE DYQIRVKCCS EKCVTPVPTT TRPTTTPTTT VSTTTQPTTT ETTTVSTTQP TTTETTTVST TTQPTTTETT SSTTVMTPTG SPSPTQPPTT ESTTVSTTSQ PTTTESTTVS TTSQPTTTES TTVSTTSQPT TTESTTSTVS TTTQSTTTES TTISTTQPTT TETTTTIVST TPCEHVQEMC QWSDWMNKEY PNPNSPSQNE TYANLRDTYD FCDTPMDIQC GISGIPMNTP FESIAQEGVM CDINSGLYCD ASQISGFFGV CYDYEIRVKC CWEECVTPTT TRPTTTESTT VSTEYQPTTT ESTTVSTTTE PTTTESTTVS TEYQPTTTES TTVSTTTKPT TTESTTVSTE YQPTTTESTT VATTTKPTTT ESTTVSTEYQ PTTTESTTVA TTTKPTTTES TTVSTEYQPT TTESTTVTVA TTTKPTTTES TTVSTEYQPT TTESTTVATT TKPTTTESTT VSTTTQPTTT ETTTVFTTQP TTTESTTVST EYQPTTSEST TVSTTTKPTT TESTTVSTTI ITTTPCEHVE EMCEWSDWMN KDEPSGGNPN QNETYDNLRD TYEFCEAPMD IECSVAGMEG TQMSLPEGVS CDQATGLVCY ASQLPGKGQV CEDYQIRVKC CSEKCVTPVP TTTRPTTTPT TTVSTTTQPT TTETTTVSTT TQPTTTEATT SITTEGTTSI ATPTGSPSPH QTTTISTTPI TTSTHSTTQP PTTTAYRPTT TKMTTTQPTT TRTETTHPTT TEEYTTPPVP TTTEMQTTQP TTTMQTTQPT TTKLETTPTS CGYVCNWTDW MNSYNPSTDM EMNDVETLEN LHSRFSFCET PMGIECRLAE NPSLDFIDHH QNGVTCEVDS GLYCNSSLTN MRACMDYEVR FQCCHVPDSC KTTTTMTTGP TTTTGPTTTV TTEKPPPTTA TTTVTTEKPT TTIATTTMTT EKLTTAPTTT VSTTTEGTTT VETTTEYCQE TCVWSEWMNS DYPGXGVQDN HHTNYNNLYN NPANYHTDNN SFYNSPAHHC RIHHRFYKNP AHYHRDNFKY NSYDSNWFSI TNPTTHYRIH HCFHDNPAHH YRIHHCFHDS PVNHPNVLWV TCEVDSGLYC NSSLTNMRAC MDYEVRFQCC HVPDSCKTTT TMTTEVPTTT TVTTKRPTTT INTESTTVSM TTQPTTTEST TVSTTTQPTT TESTTVSTTA LPTTTAFLPP NGCEDANGMP LAIGDTWIPY NDSCQECTCT GHKKTECYPR ACPTYRPPKC GVCESQRVVE GSDSCCPEYE CVCDLNRADC PEVVIPQCEY QYQYVHHTNP GECVPEYECR CNSSKCPAAP QCEPPKVLDM LEGECCLEYE CVCDACPTAP TPYCPPGEGY VLVSSEDICS CVTHSCECRN DTCAQQPECD DNKDLVTTET GCCPHYNCTC NTCPTHGPVI NCDYAEGYVL VSSQDDCGCV TESCECREDS CSPEPECPDN KHVVTVETGC CPHYNCSCNS CPPSPTPGSD CPYGEMPGYV IHETVDDCGC VNSSCTCDRS RCPAPPTCDD NKNLVALYTP CCQNYTCECK ECPVDDTSCG IGETQTTIID DCCRHTNCTP EPVCVYQNTT HQPGTSWTDA NDNCIQCQCL EEIDPATGFH RVSCSDESSQ CEVNCPACHT YQERSGECCG ECVKTSCCVM DDDETLVEHP VSLQMPAYLC DLNRADCPEV VIPQCEYQYQ YVHHTNPGEC VPEYECRCNS SQCPAAPQCE PPKVMDMSEG ECCLEYECVC DACPTASTPY CPPGEGYVLV SSEDICGCVT QSCQCRKDTC AQQPECDDNK HLITTETDCC PHYNCTCDTC PPRPVVDCNY AEGYVLVSSQ DDCGCVTESC ECREDSCSPE PECPDNKHVV TMETGCCPHY NCSCNTCPPS PTPGSDCPYG EVTCEVDSGL YCDASLTNVG ACMDYEVRFQ CCHVPDTCKT TTTVTTEVPT TTTTVTTERP LTTITTESTT VSTTTQPTTT ESTTISTTTQ PTTTESTTVS TTTQPTTTES TTVSTTIIST TPCEHVEEMC EWSDWMNKDE PSGGNPNQNE TYDNLRDIYE FCEAPMDIEC SVAGMEGTQM SLPEGVSCDQ ATGLVCYASQ LPGKGQVCED YQIRVKCCSE KCVTPVPTTT RPTTTPTTTV STTTQPTTTE TTTVSTTTQP TTTEATTSIT TEGTTSIATP TGSPSPHQTT TISTTPITTS THSTTQPPTT TAYRPTTTKM TTTQPTTTRT ETTHPTTTEE YTTPPVPTTT EMQTTQPTTT MQTTQPTTTK LETTPTSCGY VCNWTDWMNS YNPSTDMEMN DVETLENLHS RFSFCETPMG IECRLAENPS LDFIDHHQNG VTCEVDSGLY CNSSLTNMRA CMDYEVRFQC CHVPDSCKTT TTMTTGPTTT TGPTTTVTTE KPPPTTATTT VTTEKPTTTI ATTTMTTEKL TTAPTTTVST TTEGTTTVET TTEYCQETCV WSEWMNSDYP GVGVPNDNET FAHLRESSLE FCEEPDDIEC RLDMTPNVDF NRAKQSGVEC DVSSGLLCLS KLQPSLLHDN SEDNHHTNYN NLYNNPANYH TDNNSFYNSP AHHCRIHHRF YKNPAHYHRD NFKYNSYDSN WFSITNPTTH YRIHHCFHDN PAHHYRIHHC FHDSPVNHPN VLWVTCEVDS GLYCNSSLTN MRACMDYEVR FQCCHVPDSC KTTTTMTTEV PTTTTVTTKR PTTTINTEST TVSMTTQPTT TESTTVSTTT QPTTTESTTV STTALPTTTA FLPPSTTECP DVCIDGSGLT RQVGDTWFEA GDQCQRALYL CRPCGVISIN RKVCDVVEPP SCANGLQPID VGVCCPEYQC PCECKGYGDP HYFSFDGEYF YFQGEGEFIL ARDTHVPHDF EVRGFNVQCT VAPITTCTKE IKVIYKGHTI ELKTGHQVLV NGTSWTPPFK LDGCKVTTMG FPLKLEIQSL DVTVVYDFLS SGFYISVPPT MYAGKTEGLC GPCNNNKTDD CQDREGTIMN DYNNCSCDWK VDNPDSPSNS CIPEPKPTAT PSTPCDQSPC DIITDPEGPF GACHDVVDYE FFLQSCQYDR GACYSECQAL GAYAYVCQRM GVCVDWRGKG NNCSFECEAG LVYKACSCVK TCENMDVFNA SQCALAYMET EGCFCPDDMV LHETSEQCIE GDTCNGCEDA NGMPLAIGDT WIPYNDSCQE CTCTGHKKIE CYPRACPTYQ PPKCGVCESQ RVVEGSDSCC PEYECVCDLN RADCPEVVIP QCEYQYQYVH HTNPGECVPE YECRCNSSQC PAAPQCEPPK VMDMSEGECC LEYECVCDAC PTASTPYCPP GEGYVLVSSE DICGCVTQSC QCRKDTCAQQ PECDDNKHLI TTETDCCPHY NCTCDTCPPR PVVDCNYAEG YVLVSSQDDC GCVTESCECR EDSCSPEPEC PDNKHVVTME TGCCPHYNCS CNTCPPSPTP GSDCPYGEMP GYVIHETVDD CGCVNSSCTC DRSRCPAPPT CDDNKNLVAL YTPCCQNYTC ECKECPVDDT SCGIGETQTT VIDDCCRHTN CTPEPVCVYQ NTTHQPGTSW TDANDNCIRC QCLEEIDPAT GFHRVSCSDE SSQCEVNCPA CHTYQERSGE CCGECVKTSC CVMDDDETLV EHPAGSNWTS PGDSCSRCHC LEAGEHGQVV EYCSTLLCPE PPTCSSDEVM KNTSSADGCC TIYECVPPGQ ASCSLFTKQD YLRVDDCVSL QPVNVTWCEG RCGSSSMYEG VDLKHTCECC HEVSMSEVSV PMECEGTTTS SMSYAYRQIT ACDCDTTECE VPTAQP //