Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

O75095 (MEGF6_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified July 9, 2014. Version 112. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (5) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Interactions·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Multiple epidermal growth factor-like domains protein 6

Short name=Multiple EGF-like domains protein 6
Alternative name(s):
Epidermal growth factor-like protein 3
Short name=EGF-like protein 3
Gene names
Name:MEGF6
Synonyms:EGFL3, KIAA0815
OrganismHomo sapiens (Human) [Reference proteome]
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length1541 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level

General annotation (Comments)

Subcellular location

Secreted Potential.

Sequence similarities

Contains 27 EGF-like domains.

Contains 1 EMI domain.

Sequence caution

The sequence BAA32467.2 differs from that shown. Reason: Erroneous initiation.

The sequence BAE19678.1 differs from that shown. Reason: Erroneous initiation. Translation N-terminally shortened.

The sequence BAE19678.1 differs from that shown. Reason: Frameshift at position 1522.

Ontologies

Keywords
   Cellular componentSecreted
   Coding sequence diversityAlternative splicing
Polymorphism
   DomainEGF-like domain
Repeat
Signal
   LigandCalcium
   PTMDisulfide bond
Glycoprotein
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
   Cellular_componentextracellular region

Inferred from electronic annotation. Source: UniProtKB-SubCell

   Molecular_functioncalcium ion binding

Inferred from electronic annotation. Source: InterPro

protein binding

Inferred from physical interaction PubMed 21078624. Source: IntAct

Complete GO annotation...

Binary interactions

With

Entry

#Exp.

IntAct

Notes

ATXN7O152652EBI-947597,EBI-708350
CACNA1AO005552EBI-947597,EBI-766279

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: O75095-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: O75095-2)

The sequence of this isoform differs from the canonical sequence as follows:
     1-159: Missing.
     160-160: Y → MGASRDRGLAALWCLGLLGGLARVAGTHYRYLWRGCYPCHLGQAGYPVSAGDQRP
     989-1075: TCPAHTYGHN...GWAGLACEKE → K
     1161-1205: ACPPGSFGEDCAQMCQCPGENPACHPATGTCSCAAGYHGPSCQQR → G
     1377-1454: PCPPGFHGAG...VTGLCLCPPG → R
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 3030 Potential
Chain31 – 15411511Multiple epidermal growth factor-like domains protein 6
PRO_0000007524

Regions

Domain44 – 12582EMI
Domain124 – 15936EGF-like 1
Domain161 – 20141EGF-like 2; calcium-binding Potential
Domain206 – 24237EGF-like 3
Domain238 – 28447EGF-like 4
Domain285 – 32541EGF-like 5; calcium-binding Potential
Domain335 – 37036EGF-like 6
Domain375 – 41137EGF-like 7
Domain412 – 45241EGF-like 8; calcium-binding Potential
Domain516 – 55237EGF-like 9
Domain560 – 59536EGF-like 10
Domain603 – 63836EGF-like 11
Domain736 – 77035EGF-like 12
Domain783 – 81432EGF-like 13
Domain822 – 85736EGF-like 14
Domain865 – 90137EGF-like 15
Domain909 – 94436EGF-like 16
Domain955 – 98733EGF-like 17
Domain995 – 103036EGF-like 18
Domain1038 – 107336EGF-like 19
Domain1081 – 111636EGF-like 20
Domain1124 – 115936EGF-like 21
Domain1211 – 124636EGF-like 22
Domain1254 – 128936EGF-like 23
Domain1297 – 133236EGF-like 24
Domain1345 – 137531EGF-like 25
Domain1383 – 141836EGF-like 26
Domain1469 – 150436EGF-like 27

Amino acid modifications

Glycosylation2521N-linked (GlcNAc...) Potential
Glycosylation7391N-linked (GlcNAc...) Potential
Disulfide bond48 ↔ 111 Potential
Disulfide bond77 ↔ 83 Potential
Disulfide bond110 ↔ 123 Potential
Disulfide bond128 ↔ 139 By similarity
Disulfide bond133 ↔ 147 By similarity
Disulfide bond149 ↔ 158 By similarity
Disulfide bond165 ↔ 176 By similarity
Disulfide bond172 ↔ 185 By similarity
Disulfide bond187 ↔ 200 By similarity
Disulfide bond242 ↔ 255 By similarity
Disulfide bond248 ↔ 268 By similarity
Disulfide bond270 ↔ 283 By similarity
Disulfide bond289 ↔ 300 By similarity
Disulfide bond296 ↔ 309 By similarity
Disulfide bond311 ↔ 324 By similarity
Disulfide bond416 ↔ 427 By similarity
Disulfide bond423 ↔ 436 By similarity
Disulfide bond438 ↔ 451 By similarity
Disulfide bond520 ↔ 533 By similarity
Disulfide bond527 ↔ 540 By similarity
Disulfide bond542 ↔ 551 By similarity
Disulfide bond564 ↔ 576 By similarity
Disulfide bond570 ↔ 583 By similarity
Disulfide bond585 ↔ 594 By similarity
Disulfide bond607 ↔ 619 By similarity
Disulfide bond613 ↔ 626 By similarity
Disulfide bond628 ↔ 637 By similarity
Disulfide bond740 ↔ 751 By similarity
Disulfide bond744 ↔ 758 By similarity
Disulfide bond760 ↔ 769 By similarity
Disulfide bond786 ↔ 795 By similarity
Disulfide bond789 ↔ 802 By similarity
Disulfide bond804 ↔ 813 By similarity
Disulfide bond826 ↔ 838 By similarity
Disulfide bond832 ↔ 845 By similarity
Disulfide bond847 ↔ 856 By similarity
Disulfide bond869 ↔ 882 By similarity
Disulfide bond873 ↔ 889 By similarity
Disulfide bond891 ↔ 900 By similarity
Disulfide bond913 ↔ 925 By similarity
Disulfide bond919 ↔ 932 By similarity
Disulfide bond934 ↔ 943 By similarity
Disulfide bond999 ↔ 1011 By similarity
Disulfide bond1005 ↔ 1018 By similarity
Disulfide bond1020 ↔ 1029 By similarity
Disulfide bond1042 ↔ 1054 By similarity
Disulfide bond1048 ↔ 1061 By similarity
Disulfide bond1063 ↔ 1072 By similarity
Disulfide bond1085 ↔ 1097 By similarity
Disulfide bond1091 ↔ 1104 By similarity
Disulfide bond1106 ↔ 1115 By similarity
Disulfide bond1128 ↔ 1140 By similarity
Disulfide bond1134 ↔ 1147 By similarity
Disulfide bond1149 ↔ 1158 By similarity
Disulfide bond1215 ↔ 1227 By similarity
Disulfide bond1221 ↔ 1234 By similarity
Disulfide bond1236 ↔ 1245 By similarity
Disulfide bond1258 ↔ 1270 By similarity
Disulfide bond1264 ↔ 1277 By similarity
Disulfide bond1279 ↔ 1288 By similarity
Disulfide bond1301 ↔ 1313 By similarity
Disulfide bond1307 ↔ 1320 By similarity
Disulfide bond1322 ↔ 1331 By similarity
Disulfide bond1348 ↔ 1356 By similarity
Disulfide bond1350 ↔ 1363 By similarity
Disulfide bond1365 ↔ 1374 By similarity
Disulfide bond1387 ↔ 1399 By similarity
Disulfide bond1393 ↔ 1406 By similarity
Disulfide bond1408 ↔ 1417 By similarity
Disulfide bond1473 ↔ 1485 By similarity
Disulfide bond1479 ↔ 1492 By similarity
Disulfide bond1494 ↔ 1503 By similarity

Natural variations

Alternative sequence1 – 159159Missing in isoform 2.
VSP_037740
Alternative sequence1601Y → MGASRDRGLAALWCLGLLGG LARVAGTHYRYLWRGCYPCH LGQAGYPVSAGDQRP in isoform 2.
VSP_037741
Alternative sequence989 – 107587TCPAH…ACEKE → K in isoform 2.
VSP_037742
Alternative sequence1161 – 120545ACPPG…SCQQR → G in isoform 2.
VSP_037743
Alternative sequence1377 – 145478PCPPG…LCPPG → R in isoform 2.
VSP_037744
Natural variant1151M → T.
Corresponds to variant rs7513275 [ dbSNP | Ensembl ].
VAR_059258
Natural variant1311S → G. Ref.1
Corresponds to variant rs2794340 [ dbSNP | Ensembl ].
VAR_058361
Natural variant3131A → V.
Corresponds to variant rs11585362 [ dbSNP | Ensembl ].
VAR_059259
Natural variant5871P → L.
Corresponds to variant rs947345 [ dbSNP | Ensembl ].
VAR_061155
Natural variant6881L → P.
Corresponds to variant rs2821008 [ dbSNP | Ensembl ].
VAR_059260
Natural variant9161R → L. Ref.1
Corresponds to variant rs7553399 [ dbSNP | Ensembl ].
VAR_058362
Natural variant11371G → A. Ref.1
Corresponds to variant rs4648506 [ dbSNP | Ensembl ].
VAR_058363
Natural variant12871R → H.
Corresponds to variant rs57804877 [ dbSNP | Ensembl ].
VAR_061156
Natural variant15361G → S.
Corresponds to variant rs57484147 [ dbSNP | Ensembl ].
VAR_061157

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified July 28, 2009. Version 4.
Checksum: EF2A7CB65140A7BB

FASTA1,541161,185
        10         20         30         40         50         60 
MSFLEEARAA GRAVVLALVL LLLPAVPVGA SVPPRPLLPL QPGMPHVCAE QELTLVGRRQ 

        70         80         90        100        110        120 
PCVQALSHTV PVWKAGCGWQ AWCVGHERRT VYYMGYRQVY TTEARTVLRC CRGWMQQPDE 

       130        140        150        160        170        180 
EGCLSAECSA SLCFHGGRCV PGSAQPCHCP PGFQGPRCQY DVDECRTHNG GCQHRCVNTP 

       190        200        210        220        230        240 
GSYLCECKPG FRLHTDSRTC LAINSCALGN GGCQHHCVQL TITRHRCQCR PGFQLQEDGR 

       250        260        270        280        290        300 
HCVRRSPCAN RNGSCMHRCQ VVRGLARCEC HVGYQLAADG KACEDVDECA AGLAQCAHGC 

       310        320        330        340        350        360 
LNTQGSFKCV CHAGYELGAD GRQCYRIEME IVNSCEANNG GCSHGCSHTS AGPLCTCPRG 

       370        380        390        400        410        420 
YELDTDQRTC IDVDDCADSP CCQQVCTNNP GGYECGCYAG YRLSADGCGC EDVDECASSR 

       430        440        450        460        470        480 
GGCEHHCTNL AGSFQCSCEA GYRLHEDRRG CSPLEEPMVD LDGELPFVRP LPHIAVLQDE 

       490        500        510        520        530        540 
LPQLFQDDDV GADEEEAELR GEHTLTEKFV CLDDSFGHDC SLTCDDCRNG GTCLLGLDGC 

       550        560        570        580        590        600 
DCPEGWTGLI CNETCPPDTF GKNCSFSCSC QNGGTCDSVT GACRCPPGVS GTNCEDGCPK 

       610        620        630        640        650        660 
GYYGKHCRKK CNCANRGRCH RLYGACLCDP GLYGRFCHLT CPPWAFGPGC SEECQCVQPH 

       670        680        690        700        710        720 
TQSCDKRDGS CSCKAGFRGE RCQAECELGY FGPGCWQACT CPVGVACDSV SGECGKRCPA 

       730        740        750        760        770        780 
GFQGEDCGQE CPVGTFGVNC SSSCSCGGAP CHGVTGQCRC PPGRTGEDCE ADCPEGRWGL 

       790        800        810        820        830        840 
GCQEICPACQ HAARCDPETG ACLCLPGFVG SRCQDVCPAG WYGPSCQTRC SCANDGHCHP 

       850        860        870        880        890        900 
ATGHCSCAPG WTGFSCQRAC DTGHWGPDCS HPCNCSAGHG SCDAISGLCL CEAGYVGPRC 

       910        920        930        940        950        960 
EQQCPQGHFG PGCEQRCQCQ HGAACDHVSG ACTCPAGWRG TFCEHACPAG FFGLDCRSAC 

       970        980        990       1000       1010       1020 
NCTAGAACDA VNGSCLCPAG RRGPRCAETC PAHTYGHNCS QACACFNGAS CDPVHGQCHC 

      1030       1040       1050       1060       1070       1080 
APGWMGPSCL QACPAGLYGD NCRHSCLCQN GGTCDPVSGH CACPEGWAGL ACEKECLPRD 

      1090       1100       1110       1120       1130       1140 
VRAGCRHSGG CLNGGLCDPH TGRCLCPAGW TGDKCQSPCL RGWFGEACAQ RCSCPPGAAC 

      1150       1160       1170       1180       1190       1200 
HHVTGACRCP PGFTGSGCEQ ACPPGSFGED CAQMCQCPGE NPACHPATGT CSCAAGYHGP 

      1210       1220       1230       1240       1250       1260 
SCQQRCPPGR YGPGCEQLCG CLNGGSCDAA TGACRCPTGF LGTDCNLTCP QGRFGPNCTH 

      1270       1280       1290       1300       1310       1320 
VCGCGQGAAC DPVTGTCLCP PGRAGVRCER GCPQNRFGVG CEHTCSCRNG GLCHASNGSC 

      1330       1340       1350       1360       1370       1380 
SCGLGWTGRH CELACPPGRY GAACHLECSC HNNSTCEPAT GTCRCGPGFY GQACEHPCPP 

      1390       1400       1410       1420       1430       1440 
GFHGAGCQGL CWCQHGAPCD PISGRCLCPA GFHGHFCERG CEPGSFGEGC HQRCDCDGGA 

      1450       1460       1470       1480       1490       1500 
PCDPVTGLCL CPPGRSGATC NLDCRRGQFG PSCTLHCDCG GGADCDPVSG QCHCVDGYMG 

      1510       1520       1530       1540 
PTCREGGPLR LPENPSLAQG SAGTLPASSR PTSRSGGPAR H 

« Hide

Isoform 2 [UniParc].

Checksum: C01A97596FFC0A8F
Show »

FASTA1,229128,614

References

« Hide 'large scale' references
[1]"Identification of high-molecular-weight proteins with multiple EGF-like motifs by motif-trap screening."
Nakayama M., Nakajima D., Nagase T., Nomura N., Seki N., Ohara O.
Genomics 51:27-34(1998) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORMS 1 AND 2), VARIANTS GLY-131; LEU-916 AND ALA-1137.
Tissue: Brain and Spleen.
[2]"Construction of expression-ready cDNA clones for KIAA genes: manual curation of 330 KIAA cDNA clones."
Nakajima D., Okazaki N., Yamakawa H., Kikuno R., Ohara O., Nagase T.
DNA Res. 9:99-106(2002) [PubMed] [Europe PMC] [Abstract]
Cited for: SEQUENCE REVISION.
[3]"The DNA sequence and biological annotation of human chromosome 1."
Gregory S.G., Barlow K.F., McLay K.E., Kaul R., Swarbreck D., Dunham A., Scott C.E., Howe K.L., Woodfine K., Spencer C.C.A., Jones M.C., Gillson C., Searle S., Zhou Y., Kokocinski F., McDonald L., Evans R., Phillips K. expand/collapse author list , Atkinson A., Cooper R., Jones C., Hall R.E., Andrews T.D., Lloyd C., Ainscough R., Almeida J.P., Ambrose K.D., Anderson F., Andrew R.W., Ashwell R.I.S., Aubin K., Babbage A.K., Bagguley C.L., Bailey J., Beasley H., Bethel G., Bird C.P., Bray-Allen S., Brown J.Y., Brown A.J., Buckley D., Burton J., Bye J., Carder C., Chapman J.C., Clark S.Y., Clarke G., Clee C., Cobley V., Collier R.E., Corby N., Coville G.J., Davies J., Deadman R., Dunn M., Earthrowl M., Ellington A.G., Errington H., Frankish A., Frankland J., French L., Garner P., Garnett J., Gay L., Ghori M.R.J., Gibson R., Gilby L.M., Gillett W., Glithero R.J., Grafham D.V., Griffiths C., Griffiths-Jones S., Grocock R., Hammond S., Harrison E.S.I., Hart E., Haugen E., Heath P.D., Holmes S., Holt K., Howden P.J., Hunt A.R., Hunt S.E., Hunter G., Isherwood J., James R., Johnson C., Johnson D., Joy A., Kay M., Kershaw J.K., Kibukawa M., Kimberley A.M., King A., Knights A.J., Lad H., Laird G., Lawlor S., Leongamornlert D.A., Lloyd D.M., Loveland J., Lovell J., Lush M.J., Lyne R., Martin S., Mashreghi-Mohammadi M., Matthews L., Matthews N.S.W., McLaren S., Milne S., Mistry S., Moore M.J.F., Nickerson T., O'Dell C.N., Oliver K., Palmeiri A., Palmer S.A., Parker A., Patel D., Pearce A.V., Peck A.I., Pelan S., Phelps K., Phillimore B.J., Plumb R., Rajan J., Raymond C., Rouse G., Saenphimmachak C., Sehra H.K., Sheridan E., Shownkeen R., Sims S., Skuce C.D., Smith M., Steward C., Subramanian S., Sycamore N., Tracey A., Tromans A., Van Helmond Z., Wall M., Wallis J.M., White S., Whitehead S.L., Wilkinson J.E., Willey D.L., Williams H., Wilming L., Wray P.W., Wu Z., Coulson A., Vaudin M., Sulston J.E., Durbin R.M., Hubbard T., Wooster R., Dunham I., Carter N.P., McVean G., Ross M.T., Harrow J., Olson M.V., Beck S., Rogers J., Bentley D.R.
Nature 441:315-321(2006) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AB011539 mRNA. Translation: BAA32467.2. Different initiation.
AB231860 mRNA. Translation: BAE19678.1. Sequence problems.
AL512413, AL513320 Genomic DNA. Translation: CAH70834.1.
AL513320, AL512413 Genomic DNA. Translation: CAI14334.1.
CCDSCCDS41237.1. [O75095-1]
RefSeqNP_001400.3. NM_001409.3. [O75095-1]
UniGeneHs.593645.

3D structure databases

ProteinModelPortalO75095.
SMRO75095. Positions 119-473, 511-1513.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

BioGrid108273. 6 interactions.
IntActO75095. 7 interactions.
MINTMINT-2797336.
STRING9606.ENSP00000348982.

PTM databases

PhosphoSiteO75095.

Proteomic databases

PaxDbO75095.
PRIDEO75095.

Protocols and materials databases

DNASU1953.
StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENST00000294599; ENSP00000294599; ENSG00000162591. [O75095-2]
ENST00000356575; ENSP00000348982; ENSG00000162591. [O75095-1]
GeneID1953.
KEGGhsa:1953.
UCSCuc001akk.3. human. [O75095-2]
uc001akl.3. human. [O75095-1]

Organism-specific databases

CTD1953.
GeneCardsGC01M003396.
H-InvDBHIX0000066.
HGNCHGNC:3232. MEGF6.
HPAHPA052129.
MIM604266. gene.
neXtProtNX_O75095.
PharmGKBPA27665.
HUGESearch...
GenAtlasSearch...

Phylogenomic databases

eggNOGNOG12793.
HOGENOMHOG000097840.
HOVERGENHBG079790.
OMACRKKCHC.
OrthoDBEOG72C50D.
PhylomeDBO75095.
TreeFamTF332598.

Gene expression databases

BgeeO75095.
CleanExHS_MEGF6.
GenevestigatorO75095.

Family and domain databases

InterProIPR000742. EG-like_dom.
IPR001881. EGF-like_Ca-bd_dom.
IPR013032. EGF-like_CS.
IPR000152. EGF-type_Asp/Asn_hydroxyl_site.
IPR018097. EGF_Ca-bd_CS.
IPR002049. EGF_laminin.
IPR011489. EMI_domain.
IPR009030. Growth_fac_rcpt_N_dom.
[Graphical view]
PfamPF07645. EGF_CA. 1 hit.
PF00053. Laminin_EGF. 5 hits.
[Graphical view]
SMARTSM00181. EGF. 15 hits.
SM00179. EGF_CA. 3 hits.
SM00180. EGF_Lam. 10 hits.
[Graphical view]
SUPFAMSSF57184. SSF57184. 4 hits.
PROSITEPS00010. ASX_HYDROXYL. 4 hits.
PS00022. EGF_1. 23 hits.
PS01186. EGF_2. 24 hits.
PS50026. EGF_3. 23 hits.
PS01187. EGF_CA. 4 hits.
PS51041. EMI. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

GenomeRNAi1953.
NextBio7915.
PROO75095.
SOURCESearch...

Entry information

Entry nameMEGF6_HUMAN
AccessionPrimary (citable) accession number: O75095
Secondary accession number(s): Q4AC86, Q5VV39
Entry history
Integrated into UniProtKB/Swiss-Prot: April 13, 2004
Last sequence update: July 28, 2009
Last modified: July 9, 2014
This is version 112 of the entry and version 4 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

SIMILARITY comments

Index of protein domains and families

MIM cross-references

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot

Human polymorphisms and disease mutations

Index of human polymorphisms and disease mutations

Human entries with polymorphisms or disease mutations

List of human entries with polymorphisms or disease mutations

Human chromosome 1

Human chromosome 1: entries, gene names and cross-references to MIM