Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Collagen and calcium-binding EGF domain-containing protein 1

Gene

Ccbe1

Organism
Mus musculus (Mouse)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at transcript leveli

Functioni

Required for lymphangioblast budding and angiogenic sprouting from venous endothelium during embryogenesis.By similarity

GO - Molecular functioni

GO - Biological processi

  • lung development Source: MGI
  • lymphangiogenesis Source: UniProtKB
  • lymph vessel development Source: MGI
  • positive regulation of angiogenesis Source: BHF-UCL
  • positive regulation of endothelial cell migration Source: MGI
  • positive regulation of lymphangiogenesis Source: BHF-UCL
  • positive regulation of protein processing Source: MGI
  • positive regulation of vascular endothelial growth factor production Source: MGI
  • positive regulation of vascular endothelial growth factor signaling pathway Source: MGI
  • respiratory gaseous exchange Source: MGI
  • respiratory system process Source: MGI
  • sprouting angiogenesis Source: UniProtKB
  • venous blood vessel morphogenesis Source: UniProtKB
Complete GO annotation...

Keywords - Molecular functioni

Developmental protein

Keywords - Biological processi

Angiogenesis

Keywords - Ligandi

Calcium

Names & Taxonomyi

Protein namesi
Recommended name:
Collagen and calcium-binding EGF domain-containing protein 1
Alternative name(s):
Full of fluid protein homolog
Gene namesi
Name:Ccbe1
Synonyms:Kiaa1983
OrganismiMus musculus (Mouse)
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus
Proteomesi
  • UP000000589 Componenti: Chromosome 18

Organism-specific databases

MGIiMGI:2445053. Ccbe1.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Secreted

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 3535Sequence analysisAdd
BLAST
Chaini36 – 408373Collagen and calcium-binding EGF domain-containing protein 1PRO_0000279517Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Disulfide bondi139 ↔ 151PROSITE-ProRule annotation
Glycosylationi143 – 1431N-linked (GlcNAc...)Sequence analysis
Disulfide bondi147 ↔ 160PROSITE-ProRule annotation
Disulfide bondi162 ↔ 175PROSITE-ProRule annotation
Glycosylationi183 – 1831N-linked (GlcNAc...)Sequence analysis

Keywords - PTMi

Disulfide bond, Glycoprotein

Proteomic databases

MaxQBiQ3MI99.
PaxDbiQ3MI99.
PRIDEiQ3MI99.

PTM databases

iPTMnetiQ3MI99.
PhosphoSiteiQ3MI99.

Expressioni

Gene expression databases

BgeeiQ3MI99.
CleanExiMM_CCBE1.
GenevisibleiQ3MI99. MM.

Interactioni

GO - Molecular functioni

Protein-protein interaction databases

STRINGi10090.ENSMUSP00000052011.

Structurei

3D structure databases

ProteinModelPortaliQ3MI99.
SMRiQ3MI99. Positions 94-180, 313-338.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini135 – 17642EGF-like; calcium-bindingPROSITE-ProRule annotationAdd
BLAST
Domaini247 – 29246Collagen-like 1Add
BLAST
Domaini302 – 33534Collagen-like 2Add
BLAST

Sequence similaritiesi

Belongs to the CCBE1 family.Curated
Contains 2 collagen-like domains.Curated
Contains 1 EGF-like domain.PROSITE-ProRule annotation

Keywords - Domaini

Collagen, EGF-like domain, Repeat, Signal

Phylogenomic databases

eggNOGiENOG410IRWI. Eukaryota.
ENOG4111SKA. LUCA.
GeneTreeiENSGT00390000014907.
HOGENOMiHOG000111400.
HOVERGENiHBG081035.
InParanoidiQ3MI99.
KOiK19638.
OMAiTSNETLC.
OrthoDBiEOG73BVDG.
PhylomeDBiQ3MI99.
TreeFamiTF333138.

Family and domain databases

InterProiIPR008160. Collagen.
IPR001881. EGF-like_Ca-bd_dom.
IPR013032. EGF-like_CS.
IPR000742. EGF-like_dom.
IPR000152. EGF-type_Asp/Asn_hydroxyl_site.
IPR018097. EGF_Ca-bd_CS.
[Graphical view]
PfamiPF01391. Collagen. 1 hit.
[Graphical view]
SMARTiSM00181. EGF. 2 hits.
SM00179. EGF_CA. 2 hits.
[Graphical view]
PROSITEiPS00010. ASX_HYDROXYL. 1 hit.
PS01186. EGF_2. 1 hit.
PS50026. EGF_3. 1 hit.
PS01187. EGF_CA. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

Q3MI99-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MVPPPLPSRG GAAKRQLGKS LGPLLLLLAL GHTWTYREEP EDRDREVCSE
60 70 80 90 100
NKITTTKYPC LKSSGELTTC FRKKCCKGYK FVLGQCIPED YDICAQAPCE
110 120 130 140 150
QQCTDNFGRV LCTCYPGYRY DRERHQKRER PYCLDIDECA TSNTTLCAHI
160 170 180 190 200
CINTMGSYHC ECREGYILED DGRTCTRGDK YPNDTGHEEK SENEVKAGTC
210 220 230 240 250
CATCKEFSQM KQTVLQLKQK MALLPNNAAE LGKYVNGDKV LASNAYLPGP
260 270 280 290 300
PGLPGGQGPP GSPGPKGSPG FPGMPGPPGQ PGPRGSMGPM GPSPDLSHIK
310 320 330 340 350
QGRRGPVGPP GAPGRHGSKG ERGAPGPPGS PGPPGSFDFL LLVLADIRND
360 370 380 390 400
IAELQEKVFG HRTHSSAEDF PLPQEFSSYP ETLDFGSGDD YSRRTEARDP

EAPRNFYP
Length:408
Mass (Da):44,357
Last modified:May 5, 2009 - v2
Checksum:iE600004B6445EE88
GO

Sequence cautioni

The sequence BAD90476.1 differs from that shown. Reason: Erroneous initiation. Curated

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti77 – 771K → E in BAD90476 (Ref. 2) Curated
Sequence conflicti277 – 2771P → L in AAI03804 (PubMed:15489334).Curated
Sequence conflicti284 – 2841R → Q in BAC25916 (PubMed:16141072).Curated

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK028377 mRNA. Translation: BAC25916.1.
AK035153 mRNA. Translation: BAC28962.1.
AK039742 mRNA. Translation: BAC30435.1.
AK220435 mRNA. Translation: BAD90476.1. Different initiation.
BC103803 mRNA. Translation: AAI03804.1.
BC152322 mRNA. Translation: AAI52323.1.
CCDSiCCDS29314.1.
RefSeqiNP_848908.1. NM_178793.4.
UniGeneiMm.442049.

Genome annotation databases

EnsembliENSMUST00000061103; ENSMUSP00000052011; ENSMUSG00000046318.
ENSMUST00000130300; ENSMUSP00000117636; ENSMUSG00000046318.
GeneIDi320924.
KEGGimmu:320924.
UCSCiuc008ffr.1. mouse.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK028377 mRNA. Translation: BAC25916.1.
AK035153 mRNA. Translation: BAC28962.1.
AK039742 mRNA. Translation: BAC30435.1.
AK220435 mRNA. Translation: BAD90476.1. Different initiation.
BC103803 mRNA. Translation: AAI03804.1.
BC152322 mRNA. Translation: AAI52323.1.
CCDSiCCDS29314.1.
RefSeqiNP_848908.1. NM_178793.4.
UniGeneiMm.442049.

3D structure databases

ProteinModelPortaliQ3MI99.
SMRiQ3MI99. Positions 94-180, 313-338.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi10090.ENSMUSP00000052011.

PTM databases

iPTMnetiQ3MI99.
PhosphoSiteiQ3MI99.

Proteomic databases

MaxQBiQ3MI99.
PaxDbiQ3MI99.
PRIDEiQ3MI99.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSMUST00000061103; ENSMUSP00000052011; ENSMUSG00000046318.
ENSMUST00000130300; ENSMUSP00000117636; ENSMUSG00000046318.
GeneIDi320924.
KEGGimmu:320924.
UCSCiuc008ffr.1. mouse.

Organism-specific databases

CTDi147372.
MGIiMGI:2445053. Ccbe1.
RougeiSearch...

Phylogenomic databases

eggNOGiENOG410IRWI. Eukaryota.
ENOG4111SKA. LUCA.
GeneTreeiENSGT00390000014907.
HOGENOMiHOG000111400.
HOVERGENiHBG081035.
InParanoidiQ3MI99.
KOiK19638.
OMAiTSNETLC.
OrthoDBiEOG73BVDG.
PhylomeDBiQ3MI99.
TreeFamiTF333138.

Miscellaneous databases

PROiQ3MI99.
SOURCEiSearch...

Gene expression databases

BgeeiQ3MI99.
CleanExiMM_CCBE1.
GenevisibleiQ3MI99. MM.

Family and domain databases

InterProiIPR008160. Collagen.
IPR001881. EGF-like_Ca-bd_dom.
IPR013032. EGF-like_CS.
IPR000742. EGF-like_dom.
IPR000152. EGF-type_Asp/Asn_hydroxyl_site.
IPR018097. EGF_Ca-bd_CS.
[Graphical view]
PfamiPF01391. Collagen. 1 hit.
[Graphical view]
SMARTiSM00181. EGF. 2 hits.
SM00179. EGF_CA. 2 hits.
[Graphical view]
PROSITEiPS00010. ASX_HYDROXYL. 1 hit.
PS01186. EGF_2. 1 hit.
PS50026. EGF_3. 1 hit.
PS01187. EGF_CA. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

  1. "The transcriptional landscape of the mammalian genome."
    Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J.
    , Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.
    Science 309:1559-1563(2005) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
    Strain: C57BL/6J.
    Tissue: Embryo, Placenta and Spinal cord.
  2. "Prediction of the coding sequences of mouse homologues of KIAA gene. The complete nucleotide sequences of mouse KIAA-homologous cDNAs identified by screening of terminal sequences of cDNA clones randomly sampled from size-fractionated libraries."
    Okazaki N., Kikuno R.F., Ohara R., Inamoto S., Nagase T., Ohara O., Koga H.
    Submitted (FEB-2005) to the EMBL/GenBank/DDBJ databases
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
    Tissue: Brain.
  3. "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
    The MGC Project Team
    Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
    Strain: CD-1.
    Tissue: Neural stem cell.

Entry informationi

Entry nameiCCBE1_MOUSE
AccessioniPrimary (citable) accession number: Q3MI99
Secondary accession number(s): A7MCU5
, Q5DTT5, Q8BFW1, Q8BMT1
Entry historyi
Integrated into UniProtKB/Swiss-Prot: March 6, 2007
Last sequence update: May 5, 2009
Last modified: June 8, 2016
This is version 94 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. MGD cross-references
    Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.