Skip Header

Contribute Send feedback
Read comments (?) or add your own

P55068 (PGCB_RAT) Reviewed, UniProtKB/Swiss-Prot

Last modified December 14, 2011. Version 109. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (1) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Brevican core protein
Alternative name(s):
Brain-enriched hyaluronan-binding protein
Short name=BEHAB
Gene names
Name:Bcan
Synonyms:Behab
OrganismRattus norvegicus (Rat)
Taxonomic identifier10116 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeRattus

Protein attributes

Sequence length883 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

May play a role in the terminally differentiating and the adult nervous system during postnatal development. Could stabilize interactions between hyaluronan (HA) and brain proteoglycans. Isoform 2 may function as a chondroitin sulfate-bearing cell surface receptor.

Subunit structure

Interacts with TNR. Ref.3

Subcellular location

Isoform 1: Secretedextracellular spaceextracellular matrix.

Isoform 2: Membrane; Lipid-anchorGPI-anchor.

Tissue specificity

Brain.

Developmental stage

Isoform 1 increases from day P4 to P64. Isoform 2 increases after day P8.

Post-translational modification

Contains mostly chondroitin sulfate.

The GPI-anchor may be located on Ser-622 of isoform 2 Potential.

Sequence similarities

Belongs to the aggrecan/versican proteoglycan family.

Contains 1 C-type lectin domain.

Contains 1 EGF-like domain.

Contains 1 Ig-like V-type (immunoglobulin-like) domain.

Contains 2 Link domains.

Contains 1 Sushi (CCP/SCR) domain.

Sequence caution

The sequence CAA82215.1 differs from that shown. Reason: Frameshift at position 364.

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: P55068-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: P55068-2)

The sequence of this isoform differs from the canonical sequence as follows:
     625-645: DCIPSPCHNGGTCLEEKEGFR → NSAEGSMPAFLLFLLLQLWDT
     646-883: Missing.
Note: GPI-anchor amidated serine on Ser-622 (Potential).

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 2222 Potential
Chain23 – 883861Brevican core protein
PRO_0000017513

Regions

Domain35 – 154120Ig-like V-type
Domain156 – 25196Link 1
Domain256 – 35398Link 2
Domain622 – 65837EGF-like
Domain658 – 786129C-type lectin
Domain789 – 84961Sushi

Amino acid modifications

Modified residue5461Phosphoserine By similarity
Glycosylation1291N-linked (GlcNAc...) Potential
Glycosylation3361N-linked (GlcNAc...) Potential
Disulfide bond56 ↔ 136 By similarity
Disulfide bond178 ↔ 249 By similarity
Disulfide bond202 ↔ 223 By similarity
Disulfide bond276 ↔ 351 By similarity
Disulfide bond300 ↔ 321 By similarity
Disulfide bond626 ↔ 637 By similarity
Disulfide bond631 ↔ 646 By similarity
Disulfide bond648 ↔ 657 By similarity
Disulfide bond692 ↔ 784 By similarity
Disulfide bond760 ↔ 776 By similarity
Disulfide bond791 ↔ 834 By similarity
Disulfide bond820 ↔ 847 By similarity

Natural variations

Alternative sequence625 – 64521DCIPS…KEGFR → NSAEGSMPAFLLFLLLQLWD T in isoform 2.
VSP_003076
Alternative sequence646 – 883238Missing in isoform 2.
VSP_003077

Experimental info

Sequence conflict51 – 522AL → WV in CAA82215. Ref.4
Sequence conflict5031V → L in AAA87847. Ref.2
Sequence conflict518 – 5192TV → PA in AAA87847. Ref.2
Sequence conflict5261G → R in AAA87847. Ref.2
Sequence conflict5411G → A in AAA87847. Ref.2
Sequence conflict5561R → S in AAA87847. Ref.2
Sequence conflict5731E → A in AAA87847. Ref.2
Sequence conflict5831V → L in AAA87847. Ref.2
Sequence conflict6491V → L in AAA87847. Ref.2
Sequence conflict6701P → A in AAA87847. Ref.2
Sequence conflict7381P → A in AAA87847. Ref.2
Sequence conflict8091R → A in AAA87847. Ref.2

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified November 1, 1997. Version 2.
Checksum: AC7ACC40CB53ED37

FASTA88396,057
        10         20         30         40         50         60 
MIPLLLSLLA ALVLTQAPAA LADDLKEDSS EDRAFRVRIG AAQLRGVLGG ALAIPCHVHH 

        70         80         90        100        110        120 
LRPPPSRRAA PGFPRVKWTF LSGDREVEVL VARGLRVKVN EAYRFRVALP AYPASLTDVS 

       130        140        150        160        170        180 
LVLSELRPND SGVYRCEVQH GIDDSSDAVE VKVKGVVFLY REGSARYAFS FAGAQEACAR 

       190        200        210        220        230        240 
IGARIATPEQ LYAAYLGGYE QCDAGWLSDQ TVRYPIQNPR EACYGDMDGY PGVRNYGVVG 

       250        260        270        280        290        300 
PDDLYDVYCY AEDLNGELFL GAPPGKLTWE EARDYCLERG AQIASTGQLY AAWNGGLDRC 

       310        320        330        340        350        360 
SPGWLADGSV RYPIITPSQR CGGGLPGVKT LFLFPNQTGF PSKQNRFNVY CFRDSAHPSA 

       370        380        390        400        410        420 
FSEASSPASD GLEAIVTVTE KLEELQLPQE AVESESRGAI YSIPITEDGG GGSSTPEDPA 

       430        440        450        460        470        480 
EAPRTPLESE TQSVAPPTGS SEEEGEALEE EERFKDTETP KEEKEQENLW VWPTELSSPL 

       490        500        510        520        530        540 
PTGLETEHSL SQVSPPAQAV LQVGASPSPR PPRVHGPTVE TLQPPGEGSL TSTPDGAREV 

       550        560        570        580        590        600 
GGETGSPELS GVPREREEAG SSSLEDGPSL LPETWAPVGT REVETPSEEK SGRTVLTGTS 

       610        620        630        640        650        660 
VQAQPVLPTD SASRGGVAVA PSSGDCIPSP CHNGGTCLEE KEGFRCLCVP GYGGDLCDVG 

       670        680        690        700        710        720 
LHFCSPGWEP FQGACYKHFS TRRSWEEAES QCRALGAHLT SICTPEEQDF VNDRYREYQW 

       730        740        750        760        770        780 
IGLNDRTIEG DFLWSDGPPL LYENWNPGQP DSYFLSGENC VVMVWHDQGQ WSDVPCNYHL 

       790        800        810        820        830        840 
SYTCKMGLVS CGPPPQLPLA QIFGRPRLRY AVDTVLRYRC RDGLAQRNLP LIRCQENGLW 

       850        860        870        880 
EAPQISCVPR RPARALRSMT APEGPRGQLP RQRKALLTPP SSL 

« Hide

Isoform 2 [UniParc].

Checksum: 1BC9B7FF08970B89
Show »

FASTA64569,085

References

[1]"Brevican, a chondroitin sulfate proteoglycan of rat brain, occurs as secreted and cell surface glycosylphosphatidylinositol-anchored isoforms."
Seidenbecher C.I., Richter K., Rauch U., Faessler R., Garner C.C., Gundelfinger E.D.
J. Biol. Chem. 270:27206-27212(1995) [PubMed: 7592978] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA].
Strain: Sprague-Dawley.
Tissue: Brain.
[2]"cDNA cloning and the identification of an aggrecanase-like cleavage site in rat brevican."
Yamada H., Watanabe K., Shimonaka M., Yamasaki M., Yamaguchi Y.
Biochem. Biophys. Res. Commun. 216:957-963(1995) [PubMed: 7488217] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA], PROTEIN SEQUENCE OF 396-407.
Tissue: Brain.
[3]"The C-type lectin domains of lecticans, a family of aggregating chondroitin sulfate proteoglycans, bind tenascin-R by protein-protein interactions independent of carbohydrate moiety."
Aspberg A., Miura R., Bourdoulous S., Shimonaka M., Heinegard D., Schachner M., Ruoslahti E., Yamaguchi Y.
Proc. Natl. Acad. Sci. U.S.A. 94:10116-10121(1997) [PubMed: 9294172] [Abstract]
Cited for: INTERACTION WITH TNR.
[4]"BEHAB, a new member of the proteoglycan tandem repeat family of hyaluronan-binding proteins that is restricted to the brain."
Jaworski D.M., Kelly G.M., Hockfield S.
J. Cell Biol. 125:495-509(1994) [PubMed: 7512973] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 1-423.
Strain: Sprague-Dawley.
Tissue: Brain.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
X79881 mRNA. Translation: CAA56255.1.
X86406 mRNA. Translation: CAA60160.1.
U37142 mRNA. Translation: AAA87847.1.
Z28366 mRNA. Translation: CAA82215.1. Frameshift.
IPIIPI00213551.
IPI00326616.
PIRA53908.
S49126.
RefSeqNP_001028837.1. NM_001033665.1.
NP_037048.2. NM_012916.2.
UniGeneRn.10315.
Rn.168035.

3D structure databases

ProteinModelPortalP55068.
SMRP55068. Positions 664-787.
ModBaseSearch...

Protein-protein interaction databases

STRINGP55068.

Proteomic databases

PRIDEP55068.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

GeneID25393.
KEGGrno:25393.
UCSCNM_001033665. rat.

Organism-specific databases

CTD63827.
RGD2194. Bcan.

Phylogenomic databases

eggNOGroNOG05437.
GeneTreeENSGT00550000074236.
HOVERGENHBG008175.
InParanoidP55068.
OrthoDBEOG4XGZZW.

Gene expression databases

ArrayExpressP55068.
GenevestigatorP55068.
GermOnlineENSRNOG00000018798. Rattus norvegicus.

Family and domain databases

InterProIPR002353. AntifreezeII.
IPR001304. C-type_lectin.
IPR016186. C-type_lectin-like.
IPR018378. C-type_lectin_CS.
IPR016187. C-type_lectin_fold.
IPR016060. Complement_control_module.
IPR006209. EGF.
IPR006210. EGF-like.
IPR013032. EGF-like_reg_CS.
IPR000742. EGF_3.
IPR007110. Ig-like.
IPR013783. Ig-like_fold.
IPR003006. Ig/MHC_CS.
IPR013106. Ig_V-set.
IPR003596. Ig_V-set_subgr.
IPR000538. Link.
IPR000436. Sushi_SCR_CCP.
[Graphical view]
Gene3DG3DSA:3.10.100.10. C-type_lectin-like. 3 hits.
G3DSA:2.10.70.10. Complement_control_module. 1 hit.
G3DSA:2.60.40.10. Ig-like_fold. 1 hit.
KOK06795.
PfamPF00008. EGF. 1 hit.
PF00059. Lectin_C. 1 hit.
PF00084. Sushi. 1 hit.
PF07686. V-set. 1 hit.
PF00193. Xlink. 2 hits.
[Graphical view]
PRINTSPR00356. ANTIFREEZEII.
PR01265. LINKMODULE.
SMARTSM00032. CCP. 1 hit.
SM00034. CLECT. 1 hit.
SM00181. EGF. 1 hit.
SM00406. IGv. 1 hit.
SM00445. LINK. 2 hits.
[Graphical view]
SUPFAMSSF56436. C-type_lectin_fold. 3 hits.
SSF57535. Complement_control_module. 1 hit.
PROSITEPS00615. C_TYPE_LECTIN_1. 1 hit.
PS50041. C_TYPE_LECTIN_2. 1 hit.
PS00022. EGF_1. 1 hit.
PS01186. EGF_2. 1 hit.
PS50026. EGF_3. 1 hit.
PS50835. IG_LIKE. 1 hit.
PS00290. IG_MHC. 1 hit.
PS01241. LINK_1. 2 hits.
PS50963. LINK_2. 2 hits.
PS50923. SUSHI. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

NextBio606469.
PMAP-CutDBP55068.

Entry information

Entry namePGCB_RAT
AccessionPrimary (citable) accession number: P55068
Secondary accession number(s): Q62860, Q63040, Q63513
Entry history
Integrated into UniProtKB/Swiss-Prot: October 1, 1996
Last sequence update: November 1, 1997
Last modified: December 14, 2011
This is version 109 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families