Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

P55066 (NCAN_MOUSE) Reviewed, UniProtKB/Swiss-Prot

Last modified July 9, 2014. Version 134. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Web links·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Neurocan core protein
Alternative name(s):
Chondroitin sulfate proteoglycan 3
Gene names
Name:Ncan
Synonyms:Cspg3
OrganismMus musculus (Mouse) [Reference proteome]
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus

Protein attributes

Sequence length1268 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at transcript level

General annotation (Comments)

Function

May modulate neuronal adhesion and neurite growth during development by binding to neural cell adhesion molecules (NG-CAM and N-CAM). Chondroitin sulfate proteoglycan; binds to hyaluronic acid.

Subcellular location

Secreted By similarity.

Tissue specificity

Brain.

Sequence similarities

Belongs to the aggrecan/versican proteoglycan family.

Contains 1 C-type lectin domain.

Contains 2 EGF-like domains.

Contains 1 Ig-like V-type (immunoglobulin-like) domain.

Contains 2 Link domains.

Contains 1 Sushi (CCP/SCR) domain.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 2222 Potential
Chain23 – 12681246Neurocan core protein
PRO_0000017517

Regions

Domain37 – 157121Ig-like V-type
Domain159 – 25496Link 1
Domain258 – 35699Link 2
Domain960 – 99637EGF-like 1
Domain998 – 103437EGF-like 2; calcium-binding Potential
Domain1036 – 1165130C-type lectin
Domain1165 – 122561Sushi

Amino acid modifications

Glycosylation1211N-linked (GlcNAc...) Potential
Glycosylation3391N-linked (GlcNAc...) Potential
Glycosylation7421N-linked (GlcNAc...) Potential
Glycosylation9781N-linked (GlcNAc...) Potential
Glycosylation11751N-linked (GlcNAc...) Potential
Disulfide bond58 ↔ 139 By similarity
Disulfide bond181 ↔ 252 By similarity
Disulfide bond205 ↔ 226 By similarity
Disulfide bond279 ↔ 354 By similarity
Disulfide bond303 ↔ 324 By similarity
Disulfide bond964 ↔ 975 By similarity
Disulfide bond969 ↔ 984 By similarity
Disulfide bond986 ↔ 995 By similarity
Disulfide bond1002 ↔ 1013 By similarity
Disulfide bond1007 ↔ 1022 By similarity
Disulfide bond1024 ↔ 1033 By similarity
Disulfide bond1040 ↔ 1051 By similarity
Disulfide bond1068 ↔ 1160 By similarity
Disulfide bond1136 ↔ 1152 By similarity
Disulfide bond1167 ↔ 1210 By similarity
Disulfide bond1196 ↔ 1223 By similarity

Experimental info

Sequence conflict5821E → D in AAH65118. Ref.2
Sequence conflict5871P → A in AAH65118. Ref.2
Sequence conflict9361V → D in AAH65118. Ref.2
Sequence conflict9471E → K in AAH65118. Ref.2

Sequences

Sequence LengthMass (Da)Tools
P55066 [UniParc].

Last modified October 1, 1996. Version 1.
Checksum: 3014E8E202A2FAEC

FASTA1,268137,200
        10         20         30         40         50         60 
MGAGSVWASG LLLLWLLLLV AGDQDTQDTT ATEKGLRMLK SGSGPVRAAL AELVALPCFF 

        70         80         90        100        110        120 
TLQPRLSSLR DIPRIKWTKV QTASGQRQDL PILVAKDNVV RVAKGWQGRV SLPAYPRHRA 

       130        140        150        160        170        180 
NATLLLGPLR ASDSGLYRCQ VVKGIEDEQD LVTLEVTGVV FHYRAARDRY ALTFAEAQEA 

       190        200        210        220        230        240 
CRLSSATIAA PRHLQAAFED GFDNCDAGWL SDRTVRYPIT QSRPGCYGDR SSLPGVRSYG 

       250        260        270        280        290        300 
RRDPQELYDV YCFARELGGE VFYVGPARRL TLAGARAQCQ RQGAALASVG QLHLAWHEGL 

       310        320        330        340        350        360 
DQCDPGWLAD GSVRYPIQTP RRRCGGPAPG VRTVYRFANR TGFPAPGARF DAYCFRAHHH 

       370        380        390        400        410        420 
TAQHGDSEIP SSGDEGEIVS AEGPPGRELK PSLGEQEVIA PDFQEPLMSS GEGEPPDLTW 

       430        440        450        460        470        480 
TQAPEETLGS TPGGPTLASW PSSEKWLFTG APSSMGVSSP SDMGVDMEAT TPLGTQVAPT 

       490        500        510        520        530        540 
PTMRRGRFKG LNGRHFQQQG PEDQLPEVAE PSAQPPTLGA TANHMRPSAA TEASESDQSH 

       550        560        570        580        590        600 
SPWAILTNEV DEPGAGSLGS RSLPESLMWS PSLISPSVPS TESTPSPKPG AAEAPSVKSA 

       610        620        630        640        650        660 
IPHLPRLPSE PPAPSPGPSE ALSAVSLQAS SADGSPDFPI VAMLRAPKLW LLPRSTLVPN 

       670        680        690        700        710        720 
MTPVPLSPAS PLPSWVPEEQ AVRPVSLGAE DLETPFQTTI AAPVEASHRS PDADSIEIEG 

       730        740        750        760        770        780 
TSSMRATKHP ISGPWASLDS SNVTMNPVPS DAGILGTESG VLDLPGSPTS GGQATVEKVL 

       790        800        810        820        830        840 
ATWLPLPGQG LDPGSQSTPM EAHGVAVSME PTVALEGGAT EGPMEATREV VPSTADATWE 

       850        860        870        880        890        900 
SESRSAISST HIAVTMARAQ GMPTLTSTSS EGHPEPKGQM VAQESLEPLN TLPSHPWSSL 

       910        920        930        940        950        960 
VVPMDEVASV SSGEPTGLWD IPSTLIPVSL GLDESVLNVV AESPSVEGFW EEVASGQEDP 

       970        980        990       1000       1010       1020 
TDPCENNPCL HGGTCHTNGT VYGCSCDQGY AGENCEIDID DCLCSPCENG GTCIDEVNGF 

      1030       1040       1050       1060       1070       1080 
ICLCLPSYGG SLCEKDTEGC DRGWHKFQGH CYRYFAHRRA WEDAERDCRR RAGHLTSVHS 

      1090       1100       1110       1120       1130       1140 
PEEHKFINSF GHENSWIGLN DRTVERDFQW TDNTGLQYEN WREKQPDNFF AGGEDCVVMV 

      1150       1160       1170       1180       1190       1200 
AHESGRWNDV PCNYNLPYVC KKGTVLCGPP PAVENASLVG VRKIKYNVHA TVRYQCDEGF 

      1210       1220       1230       1240       1250       1260 
SQHRVATIRC RNNGKWDRPQ IMCIKPRRSH RMRRHHHHPH RHHKPRKEHR KHKRHPAEDW 


EKDEGDFC 

« Hide

References

« Hide 'large scale' references
[1]"Structure and chromosomal localization of the mouse neurocan gene."
Rauch U., Grimpe B., Kulbe G., Arnold-Ammer I., Beier D., Faessler R.
Genomics 28:405-410(1995) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA].
Strain: BALB/c.
Tissue: Brain.
[2]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
Strain: C57BL/6.
Tissue: Brain.
[3]"The transcriptional landscape of the mammalian genome."
Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J. expand/collapse author list , Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.
Science 309:1559-1563(2005) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1052-1268.
Strain: C57BL/6J.
Tissue: Cerebellum.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
X84727 mRNA. Translation: CAA59216.1.
BC065118 mRNA. Translation: AAH65118.1.
AK082298 mRNA. Translation: BAC38458.1.
CCDSCCDS22358.1.
PIRS52781.
RefSeqNP_031815.2. NM_007789.3.
UniGeneMm.268079.

3D structure databases

ProteinModelPortalP55066.
SMRP55066. Positions 40-146, 160-253, 269-358, 949-1163.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

IntActP55066. 2 interactions.
MINTMINT-4092285.

PTM databases

PhosphoSiteP55066.

Proteomic databases

PaxDbP55066.
PRIDEP55066.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENSMUST00000002412; ENSMUSP00000002412; ENSMUSG00000002341.
GeneID13004.
KEGGmmu:13004.
UCSCuc009lys.2. mouse.

Organism-specific databases

CTD1463.
MGIMGI:104694. Ncan.

Phylogenomic databases

eggNOGNOG147231.
GeneTreeENSGT00750000117329.
HOGENOMHOG000170487.
HOVERGENHBG078994.
InParanoidP55066.
KOK06794.
OrthoDBEOG7FFMQR.
PhylomeDBP55066.
TreeFamTF332134.

Enzyme and pathway databases

ReactomeREACT_188576. Developmental Biology.

Gene expression databases

BgeeP55066.
CleanExMM_NCAN.
GenevestigatorP55066.

Family and domain databases

Gene3D2.60.40.10. 1 hit.
3.10.100.10. 3 hits.
InterProIPR001304. C-type_lectin.
IPR016186. C-type_lectin-like.
IPR018378. C-type_lectin_CS.
IPR016187. C-type_lectin_fold.
IPR000742. EG-like_dom.
IPR001881. EGF-like_Ca-bd_dom.
IPR013032. EGF-like_CS.
IPR000152. EGF-type_Asp/Asn_hydroxyl_site.
IPR018097. EGF_Ca-bd_CS.
IPR007110. Ig-like_dom.
IPR013783. Ig-like_fold.
IPR003599. Ig_sub.
IPR013106. Ig_V-set.
IPR000538. Link.
IPR000436. Sushi_SCR_CCP.
[Graphical view]
PfamPF00059. Lectin_C. 1 hit.
PF00084. Sushi. 1 hit.
PF07686. V-set. 1 hit.
PF00193. Xlink. 2 hits.
[Graphical view]
PRINTSPR01265. LINKMODULE.
SMARTSM00032. CCP. 1 hit.
SM00034. CLECT. 1 hit.
SM00181. EGF. 1 hit.
SM00179. EGF_CA. 1 hit.
SM00409. IG. 1 hit.
SM00445. LINK. 2 hits.
[Graphical view]
SUPFAMSSF56436. SSF56436. 3 hits.
SSF57535. SSF57535. 1 hit.
PROSITEPS00010. ASX_HYDROXYL. 1 hit.
PS00615. C_TYPE_LECTIN_1. 1 hit.
PS50041. C_TYPE_LECTIN_2. 1 hit.
PS00022. EGF_1. 3 hits.
PS01186. EGF_2. 1 hit.
PS50026. EGF_3. 2 hits.
PS01187. EGF_CA. 1 hit.
PS50835. IG_LIKE. 1 hit.
PS01241. LINK_1. 2 hits.
PS50963. LINK_2. 2 hits.
PS50923. SUSHI. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

NextBio282828.
PROP55066.
SOURCESearch...

Entry information

Entry nameNCAN_MOUSE
AccessionPrimary (citable) accession number: P55066
Secondary accession number(s): Q6P1E3, Q8C4F8
Entry history
Integrated into UniProtKB/Swiss-Prot: October 1, 1996
Last sequence update: October 1, 1996
Last modified: July 9, 2014
This is version 134 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot