Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q9CS84 (NRX1A_MOUSE) Reviewed, UniProtKB/Swiss-Prot

Last modified November 16, 2011. Version 107. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Neurexin-1-alpha
Alternative name(s):
Neurexin I-alpha
Gene names
Name:Nrxn1
Synonyms:Kiaa0578
OrganismMus musculus (Mouse)
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus

Protein attributes

Sequence length1514 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Neuronal cell surface protein that may be involved in cell recognition and cell adhesion. May mediate intracellular signaling.

Subunit structure

The cytoplasmic C-terminal region binds to CASK, CASKIN1 and APBA1. The laminin G-like domain 2 binds to NXPH1 By similarity. Interacts with SYT13 and SYTL1. Ref.7 Ref.8

Subcellular location

Membrane; Single-pass type I membrane protein Potential.

Sequence similarities

Belongs to the neurexin family.

Contains 3 EGF-like domains.

Contains 6 laminin G-like domains.

Sequence caution

The sequence BAC41433.2 differs from that shown. Reason: Erroneous initiation.

Ontologies

Keywords
   Biological processCell adhesion
   Cellular componentMembrane
   Coding sequence diversityAlternative splicing
   DomainEGF-like domain
Repeat
Signal
Transmembrane
Transmembrane helix
   LigandCalcium
Metal-binding
   PTMDisulfide bond
Glycoprotein
   Technical term3D-structure
Complete proteome
Reference proteome
Gene Ontology (GO)
   Biological processadult behavior

Inferred from mutant phenotype. Source: BHF-UCL

cell adhesion

Inferred from electronic annotation. Source: UniProtKB-KW

gephyrin clustering

Inferred from direct assay. Source: BHF-UCL

learning

Inferred from mutant phenotype. Source: BHF-UCL

neuroligin clustering

Inferred from direct assay. Source: BHF-UCL

neuromuscular process controlling balance

Inferred from mutant phenotype. Source: BHF-UCL

neurotransmitter secretion

Traceable author statement. Source: MGI

positive regulation of excitatory postsynaptic membrane potential

Inferred from mutant phenotype. Source: BHF-UCL

positive regulation of synapse maturation

Inferred from direct assay. Source: MGI

positive regulation of synaptic transmission, glutamatergic

Inferred from mutant phenotype. Source: BHF-UCL

postsynaptic density protein 95 clustering

Inferred from direct assay. Source: BHF-UCL

postsynaptic membrane assembly

Inferred from direct assay. Source: BHF-UCL

prepulse inhibition

Inferred from mutant phenotype. Source: BHF-UCL

regulation of grooming behavior

Inferred from mutant phenotype. Source: BHF-UCL

   Cellular componentcell surface

Inferred from direct assay. Source: BHF-UCL

integral to membrane

Inferred from electronic annotation. Source: UniProtKB-KW

membrane fraction

Traceable author statement. Source: MGI

plasma membrane

Inferred by curator. Source: BHF-UCL

presynaptic membrane

Inferred from direct assay. Source: MGI

   Molecular functionacetylcholine receptor binding

Inferred from direct assay. Source: MGI

cell adhesion molecule binding

Inferred from physical interaction. Source: BHF-UCL

metal ion binding

Inferred from electronic annotation. Source: UniProtKB-KW

neuroligin family protein binding

Inferred from physical interaction. Source: BHF-UCL

Complete GO annotation...

Alternative products

This entry describes 5 isoforms produced by alternative splicing. [Align] [Select]

Note: Additional isoforms seem to exist.
Isoform 1a (identifier: Q9CS84-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2a (identifier: Q9CS84-2)

Also known as: Alpha-2B;

The sequence of this isoform differs from the canonical sequence as follows:
     387-393: Missing.
Isoform 3a (identifier: Q9CS84-3)

Also known as: Alpha-2C;

The sequence of this isoform differs from the canonical sequence as follows:
     379-393: Missing.
Isoform 4a (identifier: Q9CS84-4)

The sequence of this isoform differs from the canonical sequence as follows:
     1-320: Missing.
     379-393: Missing.
     1410-1412: Missing.
Note: No experimental confirmation available.
Isoform 1b (identifier: P0DI97-1)

The sequence of this isoform can be found in the external entry P0DI97.
Isoforms of the same protein are often annotated in two different entries if their sequences differ significantly.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 3030 Potential
Chain31 – 15141484Neurexin-1-alpha
PRO_0000043164

Regions

Topological domain31 – 14381408Extracellular Potential
Transmembrane1439 – 145921Helical; Potential
Topological domain1460 – 151455Cytoplasmic Potential
Domain31 – 217187Laminin G-like 1
Domain219 – 25638EGF-like 1
Domain283 – 480198Laminin G-like 2
Domain487 – 679193Laminin G-like 3
Domain683 – 72038EGF-like 2
Domain725 – 898174Laminin G-like 4
Domain912 – 1087176Laminin G-like 5
Domain1090 – 112738EGF-like 3
Domain1133 – 1331199Laminin G-like 6
Compositional bias13 – 219Poly-Leu
Compositional bias1361 – 13644Poly-Thr
Compositional bias1446 – 14494Poly-Ala

Sites

Metal binding3291Calcium By similarity
Metal binding3461Calcium; via carbonyl oxygen By similarity
Metal binding4141Calcium; via carbonyl oxygen By similarity

Amino acid modifications

Glycosylation1251N-linked (GlcNAc...) Potential
Glycosylation1901N-linked (GlcNAc...) Potential
Glycosylation7971N-linked (GlcNAc...) Potential
Glycosylation12301N-linked (GlcNAc...) Potential
Disulfide bond228 ↔ 243 By similarity
Disulfide bond245 ↔ 255 By similarity
Disulfide bond444 ↔ 480 By similarity
Disulfide bond650 ↔ 679 By similarity
Disulfide bond687 ↔ 698 By similarity
Disulfide bond692 ↔ 707 By similarity
Disulfide bond709 ↔ 719 By similarity
Disulfide bond1059 ↔ 1087 By similarity
Disulfide bond1094 ↔ 1105 By similarity
Disulfide bond1099 ↔ 1114 By similarity
Disulfide bond1116 ↔ 1126 By similarity

Natural variations

Alternative sequence1 – 320320Missing in isoform 4a.
VSP_016400
Alternative sequence379 – 39315Missing in isoform 3a and isoform 4a.
VSP_003485
Alternative sequence387 – 3937Missing in isoform 2a.
VSP_003484
Alternative sequence1410 – 14123Missing in isoform 4a.
VSP_016401

Secondary structure

........................... 1514
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
Isoform 1a [UniParc].

Last modified December 6, 2005. Version 3.
Checksum: 412281FE441F0EFC

FASTA1,514166,169
        10         20         30         40         50         60 
MGTALVQRGG CCLLCLSLLL LGCWAELGSG LEFPGAEGQW TRFPKWNACC ESEMSFQLKT 

        70         80         90        100        110        120 
RSARGLVLYF DDEGFCDFLE LILTRGGRLQ LSFSIFCAEP ATLLADTPVN DGAWHSVRIR 

       130        140        150        160        170        180 
RQFRNTTLYI DRAEAKWVEV KSKRRDMTVF SGLFVGGLPP ELRAAALKLT LASVREREPF 

       190        200        210        220        230        240 
KGWIRDVRVN SSQALPVDGG EVKLDDEPPN SGGGSPCEAG EEGEGGVCLN GGVCSVVDDQ 

       250        260        270        280        290        300 
AVCDCSRTGF RGKDCSQEDN NVEGLAHLMM GDQGKSKGKE EYIATFKGSE YFCYDLSQNP 

       310        320        330        340        350        360 
IQSSSDEITL SFKTLQRNGL MLHTGKSADY VNLALKNGAV SLVINLGSGA FEALVEPVNG 

       370        380        390        400        410        420 
KFNDNAWHDV KVTRNLRQHS GIGHAMVNKL HCSVTISVDG ILTTTGYTQE DYTMLGSDDF 

       430        440        450        460        470        480 
FYVGGSPSTA DLPGSPVSNN FMGCLKEVVY KNNDVRLELS RLAKQGDPKM KIHGVVAFKC 

       490        500        510        520        530        540 
ENVATLDPIT FETPESFISL PKWNAKKTGS ISFDFRTTEP NGLILFSHGK PRHQKDAKHP 

       550        560        570        580        590        600 
QMIKVDFFAI EMLDGHLYLL LDMGSGTIKI KALQKKVNDG EWYHVDFQRD GRSGTISVNT 

       610        620        630        640        650        660 
LRTPYTAPGE SEILDLDDEL YLGGLPENKA GLVFPTEVWT ALLNYGYVGC IRDLFIDGQS 

       670        680        690        700        710        720 
KDIRQMAEIQ STAGVKPSCS KETAKPCLSN PCKNNGMCRD GWNRYVCDCS GTGYLGRSCE 

       730        740        750        760        770        780 
REATVLSYDG SMFMKIQLPV VMHTEAEDVS LRFRSQRAYG ILMATTSRDS ADTLRLELDA 

       790        800        810        820        830        840 
GRVKLTVNLD CIRINCNSSK GPETLFAGYN LNDNEWHTVR VVRRGKSLKL TVDDQQAMTG 

       850        860        870        880        890        900 
QMAGDHTRLE FHNIETGIIT ERRYLSSVPS NFIGHLQSLT FNGMAYIDLC KNGDIDYCEL 

       910        920        930        940        950        960 
NARFGFRNII ADPVTFKTKS SYVALATLQA YTSMHLFFQF KTTSLDGLIL YNSGDGNDFI 

       970        980        990       1000       1010       1020 
VVELVKGYLH YVFDLGNGAN LIKGSSNKPL NDNQWHNVMI SRDTSNLHTV KIDTKITTQI 

      1030       1040       1050       1060       1070       1080 
TAGARNLDLK SDLYIGGVAK ETYKSLPKLV HAKEGFQGCL ASVDLNGRLP DLISDALFCN 

      1090       1100       1110       1120       1130       1140 
GQIERGCEGP STTCQEDSCS NQGVCLQQWD GFSCDCSMTS FSGPLCNDPG TTYIFSKGGG 

      1150       1160       1170       1180       1190       1200 
QITYKWPPND RPSTRADRLA IGFSTVQKEA VLVRVDSSSG LGDYLELHIH QGKIGVKFNV 

      1210       1220       1230       1240       1250       1260 
GTDDIAIEES NAIINDGKYH VVRFTRSGGN ATLQVDSWPV IERYPAGNND NERLAIARQR 

      1270       1280       1290       1300       1310       1320 
IPYRLGRVVD EWLLDKGRQL TIFNSQATII IGGKEQGQPF QGQLSGLYYN GLKVLNMAAE 

      1330       1340       1350       1360       1370       1380 
NDANIAIVGN VRLVGEVPSS MTTESTATAM QSEMSTSIME TTTTLATSTA RRGKPPTKEP 

      1390       1400       1410       1420       1430       1440 
ISQTTDDILV ASAECPSDDE DIDPCEPSSG GLANPTRVGG REPYPGSAEV IRESSSTTGM 

      1450       1460       1470       1480       1490       1500 
VVGIVAAAAL CILILLYAMY KYRNRDEGSY HVDESRNYIS NSAQSNGAVV KEKQPSSAKS 

      1510 
ANKNKKNKDK EYYV 

« Hide

Isoform 2a (Alpha-2B) [UniParc].

Checksum: B3372630D1BD0606
Show »

FASTA1,507165,387
Isoform 3a (Alpha-2C) [UniParc].

Checksum: 5574E00C647EAD9E
Show »

FASTA1,499164,596
Isoform 4a [UniParc].

Checksum: 9AA6E9F48CF994CE
Show »

FASTA1,176129,340
Isoform 1b [UniParc].

See P0DI97.

References

« Hide 'large scale' references
[1]"Prediction of the coding sequences of mouse homologues of KIAA gene: I. The complete nucleotide sequences of 100 mouse KIAA-homologous cDNAs identified by screening of terminal sequences of cDNA clones randomly sampled from size-fractionated libraries."
Okazaki N., Kikuno R., Ohara R., Inamoto S., Hara Y., Nagase T., Ohara O., Koga H.
DNA Res. 9:179-188(2002) [PubMed: 12465718] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2A).
Tissue: Brain.
[2]Okazaki N., Kikuno R., Nagase T., Ohara O., Koga H.
Submitted (FEB-2003) to the EMBL/GenBank/DDBJ databases
Cited for: SEQUENCE REVISION.
[3]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 4A).
Tissue: Eye.
[4]"Sequencing of the neurexin genes."
Graveley B.R., Philipps D.L.
Submitted (MAY-2001) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 298-437 (ISOFORMS 1A; 2A AND 3A/4A).
Strain: CD-1.
Tissue: Brain.
[5]"The transcriptional landscape of the mammalian genome."
Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J. expand/collapse author list , Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.
Science 309:1559-1563(2005) [PubMed: 16141072] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1126-1514 (ISOFORMS 1A/2A/3A).
Strain: C57BL/6J.
Tissue: Embryo.
[6]"Differential seizure-induced and developmental changes of neurexin expression."
Gorecki D.C., Szklarczyk A., Lukasiuk K., Kaczmarek L., Simons J.P.
Mol. Cell. Neurosci. 13:218-227(1999) [PubMed: 10408888] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 1463-1505 (ISOFORMS 1A/2A/3A/4A).
Strain: C57BL/10.
Tissue: Brain.
[7]"Synaptotagmin-like protein 1-3: a novel family of C-terminal-type tandem C2 proteins."
Fukuda M., Mikoshiba K.
Biochem. Biophys. Res. Commun. 281:1226-1233(2001) [PubMed: 11243866] [Abstract]
Cited for: INTERACTION WITH SYTL1.
[8]"Characterization of KIAA1427 protein as an atypical synaptotagmin (Syt XIII)."
Fukuda M., Mikoshiba K.
Biochem. J. 354:249-257(2001) [PubMed: 11171101] [Abstract]
Cited for: INTERACTION WITH SYT13.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AB093249 mRNA. Translation: BAC41433.2. Different initiation.
BC047146 mRNA. Translation: AAH47146.1.
AF387674 Genomic DNA. Translation: AAK70469.1.
AF387674 Genomic DNA. Translation: AAK70470.1.
AF387674 Genomic DNA. Translation: AAK70471.1.
AK017578 mRNA. Translation: BAB30815.1.
AJ006802 mRNA. Translation: CAA07257.1.
IPIIPI00230050.
IPI00230051.
IPI00468539.
IPI00970455.
RefSeqNP_064648.3. NM_020252.3.
NP_796258.2. NM_177284.2.
UniGeneMm.312068.

3D structure databases

PDBe
RCSB PDB
PDBj
EntryMethodResolution (Å)ChainPositionsPDBsum
3BODX-ray1.70A1132-1334[»]
3MW2X-ray2.69A/B1132-1334[»]
ProteinModelPortalQ9CS84.
SMRQ9CS84. Positions 221-258, 279-475, 489-906, 911-1335.
ModBaseSearch...

Protein-protein interaction databases

IntActQ9CS84. 5 interactions.
STRINGQ9CS84.

PTM databases

PhosphoSiteQ9CS84.

Proteomic databases

PRIDEQ9CS84.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENSMUST00000160844; ENSMUSP00000125407; ENSMUSG00000024109.
ENSMUST00000161402; ENSMUSP00000124116; ENSMUSG00000024109.
GeneID18189.
KEGGmmu:18189.
UCSCuc008dvz.2. mouse.
uc008dwa.2. mouse.
uc012ayq.1. mouse.

Organism-specific databases

CTD9378.
MGIMGI:1096391. Nrxn1.
RougeSearch...

Phylogenomic databases

eggNOGroNOG14987.
GeneTreeENSGT00560000076996.
HOGENOMHBG358378.
HOVERGENHBG052670.
InParanoidQ9CS84.
OMAMGDQGKS.
OrthoDBEOG41G339.

Gene expression databases

ArrayExpressQ9CS84.
BgeeQ9CS84.
CleanExMM_NRXN1.
GenevestigatorQ9CS84.
GermOnlineENSMUSG00000024109. Mus musculus.

Family and domain databases

InterProIPR008985. ConA-like_lec_gl.
IPR013320. ConA-like_subgrp.
IPR006210. EGF-like.
IPR000152. EGF-type_Asp/Asn_hydroxyl_site.
IPR000742. EGF_3.
IPR001791. Laminin_G.
IPR012680. Laminin_G_2.
IPR003585. Neurexin-like.
[Graphical view]
Gene3DG3DSA:2.60.120.200. ConA_like_subgrp. 6 hits.
KOK07377.
PfamPF02210. Laminin_G_2. 6 hits.
[Graphical view]
SMARTSM00294. 4.1m. 1 hit.
SM00181. EGF. 3 hits.
SM00282. LamG. 6 hits.
[Graphical view]
SUPFAMSSF49899. ConA_like_lec_gl. 6 hits.
PROSITEPS00010. ASX_HYDROXYL. 1 hit.
PS00022. EGF_1. False negative.
PS01186. EGF_2. False negative.
PS50026. EGF_3. 3 hits.
PS50025. LAM_G_DOMAIN. 6 hits.
[Graphical view]
ProtoNetSearch...

Other

NextBio293528.
SOURCESearch...

Entry information

Entry nameNRX1A_MOUSE
AccessionPrimary (citable) accession number: Q9CS84
Secondary accession number(s): O88722, Q80Y87, Q8CHE6
Entry history
Integrated into UniProtKB/Swiss-Prot: November 16, 2001
Last sequence update: December 6, 2005
Last modified: November 16, 2011
This is version 107 of the entry and version 3 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Relevant documents

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot

PDB cross-references

Index of Protein Data Bank (PDB) cross-references

SIMILARITY comments

Index of protein domains and families