Skip Header

 
Contribute Send feedback

Reviewed, UniProtKB/Swiss-Prot Q92859 (NEO1_HUMAN)

Last modified November 25, 2008. Version 90. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (6) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Alternative products · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Neogenin
Gene names
Name: NEO1
Synonyms: NGN
OrganismHomo sapiens (Human)
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length1461 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level.

General annotation (Comments)

Function

May be involved as a regulatory protein in the transition of undifferentiated proliferating cells to their differentiated state. May also function as a cell adhesion molecule in a broad spectrum of embryonic and adult tissues.

Subcellular location

Cell membrane; Single-pass type I membrane protein.

Tissue specificity

Widely expressed and also in cancer cell lines.

Sequence similarities

Belongs to the immunoglobulin superfamily. DCC family.

Contains 6 fibronectin type-III domains.

Contains 4 Ig-like C2-type (immunoglobulin-like) domains.

Ontologies

Keywords

   Biological processCell adhesion
   Cellular componentCell membrane
Membrane
   Coding sequence diversityAlternative splicing
Polymorphism
   DomainImmunoglobulin domain
Repeat
Signal
Transmembrane
   PTMGlycoprotein
   Technical term3D-structure

Gene Ontology (GO)

   Biological processcell adhesion Ref.2

Non-traceable author statement. Source: ProtInc

   Cellular componentintegral to plasma membrane Ref.2

Traceable author statement. Source: ProtInc

   Molecular functionprotein binding

Inferred from electronic annotation. Source: UniProtKB-KW

Complete GO annotation...

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]

Notes: Additional isoforms seem to exist.
Isoform 1 (identifier: Q92859-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q92859-2)

The sequence of this isoform differs from the canonical sequence as follows:
     1248-1300: Missing.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 3333 Potential
Chain34 – 14611428Neogenin
PRO_0000015043

Regions

Topological domain34 – 11051072Extracellular Potential
Transmembrane1106 – 112621 Potential
Topological domain1127 – 1461335Cytoplasmic Potential
Domain52 – 14190Ig-like C2-type 1
Domain152 – 23887Ig-like C2-type 2
Domain243 – 33694Ig-like C2-type 3
Domain341 – 42686Ig-like C2-type 4
Domain439 – 53294Fibronectin type-III 1
Domain539 – 62789Fibronectin type-III 2
Domain633 – 72896Fibronectin type-III 3
Domain735 – 82793Fibronectin type-III 4
Domain853 – 94997Fibronectin type-III 5
Domain954 – 105198Fibronectin type-III 6
Compositional bias1118 – 11214Poly-Val

Amino acid modifications

Glycosylation731N-linked (GlcNAc...) Potential
Glycosylation2101N-linked (GlcNAc...)
Glycosylation3261N-linked (GlcNAc...) Potential
Glycosylation4701N-linked (GlcNAc...) Potential
Glycosylation4891N-linked (GlcNAc...)
Glycosylation6391N-linked (GlcNAc...) Potential
Glycosylation7151N-linked (GlcNAc...) Potential
Glycosylation9091N-linked (GlcNAc...) Potential
Disulfide bond74 ↔ 129 By similarity
Disulfide bond173 ↔ 221 By similarity
Disulfide bond270 ↔ 320 By similarity
Disulfide bond362 ↔ 410 By similarity

Natural variations

Alternative sequence1248 – 130053Missing in isoform 2.
VSP_002593
Natural variant5341P → L: dbSNP rs4467039.
VAR_027954

Experimental info

Sequence conflict1681N → G in AAB17263. Ref.1

Secondary structure

............................................................................................... 1461
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified October 17, 2006. Version 2.
Checksum: 4AADF1EEBCAFD82C

FASTA1,461160,017
        10         20         30         40         50         60 
MAAERGARRL LSTPSFWLYC LLLLGRRAPG AAAARSGSAP QSPGASIRTF TPFYFLVEPV 

        70         80         90        100        110        120 
DTLSVRGSSV ILNCSAYSEP SPKIEWKKDG TFLNLVSDDR RQLLPDGSLF ISNVVHSKHN 

       130        140        150        160        170        180 
KPDEGYYQCV ATVESLGTII SRTAKLIVAG LPRFTSQPEP SSVYAGNNAI LNCEVNADLV 

       190        200        210        220        230        240 
PFVRWEQNRQ PLLLDDRVIK LPSGMLVISN ATEGDGGLYR CVVESGGPPK YSDEVELKVL 

       250        260        270        280        290        300 
PDPEVISDLV FLKQPSPLVR VIGQDVVLPC VASGLPTPTI KWMKNEEALD TESSERLVLL 

       310        320        330        340        350        360 
AGGSLEISDV TEDDAGTYFC IADNGNETIE AQAELTVQAQ PEFLKQPTNI YAHESMDIVF 

       370        380        390        400        410        420 
ECEVTGKPTP TVKWVKNGDM VIPSDYFKIV KEHNLQVLGL VKSDEGFYQC IAENDVGNAQ 

       430        440        450        460        470        480 
AGAQLIILEH APATTGPLPS APRDVVASLV STRFIKLTWR TPASDPHGDN LTYSVFYTKE 

       490        500        510        520        530        540 
GIARERVENT SHPGEMQVTI QNLMPATVYI FRVMAQNKHG SGESSAPLRV ETQPEVQLPG 

       550        560        570        580        590        600 
PAPNLRAYAA SPTSITVTWE TPVSGNGEIQ NYKLYYMEKG TDKEQDVDVS SHSYTINGLK 

       610        620        630        640        650        660 
KYTEYSFRVV AYNKHGPGVS TPDVAVRTLS DVPSAAPQNL SLEVRNSKSI MIHWQPPAPA 

       670        680        690        700        710        720 
TQNGQITGYK IRYRKASRKS DVTETLVSGT QLSQLIEGLD RGTEYNFRVA ALTINGTGPA 

       730        740        750        760        770        780 
TDWLSAETFE SDLDETRVPE VPSSLHVR