Skip Header

Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot Q96PQ0 (SORC2_HUMAN)

Last modified January 19, 2010. Version 70. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (6) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    VPS10 domain-containing receptor SorCS2
Gene names
Name: SORCS2
Synonyms: KIAA1329
OrganismHomo sapiens (Human) [Complete proteome]
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length1159 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level.

General annotation (Comments)

Subcellular location

Membrane; Single-pass type I membrane protein.

Tissue specificity

Highly expressed in brain and kidney. Detected at low levels in heart, liver, small intestine, skeletal muscle and thymus.

Sequence similarities

Contains 6 BNR repeats.

Contains 1 PKD domain.

Caution

The N-terminus of the protein was constructed in analogy to that of the mouse ortholog using the sequence of chromosome 4.

Ontologies

Keywords
   Cellular componentMembrane
   Coding sequence diversityPolymorphism
   DomainRepeat
Signal
Transmembrane
   PTMGlycoprotein
   Technical term3D-structure
Complete proteome
Gene Ontology (GO)
   Biological processneuropeptide signaling pathway Ref.1

Non-traceable author statement. Source: UniProtKB

   Cellular componentintegral to membrane

Inferred from electronic annotation. Source: UniProtKB-SubCell

   Molecular functionneuropeptide receptor activity Ref.1

Non-traceable author statement. Source: UniProtKB

protein binding Ref.1

Non-traceable author statement. Source: UniProtKB

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 5050 Potential
Chain51 – 11591109VPS10 domain-containing receptor SorCS2
PRO_0000033172

Regions

Topological domain51 – 10781028Lumenal Potential
Transmembrane1079 – 109921 Potential
Topological domain1100 – 115960Cytoplasmic Potential
Repeat182 – 19312BNR 1
Repeat232 – 24312BNR 2
Repeat273 – 28412BNR 3
Repeat468 – 47912BNR 4
Repeat545 – 55612BNR 5
Repeat587 – 59812BNR 6
Domain786 – 87691PKD
Compositional bias13 – 10492Pro-rich

Amino acid modifications

Glycosylation1581N-linked (GlcNAc...) Potential
Glycosylation3281N-linked (GlcNAc...) Potential
Glycosylation3621N-linked (GlcNAc...) Potential
Glycosylation6001N-linked (GlcNAc...) Potential
Glycosylation8301N-linked (GlcNAc...) Potential
Glycosylation8911N-linked (GlcNAc...) Potential
Glycosylation9021N-linked (GlcNAc...) Potential

Natural variations

Natural variant3451G → R: dbSNP rs34058821.
VAR_060109
Natural variant5951T → I: dbSNP rs16840899.
VAR_057726
Natural variant6951T → M: dbSNP rs16840892.
VAR_060110
Natural variant7451T → I: dbSNP rs16840899.
VAR_060111

Secondary structure

.................. 1159
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
Q96PQ0-1 [UniParc].

Last modified December 13, 2002. Version 2.
Checksum: 5C4FC4608C5BF872

FASTA1,159128,162
        10         20         30         40         50         60 
MAHRGPSRAS KGPGPTARAP SPGAPPPPRS PRSRPLLLLL LLLGACGAAG RSPEPGRLGP 

        70         80         90        100        110        120 
HAQLTRVPRS PPAGRAEPGG GEDRQARGTE PGAPGPSPGP APGPGEDGAP AAGYRRWERA 

       130        140        150        160        170        180 
APLAGVASRA QVSLISTSFV LKGDATHNQA MVHWTGENSS VILILTKYYH ADMGKVLESS 

       190        200        210        220        230        240 
LWRSSDFGTS YTKLTLQPGV TTVIDNFYIC PTNKRKVILV SSSLSDRDQS LFLSADEGAT 

       250        260        270        280        290        300 
FQKQPIPFFV ETLIFHPKEE DKVLAYTKES KLYVSSDLGK KWTLLQERVT KDHVFWSVSG 

       310        320        330        340        350        360 
VDADPDLVHV EAQDLGGDFR YVTCAIHNCS EKMLTAPFAG PIDHGSLTVQ DDYIFFKATS 

       370        380        390        400        410        420 
ANQTKYYVSY RRNEFVLMKL PKYALPKDLQ IISTDESQVF VAVQEWYQMD TYNLYQSDPR 

       430        440        450        460        470        480 
GVRYALVLQD VRSSRQAEES VLIDILEVRG VKGVFLANQK IDGKVMTLIT YNKGRDWDYL 

       490        500        510        520        530        540 
RPPSMDMNGK PTNCKPPDCH LHLHLRWADN PYVSGTVHTK DTAPGLIMGA GNLGSQLVEY 

       550        560        570        580        590        600 
KEEMYITSDC GHTWRQVFEE EHHILYLDHG GVIVAIKDTS IPLKILKFSV DEGLTWSTHN 

       610        620        630        640        650        660 
FTSTSVFVDG LLSEPGDETL VMTVFGHISF RSDWELVKVD FRPSFSRQCG EEDYSSWELS 

       670        680        690        700        710        720 
NLQGDRCIMG QQRSFRKRKS TSWCIKGRSF TSALTSRVCE CRDSDFLCDY GFERSPSSES 

       730        740        750        760        770        780 
STNKCSANFW FNPLSPPDDC ALGQTYTSSL GYRKVVSNVC EGGVDMQQSQ VQLQCPLTPP 

       790        800        810        820        830        840 
RGLQVSIQGE AVAVRPGEDV LFVVRQEQGD VLTTKYQVDL GDGFKAMYVN LTLTGEPIRH 

       850        860        870        880        890        900 
RYESPGIYRV SVRAENTAGH DEAVLFVQVN SPLQALYLEV VPVIGLNQEV NLTAVLLPLN 

       910        920        930        940        950        960 
PNLTVFYWWI GHSLQPLLSL DNSVTTRFSD TGDVRVTVQA ACGNSVLQDS RVLRVLDQFQ 

       970        980        990       1000       1010       1020 
VMPLQFSKEL DAYNPNTPEW REDVGLVVTR LLSKETSVPQ ELLVTVVKPG LPTLADLYVL 

      1030       1040       1050       1060       1070       1080 
LPPPRPTRKR SLSSDKRLAA IQQVLNAQKI SFLLRGGVRV LVALRDTGTG AEQLGGGGGY 

      1090       1100       1110       1120       1130       1140 
WAVVVLFVIG LFAAGAFILY KFKRKRPGRT VYAQMHNEKE QEMTSPVSHS EDVQGAVQGN 

      1150 
HSGVVLSINS REMHSYLVS 

« Hide

References

« Hide 'large scale' references
[1]"The genes for the human VPS10 domain-containing receptors are large and contain many small exons."
Hampe W., Rezgaoui M., Hermans-Borgmeyer I., Schaller H.C.
Hum. Genet. 108:529-536(2001) [PubMed: 11499680] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 35-451.
[2]"Prediction of the coding sequences of unidentified human genes. XVI. The complete sequences of 150 new cDNA clones from brain which code for large proteins in vitro."
Nagase T., Kikuno R., Ishikawa K., Hirosawa M., Ohara O.
DNA Res. 7:65-73(2000) [PubMed: 10718198] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 253-1159.
Tissue: Brain.
[3]"Solution structure of the PKD domain from human VPS10 domain-containing receptor SORCS2."
RIKEN structural genomics initiative (RSGI)
Submitted (NOV-2004) to the PDB data bank
Cited for: STRUCTURE BY NMR OF 760-869.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AF286190 mRNA. Translation: AAL04014.1.
AB037750 mRNA. Translation: BAA92567.1.
AC097382 Genomic DNA. No translation available.
IPIIPI00044600.
RefSeqNP_065828.2.
UniGeneHs.479099

3D structure databases

PDBe
RCSB PDB
PDBj
EntryMethodResolution (Å)ChainPositionsPDBsum
1WGONMR-A760-869[»]
SMRQ96PQ0. Positions 168-285.
ModBaseSearch...

Protein-protein interaction databases

STRINGQ96PQ0.

Proteomic databases

PRIDEQ96PQ0.

Genome annotation databases

EnsemblENST00000403286; ENSP00000384595; ENSG00000184985; Homo sapiens. [Genome view]
GeneID57537.
KEGGhsa:57537.
UCSCuc003gkb.2. human.

Organism-specific databases

CTD57537.
GeneCardsGC04P007312.
H-InvDBHIX0004076.
HGNCHGNC:16698. SORCS2.
MIM606284. gene.
PharmGKBPA134902026.
HUGESearch...
GenAtlasSearch...

Phylogenomic databases

eggNOGprNOG16195.
HOGENOMHBG445080.
HOVERGENQ96PQ0.

Gene expression databases

ArrayExpressQ96PQ0.
BgeeQ96PQ0.
CleanExHS_SORCS2.
GenevestigatorQ96PQ0.

Family and domain databases

InterProIPR000601. PKD.
IPR006581. VPS10.
[Graphical view]
PfamPF00801. PKD. 1 hit.
[Graphical view]
SMARTSM00089. PKD. 1 hit.
SM00602. VPS10. 1 hit.
[Graphical view]
PROSITEPS50093. PKD. 1 hit.
[Graphical view]
ProtoNetSearch...

Other Resources

NextBio63960.
SOURCESearch...

Entry information

Entry nameSORC2_HUMAN
AccessionPrimary (citable) accession number: Q96PQ0
Secondary accession number(s): Q9P2L7
Entry history
Integrated into UniProtKB/Swiss-Prot: December 13, 2002
Last sequence update: December 13, 2002
Last modified: January 19, 2010
This is version 70 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectHPI (Human Proteome Initiative)
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

Human chromosome 4

Human chromosome 4: entries, gene names and cross-references to MIM

Human entries with polymorphisms or disease mutations

List of human entries with polymorphisms or disease mutations

Human polymorphisms and disease mutations

Index of human polymorphisms and disease mutations

MIM cross-references

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot

PDB cross-references

Index of Protein Data Bank (PDB) cross-references

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents