Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q8R116 (NOTUM_MOUSE) Reviewed, UniProtKB/Swiss-Prot

Last modified September 21, 2011. Version 57. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Protein notum homolog

EC=3.-.-.-
Gene names
Name:Notum
OrganismMus musculus (Mouse)
Taxonomic identifier10090 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus

Protein attributes

Sequence length503 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at transcript level

General annotation (Comments)

Function

May deacetylate GlcNAc residues on cell surface glycans Potential.

Subcellular location

Secreted By similarity.

Sequence similarities

Belongs to the pectinacetylesterase family.

Ontologies

Keywords
   Cellular componentSecreted
   Coding sequence diversityAlternative splicing
   DomainSignal
   Molecular functionHydrolase
   PTMGlycoprotein
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
   Cellular componentextracellular region

Inferred from electronic annotation. Source: UniProtKB-SubCell

   Molecular functionhydrolase activity

Inferred from electronic annotation. Source: UniProtKB-KW

Complete GO annotation...

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q8R116-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q8R116-2)

The sequence of this isoform differs from the canonical sequence as follows:
     1-175: Missing.
     176-185: NPHWWNANMV → MVGSPDPCYS
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 1919 Potential
Chain20 – 503484Protein notum homolog
PRO_0000318756

Regions

Compositional bias33 – 6028Pro-rich

Sites

Active site2391Charge relay system By similarity
Active site3471Charge relay system By similarity
Active site3961Charge relay system By similarity

Amino acid modifications

Glycosylation1031N-linked (GlcNAc...) Potential

Natural variations

Alternative sequence1 – 175175Missing in isoform 2.
VSP_031295
Alternative sequence176 – 18510NPHWWNANMV → MVGSPDPCYS in isoform 2.
VSP_031296

Experimental info

Sequence conflict4831T → M in AAH25832. Ref.2

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified February 26, 2008. Version 2.
Checksum: 9C92182D2F9DEDAF

FASTA50356,584
        10         20         30         40         50         60 
MGGEVRVLLL LGLLHWVGGS EGRKTWRRRG QQPPQPPPPP PLPQRAEVEP GAGQPVESFP 

        70         80         90        100        110        120 
LDFTAVEGNM DSFMAQVKSL AQSLYPCSAQ QLNEDLRLHL LLNTSVTCND GSPAGYYLKE 

       130        140        150        160        170        180 
SKGSRRWLLF LEGGWYCFNR ENCDSRYSTM RRLMSSKDWP HTRTGTGILS SQPEENPHWW 

       190        200        210        220        230        240 
NANMVFIPYC SSDVWSGASP KSDKNEYAFM GSLIIQEVVR ELLGKGLSGA KVLLLAGSSA 

       250        260        270        280        290        300 
GGTGVLLNVD RVAELLEELG YPSIQVRGLA DSGWFLDNKQ YRRSDCIDTI NCAPTDAIRR 

       310        320        330        340        350        360 
GIRYWSGMVP ERCQRQFKEG EEWNCFFGYK VYPTLRCPVF VVQWLFDEAQ LTVDNVHLTG 

       370        380        390        400        410        420 
QPVQEGQWLY IQNLGRELRG TLKDVQASFA PACLSHEIII RSYWTDVQVK GTSLPRALHC 

       430        440        450        460        470        480 
WDRSFHDSHK ASKTPMKGCP FHLVDSCPWP HCNPSCPTIR DQFTGQEMNV AQFLMHMGFD 

       490        500 
VQTVAQQQGM EPSKLLGMLS NGN 

« Hide

Isoform 2 [UniParc].

Checksum: 9D57068FB45E5F83
Show »

FASTA32836,756

References

[1]"Lineage-specific biology revealed by a finished genome assembly of the mouse."
Church D.M., Goodstadt L., Hillier L.W., Zody M.C., Goldstein S., She X., Bult C.J., Agarwala R., Cherry J.L., DiCuccio M., Hlavina W., Kapustin Y., Meric P., Maglott D., Birtle Z., Marques A.C., Graves T., Zhou S. expand/collapse author list , Teague B., Potamousis K., Churas C., Place M., Herschleb J., Runnheim R., Forrest D., Amos-Landgraf J., Schwartz D.C., Cheng Z., Lindblad-Toh K., Eichler E.E., Ponting C.P.
PLoS Biol. 7:E1000112-E1000112(2009) [PubMed: 19468303] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: C57BL/6J.
[2]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
Strain: FVB/N.
Tissue: Liver.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AL663030 Genomic DNA. Translation: CAM27104.1.
BC025832 mRNA. Translation: AAH25832.1.
IPIIPI00648961.
IPI00652187.
RefSeqNP_780472.3. NM_175263.4.
UniGeneMm.32839.

3D structure databases

ModBaseSearch...

Proteomic databases

PRIDEQ8R116.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENSMUST00000106177; ENSMUSP00000101783; ENSMUSG00000042988.
ENSMUST00000106178; ENSMUSP00000101784; ENSMUSG00000042988.
GeneID77583.
KEGGmmu:77583.
UCSCuc007mty.2. mouse.
uc007mtz.2. mouse.

Organism-specific databases

CTD147111.
MGIMGI:1924833. Notum.

Phylogenomic databases

eggNOGroNOG04832.
GeneTreeENSGT00390000015892.
HOGENOMHBG446237.
HOVERGENHBG061551.
InParanoidQ8R116.
OMAKDVPASF.

Gene expression databases

ArrayExpressQ8R116.
BgeeQ8R116.
CleanExMM_NOTUM.
GenevestigatorQ8R116.

Family and domain databases

ProtoNetSearch...

Other

SOURCESearch...

Entry information

Entry nameNOTUM_MOUSE
AccessionPrimary (citable) accession number: Q8R116
Secondary accession number(s): A2ABZ5, A2ABZ6
Entry history
Integrated into UniProtKB/Swiss-Prot: February 26, 2008
Last sequence update: February 26, 2008
Last modified: September 21, 2011
This is version 57 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Relevant documents

MGD cross-references

Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot

SIMILARITY comments

Index of protein domains and families