Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q6UY09 (CEA20_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified May 1, 2013. Version 72. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (4) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Carcinoembryonic antigen-related cell adhesion molecule 20
Gene names
Name:CEACAM20
ORF Names:UNQ9366/PRO34155
OrganismHomo sapiens (Human) [Reference proteome]
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length585 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at transcript level

General annotation (Comments)

Subcellular location

Membrane; Single-pass type I membrane protein Potential.

Sequence similarities

Belongs to the immunoglobulin superfamily. CEA family.

Contains 4 Ig-like C2-type (immunoglobulin-like) domains.

Ontologies

Keywords
   Cellular componentMembrane
   Coding sequence diversityPolymorphism
   DomainImmunoglobulin domain
Repeat
Signal
Transmembrane
Transmembrane helix
   PTMDisulfide bond
Glycoprotein
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
   Cellular_componentintegral to membrane

Inferred from electronic annotation. Source: UniProtKB-KW

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 3030 Potential
Chain31 – 585555Carcinoembryonic antigen-related cell adhesion molecule 20
PRO_0000014577

Regions

Topological domain31 – 450420Extracellular Potential
Transmembrane451 – 47121Helical; Potential
Topological domain472 – 585114Cytoplasmic Potential
Domain58 – 15497Ig-like C2-type 1
Domain160 – 24687Ig-like C2-type 2
Domain256 – 34186Ig-like C2-type 3
Domain346 – 43287Ig-like C2-type 4

Amino acid modifications

Glycosylation961N-linked (GlcNAc...) Potential
Glycosylation1051N-linked (GlcNAc...) Potential
Glycosylation2801N-linked (GlcNAc...) Potential
Glycosylation3061N-linked (GlcNAc...) Potential
Glycosylation3171N-linked (GlcNAc...) Potential
Glycosylation3681N-linked (GlcNAc...) Potential
Glycosylation4151N-linked (GlcNAc...) Potential
Disulfide bond90 ↔ 138 By similarity
Disulfide bond276 ↔ 324 By similarity
Disulfide bond375 ↔ 416 By similarity

Natural variations

Natural variant411A → V.
Corresponds to variant rs10408247 [ dbSNP | Ensembl ].
VAR_056030
Natural variant871T → I.
Corresponds to variant rs36053277 [ dbSNP | Ensembl ].
VAR_061312
Natural variant1131R → H.
Corresponds to variant rs13345196 [ dbSNP | Ensembl ].
VAR_056031
Natural variant1271I → V.
Corresponds to variant rs35443082 [ dbSNP | Ensembl ].
VAR_056032
Natural variant3551S → L.
Corresponds to variant rs16959164 [ dbSNP | Ensembl ].
VAR_056033
Natural variant3691S → F.
Corresponds to variant rs10414398 [ dbSNP | Ensembl ].
VAR_056034
Natural variant5121C → R.
Corresponds to variant rs8100718 [ dbSNP | Ensembl ].
VAR_059385

Sequences

Sequence LengthMass (Da)Tools
Q6UY09 [UniParc].

Last modified July 5, 2004. Version 1.
Checksum: 6E8D27BABB07E0F7

FASTA58564,502
        10         20         30         40         50         60 
MGPADSWGHH WMGILLSASL CTVWSPPAAA QLTLNANPLD ATQSEDVVLP VFGTPRTPQI 

        70         80         90        100        110        120 
HGRSRELAKP SIAVSPGTAI EQKDMVTFYC TTKDVNITIH WVSNNLSIVF HERMQLSKDG 

       130        140        150        160        170        180 
KILTILIVQR EDSGTYQCEA RDALLSQRSD PIFLDVKYGP DPVEIKLESG VASGEVVEVM 

       190        200        210        220        230        240 
EGSSMTFLAE TKSHPPCAYT WFLLDSILSH TTRTFTIHAV SREHEGLYRC LVSNSATHLS 

       250        260        270        280        290        300 
SLGTLKVRVL ETLTMPQVVP SSLNLVENAR SVDLTCQTVN QSVNVQWFLS GQPLLPSEHL 

       310        320        330        340        350        360 
QLSADNRTLI IHGLQRNDTG PYACEVWNWG SRARSEPLEL TINYGPDQVH ITRESASEMI 

       370        380        390        400        410        420 
STIEAELNSS LTLQCWAESK PGAEYRWTLE HSTGEHLGEQ LIIRALTWEH DGIYNCTASN 

       430        440        450        460        470        480 
SLTGLARSTS VLVKVVGPQS SSLSSGAIAG IVIGILAVIA VASELGYFLC IRNARRPSRK 

       490        500        510        520        530        540 
TTEDPSHETS QPIPKEEHPT EPSSESLSPE YCNISQLQGR IRVELMQPPD LPEETYETKL 

       550        560        570        580 
PSASRRGNSF SPWKPPPKPL MPPLRLVSTV PKNMESIYEV LGMQQ 

« Hide

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AY358129 mRNA. Translation: AAQ88496.1.
IPIIPI00789178.
RefSeqNP_001096067.1. NM_001102597.1.
NP_001096068.1. NM_001102598.1.
NP_001096069.1. NM_001102599.1.
NP_001096070.1. NM_001102600.1.
UniGeneHs.689632.

3D structure databases

ProteinModelPortalQ6UY09.
ModBaseSearch...

PTM databases

PhosphoSiteQ6UY09.

Polymorphism databases

DMDM73619948.

Proteomic databases

PaxDbQ6UY09.
PRIDEQ6UY09.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

GeneID125931.
KEGGhsa:125931.
UCSCuc010ejn.1. human.

Organism-specific databases

CTD125931.
GeneCardsGC19M045005.
H-InvDBHIX0040059.
HGNCHGNC:24879. CEACAM20.
neXtProtNX_Q6UY09.
PharmGKBPA142672134.
GenAtlasSearch...

Phylogenomic databases

eggNOGNOG146066.
HOVERGENHBG063623.
InParanoidQ6UY09.
KOK06499.

Gene expression databases

CleanExHS_CEACAM20.
GenevestigatorQ6UY09.

Family and domain databases

Gene3D2.60.40.10. 4 hits.
InterProIPR007110. Ig-like_dom.
IPR013783. Ig-like_fold.
IPR013098. Ig_I-set.
IPR003599. Ig_sub.
IPR003598. Ig_sub2.
[Graphical view]
PfamPF07679. I-set. 2 hits.
[Graphical view]
SMARTSM00409. IG. 2 hits.
SM00408. IGc2. 2 hits.
[Graphical view]
PROSITEPS50835. IG_LIKE. 4 hits.
[Graphical view]
ProtoNetSearch...

Other

GenomeRNAi125931.
NextBio81568.

Entry information

Entry nameCEA20_HUMAN
AccessionPrimary (citable) accession number: Q6UY09
Entry history
Integrated into UniProtKB/Swiss-Prot: August 16, 2005
Last sequence update: July 5, 2004
Last modified: May 1, 2013
This is version 72 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

Human chromosome 19

Human chromosome 19: entries, gene names and cross-references to MIM

Human entries with polymorphisms or disease mutations

List of human entries with polymorphisms or disease mutations

Human polymorphisms and disease mutations

Index of human polymorphisms and disease mutations

SIMILARITY comments

Index of protein domains and families