Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q8WXI7 (MUC16_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified June 11, 2014. Version 88. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (5) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Web links·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Mucin-16

Short name=MUC-16
Alternative name(s):
Ovarian cancer-related tumor marker CA125
Short name=CA-125
Ovarian carcinoma antigen CA125
Gene names
Name:MUC16
Synonyms:CA125
OrganismHomo sapiens (Human) [Reference proteome]
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length22152 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Thought to provide a protective, lubricating barrier against particles and infectious agents at mucosal surfaces By similarity.

Subunit structure

Binds to MSLN. Binding to MSLN mediates heterotypic cell adhesion. This may contribute to the metastasis of ovarian cancer to the peritoneum by initiating cell attachment to the mesothelial epithelium via binding to MSLN. Ref.8

Subcellular location

Cell membrane; Single-pass type I membrane protein. Secretedextracellular space. Note: May be liberated into the extracellular space following the phosphorylation of the intracellular C-terminus which induces the proteolytic cleavage and liberation of the extracellular domain.

Tissue specificity

Expressed in corneal and conjunctival epithelia (at protein level). Overexpressed in ovarian carcinomas and ovarian low malignant potential (LMP) tumors as compared to the expression in normal ovarian tissue and ovarian adenomas. Ref.1 Ref.3 Ref.10

Induction

Up-regulated in ovarian cancer cells. Ref.1

Domain

Composed of three domains, a Ser-, Thr-rich N-terminal domain, a repeated domain containing more than 60 partially conserved tandem repeats of 156 amino acids each (AAs 12061-21862) and a C-terminal transmembrane contain domain with a short cytoplasmic tail.

Post-translational modification

Heavily O-glycosylated; expresses both type 1 and type 2 core glycans. Ref.7

Heavily N-glycosylated; expresses primarily high mannose and complex bisecting type N-linked glycans. Ref.7

May be phosphorylated. Phosphorylation of the intracellular C-terminal domain may induce proteolytic cleavage and the liberation of the extracellular domain into the extracellular space. Ref.6

May contain numerous disulfide bridges. Association of several molecules of the secreted form may occur through interchain disulfide bridges providing an extraordinarily large gel-like matrix in the extracellular space or in the lumen of secretory ducts.

Polymorphism

The number of repeats is highly polymorphic.

Miscellaneous

Antigen that is the basis for a widely used serum assay for the monitoring of patients with ovarian epithelial cancer. Due to lack of sensitivity for stage I disease and lack of specificity, it is of little value in the detection of early ovarian cancer. Due to its similarly elevated levels in some nonmalignant conditions, it is not specific enough to be used for population screening.

Sequence similarities

Contains 2 ANK repeats.

Contains 56 SEA domains.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 2215222152Mucin-16
PRO_0000259595

Regions

Topological domain1 – 2209622096Extracellular Potential
Transmembrane22097 – 2211721Helical; Potential
Topological domain22118 – 2215235Cytoplasmic Potential
Domain12070 – 12190121SEA 1
Domain12229 – 12339111SEA 2
Domain12386 – 12504119SEA 3
Domain12539 – 12660122SEA 4
Domain12697 – 12815119SEA 5
Domain12853 – 12971119SEA 6
Domain13009 – 13127119SEA 7
Domain13165 – 13283119SEA 8
Domain13315 – 13439125SEA 9
Domain13477 – 13595119SEA 10
Domain13633 – 13751119SEA 11
Repeat13767 – 1380135ANK 1
Domain13789 – 13907119SEA 12
Domain13939 – 14063125SEA 13
Domain14101 – 14219119SEA 14
Domain14251 – 14375125SEA 15
Domain14410 – 14531122SEA 16
Domain14567 – 14689123SEA 17
Domain14723 – 14845123SEA 18
Domain14877 – 15001125SEA 19
Domain15039 – 15157119SEA 20
Domain15195 – 15313119SEA 21
Domain15348 – 15469122SEA 22
Domain15507 – 15625119SEA 23
Domain15664 – 15774111SEA 24
Domain15812 – 15936125SEA 25
Domain16123 – 16247125SEA 26
Domain16279 – 16403125SEA 27
Domain16437 – 16559123SEA 28
Domain16591 – 16715125SEA 29
Domain16750 – 16871122SEA 30
Domain16905 – 17027123SEA 31
Domain17065 – 17183119SEA 32
Domain17221 – 17339119SEA 33
Domain17378 – 17488111SEA 34
Domain17534 – 17644111SEA 35
Domain17689 – 17807119SEA 36
Domain18001 – 18119119SEA 37
Domain18313 – 18431119SEA 38
Domain18625 – 18743119SEA 39
Repeat18915 – 1894935ANK 2
Domain18937 – 19055119SEA 40
Domain19243 – 19367125SEA 41
Domain19555 – 19679125SEA 42
Domain19866 – 19990125SEA 43
Domain20184 – 20302119SEA 44
Domain20340 – 20458119SEA 45
Domain20496 – 20614119SEA 46
Domain20652 – 20770119SEA 47
Domain20804 – 20926123SEA 48
Domain20964 – 21082119SEA 49
Domain21117 – 21238122SEA 50
Domain21273 – 21394122SEA 51
Domain21433 – 21542110SEA 52
Domain21565 – 21683119SEA 53
Domain21717 – 21826110SEA 54
Domain21836 – 21957122SEA 55
Domain21959 – 22080122SEA 56
Compositional bias14 – 1208512072Thr-rich
Compositional bias1638 – 30551418Ser-rich
Compositional bias3961 – 4385425Ser-rich
Compositional bias7085 – 103953311Ser-rich

Amino acid modifications

Glycosylation1391N-linked (GlcNAc...) Potential
Glycosylation4341N-linked (GlcNAc...) Potential
Glycosylation7871N-linked (GlcNAc...) Potential
Glycosylation9301N-linked (GlcNAc...) Potential
Glycosylation9571N-linked (GlcNAc...) Potential
Glycosylation13751N-linked (GlcNAc...) Potential
Glycosylation16331N-linked (GlcNAc...) Potential
Glycosylation18401N-linked (GlcNAc...) Potential
Glycosylation18771N-linked (GlcNAc...) Potential
Glycosylation18901N-linked (GlcNAc...) Potential
Glycosylation23451N-linked (GlcNAc...) Potential
Glycosylation23751N-linked (GlcNAc...) Potential
Glycosylation27371N-linked (GlcNAc...) Potential
Glycosylation30861N-linked (GlcNAc...) Potential
Glycosylation31791N-linked (GlcNAc...) Potential
Glycosylation35021N-linked (GlcNAc...) Potential
Glycosylation42221N-linked (GlcNAc...) Potential
Glycosylation45001N-linked (GlcNAc...) Potential
Glycosylation46081N-linked (GlcNAc...) Potential
Glycosylation46151N-linked (GlcNAc...) Potential
Glycosylation46261N-linked (GlcNAc...) Potential
Glycosylation48631N-linked (GlcNAc...) Potential
Glycosylation50981N-linked (GlcNAc...) Potential
Glycosylation51331N-linked (GlcNAc...) Potential
Glycosylation52301N-linked (GlcNAc...) Potential
Glycosylation53221N-linked (GlcNAc...) Potential
Glycosylation53961N-linked (GlcNAc...) Potential
Glycosylation54721N-linked (GlcNAc...) Potential
Glycosylation56911N-linked (GlcNAc...) Potential
Glycosylation58651N-linked (GlcNAc...) Potential
Glycosylation60901N-linked (GlcNAc...) Potential
Glycosylation67341N-linked (GlcNAc...) Potential
Glycosylation68611N-linked (GlcNAc...) Potential
Glycosylation69631N-linked (GlcNAc...) Potential
Glycosylation80311N-linked (GlcNAc...) Potential
Glycosylation80571N-linked (GlcNAc...) Potential
Glycosylation83261N-linked (GlcNAc...) Potential
Glycosylation86201N-linked (GlcNAc...) Potential
Glycosylation86861N-linked (GlcNAc...) Potential
Glycosylation89151N-linked (GlcNAc...) Potential
Glycosylation92041N-linked (GlcNAc...) Potential
Glycosylation94951N-linked (GlcNAc...) Potential
Glycosylation97871N-linked (GlcNAc...) Potential
Glycosylation99201N-linked (GlcNAc...) Potential
Glycosylation100771N-linked (GlcNAc...) Potential
Glycosylation101751N-linked (GlcNAc...) Potential
Glycosylation105121N-linked (GlcNAc...) Potential
Glycosylation107021N-linked (GlcNAc...) Potential
Glycosylation107511N-linked (GlcNAc...) Potential
Glycosylation110551N-linked (GlcNAc...) Potential
Glycosylation112261N-linked (GlcNAc...) Potential
Glycosylation112651N-linked (GlcNAc...) Potential
Glycosylation113691N-linked (GlcNAc...) Potential
Glycosylation115961N-linked (GlcNAc...) Potential
Glycosylation120811N-linked (GlcNAc...) Potential
Glycosylation121021N-linked (GlcNAc...) Potential
Glycosylation121181N-linked (GlcNAc...) Potential
Glycosylation121701N-linked (GlcNAc...) Potential
Glycosylation122371N-linked (GlcNAc...) Potential
Glycosylation122741N-linked (GlcNAc...) Potential
Glycosylation123951N-linked (GlcNAc...) Potential
Glycosylation124161N-linked (GlcNAc...) Potential
Glycosylation124321N-linked (GlcNAc...) Potential
Glycosylation125511N-linked (GlcNAc...) Potential
Glycosylation125721N-linked (GlcNAc...) Potential
Glycosylation125881N-linked (GlcNAc...) Potential
Glycosylation127061N-linked (GlcNAc...) Potential
Glycosylation127271N-linked (GlcNAc...) Potential
Glycosylation127431N-linked (GlcNAc...) Potential
Glycosylation128261N-linked (GlcNAc...) Potential
Glycosylation128621N-linked (GlcNAc...) Potential
Glycosylation128831N-linked (GlcNAc...) Potential
Glycosylation129821N-linked (GlcNAc...) Potential
Glycosylation130181N-linked (GlcNAc...) Potential
Glycosylation130391N-linked (GlcNAc...) Potential
Glycosylation130551N-linked (GlcNAc...) Potential
Glycosylation131741N-linked (GlcNAc...) Potential
Glycosylation131951N-linked (GlcNAc...) Potential
Glycosylation132111N-linked (GlcNAc...) Potential
Glycosylation133301N-linked (GlcNAc...) Potential
Glycosylation133511N-linked (GlcNAc...) Potential
Glycosylation133671N-linked (GlcNAc...) Potential
Glycosylation134231N-linked (GlcNAc...) Potential
Glycosylation134861N-linked (GlcNAc...) Potential
Glycosylation135071N-linked (GlcNAc...) Potential
Glycosylation135231N-linked (GlcNAc...) Potential
Glycosylation135791N-linked (GlcNAc...) Potential
Glycosylation136421N-linked (GlcNAc...) Potential
Glycosylation136631N-linked (GlcNAc...) Potential
Glycosylation136791N-linked (GlcNAc...) Potential
Glycosylation137981N-linked (GlcNAc...) Potential
Glycosylation138191N-linked (GlcNAc...) Potential
Glycosylation138351N-linked (GlcNAc...) Potential
Glycosylation139541N-linked (GlcNAc...) Potential
Glycosylation139751N-linked (GlcNAc...) Potential
Glycosylation139911N-linked (GlcNAc...) Potential
Glycosylation141101N-linked (GlcNAc...) Potential
Glycosylation141311N-linked (GlcNAc...) Potential
Glycosylation141471N-linked (GlcNAc...) Potential
Glycosylation142661N-linked (GlcNAc...) Potential
Glycosylation142871N-linked (GlcNAc...) Potential
Glycosylation143031N-linked (GlcNAc...) Potential
Glycosylation143591N-linked (GlcNAc...) Potential
Glycosylation144221N-linked (GlcNAc...) Potential
Glycosylation144591N-linked (GlcNAc...) Potential
Glycosylation145801N-linked (GlcNAc...) Potential
Glycosylation146011N-linked (GlcNAc...) Potential
Glycosylation146171N-linked (GlcNAc...) Potential
Glycosylation147361N-linked (GlcNAc...) Potential
Glycosylation147571N-linked (GlcNAc...) Potential
Glycosylation147731N-linked (GlcNAc...) Potential
Glycosylation148251N-linked (GlcNAc...) Potential
Glycosylation148921N-linked (GlcNAc...) Potential
Glycosylation149131N-linked (GlcNAc...) Potential
Glycosylation149291N-linked (GlcNAc...) Potential
Glycosylation150481N-linked (GlcNAc...) Potential
Glycosylation150691N-linked (GlcNAc...) Potential
Glycosylation152041N-linked (GlcNAc...) Potential
Glycosylation152411N-linked (GlcNAc...) Potential
Glycosylation153601N-linked (GlcNAc...) Potential
Glycosylation153811N-linked (GlcNAc...) Potential
Glycosylation153971N-linked (GlcNAc...) Potential
Glycosylation155161N-linked (GlcNAc...) Potential
Glycosylation155371N-linked (GlcNAc...) Potential
Glycosylation156721N-linked (GlcNAc...) Potential
Glycosylation156931N-linked (GlcNAc...) Potential
Glycosylation157091N-linked (GlcNAc...) Potential
Glycosylation158271N-linked (GlcNAc...) Potential
Glycosylation158481N-linked (GlcNAc...) Potential
Glycosylation158641N-linked (GlcNAc...) Potential
Glycosylation159471N-linked (GlcNAc...) Potential
Glycosylation159831N-linked (GlcNAc...) Potential
Glycosylation160041N-linked (GlcNAc...) Potential
Glycosylation160201N-linked (GlcNAc...) Potential
Glycosylation161381N-linked (GlcNAc...) Potential
Glycosylation161591N-linked (GlcNAc...) Potential
Glycosylation161751N-linked (GlcNAc...) Potential
Glycosylation162581N-linked (GlcNAc...) Potential
Glycosylation162941N-linked (GlcNAc...) Potential
Glycosylation163151N-linked (GlcNAc...) Potential
Glycosylation163311N-linked (GlcNAc...) Potential
Glycosylation164501N-linked (GlcNAc...) Potential
Glycosylation164711N-linked (GlcNAc...) Potential
Glycosylation164871N-linked (GlcNAc...) Potential
Glycosylation165391N-linked (GlcNAc...) Potential
Glycosylation166061N-linked (GlcNAc...) Potential
Glycosylation166271N-linked (GlcNAc...) Potential
Glycosylation166431N-linked (GlcNAc...) Potential
Glycosylation167621N-linked (GlcNAc...) Potential
Glycosylation167831N-linked (GlcNAc...) Potential
Glycosylation167991N-linked (GlcNAc...) Potential
Glycosylation169181N-linked (GlcNAc...) Potential
Glycosylation169391N-linked (GlcNAc...) Potential
Glycosylation169551N-linked (GlcNAc...) Potential
Glycosylation170741N-linked (GlcNAc...) Potential
Glycosylation170951N-linked (GlcNAc...) Potential
Glycosylation171111N-linked (GlcNAc...) Potential
Glycosylation172301N-linked (GlcNAc...) Potential
Glycosylation172511N-linked (GlcNAc...) Potential
Glycosylation173861N-linked (GlcNAc...) Potential
Glycosylation174071N-linked (GlcNAc...) Potential
Glycosylation174231N-linked (GlcNAc...) Potential
Glycosylation175421N-linked (GlcNAc...) Potential
Glycosylation175631N-linked (GlcNAc...) Potential
Glycosylation175791N-linked (GlcNAc...) Potential
Glycosylation176981N-linked (GlcNAc...) Potential
Glycosylation177191N-linked (GlcNAc...) Potential
Glycosylation178541N-linked (GlcNAc...) Potential
Glycosylation178751N-linked (GlcNAc...) Potential
Glycosylation178911N-linked (GlcNAc...) Potential
Glycosylation180101N-linked (GlcNAc...) Potential
Glycosylation180311N-linked (GlcNAc...) Potential
Glycosylation180471N-linked (GlcNAc...) Potential
Glycosylation181661N-linked (GlcNAc...) Potential
Glycosylation181871N-linked (GlcNAc...) Potential
Glycosylation182031N-linked (GlcNAc...) Potential
Glycosylation183221N-linked (GlcNAc...) Potential
Glycosylation183431N-linked (GlcNAc...) Potential
Glycosylation183591N-linked (GlcNAc...) Potential
Glycosylation184781N-linked (GlcNAc...) Potential
Glycosylation184991N-linked (GlcNAc...) Potential
Glycosylation185151N-linked (GlcNAc...) Potential
Glycosylation186341N-linked (GlcNAc...) Potential
Glycosylation186551N-linked (GlcNAc...) Potential
Glycosylation187901N-linked (GlcNAc...) Potential
Glycosylation188111N-linked (GlcNAc...) Potential
Glycosylation188271N-linked (GlcNAc...) Potential
Glycosylation189461N-linked (GlcNAc...) Potential
Glycosylation189671N-linked (GlcNAc...) Potential
Glycosylation189831N-linked (GlcNAc...) Potential
Glycosylation190391N-linked (GlcNAc...) Potential
Glycosylation191021N-linked (GlcNAc...) Potential
Glycosylation191231N-linked (GlcNAc...) Potential
Glycosylation191391N-linked (GlcNAc...) Potential
Glycosylation192581N-linked (GlcNAc...) Potential
Glycosylation192791N-linked (GlcNAc...) Potential
Glycosylation192951N-linked (GlcNAc...) Potential
Glycosylation194141N-linked (GlcNAc...) Potential
Glycosylation194351N-linked (GlcNAc...) Potential
Glycosylation194511N-linked (GlcNAc...) Potential
Glycosylation195701N-linked (GlcNAc...) Potential
Glycosylation195911N-linked (GlcNAc...) Potential
Glycosylation196071N-linked (GlcNAc...) Potential
Glycosylation197261N-linked (GlcNAc...) Potential
Glycosylation197471N-linked (GlcNAc...) Potential
Glycosylation197631N-linked (GlcNAc...) Potential
Glycosylation198811N-linked (GlcNAc...) Potential
Glycosylation199021N-linked (GlcNAc...) Potential
Glycosylation199181N-linked (GlcNAc...) Potential
Glycosylation200371N-linked (GlcNAc...) Potential
Glycosylation200581N-linked (GlcNAc...) Potential
Glycosylation200741N-linked (GlcNAc...) Potential
Glycosylation201571N-linked (GlcNAc...) Potential
Glycosylation201931N-linked (GlcNAc...) Potential
Glycosylation202141N-linked (GlcNAc...) Potential
Glycosylation202301N-linked (GlcNAc...) Potential
Glycosylation203491N-linked (GlcNAc...) Potential
Glycosylation203701N-linked (GlcNAc...) Potential
Glycosylation205051N-linked (GlcNAc...) Potential
Glycosylation205261N-linked (GlcNAc...) Potential
Glycosylation206611N-linked (GlcNAc...) Potential
Glycosylation206821N-linked (GlcNAc...) Potential
Glycosylation206981N-linked (GlcNAc...) Potential
Glycosylation208171N-linked (GlcNAc...) Potential
Glycosylation208381N-linked (GlcNAc...) Potential
Glycosylation209731N-linked (GlcNAc...) Potential
Glycosylation209941N-linked (GlcNAc...) Potential
Glycosylation210101N-linked (GlcNAc...) Potential
Glycosylation211291N-linked (GlcNAc...) Potential
Glycosylation211501N-linked (GlcNAc...) Potential
Glycosylation211661N-linked (GlcNAc...) Potential
Glycosylation212851N-linked (GlcNAc...) Potential
Glycosylation213061N-linked (GlcNAc...) Potential
Glycosylation213781N-linked (GlcNAc...) Potential
Glycosylation213891N-linked (GlcNAc...) Potential
Glycosylation214411N-linked (GlcNAc...) Potential
Glycosylation214611N-linked (GlcNAc...) Potential
Glycosylation214771N-linked (GlcNAc...) Potential
Glycosylation215741N-linked (GlcNAc...) Potential
Glycosylation215951N-linked (GlcNAc...) Potential
Glycosylation217251N-linked (GlcNAc...) Potential
Glycosylation217451N-linked (GlcNAc...) Potential
Glycosylation218401N-linked (GlcNAc...) Potential
Glycosylation218571N-linked (GlcNAc...) Potential
Glycosylation218991N-linked (GlcNAc...) Potential
Glycosylation219321N-linked (GlcNAc...) Potential
Glycosylation219711N-linked (GlcNAc...) Potential
Glycosylation220081N-linked (GlcNAc...) Potential
Glycosylation220621N-linked (GlcNAc...) Potential
Glycosylation220681N-linked (GlcNAc...) Potential
Disulfide bond12128 ↔ 12148 Potential
Disulfide bond12284 ↔ 12304 Potential
Disulfide bond12442 ↔ 12462 Potential
Disulfide bond12598 ↔ 12618 Potential
Disulfide bond12753 ↔ 12773 Potential
Disulfide bond12909 ↔ 12929 Potential
Disulfide bond13065 ↔ 13085 Potential
Disulfide bond13221 ↔ 13241 Potential
Disulfide bond13377 ↔ 13397 Potential
Disulfide bond13533 ↔ 13553 Potential
Disulfide bond13689 ↔ 13709 Potential
Disulfide bond13845 ↔ 13865 Potential
Disulfide bond14001 ↔ 14021 Potential
Disulfide bond14157 ↔ 14177 Potential
Disulfide bond14313 ↔ 14333 Potential
Disulfide bond14469 ↔ 14489 Potential
Disulfide bond14627 ↔ 14647 Potential
Disulfide bond14783 ↔ 14803 Potential
Disulfide bond14939 ↔ 14959 Potential
Disulfide bond15095 ↔ 15115 Potential
Disulfide bond15251 ↔ 15271 Potential
Disulfide bond15407 ↔ 15427 Potential
Disulfide bond15563 ↔ 15583 Potential
Disulfide bond15719 ↔ 15739 Potential
Disulfide bond15874 ↔ 15894 Potential
Disulfide bond16185 ↔ 16205 Potential
Disulfide bond16341 ↔ 16361 Potential
Disulfide bond16497 ↔ 16517 Potential
Disulfide bond16653 ↔ 16673 Potential
Disulfide bond16809 ↔ 16829 Potential
Disulfide bond16965 ↔ 16985 Potential
Disulfide bond17121 ↔ 17141 Potential
Disulfide bond17277 ↔ 17297 Potential
Disulfide bond17433 ↔ 17453 Potential
Disulfide bond17589 ↔ 17609 Potential
Disulfide bond17745 ↔ 17765 Potential
Disulfide bond18057 ↔ 18077 Potential
Disulfide bond18369 ↔ 18389 Potential
Disulfide bond18681 ↔ 18701 Potential
Disulfide bond18993 ↔ 19013 Potential
Disulfide bond19305 ↔ 19325 Potential
Disulfide bond19617 ↔ 19637 Potential
Disulfide bond19928 ↔ 19948 Potential
Disulfide bond20396 ↔ 20416 Potential
Disulfide bond20552 ↔ 20572 Potential
Disulfide bond20708 ↔ 20728 Potential
Disulfide bond20864 ↔ 20884 Potential
Disulfide bond21020 ↔ 21040 Potential
Disulfide bond21176 ↔ 21196 Potential
Disulfide bond21332 ↔ 21352 Potential
Disulfide bond21621 ↔ 21641 Potential
Disulfide bond21771 ↔ 21791 Potential
Disulfide bond22018 ↔ 22038 Potential

Natural variations

Natural variant5451T → A.
Corresponds to variant rs17000957 [ dbSNP | Ensembl ].
VAR_056592
Natural variant10151R → G.
Corresponds to variant rs17000950 [ dbSNP | Ensembl ].
VAR_056593
Natural variant10321S → T.
Corresponds to variant rs10411228 [ dbSNP | Ensembl ].
VAR_056594
Natural variant10411P → S.
Corresponds to variant rs10406202 [ dbSNP | Ensembl ].
VAR_056595
Natural variant11621T → I.
Corresponds to variant rs17000947 [ dbSNP | Ensembl ].
VAR_056596
Natural variant12661K → N.
Corresponds to variant rs1596797 [ dbSNP | Ensembl ].
VAR_056597
Natural variant13531H → Y.
Corresponds to variant rs12611293 [ dbSNP | Ensembl ].
VAR_056598
Natural variant14001K → N.
Corresponds to variant rs1596798 [ dbSNP | Ensembl ].
VAR_056599
Natural variant18331L → F.
Corresponds to variant rs4520945 [ dbSNP | Ensembl ].
VAR_056600
Natural variant19531S → P.
Corresponds to variant rs1108380 [ dbSNP | Ensembl ].
VAR_056601
Natural variant20581S → P.
Corresponds to variant rs1574479 [ dbSNP | Ensembl ].
VAR_056602
Natural variant21501I → V.
Corresponds to variant rs10407633 [ dbSNP | Ensembl ].
VAR_056603
Natural variant22711T → A.
Corresponds to variant rs11085805 [ dbSNP | Ensembl ].
VAR_056604
Natural variant22881V → L.
Corresponds to variant rs10410933 [ dbSNP | Ensembl ].
VAR_056605
Natural variant23561D → E.
Corresponds to variant rs10416013 [ dbSNP | Ensembl ].
VAR_056606
Natural variant27471A → T.
Corresponds to variant rs10402538 [ dbSNP | Ensembl ].
VAR_056607
Natural variant27861M → I.
Corresponds to variant rs17000886 [ dbSNP | Ensembl ].
VAR_056608
Natural variant28341T → M.
Corresponds to variant rs10407623 [ dbSNP | Ensembl ].
VAR_056609
Natural variant35741R → H.
Corresponds to variant rs2591594 [ dbSNP | Ensembl ].
VAR_056610
Natural variant57431H → D.
Corresponds to variant rs1559172 [ dbSNP | Ensembl ].
VAR_056611
Natural variant57561A → T.
Corresponds to variant rs1559171 [ dbSNP | Ensembl ].
VAR_056612
Natural variant58541F → V.
Corresponds to variant rs1862460 [ dbSNP | Ensembl ].
VAR_056613
Natural variant70651T → A.
Corresponds to variant rs17000770 [ dbSNP | Ensembl ].
VAR_056614
Natural variant72741I → V.
Corresponds to variant rs1867691 [ dbSNP | Ensembl ].
VAR_056615
Natural variant105091T → N.
Corresponds to variant rs11670461 [ dbSNP | Ensembl ].
VAR_056616

Experimental info

Sequence conflict83461D → A in AAK74120. Ref.3
Sequence conflict85801A → T in AAK74120. Ref.3
Sequence conflict90611S → T in AAK74120. Ref.3
Sequence conflict90741A → T in AAK74120. Ref.3
Sequence conflict91311D → E in AAK74120. Ref.3
Sequence conflict94071Q → R in AAK74120. Ref.3
Sequence conflict95041T → A in AAK74120. Ref.3
Sequence conflict96101S → P in AAK74120. Ref.3
Sequence conflict98521D → E in AAK74120. Ref.3
Sequence conflict99011Y → F in AAK74120. Ref.3
Sequence conflict101711F → L in AAK74120. Ref.3
Sequence conflict103961T → S in AAK74120. Ref.3
Sequence conflict105151R → M in AAK74120. Ref.3
Sequence conflict105221L → P in AAK74120. Ref.3
Sequence conflict105631L → F in AAK74120. Ref.3
Sequence conflict108301P → S in AAK74120. Ref.3
Sequence conflict111561S → F in AAK74120. Ref.3
Sequence conflict112011A → G in AAK74120. Ref.3
Sequence conflict114571G → D in AAK74120. Ref.3
Sequence conflict115061K → R in AAK74120. Ref.3
Sequence conflict115771V → A in AAK74120. Ref.3
Sequence conflict117831G → E in AAK74120. Ref.3
Sequence conflict121421M → T in AAK74120. Ref.3
Sequence conflict122261A → T in AAK74120. Ref.3
Sequence conflict125821L → V in AAK74120. Ref.3
Sequence conflict126291L → V in AAK74120. Ref.3
Sequence conflict126951A → SAT in AAK74120. Ref.3
Sequence conflict127131Q → K in AAK74120. Ref.3
Sequence conflict127201H → C in AAK74120. Ref.3
Sequence conflict127351G → S in AAK74120. Ref.3
Sequence conflict12744 – 12899156Missing in AAK74120. Ref.3
Sequence conflict133521T → A in AAK74120. Ref.3
Sequence conflict13519 – 149241406Missing in AAK74120. Ref.3
Sequence conflict150531D → N in AAK74120. Ref.3
Sequence conflict15215 – 205155301Missing in AAK74120. Ref.3
Sequence conflict205151V → E in BAC87568. Ref.5
Sequence conflict20760 – 21071312Missing in BAC87568. Ref.5
Sequence conflict208391T → I in AAK74120. Ref.3
Sequence conflict208421R → S in AAK74120. Ref.3
Sequence conflict209771T → I in AAK74120. Ref.3
Sequence conflict213841H → P in AAK74120. Ref.3
Sequence conflict213841H → P in BAC87568. Ref.5
Sequence conflict214871S → C in AAK74120. Ref.3
Sequence conflict214871S → C in BAC87568. Ref.5
Sequence conflict216021K → Q in AAK74120. Ref.3
Sequence conflict216021K → Q in BAC87568. Ref.5
Sequence conflict21686 – 2169510YNEPGLDEPP → HHTLQRQSTT in AAK74120. Ref.3
Sequence conflict216911L → P in AAK74120. Ref.3

Sequences

Sequence LengthMass (Da)Tools
Q8WXI7 [UniParc].

Last modified March 1, 2003. Version 2.
Checksum: B3E7BDF19997A440

FASTA22,1522,353,428
        10         20         30         40         50         60 
MLKPSGLPGS SSPTRSLMTG SRSTKATPEM DSGLTGATLS PKTSTGAIVV TEHTLPFTSP 

        70         80         90        100        110        120 
DKTLASPTSS VVGRTTQSLG VMSSALPEST SRGMTHSEQR TSPSLSPQVN GTPSRNYPAT 

       130        140        150        160        170        180 
SMVSGLSSPR TRTSSTEGNF TKEASTYTLT VETTSGPVTE KYTVPTETST TEGDSTETPW 

       190        200        210        220        230        240 
DTRYIPVKIT SPMKTFADST ASKENAPVSM TPAETTVTDS HTPGRTNPSF GTLYSSFLDL 

       250        260        270        280        290        300 
SPKGTPNSRG ETSLELILST TGYPFSSPEP GSAGHSRIST SAPLSSSASV LDNKISETSI 

       310        320        330        340        350        360 
FSGQSLTSPL SPGVPEARAS TMPNSAIPFS MTLSNAETSA ERVRSTISSL GTPSISTKQT 

       370        380        390        400        410        420 
AETILTFHAF AETMDIPSTH IAKTLASEWL GSPGTLGGTS TSALTTTSPS TTLVSEETNT 

       430        440        450        460        470        480 
HHSTSGKETE GTLNTSMTPL ETSAPGEESE MTATLVPTLG FTTLDSKIRS PSQVSSSHPT 

       490        500        510        520        530        540 
RELRTTGSTS GRQSSSTAAH GSSDILRATT SSTSKASSWT SESTAQQFSE PQHTQWVETS 

       550        560        570        580        590        600 
PSMKTERPPA STSVAAPITT SVPSVVSGFT TLKTSSTKGI WLEETSADTL IGESTAGPTT 

       610        620        630        640        650        660 
HQFAVPTGIS MTGGSSTRGS QGTTHLLTRA TASSETSADL TLATNGVPVS VSPAVSKTAA 

       670        680        690        700        710        720 
GSSPPGGTKP SYTMVSSVIP ETSSLQSSAF REGTSLGLTP LNTRHPFSSP EPDSAGHTKI 

       730        740        750        760        770        780 
STSIPLLSSA SVLEDKVSAT STFSHHKATS SITTGTPEIS TKTKPSSAVL SSMTLSNAAT 

       790        800        810        820        830        840 
SPERVRNATS PLTHPSPSGE ETAGSVLTLS TSAETTDSPN IHPTGTLTSE SSESPSTLSL 

       850        860        870        880        890        900 
PSVSGVKTTF SSSTPSTHLF TSGEETEETS NPSVSQPETS VSRVRTTLAS TSVPTPVFPT 

       910        920        930        940        950        960 
MDTWPTRSAQ FSSSHLVSEL RATSSTSVTN STGSALPKIS HLTGTATMSQ TNRDTFNDSA 

       970        980        990       1000       1010       1020 
APQSTTWPET SPRFKTGLPS ATTTVSTSAT SLSATVMVSK FTSPATSSME ATSIREPSTT 

      1030       1040       1050       1060       1070       1080 
ILTTETTNGP GSMAVASTNI PIGKGYITEG RLDTSHLPIG TTASSETSMD FTMAKESVSM 

      1090       1100       1110       1120       1130       1140 
SVSPSQSMDA AGSSTPGRTS QFVDTFSDDV YHLTSREITI PRDGTSSALT PQMTATHPPS 

      1150       1160       1170       1180       1190       1200 
PDPGSARSTW LGILSSSPSS PTPKVTMSST FSTQRVTTSM IMDTVETSRW NMPNLPSTTS 

      1210       1220       1230       1240       1250       1260 
LTPSNIPTSG AIGKSTLVPL DTPSPATSLE ASEGGLPTLS TYPESTNTPS IHLGAHASSE 

      1270       1280       1290       1300       1310       1320 
SPSTIKLTMA SVVKPGSYTP LTFPSIETHI HVSTARMAYS SGSSPEMTAP GETNTGSTWD 

      1330       1340       1350       1360       1370       1380 
PTTYITTTDP KDTSSAQVST PHSVRTLRTT ENHPKTESAT PAAYSGSPKI SSSPNLTSPA 

      1390       1400       1410       1420       1430       1440 
TKAWTITDTT EHSTQLHYTK LAEKSSGFET QSAPGPVSVV IPTSPTIGSS TLELTSDVPG 

      1450       1460       1470       1480       1490       1500 
EPLVLAPSEQ TTITLPMATW LSTSLTEEMA STDLDISSPS SPMSTFAIFP PMSTPSHELS 

      1510       1520       1530       1540       1550       1560 
KSEADTSAIR NTDSTTLDQH LGIRSLGRTG DLTTVPITPL TTTWTSVIEH STQAQDTLSA 

      1570       1580       1590       1600       1610       1620 
TMSPTHVTQS LKDQTSIPAS ASPSHLTEVY PELGTQGRSS SEATTFWKPS TDTLSREIET 

      1630       1640       1650       1660       1670       1680 
GPTNIQSTPP MDNTTTGSSS SGVTLGIAHL PIGTSSPAET STNMALERRS STATVSMAGT 

      1690       1700       1710       1720       1730       1740 
MGLLVTSAPG RSISQSLGRV SSVLSESTTE GVTDSSKGSS PRLNTQGNTA LSSSLEPSYA 

      1750       1760       1770       1780       1790       1800 
EGSQMSTSIP LTSSPTTPDV EFIGGSTFWT KEVTTVMTSD ISKSSARTES SSATLMSTAL 

      1810       1820       1830       1840       1850       1860 
GSTENTGKEK LRTASMDLPS PTPSMEVTPW ISLTLSNAPN TTDSLDLSHG VHTSSAGTLA 

      1870       1880       1890       1900       1910       1920 
TDRSLNTGVT RASRLENGSD TSSKSLSMGN STHTSMTDTE KSEVSSSIHP RPETSAPGAE 

      1930       1940       1950       1960       1970       1980 
TTLTSTPGNR AISLTLPFSS IPVEEVISTG ITSGPDINSA PMTHSPITPP TIVWTSTGTI 

      1990       2000       2010       2020       2030       2040 
EQSTQPLHAV SSEKVSVQTQ STPYVNSVAV SASPTHENSV SSGSSTSSPY SSASLESLDS 

      2050       2060       2070       2080       2090       2100 
TISRRNAITS WLWDLTTSLP TTTWPSTSLS EALSSGHSGV SNPSSTTTEF PLFSAASTSA 

      2110       2120       2130       2140       2150       2160 
AKQRNPETET HGPQNTAAST LNTDASSVTG LSETPVGASI SSEVPLPMAI TSRSDVSGLT 

      2170       2180       2190       2200       2210       2220 
SESTANPSLG TASSAGTKLT RTISLPTSES LVSFRMNKDP WTVSIPLGSH PTTNTETSIP 

      2230       2240       2250       2260       2270       2280 
VNSAGPPGLS TVASDVIDTP SDGAESIPTV SFSPSPDTEV TTISHFPEKT THSFRTISSL 

      2290       2300       2310       2320       2330       2340 
THELTSRVTP IPGDWMSSAM STKPTGASPS ITLGERRTIT SAAPTTSPIV LTASFTETST 

      2350       2360       2370       2380       2390       2400 
VSLDNETTVK TSDILDARKT NELPSDSSSS SDLINTSIAS STMDVTKTAS ISPTSISGMT 

      2410       2420       2430       2440       2450       2460 
ASSSPSLFSS DRPQVPTSTT ETNTATSPSV SSNTYSLDGG SNVGGTPSTL PPFTITHPVE 

      2470       2480       2490       2500       2510       2520 
TSSALLAWSR PVRTFSTMVS TDTASGENPT SSNSVVTSVP APGTWASVGS TTDLPAMGFL 

      2530       2540       2550       2560       2570       2580 
KTSPAGEAHS LLASTIEPAT AFTPHLSAAV VTGSSATSEA SLLTTSESKA IHSSPQTPTT 

      2590       2600       2610       2620       2630       2640 
PTSGANWETS ATPESLLVVT ETSDTTLTSK ILVTDTILFS TVSTPPSKFP STGTLSGASF 

      2650       2660       2670       2680       2690       2700 
PTLLPDTPAI PLTATEPTSS LATSFDSTPL VTIASDSLGT VPETTLTMSE TSNGDALVLK 

      2710       2720       2730       2740       2750       2760 
TVSNPDRSIP GITIQGVTES PLHPSSTSPS KIVAPRNTTY EGSITVALST LPAGTTGSLV 

      2770       2780       2790       2800       2810       2820 
FSQSSENSET TALVDSSAGL ERASVMPLTT GSQGMASSGG IRSGSTHSTG TKTFSSLPLT 

      2830       2840       2850       2860       2870       2880 
MNPGEVTAMS EITTNRLTAT QSTAPKGIPV KPTSAESGLL TPVSASSSPS KAFASLTTAP 

      2890       2900       2910       2920       2930       2940 
PSTWGIPQST LTFEFSEVPS LDTKSASLPT PGQSLNTIPD SDASTASSSL SKSPEKNPRA 

      2950       2960       2970       2980       2990       3000 
RMMTSTKAIS ASSFQSTGFT ETPEGSASPS MAGHEPRVPT SGTGDPRYAS ESMSYPDPSK 

      3010       3020       3030       3040       3050       3060 
ASSAMTSTSL ASKLTTLFST GQAARSGSSS SPISLSTEKE TSFLSPTAST SRKTSLFLGP 

      3070       3080       3090       3100       3110       3120 
SMARQPNILV HLQTSALTLS PTSTLNMSQE EPPELTSSQT IAEEEGTTAE TQTLTFTPSE 

      3130       3140       3150       3160       3170       3180 
TPTSLLPVSS PTEPTARRKS SPETWASSIS VPAKTSLVET TDGTLVTTIK MSSQAAQGNS 

      3190       3200       3210       3220       3230       3240 
TWPAPAEETG TSPAGTSPGS PEVSTTLKIM SSKEPSISPE IRSTVRNSPW KTPETTVPME 

      3250       3260       3270       3280       3290       3300 
TTVEPVTLQS TALGSGSTSI SHLPTGTTSP TKSPTENMLA TERVSLSPSP PEAWTNLYSG 

      3310       3320       3330       3340       3350       3360 
TPGGTRQSLA TMSSVSLESP TARSITGTGQ QSSPELVSKT TGMEFSMWHG STGGTTGDTH 

      3370       3380       3390       3400       3410       3420 
VSLSTSSNIL EDPVTSPNSV SSLTDKSKHK TETWVSTTAI PSTVLNNKIM AAEQQTSRSV 

      3430       3440       3450       3460       3470       3480 
DEAYSSTSSW SDQTSGSDIT LGASPDVTNT LYITSTAQTT SLVSLPSGDQ GITSLTNPSG 

      3490       3500       3510       3520       3530       3540 
GKTSSASSVT SPSIGLETLR ANVSAVKSDI APTAGHLSQT SSPAEVSILD VTTAPTPGIS 

      3550       3560       3570       3580       3590       3600 
TTITTMGTNS ISTTTPNPEV GMSTMDSTPA TERRTTSTEH PSTWSSTAAS DSWTVTDMTS 

      3610       3620       3630       3640       3650       3660 
NLKVARSPGT ISTMHTTSFL ASSTELDSMS TPHGRITVIG TSLVTPSSDA SAVKTETSTS 

      3670       3680       3690       3700       3710       3720 
ERTLSPSDTT ASTPISTFSR VQRMSISVPD ILSTSWTPSS TEAEDVPVSM VSTDHASTKT 

      3730       3740       3750       3760       3770       3780 
DPNTPLSTFL FDSLSTLDWD TGRSLSSATA TTSAPQGATT PQELTLETMI SPATSQLPFS 

      3790       3800       3810       3820       3830       3840 
IGHITSAVTP AAMARSSGVT FSRPDPTSKK AEQTSTQLPT TTSAHPGQVP RSAATTLDVI 

      3850       3860       3870       3880       3890       3900 
PHTAKTPDAT FQRQGQTALT TEARATSDSW NEKEKSTPSA PWITEMMNSV SEDTIKEVTS 

      3910       3920       3930       3940       3950       3960 
SSSVLKDPEY AGHKLGIWDD FIPKFGKAAH MRELPLLSPP QDKEAIHPST NTVETTGWVT 

      3970       3980       3990       4000       4010       4020 
SSEHASHSTI PAHSASSKLT SPVVTTSTRE QAIVSMSTTT WPESTRARTE PNSFLTIELR 

      4030       4040       4050       4060       4070       4080 
DVSPYMDTSS TTQTSIISSP GSTAITKGPR TEITSSKRIS SSFLAQSMRS SDSPSEAITR 

      4090       4100       4110       4120       4130       4140 
LSNFPAMTES GGMILAMQTS PPGATSLSAP TLDTSATASW TGTPLATTQR FTYSEKTTLF 

      4150       4160       4170       4180       4190       4200 
SKGPEDTSQP SPPSVEETSS SSSLVPIHAT TSPSNILLTS QGHSPSSTPP VTSVFLSETS 

      4210       4220       4230       4240       4250       4260 
GLGKTTDMSR ISLEPGTSLP PNLSSTAGEA LSTYEASRDT KAIHHSADTA VTNMEATSSE 

      4270       4280       4290       4300       4310       4320 
YSPIPGHTKP SKATSPLVTS HIMGDITSST SVFGSSETTE IETVSSVNQG LQERSTSQVA 

      4330       4340       4350       4360       4370       4380 
SSATETSTVI THVSSGDATT HVTKTQATFS SGTSISSPHQ FITSTNTFTD VSTNPSTSLI 

      4390       4400       4410       4420       4430       4440 
MTESSGVTIT TQTGPTGAAT QGPYLLDTST MPYLTETPLA VTPDFMQSEK TTLISKGPKD 

      4450       4460       4470       4480       4490       4500 
VTWTSPPSVA ETSYPSSLTP FLVTTIPPAT STLQGQHTSS PVSATSVLTS GLVKTTDMLN 

      4510       4520       4530       4540       4550       4560 
TSMEPVTNSP QNLNNPSNEI LATLAATTDI ETIHPSINKA VTNMGTASSA HVLHSTLPVS 

      4570       4580       4590       4600       4610       4620 
SEPSTATSPM VPASSMGDAL ASISIPGSET TDIEGEPTSS LTAGRKENST LQEMNSTTES 

      4630       4640       4650       4660       4670       4680 
NIILSNVSVG AITEATKMEV PSFDATFIPT PAQSTKFPDI FSVASSRLSN SPPMTISTHM 

      4690       4700       4710       4720       4730       4740 
TTTQTGSSGA TSKIPLALDT STLETSAGTP SVVTEGFAHS KITTAMNNDV KDVSQTNPPF 

      4750       4760       4770       4780       4790       4800 
QDEASSPSSQ APVLVTTLPS SVAFTPQWHS TSSPVSMSSV LTSSLVKTAG KVDTSLETVT 

      4810       4820       4830       4840       4850       4860 
SSPQSMSNTL DDISVTSAAT TDIETTHPSI NTVVTNVGTT GSAFESHSTV SAYPEPSKVT 

      4870       4880       4890       4900       4910       4920 
SPNVTTSTME DTTISRSIPK SSKTTRTETE TTSSLTPKLR ETSISQEITS STETSTVPYK 

      4930       4940       4950       4960       4970       4980 
ELTGATTEVS RTDVTSSSST SFPGPDQSTV SLDISTETNT RLSTSPIMTE SAEITITTQT 

      4990       5000       5010       5020       5030       5040 
GPHGATSQDT FTMDPSNTTP QAGIHSAMTH GFSQLDVTTL MSRIPQDVSW TSPPSVDKTS 

      5050       5060       5070       5080       5090       5100 
SPSSFLSSPA MTTPSLISST LPEDKLSSPM TSLLTSGLVK ITDILRTRLE PVTSSLPNFS 

      5110       5120       5130       5140       5150       5160 
STSDKILATS KDSKDTKEIF PSINTEETNV KANNSGHESH SPALADSETP KATTQMVITT 

      5170       5180       5190       5200       5210       5220 
TVGDPAPSTS MPVHGSSETT NIKREPTYFL TPRLRETSTS QESSFPTDTS FLLSKVPTGT 

      5230       5240       5250       5260       5270       5280 
ITEVSSTGVN SSSKISTPDH DKSTVPPDTF TGEIPRVFTS SIKTKSAEMT ITTQASPPES 

      5290       5300       5310       5320       5330       5340 
ASHSTLPLDT STTLSQGGTH STVTQGFPYS EVTTLMGMGP GNVSWMTTPP VEETSSVSSL 

      5350       5360       5370       5380       5390       5400 
MSSPAMTSPS PVSSTSPQSI PSSPLPVTAL PTSVLVTTTD VLGTTSPESV TSSPPNLSSI 

      5410       5420       5430       5440       5450       5460 
THERPATYKD TAHTEAAMHH STNTAVTNVG TSGSGHKSQS SVLADSETSK ATPLMSTTST 

      5470       5480       5490       5500       5510       5520 
LGDTSVSTST PNISQTNQIQ TEPTASLSPR LRESSTSEKT SSTTETNTAF SYVPTGAITQ 

      5530       5540       5550       5560       5570       5580 
ASRTEISSSR TSISDLDRPT IAPDISTGMI TRLFTSPIMT KSAEMTVTTQ TTTPGATSQG 

      5590       5600       5610       5620       5630       5640 
ILPWDTSTTL FQGGTHSTVS QGFPHSEITT LRSRTPGDVS WMTTPPVEET SSGFSLMSPS 

      5650       5660       5670       5680       5690       5700 
MTSPSPVSST SPESIPSSPL PVTALLTSVL VTTTNVLGTT SPETVTSSPP NLSSPTQERL 

      5710       5720       5730       5740       5750       5760 
TTYKDTAHTE AMHASMHTNT AVANVGTSIS GHESQSSVPA DSHTSKATSP MGITFAMGDT 

      5770       5780       5790       5800       5810       5820 
SVSTSTPAFF ETRIQTESTS SLIPGLRDTR TSEEINTVTE TSTVLSEVPT TTTTEVSRTE 

      5830       5840       5850       5860       5870       5880 
VITSSRTTIS GPDHSKMSPY ISTETITRLS TFPFVTGSTE MAITNQTGPI GTISQATLTL 

      5890       5900       5910       5920       5930       5940 
DTSSTASWEG THSPVTQRFP HSEETTTMSR STKGVSWQSP PSVEETSSPS SPVPLPAITS 

      5950       5960       5970       5980       5990       6000 
HSSLYSAVSG SSPTSALPVT SLLTSGRRKT IDMLDTHSEL VTSSLPSASS FSGEILTSEA 

      6010       6020       6030       6040       6050       6060 
STNTETIHFS ENTAETNMGT TNSMHKLHSS VSIHSQPSGH TPPKVTGSMM EDAIVSTSTP 

      6070       6080       6090       6100       6110       6120 
GSPETKNVDR DSTSPLTPEL KEDSTALVMN STTESNTVFS SVSLDAATEV SRAEVTYYDP 

      6130       6140       6150       6160       6170       6180 
TFMPASAQST KSPDISPEAS SSHSNSPPLT ISTHKTIATQ TGPSGVTSLG QLTLDTSTIA 

      6190       6200       6210       6220       6230       6240 
TSAGTPSART QDFVDSETTS VMNNDLNDVL KTSPFSAEEA NSLSSQAPLL VTTSPSPVTS 

      6250       6260       6270       6280       6290       6300 
TLQEHSTSSL VSVTSVPTPT LAKITDMDTN LEPVTRSPQN LRNTLATSEA TTDTHTMHPS 

      6310       6320       6330       6340       6350       6360 
INTAMANVGT TSSPNEFYFT VSPDSDPYKA TSAVVITSTS GDSIVSTSMP RSSAMKKIES 

      6370       6380       6390       6400       6410       6420 
ETTFSLIFRL RETSTSQKIG SSSDTSTVFD KAFTAATTEV SRTELTSSSR TSIQGTEKPT 

      6430       6440       6450       6460       6470       6480 
MSPDTSTRSV TMLSTFAGLT KSEERTIATQ TGPHRATSQG TLTWDTSITT SQAGTHSAMT 

      6490       6500       6510       6520       6530       6540 
HGFSQLDLST LTSRVPEYIS GTSPPSVEKT SSSSSLLSLP AITSPSPVPT TLPESRPSSP 

      6550       6560       6570       6580       6590       6600 
VHLTSLPTSG LVKTTDMLAS VASLPPNLGS TSHKIPTTSE DIKDTEKMYP STNIAVTNVG 

      6610       6620       6630       6640       6650       6660 
TTTSEKESYS SVPAYSEPPK VTSPMVTSFN IRDTIVSTSM PGSSEITRIE MESTFSVAHG 

      6670       6680       6690       6700       6710       6720 
LKGTSTSQDP IVSTEKSAVL HKLTTGATET SRTEVASSRR TSIPGPDHST ESPDISTEVI 

      6730       6740       6750       6760       6770       6780 
PSLPISLGIT ESSNMTIITR TGPPLGSTSQ GTFTLDTPTT SSRAGTHSMA TQEFPHSEMT 

      6790       6800       6810       6820       6830       6840 
TVMNKDPEIL SWTIPPSIEK TSFSSSLMPS PAMTSPPVSS TLPKTIHTTP SPMTSLLTPS 

      6850       6860       6870       6880       6890       6900 
LVMTTDTLGT SPEPTTSSPP NLSSTSHVIL TTDEDTTAIE AMHPSTSTAA TNVETTCSGH 

      6910       6920       6930       6940       6950       6960 
GSQSSVLTDS EKTKATAPMD TTSTMGHTTV STSMSVSSET TKIKRESTYS LTPGLRETSI 

      6970       6980       6990       7000       7010       7020 
SQNASFSTDT SIVLSEVPTG TTAEVSRTEV TSSGRTSIPG PSQSTVLPEI STRTMTRLFA 

      7030       7040       7050       7060       7070       7080 
SPTMTESAEM TIPTQTGPSG STSQDTLTLD TSTTKSQAKT HSTLTQRFPH SEMTTLMSRG 

      7090       7100       7110       7120       7130       7140 
PGDMSWQSSP SLENPSSLPS LLSLPATTSP PPISSTLPVT ISSSPLPVTS LLTSSPVTTT 

      7150       7160       7170       7180       7190       7200 
DMLHTSPELV TSSPPKLSHT SDERLTTGKD TTNTEAVHPS TNTAASNVEI PSFGHESPSS 

      7210       7220       7230       7240       7250       7260 
ALADSETSKA TSPMFITSTQ EDTTVAISTP HFLETSRIQK ESISSLSPKL RETGSSVETS 

      7270       7280       7290       7300       7310       7320 
SAIETSAVLS EVSIGATTEI SRTEVTSSSR TSISGSAEST MLPEISTTRK IIKFPTSPIL 

      7330       7340       7350       7360       7370       7380 
AESSEMTIKT QTSPPGSTSE STFTLDTSTT PSLVITHSTM TQRLPHSEIT TLVSRGAGDV 

      7390       7400       7410       7420       7430       7440 
PRPSSLPVEE TSPPSSQLSL SAMISPSPVS STLPASSHSS SASVTSPLTP GQVKTTEVLD 

      7450       7460       7470       7480       7490       7500 
ASAEPETSSP PSLSSTSVEI LATSEVTTDT EKIHPFPNTA VTKVGTSSSG HESPSSVLPD 

      7510       7520       7530       7540       7550       7560 
SETTKATSAM GTISIMGDTS VSTLTPALSN TRKIQSEPAS SLTTRLRETS TSEETSLATE 

      7570       7580       7590       7600       7610       7620 
ANTVLSKVST GATTEVSRTE AISFSRTSMS GPEQSTMSQD ISIGTIPRIS ASSVLTESAK 

      7630       7640       7650       7660       7670       7680 
MTITTQTGPS ESTLESTLNL NTATTPSWVE THSIVIQGFP HPEMTTSMGR GPGGVSWPSP 

      7690       7700       7710       7720       7730       7740 
PFVKETSPPS SPLSLPAVTS PHPVSTTFLA HIPPSPLPVT SLLTSGPATT TDILGTSTEP 

      7750       7760       7770       7780       7790       7800 
GTSSSSSLST TSHERLTTYK DTAHTEAVHP STNTGGTNVA TTSSGYKSQS SVLADSSPMC 

      7810       7820       7830       7840       7850       7860 
TTSTMGDTSV LTSTPAFLET RRIQTELASS LTPGLRESSG SEGTSSGTKM STVLSKVPTG 

      7870       7880       7890       7900       7910       7920 
ATTEISKEDV TSIPGPAQST ISPDISTRTV SWFSTSPVMT ESAEITMNTH TSPLGATTQG 

      7930       7940       7950       7960       7970       7980 
TSTLATSSTT SLTMTHSTIS QGFSHSQMST LMRRGPEDVS WMSPPLLEKT RPSFSLMSSP 

      7990       8000       8010       8020       8030       8040 
ATTSPSPVSS TLPESISSSP LPVTSLLTSG LAKTTDMLHK SSEPVTNSPA NLSSTSVEIL 

      8050       8060       8070       8080       8090       8100 
ATSEVTTDTE KTHPSSNRTV TDVGTSSSGH ESTSFVLADS QTSKVTSPMV ITSTMEDTSV 

      8110       8120       8130       8140       8150       8160 
STSTPGFFET SRIQTEPTSS LTLGLRKTSS SEGTSLATEM STVLSGVPTG ATAEVSRTEV 

      8170       8180       8190       8200       8210       8220 
TSSSRTSISG FAQLTVSPET STETITRLPT SSIMTESAEM MIKTQTDPPG STPESTHTVD 

      8230       8240       8250       8260       8270       8280 
ISTTPNWVET HSTVTQRFSH SEMTTLVSRS PGDMLWPSQS SVEETSSASS LLSLPATTSP 

      8290       8300       8310       8320       8330       8340 
SPVSSTLVED FPSASLPVTS LLTPGLVITT DRMGISREPG TSSTSNLSST SHERLTTLED 

      8350       8360       8370       8380       8390       8400 
TVDTEDMQPS THTAVTNVRT SISGHESQSS VLSDSETPKA TSPMGTTYTM GETSVSISTS 

      8410       8420       8430       8440       8450       8460 
DFFETSRIQI EPTSSLTSGL RETSSSERIS SATEGSTVLS EVPSGATTEV SRTEVISSRG 

      8470       8480       8490       8500       8510       8520 
TSMSGPDQFT ISPDISTEAI TRLSTSPIMT ESAESAITIE TGSPGATSEG TLTLDTSTTT 

      8530       8540       8550       8560       8570       8580 
FWSGTHSTAS PGFSHSEMTT LMSRTPGDVP WPSLPSVEEA SSVSSSLSSP AMTSTSFFSA 

      8590       8600       8610       8620       8630       8640 
LPESISSSPH PVTALLTLGP VKTTDMLRTS SEPETSSPPN LSSTSAEILA TSEVTKDREK 

      8650       8660       8670       8680       8690       8700 
IHPSSNTPVV NVGTVIYKHL SPSSVLADLV TTKPTSPMAT TSTLGNTSVS TSTPAFPETM 

      8710       8720       8730       8740       8750       8760 
MTQPTSSLTS GLREISTSQE TSSATERSAS LSGMPTGATT KVSRTEALSL GRTSTPGPAQ 

      8770       8780       8790       8800       8810       8820 
STISPEISTE TITRISTPLT TTGSAEMTIT PKTGHSGASS QGTFTLDTSS RASWPGTHSA 

      8830       8840       8850       8860       8870       8880 
ATHRSPHSGM TTPMSRGPED VSWPSRPSVE KTSPPSSLVS LSAVTSPSPL YSTPSESSHS 

      8890       8900       8910       8920       8930       8940 
SPLRVTSLFT PVMMKTTDML DTSLEPVTTS PPSMNITSDE SLATSKATME TEAIQLSENT 

      8950       8960       8970       8980       8990       9000 
AVTQMGTISA RQEFYSSYPG LPEPSKVTSP VVTSSTIKDI VSTTIPASSE ITRIEMESTS 

      9010       9020       9030       9040       9050       9060 
TLTPTPRETS TSQEIHSATK PSTVPYKALT SATIEDSMTQ VMSSSRGPSP DQSTMSQDIS 

      9070       9080       9090       9100       9110       9120 
SEVITRLSTS PIKAESTEMT ITTQTGSPGA TSRGTLTLDT STTFMSGTHS TASQGFSHSQ 

      9130       9140       9150       9160       9170       9180 
MTALMSRTPG DVPWLSHPSV EEASSASFSL SSPVMTSSSP VSSTLPDSIH SSSLPVTSLL 

      9190       9200       9210       9220       9230       9240 
TSGLVKTTEL LGTSSEPETS SPPNLSSTSA EILATTEVTT DTEKLEMTNV VTSGYTHESP 

      9250       9260       9270       9280       9290       9300 
SSVLADSVTT KATSSMGITY PTGDTNVLTS TPAFSDTSRI QTKSKLSLTP GLMETSISEE 

      9310       9320       9330       9340       9350       9360 
TSSATEKSTV LSSVPTGATT EVSRTEAISS SRTSIPGPAQ STMSSDTSME TITRISTPLT 

      9370       9380       9390       9400       9410       9420 
RKESTDMAIT PKTGPSGATS QGTFTLDSSS TASWPGTHSA TTQRFPQSVV TTPMSRGPED 

      9430       9440       9450       9460       9470       9480 
VSWPSPLSVE KNSPPSSLVS SSSVTSPSPL YSTPSGSSHS SPVPVTSLFT SIMMKATDML 

      9490       9500       9510       9520       9530       9540 
DASLEPETTS APNMNITSDE SLATSKATTE TEAIHVFENT AASHVETTSA TEELYSSSPG 

      9550       9560       9570       9580       9590       9600 
FSEPTKVISP VVTSSSIRDN MVSTTMPGSS GITRIEIESM SSLTPGLRET RTSQDITSST 

      9610       9620       9630       9640       9650       9660 
ETSTVLYKMS SGATPEVSRT EVMPSSRTSI PGPAQSTMSL DISDEVVTRL STSPIMTESA 

      9670       9680       9690       9700       9710       9720 
EITITTQTGY SLATSQVTLP LGTSMTFLSG THSTMSQGLS HSEMTNLMSR GPESLSWTSP 

      9730       9740       9750       9760       9770       9780 
RFVETTRSSS SLTSLPLTTS LSPVSSTLLD SSPSSPLPVT SLILPGLVKT TEVLDTSSEP 

      9790       9800       9810       9820       9830       9840 
KTSSSPNLSS TSVEIPATSE IMTDTEKIHP SSNTAVAKVR TSSSVHESHS SVLADSETTI 

      9850       9860       9870       9880       9890       9900 
TIPSMGITSA VDDTTVFTSN PAFSETRRIP TEPTFSLTPG FRETSTSEET TSITETSAVL 

      9910       9920       9930       9940       9950       9960 
YGVPTSATTE VSMTEIMSSN RTHIPDSDQS TMSPDIITEV ITRLSSSSMM SESTQMTITT 

      9970       9980       9990      10000      10010      10020 
QKSSPGATAQ STLTLATTTA PLARTHSTVP PRFLHSEMTT LMSRSPENPS WKSSPFVEKT 

     10030      10040      10050      10060      10070      10080 
SSSSSLLSLP VTTSPSVSST LPQSIPSSSF SVTSLLTPGM VKTTDTSTEP GTSLSPNLSG 

     10090      10100      10110      10120      10130      10140 
TSVEILAASE VTTDTEKIHP SSSMAVTNVG TTSSGHELYS SVSIHSEPSK ATYPVGTPSS 

     10150      10160      10170      10180      10190      10200 
MAETSISTSM PANFETTGFE AEPFSHLTSG FRKTNMSLDT SSVTPTNTPS SPGSTHLLQS 

     10210      10220      10230      10240      10250      10260 
SKTDFTSSAK TSSPDWPPAS QYTEIPVDII TPFNASPSIT ESTGITSFPE SRFTMSVTES 

     10270      10280      10290      10300      10310      10320 
THHLSTDLLP SAETISTGTV MPSLSEAMTS FATTGVPRAI SGSGSPFSRT ESGPGDATLS 

     10330      10340      10350      10360      10370      10380 
TIAESLPSST PVPFSSSTFT TTDSSTIPAL HEITSSSATP YRVDTSLGTE SSTTEGRLVM 

     10390      10400      10410      10420      10430      10440 
VSTLDTSSQP GRTSSTPILD TRMTESVELG TVTSAYQVPS LSTRLTRTDG IMEHITKIPN 

     10450      10460      10470      10480      10490      10500 
EAAHRGTIRP VKGPQTSTSP ASPKGLHTGG TKRMETTTTA LKTTTTALKT TSRATLTTSV 

     10510      10520      10530      10540      10550      10560 
YTPTLGTLTP LNASRQMAST ILTEMMITTP YVFPDVPETT SSLATSLGAE TSTALPRTTP 

     10570      10580      10590      10600      10610      10620 
SVLNRESETT ASLVSRSGAE RSPVIQTLDV SSSEPDTTAS WVIHPAETIP TVSKTTPNFF 

     10630      10640      10650      10660      10670      10680 
HSELDTVSST ATSHGADVSS AIPTNISPSE LDALTPLVTI SGTDTSTTFP TLTKSPHETE 

     10690      10700      10710      10720      10730      10740 
TRTTWLTHPA ETSSTIPRTI PNFSHHESDA TPSIATSPGA ETSSAIPIMT VSPGAEDLVT 

     10750      10760      10770      10780      10790      10800 
SQVTSSGTDR NMTIPTLTLS PGEPKTIASL VTHPEAQTSS AIPTSTISPA VSRLVTSMVT 

     10810      10820      10830      10840      10850      10860 
SLAAKTSTTN RALTNSPGEP ATTVSLVTHP AQTSPTVPWT TSIFFHSKSD TTPSMTTSHG 

     10870      10880      10890      10900      10910      10920 
AESSSAVPTP TVSTEVPGVV TPLVTSSRAV ISTTIPILTL SPGEPETTPS MATSHGEEAS 

     10930      10940      10950      10960      10970      10980 
SAIPTPTVSP GVPGVVTSLV TSSRAVTSTT IPILTFSLGE PETTPSMATS HGTEAGSAVP 

     10990      11000      11010      11020      11030      11040 
TVLPEVPGMV TSLVASSRAV TSTTLPTLTL SPGEPETTPS MATSHGAEAS STVPTVSPEV 

     11050      11060      11070      11080      11090      11100 
PGVVTSLVTS SSGVNSTSIP TLILSPGELE TTPSMATSHG AEASSAVPTP TVSPGVSGVV 

     11110      11120      11130      11140      11150      11160 
TPLVTSSRAV TSTTIPILTL SSSEPETTPS MATSHGVEAS SAVLTVSPEV PGMVTSLVTS 

     11170      11180      11190      11200      11210      11220 
SRAVTSTTIP TLTISSDEPE TTTSLVTHSE AKMISAIPTL AVSPTVQGLV TSLVTSSGSE 

     11230      11240      11250      11260      11270      11280 
TSAFSNLTVA SSQPETIDSW VAHPGTEASS VVPTLTVSTG EPFTNISLVT HPAESSSTLP 

     11290      11300      11310      11320      11330      11340 
RTTSRFSHSE LDTMPSTVTS PEAESSSAIS TTISPGIPGV LTSLVTSSGR DISATFPTVP 

     11350      11360      11370      11380      11390      11400 
ESPHESEATA SWVTHPAVTS TTVPRTTPNY SHSEPDTTPS IATSPGAEAT SDFPTITVSP 

     11410      11420      11430      11440      11450      11460 
DVPDMVTSQV TSSGTDTSIT IPTLTLSSGE PETTTSFITY SETHTSSAIP TLPVSPGASK 

     11470      11480      11490      11500      11510      11520 
MLTSLVISSG TDSTTTFPTL TETPYEPETT AIQLIHPAET NTMVPKTTPK FSHSKSDTTL 

     11530      11540      11550      11560      11570      11580 
PVAITSPGPE ASSAVSTTTI SPDMSDLVTS LVPSSGTDTS TTFPTLSETP YEPETTVTWL 

     11590      11600      11610      11620      11630      11640 
THPAETSTTV SGTIPNFSHR GSDTAPSMVT SPGVDTRSGV PTTTIPPSIP GVVTSQVTSS 

     11650      11660      11670      11680      11690      11700 
ATDTSTAIPT LTPSPGEPET TASSATHPGT QTGFTVPIRT VPSSEPDTMA SWVTHPPQTS 

     11710      11720      11730      11740      11750      11760 
TPVSRTTSSF SHSSPDATPV MATSPRTEAS SAVLTTISPG APEMVTSQIT SSGAATSTTV 

     11770      11780      11790      11800      11810      11820 
PTLTHSPGMP ETTALLSTHP RTGTSKTFPA STVFPQVSET TASLTIRPGA ETSTALPTQT 

     11830      11840      11850      11860      11870      11880 
TSSLFTLLVT GTSRVDLSPT ASPGVSAKTA PLSTHPGTET STMIPTSTLS LGLLETTGLL 

     11890      11900      11910      11920      11930      11940 
ATSSSAETST STLTLTVSPA VSGLSSASIT TDKPQTVTSW NTETSPSVTS VGPPEFSRTV 

     11950      11960      11970      11980      11990      12000 
TGTTMTLIPS EMPTPPKTSH GEGVSPTTIL RTTMVEATNL ATTGSSPTVA KTTTTFNTLA 

     12010      12020      12030      12040      12050      12060 
GSLFTPLTTP GMSTLASESV TSRTSYNHRS WISTTSSYNR RYWTPATSTP VTSTFSPGIS 

     12070      12080      12090      12100      12110      12120 
TSSIPSSTAA TVPFMVPFTL NFTITNLQYE EDMRHPGSRK FNATERELQG LLKPLFRNSS 

     12130      12140      12150      12160      12170      12180 
LEYLYSGCRL ASLRPEKDSS AMAVDAICTH RPDPEDLGLD RERLYWELSN LTNGIQELGP 

     12190      12200      12210      12220      12230      12240 
YTLDRNSLYV NGFTHRSSMP TTSTPGTSTV DVGTSGTPSS SPSPTAAGPL LMPFTLNFTI 

     12250      12260      12270      12280      12290      12300 
TNLQYEEDMR RTGSRKFNTM ESVLQGLLKP LFKNTSVGPL YSGCRLTLLR PEKDGAATGV 

     12310      12320      12330      12340      12350      12360 
DAICTHRLDP KSPGLNREQL YWELSKLTND IEELGPYTLD RNSLYVNGFT HQSSVSTTST 

     12370      12380      12390      12400      12410      12420 
PGTSTVDLRT SGTPSSLSSP TIMAAGPLLV PFTLNFTITN LQYGEDMGHP GSRKFNTTER 

     12430      12440      12450      12460      12470      12480 
VLQGLLGPIF KNTSVGPLYS GCRLTSLRSE KDGAATGVDA ICIHHLDPKS PGLNRERLYW 

     12490      12500      12510      12520      12530      12540 
ELSQLTNGIK ELGPYTLDRN SLYVNGFTHR TSVPTTSTPG TSTVDLGTSG TPFSLPSPAT 

     12550      12560      12570      12580      12590      12600 
AGPLLVLFTL NFTITNLKYE EDMHRPGSRK FNTTERVLQT LLGPMFKNTS VGLLYSGCRL 

     12610      12620      12630      12640      12650      12660 
TLLRSEKDGA ATGVDAICTH RLDPKSPGLD REQLYWELSQ LTNGIKELGP YTLDRNSLYV 

     12670      12680      12690      12700      12710      12720 
NGFTHWIPVP TSSTPGTSTV DLGSGTPSSL PSPTAAGPLL VPFTLNFTIT NLQYEEDMHH 

     12730      12740      12750      12760      12770      12780 
PGSRKFNTTE RVLQGLLGPM FKNTSVGLLY SGCRLTLLRS EKDGAATGVD AICTHRLDPK 

     12790      12800      12810      12820      12830      12840 
SPGVDREQLY WELSQLTNGI KELGPYTLDR NSLYVNGFTH QTSAPNTSTP GTSTVDLGTS 

     12850      12860      12870      12880      12890      12900 
GTPSSLPSPT SAGPLLVPFT LNFTITNLQY EEDMRHPGSR KFNTTERVLQ GLLKPLFKST 

     12910      12920      12930      12940      12950      12960 
SVGPLYSGCR LTLLRSEKDG AATGVDAICT HRLDPKSPGV DREQLYWELS QLTNGIKELG 

     12970      12980      12990      13000      13010      13020 
PYTLDRNSLY VNGFTHQTSA PNTSTPGTST VDLGTSGTPS SLPSPTSAGP LLVPFTLNFT 

     13030      13040      13050      13060      13070      13080 
ITNLQYEEDM HHPGSRKFNT TERVLQGLLG PMFKNTSVGL LYSGCRLTLL RPEKNGAATG 

     13090      13100      13110      13120      13130      13140 
MDAICSHRLD PKSPGLNREQ LYWELSQLTH GIKELGPYTL DRNSLYVNGF THRSSVAPTS 

     13150      13160      13170      13180      13190      13200 
TPGTSTVDLG TSGTPSSLPS PTTAVPLLVP FTLNFTITNL QYGEDMRHPG SRKFNTTERV 

     13210      13220      13230      13240      13250      13260 
LQGLLGPLFK NSSVGPLYSG CRLISLRSEK DGAATGVDAI CTHHLNPQSP GLDREQLYWQ 

     13270      13280      13290      13300      13310      13320 
LSQMTNGIKE LGPYTLDRNS LYVNGFTHRS SGLTTSTPWT STVDLGTSGT PSPVPSPTTA 

     13330      13340      13350      13360      13370      13380 
GPLLVPFTLN FTITNLQYEE DMHRPGSRKF NTTERVLQGL LSPIFKNSSV GPLYSGCRLT 

     13390      13400      13410      13420      13430      13440 
SLRPEKDGAA TGMDAVCLYH PNPKRPGLDR EQLYWELSQL THNITELGPY SLDRDSLYVN 

     13450      13460      13470      13480      13490      13500 
GFTHQNSVPT TSTPGTSTVY WATTGTPSSF PGHTEPGPLL IPFTFNFTIT NLHYEENMQH 

     13510      13520      13530      13540      13550      13560 
PGSRKFNTTE RVLQGLLKPL FKNTSVGPLY SGCRLTSLRP EKDGAATGMD AVCLYHPNPK 

     13570      13580      13590      13600      13610      13620 
RPGLDREQLY WELSQLTHNI TELGPYSLDR DSLYVNGFTH QNSVPTTSTP GTSTVYWATT 

     13630      13640      13650      13660      13670      13680 
GTPSSFPGHT EPGPLLIPFT FNFTITNLHY EENMQHPGSR KFNTTERVLQ GLLKPLFKNT 

     13690      13700      13710      13720      13730      13740 
SVGPLYSGCR LTLLRPEKHE AATGVDTICT HRVDPIGPGL DRERLYWELS QLTNSITELG 

     13750      13760      13770      13780      13790      13800 
PYTLDRDSLY VNGFNPRSSV PTTSTPGTST VHLATSGTPS SLPGHTAPVP LLIPFTLNFT 

     13810      13820      13830      13840      13850      13860 
ITNLHYEENM QHPGSRKFNT TERVLQGLLK PLFKNTSVGP LYSGCRLTLL RPEKHEAATG 

     13870      13880      13890      13900      13910      13920 
VDTICTHRVD PIGPGLXXEX LYWELSXLTX XIXELGPYTL DRXSLYVNGF THXXSXPTTS 

     13930      13940      13950      13960      13970      13980 
TPGTSTVXXG TSGTPSSXPX XTSAGPLLVP FTLNFTITNL QYEEDMHHPG SRKFNTTERV 

     13990      14000      14010      14020      14030      14040 
LQGLLGPMFK NTSVGLLYSG CRLTLLRPEK NGAATGMDAI CSHRLDPKSP GLDREQLYWE 

     14050      14060      14070      14080      14090      14100 
LSQLTHGIKE LGPYTLDRNS LYVNGFTHRS SVAPTSTPGT STVDLGTSGT PSSLPSPTTA 

     14110      14120      14130      14140      14150      14160 
VPLLVPFTLN FTITNLQYGE DMRHPGSRKF NTTERVLQGL LGPLFKNSSV GPLYSGCRLI 

     14170      14180      14190      14200      14210      14220 
SLRSEKDGAA TGVDAICTHH LNPQSPGLDR EQLYWQLSQM TNGIKELGPY TLDRNSLYVN 

     14230      14240      14250      14260      14270      14280 
GFTHRSSGLT TSTPWTSTVD LGTSGTPSPV PSPTTAGPLL VPFTLNFTIT NLQYEEDMHR 

     14290      14300      14310      14320      14330      14340 
PGSRKFNATE RVLQGLLSPI FKNSSVGPLY SGCRLTSLRP EKDGAATGMD AVCLYHPNPK 

     14350      14360      14370      14380      14390      14400 
RPGLDREQLY WELSQLTHNI TELGPYSLDR DSLYVNGFTH QSSMTTTRTP DTSTMHLATS 

     14410      14420      14430      14440      14450      14460 
RTPASLSGPT TASPLLVLFT INCTITNLQY EEDMRRTGSR KFNTMESVLQ GLLKPLFKNT 

     14470      14480      14490      14500      14510      14520 
SVGPLYSGCR LTLLRPKKDG AATGVDAICT HRLDPKSPGL NREQLYWELS KLTNDIEELG 

     14530      14540      14550      14560      14570      14580 
PYTLDRNSLY VNGFTHQSSV STTSTPGTST VDLRTSGTPS SLSSPTIMXX XPLLXPFTXN 

     14590      14600      14610      14620      14630      14640 
XTITNLXXXX XMXXPGSRKF NTTERVLQGL LRPLFKNTSV SSLYSGCRLT LLRPEKDGAA 

     14650      14660      14670      14680      14690      14700 
TRVDAACTYR PDPKSPGLDR EQLYWELSQL THSITELGPY TLDRVSLYVN GFNPRSSVPT 

     14710      14720      14730      14740      14750      14760 
TSTPGTSTVH LATSGTPSSL PGHTXXXPLL XPFTXNXTIT NLXXXXXMXX PGSRKFNTTE 

     14770      14780      14790      14800      14810      14820 
RVLQGLLKPL FRNSSLEYLY SGCRLASLRP EKDSSAMAVD AICTHRPDPE DLGLDRERLY 

     14830      14840      14850      14860      14870      14880 
WELSNLTNGI QELGPYTLDR NSLYVNGFTH RSSGLTTSTP WTSTVDLGTS GTPSPVPSPT 

     14890      14900      14910      14920      14930      14940 
TAGPLLVPFT LNFTITNLQY EEDMHRPGSR RFNTTERVLQ GLLTPLFKNT SVGPLYSGCR 

     14950      14960      14970      14980      14990      15000 
LTLLRPEKQE AATGVDTICT HRVDPIGPGL DRERLYWELS QLTNSITELG PYTLDRDSLY 

     15010      15020      15030      15040      15050      15060 
VNGFNPWSSV PTTSTPGTST VHLATSGTPS SLPGHTAPVP LLIPFTLNFT ITDLHYEENM 

     15070      15080      15090      15100      15110      15120 
QHPGSRKFNT TERVLQGLLK PLFKSTSVGP LYSGCRLTLL RPEKHGAATG VDAICTLRLD 

     15130      15140      15150      15160      15170      15180 
PTGPGLDRER LYWELSQLTN SVTELGPYTL DRDSLYVNGF THRSSVPTTS IPGTSAVHLE 

     15190      15200      15210      15220      15230      15240 
TSGTPASLPG HTAPGPLLVP FTLNFTITNL QYEEDMRHPG SRKFSTTERV LQGLLKPLFK 

     15250      15260      15270      15280      15290      15300 
NTSVSSLYSG CRLTLLRPEK DGAATRVDAV CTHRPDPKSP GLDRERLYWK LSQLTHGITE 

     15310      15320      15330      15340      15350      15360 
LGPYTLDRHS LYVNGFTHQS SMTTTRTPDT STMHLATSRT PASLSGPTTA SPLLVLFTIN 

     15370      15380      15390      15400      15410      15420 
FTITNLRYEE NMHHPGSRKF NTTERVLQGL LRPVFKNTSV GPLYSGCRLT TLRPKKDGAA 

     15430      15440      15450      15460      15470      15480 
TKVDAICTYR PDPKSPGLDR EQLYWELSQL THSITELGPY TQDRDSLYVN GFTHRSSVPT 

     15490      15500      15510      15520      15530      15540 
TSIPGTSAVH LETSGTPASL PGHTAPGPLL VPFTLNFTIT NLQYEEDMRH PGSRKFNTTE 

     15550      15560      15570      15580      15590      15600 
RVLQGLLKPL FKSTSVGPLY SGCRLTLLRP EKRGAATGVD TICTHRLDPL NPGLDREQLY 

     15610      15620      15630      15640      15650      15660 
WELSKLTRGI IELGPYLLDR GSLYVNGFTH RTSVPTTSTP GTSTVDLGTS GTPFSLPSPA 

     15670      15680      15690      15700      15710      15720 
XXXPLLXPFT XNXTITNLXX XXXMXXPGSR KFNTTERVLQ TLLGPMFKNT SVGLLYSGCR 

     15730      15740      15750      15760      15770      15780 
LTLLRSEKDG AATGVDAICT HRLDPKSPGV DREQLYWELS QLTNGIKELG PYTLDRNSLY 

     15790      15800      15810      15820      15830      15840 
VNGFTHWIPV PTSSTPGTST VDLGSGTPSS LPSPTTAGPL LVPFTLNFTI TNLKYEEDMH 

     15850      15860      15870      15880      15890      15900 
CPGSRKFNTT ERVLQSLLGP MFKNTSVGPL YSGCRLTLLR SEKDGAATGV DAICTHRLDP 

     15910      15920      15930      15940      15950      15960 
KSPGVDREQL YWELSQLTNG IKELGPYTLD RNSLYVNGFT HQTSAPNTST PGTSTVDLGT 

     15970      15980      15990      16000      16010      16020 
SGTPSSLPSP TXXXPLLXPF TXNXTITNLX XXXXMXXPGS RKFNTTEXVL QGLLXPXFKN 

     16030      16040      16050      16060      16070      16080 
XSVGXLYSGC RLTXLRXEKX GAATGXDAIC XHXXXPKXPG LXXEXLYWEL SXLTXXIXEL 

     16090      16100      16110      16120      16130      16140 
GPYTLDRXSL YVNGFTHWIP VPTSSTPGTS TVDLGSGTPS SLPSPTTAGP LLVPFTLNFT 

     16150      16160      16170      16180      16190      16200 
ITNLKYEEDM HCPGSRKFNT TERVLQSLLG PMFKNTSVGP LYSGCRLTSL RSEKDGAATG 

     16210      16220      16230      16240      16250      16260 
VDAICTHRVD PKSPGVDREQ LYWELSQLTN GIKELGPYTL DRNSLYVNGF THQTSAPNTS 

     16270      16280      16290      16300      16310      16320 
TPGTSTVXXG TSGTPSSXPX XTSAGPLLVP FTLNFTITNL QYEEDMHHPG SRKFNTTERV 

     16330      16340      16350      16360      16370      16380 
LQGLLGPMFK NTSVGLLYSG CRLTLLRPEK NGATTGMDAI CTHRLDPKSP GLXXEXLYWE 

     16390      16400      16410      16420      16430      16440 
LSXLTXXIXE LGPYTLDRXS LYVNGFTHXX SXPTTSTPGT STVXXGTSGT PSSXPXXTXX 

     16450      16460      16470      16480      16490      16500 
XPLLXPFTXN XTITNLXXXX XMXXPGSRKF NTTERVLQGL LKPLFRNSSL EYLYSGCRLA 

     16510      16520      16530      16540      16550      16560 
SLRPEKDSSA MAVDAICTHR PDPEDLGLDR ERLYWELSNL TNGIQELGPY TLDRNSLYVN 

     16570      16580      16590      16600      16610      16620 
GFTHRSSMPT TSTPGTSTVD VGTSGTPSSS PSPTTAGPLL IPFTLNFTIT NLQYGEDMGH 

     16630      16640      16650      16660      16670      16680 
PGSRKFNTTE RVLQGLLGPI FKNTSVGPLY SGCRLTSLRS EKDGAATGVD AICIHHLDPK 

     16690      16700      16710      16720      16730      16740 
SPGLNRERLY WELSQLTNGI KELGPYTLDR NSLYVNGFTH RTSVPTTSTP GTSTVDLGTS 

     16750      16760      16770      16780      16790      16800 
GTPFSLPSPA TAGPLLVLFT LNFTITNLKY EEDMHRPGSR KFNTTERVLQ TLLGPMFKNT 

     16810      16820      16830      16840      16850      16860 
SVGLLYSGCR LTLLRSEKDG AATGVDAICT HRLDPKSPGL XXEXLYWELS XLTXXIXELG 

     16870      16880      16890      16900      16910      16920 
PYTLDRXSLY VNGFTHXXSX PTTSTPGTST VXXGTSGTPS SXPXXTXXXP LLXPFTXNXT 

     16930      16940      16950      16960      16970      16980 
ITNLXXXXXM XXPGSRKFNT TERVLQGLLR PVFKNTSVGP LYSGCRLTLL RPKKDGAATK 

     16990      17000      17010      17020      17030      17040 
VDAICTYRPD PKSPGLDREQ LYWELSQLTH SITELGPYTQ DRDSLYVNGF THRSSVPTTS 

     17050      17060      17070      17080      17090      17100 
IPGTSAVHLE TTGTPSSFPG HTEPGPLLIP FTFNFTITNL RYEENMQHPG SRKFNTTERV 

     17110      17120      17130      17140      17150      17160 
LQGLLTPLFK NTSVGPLYSG CRLTLLRPEK QEAATGVDTI CTHRVDPIGP GLDRERLYWE 

     17170      17180      17190      17200      17210      17220 
LSQLTNSITE LGPYTLDRDS LYVDGFNPWS SVPTTSTPGT STVHLATSGT PSPLPGHTAP 

     17230      17240      17250      17260      17270      17280 
VPLLIPFTLN FTITDLHYEE NMQHPGSRKF NTTERVLQGL LKPLFKSTSV GPLYSGCRLT 

     17290      17300      17310      17320      17330      17340 
LLRPEKHGAA TGVDAICTLR LDPTGPGLDR ERLYWELSQL TNSITELGPY TLDRDSLYVN 

     17350      17360      17370      17380      17390      17400 
GFNPWSSVPT TSTPGTSTVH LATSGTPSSL PGHTTAGPLL VPFTLNFTIT NLKYEEDMHC 

     17410      17420      17430      17440      17450      17460 
PGSRKFNTTE RVLQSLHGPM FKNTSVGPLY SGCRLTLLRS EKDGAATGVD AICTHRLDPK 

     17470      17480      17490      17500      17510      17520 
SPGLXXEXLY WELSXLTXXI XELGPYTLDR XSLYVNGFTH XXSXPTTSTP GTSTVXXGTS 

     17530      17540      17550      17560      17570      17580 
GTPSSXPXXT XXXPLLXPFT XNXTITNLXX XXXMXXPGSR KFNTTEXVLQ GLLXPXFKNX 

     17590      17600      17610      17620      17630      17640 
SVGXLYSGCR LTXLRXEKXG AATGXDAICX HXXXPKXPGL XXEXLYWELS XLTNSITELG 

     17650      17660      17670      17680      17690      17700 
PYTLDRDSLY VNGFTHRSSM PTTSIPGTSA VHLETSGTPA SLPGHTAPGP LLVPFTLNFT 

     17710      17720      17730      17740      17750      17760 
ITNLQYEEDM RHPGSRKFNT TERVLQGLLK PLFKSTSVGP LYSGCRLTLL RPEKRGAATG 

     17770      17780      17790      17800      17810      17820 
VDTICTHRLD PLNPGLXXEX LYWELSXLTX XIXELGPYTL DRXSLYVNGF THXXSXPTTS 

     17830      17840      17850      17860      17870      17880 
TPGTSTVXXG TSGTPSSXPX XTXXXPLLXP FTXNXTITNL XXXXXMXXPG SRKFNTTEXV 

     17890      17900      17910      17920      17930      17940 
LQGLLXPXFK NXSVGXLYSG CRLTXLRXEK XGAATGXDAI CXHXXXPKXP GLXXEXLYWE 

     17950      17960      17970      17980      17990      18000 
LSXLTXXIXE LGPYTLDRXS LYVNGFHPRS SVPTTSTPGT STVHLATSGT PSSLPGHTAP 

     18010      18020      18030      18040      18050      18060 
VPLLIPFTLN FTITNLHYEE NMQHPGSRKF NTTERVLQGL LGPMFKNTSV GLLYSGCRLT 

     18070      18080      18090      18100      18110      18120 
LLRPEKNGAA TGMDAICSHR LDPKSPGLXX EXLYWELSXL TXXIXELGPY TLDRXSLYVN 

     18130      18140      18150      18160      18170      18180 
GFTHXXSXPT TSTPGTSTVX XGTSGTPSSX PXXTXXXPLL XPFTXNXTIT NLXXXXXMXX 

     18190      18200      18210      18220      18230      18240 
PGSRKFNTTE XVLQGLLXPX FKNXSVGXLY SGCRLTXLRX EKXGAATGXD AICXHXXXPK 

     18250      18260      18270      18280      18290      18300 
XPGLXXEXLY WELSXLTXXI XELGPYTLDR XSLYVNGFTH QNSVPTTSTP GTSTVYWATT 

     18310      18320      18330      18340      18350      18360 
GTPSSFPGHT EPGPLLIPFT FNFTITNLHY EENMQHPGSR KFNTTERVLQ GLLTPLFKNT 

     18370      18380      18390      18400      18410      18420 
SVGPLYSGCR LTLLRPEKQE AATGVDTICT HRVDPIGPGL XXEXLYWELS XLTXXIXELG 

     18430      18440      18450      18460      18470      18480 
PYTLDRXSLY VNGFTHXXSX PTTSTPGTST VXXGTSGTPS SXPXXTXXXP LLXPFTXNXT 

     18490      18500      18510      18520      18530      18540 
ITNLXXXXXM XXPGSRKFNT TEXVLQGLLX PXFKNXSVGX LYSGCRLTXL RXEKXGAATG 

     18550      18560      18570      18580      18590      18600 
XDAICXHXXX PKXPGLXXEX LYWELSXLTX XIXELGPYTL DRXSLYVNGF THRSSVPTTS 

     18610      18620      18630      18640      18650      18660 
SPGTSTVHLA TSGTPSSLPG HTAPVPLLIP FTLNFTITNL HYEENMQHPG SRKFNTTERV 

     18670      18680      18690      18700      18710      18720 
LQGLLKPLFK STSVGPLYSG CRLTLLRPEK HGAATGVDAI CTLRLDPTGP GLXXEXLYWE 

     18730      18740      18750      18760      18770      18780 
LSXLTXXIXE LGPYTLDRXS LYVNGFTHXX SXPTTSTPGT STVXXGTSGT PSSXPXXTXX 

     18790      18800      18810      18820      18830      18840 
XPLLXPFTXN XTITNLXXXX XMXXPGSRKF NTTEXVLQGL LXPXFKNXSV GXLYSGCRLT 

     18850      18860      18870      18880      18890      18900 
XLRXEKXGAA TGXDAICXHX XXPKXPGLXX EXLYWELSXL TXXIXELGPY TLDRXSLYVN 

     18910      18920      18930      18940      18950      18960 
GFTHRTSVPT TSTPGTSTVH LATSGTPSSL PGHTAPVPLL IPFTLNFTIT NLQYEEDMHR 

     18970      18980      18990      19000      19010      19020 
PGSRKFNTTE RVLQGLLSPI FKNSSVGPLY SGCRLTSLRP EKDGAATGMD AVCLYHPNPK 

     19030      19040      19050      19060      19070      19080 
RPGLDREQLY CELSQLTHNI TELGPYSLDR DSLYVNGFTH QNSVPTTSTP GTSTVYWATT 

     19090      19100      19110      19120      19130      19140 
GTPSSFPGHT XXXPLLXPFT XNXTITNLXX XXXMXXPGSR KFNTTEXVLQ GLLXPXFKNX 

     19150      19160      19170      19180      19190      19200 
SVGXLYSGCR LTXLRXEKXG AATGXDAICX HXXXPKXPGL XXEXLYWELS XLTXXIXELG 

     19210      19220      19230      19240      19250      19260 
PYTLDRXSLY VNGFTHWSSG LTTSTPWTST VDLGTSGTPS PVPSPTTAGP LLVPFTLNFT 

     19270      19280      19290      19300      19310      19320 
ITNLQYEEDM HRPGSRKFNA TERVLQGLLS PIFKNTSVGP LYSGCRLTLL RPEKQEAATG 

     19330      19340      19350      19360      19370      19380 
VDTICTHRVD PIGPGLXXEX LYWELSXLTX XIXELGPYTL DRXSLYVNGF THXXSXPTTS 

     19390      19400      19410      19420      19430      19440 
TPGTSTVXXG TSGTPSSXPX XTXXXPLLXP FTXNXTITNL XXXXXMXXPG SRKFNTTEXV 

     19450      19460      19470      19480      19490      19500 
LQGLLXPXFK NXSVGXLYSG CRLTXLRXEK XGAATGXDAI CXHXXXPKXP GLXXEXLYWE 

     19510      19520      19530      19540      19550      19560 
LSXLTXXIXE LGPYTLDRXS LYVNGFTHRS FGLTTSTPWT STVDLGTSGT PSPVPSPTTA 

     19570      19580      19590      19600      19610      19620 
GPLLVPFTLN FTITNLQYEE DMHRPGSRKF NTTERVLQGL LTPLFRNTSV SSLYSGCRLT 

     19630      19640      19650      19660      19670      19680 
LLRPEKDGAA TRVDAVCTHR PDPKSPGLXX EXLYWELSXL TXXIXELGPY TLDRXSLYVN 

     19690      19700      19710      19720      19730      19740 
GFTHXXSXPT TSTPGTSTVX XGTSGTPSSX PXXTXXXPLL XPFTXNXTIT NLXXXXXMXX 

     19750      19760      19770      19780      19790      19800 
PGSRKFNTTE XVLQGLLXPX FKNXSVGXLY SGCRLTXLRX EKXGAATGXD AICXHXXXPK 

     19810      19820      19830      19840      19850      19860 
XPGLXXEXLY WELSXLTXXI XELGPYTLDR XSLYVNGFTH WIPVPTSSTP GTSTVDLGSG 

     19870      19880      19890      19900      19910      19920 
TPSSLPSPTT AGPLLVPFTL NFTITNLQYG EDMGHPGSRK FNTTERVLQG LLGPIFKNTS 

     19930      19940      19950      19960      19970      19980 
VGPLYSGCRL TSLRSEKDGA ATGVDAICIH HLDPKSPGLX XEXLYWELSX LTXXIXELGP 

     19990      20000      20010      20020      20030      20040 
YTLDRXSLYV NGFTHXXSXP TTSTPGTSTV XXGTSGTPSS XPXXTXXXPL LXPFTXNXTI 

     20050      20060      20070      20080      20090      20100 
TNLXXXXXMX XPGSRKFNTT EXVLQGLLXP XFKNXSVGXL YSGCRLTXLR XEKXGAATGX 

     20110      20120      20130      20140      20150      20160 
DAICXHXXXP KXPGLXXEXL YWELSXLTXX IXELGPYTLD RXSLYVNGFT HQTFAPNTST 

     20170      20180      20190      20200      20210      20220 
PGTSTVDLGT SGTPSSLPSP TSAGPLLVPF TLNFTITNLQ YEEDMHHPGS RKFNTTERVL 

     20230      20240      20250      20260      20270      20280 
QGLLGPMFKN TSVGLLYSGC RLTLLRPEKN GAATRVDAVC THRPDPKSPG LXXEXLYWEL 

     20290      20300      20310      20320      20330      20340 
SXLTXXIXEL GPYTLDRXSL YVNGFTHXXS XPTTSTPGTS TVXXGTSGTP SSXPXXTAPV 

     20350      20360      20370      20380      20390      20400 
PLLIPFTLNF TITNLHYEEN MQHPGSRKFN TTERVLQGLL KPLFKSTSVG PLYSGCRLTL 

     20410      20420      20430      20440      20450      20460 
LRPEKHGAAT GVDAICTLRL DPTGPGLDRE RLYWELSQLT NSVTELGPYT LDRDSLYVNG 

     20470      20480      20490      20500      20510      20520 
FTQRSSVPTT SIPGTSAVHL ETSGTPASLP GHTAPGPLLV PFTLNFTITN LQYEVDMRHP 

     20530      20540      20550      20560      20570      20580 
GSRKFNTTER VLQGLLKPLF KSTSVGPLYS GCRLTLLRPE KRGAATGVDT ICTHRLDPLN 

     20590      20600      20610      20620      20630      20640 
PGLDREQLYW ELSKLTRGII ELGPYLLDRG SLYVNGFTHR NFVPITSTPG TSTVHLGTSE 

     20650      20660      20670      20680      20690      20700 
TPSSLPRPIV PGPLLVPFTL NFTITNLQYE EAMRHPGSRK FNTTERVLQG LLRPLFKNTS 

     20710      20720      20730      20740      20750      20760 
IGPLYSSCRL TLLRPEKDKA ATRVDAICTH HPDPQSPGLN REQLYWELSQ LTHGITELGP 

     20770      20780      20790      20800      20810      20820 
YTLDRDSLYV DGFTHWSPIP TTSTPGTSIV NLGTSGIPPS LPETTXXXPL LXPFTXNXTI 

     20830      20840      20850      20860      20870      20880 
TNLXXXXXMX XPGSRKFNTT ERVLQGLLKP LFKSTSVGPL YSGCRLTLLR PEKDGVATRV 

     20890      20900      20910      20920      20930      20940 
DAICTHRPDP KIPGLDRQQL YWELSQLTHS ITELGPYTLD RDSLYVNGFT QRSSVPTTST 

     20950      20960      20970      20980      20990      21000 
PGTFTVQPET SETPSSLPGP TATGPVLLPF TLNFTITNLQ YEEDMHRPGS RKFNTTERVL 

     21010      21020      21030      21040      21050      21060 
QGLLMPLFKN TSVSSLYSGC RLTLLRPEKD GAATRVDAVC THRPDPKSPG LDRERLYWKL 

     21070      21080      21090      21100      21110      21120 
SQLTHGITEL GPYTLDRHSL YVNGFTHQSS MTTTRTPDTS TMHLATSRTP ASLSGPTTAS 

     21130      21140      21150      21160      21170      21180 
PLLVLFTINF TITNLRYEEN MHHPGSRKFN TTERVLQGLL RPVFKNTSVG PLYSGCRLTL 

     21190      21200      21210      21220      21230      21240 
LRPKKDGAAT KVDAICTYRP DPKSPGLDRE QLYWELSQLT HSITELGPYT LDRDSLYVNG 

     21250      21260      21270      21280      21290      21300 
FTQRSSVPTT SIPGTPTVDL GTSGTPVSKP GPSAASPLLV LFTLNFTITN LRYEENMQHP 

     21310      21320      21330      21340      21350      21360 
GSRKFNTTER VLQGLLRSLF KSTSVGPLYS GCRLTLLRPE KDGTATGVDA ICTHHPDPKS 

     21370      21380      21390      21400      21410      21420 
PRLDREQLYW ELSQLTHNIT ELGHYALDND SLFVNGFTHR SSVSTTSTPG TPTVYLGASK 

     21430      21440      21450      21460      21470      21480 
TPASIFGPSA ASHLLILFTL NFTITNLRYE ENMWPGSRKF NTTERVLQGL LRPLFKNTSV 

     21490      21500      21510      21520      21530      21540 
GPLYSGSRLT LLRPEKDGEA TGVDAICTHR PDPTGPGLDR EQLYLELSQL THSITELGPY 

     21550      21560      21570      21580      21590      21600 
TLDRDSLYVN GFTHRSSVPT TSTGVVSEEP FTLNFTINNL RYMADMGQPG SLKFNITDNV 

     21610      21620      21630      21640      21650      21660 
MKHLLSPLFQ RSSLGARYTG CRVIALRSVK NGAETRVDLL CTYLQPLSGP GLPIKQVFHE 

     21670      21680      21690      21700      21710      21720 
LSQQTHGITR LGPYSLDKDS LYLNGYNEPG LDEPPTTPKP ATTFLPPLSE ATTAMGYHLK 

     21730      21740      21750      21760      21770      21780 
TLTLNFTISN LQYSPDMGKG SATFNSTEGV LQHLLRPLFQ KSSMGPFYLG CQLISLRPEK 

     21790      21800      21810      21820      21830      21840 
DGAATGVDTT CTYHPDPVGP GLDIQQLYWE LSQLTHGVTQ LGFYVLDRDS LFINGYAPQN 

     21850      21860      21870      21880      21890      21900 
LSIRGEYQIN FHIVNWNLSN PDPTSSEYIT LLRDIQDKVT TLYKGSQLHD TFRFCLVTNL 

     21910      21920      21930      21940      21950      21960 
TMDSVLVTVK ALFSSNLDPS LVEQVFLDKT LNASFHWLGS TYQLVDIHVT EMESSVYQPT 

     21970      21980      21990      22000      22010      22020 
SSSSTQHFYL NFTITNLPYS QDKAQPGTTN YQRNKRNIED ALNQLFRNSS IKSYFSDCQV 

     22030      22040      22050      22060      22070      22080 
STFRSVPNRH HTGVDSLCNF SPLARRVDRV AIYEEFLRMT RNGTQLQNFT LDRSSVLVDG 

     22090      22100      22110      22120      22130      22140 
YSPNRNEPLT GNSDLPFWAV ILIGLAGLLG LITCLICGVL VTTRRRKKEG EYNVQQQCPG 

     22150 
YYQSHLDLED LQ 

« Hide

References

« Hide 'large scale' references
[1]"The CA 125 gene: a newly discovered extension of the glycosylated N-terminal domain doubles the size of this extracellular superstructure."
O'Brien T.J., Beard J.B., Underwood L.J., Shigemasa K.
Tumor Biol. 23:154-169(2002) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 1-10431, SEQUENCE REVISION TO N-TERMINUS, TISSUE SPECIFICITY, INDUCTION.
[2]"The CA 125 gene: an extracellular superstructure dominated by repeat sequences."
O'Brien T.J., Beard J.B., Underwood L.J., Dennis R.A., Santin A.D., York L.
Tumor Biol. 22:348-366(2001) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 10432-22152.
[3]"Molecular cloning of the CA125 ovarian cancer antigen: identification as a new mucin, MUC16."
Yin B.W.T., Lloyd K.O.
J. Biol. Chem. 276:27371-27375(2001) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 8297-22152, PROTEIN SEQUENCE OF 21360-21365 AND 21983-21995, TISSUE SPECIFICITY.
[4]Lloyd K.O., Yin B.W.T.
Submitted (SEP-2003) to the EMBL/GenBank/DDBJ databases
Cited for: SEQUENCE REVISION.
[5]"Complete sequencing and characterization of 21,243 full-length human cDNAs."
Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S. expand/collapse author list , Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.
Nat. Genet. 36:40-45(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 20473-21695.
Tissue: Uterus.
[6]"CA125 phosphorylation is associated with its secretion from the WISH human amnion cell line."
Fendrick J.L., Konishi I., Geary S.M., Parmley T.H., Quirk J.G. Jr., O'Brien T.J.
Tumor Biol. 18:278-289(1997) [PubMed] [Europe PMC] [Abstract]
Cited for: PHOSPHORYLATION.
[7]"Characterization of the oligosaccharides associated with the human ovarian tumor marker CA125."
Kui Wong N., Easton R.L., Panico M., Sutton-Smith M., Morrison J.C., Lattanzio F.A., Morris H.R., Clark G.F., Dell A., Patankar M.S.
J. Biol. Chem. 278:28619-28634(2003) [PubMed] [Europe PMC] [Abstract]
Cited for: GLYCOSYLATION.
[8]"Binding of ovarian cancer antigen CA125/MUC16 to mesothelin mediates cell adhesion."
Rump A., Morikawa Y., Tanaka M., Minami S., Umesaki N., Takeuchi M., Miyajima A.
J. Biol. Chem. 279:9190-9198(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: INTERACTION WITH MSLN.
[9]"Introducing the MUC16 gene: implications for prevention and early detection in epithelial ovarian cancer."
McLemore M.R., Aouizerat B.
Biol. Res. Nurs. 6:262-267(2005) [PubMed] [Europe PMC] [Abstract]
Cited for: REVIEW, POLYMORPHISM.
[10]"Mucin characteristics of human corneal-limbal epithelial cells that exclude the rose bengal anionic dye."
Argueso P., Tisdale A., Spurr-Michaud S., Sumiyoshi M., Gipson I.K.
Invest. Ophthalmol. Vis. Sci. 47:113-119(2006) [PubMed] [Europe PMC] [Abstract]
Cited for: TISSUE SPECIFICITY.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AF414442 mRNA. Translation: AAL65133.2.
AF361486 mRNA. Translation: AAK74120.3.
AK128681 mRNA. Translation: BAC87568.1.
RefSeqNP_078966.2. NM_024690.2.
UniGeneHs.432676.

3D structure databases

ProteinModelPortalQ8WXI7.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

BioGrid125094. 3 interactions.
STRING9606.ENSP00000381008.

PTM databases

PhosphoSiteQ8WXI7.

Proteomic databases

MaxQBQ8WXI7.
PaxDbQ8WXI7.
PRIDEQ8WXI7.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

GeneID94025.
KEGGhsa:94025.

Organism-specific databases

CTD94025.
GeneCardsGC19M008960.
H-InvDBHIX0021715.
HGNCHGNC:15582. MUC16.
HPACAB055172.
MIM606154. gene.
neXtProtNX_Q8WXI7.
PharmGKBPA31314.
GenAtlasSearch...

Phylogenomic databases

eggNOGNOG12793.
KOK16145.

Enzyme and pathway databases

ReactomeREACT_17015. Metabolism of proteins.

Gene expression databases

CleanExHS_MUC16.
GenevestigatorQ8WXI7.

Family and domain databases

Gene3D3.30.70.960. 65 hits.
InterProIPR028850. MUC16.
IPR000082. SEA_dom.
[Graphical view]
PANTHERPTHR14672. PTHR14672. 1 hit.
PfamPF01390. SEA. 55 hits.
[Graphical view]
SMARTSM00200. SEA. 23 hits.
[Graphical view]
SUPFAMSSF82671. SSF82671. 65 hits.
PROSITEPS50024. SEA. 65 hits.
[Graphical view]
ProtoNetSearch...

Other

ChiTaRSMUC16. human.
GeneWikiCA-125.
GenomeRNAi94025.
NextBio78314.
PROQ8WXI7.
SOURCESearch...

Entry information

Entry nameMUC16_HUMAN
AccessionPrimary (citable) accession number: Q8WXI7
Secondary accession number(s): Q6ZQW5, Q96RK2
Entry history
Integrated into UniProtKB/Swiss-Prot: October 31, 2006
Last sequence update: March 1, 2003
Last modified: June 11, 2014
This is version 88 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

SIMILARITY comments

Index of protein domains and families

MIM cross-references

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot

Human polymorphisms and disease mutations

Index of human polymorphisms and disease mutations

Human entries with polymorphisms or disease mutations

List of human entries with polymorphisms or disease mutations

Human chromosome 19

Human chromosome 19: entries, gene names and cross-references to MIM