Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q8WZ42 (TITIN_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified January 25, 2012. Version 101. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (7) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Interactions·Alt products·Sequence annotation·Sequences·References·Web links·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Titin

EC=2.7.11.1
Alternative name(s):
Connectin
Rhabdomyosarcoma antigen MU-RMS-40.14
Gene names
Name:TTN
OrganismHomo sapiens (Human)
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length34350 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Key component in the assembly and functioning of vertebrate striated muscles. By providing connections at the level of individual microfilaments, it contributes to the fine balance of forces between the two halves of the sarcomere. The size and extensibility of the cross-links are the main determinants of sarcomere extensibility properties of muscle. In non-muscle cells, seems to play a role in chromosome condensation and chromosome segregation during mitosis. Might link the lamina network to chromatin or nuclear actin, or both during interphase. Ref.31

Catalytic activity

ATP + a protein = ADP + a phosphoprotein.

Cofactor

Magnesium.

Enzyme regulation

Full activation of the protein kinase domain requires both phosphorylation of Tyr-32341, preventing it from blocking the catalytic aspartate residue, and binding of Ca/CALM to the C-terminal regulatory tail of the molecule which results in ATP binding to the kinase. Ref.31

Subunit structure

Interacts with MYOM1, MYOM2, tropomyosin and myosin. Interacts with actin, primarily via the PEVK domains and with MYPN By similarity. Interacts with FHL2, NEB, CRYAB, LMNA/lamin-A and LMNB/lamin-B. Interacts with TCAP/telethonin and/or ANK1 isoform Mu17/ank1.5, via the first two N-terminal immunoglobulin domains. Interacts with TRIM63 and TRIM55, through several domains including immunoglobulin domains 141 and 142. Interacts with ANKRD1, ANKRD2 and ANKRD23, via the region between immunoglobulin domains 77 and 78 and interacts with CAPN3, via immunoglobulin domain 79. Interacts with NBR1 through the protein kinase domain. Interacts with CALM/calmodulin. Isoform 8 interacts with OBSCN isoform 3. Ref.3 Ref.16 Ref.17 Ref.18 Ref.19 Ref.20 Ref.21 Ref.22 Ref.24 Ref.31 Ref.41

Subcellular location

Cytoplasm Probable. Nucleus Ref.24.

Tissue specificity

Isoform 3, isoform 7 and isoform 8 are expressed in cardiac muscle. Isoform 4 is expressed in vertebrate skeletal muscle. Isoform 6 is expressed in cardiac tissues. Ref.3 Ref.7

Domain

ZIS1 and ZIS5 regions contain multiple SPXR consensus sites for ERK- and CDK-like protein kinases as well as multiple SP motifs. ZIS1 could adopt a closed conformation which would block the TCAP-binding site.

The PEVK region may serve as an entropic spring of a chain of structural folds and may also be an interaction site to other myofilament proteins to form interfilament connectivity in the sarcomere.

Post-translational modification

Autophosphorylated By similarity. Phosphorylated upon DNA damage, probably by ATM or ATR. Ref.14 Ref.23 Ref.26 Ref.27 Ref.28 Ref.29 Ref.31

Involvement in disease

Defects in TTN are the cause of hereditary myopathy with early respiratory failure (HMERF) [MIM:603689]; also known as Edstrom myopathy. HMERF is an autosomal dominant, adult-onset myopathy with early respiratory muscle involvement. Ref.41

Defects in TTN are the cause of familial hypertrophic cardiomyopathy type 9 (CMH9) [MIM:613765]. Familial hypertrophic cardiomyopathy is a hereditary heart disorder characterized by ventricular hypertrophy, which is usually asymmetric and often involves the interventricular septum. The symptoms include dyspnea, syncope, collapse, palpitations, and chest pain. They can be readily provoked by exercise. The disorder has inter- and intrafamilial variability ranging from benign to malignant forms with high risk of cardiac failure and sudden cardiac death. Ref.35

Defects in TTN are the cause of cardiomyopathy dilated type 1G (CMD1G) [MIM:604145]. Dilated cardiomyopathy is a disorder characterized by ventricular dilation and impaired systolic function, resulting in congestive heart failure and arrhythmia. Patients are at risk of premature death. Ref.37 Ref.38 Ref.40

Defects in TTN are the cause of tardive tibial muscular dystrophy (TMD) [MIM:600334]; also known as Udd myopathy. TMD is an autosomal dominant, late-onset distal myopathy. Muscle weakness and atrophy are usually confined to the anterior compartment of the lower leg, in particular the tibialis anterior muscle. Clinical symptoms usually occur at age 35-45 years or much later. Ref.36 Ref.39

Defects in TTN are the cause of limb-girdle muscular dystrophy type 2J (LGMD2J) [MIM:608807]. LGMD2J is an autosomal recessive degenerative myopathy characterized by progressive weakness of the pelvic and shoulder girdle muscles. Severe disability is observed within 20 years of onset.

Defects in TTN are the cause of early-onset myopathy with fatal cardiomyopathy (EOMFC) [MIM:611705]. Early-onset myopathies are inherited muscle disorders that manifest typically from birth or infancy with hypotonia, muscle weakness, and delayed motor development. EOMFC is a titinopathy that, in contrast with the previously described examples, involves both heart and skeletal muscle, has a congenital onset, and is purely recessive. This phenotype is due to homozygous out-of-frame TTN deletions, which lead to a total absence of titin's C-terminal end from striated muscles and to secondary CAPN3 depletion. Ref.42

Miscellaneous

In some isoforms, after the PEVK repeat region there is a long PEVK duplicated region. On account of this region, it has been very difficult to sequence the whole protein. The length of this region (ranging from 183 to 2174 residues), may be a key elastic element of titin.

Sequence similarities

Belongs to the protein kinase superfamily. CAMK Ser/Thr protein kinase family.

Contains 132 fibronectin type-III domains.

Contains 152 Ig-like (immunoglobulin-like) domains.

Contains 19 Kelch repeats.

Contains 1 protein kinase domain.

Contains 17 RCC1 repeats.

Contains 14 TPR repeats.

Contains 15 WD repeats.

Sequence caution

The sequence AAH58824.1 differs from that shown. Reason: Contaminating sequence. Potential poly-A sequence starting in position 553.

The sequence AAH70170.1 differs from that shown. Reason: Contaminating sequence. Potential poly-A sequence starting in position 627.

Ontologies

Keywords
   Cellular componentCytoplasm
Nucleus
   Coding sequence diversityAlternative splicing
Polymorphism
   DiseaseCardiomyopathy
Disease mutation
Limb-girdle muscular dystrophy
   DomainCoiled coil
Immunoglobulin domain
Kelch repeat
Repeat
TPR repeat
WD repeat
   LigandATP-binding
Calcium
Calmodulin-binding
Magnesium
Metal-binding
Nucleotide-binding
   Molecular functionKinase
Serine/threonine-protein kinase
Transferase
   PTMDisulfide bond
Isopeptide bond
Phosphoprotein
Ubl conjugation
   Technical term3D-structure
Complete proteome
Direct protein sequencing
Reference proteome
Gene Ontology (GO)
   Biological processcardiac muscle fiber development

Inferred from mutant phenotype. Source: BHF-UCL

cardiac muscle tissue morphogenesis

Inferred from mutant phenotype. Source: BHF-UCL

cardiac myofibril assembly

Inferred from mutant phenotype Ref.37. Source: BHF-UCL

mitotic chromosome condensation

Inferred from expression pattern. Source: BHF-UCL

muscle filament sliding

Traceable author statement. Source: Reactome

platelet activation

Traceable author statement. Source: Reactome

platelet degranulation

Traceable author statement. Source: Reactome

regulation of protein kinase activity

Inferred from mutant phenotype Ref.31. Source: BHF-UCL

response to calcium ion

Inferred from direct assay. Source: BHF-UCL

sarcomere organization

Inferred from mutant phenotype. Source: BHF-UCL

sarcomerogenesis

Inferred from mutant phenotype. Source: BHF-UCL

skeletal muscle myosin thick filament assembly

Inferred from mutant phenotype. Source: BHF-UCL

skeletal muscle thin filament assembly

Inferred from mutant phenotype. Source: BHF-UCL

striated muscle contraction

Traceable author statement Ref.1. Source: UniProtKB

   Cellular componentM band

Inferred from direct assay Ref.3. Source: BHF-UCL

Z disc

Inferred from direct assay. Source: UniProtKB

condensed nuclear chromosome

Inferred from direct assay. Source: BHF-UCL

cytosol

Traceable author statement. Source: Reactome

extracellular region

Traceable author statement. Source: Reactome

striated muscle thin filament

Inferred from direct assay Ref.3. Source: BHF-UCL

   Molecular functionATP binding

Inferred from electronic annotation. Source: UniProtKB-KW

actin filament binding

Inferred from direct assay. Source: BHF-UCL

calcium ion binding

Inferred from direct assay. Source: BHF-UCL

calmodulin binding

Traceable author statement. Source: UniProtKB

identical protein binding

Inferred from physical interaction Ref.31. Source: IntAct

muscle alpha-actinin binding

Inferred from physical interaction. Source: BHF-UCL

nucleic acid binding

Inferred from electronic annotation. Source: InterPro

protein binding

Inferred from physical interaction Ref.3Ref.20. Source: BHF-UCL

protein self-association

Inferred from direct assay Ref.31. Source: BHF-UCL

protein serine/threonine kinase activity

Inferred from direct assay Ref.31. Source: UniProtKB

protein tyrosine kinase activity

Inferred from electronic annotation. Source: InterPro

structural constituent of muscle

Traceable author statement Ref.1. Source: UniProtKB

telethonin binding

Inferred from sequence or structural similarity. Source: BHF-UCL

Complete GO annotation...

Binary interactions

Alternative products

This entry describes 8 isoforms produced by alternative splicing. [Align] [Select]

Note: A number of isoforms may be produced, ranging from 27000 to 33000 residues in different striated muscle tissues, the size of the full length protein may be up to 38138 residues.
Isoform 1 (identifier: Q8WZ42-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Note: No experimental confirmation available.
Isoform 2 (identifier: Q8WZ42-2)

The sequence of this isoform differs from the canonical sequence as follows:
     555-646: Missing.
Note: No experimental confirmation available.
Isoform 3 (identifier: Q8WZ42-3)

Also known as: Small cardiac N2-B;

The sequence of this isoform differs from the canonical sequence as follows:
     556-601: Missing.
     4474-11851: Missing.
Isoform 4 (identifier: Q8WZ42-4)

Also known as: Soleus;

The sequence of this isoform differs from the canonical sequence as follows:
     3454-4380: Missing.
     11507-11507: E → EVFEEPEESPSAPPKKPEVPPVR
Note: No experimental confirmation available.
Isoform 5 (identifier: Q8WZ42-5)

The sequence of this isoform differs from the canonical sequence as follows:
     10382-10645: Missing.
     10742-10931: Missing.
     11015-11163: Missing.
     11223-11852: Missing.
     11985-12201: Missing.
Note: No experimental confirmation available.
Isoform 6 (identifier: Q8WZ42-6)

Also known as: Small cardiac novex-3;

The sequence of this isoform differs from the canonical sequence as follows:
     3455-5604: FSSSFLSAEE...VLDLIIPPSF → LFSEGESEHS...AESFAALTLT
     5605-34350: Missing.
Note: Phosphorylated on Thr-5304 and Ser-5306. Ref.2 (CAD12457) sequence is in conflict in positions: 3732:L->F and 5139:R->M.
Isoform 7 (identifier: Q8WZ42-7)

Also known as: Cardiac novex-2;

The sequence of this isoform differs from the canonical sequence as follows:
     3435-3645: APESILHERI...LPAIFEYTVV → VQALDRQSSG...IEQEIEMEMK
     3646-4380: Missing.
Isoform 8 (identifier: Q8WZ42-8)

Also known as: Cardiac novex-1;

The sequence of this isoform differs from the canonical sequence as follows:
     3434-3434: E → GFSKFEENTS...CAATLTVTPK

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 3435034350Titin
PRO_0000239311

Regions

Domain6 – 9691Ig-like 1
Domain104 – 19289Ig-like 2
Repeat417 – 46246Z-repeat 1
Repeat466 – 51146Z-repeat 2
Repeat512 – 55443Z-repeat 3
Repeat555 – 60046Z-repeat 4
Repeat601 – 64646Z-repeat 5
Repeat647 – 69145Z-repeat 6
Repeat692 – 74049Z-repeat 7
Domain943 – 103189Ig-like 3
Domain1082 – 117291Ig-like 4
Domain1291 – 138292Ig-like 5
Domain1457 – 154690Ig-like 6
Domain1556 – 164691Ig-like 7
Domain1703 – 179391Ig-like 8
Domain1841 – 192888Ig-like 9
Domain2078 – 216790Ig-like 10
Repeat2089 – 212234TPR 1
Domain2171 – 226292Ig-like 11
Domain2264 – 235491Ig-like 12
Domain2353 – 244391Ig-like 13
Domain2430 – 2529100Ig-like 14
Domain2620 – 270384Ig-like 15
Repeat2804 – 283835TPR 2
Domain2880 – 296586Ig-like 16
Domain2968 – 305083Ig-like 17
Repeat3022 – 306241WD 1
Domain3058 – 314184Ig-like 18
Domain3239 – 332789Ig-like 19
Domain3344 – 343289Ig-like 20
Domain3503 – 358684Ig-like 21
Domain3621 – 371292Ig-like 22
Repeat4168 – 420336TPR 3
Domain4289 – 437688Ig-like 23
Domain4383 – 447189Ig-like 24
Domain4478 – 456689Ig-like 25
Domain4571 – 465989Ig-like 26
Domain4664 – 475390Ig-like 27
Domain4758 – 484689Ig-like 28
Domain4851 – 493686Ig-like 29
Repeat4860 – 490445Kelch 1
Domain4943 – 503290Ig-like 30
Domain5040 – 512889Ig-like 31
Domain5133 – 522189Ig-like 32
Repeat5170 – 520334TPR 4
Domain5225 – 531490Ig-like 33
Domain5320 – 540889Ig-like 34
Domain5413 – 550189Ig-like 35
Domain5505 – 559490Ig-like 36
Domain5602 – 569089Ig-like 37
Domain5695 – 578389Ig-like 38
Domain5788 – 587790Ig-like 39
Domain5882 – 597089Ig-like 40
Domain5975 – 606389Ig-like 41
Domain6067 – 615690Ig-like 42
Domain6164 – 625289Ig-like 43
Domain6257 – 634791Ig-like 44
Domain6350 – 644091Ig-like 45
Domain6444 – 653491Ig-like 46
Repeat6474 – 650734TPR 5
Domain6537 – 662690Ig-like 47
Domain6630 – 672192Ig-like 48
Repeat6654 – 669239WD 2
Domain6727 – 681589Ig-like 49
Domain6820 – 690889Ig-like 50
Domain6912 – 700190Ig-like 51
Domain7005 – 709389Ig-like 52
Domain7102 – 719089Ig-like 53
Domain7198 – 728689Ig-like 54
Domain7291 – 738090Ig-like 55
Domain7385 – 747389Ig-like 56
Repeat7415 – 744834TPR 6
Domain7478 – 756790Ig-like 57
Domain7571 – 766292Ig-like 58
Domain7668 – 775689Ig-like 59
Domain7761 – 784989Ig-like 60
Domain7853 – 794290Ig-like 61
Domain7946 – 803590Ig-like 62
Domain8042 – 813392Ig-like 63
Domain8138 – 822992Ig-like 64
Domain8232 – 832190Ig-like 65
Domain8326 – 841489Ig-like 66
Domain8419 – 850890Ig-like 67
Domain8512 – 860392Ig-like 68
Domain8609 – 869789Ig-like 69
Domain8702 – 879089Ig-like 70
Domain8794 – 888390Ig-like 71
Domain8888 – 897689Ig-like 72
Domain8984 – 907491Ig-like 73
Domain9079 – 916890Ig-like 74
Domain9176 – 926590Ig-like 75
Repeat9184 – 922138TPR 7
Domain9272 – 936190Ig-like 76
Domain9366 – 9470105Ig-like 77
Domain9660 – 975596Ig-like 78
Repeat9701 – 973434TPR 8
Domain9760 – 985192Ig-like 79
Repeat10031 – 1006434TPR 9
Repeat10041 – 1008747Kelch 2
Repeat10216 – 1024227PEVK 1
Repeat10244 – 1027027PEVK 2
Repeat10272 – 1029827PEVK 3
Repeat10300 – 1032627PEVK 4
Repeat10327 – 1035327PEVK 5
Repeat10355 – 1038127PEVK 6
Repeat10508 – 1053427PEVK 7
Repeat10536 – 1056227PEVK 8
Repeat10592 – 1061827PEVK 9
Repeat10878 – 1090427PEVK 10
Repeat10906 – 1093025PEVK 11
Repeat10932 – 1095827PEVK 12
Repeat10960 – 1098627PEVK 13
Repeat10987 – 1101428PEVK 14
Repeat11363 – 1139634PEVK 15
Repeat11397 – 1142125PEVK 16
Repeat11453 – 1147927PEVK 17
Repeat11481 – 1150727PEVK 18
Repeat11509 – 1153527PEVK 19
Repeat11537 – 1156327PEVK 20
Repeat11565 – 1159127PEVK 21
Repeat11657 – 1168327PEVK 22
Repeat11703 – 1172927PEVK 23
Repeat11745 – 1177127PEVK 24
Repeat11775 – 1180127PEVK 25
Repeat11836 – 1186227PEVK 26
Repeat11864 – 1189027PEVK 27
Repeat11893 – 1191927PEVK 28
Repeat11929 – 1195527PEVK 29
Repeat11966 – 1199227PEVK 30
Repeat11996 – 1202227PEVK 31
Domain12041 – 1213393Ig-like 80
Domain12138 – 1222285Ig-like 81
Domain12233 – 1231886Ig-like 82
Domain12499 – 1258486Ig-like 83
Domain12590 – 1267283Ig-like 84
Domain12766 – 1285085Ig-like 85
Domain12945 – 1303288Ig-like 86
Repeat12955 – 1298834TPR 10
Domain13120 – 1320687Ig-like 87
Domain13210 – 1329586Ig-like 88
Domain13299 – 1338486Ig-like 89
Domain13388 – 1347891Ig-like 90
Repeat13391 – 1343242WD 3
Repeat13443 – 1348543WD 4
Domain13479 – 1356284Ig-like 91
Domain13565 – 1365591Ig-like 92
Domain13659 – 1374890Ig-like 93
Repeat13714 – 1375340WD 5
Domain13749 – 1383385Ig-like 94
Domain13927 – 1401286Ig-like 95
Domain14017 – 1410993Fibronectin type-III 1
Repeat14084 – 1413653RCC1 1
Domain14118 – 1421093Fibronectin type-III 2
Repeat14185 – 1423854RCC1 2
Domain14219 – 1431193Fibronectin type-III 3
Domain14415 – 1450692Fibronectin type-III 4
Domain14515 – 1460692Fibronectin type-III 5
Domain14615 – 1470894Ig-like 96
Domain14711 – 1480292Fibronectin type-III 6
Domain14809 – 1490294Fibronectin type-III 7
Repeat14828 – 1487649Kelch 3
Domain14910 – 1500293Fibronectin type-III 8
Repeat14986 – 1503651Kelch 4
Domain15010 – 1510495Fibronectin type-III 9
Repeat15077 – 1513054RCC1 3
Domain15111 – 1520393Fibronectin type-III 10
Domain15211 – 1530595Fibronectin type-III 11
Domain15314 – 1540289Ig-like 97
Domain15407 – 1550094Fibronectin type-III 12
Domain15506 – 1559994Fibronectin type-III 13
Repeat15574 – 1563057RCC1 4
Domain15608 – 15724117Ig-like 98
Domain15729 – 1582294Fibronectin type-III 14
Domain15829 – 1592395Fibronectin type-III 15
Domain15929 – 1602193Fibronectin type-III 16
Domain16029 – 1611991Ig-like 99
Domain16124 – 1621491Fibronectin type-III 17
Repeat16134 – 1618047WD 6
Domain16221 – 1631494Fibronectin type-III 18
Domain16322 – 1642099Ig-like 100
Domain16425 – 1651894Fibronectin type-III 19
Domain16526 – 1662398Fibronectin type-III 20
Domain16631 – 1672696Fibronectin type-III 21
Domain16727 – 16834108Ig-like 101
Domain16839 – 1693092Fibronectin type-III 22
Domain16938 – 1703699Fibronectin type-III 23
Domain17044 – 1713996Ig-like 102
Domain17144 – 1723693Fibronectin type-III 24
Domain17244 – 17343100Fibronectin type-III 25
Domain17348 – 1744093Fibronectin type-III 26
Domain17449 – 1753688Ig-like 103
Domain17543 – 1763795Fibronectin type-III 27
Domain17644 – 1773693Fibronectin type-III 28
Repeat17711 – 1776959RCC1 5
Domain17745 – 1783490Ig-like 104
Domain17839 – 1793193Fibronectin type-III 29
Repeat17930 – 1796940WD 7
Domain17939 – 1803193Fibronectin type-III 30
Repeat18006 – 1805550RCC1 6
Domain18040 – 1813495Fibronectin type-III 31
Domain18143 – 1822886Ig-like 105
Domain18237 – 1833094Fibronectin type-III 32
Repeat18258 – 1830346Kelch 5
Repeat18303 – 1835856RCC1 7
Domain18336 – 1842792Fibronectin type-III 33
Domain18435 – 1852692Ig-like 106
Domain18531 – 1862393Fibronectin type-III 34
Repeat18553 – 1859846Kelch 6
Domain18630 – 1872293Fibronectin type-III 35
Domain18730 – 1882495Fibronectin type-III 36
Domain18833 – 1892492Ig-like 107
Domain18929 – 1901991Fibronectin type-III 37
Domain19028 – 1912194Fibronectin type-III 38
Domain19128 – 1921992Ig-like 108
Domain19224 – 1931592Fibronectin type-III 39
Repeat19290 – 1934657RCC1 8
Domain19323 – 1941593Fibronectin type-III 40
Repeat19389 – 1945264RCC1 9
Domain19423 – 19522100Fibronectin type-III 41
Domain19531 – 1961787Ig-like 109
Domain19626 – 1971792Fibronectin type-III 42
Repeat19647 – 1969246Kelch 7
Domain19726 – 1981893Fibronectin type-III 43
Domain19826 – 1991489Ig-like 110
Domain19919 – 2001193Fibronectin type-III 44
Domain20019 – 2011193Fibronectin type-III 45
Domain20116 – 2021196Fibronectin type-III 46
Domain20220 – 2031192Ig-like 111
Domain20316 – 2040792Fibronectin type-III 47
Domain20414 – 2050794Fibronectin type-III 48
Domain20515 – 2060995Fibronectin type-III 49
Domain20714 – 2080592Fibronectin type-III 50
Domain20811 – 2090090Fibronectin type-III 51
Repeat20833 – 2087644Kelch 8
Domain20893 – 20996104Ig-like 112
Domain21003 – 2109795Fibronectin type-III 52
Repeat21069 – 2112557RCC1 10
Domain21103 – 2119593Fibronectin type-III 53
Domain21200 – 2129495Fibronectin type-III 54
Repeat21222 – 2126746Kelch 9
Domain21303 – 2139593Ig-like 113
Domain21400 – 2149192Fibronectin type-III 55
Domain21498 – 2159194Fibronectin type-III 56
Repeat21565 – 2162056RCC1 11
Domain21599 – 2169395Fibronectin type-III 57
Domain21701 – 2179393Ig-like 114
Domain21795 – 2188692Fibronectin type-III 58
Repeat21860 – 2191051RCC1 12
Domain21892 – 2198291Fibronectin type-III 59
Domain21990 – 2208394Ig-like 115
Domain22086 – 2217893Fibronectin type-III 60
Domain22186 – 2227893Fibronectin type-III 61
Domain22283 – 2237795Fibronectin type-III 62
Repeat22306 – 2235045Kelch 10
Domain22386 – 2247792Ig-like 116
Domain22482 – 2257493Fibronectin type-III 63
Domain22581 – 2267494Fibronectin type-III 64
Domain22683 – 2277694Fibronectin type-III 65
Domain22785 – 2287490Ig-like 117
Domain22879 – 2297092Fibronectin type-III 66
Domain22976 – 2306691Fibronectin type-III 67
Repeat23041 – 2309151RCC1 13
Domain23075 – 2316389Ig-like 118
Domain23168 – 2326295Fibronectin type-III 68
Domain23268 – 2336093Fibronectin type-III 69
Domain23365 – 2345995Fibronectin type-III 70
Domain23468 – 2355588Ig-like 119
Domain23564 – 2365693Fibronectin type-III 71
Repeat23651 – 2369444WD 8
Domain23664 – 2375693Fibronectin type-III 72
Domain23765 – 2385894Fibronectin type-III 73
Domain23867 – 2395488Ig-like 120
Domain23961 – 2405292Fibronectin type-III 74
Repeat24027 – 2407650RCC1 14
Domain24058 – 2414891Fibronectin type-III 75
Repeat24079 – 2412446Kelch 11
Domain24157 – 2424185Ig-like 121
Domain24250 – 2434495Fibronectin type-III 76
Repeat24261 – 2430747WD 9
Domain24350 – 2444293Fibronectin type-III 77
Domain24447 – 2454195Fibronectin type-III 78
Domain24550 – 2464192Ig-like 122
Domain24646 – 2473893Fibronectin type-III 79
Domain24746 – 2483893Fibronectin type-III 80
Domain24847 – 2494094Fibronectin type-III 81
Repeat24868 – 2491649Kelch 12
Domain24949 – 2503890Ig-like 123
Domain25043 – 2513492Fibronectin type-III 82
Domain25139 – 2523092Fibronectin type-III 83
Domain25239 – 2532587Ig-like 124
Domain25332 – 2542493Fibronectin type-III 84
Repeat25343 – 2538947WD 10
Repeat25419 – 2546244WD 11
Domain25432 – 2552493Fibronectin type-III 85
Domain25529 – 2562395Fibronectin type-III 86
Domain25632 – 2572291Ig-like 125
Domain25729 – 2582193Fibronectin type-III 87
Domain25828 – 2592194Fibronectin type-III 88
Domain25929 – 2602395Fibronectin type-III 89
Repeat25951 – 2599747Kelch 13
Domain26032 – 2612190Ig-like 126
Domain26126 – 2621792Fibronectin type-III 90
Domain26222 – 2631392Fibronectin type-III 91
Repeat26244 – 2628946Kelch 14
Domain26322 – 2641089Ig-like 127
Domain26415 – 2650692Fibronectin type-III 92
Repeat26501 – 2654444WD 12
Domain26513 – 2660795Fibronectin type-III 93
Domain26611 – 2670595Fibronectin type-III 94
Domain26714 – 2680188Ig-like 128
Domain26810 – 2690293Fibronectin type-III 95
Domain26910 – 2700293Fibronectin type-III 96
Domain27011 – 2710292Fibronectin type-III 97
Repeat27077 – 2712751RCC1 15
Domain27101 – 2719696Ig-like 129
Domain27205 – 2729692Fibronectin type-III 98
Repeat27271 – 2732050RCC1 16
Domain27302 – 2739291Fibronectin type-III 99
Repeat27323 – 2736846Kelch 15
Domain27497 – 2758993Fibronectin type-III 100
Domain27597 – 2768993Fibronectin type-III 101
Domain27694 – 2778895Fibronectin type-III 102
Domain27797 – 2788892Ig-like 130
Domain27893 – 2798593Fibronectin type-III 103
Domain27993 – 2808896Fibronectin type-III 104
Repeat28062 – 2809534TPR 11
Domain28093 – 2818795Fibronectin type-III 105
Domain28196 – 2828691Ig-like 131
Domain28293 – 2838492Fibronectin type-III 106
Domain28390 – 2848091Fibronectin type-III 107
Domain28488 – 2857790Ig-like 132
Domain28584 – 2867693Fibronectin type-III 108
Repeat28606 – 2865146Kelch 16
Repeat28671 – 2871444WD 13
Domain28684 – 2877693Fibronectin type-III 109
Domain28781 – 2887494Fibronectin type-III 110
Domain28882 – 2897493Ig-like 133
Domain28979 – 2907193Fibronectin type-III 111
Repeat29046 – 2910156RCC1 17
Domain29079 – 2917193Fibronectin type-III 112
Domain29180 – 2927394Fibronectin type-III 113
Domain29282 – 2936786Ig-like 134
Domain29376 – 2946792Fibronectin type-III 114
Domain29473 – 2956391Fibronectin type-III 115
Domain29568 – 2966396Ig-like 135
Domain29668 – 2976093Fibronectin type-III 116
Domain29767 – 2986094Fibronectin type-III 117
Domain29865 – 2996298Fibronectin type-III 118
Domain29971 – 3005989Ig-like 136
Domain30068 – 3015992Fibronectin type-III 119
Domain30167 – 3026094Fibronectin type-III 120
Domain30269 – 3036597Fibronectin type-III 121
Domain30371 – 3046090Ig-like 137
Domain30465 – 3055692Fibronectin type-III 122
Domain30562 – 3065493Fibronectin type-III 123
Domain30663 – 3075492Ig-like 138
Domain30759 – 3085395Fibronectin type-III 124
Domain30859 – 3095193Fibronectin type-III 125
Repeat30880 – 3092546Kelch 17
Domain30959 – 3105294Fibronectin type-III 126
Domain31061 – 3115090Ig-like 139
Domain31155 – 3124793Fibronectin type-III 127
Domain31256 – 3135196Fibronectin type-III 128
Domain31358 – 3145093Fibronectin type-III 129
Domain31460 – 3154889Ig-like 140
Domain31650 – 3174394Fibronectin type-III 130
Repeat31739 – 3178244WD 14
Domain31752 – 3184695Fibronectin type-III 131
Domain31855 – 3194591Ig-like 141
Repeat31892 – 3193746WD 15
Domain31955 – 3204692Ig-like 142
Domain32049 – 3213991Fibronectin type-III 132
Repeat32070 – 3211546Kelch 18
Domain32178 – 32432255Protein kinase
Domain32496 – 3258489Ig-like 143
Repeat32503 – 3254947Kelch 19
Domain32617 – 3271094Ig-like 144
Domain32722 – 3281190Ig-like 145
Repeat32927 – 3296034TPR 12
Repeat33235 – 3326834TPR 13
Domain33301 – 3339191Ig-like 146
Domain33488 – 3357689Ig-like 147
Repeat33518 – 3355134TPR 14
Domain33645 – 3373288Ig-like 148
Domain33779 – 3386789Ig-like 149
Domain33963 – 3405290Ig-like 150
Domain34061 – 3414989Ig-like 151
Domain34256 – 3434489Ig-like 152
Nucleotide binding32184 – 321929ATP By similarity
Region253 – 34189ZIS1
Region1410 – 144031ZIS5
Coiled coil529 – 56133 Potential
Coiled coil2025 – 205228 Potential
Coiled coil3462 – 348726 Potential
Coiled coil9534 – 957744 Potential
Compositional bias391 – 43646Ala-rich
Compositional bias453 – 4564Poly-Thr
Compositional bias9500 – 95034Poly-Glu
Compositional bias9861 – 995292Pro-rich
Compositional bias9974 – 119171944Glu-rich
Compositional bias9974 – 10089116Glu-rich
Compositional bias10102 – 101054Poly-Pro
Compositional bias10211 – 120321822Pro-rich
Compositional bias33188 – 331936Poly-Ser
Compositional bias33197 – 332004Poly-Arg
Compositional bias34102 – 34244143Ser-rich

Sites

Active site322981Proton acceptor By similarity
Binding site322071ATP By similarity

Amino acid modifications

Modified residue2631Phosphoserine By similarity
Modified residue2651Phosphoserine By similarity
Modified residue2671Phosphothreonine By similarity
Modified residue40651Phosphoserine Ref.29
Modified residue40681Phosphoserine Ref.29
Modified residue69201Phosphoserine By similarity
Modified residue84901Phosphotyrosine Ref.23
Modified residue91221Phosphoserine By similarity
Modified residue92031Phosphoserine Ref.28
Modified residue92071Phosphothreonine Ref.28
Modified residue115031Phosphoserine By similarity
Modified residue120071Phosphothreonine By similarity
Modified residue120091Phosphoserine By similarity
Modified residue120221Phosphoserine By similarity
Modified residue225251Phosphoserine Ref.27
Modified residue225341Phosphoserine Ref.27
Modified residue304431Phosphothreonine By similarity
Modified residue323411Phosphotyrosine Ref.31
Modified residue332451Phosphoserine By similarity
Modified residue332471Phosphoserine By similarity
Modified residue336021Phosphoserine By similarity
Modified residue336141Phosphoserine By similarity
Modified residue339381Phosphoserine Ref.26
Modified residue339421Phosphoserine Ref.26
Disulfide bond964 ↔ 1015 By similarity
Disulfide bond1724 ↔ 1777 By similarity
Disulfide bond2109 ↔ 2134 Ref.33
Disulfide bond2196 ↔ 2246 By similarity
Disulfide bond3259 ↔ 3311 By similarity
Disulfide bond4404 ↔ 4455 By similarity
Disulfide bond4499 ↔ 4550 By similarity
Disulfide bond4592 ↔ 4643 By similarity
Disulfide bond4686 ↔ 4737 By similarity
Disulfide bond4779 ↔ 4830 By similarity
Disulfide bond5061 ↔ 5112 By similarity
Disulfide bond5248 ↔ 5299 By similarity
Disulfide bond5623 ↔ 5674 By similarity
Disulfide bond5810 ↔ 5861 By similarity
Disulfide bond5903 ↔ 5954 By similarity
Disulfide bond6185 ↔ 6236 By similarity
Disulfide bond6372 ↔ 6423 By similarity
Disulfide bond6465 ↔ 6516 By similarity
Disulfide bond6748 ↔ 6799 By similarity
Disulfide bond7027 ↔ 7078 By similarity
Disulfide bond7123 ↔ 7174 By similarity
Disulfide bond7219 ↔ 7270 By similarity
Disulfide bond7313 ↔ 7364 By similarity
Disulfide bond7406 ↔ 7457 By similarity
Disulfide bond7689 ↔ 7740 By similarity
Disulfide bond7968 ↔ 8019 By similarity
Disulfide bond8064 ↔ 8115 By similarity
Disulfide bond8160 ↔ 8211 By similarity
Disulfide bond8254 ↔ 8305 By similarity
Disulfide bond8347 ↔ 8398 By similarity
Disulfide bond8630 ↔ 8681 By similarity
Disulfide bond8909 ↔ 8960 By similarity
Disulfide bond9005 ↔ 9056 By similarity
Disulfide bond9101 ↔ 9152 By similarity
Disulfide bond9294 ↔ 9345 By similarity
Disulfide bond9693 ↔ 9743 By similarity
Disulfide bond12067 ↔ 12117 By similarity
Disulfide bond12611 ↔ 12660 By similarity
Disulfide bond12966 ↔ 13016 By similarity
Disulfide bond13233 ↔ 13283 By similarity
Disulfide bond13322 ↔ 13372 By similarity
Disulfide bond13411 ↔ 13461 By similarity
Disulfide bond13771 ↔ 13821 By similarity
Disulfide bond31481 ↔ 31532 By similarity
Disulfide bond32516 ↔ 32568 By similarity
Disulfide bond33664 ↔ 33718 By similarity
Cross-link10718Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in ubiquitin) By similarity
Cross-link10733Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in ubiquitin) By similarity
Cross-link10740Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in ubiquitin) By similarity
Cross-link29566Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in ubiquitin) By similarity
Cross-link30146Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in ubiquitin) By similarity

Natural variations

Alternative sequence555 – 64692Missing in isoform 2.
VSP_019138
Alternative sequence556 – 60146Missing in isoform 3.
VSP_019139
Alternative sequence34341E → GFSKFEENTSNSQWHVSLSV SFKKEPLGQKPSFIQPLSSL RVHNGETVRFHARVSGIPKP EIQWFHNQQLILPTKDVVFH FEESTGMALMLIVDAYSEHA GQYSCKAANSAGEATCAATL TVTPK in isoform 8.
VSP_019140
Alternative sequence3435 – 3645211APESI…EYTVV → VQALDRQSSGKDVRESTKSQ AVADSSFTKEESKISQKEIK SFQGSSYEYEVQVFESVSQS SIHTAASVQDTQLCHTASLS QIAESTELSKECAKESTGEA PKIFLHLQDVTVKCGDTAQF LCVLKDDSFIDVTWTHEGAK IEESERLKQSQNGNIQFLTI CNVQLVDQGLYSCIVHNDCG ERTTSAVLSVEGAPESILHE RIEQEIEMEMK in isoform 7.
VSP_019141
Alternative sequence3454 – 4380927Missing in isoform 4.
VSP_019142
Alternative sequence3455 – 56042150FSSSF…IPPSF → LFSEGESEHSERDTRDAFSD SEDIDHKSMAAKRYASRISS TSSWPEYFKPSFTQKLTFKY VLEGEPVVFTCRLIACPTPE MTWFHNNRPIPTGLRRIIKA ESDLHHHSSSLEIKRVQDRD SGSYRLLAINSEGSAESTAS LLVIQKGQDEKYLEFLKRAE RTHENVEALVERGEDRIKVD LRFTGSPFNKKQDVEQKGMM RTIHFKTMSSAKKTDYMYDE EYLESKSDIRGWLNVGESFL DKETKVKLQRLREARKTLME KKKLSLLDTSSEISSRTLRS EASDKDILFSREDMKIRSMS DLAESYKVDHSAESIVQNPH ALSNQMDQNIESEELPTSFQ TIVDEEIFQTEIRMSQEALV KESLPKDHLYGEILVNENTQ ARGQLEEIMANTTIGESSTY ITNVCEKEEVYETPENVSQA ITPHASESFGTLVNVEESEE IASERIKKDDLRELQLSAST RIDEFKTEQKEENMRFFENS FRKRPQRCPPSFLQEIESQE VYEGDSCNFVCHFQGYPQPI VTWYNNDMPIPRNQNFIIHS LENYSILTLSSVHHQNEGSI TCVLFNQYGTVKTTSMLKVK AKQKHDVKAHKVPVFHDYLD EEEELALVFDQAKGAHPSMS QEGQTNLHLLKTNPPVPPSG DTELLSFPVEIQVTAATPIP EQDKESKEVFQTEELEPKAM PQDQVTQSPKHRFVFLSDIT NEPPKMLQEMPKHARCREGD SIILECLISGEPQPVVTWFQ NGVLLKQNQKFQFEEVNCSH QLYIKDVNSQDSGKYKCVAE NNSGAVESVSDLTVEPVTYR ENSQFENIGEIYGKYSRDQQ LQDQGESVRAHFYDYPAGPF TPWTNVKEYSVRDYFQSLET IEQIDQKEQVRCIPSREKIP RFVHGASRTIKISKPIRAEF IQCQAEGKERHVSEKSKLHQ AEGTVYPFVDDFSDVTIKKE IRNNFGKLGRSEKENVQECA QSDYLPNIHSERISDSYNTK DSSAIVYEESLGEEIHYPGK KVKHRIIEFEKLHVEKGVLE KRPTRTSIVNPPQKKIDDKA FSLKQRESRSSNLNANMYQA EKMSPNTESDSSNIAINLKL LSSQTHKEFDAQEREQQEKI SLIDKPAISKRAEHESPITF DLKQFHTQIKHTDVKFQELD SGQPEEAYFKIQHPADTENI VFDLKQMYSHIGDPALEFQG QETREQQEIHYKEKIPSPET LQPDTHNISKSVQNNVFASQ EISSSQELSNRTMVEKSSID ENSISLEKEVRHVQEQNLDI LKTDLSLKSFSEEIYSESCA LLPTSSADIEETDLSEKSCP LENGGRSSISHLKKAASEEK PLGVGEMEEECTLEPELAAF PKQDGGTQEYTDATLEDHRG DVQEADTLHRQLSLSQCFPL LMTEEQQNPGEQISTNIHAS GEEKCYEEVQVQNEASFSTL EGEMIETSFSQNIPKLDEAH TTEAAESETSLTQYLLAAGK REVPETKDTRDQAKLVQSES ITSMEVEEVTFNTVYEYYNQ KQESLGRPLSPESDISIGVG STTSEEISELDQFYTPPSSV EYFESPKSPDLYFNPSDITK QSSIHSGGETVERYSTPLGE VAERYSTPSEGEVGERYSTP PGETLERYSTPPGETLERYS TPPGETLERYSTPPGETLER YSTPPGETLERYSTPPGEAL ERYSIPTGGPNPTGTFKTYP SKIEREDGTPNEHFYTPTEE RGSAYEIWRSDSFGTPNEAI EPKDNEMPPSFIEPLTKRKV YENTTLGFIVEVEGLPVPGV KWYRNKSLLEPDERIKMERV GNVCSLEISNIQKGEGGEYM CHAVNIIGEAKSFANVDIMP QEERVVALPPPVTHQHVMEF DLEHTTSSRTPSPQEIVLEV ELSEKDVKEFEKQVKIVTVP EFTPDHKSMIVSLDVLPFNF VDPNMDSREGEDKELKIDLE VFEMPPRFIMPICDFKIPEN SDAVFKCSVIGIPTPEVKWY KEYMCIEPDNIKYVISEEKG SHTLKIRNVCLSDSATYRCR AVNCVGEAICRGFLTMGDSE IFAVIAKKSKVTLSSLMEEL VLKSNYTDSFFEFQVVEGPP RFIKGISDCYAPIGTAAYFQ CLVRGSPRPTVYWYKDGKLV QGRRFTVEESGTGFHNLFIT SLVKSDEGEYRCVATNKSGM AESFAALTLT in isoform 6.
VSP_019143
Alternative sequence3646 – 4380735Missing in isoform 7.
VSP_019144
Alternative sequence4474 – 118517378Missing in isoform 3.
VSP_019145
Alternative sequence5605 – 3435028746Missing in isoform 6.
VSP_019146
Alternative sequence10382 – 10645264Missing in isoform 5.
VSP_019147
Alternative sequence10742 – 10931190Missing in isoform 5.
VSP_019148
Alternative sequence11015 – 11163149Missing in isoform 5.
VSP_019149
Alternative sequence11223 – 11852630Missing in isoform 5.
VSP_019150
Alternative sequence115071E → EVFEEPEESPSAPPKKPEVP PVR in isoform 4.
VSP_019151
Alternative sequence11985 – 12201217Missing in isoform 5.
VSP_019152
Natural variant541V → M in CMD1G; affects interaction with TCAP/telethonin. Ref.37
VAR_026685
Natural variant601D → Y. Ref.43
Corresponds to variant rs35683768 [ dbSNP | Ensembl ].
VAR_040078
Natural variant1151V → M in a metastatic melanoma sample; somatic mutation. Ref.43
VAR_040079
Natural variant2791R → W in HMERF; disrupts NBR1-binding. Ref.41
VAR_026634
Natural variant3281R → C. Ref.37 Ref.43
Corresponds to variant rs16866538 [ dbSNP | Ensembl ].
VAR_026686
Natural variant3601R → T. Ref.43
Corresponds to variant rs56128843 [ dbSNP | Ensembl ].
VAR_040080
Natural variant4981V → I. Ref.1 Ref.43
VAR_040081
Natural variant7401R → L in CMH9. Ref.35
Corresponds to variant rs28933405 [ dbSNP | Ensembl ].
VAR_026687
Natural variant7431A → V in CMD1G; affects interaction with TCAP/telethonin. Ref.37
VAR_026688
Natural variant7991T → M in a colorectal adenocarcinoma sample; somatic mutation. Ref.43
VAR_040082
Natural variant8111T → I. Ref.43
Corresponds to variant rs35813871 [ dbSNP | Ensembl ].
VAR_040083
Natural variant9221R → H. Ref.43
Corresponds to variant rs56046320 [ dbSNP | Ensembl ].
VAR_040084
Natural variant9371E → D in a gastric adenocarcinoma sample; somatic mutation. Ref.43
VAR_040085
Natural variant9761W → R in CMD1G. Ref.38
VAR_026689
Natural variant10811A → T. Ref.43
Corresponds to variant rs55914517 [ dbSNP | Ensembl ].
VAR_040086
Natural variant11371G → R. Ref.43
VAR_040087
Natural variant12011K → E. Ref.1 Ref.43
Corresponds to variant rs10497520 [ dbSNP | Ensembl ].
VAR_040088
Natural variant12021V → A. Ref.43
VAR_040089
Natural variant12491S → L.
Corresponds to variant rs1552280 [ dbSNP | Ensembl ].
VAR_056081
Natural variant12951S → L. Ref.2 Ref.3 Ref.43
Corresponds to variant rs1552280 [ dbSNP | Ensembl ].
VAR_040090
Natural variant13451G → D. Ref.43
Corresponds to variant rs36021856 [ dbSNP | Ensembl ].
VAR_040091
Natural variant13471A → T in a metastatic melanoma sample; somatic mutation. Ref.43
VAR_040092
Natural variant13501R → H in a gastric adenocarcinoma sample; somatic mutation. Ref.43
VAR_040093
Natural variant13531V → L. Ref.43
Corresponds to variant rs36062108 [ dbSNP | Ensembl ].
VAR_040094
Natural variant13931I → V. Ref.43
Corresponds to variant rs16866531 [ dbSNP | Ensembl ].
VAR_040095
Natural variant14161R → C. Ref.43
VAR_040096
Natural variant14411R → P. Ref.43
VAR_040097
Natural variant15441I → V. Ref.43
VAR_040098
Natural variant15721R → Q. Ref.2 Ref.3 Ref.43
Corresponds to variant rs12476289 [ dbSNP | Ensembl ].
VAR_040099
Natural variant16581R → G. Ref.43
Corresponds to variant rs56270960 [ dbSNP | Ensembl ].
VAR_040100
Natural variant16641R → Q in an ovarian mucinous carcinoma sample; somatic mutation. Ref.43
VAR_040101
Natural variant16921G → D in a lung squamous cell carcinoma sample; somatic mutation. Ref.43
VAR_040102
Natural variant17441P → L. Ref.43
VAR_040103
Natural variant17721S → G. Ref.43
VAR_040104
Natural variant19071T → I in a colorectal adenocarcinoma sample; somatic mutation. Ref.43
VAR_040105
Natural variant19981R → H. Ref.43
VAR_040106
Natural variant21071P → L in a colorectal adenocarcinoma sample; somatic mutation. Ref.43
VAR_040107
Natural variant21181I → T. Ref.43
Corresponds to variant rs56404770 [ dbSNP | Ensembl ].
VAR_040108
Natural variant21641A → T. Ref.43
Corresponds to variant rs56285559 [ dbSNP | Ensembl ].
VAR_040109
Natural variant22401D → Y. Ref.43
VAR_040110
Natural variant23921G → S. Ref.43
Corresponds to variant rs4894048 [ dbSNP | Ensembl ].
VAR_040111
Natural variant24321L → F in a lung neuroendocrine carcinoma sample; somatic mutation. Ref.43
VAR_040112
Natural variant26101M → I. Ref.2 Ref.3 Ref.43
Corresponds to variant rs56142888 [ dbSNP | Ensembl ].
VAR_040113
Natural variant27711I → M in a breast infiltrating ductal carcinoma sample; somatic mutation. Ref.43
VAR_040114
Natural variant28231V → F. Ref.43
Corresponds to variant rs33917087 [ dbSNP | Ensembl ].
VAR_040115
Natural variant28311S → N. Ref.2 Ref.3 Ref.43
Corresponds to variant rs2306636 [ dbSNP | Ensembl ].
VAR_040116
Natural variant29301V → I. Ref.43
Corresponds to variant rs56373393 [ dbSNP | Ensembl ].
VAR_040117
Natural variant30261N → I.
Corresponds to variant rs11900987 [ dbSNP | Ensembl ].
VAR_056082
Natural variant31541K → R. Ref.43
VAR_040118
Natural variant31911Q → E. Ref.43
VAR_040119
Natural variant32381P → L in a bladder carcinoma sample; somatic mutation. Ref.43
VAR_040120
Natural variant32501V → G. Ref.43
VAR_040121
Natural variant32611V → M. Ref.1 Ref.43
VAR_040122
Natural variant33671R → Q. Ref.43
VAR_040123
Natural variant34191S → N. Ref.1
Corresponds to variant rs2291310 [ dbSNP | Ensembl ].
VAR_056083
Natural variant34821E → K in a metastatic melanoma sample; somatic mutation. Ref.43
VAR_040124
Natural variant34911S → P. Ref.8 Ref.43
VAR_040125
Natural variant35701E → K in a breast pleomorphic lobular carcinoma sample; somatic mutation. Ref.43
VAR_040126
Natural variant35901L → V. Ref.43
VAR_040127
Natural variant36371P → S.
Corresponds to variant rs2627037 [ dbSNP | Ensembl ].
VAR_056084
Natural variant37621I → V. Ref.43
VAR_040128
Natural variant37991S → Y in CMD1G. Ref.37
VAR_026690
Natural variant38771I → F. Ref.43
VAR_040129
Natural variant39651I → L. Ref.43
VAR_040130
Natural variant40841R → Q. Ref.37
VAR_026691
Natural variant42151T → P. Ref.2 Ref.3 Ref.8 Ref.37 Ref.43
VAR_026635
Natural variant42381G → W. Ref.43
VAR_040131
Natural variant42831L → F. Ref.2 Ref.3 Ref.8 Ref.43
VAR_026636
Natural variant42911I → T in a colorectal adenocarcinoma sample; somatic mutation. Ref.43
VAR_040132
Natural variant43031G → D. Ref.43
VAR_040133
Natural variant44271D → E. Ref.43
VAR_040134
Natural variant44651S → N in CMD1G. Ref.37
VAR_026692
Natural variant82881A → V.
Corresponds to variant rs16866412 [ dbSNP | Ensembl ].
VAR_056085
Natural variant84741I → T.
Corresponds to variant rs4893852 [ dbSNP | Ensembl ].
VAR_056086
Natural variant123101G → E. Ref.43
VAR_040135
Natural variant123831R → H. Ref.1 Ref.43
VAR_040136
Natural variant124691V → A. Ref.43
VAR_040137
Natural variant126421R → C in a gastric adenocarcinoma sample; somatic mutation. Ref.43
VAR_040138
Natural variant126571E → K in a Wilms tumor; somatic mutation. Ref.43
VAR_040139
Natural variant126791K → E. Ref.1 Ref.43
VAR_040140
Natural variant127201S → F in a metastatic melanoma sample; somatic mutation. Ref.43
VAR_040141
Natural variant127981R → C. Ref.43
VAR_040142
Natural variant130491E → G. Ref.43
VAR_040143
Natural variant130831E → K in a metastatic melanoma sample; somatic mutation. Ref.43
VAR_040144
Natural variant130961R → L. Ref.43
VAR_040145
Natural variant130991Q → R in a lung small cell carcinoma sample; somatic mutation. Ref.43
VAR_040146
Natural variant132971V → A. Ref.43
VAR_040147
Natural variant133991I → M. Ref.43
VAR_040148
Natural variant134181A → T. Ref.43
VAR_040149
Natural variant134281E → V. Ref.43
VAR_040150
Natural variant134301I → T. Ref.43
VAR_040151
Natural variant134341R → K in a breast pleomorphic lobular carcinoma sample; somatic mutation. Ref.43
VAR_040152
Natural variant134691D → N. Ref.43
VAR_040153
Natural variant134951K → N. Ref.43
VAR_040154
Natural variant137851N → S in a breast pleomorphic lobular carcinoma sample; somatic mutation. Ref.43
VAR_040155
Natural variant138701Q → H in a lung small cell carcinoma sample; somatic mutation. Ref.43
VAR_040156
Natural variant141091V → I. Ref.43
VAR_040157
Natural variant141311R → Q. Ref.43
VAR_040158
Natural variant142081P → T. Ref.43
VAR_040159
Natural variant147281L → V in a lung adenocarcinoma sample; somatic mutation. Ref.43
VAR_040160
Natural variant149991S → T. Ref.43
VAR_040161
Natural variant150211N → T. Ref.43
VAR_040162
Natural variant155201A → V. Ref.43
VAR_040163
Natural variant155551R → I in a colorectal adenocarcinoma sample; somatic mutation. Ref.43
VAR_040164
Natural variant156201R → Q. Ref.43
VAR_040165
Natural variant156291S → I. Ref.43
VAR_040166
Natural variant156351Y → C in a gastric adenocarcinoma sample; somatic mutation. Ref.43
VAR_040167
Natural variant157001R → Q. Ref.43
VAR_040168
Natural variant157051L → P. Ref.43
VAR_040169
Natural variant158371I → M. Ref.43
VAR_040170
Natural variant160581R → H. Ref.43
VAR_040171
Natural variant160671K → I. Ref.43
VAR_040172
Natural variant160901I → T. Ref.43
VAR_040173
Natural variant161951R → H in a gastric adenocarcinoma sample; somatic mutation. Ref.43
VAR_040174
Natural variant164091R → C. Ref.43
VAR_040175
Natural variant164241R → P. Ref.43
VAR_040176
Natural variant165751V → M.
Corresponds to variant rs3813243 [ dbSNP | Ensembl ].
VAR_056087
Natural variant166291I → M. Ref.43
VAR_040177
Natural variant168771K → R. Ref.43
VAR_040178
Natural variant170601N → D. Ref.43
VAR_040179
Natural variant176371I → V. Ref.43
VAR_040180
Natural variant178381R → H. Ref.43
VAR_040181
Natural variant178661D → N. Ref.43
VAR_040182
Natural variant179061G → E in a metastatic melanoma sample; somatic mutation. Ref.43
VAR_040183
Natural variant180941E → A. Ref.43
VAR_040184
Natural variant181091G → S. Ref.43
VAR_040185
Natural variant181641R → T in an ovarian serous carcinoma sample; somatic mutation. Ref.43
VAR_040186
Natural variant182211P → L. Ref.43
VAR_040187
Natural variant182221A → T. Ref.43
VAR_040188
Natural variant187261R → Q. Ref.43
VAR_040189
Natural variant188351V → A in a breast infiltrating ductal carcinoma sample; somatic mutation. Ref.43
VAR_040190
Natural variant188811R → K in a metastatic melanoma sample; somatic mutation. Ref.43
VAR_040191
Natural variant189391N → S. Ref.43
VAR_040192
Natural variant190001R → Q. Ref.43
VAR_040193
Natural variant190601L → Q in a lung large cell carcinoma sample; somatic mutation. Ref.43
VAR_040194
Natural variant190911R → K in a lung large cell carcinoma sample; somatic mutation. Ref.43
VAR_040195
Natural variant192241P → S in a colorectal adenocarcinoma sample; somatic mutation. Ref.43
VAR_040196
Natural variant193671T → I. Ref.43
VAR_040197
Natural variant193921E → K in a lung neuroendocrine carcinoma sample; somatic mutation. Ref.43
VAR_040198
Natural variant194801A → S in a gastric adenocarcinoma sample; somatic mutation. Ref.43
VAR_040199
Natural variant194951D → G. Ref.43
VAR_040200
Natural variant196651R → H. Ref.43
VAR_040201
Natural variant197621T → I. Ref.1 Ref.43
VAR_040202
Natural variant199471G → R. Ref.43
VAR_040203
Natural variant199561V → M. Ref.43
VAR_040204
Natural variant199921R → Q. Ref.43
VAR_040205
Natural variant200571R → C. Ref.43
VAR_040206
Natural variant200751S → L. Ref.43
VAR_040207
Natural variant201791T → K. Ref.43
VAR_040208
Natural variant201981A → T. Ref.43
VAR_040209
Natural variant201981A → V. Ref.43
VAR_040210
Natural variant203311R → H. Ref.43
VAR_040211
Natural variant203591R → K.
Corresponds to variant rs9808036 [ dbSNP | Ensembl ].
VAR_056088
Natural variant204081A → T in a metastatic melanoma sample; somatic mutation. Ref.43
VAR_040212
Natural variant205641R → K. Ref.43
VAR_040213
Natural variant207181V → I. Ref.1 Ref.43
VAR_040214
Natural variant207261S → P. Ref.43
VAR_040215
Natural variant208921T → N. Ref.43
VAR_040216
Natural variant208941S → R. Ref.43
VAR_040217
Natural variant211251D → E. Ref.43
VAR_040218
Natural variant214031P → S. Ref.43
VAR_040219
Natural variant217301R → C. Ref.43
VAR_040220
Natural variant217471R → Q. Ref.43
VAR_040221
Natural variant218511C → R in a gastric adenocarcinoma sample; somatic mutation. Ref.43
VAR_040222
Natural variant219251G → R. Ref.43
VAR_040223
Natural variant219951R → H. Ref.43
VAR_040224
Natural variant220451A → V. Ref.43
VAR_040225
Natural variant221491R → H. Ref.43
VAR_040226
Natural variant221601V → I. Ref.43
VAR_040227
Natural variant222611I → T. Ref.43
VAR_040228
Natural variant223061K → N. Ref.43
VAR_040229
Natural variant223571R → H. Ref.43
VAR_040230
Natural variant224081L → P. Ref.43
VAR_040231
Natural variant225371Q → H in a gastric adenocarcinoma sample; somatic mutation. Ref.43
VAR_040232
Natural variant225841P → L. Ref.43
VAR_040233
Natural variant226461L → P in a gastric adenocarcinoma sample; somatic mutation. Ref.43
VAR_040234
Natural variant226701T → A. Ref.43
VAR_040235
Natural variant227701A → D. Ref.43
VAR_040236
Natural variant228011A → T in a metastatic melanoma sample; somatic mutation. Ref.43
VAR_040237
Natural variant228231R → W in a metastatic melanoma sample; somatic mutation. Ref.43
VAR_040238
Natural variant229681E → Q. Ref.43
VAR_040239
Natural variant230741P → L. Ref.43
VAR_040240
Natural variant230791L → F in a metastatic melanoma sample; somatic mutation. Ref.43
VAR_040241
Natural variant232821D → N in a breast infiltrating ductal carcinoma sample; somatic mutation. Ref.43
VAR_040242
Natural variant233031H → Y in a metastatic melanoma sample; somatic mutation. Ref.43
VAR_040243
Natural variant233061R → C. Ref.43
VAR_040244
Natural variant235151A → S in a lung squamous cell carcinoma sample; somatic mutation. Ref.43
VAR_040245
Natural variant235511E → Q in a gastric adenocarcinoma sample; somatic mutation. Ref.43
VAR_040246
Natural variant238071S → N. Ref.1 Ref.43
VAR_040247
Natural variant238721D → N in an ovarian serous carcinoma sample; somatic mutation. Ref.43
VAR_040248
Natural variant238911V → A. Ref.43
VAR_040249
Natural variant239331Y → H. Ref.43
VAR_040250
Natural variant239391T → M. Ref.43
VAR_040251
Natural variant239521F → L in a gastric adenocarcinoma sample; somatic mutation. Ref.43
VAR_040252
Natural variant240981A → G. Ref.43
VAR_040253
Natural variant240981A → T.
Corresponds to variant rs4894028 [ dbSNP | Ensembl ].
VAR_056089
Natural variant241191N → S. Ref.43
VAR_040254
Natural variant241331V → I. Ref.43
VAR_040255
Natural variant241591V → A in a head and neck squamous cell carcinoma sample; somatic mutation. Ref.43
VAR_040256
Natural variant242391T → A. Ref.43
VAR_040257
Natural variant242651E → K. Ref.43
VAR_040258
Natural variant245841I → T. Ref.43
VAR_040259
Natural variant247811I → T. Ref.43
VAR_040260
Natural variant247991R → H. Ref.43
VAR_040261
Natural variant249541D → H. Ref.43
VAR_040262
Natural variant249801T → M. Ref.1 Ref.43
VAR_040263
Natural variant256591R → H. Ref.43
VAR_040264
Natural variant256791A → T. Ref.43
VAR_040265
Natural variant257201P → A. Ref.43
VAR_040266
Natural variant258211T → K. Ref.43
VAR_040267
Natural variant258591E → K in a metastatic melanoma sample; somatic mutation. Ref.43
VAR_040268
Natural variant258791N → K. Ref.43
VAR_040269
Natural variant259231A → V. Ref.43
VAR_040270
Natural variant260451V → I. Ref.43
VAR_040271
Natural variant260591K → E in a lung small cell carcinoma sample; somatic mutation. Ref.43
VAR_040272
Natural variant261341I → V. Ref.43
VAR_040273
Natural variant264771R → C. Ref.43
VAR_040274
Natural variant268431D → Y. Ref.43
VAR_040275
Natural variant273461K → R. Ref.43
VAR_040276
Natural variant276521R → C. Ref.43
VAR_040277
Natural variant277281G → V. Ref.43
VAR_040278
Natural variant277541F → L. Ref.43
VAR_040279
Natural variant277551I → T. Ref.1 Ref.43
VAR_040280
Natural variant279291I → V. Ref.43
VAR_040281
Natural variant281321I → L. Ref.43
VAR_040282
Natural variant281681R → Q. Ref.43
VAR_040283
Natural variant285381R → H. Ref.43
VAR_040284
Natural variant285721I → T. Ref.43
VAR_040285
Natural variant289481A → T. Ref.43
VAR_040286
Natural variant289861I → V. Ref.43
VAR_040287
Natural variant289931G → E. Ref.43
VAR_040288
Natural variant289981L → V. Ref.43
VAR_040289
Natural variant290701V → M. Ref.43
VAR_040290
Natural variant290901I → V. Ref.43
VAR_040291
Natural variant294191R → C. Ref.43
VAR_040292
Natural variant294791L → P. Ref.43
VAR_040293
Natural variant298801S → L in a colorectal adenocarcinoma sample; somatic mutation. Ref.43
VAR_040294
Natural variant299761D → E. Ref.43
VAR_040295
Natural variant300421S → G. Ref.43
VAR_040296
Natural variant301071R → C. Ref.43
VAR_040297
Natural variant301251S → F. Ref.43
VAR_040298
Natural variant302111L → P. Ref.43
VAR_040299
Natural variant304121I → T. Ref.43
VAR_040300
Natural variant306171T → S in a renal chromophobe cancer sample; somatic mutation. Ref.43
VAR_040301
Natural variant306741T → I. Ref.43
VAR_040302
Natural variant308091V → I in a gastric adenocarcinoma sample; somatic mutation. Ref.43
VAR_040303
Natural variant308181F → I. Ref.43
VAR_040304
Natural variant308251E → K. Ref.43
VAR_040305
Natural variant308561I → T. Ref.43
VAR_040306
Natural variant308871G → D. Ref.43
VAR_040307
Natural variant308871G → S. Ref.43
VAR_040308
Natural variant308971R → H. Ref.43
VAR_040309
Natural variant309071R → H. Ref.43
VAR_040310
Natural variant309461R → H. Ref.43
VAR_040311
Natural variant310811I → F. Ref.43
VAR_040312
Natural variant311071R → C. Ref.43
VAR_040313
Natural variant311241A → G. Ref.43
VAR_040314
Natural variant311561N → S. Ref.43
VAR_040315
Natural variant312461P → T. Ref.43
VAR_040316
Natural variant313301R → H. Ref.43
VAR_040317
Natural variant316901C → R. Ref.43
VAR_040318
Natural variant317241R → Q. Ref.43
VAR_040319
Natural variant317251V → I. Ref.43
VAR_040320
Natural variant317321G → S. Ref.43
VAR_040321
Natural variant318861V → I. Ref.43
VAR_040322
Natural variant320971R → C. Ref.43
VAR_040323
Natural variant321711T → N in a lung large cell carcinoma sample; somatic mutation. Ref.43
VAR_040324
Natural variant322481V → I. Ref.43
VAR_040325
Natural variant322811Q → H. Ref.43
VAR_040326
Natural variant323231R → H. Ref.43
VAR_040327
Natural variant324111R → W in a gastric adenocarcinoma sample; somatic mutation. Ref.43
VAR_040328
Natural variant325581I → V. Ref.43
VAR_040329
Natural variant326101M → V. Ref.43
VAR_040330
Natural variant326371G → V. Ref.43
VAR_040331
Natural variant329221V → A. Ref.43
VAR_040332
Natural variant329431L → R in a gastric adenocarcinoma sample; somatic mutation. Ref.43
VAR_040333
Natural variant329531R → H. Ref.43
VAR_040334
Natural variant329961R → Q in CMD1G. Ref.40
VAR_026693
Natural variant332131V → L. Ref.43
VAR_040335
Natural variant332421R → C in a gastric adenocarcinoma sample; somatic mutation. Ref.43
VAR_040336
Natural variant333871T → M. Ref.43
VAR_040337
Natural variant334191E → D. Ref.43
VAR_040338
Natural variant335361V → M. Ref.43
VAR_040339
Natural variant335681K → Q. Ref.43
VAR_040340
Natural variant336161E → K. Ref.43
VAR_040341
Natural variant336201P → L. Ref.43
VAR_040342
Natural variant338861E → V. Ref.43
VAR_040343
Natural variant338991I → T. Ref.43
VAR_040344
Natural variant339041L → P in a gastric adenocarcinoma sample; somatic mutation. Ref.43
VAR_040345
Natural variant339551T → I. Ref.43
VAR_040346
Natural variant341151V → A. Ref.43
VAR_040347
Natural variant343061I → N in TMD. Ref.39
VAR_026694
Natural variant343151L → P in TMD. Ref.36
VAR_026695

Experimental info

Mutagenesis322071K → A: Disrupts catalytic activity. Ref.31
Mutagenesis323411Y → E: No phosphorylation on tyrosine. Ref.31
Sequence conflict1321T → N in CAA62188. Ref.1
Sequence conflict2551P → H in CAD12455. Ref.2
Sequence conflict2551P → H in CAD12456. Ref.2
Sequence conflict2551P → H in CAD12457. Ref.2
Sequence conflict1472 – 14743QTA → ANC in CAA62188. Ref.1
Sequence conflict17301G → S in CAA62188. Ref.1
Sequence conflict39191P → L in CAA62188. Ref.1
Sequence conflict45251K → R in CAA62189. Ref.1
Sequence conflict66191E → R in CAA62189. Ref.1
Sequence conflict71451D → H in CAA62189. Ref.1
Sequence conflict74411S → N in CAA62189. Ref.1
Sequence conflict80381E → A in CAA62189. Ref.1
Sequence conflict82251H → Q in CAA62189. Ref.1
Sequence conflict88501A → P in CAA62189. Ref.1
Sequence conflict96931C → V in CAA62189. Ref.1
Sequence conflict98921I → V in CAA62189. Ref.1
Sequence conflict103051R → G in CAA62189. Ref.1
Sequence conflict103051R → G Ref.9
Sequence conflict105661R → K in CAA62189. Ref.1
Sequence conflict107791R → H in CAA62189. Ref.1
Sequence conflict114241D → E in CAA62189. Ref.1
Sequence conflict115111A → V in CAA62189. Ref.1
Sequence conflict116261K → N in CAA62189. Ref.1
Sequence conflict118381K → N in CAA62189. Ref.1
Sequence conflict129941A → T in CAA62188. Ref.1
Sequence conflict13795 – 137973YKF → IQI Ref.11
Sequence conflict145901A → T in CAA62188. Ref.1
Sequence conflict147921V → A in CAA62188. Ref.1
Sequence conflict159741E → K in CAA62188. Ref.1
Sequence conflict16234 – 162352TK → QR in CAA62188. Ref.1
Sequence conflict172381R → H in CAA62188. Ref.1
Sequence conflict197341V → A in CAA62188. Ref.1
Sequence conflict206221L → H in CAA62188. Ref.1
Sequence conflict207751A → P in CAA62188. Ref.1
Sequence conflict216251Y → I in CAA45939. Ref.13
Sequence conflict21795 – 218006PGPVLN → ARPSPQ in CAA45939. Ref.13
Sequence conflict221761C → W in CAA62188. Ref.1
Sequence conflict228161A → D in CAA62188. Ref.1
Sequence conflict228371T → A in CAA62188. Ref.1
Sequence conflict241761I → M in CAA62188. Ref.1
Sequence conflict241761I → M in CAA45940. Ref.13
Sequence conflict241811S → F in CAA45940. Ref.13
Sequence conflict257311P → S in CAA62188. Ref.1
Sequence conflict265731M → V in CAA62188. Ref.1
Sequence conflict26846 – 268483IVE → HRK in CAA62188. Ref.1
Sequence conflict278791G → A in CAA62188. Ref.1
Sequence conflict289361D → N in CAA62188. Ref.1
Sequence conflict292221A → T in CAA62188. Ref.1
Sequence conflict295181F → L in CAA62188. Ref.1
Sequence conflict297011T → P in CAA45938. Ref.14
Sequence conflict297011T → P in CAA49245. Ref.14
Sequence conflict298731E → G in CAA45938. Ref.14
Sequence conflict298731E → G in CAA49245. Ref.14
Sequence conflict298781T → Q in CAA45938. Ref.14
Sequence conflict298781T → Q in CAA49245. Ref.14
Sequence conflict302091K → E in CAA62188. Ref.1
Sequence conflict302091K → E in CAA45938. Ref.14
Sequence conflict302091K → E in CAA49245. Ref.14
Sequence conflict302561A → R in CAA62188. Ref.1
Sequence conflict302561A → R in CAA45938. Ref.14
Sequence conflict302561A → R in CAA49245. Ref.14
Sequence conflict304961V → I in CAA62188. Ref.1
Sequence conflict304961V → I in CAA45938. Ref.14
Sequence conflict304961V → I in CAA49245. Ref.14
Sequence conflict305341Y → H in CAA62188. Ref.1
Sequence conflict305341Y → H in CAA45938. Ref.14
Sequence conflict305341Y → H in CAA49245. Ref.14
Sequence conflict307481S → L in CAA45938. Ref.14
Sequence conflict307481S → L in CAA49245. Ref.14
Sequence conflict313031W → V in CAA62188. Ref.1
Sequence conflict313031W → V in CAA45938. Ref.14
Sequence conflict313031W → V in CAA49245. Ref.14
Sequence conflict320601V → A in CAA62188. Ref.1
Sequence conflict320601V → A in CAA45938. Ref.14
Sequence conflict320601V → A in CAA49245. Ref.14
Sequence conflict330841D → N in CAA62188. Ref.1
Sequence conflict330841D → N in CAA49245. Ref.14
Sequence conflict331201R → W in CAD28458. Ref.15
Sequence conflict333821A → R in CAA49245. Ref.14
Sequence conflict334241S → P in CAA62188. Ref.1
Sequence conflict334241S → P in CAA49245. Ref.14
Sequence conflict335261Q → R in CAA49245. Ref.14
Sequence conflict336331T → I in CAA62188. Ref.1
Sequence conflict336331T → I in CAA49245. Ref.14
Sequence conflict337161Y → V in CAD28458. Ref.15
Sequence conflict339951V → A in CAD28458. Ref.15
Sequence conflict340711N → G in AAI07798. Ref.5
Sequence conflict340821C → R in CAA49245. Ref.14
Sequence conflict342531G → R in AAI07798. Ref.5

Secondary structure

................................................................................................................................................................................................................................ 34350
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified October 19, 2011. Version 3.
Checksum: 084F69B34544CE7F

FASTA34,3503,816,141
        10         20         30         40         50         60 
MTTQAPTFTQ PLQSVVVLEG STATFEAHIS GFPVPEVSWF RDGQVISTST LPGVQISFSD 

        70         80         90        100        110        120 
GRAKLTIPAV TKANSGRYSL KATNGSGQAT STAELLVKAE TAPPNFVQRL QSMTVRQGSQ 

       130        140        150        160        170        180 
VRLQVRVTGI PTPVVKFYRD GAEIQSSLDF QISQEGDLYS LLIAEAYPED SGTYSVNATN 

       190        200        210        220        230        240 
SVGRATSTAE LLVQGEEEVP AKKTKTIVST AQISESRQTR IEKKIEAHFD ARSIATVEMV 

       250        260        270        280        290        300 
IDGAAGQQLP HKTPPRIPPK PKSRSPTPPS IAAKAQLARQ QSPSPIRHSP SPVRHVRAPT 

       310        320        330        340        350        360 
PSPVRSVSPA ARISTSPIRS VRSPLLMRKT QASTVATGPE VPPPWKQEGY VASSSEAEMR 

       370        380        390        400        410        420 
ETTLTTSTQI RTEERWEGRY GVQEQVTISG AAGAAASVSA SASYAAEAVA TGAKEVKQDA 

       430        440        450        460        470        480 
DKSAAVATVV AAVDMARVRE PVISAVEQTA QRTTTTAVHI QPAQEQVRKE AEKTAVTKVV 

       490        500        510        520        530        540 
VAADKAKEQE LKSRTKEVIT TKQEQMHVTH EQIRKETEKT FVPKVVISAA KAKEQETRIS 

       550        560        570        580        590        600 
EEITKKQKQV TQEAIRQETE ITAASMVVVA TAKSTKLETV PGAQEETTTQ QDQMHLSYEK 

       610        620        630        640        650        660 
IMKETRKTVV PKVIVATPKV KEQDLVSRGR EGITTKREQV QITQEKMRKE AEKTALSTIA 

       670        680        690        700        710        720 
VATAKAKEQE TILRTRETMA TRQEQIQVTH GKVDVGKKAE AVATVVAAVD QARVREPREP 

       730        740        750        760        770        780 
GHLEESYAQQ TTLEYGYKER ISAAKVAEPP QRPASEPHVV PKAVKPRVIQ APSETHIKTT 

       790        800        810        820        830        840 
DQKGMHISSQ IKKTTDLTTE RLVHVDKRPR TASPHFTVSK ISVPKTEHGY EASIAGSAIA 

       850        860        870        880        890        900 
TLQKELSATS SAQKITKSVK APTVKPSETR VRAEPTPLPQ FPFADTPDTY KSEAGVEVKK 

       910        920        930        940        950        960 
EVGVSITGTT VREERFEVLH GREAKVTETA RVPAPVEIPV TPPTLVSGLK NVTVIEGESV 

       970        980        990       1000       1010       1020 
TLECHISGYP SPTVTWYRED YQIESSIDFQ ITFQSGIARL MIREAFAEDS GRFTCSAVNE 

      1030       1040       1050       1060       1070       1080 
AGTVSTSCYL AVQVSEEFEK ETTAVTEKFT TEEKRFVESR DVVMTDTSLT EEQAGPGEPA 

      1090       1100       1110       1120       1130       1140 
APYFITKPVV QKLVEGGSVV FGCQVGGNPK PHVYWKKSGV PLTTGYRYKV SYNKQTGECK 

      1150       1160       1170       1180       1190       1200 
LVISMTFADD AGEYTIVVRN KHGETSASAS LLEEADYELL MKSQQEMLYQ TQVTAFVQEP 

      1210       1220       1230       1240       1250       1260 
KVGETAPGFV YSEYEKEYEK EQALIRKKMA KDTVVVRTYV EDQEFHISSF EERLIKEIEY 

      1270       1280       1290       1300       1310       1320 
RIIKTTLEEL LEEDGEEKMA VDISESEAVE SGFDSRIKNY RILEGMGVTF HCKMSGYPLP 

      1330       1340       1350       1360       1370       1380 
KIAWYKDGKR IKHGERYQMD FLQDGRASLR IPVVLPEDEG IYTAFASNIK GNAICSGKLY 

      1390       1400       1410       1420       1430       1440 
VEPAAPLGAP TYIPTLEPVS RIRSLSPRSV SRSPIRMSPA RMSPARMSPA RMSPARMSPG 

      1450       1460       1470       1480       1490       1500 
RRLEETDESQ LERLYKPVFV LKPVSFKCLE GQTARFDLKV VGRPMPETFW FHDGQQIVND 

      1510       1520       1530       1540       1550       1560 
YTHKVVIKED GTQSLIIVPA TPSDSGEWTV VAQNRAGRSS ISVILTVEAV EHQVKPMFVE 

      1570       1580       1590       1600       1610       1620 
KLKNVNIKEG SRLEMKVRAT GNPNPDIVWL KNSDIIVPHK YPKIRIEGTK GEAALKIDST 

      1630       1640       1650       1660       1670       1680 
VSQDSAWYTA TAINKAGRDT TRCKVNVEVE FAEPEPERKL IIPRGTYRAK EIAAPELEPL 

      1690       1700       1710       1720       1730       1740 
HLRYGQEQWE EGDLYDKEKQ QKPFFKKKLT SLRLKRFGPA HFECRLTPIG DPTMVVEWLH 

      1750       1760       1770       1780       1790       1800 
DGKPLEAANR LRMINEFGYC SLDYGVAYSR DSGIITCRAT NKYGTDHTSA TLIVKDEKSL 

      1810       1820       1830       1840       1850       1860 
VEESQLPEGR KGLQRIEELE RMAHEGALTG VTTDQKEKQK PDIVLYPEPV RVLEGETARF 

      1870       1880       1890       1900       1910       1920 
RCRVTGYPQP KVNWYLNGQL IRKSKRFRVR YDGIHYLDIV DCKSYDTGEV KVTAENPEGV 

      1930       1940       1950       1960       1970       1980 
IEHKVKLEIQ QREDFRSVLR RAPEPRPEFH VHEPGKLQFE VQKVDRPVDT TETKEVVKLK 

      1990       2000       2010       2020       2030       2040 
RAERITHEKV PEESEELRSK FKRRTEEGYY EAITAVELKS RKKDESYEEL LRKTKDELLH 

      2050       2060       2070       2080       2090       2100 
WTKELTEEEK KALAEEGKIT IPTFKPDKIE LSPSMEAPKI FERIQSQTVG QGSDAHFRVR 

      2110       2120       2130       2140       2150       2160 
VVGKPDPECE WYKNGVKIER SDRIYWYWPE DNVCELVIRD VTAEDSASIM VKAINIAGET 

      2170       2180       2190       2200       2210       2220 
SSHAFLLVQA KQLITFTQEL QDVVAKEKDT MATFECETSE PFVKVKWYKD GMEVHEGDKY 

      2230       2240       2250       2260       2270       2280 
RMHSDRKVHF LSILTIDTSD AEDYSCVLVE DENVKTTAKL IVEGAVVEFV KELQDIEVPE 

      2290       2300       2310       2320       2330       2340 
SYSGELECIV SPENIEGKWY HNDVELKSNG KYTITSRRGR QNLTVKDVTK EDQGEYSFVI 

      2350       2360       2370       2380       2390       2400 
DGKKTTCKLK MKPRPIAILQ GLSDQKVCEG DIVQLEVKVS LESVEGVWMK DGQEVQPSDR 

      2410       2420       2430       2440       2450       2460 
VHIVIDKQSH MLLIEDMTKE DAGNYSFTIP ALGLSTSGRV SVYSVDVITP LKDVNVIEGT 

      2470       2480       2490       2500       2510       2520 
KAVLECKVSV PDVTSVKWYL NDEQIKPDDR VQAIVKGTKQ RLVINRTHAS DEGPYKLIVG 

      2530       2540       2550       2560       2570       2580 
RVETNCNLSV EKIKIIRGLR DLTCTETQNV VFEVELSHSG IDVLWNFKDK EIKPSSKYKI 

      2590       2600       2610       2620       2630       2640 
EAHGKIYKLT VLNMMKDDEG KYTFYAGENM TSGKLTVAGG AISKPLTDQT VAESQEAVFE 

      2650       2660       2670       2680       2690       2700 
CEVANPDSKG EWLRDGKHLP LTNNIRSESD GHKRRLIIAA TKLDDIGEYT YKVATSKTSA 

      2710       2720       2730       2740       2750       2760 
KLKVEAVKIK KTLKNLTVTE TQDAVFTVEL THPNVKGVQW IKNGVVLESN EKYAISVKGT 

      2770       2780       2790       2800       2810       2820 
IYSLRIKNCA IVDESVYGFR LGRLGASARL HVETVKIIKK PKDVTALENA TVAFEVSVSH 

      2830       2840       2850       2860       2870       2880 
DTVPVKWFHK SVEIKPSDKH RLVSERKVHK LMLQNISPSD AGEYTAVVGQ LECKAKLFVE 

      2890       2900       2910       2920       2930       2940 
TLHITKTMKN IEVPETKTAS FECEVSHFNV PSMWLKNGVE IEMSEKFKIV VQGKLHQLII 

      2950       2960       2970       2980       2990       3000 
MNTSTEDSAE YTFVCGNDQV SATLTVTPIM ITSMLKDINA EEKDTITFEV TVNYEGISYK 

      3010       3020       3030       3040       3050       3060 
WLKNGVEIKS TDKCQMRTKK LTHSLNIRNV HFGDAADYTF VAGKATSTAT LYVEARHIEF 

      3070       3080       3090       3100       3110       3120 
RKHIKDIKVL EKKRAMFECE VSEPDITVQW MKDDQELQIT DRIKIQKEKY VHRLLIPSTR 

      3130       3140       3150       3160       3170       3180 
MSDAGKYTVV AGGNVSTAKL FVEGRDVRIR SIKKEVQVIE KQRAVVEFEV NEDDVDAHWY 

      3190       3200       3210       3220       3230       3240 
KDGIEINFQV QERHKYVVER RIHRMFISET RQSDAGEYTF VAGRNRSSVT LYVNAPEPPQ 

      3250       3260       3270       3280       3290       3300 
VLQELQPVTV QSGKPARFCA VISGRPQPKI SWYKEEQLLS TGFKCKFLHD GQEYTLLLIE 

      3310       3320       3330       3340       3350       3360 
AFPEDAAVYT CEAKNDYGVA TTSASLSVEV PEVVSPDQEM PVYPPAIITP LQDTVTSEGQ 

      3370       3380       3390       3400       3410       3420 
PARFQCRVSG TDLKVSWYSK DKKIKPSRFF RMTQFEDTYQ LEIAEAYPED EGTYTFVASN 

      3430       3440       3450       3460       3470       3480 
AVGQVSSTAN LSLEAPESIL HERIEQEIEM EMKEFSSSFL SAEEEGLHSA ELQLSKINET 

      3490       3500       3510       3520       3530       3540 
LELLSESPVY STKFDSEKEG TGPIFIKEVS NADISMGDVA TLSVTVIGIP KPKIQWFFNG 

      3550       3560       3570       3580       3590       3600 
VLLTPSADYK FVFDGDDHSL IILFTKLEDE GEYTCMASND YGKTICSAYL KINSKGEGHK 

      3610       3620       3630       3640       3650       3660 
DTETESAVAK SLEKLGGPCP PHFLKELKPI RCAQGLPAIF EYTVVGEPAP TVTWFKENKQ 

      3670       3680       3690       3700       3710       3720 
LCTSVYYTII HNPNGSGTFI VNDPQREDSG LYICKAENML GESTCAAELL VLLEDTDMTD 

      3730       3740       3750       3760       3770       3780 
TPCKAKSTPE APEDFPQTPL KGPAVEALDS EQEIATFVKD TILKAALITE ENQQLSYEHI 

      3790       3800       3810       3820       3830       3840 
AKANELSSQL PLGAQELQSI LEQDKLTPES TREFLCINGS IHFQPLKEPS PNLQLQIVQS 

      3850       3860       3870       3880       3890       3900 
QKTFSKEGIL MPEEPETQAV LSDTEKIFPS AMSIEQINSL TVEPLKTLLA EPEGNYPQSS 

      3910       3920       3930       3940       3950       3960 
IEPPMHSYLT SVAEEVLSPK EKTVSDTNRE QRVTLQKQEA QSALILSQSL AEGHVESLQS 

      3970       3980       3990       4000       4010       4020 
PDVMISQVNY EPLVPSEHSC TEGGKILIES ANPLENAGQD SAVRIEEGKS LRFPLALEEK 

      4030       4040       4050       4060       4070       4080 
QVLLKEEHSD NVVMPPDQII ESKREPVAIK KVQEVQGRDL LSKESLLSGI PEEQRLNLKI 

      4090       4100       4110       4120       4130       4140 
QICRALQAAV ASEQPGLFSE WLRNIEKVEV EAVNITQEPR HIMCMYLVTS AKSVTEEVTI 

      4150       4160       4170       4180       4190       4200 
IIEDVDPQMA NLKMELRDAL CAIIYEEIDI LTAEGPRIQQ GAKTSLQEEM DSFSGSQKVE 

      4210       4220       4230       4240       4250       4260 
PITEPEVESK YLISTEEVSY FNVQSRVKYL DATPVTKGVA SAVVSDEKQD ESLKPSEEKE 

      4270       4280       4290       4300       4310       4320 
ESSSESGTEE VATVKIQEAE GGLIKEDGPM IHTPLVDTVS EEGDIVHLTT SITNAKEVNW 

      4330       4340       4350       4360       4370       4380 
YFENKLVPSD EKFKCLQDQN TYTLVIDKVN TEDHQGEYVC EALNDSGKTA TSAKLTVVKR 

      4390       4400       4410       4420       4430       4440 
AAPVIKRKIE PLEVALGHLA KFTCEIQSAP NVRFQWFKAG REIYESDKCS IRSSKYISSL 

      4450       4460       4470       4480       4490       4500 
EILRTQVVDC GEYTCKASNE YGSVSCTATL TVTEAYPPTF LSRPKSLTTF VGKAAKFICT 

      4510       4520       4530       4540       4550       4560 
VTGTPVIETI WQKDGAALSP SPNWKISDAE NKHILELSNL TIQDRGVYSC KASNKFGADI 

      4570       4580       4590       4600       4610       4620 
CQAELIIIDK PHFIKELEPV QSAINKKVHL ECQVDEDRKV TVTWSKDGQK LPPGKDYKIC 

      4630       4640       4650       4660       4670       4680 
FEDKIATLEI PLAKLKDSGT YVCTASNEAG SSSCSATVTV REPPSFVKKV DPSYLMLPGE 

      4690       4700       4710       4720       4730       4740 
SARLHCKLKG SPVIQVTWFK NNKELSESNT VRMYFVNSEA ILDITDVKVE DSGSYSCEAV 

      4750       4760       4770       4780       4790       4800 
NDVGSDSCST EIVIKEPPSF IKTLEPADIV RGTNALLQCE VSGTGPFEIS WFKDKKQIRS 

      4810       4820       4830       4840       4850       4860 
SKKYRLFSQK SLVCLEIFSF NSADVGEYEC VVANEVGKCG CMATHLLKEP PTFVKKVDDL 

      4870       4880       4890       4900       4910       4920 
IALGGQTVTL QAAVRGSEPI SVTWMKGQEV IREDGKIKMS FSNGVAVLII PDVQISFGGK 

      4930       4940       4950       4960       4970       4980 
YTCLAENEAG SQTSVGELIV KEPAKIIERA ELIQVTAGDP ATLEYTVAGT PELKPKWYKD 

      4990       5000       5010       5020       5030       5040 
GRPLVASKKY RISFKNNVAQ LKFYSAELHD SGQYTFEISN EVGSSSCETT FTVLDRDIAP 

      5050       5060       5070       5080       5090       5100 
FFTKPLRNVD SVVNGTCRLD CKIAGSLPMR VSWFKDGKEI AASDRYRIAF VEGTASLEII 

      5110       5120       5130       5140       5150       5160 
RVDMNDAGNF TCRATNSVGS KDSSGALIVQ EPPSFVTKPG SKDVLPGSAV CLKSTFQGST 

      5170       5180       5190       5200       5210       5220 
PLTIRWFKGN KELVSGGSCY ITKEALESSL ELYLVKTSDS GTYTCKVSNV AGGVECSANL 

      5230       5240       5250       5260       5270       5280 
FVKEPATFVE KLEPSQLLKK GDATQLACKV TGTPPIKITW FANDREIKES SKHRMSFVES 

      5290       5300       5310       5320       5330       5340 
TAVLRLTDVG IEDSGEYMCE AQNEAGSDHC SSIVIVKESP YFTKEFKPIE VLKEYDVMLL 

      5350       5360       5370       5380       5390       5400 
AEVAGTPPFE ITWFKDNTIL RSGRKYKTFI QDHLVSLQIL KFVAADAGEY QCRVTNEVGS 

      5410       5420       5430       5440       5450       5460 
SICSARVTLR EPPSFIKKIE STSSLRGGTA AFQATLKGSL PITVTWLKDS DEITEDDNIR 

      5470       5480       5490       5500       5510       5520 
MTFENNVASL YLSGIEVKHD GKYVCQAKND AGIQRCSALL SVKEPATITE EAVSIDVTQG 

      5530       5540       5550       5560       5570       5580 
DPATLQVKFS GTKEITAKWF KDGQELTLGS KYKISVTDTV SILKIISTEK KDSGEYTFEV 

      5590       5600       5610       5620       5630       5640 
QNDVGRSSCK ARINVLDLII PPSFTKKLKK MDSIKGSFID LECIVAGSHP ISIQWFKDDQ 

      5650       5660       5670       5680       5690       5700 
EISASEKYKF SFHDNTAFLE ISQLEGTDSG TYTCSATNKA GHNQCSGHLT VKEPPYFVEK 

      5710       5720       5730       5740       5750       5760 
PQSQDVNPNT RVQLKALVGG TAPMTIKWFK DNKELHSGAA RSVWKDDTST SLELFAAKAT 

      5770       5780       5790       5800       5810       5820 
DSGTYICQLS NDVGTATSKA TLFVKEPPQF IKKPSPVLVL RNGQSTTFEC QITGTPKIRV 

      5830       5840       5850       5860       5870       5880 
SWYLDGNEIT AIQKHGISFI DGLATFQISG ARVENSGTYV CEARNDAGTA SCSIELKVKE 

      5890       5900       5910       5920       5930       5940 
PPTFIRELKP VEVVKYSDVE LECEVTGTPP FEVTWLKNNR EIRSSKKYTL TDRVSVFNLH 

      5950       5960       5970       5980       5990       6000 
ITKCDPSDTG EYQCIVSNEG GSCSCSTRVA LKEPPSFIKK IENTTTVLKS SATFQSTVAG 

      6010       6020       6030       6040       6050       6060 
SPPISITWLK DDQILDEDDN VYISFVDSVA TLQIRSVDNG HSGRYTCQAK NESGVERCYA 

      6070       6080       6090       6100       6110       6120 
FLLVQEPAQI VEKAKSVDVT EKDPMTLECV VAGTPELKVK WLKDGKQIVP SRYFSMSFEN 

      6130       6140       6150       6160       6170       6180 
NVASFRIQSV MKQDSGQYTF KVENDFGSSS CDAYLRVLDQ NIPPSFTKKL TKMDKVLGSS 

      6190       6200       6210       6220       6230       6240 
IHMECKVSGS LPISAQWFKD GKEISTSAKY RLVCHERSVS LEVNNLELED TANYTCKVSN 

      6250       6260       6270       6280       6290       6300 
VAGDDACSGI LTVKEPPSFL VKPGRQQAIP DSTVEFKAIL KGTPPFKIKW FKDDVELVSG 

      6310       6320       6330       6340       6350       6360 
PKCFIGLEGS TSFLNLYSVD ASKTGQYTCH VTNDVGSDSC TTMLLVTEPP KFVKKLEASK 

      6370       6380       6390       6400       6410       6420 
IVKAGDSSRL ECKIAGSPEI RVVWFRNEHE LPASDKYRMT FIDSVAVIQM NNLSTEDSGD 

      6430       6440       6450       6460       6470       6480 
FICEAQNPAG STSCSTKVIV KEPPVFSSFP PIVETLKNAE VSLECELSGT PPFEVVWYKD 

      6490       6500       6510       6520       6530       6540 
KRQLRSSKKY KIASKNFHTS IHILNVDTSD IGEYHCKAQN EVGSDTCVCT VKLKEPPRFV 

      6550       6560       6570       6580       6590       6600 
SKLNSLTVVA GEPAELQASI EGAQPIFVQW LKEKEEVIRE SENIRITFVE NVATLQFAKA 

      6610       6620       6630       6640       6650       6660 
EPANAGKYIC QIKNDGGMEE NMATLMVLEP AVIVEKAGPM TVTVGETCTL ECKVAGTPEL 

      6670       6680       6690       6700       6710       6720 
SVEWYKDGKL LTSSQKHKFS FYNKISSLRI LSVERQDAGT YTFQVQNNVG KSSCTAVVDV 

      6730       6740       6750       6760       6770       6780 
SDRAVPPSFT RRLKNTGGVL GASCILECKV AGSSPISVAW FHEKTKIVSG AKYQTTFSDN 

      6790       6800       6810       6820       6830       6840 
VCTLQLNSLD SSDMGNYTCV AANVAGSDEC RAVLTVQEPP SFVKEPEPLE VLPGKNVTFT 

      6850       6860       6870       6880       6890       6900 
SVIRGTPPFK VNWFRGAREL VKGDRCNIYF EDTVAELELF NIDISQSGEY TCVVSNNAGQ 

      6910       6920       6930       6940       6950       6960 
ASCTTRLFVK EPAAFLKRLS DHSVEPGKSI ILESTYTGTL PISVTWKKDG FNITTSEKCN 

      6970       6980       6990       7000       7010       7020 
IVTTEKTCIL EILNSTKRDA GQYSCEIENE AGRDVCGALV STLEPPYFVT ELEPLEAAVG 

      7030       7040       7050       7060       7070       7080 
DSVSLQCQVA GTPEITVSWY KGDTKLRPTP EYRTYFTNNV ATLVFNKVNI NDSGEYTCKA 

      7090       7100       7110       7120       7130       7140 
ENSIGTASSK TVFRIQERQL PPSFARQLKD IEQTVGLPVT LTCRLNGSAP IQVCWYRDGV 

      7150       7160       7170       7180       7190       7200 
LLRDDENLQT SFVDNVATLK ILQTDLSHSG QYSCSASNPL GTASSSARLT AREPKKSPFF 

      7210       7220       7230       7240       7250       7260 
DIKPVSIDVI AGESADFECH VTGAQPMRIT WSKDNKEIRP GGNYTITCVG NTPHLRILKV 

      7270       7280       7290       7300       7310       7320 
GKGDSGQYTC QATNDVGKDM CSAQLSVKEP PKFVKKLEAS KVAKQGESIQ LECKISGSPE 

      7330       7340       7350       7360       7370       7380 
IKVSWFRNDS ELHESWKYNM SFINSVALLT INEASAEDSG DYICEAHNGV GDASCSTALT 

      7390       7400       7410       7420       7430       7440 
VKAPPVFTQK PSPVGALKGS DVILQCEISG TPPFEVVWVK DRKQVRNSKK FKITSKHFDT 

      7450       7460       7470       7480       7490       7500 
SLHILNLEAS DVGEYHCKAT NEVGSDTCSC SVKFKEPPRF VKKLSDTSTL IGDAVELRAI 

      7510       7520       7530       7540       7550       7560 
VEGFQPISVV WLKDRGEVIR ESENTRISFI DNIATLQLGS PEASNSGKYI CQIKNDAGMR 

      7570       7580       7590       7600       7610       7620 
ECSAVLTVLE PARIIEKPEP MTVTTGNPFA LECVVTGTPE LSAKWFKDGR ELSADSKHHI 

      7630       7640       7650       7660       7670       7680 
TFINKVASLK IPCAEMSDKG LYSFEVKNSV GKSNCTVSVH VSDRIVPPSF IRKLKDVNAI 

      7690       7700       7710       7720       7730       7740 
LGASVVLECR VSGSAPISVG WFQDGNEIVS GPKCQSSFSE NVCTLNLSLL EPSDTGIYTC 

      7750       7760       7770       7780       7790       7800 
VAANVAGSDE CSAVLTVQEP PSFEQTPDSV EVLPGMSLTF TSVIRGTPPF KVKWFKGSRE 

      7810       7820       7830       7840       7850       7860 
LVPGESCNIS LEDFVTELEL FEVQPLESGD YSCLVTNDAG SASCTTHLFV KEPATFVKRL 

      7870       7880       7890       7900       7910       7920 
ADFSVETGSP IVLEATYTGT PPISVSWIKD EYLISQSERC SITMTEKSTI LEILESTIED 

      7930       7940       7950       7960       7970       7980 
YAQYSCLIEN EAGQDICEAL VSVLEPPYFI EPLEHVEAVI GEPATLQCKV DGTPEIRISW 

      7990       8000       8010       8020       8030       8040 
YKEHTKLRSA PAYKMQFKNN VASLVINKVD HSDVGEYSCK ADNSVGAVAS SAVLVIKERK 

      8050       8060       8070       8080       8090       8100 
LPPFFARKLK DVHETLGFPV AFECRINGSE PLQVSWYKDG VLLKDDANLQ TSFVHNVATL 

      8110       8120       8130       8140       8150       8160 
QILQTDQSHI GQYNCSASNP LGTASSSAKL ILSEHEVPPF FDLKPVSVDL ALGESGTFKC 

      8170       8180       8190       8200       8210       8220 
HVTGTAPIKI TWAKDNREIR PGGNYKMTLV ENTATLTVLK VGKGDAGQYT CYASNIAGKD 

      8230       8240       8250       8260       8270       8280 
SCSAHLGVQE PPRFIKKLEP SRIVKQDEFT RYECKIGGSP EIKVLWYKDE TEIQESSKFR 

      8290       8300       8310       8320       8330       8340 
MSFVDSVAVL EMHNLSVEDS GDYTCEAHNA AGSASSSTSL KVKEPPIFRK KPHPIETLKG 

      8350       8360       8370       8380       8390       8400 
ADVHLECELQ GTPPFHVSWY KDKRELRSGK KYKIMSENFL TSIHILNVDA ADIGEYQCKA 

      8410       8420       8430       8440       8450       8460 
TNDVGSDTCV GSIALKAPPR FVKKLSDIST VVGKEVQLQT TIEGAEPISV VWFKDKGEIV 

      8470       8480       8490       8500       8510       8520 
RESDNIWISY SENIATLQFS RVEPANAGKY TCQIKNDAGM QECFATLSVL EPATIVEKPE 

      8530       8540       8550       8560       8570       8580 
SIKVTTGDTC TLECTVAGTP ELSTKWFKDG KELTSDNKYK ISFFNKVSGL KIINVAPSDS 

      8590       8600       8610       8620       8630       8640 
GVYSFEVQNP VGKDSCTASL QVSDRTVPPS FTRKLKETNG LSGSSVVMEC KVYGSPPISV 

      8650       8660       8670       8680       8690       8700 
SWFHEGNEIS SGRKYQTTLT DNTCALTVNM LEESDSGDYT CIATNMAGSD ECSAPLTVRE 

      8710       8720       8730       8740       8750       8760 
PPSFVQKPDP MDVLTGTNVT FTSIVKGTPP FSVSWFKGSS ELVPGDRCNV SLEDSVAELE 

      8770       8780       8790       8800       8810       8820 
LFDVDTSQSG EYTCIVSNEA GKASCTTHLY IKAPAKFVKR LNDYSIEKGK PLILEGTFTG 

      8830       8840       8850       8860       8870       8880 
TPPISVTWKK NGINVTPSQR CNITTTEKSA ILEIPSSTVE DAGQYNCYIE NASGKDSCSA 

      8890       8900       8910       8920       8930       8940 
QILILEPPYF VKQLEPVKVS VGDSASLQCQ LAGTPEIGVS WYKGDTKLRP TTTYKMHFRN 

      8950       8960       8970       8980       8990       9000 
NVATLVFNQV DINDSGEYIC KAENSVGEVS ASTFLTVQEQ KLPPSFSRQL RDVQETVGLP 

      9010       9020       9030       9040       9050       9060 
VVFDCAISGS EPISVSWYKD GKPLKDSPNV QTSFLDNTAT LNIFKTDRSL AGQYSCTATN 

      9070       9080       9090       9100       9110       9120 
PIGSASSSAR LILTEGKNPP FFDIRLAPVD AVVGESADFE CHVTGTQPIK VSWAKDSREI 

      9130       9140       9150       9160       9170       9180 
RSGGKYQISY LENSAHLTVL KVDKGDSGQY TCYAVNEVGK DSCTAQLNIK ERLIPPSFTK 

      9190       9200       9210       9220       9230       9240 
RLSETVEETE GNSFKLEGRV AGSQPITVAW YKNNIEIQPT SNCEITFKNN TLVLQVRKAG 

      9250       9260       9270       9280       9290       9300 
MNDAGLYTCK VSNDAGSALC TSSIVIKEPK KPPVFDQHLT PVTVSEGEYV QLSCHVQGSE 

      9310       9320       9330       9340       9350       9360 
PIRIQWLKAG REIKPSDRCS FSFASGTAVL ELRDVAKADS GDYVCKASNV AGSDTTKSKV 

      9370       9380       9390       9400       9410       9420 
TIKDKPAVAP ATKKAAVDGR LFFVSEPQSI RVVEKTTATF IAKVGGDPIP NVKWTKGKWR 

      9430       9440       9450       9460       9470       9480 
QLNQGGRVFI HQKGDEAKLE IRDTTKTDSG LYRCVAFNEH GEIESNVNLQ VDERKKQEKI 

      9490       9500       9510       9520       9530       9540 
EGDLRAMLKK TPILKKGAGE EEEIDIMELL KNVDPKEYEK YARMYGITDF RGLLQAFELL 

      9550       9560       9570       9580       9590       9600 
KQSQEEETHR LEIEEIERSE RDEKEFEELV SFIQQRLSQT EPVTLIKDIE NQTVLKDNDA 

      9610       9620       9630       9640       9650       9660 
VFEIDIKINY PEIKLSWYKG TEKLEPSDKF EISIDGDRHT LRVKNCQLKD QGNYRLVCGP 

      9670       9680       9690       9700       9710       9720 
HIASAKLTVI EPAWERHLQD VTLKEGQTCT MTCQFSVPNV KSEWFRNGRI LKPQGRHKTE 

      9730       9740       9750       9760       9770       9780 
VEHKVHKLTI ADVRAEDQGQ YTCKYEDLET SAELRIEAEP IQFTKRIQNI VVSEHQSATF 

      9790       9800       9810       9820       9830       9840 
ECEVSFDDAI VTWYKGPTEL TESQKYNFRN DGRCHYMTIH NVTPDDEGVY SVIARLEPRG 

      9850       9860       9870       9880       9890       9900 
EARSTAELYL TTKEIKLELK PPDIPDSRVP IPTMPIRAVP PEEIPPVVAP PIPLLLPTPE 

      9910       9920       9930       9940       9950       9960 
EKKPPPKRIE VTKKAVKKDA KKVVAKPKEM TPREEIVKKP PPPTTLIPAK APEIIDVSSK 

      9970       9980       9990      10000      10010      10020 
AEEVKIMTIT RKKEVQKEKE AVYEKKQAVH KEKRVFIESF EEPYDELEVE PYTEPFEQPY 

     10030      10040      10050      10060      10070      10080 
YEEPDEDYEE IKVEAKKEVH EEWEEDFEEG QEYYEREEGY DEGEEEWEEA YQEREVIQVQ 

     10090      10100      10110      10120      10130      10140 
KEVYEESHER KVPAKVPEKK APPPPKVIKK PVIEKIEKTS RRMEEEKVQV TKVPEVSKKI 

     10150      10160      10170      10180      10190      10200 
VPQKPSRTPV QEEVIEVKVP AVHTKKMVIS EEKMFFASHT EEEVSVTVPE VQKEIVTEEK 

     10210      10220      10230      10240      10250      10260 
IHVAVSKRVE PPPKVPELPE KPAPEEVAPV PIPKKVEPPA PKVPEVPKKP VPEEKKPVPV 

     10270      10280      10290      10300      10310      10320 
PKKEPAAPPK VPEVPKKPVP EEKIPVPVAK KKEAPPAKVP EVQKRVVTEE KITIVTQREE 

     10330      10340      10350      10360      10370      10380 
SPPPAVPEIP KKKVPEERKP VPRKEEEVPP PPKVPALPKK PVPEEKVAVP VPVAKKAPPP 

     10390      10400      10410      10420      10430      10440 
RAEVSKKTVV EEKRFVAEEK LSFAVPQRVE VTRHEVSAEE EWSYSEEEEG VSISVYREEE 

     10450      10460      10470      10480      10490      10500 
REEEEEAEVT EYEVMEEPEE YVVEEKLHII SKRVEAEPAE VTERQEKKIV LKPKIPAKIE 

     10510      10520      10530      10540      10550      10560 
EPPPAKVPEA PKKIVPEKKV PAPVPKKEKV PPPKVPEEPK KPVPEKKVPP KVIKMEEPLP 

     10570      10580      10590      10600      10610      10620 
AKVTERHMQI TQEEKVLVAV TKKEAPPKAR VPEEPKRAVP EEKVLKLKPK REEEPPAKVT 

     10630      10640      10650      10660      10670      10680 
EFRKRVVKEE KVSIEAPKRE PQPIKEVTIM EEKERAYTLE EEAVSVQREE EYEEYEEYDY 

     10690      10700      10710      10720      10730      10740 
KEFEEYEPTE EYDQYEEYEE REYERYEEHE EYITEPEKPI PVKPVPEEPV PTKPKAPPAK 

     10750      10760      10770      10780      10790      10800 
VLKKAVPEEK VPVPIPKKLK PPPPKVPEEP KKVFEEKIRI SITKREKEQV TEPAAKVPMK 

     10810      10820      10830      10840      10850      10860 
PKRVVAEEKV PVPRKEVAPP VRVPEVPKEL EPEEVAFEEE VVTHVEEYLV EEEEEYIHEE 

     10870      10880      10890      10900      10910      10920 
EEFITEEEVV PVIPVKVPEV PRKPVPEEKK PVPVPKKKEA PPAKVPEVPK KPEEKVPVLI 

     10930      10940      10950      10960      10970      10980 
PKKEKPPPAK VPEVPKKPVP EEKVPVPVPK KVEAPPAKVP EVPKKPVPEK KVPVPAPKKV 

     10990      11000      11010      11020      11030      11040 
EAPPAKVPEV PKKLIPEEKK PTPVPKKVEA PPPKVPKKRE PVPVPVALPQ EEEVLFEEEI 

     11050      11060      11070      11080      11090      11100 
VPEEEVLPEE EEVLPEEEEV LPEEEEVLPE EEEIPPEEEE VPPEEEYVPE EEEFVPEEEV 

     11110      11120      11130      11140      11150      11160 
LPEVKPKVPV PAPVPEIKKK VTEKKVVIPK KEEAPPAKVP EVPKKVEEKR IILPKEEEVL 

     11170      11180      11190      11200      11210      11220 
PVEVTEEPEE EPISEEEIPE EPPSIEEVEE VAPPRVPEVI KKAVPEAPTP VPKKVEAPPA 

     11230      11240      11250      11260      11270      11280 
KVSKKIPEEK VPVPVQKKEA PPAKVPEVPK KVPEKKVLVP KKEAVPPAKG RTVLEEKVSV 

     11290      11300      11310      11320      11330      11340 
AFRQEVVVKE RLELEVVEAE VEEIPEEEEF HEVEEYFEEG EFHEVEEFIK LEQHRVEEEH 

     11350      11360      11370      11380      11390      11400 
RVEKVHRVIE VFEAEEVEVF EKPKAPPKGP EISEKIIPPK KPPTKVVPRK EPPAKVPEVP 

     11410      11420      11430      11440      11450      11460 
KKIVVEEKVR VPEEPRVPPT KVPDVLPPKE VVPEKKVPVP PAKKPEAPPP KVPEAPKEVV 

     11470      11480      11490      11500      11510      11520 
PEKKVPVPPP KKPEVPPTKV PEVPKAAVPE KKVPEAIPPK PESPPPEVPE APKEVVPEKK 

     11530      11540      11550      11560      11570      11580 
VPAAPPKKPE VTPVKVPEAP KEVVPEKKVP VPPPKKPEVP PTKVPEVPKV AVPEKKVPEA 

     11590      11600      11610      11620      11630      11640 
IPPKPESPPP EVFEEPEEVA LEEPPAEVVE EPEPAAPPQV TVPPKKPVPE KKAPAVVAKK 

     11650      11660      11670      11680      11690      11700 
PELPPVKVPE VPKEVVPEKK VPLVVPKKPE APPAKVPEVP KEVVPEKKVA VPKKPEVPPA 

     11710      11720      11730      11740      11750      11760 
KVPEVPKKPV LEEKPAVPVP ERAESPPPEV YEEPEEIAPE EEIAPEEEKP VPVAEEEEPE 

     11770      11780      11790      11800      11810      11820 
VPPPAVPEEP KKIIPEKKVP VIKKPEAPPP KEPEPEKVIE KPKLKPRPPP PPPAPPKEDV 

     11830      11840      11850      11860      11870      11880 
KEKIFQLKAI PKKKVPEKPQ VPEKVELTPL KVPGGEKKVR KLLPERKPEP KEEVVLKSVL 

     11890      11900      11910      11920      11930      11940 
RKRPEEEEPK VEPKKLEKVK KPAVPEPPPP KPVEEVEVPT VTKRERKIPE PTKVPEIKPA 

     11950      11960      11970      11980      11990      12000 
IPLPAPEPKP KPEAEVKTIK PPPVEPEPTP IAAPVTVPVV GKKAEAKAPK EEAAKPKGPI 

     12010      12020      12030      12040      12050      12060 
KGVPKKTPSP IEAERRKLRP GSGGEKPPDE APFTYQLKAV PLKFVKEIKD IILTESEFVG 

     12070      12080      12090      12100      12110      12120 
SSAIFECLVS PSTAITTWMK DGSNIRESPK HRFIADGKDR KLHIIDVQLS DAGEYTCVLR 

     12130      12140      12150      12160      12170      12180 
LGNKEKTSTA KLVVEELPVR FVKTLEEEVT VVKGQPLYLS CELNKERDVV WRKDGKIVVE 

     12190      12200      12210      12220      12230      12240 
KPGRIVPGVI GLMRALTIND ADDTDAGTYT VTVENANNLE CSSCVKVVEV IRDWLVKPIR 

     12250      12260      12270      12280      12290      12300 
DQHVKPKGTA IFACDIAKDT PNIKWFKGYD EIPAEPNDKT EILRDGNHLY LKIKNAMPED 

     12310      12320      12330      12340      12350      12360 
IAEYAVEIEG KRYPAKLTLG EREVELLKPI EDVTIYEKES ASFDAEISEA DIPGQWKLKG 

     12370      12380      12390      12400      12410      12420 
ELLRPSPTCE IKAEGGKRFL TLRKVKLDQA GEVLYQALNA ITTAILTVKE IELDFAVPLK 

     12430      12440      12450      12460      12470      12480 
DVTVPERRQA RFECVLTREA NVIWSKGPDI IKSSDKFDII ADGKKHILVI NDSQFDDEGV 

     12490      12500      12510      12520      12530      12540 
YTAEVEGKKT SARLFVTGIR LKFMSPLEDQ TVKEGETATF VCELSHEKMH VVWFKNDAKL 

     12550      12560      12570      12580      12590      12600 
HTSRTVLISS EGKTHKLEMK EVTLDDISQI KAQVKELSST AQLKVLEADP YFTVKLHDKT 

     12610      12620      12630      12640      12650      12660 
AVEKDEITLK CEVSKDVPVK WFKDGEEIVP SPKYSIKADG LRRILKIKKA DLKDKGEYVC 

     12670      12680      12690      12700      12710      12720 
DCGTDKTKAN VTVEARLIKV EKPLYGVEVF VGETAHFEIE LSEPDVHGQW KLKGQPLTAS 

     12730      12740      12750      12760      12770      12780 
PDCEIIEDGK KHILILHNCQ LGMTGEVSFQ AANAKSAANL KVKELPLIFI TPLSDVKVFE 

     12790      12800      12810      12820      12830      12840 
KDEAKFECEV SREPKTFRWL KGTQEITGDD RFELIKDGTK HSMVIKSAAF EDEAKYMFEA 

     12850      12860      12870      12880      12890      12900 
EDKHTSGKLI IEGIRLKFLT PLKDVTAKEK ESAVFTVELS HDNIRVKWFK NDQRLHTTRS 

     12910      12920      12930      12940      12950      12960 
VSMQDEGKTH SITFKDLSID DTSQIRVEAM GMSSEAKLTV LEGDPYFTGK LQDYTGVEKD 

     12970      12980      12990      13000      13010      13020 
EVILQCEISK ADAPVKWFKD GKEIKPSKNA VIKADGKKRM LILKKALKSD IGQYTCDCGT 

     13030      13040      13050      13060      13070      13080 
DKTSGKLDIE DREIKLVRPL HSVEVMETET ARFETEISED DIHANWKLKG EALLQTPDCE 

     13090      13100      13110      13120      13130      13140 
IKEEGKIHSL VLHNCRLDQT GGVDFQAANV KSSAHLRVKP RVIGLLRPLK DVTVTAGETA 

     13150      13160      13170      13180      13190      13200 
TFDCELSYED IPVEWYLKGK KLEPSDKVVP RSEGKVHTLT LRDVKLEDAG EVQLTAKDFK 

     13210      13220      13230      13240      13250      13260 
THANLFVKEP PVEFTKPLED QTVEEGATAV LECEVSRENA KVKWFKNGTE ILKSKKYEIV 

     13270      13280      13290      13300      13310      13320 
ADGRVRKLVI HDCTPEDIKT YTCDAKDFKT SCNLNVVPPH VEFLRPLTDL QVREKEMARF 

     13330      13340      13350      13360      13370      13380 
ECELSRENAK VKWFKDGAEI KKGKKYDIIS KGAVRILVIN KCLLDDEAEY SCEVRTARTS 

     13390      13400      13410      13420      13430      13440 
GMLTVLEEEA VFTKNLANIE VSETDTIKLV CEVSKPGAEV IWYKGDEEII ETGRYEILTE 

     13450      13460      13470      13480      13490      13500 
GRKRILVIQN AHLEDAGNYN CRLPSSRTDG KVKVHELAAE FISKPQNLEI LEGEKAEFVC 

     13510      13520      13530      13540      13550      13560 
SISKESFPVQ WKRDDKTLES GDKYDVIADG KKRVLVVKDA TLQDMGTYVV MVGAARAAAH 

     13570      13580      13590      13600      13610      13620 
LTVIEKLRIV VPLKDTRVKE QQEVVFNCEV NTEGAKAKWF RNEEAIFDSS KYIILQKDLV 

     13630      13640      13650      13660      13670      13680 
YTLRIRDAHL DDQANYNVSL TNHRGENVKS AANLIVEEED LRIVEPLKDI ETMEKKSVTF 

     13690      13700      13710      13720      13730      13740 
WCKVNRLNVT LKWTKNGEEV PFDNRVSYRV DKYKHMLTIK DCGFPDEGEY IVTAGQDKSV 

     13750      13760      13770      13780      13790      13800 
AELLIIEAPT EFVEHLEDQT VTEFDDAVFS CQLSREKANV KWYRNGREIK EGKKYKFEKD 

     13810      13820      13830      13840      13850      13860 
GSIHRLIIKD CRLDDECEYA CGVEDRKSRA RLFVEEIPVE IIRPPQDILE APGADVVFLA 

     13870      13880      13890      13900      13910      13920 
ELNKDKVEVQ WLRNNMVVVQ GDKHQMMSEG KIHRLQICDI KPRDQGEYRF IAKDKEARAK 

     13930      13940      13950      13960      13970      13980 
LELAAAPKIK TADQDLVVDV GKPLTMVVPY DAYPKAEAEW FKENEPLSTK TIDTTAEQTS 

     13990      14000      14010      14020      14030      14040 
FRILEAKKGD KGRYKIVLQN KHGKAEGFIN LKVIDVPGPV RNLEVTETFD GEVSLAWEEP 

     14050      14060      14070      14080      14090      14100 
LTDGGSKIIG YVVERRDIKR KTWVLATDRA ESCEFTVTGL QKGGVEYLFR VSARNRVGTG 

     14110      14120      14130      14140      14150      14160 
EPVETDNPVE ARSKYDVPGP PLNVTITDVN RFGVSLTWEP PEYDGGAEIT NYVIELRDKT 

     14170      14180      14190      14200      14210      14220 
SIRWDTAMTV RAEDLSATVT DVVEGQEYSF RVRAQNRIGV GKPSAATPFV KVADPIERPS 

     14230      14240      14250      14260      14270      14280 
PPVNLTSSDQ TQSSVQLKWE PPLKDGGSPI LGYIIERCEE GKDNWIRCNM KLVPELTYKV 

     14290      14300      14310      14320      14330      14340 
TGLEKGNKYL YRVSAENKAG VSDPSEILGP LTADDAFVEP TMDLSAFKDG LEVIVPNPIT 

     14350      14360      14370      14380      14390      14400 
ILVPSTGYPR PTATWCFGDK VLETGDRVKM KTLSAYAELV ISPSERSDKG IYTLKLENRV 

     14410      14420      14430      14440      14450      14460 
KTISGEIDVN VIARPSAPKE LKFGDITKDS VHLTWEPPDD DGGSPLTGYV VEKREVSRKT 

     14470      14480      14490      14500      14510      14520 
WTKVMDFVTD LEFTVPDLVQ GKEYLFKVCA RNKCGPGEPA YVDEPVNMST PATVPDPPEN 

     14530      14540      14550      14560      14570      14580 
VKWRDRTANS IFLTWDPPKN DGGSRIKGYI VERCPRGSDK WVACGEPVAE TKMEVTGLEE 

     14590      14600      14610      14620      14630      14640 
GKWYAYRVKA LNRQGASKPS RPTEEIQAVD TQEAPEIFLD VKLLAGLTVK AGTKIELPAT 

     14650      14660      14670      14680      14690      14700 
VTGKPEPKIT WTKADMILKQ DKRITIENVP KKSTVTIVDS KRSDTGTYII EAVNVCGRAT 

     14710      14720      14730      14740      14750      14760 
AVVEVNVLDK PGPPAAFDIT DVTNESCLLT WNPPRDDGGS KITNYVVERR ATDSEVWHKL 

     14770      14780      14790      14800      14810      14820 
SSTVKDTNFK ATKLIPNKEY IFRVAAENMY GVGEPVQASP ITAKYQFDPP GPPTRLEPSD 

     14830      14840      14850      14860      14870      14880 
ITKDAVTLTW CEPDDDGGSP ITGYWVERLD PDTDKWVRCN KMPVKDTTYR VKGLTNKKKY 

     14890      14900      14910      14920      14930      14940 
RFRVLAENLA GPGKPSKSTE PILIKDPIDP PWPPGKPTVK DVGKTSVRLN WTKPEHDGGA 

     14950      14960      14970      14980      14990      15000 
KIESYVIEML KTGTDEWVRV AEGVPTTQHL LPGLMEGQEY SFRVRAVNKA GESEPSEPSD 

     15010      15020      15030      15040      15050      15060 
PVLCREKLYP PSPPRWLEVI NITKNTADLK WTVPEKDGGS PITNYIVEKR DVRRKGWQTV 

     15070      15080      15090      15100      15110      15120 
DTTVKDTKCT VTPLTEGSLY VFRVAAENAI GQSDYTEIED SVLAKDTFTT PGPPYALAVV 

     15130      15140      15150      15160      15170      15180 
DVTKRHVDLK WEPPKNDGGR PIQRYVIEKK ERLGTRWVKA GKTAGPDCNF RVTDVIEGTE 

     15190      15200      15210      15220      15230      15240 
VQFQVRAENE AGVGHPSEPT EILSIEDPTS PPSPPLDLHV TDAGRKHIAI AWKPPEKNGG 

     15250      15260      15270      15280      15290      15300 
SPIIGYHVEM CPVGTEKWMR VNSRPIKDLK FKVEEGVVPD KEYVLRVRAV NAIGVSEPSE 

     15310      15320      15330      15340      15350      15360 
ISENVVAKDP DCKPTIDLET HDIIVIEGEK LSIPVPFRAV PVPTVSWHKD GKEVKASDRL 

     15370      15380      15390      15400      15410      15420 
TMKNDHISAH LEVPKSVRAD AGIYTITLEN KLGSATASIN VKVIGLPGPC KDIKASDITK 

     15430      15440      15450      15460      15470      15480 
SSCKLTWEPP EFDGGTPILH YVLERREAGR RTYIPVMSGE NKLSWTVKDL IPNGEYFFRV 

     15490      15500      15510      15520      15530      15540 
KAVNKVGGGE YIELKNPVIA QDPKQPPDPP VDVEVHNPTA EAMTITWKPP LYDGGSKIMG 

     15550      15560      15570      15580      15590      15600 
YIIEKIAKGE ERWKRCNEHL VPILTYTAKG LEEGKEYQFR VRAENAAGIS EPSRATPPTK 

     15610      15620      15630      15640      15650      15660 
AVDPIDAPKV ILRTSLEVKR GDEIALDASI SGSPYPTITW IKDENVIVPE EIKKRAAPLV 

     15670      15680      15690      15700      15710      15720 
RRRKGEVQEE EPFVLPLTQR LSIDNSKKGE SQLRVRDSLR PDHGLYMIKV ENDHGIAKAP 

     15730      15740      15750      15760      15770      15780 
CTVSVLDTPG PPINFVFEDI RKTSVLCKWE PPLDDGGSEI INYTLEKKDK TKPDSEWIVV 

     15790      15800      15810      15820      15830      15840 
TSTLRHCKYS VTKLIEGKEY LFRVRAENRF GPGPPCVSKP LVAKDPFGPP DAPDKPIVED 

     15850      15860      15870      15880      15890      15900 
VTSNSMLVKW NEPKDNGSPI LGYWLEKREV NSTHWSRVNK SLLNALKANV DGLLEGLTYV 

     15910      15920      15930      15940      15950      15960 
FRVCAENAAG PGKFSPPSDP KTAHDPISPP GPPIPRVTDT SSTTIELEWE PPAFNGGGEI 

     15970      15980      15990      16000      16010      16020 
VGYFVDKQLV GTNEWSRCTE KMIKVRQYTV KEIREGADYK LRVSAVNAAG EGPPGETQPV 

     16030      16040      16050      16060      16070      16080 
TVAEPQEPPA VELDVSVKGG IQIMAGKTLR IPAVVTGRPV PTKVWTKEEG ELDKDRVVID 

     16090      16100      16110      16120      16130      16140 
NVGTKSELII KDALRKDHGR YVITATNSCG SKFAAARVEV FDVPGPVLDL KPVVTNRKMC 

     16150      16160      16170      16180      16190      16200 
LLNWSDPEDD GGSEITGFII ERKDAKMHTW RQPIETERSK CDITGLLEGQ EYKFRVIAKN 

     16210      16220      16230      16240      16250      16260 
KFGCGPPVEI GPILAVDPLG PPTSPERLTY TERTKSTITL DWKEPRSNGG SPIQGYIIEK 

     16270      16280      16290      16300      16310      16320 
RRHDKPDFER VNKRLCPTTS FLVENLDEHQ MYEFRVKAVN EIGESEPSLP LNVVIQDDEV 

     16330      16340      16350      16360      16370      16380 
PPTIKLRLSV RGDTIKVKAG EPVHIPADVT GLPMPKIEWS KNETVIEKPT DALQITKEEV 

     16390      16400      16410      16420      16430      16440 
SRSEAKTELS IPKAVREDKG TYTVTASNRL GSVFRNVHVE VYDRPSPPRN LAVTDIKAES 

     16450      16460      16470      16480      16490      16500 
CYLTWDAPLD NGGSEITHYV IDKRDASRKK AEWEEVTNTA VEKRYGIWKL IPNGQYEFRV 

     16510      16520      16530      16540      16550      16560 
RAVNKYGISD ECKSDKVVIQ DPYRLPGPPG KPKVLARTKG SMLVSWTPPL DNGGSPITGY 

     16570      16580      16590      16600      16610      16620 
WLEKREEGSP YWSRVSRAPI TKVGLKGVEF NVPRLLEGVK YQFRAMAINA AGIGPPSEPS 

     16630      16640      16650      16660      16670      16680 
DPEVAGDPIF PPGPPSCPEV KDKTKSSISL GWKPPAKDGG SPIKGYIVEM QEEGTTDWKR 

     16690      16700      16710      16720      16730      16740 
VNEPDKLITT CECVVPNLKE LRKYRFRVKA VNEAGESEPS DTTGEIPATD IQEEPEVFID 

     16750      16760      16770      16780      16790      16800 
IGAQDCLVCK AGSQIRIPAV IKGRPTPKSS WEFDGKAKKA MKDGVHDIPE DAQLETAENS 

     16810      16820      16830      16840      16850      16860 
SVIIIPECKR SHTGKYSITA KNKAGQKTAN CRVKVMDVPG PPKDLKVSDI TRGSCRLSWK 

     16870      16880      16890      16900      16910      16920 
MPDDDGGDRI KGYVIEKRTI DGKAWTKVNP DCGSTTFVVP DLLSEQQYFF RVRAENRFGI 

     16930      16940      16950      16960      16970      16980 
GPPVETIQRT TARDPIYPPD PPIKLKIGLI TKNTVHLSWK PPKNDGGSPV THYIVECLAW 

     16990      17000      17010      17020      17030      17040 
DPTGTKKEAW RQCNKRDVEE LQFTVEDLVE GGEYEFRVKA VNAAGVSKPS ATVGPCDCQR 

     17050      17060      17070      17080      17090      17100 
PDMPPSIDLK EFMEVEEGTN VNIVAKIKGV PFPTLTWFKA PPKKPDNKEP VLYDTHVNKL 

     17110      17120      17130      17140      17150      17160 
VVDDTCTLVI PQSRRSDTGL YTITAVNNLG TASKEMRLNV LGRPGPPVGP IKFESVSADQ 

     17170      17180      17190      17200      17210      17220 
MTLSWFPPKD DGGSKITNYV IEKREANRKT WVHVSSEPKE CTYTIPKLLE GHEYVFRIMA 

     17230      17240      17250      17260      17270      17280 
QNKYGIGEPL DSEPETARNL FSVPGAPDKP TVSSVTRNSM TVNWEEPEYD GGSPVTGYWL 

     17290      17300      17310      17320      17330      17340 
EMKDTTSKRW KRVNRDPIKA MTLGVSYKVT GLIEGSDYQF RVYAINAAGV GPASLPSDPA 

     17350      17360      17370      17380      17390      17400 
TARDPIAPPG PPFPKVTDWT KSSADLEWSP PLKDGGSKVT GYIVEYKEEG KEEWEKGKDK 

     17410      17420      17430      17440      17450      17460 
EVRGTKLVVT GLKEGAFYKF RVSAVNIAGI GEPGEVTDVI EMKDRLVSPD LQLDASVRDR 

     17470      17480      17490      17500      17510      17520 
IVVHAGGVIR IIAYVSGKPP PTVTWNMNER TLPQEATIET TAISSSMVIK NCQRSHQGVY 

     17530      17540      17550      17560      17570      17580 
SLLAKNEAGE RKKTIIVDVL DVPGPVGTPF LAHNLTNESC KLTWFSPEDD GGSPITNYVI 

     17590      17600      17610      17620      17630      17640 
EKRESDRRAW TPVTYTVTRQ NATVQGLIQG KAYFFRIAAE NSIGMGPFVE TSEALVIREP 

     17650      17660      17670      17680      17690      17700 
ITVPERPEDL EVKEVTKNTV TLTWNPPKYD GGSEIINYVL ESRLIGTEKF HKVTNDNLLS 

     17710      17720      17730      17740      17750      17760 
RKYTVKGLKE GDTYEYRVSA VNIVGQGKPS FCTKPITCKD ELAPPTLHLD FRDKLTIRVG 

     17770      17780      17790      17800      17810      17820 
EAFALTGRYS GKPKPKVSWF KDEADVLEDD RTHIKTTPAT LALEKIKAKR SDSGKYCVVV 

     17830      17840      17850      17860      17870      17880 
ENSTGSRKGF CQVNVVDRPG PPVGPVSFDE VTKDYMVISW KPPLDDGGSK ITNYIIEKKE 

     17890      17900      17910      17920      17930      17940 
VGKDVWMPVT SASAKTTCKV SKLLEGKDYI FRIHAENLYG ISDPLVSDSM KAKDRFRVPD 

     17950      17960      17970      17980      17990      18000 
APDQPIVTEV TKDSALVTWN KPHDGGKPIT NYILEKRETM SKRWARVTKD PIHPYTKFRV 

     18010      18020      18030      18040      18050      18060 
PDLLEGCQYE FRVSAENEIG IGDPSPPSKP VFAKDPIAKP SPPVNPEAID TTCNSVDLTW 

     18070      18080      18090      18100      18110      18120 
QPPRHDGGSK ILGYIVEYQK VGDEEWRRAN HTPESCPETK YKVTGLRDGQ TYKFRVLAVN 

     18130      18140      18150      18160      18170      18180 
AAGESDPAHV PEPVLVKDRL EPPELILDAN MAREQHIKVG DTLRLSAIIK GVPFPKVTWK 

     18190      18200      18210      18220      18230      18240 
KEDRDAPTKA RIDVTPVGSK LEIRNAAHED GGIYSLTVEN PAGSKTVSVK VLVLDKPGPP 

     18250      18260      18270      18280      18290      18300 
RDLEVSEIRK DSCYLTWKEP LDDGGSVITN YVVERRDVAS AQWSPLSATS KKKSHFAKHL 

     18310      18320      18330      18340      18350      18360 
NEGNQYLFRV AAENQYGRGP FVETPKPIKA LDPLHPPGPP KDLHHVDVDK TEVSLVWNKP 

     18370      18380      18390      18400      18410      18420 
DRDGGSPITG YLVEYQEEGT QDWIKFKTVT NLECVVTGLQ QGKTYRFRVK AENIVGLGLP 

     18430      18440      18450      18460      18470      18480 
DTTIPIECQE KLVPPSVELD VKLIEGLVVK AGTTVRFPAI IRGVPVPTAK WTTDGSEIKT 

     18490      18500      18510      18520      18530      18540 
DEHYTVETDN FSSVLTIKNC LRRDTGEYQI TVSNAAGSKT VAVHLTVLDV PGPPTGPINI 

     18550      18560      18570      18580      18590      18600 
LDVTPEHMTI SWQPPKDDGG SPVINYIVEK QDTRKDTWGV VSSGSSKTKL KIPHLQKGCE 

     18610      18620      18630      18640      18650      18660 
YVFRVRAENK IGVGPPLDST PTVAKHKFSP PSPPGKPVVT DITENAATVS WTLPKSDGGS 

     18670      18680      18690      18700      18710      18720 
PITGYYMERR EVTGKWVRVN KTPIADLKFR VTGLYEGNTY EFRVFAENLA GLSKPSPSSD 

     18730      18740      18750      18760      18770      18780 
PIKACRPIKP PGPPINPKLK DKSRETADLV WTKPLSDGGS PILGYVVECQ KPGTAQWNRI 

     18790      18800      18810      18820      18830      18840 
NKDELIRQCA FRVPGLIEGN EYRFRIKAAN IVGEGEPREL AESVIAKDIL HPPEVELDVT 

     18850      18860      18870      18880      18890      18900 
CRDVITVRVG QTIRILARVK GRPEPDITWT KEGKVLVREK RVDLIQDLPR VELQIKEAVR 

     18910      18920      18930      18940      18950      18960 
ADHGKYIISA KNSSGHAQGS AIVNVLDRPG PCQNLKVTNV TKENCTISWE NPLDNGGSEI 

     18970      18980      18990      19000      19010      19020 
TNFIVEYRKP NQKGWSIVAS DVTKRLIKAN LLANNEYYFR VCAENKVGVG PTIETKTPIL 

     19030      19040      19050      19060      19070      19080 
AINPIDRPGE PENLHIADKG KTFVYLKWRR PDYDGGSPNL SYHVERRLKG SDDWERVHKG 

     19090      19100      19110      19120      19130      19140 
SIKETHYMVD RCVENQIYEF RVQTKNEGGE SDWVKTEEVV VKEDLQKPVL DLKLSGVLTV 

     19150      19160      19170      19180      19190      19200 
KAGDTIRLEA GVRGKPFPEV AWTKDKDATD LTRSPRVKID TRADSSKFSL TKAKRSDGGK 

     19210      19220      19230      19240      19250      19260 
YVVTATNTAG SFVAYATVNV LDKPGPVRNL KIVDVSSDRC TVCWDPPEDD GGCEIQNYIL 

     19270      19280      19290      19300      19310      19320 
EKCETKRMVW STYSATVLTP GTTVTRLIEG NEYIFRVRAE NKIGTGPPTE SKPVIAKTKY 

     19330      19340      19350      19360      19370      19380 
DKPGRPDPPE VTKVSKEEMT VVWNPPEYDG GKSITGYFLE KKEKHSTRWV PVNKSAIPER 

     19390      19400      19410      19420      19430      19440 
RMKVQNLLPD HEYQFRVKAE NEIGIGEPSL PSRPVVAKDP IEPPGPPTNF RVVDTTKHSI 

     19450      19460      19470      19480      19490      19500 
TLGWGKPVYD GGAPIIGYVV EMRPKIADAS PDEGWKRCNA AAQLVRKEFT VTSLDENQEY 

     19510      19520      19530      19540      19550      19560 
EFRVCAQNQV GIGRPAELKE AIKPKEILEP PEIDLDASMR KLVIVRAGCP IRLFAIVRGR 

     19570      19580      19590      19600      19610      19620 
PAPKVTWRKV GIDNVVRKGQ VDLVDTMAFL VIPNSTRDDS GKYSLTLVNP AGEKAVFVNV 

     19630      19640      19650      19660      19670      19680 
RVLDTPGPVS DLKVSDVTKT SCHVSWAPPE NDGGSQVTHY IVEKREADRK TWSTVTPEVK 

     19690      19700      19710      19720      19730      19740 
KTSFHVTNLV PGNEYYFRVT AVNEYGPGVP TDVPKPVLAS DPLSEPDPPR KLEVTEMTKN 

     19750      19760      19770      19780      19790      19800 
SATLAWLPPL RDGGAKIDGY ITSYREEEQP ADRWTEYSVV KDLSLVVTGL KEGKKYKFRV 

     19810      19820      19830      19840      19850      19860 
AARNAVGVSL PREAEGVYEA KEQLLPPKIL MPEQITIKAG KKLRIEAHVY GKPHPTCKWK 

     19870      19880      19890      19900      19910      19920 
KGEDEVVTSS HLAVHKADSS SILIIKDVTR KDSGYYSLTA ENSSGTDTQK IKVVVMDAPG 

     19930      19940      19950      19960      19970      19980 
PPQPPFDISD IDADACSLSW HIPLEDGGSN ITNYIVEKCD VSRGDWVTAL ASVTKTSCRV 

     19990      20000      20010      20020      20030      20040 
GKLIPGQEYI FRVRAENRFG ISEPLTSPKM VAQFPFGVPS EPKNARVTKV NKDCIFVAWD 

     20050      20060      20070      20080      20090      20100 
RPDSDGGSPI IGYLIERKER NSLLWVKAND TLVRSTEYPC AGLVEGLEYS FRIYALNKAG 

     20110      20120      20130      20140      20150      20160 
SSPPSKPTEY VTARMPVDPP GKPEVIDVTK STVSLIWARP KHDGGSKIIG YFVEACKLPG 

     20170      20180      20190      20200      20210      20220 
DKWVRCNTAP HQIPQEEYTA TGLEEKAQYQ FRAIARTAVN ISPPSEPSDP VTILAENVPP 

     20230      20240      20250      20260      20270      20280 
RIDLSVAMKS LLTVKAGTNV CLDATVFGKP MPTVSWKKDG TLLKPAEGIK MAMQRNLCTL 

     20290      20300      20310      20320      20330      20340 
ELFSVNRKDS GDYTITAENS SGSKSATIKL KVLDKPGPPA SVKINKMYSD RAMLSWEPPL 

     20350      20360      20370      20380      20390      20400 
EDGGSEITNY IVDKRETSRP NWAQVSATVP ITSCSVEKLI EGHEYQFRIC AENKYGVGDP 

     20410      20420      20430      20440      20450      20460 
VFTEPAIAKN PYDPPGRCDP PVISNITKDH MTVSWKPPAD DGGSPITGYL LEKRETQAVN 

     20470      20480      20490      20500      20510      20520 
WTKVNRKPII ERTLKATGLQ EGTEYEFRVT AINKAGPGKP SDASKAAYAR DPQYPPAPPA 

     20530      20540      20550      20560      20570      20580 
FPKVYDTTRS SVSLSWGKPA YDGGSPIIGY LVEVKRADSD NWVRCNLPQN LQKTRFEVTG 

     20590      20600      20610      20620      20630      20640 
LMEDTQYQFR VYAVNKIGYS DPSDVPDKHY PKDILIPPEG ELDADLRKTL ILRAGVTMRL 

     20650      20660      20670      20680      20690      20700 
YVPVKGRPPP KITWSKPNVN LRDRIGLDIK STDFDTFLRC ENVNKYDAGK YILTLENSCG 

     20710      20720      20730      20740      20750      20760 
KKEYTIVVKV LDTPGPPVNV TVKEISKDSA YVTWEPPIID GGSPIINYVV QKRDAERKSW 

     20770      20780      20790      20800      20810      20820 
STVTTECSKT SFRVANLEEG KSYFFRVFAE NEYGIGDPGE TRDAVKASQT PGPVVDLKVR 

     20830      20840      20850      20860      20870      20880 
SVSKSSCSIG WKKPHSDGGS RIIGYVVDFL TEENKWQRVM KSLSLQYSAK DLTEGKEYTF 

     20890      20900      20910      20920      20930      20940 
RVSAENENGE GTPSEITVVA RDDVVAPDLD LKGLPDLCYL AKENSNFRLK IPIKGKPAPS 

     20950      20960      20970      20980      20990      21000 
VSWKKGEDPL ATDTRVSVES SAVNTTLIVY DCQKSDAGKY TITLKNVAGT KEGTISIKVV 

     21010      21020      21030      21040      21050      21060 
GKPGIPTGPI KFDEVTAEAM TLKWAPPKDD GGSEITNYIL EKRDSVNNKW VTCASAVQKT 

     21070      21080      21090      21100      21110      21120 
TFRVTRLHEG MEYTFRVSAE NKYGVGEGLK SEPIVARHPF DVPDAPPPPN IVDVRHDSVS 

     21130      21140      21150      21160      21170      21180 
LTWTDPKKTG GSPITGYHLE FKERNSLLWK RANKTPIRMR DFKVTGLTEG LEYEFRVMAI 

     21190      21200      21210      21220      21230      21240 
NLAGVGKPSL PSEPVVALDP IDPPGKPEVI NITRNSVTLI WTEPKYDGGH KLTGYIVEKR 

     21250      21260      21270      21280      21290      21300 
DLPSKSWMKA NHVNVPECAF TVTDLVEGGK YEFRIRAKNT AGAISAPSES TETIICKDEY 

     21310      21320      21330      21340      21350      21360 
EAPTIVLDPT IKDGLTIKAG DTIVLNAISI LGKPLPKSSW SKAGKDIRPS DITQITSTPT 

     21370      21380      21390      21400      21410      21420 
SSMLTIKYAT RKDAGEYTIT ATNPFGTKVE HVKVTVLDVP GPPGPVEISN VSAEKATLTW 

     21430      21440      21450      21460      21470      21480 
TPPLEDGGSP IKSYILEKRE TSRLLWTVVS EDIQSCRHVA TKLIQGNEYI FRVSAVNHYG 

     21490      21500      21510      21520      21530      21540 
KGEPVQSEPV KMVDRFGPPG PPEKPEVSNV TKNTATVSWK RPVDDGGSEI TGYHVERREK 

     21550      21560      21570      21580      21590      21600 
KSLRWVRAIK TPVSDLRCKV TGLQEGSTYE FRVSAENRAG IGPPSEASDS VLMKDAAYPP 

     21610      21620      21630      21640      21650      21660 
GPPSNPHVTD TTKKSASLAW GKPHYDGGLE ITGYVVEHQK VGDEAWIKDT TGTALRITQF 

     21670      21680      21690      21700      21710      21720 
VVPDLQTKEK YNFRISAIND AGVGEPAVIP DVEIVEREMA PDFELDAELR RTLVVRAGLS 

     21730      21740      21750      21760      21770      21780 
IRIFVPIKGR PAPEVTWTKD NINLKNRANI ENTESFTLLI IPECNRYDTG KFVMTIENPA 

     21790      21800      21810      21820      21830      21840 
GKKSGFVNVR VLDTPGPVLN LRPTDITKDS VTLHWDLPLI DGGSRITNYI VEKREATRKS 

     21850      21860      21870      21880      21890      21900 
YSTATTKCHK CTYKVTGLSE GCEYFFRVMA ENEYGIGEPT ETTEPVKASE APSPPDSLNI 

     21910      21920      21930      21940      21950      21960 
MDITKSTVSL AWPKPKHDGG SKITGYVIEA QRKGSDQWTH ITTVKGLECV VRNLTEGEEY 

     21970      21980      21990      22000      22010      22020 
TFQVMAVNSA GRSAPRESRP VIVKEQTMLP ELDLRGIYQK LVIAKAGDNI KVEIPVLGRP 

     22030      22040      22050      22060      22070      22080 
KPTVTWKKGD QILKQTQRVN FETTATSTIL NINECVRSDS GPYPLTARNI VGEVGDVITI 

     22090      22100      22110      22120      22130      22140 
QVHDIPGPPT GPIKFDEVSS DFVTFSWDPP ENDGGVPISN YVVEMRQTDS TTWVELATTV 

     22150      22160      22170      22180      22190      22200 
IRTTYKATRL TTGLEYQFRV KAQNRYGVGP GITSACIVAN YPFKVPGPPG TPQVTAVTKD 

     22210      22220      22230      22240      22250      22260 
SMTISWHEPL SDGGSPILGY HVERKERNGI LWQTVSKALV PGNIFKSSGL TDGIAYEFRV 

     22270      22280      22290      22300      22310      22320 
IAENMAGKSK PSKPSEPMLA LDPIDPPGKP VPLNITRHTV TLKWAKPEYT GGFKITSYIV 

     22330      22340      22350      22360      22370      22380 
EKRDLPNGRW LKANFSNILE NEFTVSGLTE DAAYEFRVIA KNAAGAISPP SEPSDAITCR 

     22390      22400      22410      22420      22430      22440 
DDVEAPKIKV DVKFKDTVIL KAGEAFRLEA DVSGRPPPTM EWSKDGKELE GTAKLEIKIA 

     22450      22460      22470      22480      22490      22500 
DFSTNLVNKD STRRDSGAYT LTATNPGGFA KHIFNVKVLD RPGPPEGPLA VTEVTSEKCV 

     22510      22520      22530      22540      22550      22560 
LSWFPPLDDG GAKIDHYIVQ KRETSRLAWT NVASEVQVTK LKVTKLLKGN EYIFRVMAVN 

     22570      22580      22590      22600      22610      22620 
KYGVGEPLES EPVLAVNPYG PPDPPKNPEV TTITKDSMVV CWGHPDSDGG SEIINYIVER 

     22630      22640      22650      22660      22670      22680 
RDKAGQRWIK CNKKTLTDLR YKVSGLTEGH EYEFRIMAEN AAGISAPSPT SPFYKACDTV 

     22690      22700      22710      22720      22730      22740 
FKPGPPGNPR VLDTSRSSIS IAWNKPIYDG GSEITGYMVE IALPEEDEWQ IVTPPAGLKA 

     22750      22760      22770      22780      22790      22800 
TSYTITGLTE NQEYKIRIYA MNSEGLGEPA LVPGTPKAED RMLPPEIELD ADLRKVVTIR 

     22810      22820      22830      22840      22850      22860 
ACCTLRLFVP IKGRPAPEVK WARDHGESLD KASIESTSSY TLLIVGNVNR FDSGKYILTV 

     22870      22880      22890      22900      22910      22920 
ENSSGSKSAF VNVRVLDTPG PPQDLKVKEV TKTSVTLTWD PPLLDGGSKI KNYIVEKRES 

     22930      22940      22950      22960      22970      22980 
TRKAYSTVAT NCHKTSWKVD QLQEGCSYYF RVLAENEYGI GLPAETAESV KASERPLPPG 

     22990      23000      23010      23020      23030      23040 
KITLMDVTRN SVSLSWEKPE HDGGSRILGY IVEMQTKGSD KWATCATVKV TEATITGLIQ 

     23050      23060      23070      23080      23090      23100 
GEEYSFRVSA QNEKGISDPR QLSVPVIAKD LVIPPAFKLL FNTFTVLAGE DLKVDVPFIG 

     23110      23120      23130      23140      23150      23160 
RPTPAVTWHK DNVPLKQTTR VNAESTENNS LLTIKDACRE DVGHYVVKLT NSAGEAIETL 

     23170      23180      23190      23200      23210      23220 
NVIVLDKPGP PTGPVKMDEV TADSITLSWG PPKYDGGSSI NNYIVEKRDT STTTWQIVSA 

     23230      23240      23250      23260      23270      23280 
TVARTTIKAC RLKTGCEYQF RIAAENRYGK STYLNSEPTV AQYPFKVPGP PGTPVVTLSS 

     23290      23300      23310      23320      23330      23340 
RDSMEVQWNE PISDGGSRVI GYHLERKERN SILWVKLNKT PIPQTKFKTT GLEEGVEYEF 

     23350      23360      23370      23380      23390      23400 
RVSAENIVGI GKPSKVSECY VARDPCDPPG RPEAIIVTRN SVTLQWKKPT YDGGSKITGY 

     23410      23420      23430      23440      23450      23460 
IVEKKELPEG RWMKASFTNI IDTHFEVTGL VEDHRYEFRV IARNAAGVFS EPSESTGAIT 

     23470      23480      23490      23500      23510      23520 
ARDEVDPPRI SMDPKYKDTI VVHAGESFKV DADIYGKPIP TIQWIKGDQE LSNTARLEIK 

     23530      23540      23550      23560      23570      23580 
STDFATSLSV KDAVRVDSGN YILKAKNVAG ERSVTVNVKV LDRPGPPEGP VVISGVTAEK 

     23590      23600      23610      23620      23630      23640 
CTLAWKPPLQ DGGSDIINYI VERRETSRLV WTVVDANVQT LSCKVTKLLE GNEYTFRIMA 

     23650      23660      23670      23680      23690      23700 
VNKYGVGEPL ESEPVVAKNP FVVPDAPKAP EVTTVTKDSM IVVWERPASD GGSEILGYVL 

     23710      23720      23730      23740      23750      23760 
EKRDKEGIRW TRCHKRLIGE LRLRVTGLIE NHDYEFRVSA ENAAGLSEPS PPSAYQKACD 

     23770      23780      23790      23800      23810      23820 
PIYKPGPPNN PKVIDITRSS VFLSWSKPIY DGGCEIQGYI VEKCDVSVGE WTMCTPPTGI 

     23830      23840      23850      23860      23870      23880 
NKTNIEVEKL LEKHEYNFRI CAINKAGVGE HADVPGPIIV EEKLEAPDID LDLELRKIIN 

     23890      23900      23910      23920      23930      23940 
IRAGGSLRLF VPIKGRPTPE VKWGKVDGEI RDAAIIDVTS SFTSLVLDNV NRYDSGKYTL 

     23950      23960      23970      23980      23990      24000 
TLENSSGTKS AFVTVRVLDT PSPPVNLKVT EITKDSVSIT WEPPLLDGGS KIKNYIVEKR 

     24010      24020      24030      24040      24050      24060 
EATRKSYAAV VTNCHKNSWK IDQLQEGCSY YFRVTAENEY GIGLPAQTAD PIKVAEVPQP 

     24070      24080      24090      24100      24110      24120 
PGKITVDDVT RNSVSLSWTK PEHDGGSKII QYIVEMQAKH SEKWSECARV KSLQAVITNL 

     24130      24140      24150      24160      24170      24180 
TQGEEYLFRV VAVNEKGRSD PRSLAVPIVA KDLVIEPDVK PAFSSYSVQV GQDLKIEVPI 

     24190      24200      24210      24220      24230      24240 
SGRPKPTITW TKDGLPLKQT TRINVTDSLD LTTLSIKETH KDDGGQYGIT VANVVGQKTA 

     24250      24260      24270      24280      24290      24300 
SIEIVTLDKP DPPKGPVKFD DVSAESITLS WNPPLYTGGC QITNYIVQKR DTTTTVWDVV 

     24310      24320      24330      24340      24350      24360 
SATVARTTLK VTKLKTGTEY QFRIFAENRY GQSFALESDP IVAQYPYKEP GPPGTPFATA 

     24370      24380      24390      24400      24410      24420 
ISKDSMVIQW HEPVNNGGSP VIGYHLERKE RNSILWTKVN KTIIHDTQFK AQNLEEGIEY 

     24430      24440      24450      24460      24470      24480 
EFRVYAENIV GVGKASKNSE CYVARDPCDP PGTPEPIMVK RNEITLQWTK PVYDGGSMIT 

     24490      24500      24510      24520      24530      24540 
GYIVEKRDLP DGRWMKASFT NVIETQFTVS GLTEDQRYEF RVIAKNAAGA ISKPSDSTGP 

     24550      24560      24570      24580      24590      24600 
ITAKDEVELP RISMDPKFRD TIVVNAGETF RLEADVHGKP LPTIEWLRGD KEIEESARCE 

     24610      24620      24630      24640      24650      24660 
IKNTDFKALL IVKDAIRIDG GQYILRASNV AGSKSFPVNV KVLDRPGPPE GPVQVTGVTS 

     24670      24680      24690      24700      24710      24720 
EKCSLTWSPP LQDGGSDISH YVVEKRETSR LAWTVVASEV VTNSLKVTKL LEGNEYVFRI 

     24730      24740      24750      24760      24770      24780 
MAVNKYGVGE PLESAPVLMK NPFVLPGPPK SLEVTNIAKD SMTVCWNRPD SDGGSEIIGY 

     24790      24800      24810      24820      24830      24840 
IVEKRDRSGI RWIKCNKRRI TDLRLRVTGL TEDHEYEFRV SAENAAGVGE PSPATVYYKA 

     24850      24860      24870      24880      24890      24900 
CDPVFKPGPP TNAHIVDTTK NSITLAWGKP IYDGGSEILG YVVEICKADE EEWQIVTPQT 

     24910      24920      24930      24940      24950      24960 
GLRVTRFEIS KLTEHQEYKI RVCALNKVGL GEATSVPGTV KPEDKLEAPE LDLDSELRKG 

     24970      24980      24990      25000      25010      25020 
IVVRAGGSAR IHIPFKGRPT PEITWSREEG EFTDKVQIEK GVNYTQLSID NCDRNDAGKY 

     25030      25040      25050      25060      25070      25080 
ILKLENSSGS KSAFVTVKVL DTPGPPQNLA VKEVRKDSAF LVWEPPIIDG GAKVKNYVID 

     25090      25100      25110      25120      25130      25140 
KRESTRKAYA NVSSKCSKTS FKVENLTEGA IYYFRVMAEN EFGVGVPVET VDAVKAAEPP 

     25150      25160      25170      25180      25190      25200 
SPPGKVTLTD VSQTSASLMW EKPEHDGGSR VLGYVVEMQP KGTEKWSIVA ESKVCNAVVT 

     25210      25220      25230      25240      25250      25260 
GLSSGQEYQF RVKAYNEKGK SDPRVLGVPV IAKDLTIQPS LKLPFNTYSI QAGEDLKIEI 

     25270      25280      25290      25300      25310      25320 
PVIGRPRPNI SWVKDGEPLK QTTRVNVEET ATSTVLHIKE GNKDDFGKYT VTATNSAGTA 

     25330      25340      25350      25360      25370      25380 
TENLSVIVLE KPGPPVGPVR FDEVSADFVV ISWEPPAYTG GCQISNYIVE KRDTTTTTWH 

     25390      25400      25410      25420      25430      25440 
MVSATVARTT IKITKLKTGT EYQFRIFAEN RYGKSAPLDS KAVIVQYPFK EPGPPGTPFV 

     25450      25460      25470      25480      25490      25500 
TSISKDQMLV QWHEPVNDGG TKIIGYHLEQ KEKNSILWVK LNKTPIQDTK FKTTGLDEGL 

     25510      25520      25530      25540      25550      25560 
EYEFKVSAEN IVGIGKPSKV SECFVARDPC DPPGRPEAIV ITRNNVTLKW KKPAYDGGSK 

     25570      25580      25590      25600      25610      25620 
ITGYIVEKKD LPDGRWMKAS FTNVLETEFT VSGLVEDQRY EFRVIARNAA GNFSEPSDSS 

     25630      25640      25650      25660      25670      25680 
GAITARDEID APNASLDPKY KDVIVVHAGE TFVLEADIRG KPIPDVVWSK DGKELEETAA 

     25690      25700      25710      25720      25730      25740 
RMEIKSTIQK TTLVVKDCIR TDGGQYILKL SNVGGTKSIP ITVKVLDRPG PPEGPLKVTG 

     25750      25760      25770      25780      25790      25800 
VTAEKCYLAW NPPLQDGGAN ISHYIIEKRE TSRLSWTQVS TEVQALNYKV TKLLPGNEYI 

     25810      25820      25830      25840      25850      25860 
FRVMAVNKYG IGEPLESGPV TACNPYKPPG PPSTPEVSAI TKDSMVVTWA RPVDDGGTEI 

     25870      25880      25890      25900      25910      25920 
EGYILEKRDK EGVRWTKCNK KTLTDLRLRV TGLTEGHSYE FRVAAENAAG VGEPSEPSVF 

     25930      25940      25950      25960      25970      25980 
YRACDALYPP GPPSNPKVTD TSRSSVSLAW SKPIYDGGAP VKGYVVEVKE AAADEWTTCT 

     25990      26000      26010      26020      26030      26040 
PPTGLQGKQF TVTKLKENTE YNFRICAINS EGVGEPATLP GSVVAQERIE PPEIELDADL 

     26050      26060      26070      26080      26090      26100 
RKVVVLRASA TLRLFVTIKG RPEPEVKWEK AEGILTDRAQ IEVTSSFTML VIDNVTRFDS 

     26110      26120      26130      26140      26150      26160 
GRYNLTLENN SGSKTAFVNV RVLDSPSAPV NLTIREVKKD SVTLSWEPPL IDGGAKITNY 

     26170      26180      26190      26200      26210      26220 
IVEKRETTRK AYATITNNCT KTTFRIENLQ EGCSYYFRVL ASNEYGIGLP AETTEPVKVS 

     26230      26240      26250      26260      26270      26280 
EPPLPPGRVT LVDVTRNTAT IKWEKPESDG GSKITGYVVE MQTKGSEKWS TCTQVKTLEA 

     26290      26300      26310      26320      26330      26340 
TISGLTAGEE YVFRVAAVNE KGRSDPRQLG VPVIARDIEI KPSVELPFHT FNVKAREQLK 

     26350      26360      26370      26380      26390      26400 
IDVPFKGRPQ ATVNWRKDGQ TLKETTRVNV SSSKTVTSLS IKEASKEDVG TYELCVSNSA 

     26410      26420      26430      26440      26450      26460 
GSITVPITII VLDRPGPPGP IRIDEVSCDS ITISWNPPEY DGGCQISNYI VEKKETTSTT 

     26470      26480      26490      26500      26510      26520 
WHIVSQAVAR TSIKIVRLTT GSEYQFRVCA ENRYGKSSYS ESSAVVAEYP FSPPGPPGTP 

     26530      26540      26550      26560      26570      26580 
KVVHATKSTM LVTWQVPVND GGSRVIGYHL EYKERSSILW SKANKILIAD TQMKVSGLDE 

     26590      26600      26610      26620      26630      26640 
GLMYEYRVYA ENIAGIGKCS KSCEPVPARD PCDPPGQPEV TNITRKSVSL KWSKPHYDGG 

     26650      26660      26670      26680      26690      26700 
AKITGYIVER RELPDGRWLK CNYTNIQETY FEVTELTEDQ RYEFRVFARN AADSVSEPSE 

     26710      26720      26730      26740      26750      26760 
STGPIIVKDD VEPPRVMMDV KFRDVIVVKA GEVLKINADI AGRPLPVISW AKDGIEIEER 

     26770      26780      26790      26800      26810      26820 
ARTEIISTDN HTLLTVKDCI RRDTGQYVLT LKNVAGTRSV AVNCKVLDKP GPPAGPLEIN 

     26830      26840      26850      26860      26870      26880 
GLTAEKCSLS WGRPQEDGGA DIDYYIVEKR ETSHLAWTIC EGELQMTSCK VTKLLKGNEY 

     26890      26900      26910      26920      26930      26940 
IFRVTGVNKY GVGEPLESVA IKALDPFTVP SPPTSLEITS VTKESMTLCW SRPESDGGSE 

     26950      26960      26970      26980      26990      27000 
ISGYIIERRE KNSLRWVRVN KKPVYDLRVK STGLREGCEY EYRVYAENAA GLSLPSETSP 

     27010      27020      27030      27040      27050      27060 
LIRAEDPVFL PSPPSKPKIV DSGKTTITIA WVKPLFDGGA PITGYTVEYK KSDDTDWKTS 

     27070      27080      27090      27100      27110      27120 
IQSLRGTEYT ISGLTTGAEY VFRVKSVNKV GASDPSDSSD PQIAKEREEE PLFDIDSEMR 

     27130      27140      27150      27160      27170      27180 
KTLIVKAGAS FTMTVPFRGR PVPNVLWSKP DTDLRTRAYV DTTDSRTSLT IENANRNDSG 

     27190      27200      27210      27220      27230      27240 
KYTLTIQNVL SAASLTLVVK VLDTPGPPTN ITVQDVTKES AVLSWDVPEN DGGAPVKNYH 

     27250      27260      27270      27280      27290      27300 
IEKREASKKA WVSVTNNCNR LSYKVTNLQE GAIYYFRVSG ENEFGVGIPA ETKEGVKITE 

     27310      27320      27330      27340      27350      27360 
KPSPPEKLGV TSISKDSVSL TWLKPEHDGG SRIVHYVVEA LEKGQKNWVK CAVAKSTHHV 

     27370      27380      27390      27400      27410      27420 
VSGLRENSEY FFRVFAENQA GLSDPRELLL PVLIKEQLEP PEIDMKNFPS HTVYVRAGSN 

     27430      27440      27450      27460      27470      27480 
LKVDIPISGK PLPKVTLSRD GVPLKATMRF NTEITAENLT INLKESVTAD AGRYEITAAN 

     27490      27500      27510      27520      27530      27540 
SSGTTKAFIN IVVLDRPGPP TGPVVISDIT EESVTLKWEP PKYDGGSQVT NYILLKRETS 

     27550      27560      27570      27580      27590      27600 
TAVWTEVSAT VARTMMKVMK LTTGEEYQFR IKAENRFGIS DHIDSACVTV KLPYTTPGPP 

     27610      27620      27630      27640      27650      27660 
STPWVTNVTR ESITVGWHEP VSNGGSAVVG YHLEMKDRNS ILWQKANKLV IRTTHFKVTT 

     27670      27680      27690      27700      27710      27720 
ISAGLIYEFR VYAENAAGVG KPSHPSEPVL AIDACEPPRN VRITDISKNS VSLSWQQPAF 

     27730      27740      27750      27760      27770      27780 
DGGSKITGYI VERRDLPDGR WTKASFTNVT ETQFIISGLT QNSQYEFRVF ARNAVGSISN 

     27790      27800      27810      27820      27830      27840 
PSEVVGPITC IDSYGGPVID LPLEYTEVVK YRAGTSVKLR AGISGKPAPT IEWYKDDKEL 

     27850      27860      27870      27880      27890      27900 
QTNALVCVEN TTDLASILIK DADRLNSGCY ELKLRNAMGS ASATIRVQIL DKPGPPGGPI 

     27910      27920      27930      27940      27950      27960 
EFKTVTAEKI TLLWRPPADD GGAKITHYIV EKRETSRVVW SMVSEHLEEC IITTTKIIKG 

     27970      27980      27990      28000      28010      28020 
NEYIFRVRAV NKYGIGEPLE SDSVVAKNAF VTPGPPGIPE VTKITKNSMT VVWSRPIADG 

     28030      28040      28050      28060      28070      28080 
GSDISGYFLE KRDKKSLGWF KVLKETIRDT RQKVTGLTEN SDYQYRVCAV NAAGQGPFSE 

     28090      28100      28110      28120      28130      28140 
PSEFYKAADP IDPPGPPAKI RIADSTKSSI TLGWSKPVYD GGSAVTGYVV EIRQGEEEEW 

     28150      28160      28170      28180      28190      28200 
TTVSTKGEVR TTEYVVSNLK PGVNYYFRVS AVNCAGQGEP IEMNEPVQAK DILEAPEIDL 

     28210      28220      28230      28240      28250      28260 
DVALRTSVIA KAGEDVQVLI PFKGRPPPTV TWRKDEKNLG SDARYSIENT DSSSLLTIPQ 

     28270      28280      28290      28300      28310      28320 
VTRNDTGKYI LTIENGVGEP KSSTVSVKVL DTPAACQKLQ VKHVSRGTVT LLWDPPLIDG 

     28330      28340      28350      28360      28370      28380 
GSPIINYVIE KRDATKRTWS VVSHKCSSTS FKLIDLSEKT PFFFRVLAEN EIGIGEPCET 

     28390      28400      28410      28420      28430      28440 
TEPVKAAEVP APIRDLSMKD STKTSVILSW TKPDFDGGSV ITEYVVERKG KGEQTWSHAG 

     28450      28460      28470      28480      28490      28500 
ISKTCEIEVS QLKEQSVLEF RVFAKNEKGL SDPVTIGPIT VKELIITPEV DLSDIPGAQV 

     28510      28520      28530      28540      28550      28560 
TVRIGHNVHL ELPYKGKPKP SISWLKDGLP LKESEFVRFS KTENKITLSI KNAKKEHGGK 

     28570      28580      28590      28600      28610      28620 
YTVILDNAVC RIAVPITVIT LGPPSKPKGP IRFDEIKADS VILSWDVPED NGGGEITCYS 

     28630      28640      28650      28660      28670      28680 
IEKRETSQTN WKMVCSSVAR TTFKVPNLVK DAEYQFRVRA ENRYGVSQPL VSSIIVAKHQ 

     28690      28700      28710      28720      28730      28740 
FRIPGPPGKP VIYNVTSDGM SLTWDAPVYD GGSEVTGFHV EKKERNSILW QKVNTSPISG 

     28750      28760      28770      28780      28790      28800 
REYRATGLVE GLDYQFRVYA ENSAGLSSPS DPSKFTLAVS PVDPPGTPDY IDVTRETITL 

     28810      28820      28830      28840      28850      28860 
KWNPPLRDGG SKIVGYSIEK RQGNERWVRC NFTDVSECQY TVTGLSPGDR YEFRIIARNA 

     28870      28880      28890      28900      28910      28920 
VGTISPPSQS SGIIMTRDEN VPPIVEFGPE YFDGLIIKSG ESLRIKALVQ GRPVPRVTWF 

     28930      28940      28950      28960      28970      28980 
KDGVEIEKRM NMEITDVLGS TSLFVRDATR DHRGVYTVEA KNASGSAKAE IKVKVQDTPG 

     28990      29000      29010      29020      29030      29040 
KVVGPIRFTN ITGEKMTLWW DAPLNDGCAP ITHYIIEKRE TSRLAWALIE DKCEAQSYTA 

     29050      29060      29070      29080      29090      29100 
IKLINGNEYQ FRVSAVNKFG VGRPLDSDPV VAQIQYTVPD APGIPEPSNI TGNSITLTWA 

     29110      29120      29130      29140      29150      29160 
RPESDGGSEI QQYILERREK KSTRWVKVIS KRPISETRFK VTGLTEGNEY EFHVMAENAA 

     29170      29180      29190      29200      29210      29220 
GVGPASGISR LIKCREPVNP PGPPTVVKVT DTSKTTVSLE WSKPVFDGGM EIIGYIIEMC 

     29230      29240      29250      29260      29270      29280 
KADLGDWHKV NAEACVKTRY TVTDLQAGEE YKFRVSAING AGKGDSCEVT GTIKAVDRLT 

     29290      29300      29310      29320      29330      29340 
APELDIDANF KQTHVVRAGA SIRLFIAYQG RPTPTAVWSK PDSNLSLRAD IHTTDSFSTL 

     29350      29360      29370      29380      29390      29400 
TVENCNRNDA GKYTLTVENN SGSKSITFTV KVLDTPGPPG PITFKDVTRG SATLMWDAPL 

     29410      29420      29430      29440      29450      29460 
LDGGARIHHY VVEKREASRR SWQVISEKCT RQIFKVNDLA EGVPYYFRVS AVNEYGVGEP 

     29470      29480      29490      29500      29510      29520 
YEMPEPIVAT EQPAPPRRLD VVDTSKSSAV LAWLKPDHDG GSRITGYLLE MRQKGSDFWV 

     29530      29540      29550      29560      29570      29580 
EAGHTKQLTF TVERLVEKTE YEFRVKAKND AGYSEPREAF SSVIIKEPQI EPTADLTGIT 

     29590      29600      29610      29620      29630      29640 
NQLITCKAGS PFTIDVPISG RPAPKVTWKL EEMRLKETDR VSITTTKDRT TLTVKDSMRG 

     29650      29660      29670      29680      29690      29700 
DSGRYFLTLE NTAGVKTFSV TVVVIGRPGP VTGPIEVSSV SAESCVLSWG EPKDGGGTEI 

     29710      29720      29730      29740      29750      29760 
TNYIVEKRES GTTAWQLVNS SVKRTQIKVT HLTKYMEYSF RVSSENRFGV SKPLESAPII 

     29770      29780      29790      29800      29810      29820 
AEHPFVPPSA PTRPEVYHVS ANAMSIRWEE PYHDGGSKII GYWVEKKERN TILWVKENKV 

     29830      29840      29850      29860      29870      29880 
PCLECNYKVT GLVEGLEYQF RTYALNAAGV SKASEASRPI MAQNPVDAPG RPEVTDVTRS 

     29890      29900      29910      29920      29930      29940 
TVSLIWSAPA YDGGSKVVGY IIERKPVSEV GDGRWLKCNY TIVSDNFFTV TALSEGDTYE 

     29950      29960      29970      29980      29990      30000 
FRVLAKNAAG VISKGSESTG PVTCRDEYAP PKAELDARLH GDLVTIRAGS DLVLDAAVGG 

     30010      30020      30030      30040      30050      30060 
KPEPKIIWTK GDKELDLCEK VSLQYTGKRA TAVIKFCDRS DSGKYTLTVK NASGTKAVSV 

     30070      30080      30090      30100      30110      30120 
MVKVLDSPGP CGKLTVSRVT QEKCTLAWSL PQEDGGAEIT HYIVERRETS RLNWVIVEGE 

     30130      30140      30150      30160      30170      30180 
CPTLSYVVTR LIKNNEYIFR VRAVNKYGPG VPVESEPIVA RNSFTIPSPP GIPEEVGTGK 

     30190      30200      30210      30220      30230      30240 
EHIIIQWTKP ESDGGNEISN YLVDKREKKS LRWTRVNKDY VVYDTRLKVT SLMEGCDYQF 

     30250      30260      30270      30280      30290      30300 
RVTAVNAAGN SEPSEASNFI SCREPSYTPG PPSAPRVVDT TKHSISLAWT KPMYDGGTDI 

     30310      30320      30330      30340      30350      30360 
VGYVLEMQEK DTDQWYRVHT NATIRNTEFT VPDLKMGQKY SFRVAAVNVK GMSEYSESIA 

     30370      30380      30390      30400      30410      30420 
EIEPVERIEI PDLELADDLK KTVTIRAGAS LRLMVSVSGR PPPVITWSKQ GIDLASRAII 

     30430      30440      30450      30460      30470      30480 
DTTESYSLLI VDKVNRYDAG KYTIEAENQS GKKSATVLVK VYDTPGPCPS VKVKEVSRDS 

     30490      30500      30510      30520      30530      30540 
VTITWEIPTI DGGAPVNNYI VEKREAAMRA FKTVTTKCSK TLYRISGLVE GTMYYFRVLP 

     30550      30560      30570      30580      30590      30600 
ENIYGIGEPC ETSDAVLVSE VPLVPAKLEV VDVTKSTVTL AWEKPLYDGG SRLTGYVLEA 

     30610      30620      30630      30640      30650      30660 
CKAGTERWMK VVTLKPTVLE HTVTSLNEGE QYLFRIRAQN EKGVSEPRET VTAVTVQDLR 

     30670      30680      30690      30700      30710      30720 
VLPTIDLSTM PQKTIHVPAG RPVELVIPIA GRPPPAASWF FAGSKLRESE RVTVETHTKV 

     30730      30740      30750      30760      30770      30780 
AKLTIRETTI RDTGEYTLEL KNVTGTTSET IKVIILDKPG PPTGPIKIDE IDATSITISW 

     30790      30800      30810      30820      30830      30840 
EPPELDGGAP LSGYVVEQRD AHRPGWLPVS ESVTRSTFKF TRLTEGNEYV FRVAATNRFG 

     30850      30860      30870      30880      30890      30900 
IGSYLQSEVI ECRSSIRIPG PPETLQIFDV SRDGMTLTWY PPEDDGGSQV TGYIVERKEV 

     30910      30920      30930      30940      30950      30960 
RADRWVRVNK VPVTMTRYRS TGLTEGLEYE HRVTAINARG SGKPSRPSKP IVAMDPIAPP 

     30970      30980      30990      31000      31010      31020 
GKPQNPRVTD TTRTSVSLAW SVPEDEGGSK VTGYLIEMQK VDQHEWTKCN TTPTKIREYT 

     31030      31040      31050      31060      31070      31080 
LTHLPQGAEY RFRVLACNAG GPGEPAEVPG TVKVTEMLEY PDYELDERYQ EGIFVRQGGV 

     31090      31100      31110      31120      31130      31140 
IRLTIPIKGK PFPICKWTKE GQDISKRAMI ATSETHTELV IKEADRGDSG TYDLVLENKC 

     31150      31160      31170      31180      31190      31200 
GKKAVYIKVR VIGSPNSPEG PLEYDDIQVR SVRVSWRPPA DDGGADILGY ILERREVPKA 

     31210      31220      31230      31240      31250      31260 
AWYTIDSRVR GTSLVVKGLK ENVEYHFRVS AENQFGISKP LKSEEPVTPK TPLNPPEPPS 

     31270      31280      31290      31300      31310      31320 
NPPEVLDVTK SSVSLSWSRP KDDGGSRVTG YYIERKETST DKWVRHNKTQ ITTTMYTVTG 

     31330      31340      31350      31360      31370      31380 
LVPDAEYQFR IIAQNDVGLS ETSPASEPVV CKDPFDKPSQ PGELEILSIS KDSVTLQWEK 

     31390      31400      31410      31420      31430      31440 
PECDGGKEIL GYWVEYRQSG DSAWKKSNKE RIKDKQFTIG GLLEATEYEF RVFAENETGL 

     31450      31460      31470      31480      31490      31500 
SRPRRTAMSI KTKLTSGEAP GIRKEMKDVT TKLGEAAQLS CQIVGRPLPD IKWYRFGKEL 

     31510      31520      31530      31540      31550      31560 
IQSRKYKMSS DGRTHTLTVM TEEQEDEGVY TCIATNEVGE VETSSKLLLQ ATPQFHPGYP 

     31570      31580      31590      31600      31610      31620 
LKEKYYGAVG STLRLHVMYI GRPVPAMTWF HGQKLLQNSE NITIENTEHY THLVMKNVQR 

     31630      31640      31650      31660      31670      31680 
KTHAGKYKVQ LSNVFGTVDA ILDVEIQDKP DKPTGPIVIE ALLKNSAVIS WKPPADDGGS 

     31690      31700      31710      31720      31730      31740 
WITNYVVEKC EAKEGAEWQL VSSAISVTTC RIVNLTENAG YYFRVSAQNT FGISDPLEVS 

     31750      31760      31770      31780      31790      31800 
SVVIIKSPFE KPGAPGKPTI TAVTKDSCVV AWKPPASDGG AKIRNYYLEK REKKQNKWIS 

     31810      31820      31830      31840      31850      31860 
VTTEEIRETV FSVKNLIEGL EYEFRVKCEN LGGESEWSEI SEPITPKSDV PIQAPHFKEE 

     31870      31880      31890      31900      31910      31920 
LRNLNVRYQS NATLVCKVTG HPKPIVKWYR QGKEIIADGL KYRIQEFKGG YHQLIIASVT 

     31930      31940      31950      31960      31970      31980 
DDDATVYQVR ATNQGGSVSG TASLEVEVPA KIHLPKTLEG MGAVHALRGE VVSIKIPFSG 

     31990      32000      32010      32020      32030      32040 
KPDPVITWQK GQDLIDNNGH YQVIVTRSFT SLVFPNGVER KDAGFYVVCA KNRFGIDQKT 

     32050      32060      32070      32080      32090      32100 
VELDVADVPD PPRGVKVSDV SRDSVNLTWT EPASDGGSKI TNYIVEKCAT TAERWLRVGQ 

     32110      32120      32130      32140      32150      32160 
ARETRYTVIN LFGKTSYQFR VIAENKFGLS KPSEPSEPTI TKEDKTRAMN YDEEVDETRE 

     32170      32180      32190      32200      32210      32220 
VSMTKASHSS TKELYEKYMI AEDLGRGEFG IVHRCVETSS KKTYMAKFVK VKGTDQVLVK 

     32230      32240      32250      32260      32270      32280 
KEISILNIAR HRNILHLHES FESMEELVMI FEFISGLDIF ERINTSAFEL NEREIVSYVH 

     32290      32300      32310      32320      32330      32340 
QVCEALQFLH SHNIGHFDIR PENIIYQTRR SSTIKIIEFG QARQLKPGDN FRLLFTAPEY 

     32350      32360      32370      32380      32390      32400 
YAPEVHQHDV VSTATDMWSL GTLVYVLLSG INPFLAETNQ QIIENIMNAE YTFDEEAFKE 

     32410      32420      32430      32440      32450      32460 
ISIEAMDFVD RLLVKERKSR MTASEALQHP WLKQKIERVS TKVIRTLKHR RYYHTLIKKD 

     32470      32480      32490      32500      32510      32520 
LNMVVSAARI SCGGAIRSQK GVSVAKVKVA SIEIGPVSGQ IMHAVGEEGG HVKYVCKIEN 

     32530      32540      32550      32560      32570      32580 
YDQSTQVTWY FGVRQLENSE KYEITYEDGV AILYVKDITK LDDGTYRCKV VNDYGEDSSY 

     32590      32600      32610      32620      32630      32640 
AELFVKGVRE VYDYYCRRTM KKIKRRTDTM RLLERPPEFT LPLYNKTAYV GENVRFGVTI 

     32650      32660      32670      32680      32690      32700 
TVHPEPHVTW YKSGQKIKPG DNDKKYTFES DKGLYQLTIN SVTTDDDAEY TVVARNKYGE 

     32710      32720      32730      32740      32750      32760 
DSCKAKLTVT LHPPPTDSTL RPMFKRLLAN AECQEGQSVC FEIRVSGIPP PTLKWEKDGQ 

     32770      32780      32790      32800      32810      32820 
PLSLGPNIEI IHEGLDYYAL HIRDTLPEDT GYYRVTATNT AGSTSCQAHL QVERLRYKKQ 

     32830      32840      32850      32860      32870      32880 
EFKSKEEHER HVQKQIDKTL RMAEILSGTE SVPLTQVAKE ALREAAVLYK PAVSTKTVKG 

     32890      32900      32910      32920      32930      32940 
EFRLEIEEKK EERKLRMPYD VPEPRKYKQT TIEEDQRIKQ FVPMSDMKWY KKIRDQYEMP 

     32950      32960      32970      32980      32990      33000 
GKLDRVVQKR PKRIRLSRWE QFYVMPLPRI TDQYRPKWRI PKLSQDDLEI VRPARRRTPS 

     33010      33020      33030      33040      33050      33060 
PDYDFYYRPR RRSLGDISDE ELLLPIDDYL AMKRTEEERL RLEEELELGF SASPPSRSPP 

     33070      33080      33090      33100      33110      33120 
HFELSSLRYS SPQAHVKVEE TRKDFRYSTY HIPTKAEAST SYAELRERHA QAAYRQPKQR 

     33130      33140      33150      33160      33170      33180 
QRIMAEREDE ELLRPVTTTQ HLSEYKSELD FMSKEEKSRK KSRRQREVTE ITEIEEEYEI 

     33190      33200      33210      33220      33230      33240 
SKHAQRESSS SASRLLRRRR SLSPTYIELM RPVSELIRSR PQPAEEYEDD TERRSPTPER 

     33250      33260      33270      33280      33290      33300 
TRPRSPSPVS SERSLSRFER SARFDIFSRY ESMKAALKTQ KTSERKYEVL SQQPFTLDHA 

     33310      33320      33330      33340      33350      33360 
PRITLRMRSH RVPCGQNTRF ILNVQSKPTA EVKWYHNGVE LQESSKIHYT NTSGVLTLEI 

     33370      33380      33390      33400      33410      33420 
LDCHTDDSGT YRAVCTNYKG EASDYATLDV TGGDYTTYAS QRRDEEVPRS VFPELTRTEA 

     33430      33440      33450      33460      33470      33480 
YAVSSFKKTS EMEASSSVRE VKSQMTETRE SLSSYEHSAS AEMKSAALEE KSLEEKSTTR 

     33490      33500      33510      33520      33530      33540 
KIKTTLAARI LTKPRSMTVY EGESARFSCD TDGEPVPTVT WLRKGQVLST SARHQVTTTK 

     33550      33560      33570      33580      33590      33600 
YKSTFEISSV QASDEGNYSV VVENSEGKQE AEFTLTIQKA RVTEKAVTSP PRVKSPEPRV 

     33610      33620      33630      33640      33650      33660 
KSPEAVKSPK RVKSPEPSHP KAVSPTETKP TPTEKVQHLP VSAPPKITQF LKAEASKEIA 

     33670      33680      33690      33700      33710      33720 
KLTCVVESSV LRAKEVTWYK DGKKLKENGH FQFHYSADGT YELKINNLTE SDQGEYVCEI 

     33730      33740      33750      33760      33770      33780 
SGEGGTSKTN LQFMGQAFKS IHEKVSKISE TKKSDQKTTE STVTRKTEPK APEPISSKPV 

     33790      33800      33810      33820      33830      33840 
IVTGLQDTTV SSDSVAKFAV KATGEPRPTA IWTKDGKAIT QGGKYKLSED KGGFFLEIHK 

     33850      33860      33870      33880      33890      33900 
TDTSDSGLYT CTVKNSAGSV SSSCKLTIKA IKDTEAQKVS TQKTSEITPQ KKAVVQEEIS 

     33910      33920      33930      33940      33950      33960 
QKALRSEEIK MSEAKSQEKL ALKEEASKVL ISEEVKKSAA TSLEKSIVHE EITKTSQASE 

     33970      33980      33990      34000      34010      34020 
EVRTHAEIKA FSTQMSINEG QRLVLKANIA GATDVKWVLN GVELTNSEEY RYGVSGSDQT 

     34030      34040      34050      34060      34070      34080 
LTIKQASHRD EGILTCISKT KEGIVKCQYD LTLSKELSDA PAFISQPRSQ NINEGQNVLF 

     34090      34100      34110      34120      34130      34140 
TCEISGEPSP EIEWFKNNLP ISISSNVSIS RSRNVYSLEI RNASVSDSGK YTIKAKNFRG 

     34150      34160      34170      34180      34190      34200 
QCSATASLMV LPLVEEPSRE VVLRTSGDTS LQGSFSSQSV QMSASKQEAS FSSFSSSSAS 

     34210      34220      34230      34240      34250      34260 
SMTEMKFASM SAQSMSSMQE SFVEMSSSSF MGISNMTQLE SSTSKMLKAG IRGIPPKIEA 

     34270      34280      34290      34300      34310      34320 
LPSDISIDEG KVLTVACAFT GEPTPEVTWS CGGRKIHSQE QGRFHIENTD DLTTLIIMDV 

     34330      34340      34350 
QKQDGGLYTL SLGNEFGSDS ATVNIHIRSI 

« Hide

Isoform 2 [UniParc].

Checksum: 0F585566A5261872
Show »

FASTA34,2583,805,819
Isoform 3 (Small cardiac N2-B) [UniParc].

Checksum: 49C0716EB3A00B06
Show »

FASTA26,9262,992,967
Isoform 4 (Soleus) [UniParc].

Checksum: FEF6E39E08EAFE3E
Show »

FASTA33,4453,716,149
Isoform 5 [UniParc].

Checksum: 9C1E28B034F5CE4A
Show »

FASTA32,9003,653,210
Isoform 6 (Small cardiac novex-3) [UniParc].

Checksum: 95DCBFC183B0C668
Show »

FASTA5,604631,567
Isoform 7 (Cardiac novex-2) [UniParc].

Checksum: DB03495F9E351EFA
Show »

FASTA33,6153,734,769
Isoform 8 (Cardiac novex-1) [UniParc].

Checksum: C28C55CDBFC91A5F
Show »

FASTA34,4743,829,828

References

« Hide 'large scale' references
[1]"Titins, giant proteins in charge of muscle ultrastructure and elasticity."
Labeit S., Kolmerer B.
Science 270:293-296(1995) [PubMed: 7569978] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 3), NUCLEOTIDE SEQUENCE [MRNA] OF 3336-12202 (ISOFORM 4), VARIANTS ILE-498; GLU-1201; MET-3261; ASN-3419; HIS-12383; GLU-12679; ILE-19762; ILE-20718; ASN-23807; MET-24980 AND THR-27755.
Tissue: Skeletal muscle.
[2]"Series of exon-skipping events in the elastic spring region of titin as the structural basis for myofibrillar elastic diversity."
Freiburg A., Trombitas K., Hell W., Cazorla O., Fougerousse F., Centner T., Kolmerer B., Witt C., Beckmann J.S., Gregorio C.C., Granzier H., Labeit S.
Circ. Res. 86:1114-1121(2000) [PubMed: 10850961] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA], ALTERNATIVE SPLICING, VARIANTS LEU-1295; GLN-1572; ILE-2610; ASN-2831; PRO-4215 AND PHE-4283.
[3]"The complete gene sequence of titin, expression of an unusual ~700 kDa titin isoform and its interaction with obscurin identify a novel Z-line to I-band linking system."
Bang M.-L., Centner T., Fornoff F., Geach A.J., Gotthardt M., McNabb M., Witt C.C., Labeit D., Gregorio C.C., Granzier H., Labeit S.
Circ. Res. 89:1065-1072(2001) [PubMed: 11717165] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA], ALTERNATIVE SPLICING, TISSUE SPECIFICITY, INTERACTION WITH OBSCN, VARIANTS LEU-1295; GLN-1572; ILE-2610; ASN-2831; PRO-4215 AND PHE-4283.
[4]"Generation and annotation of the DNA sequences of human chromosomes 2 and 4."
Hillier L.W., Graves T.A., Fulton R.S., Fulton L.A., Pepin K.H., Minx P., Wagner-McPherson C., Layman D., Wylie K., Sekhon M., Becker M.C., Fewell G.A., Delehaunty K.D., Miner T.L., Nash W.E., Kremitzki C., Oddy L., Du H. expand/collapse author list , Sun H., Bradshaw-Cordum H., Ali J., Carter J., Cordes M., Harris A., Isak A., van Brunt A., Nguyen C., Du F., Courtney L., Kalicki J., Ozersky P., Abbott S., Armstrong J., Belter E.A., Caruso L., Cedroni M., Cotton M., Davidson T., Desai A., Elliott G., Erb T., Fronick C., Gaige T., Haakenson W., Haglund K., Holmes A., Harkins R., Kim K., Kruchowski S.S., Strong C.M., Grewal N., Goyea E., Hou S., Levy A., Martinka S., Mead K., McLellan M.D., Meyer R., Randall-Maher J., Tomlinson C., Dauphin-Kohlberg S., Kozlowicz-Reilly A., Shah N., Swearengen-Shahid S., Snider J., Strong J.T., Thompson J., Yoakum M., Leonard S., Pearman C., Trani L., Radionenko M., Waligorski J.E., Wang C., Rock S.M., Tin-Wollam A.-M., Maupin R., Latreille P., Wendl M.C., Yang S.-P., Pohl C., Wallis J.W., Spieth J., Bieri T.A., Berkowicz N., Nelson J.O., Osborne J., Ding L., Meyer R., Sabo A., Shotland Y., Sinha P., Wohldmann P.E., Cook L.L., Hickenbotham M.T., Eldred J., Williams D., Jones T.A., She X., Ciccarelli F.D., Izaurralde E., Taylor J., Schmutz J., Myers R.M., Cox D.R., Huang X., McPherson J.D., Mardis E.R., Clifton S.W., Warren W.C., Chinwalla A.T., Eddy S.R., Marra M.A., Ovcharenko I., Furey T.S., Miller W., Eichler E.E., Bork P., Suyama M., Torrents D., Waterston R.H., Wilson R.K.
Nature 434:724-731(2005) [PubMed: 15815621] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[5]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1-626 AND 34071-34350 (ISOFORM 3).
Tissue: Muscle and Skeletal muscle.
[6]"The central Z-disk region of titin is assembled from a novel repeat in variable copy numbers."
Gautel M., Goulding D., Bullard B., Weber K., Furst D.O.
J. Cell Sci. 109:2747-2754(1996) [PubMed: 8937992] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 405-709 (ISOFORMS 1 AND 2).
Tissue: Heart muscle.
[7]"Dissecting titin into its structural motifs: identification of an alpha helix near the N-terminus."
Musco G., Tziatzos C., Schuck P., Pastore A.
Biochemistry 34:553-561(1995) [PubMed: 7819249] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 2023-2060, TISSUE SPECIFICITY.
Tissue: Heart muscle.
[8]"Familial dilated cardiomyopathy locus maps to chromosome 2q31."
Siu B.L., Niimura H., Osborne J.A., Fatkin D., MacRae C., Solomon S., Benson D.W., Seidman J.G., Seidman C.E.
Circulation 99:1022-1026(1999) [PubMed: 10051295] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 3455-4473, VARIANTS PRO-3491; PRO-4215 AND PHE-4283.
[9]"Species variations in cDNA sequence and exon splicing patterns in the extensible I-band region of cardiac titin: relation to passive tension."
Greaser M.L., Berri M., Warren C.M., Mozdziak P.E.
J. Muscle Res. Cell Motil. 23:473-482(2002) [PubMed: 12785098] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 9806-12017 (ISOFORM 5).
Tissue: Heart ventricle.
[10]Lubec G., Chen W.-Q., Sun Y.
Submitted (DEC-2008) to UniProtKB
Cited for: PROTEIN SEQUENCE OF 11773-11783; 17908-17931; 18656-18669; 26545-26553; 28758-28774 AND 32920-32928, MASS SPECTROMETRY.
Tissue: Fetal brain cortex.
[11]"Serological identification of rhabdomyosarcoma antigens."
Behrends U., Gotz C., Mautner J.
Submitted (OCT-2005) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 12947-13797.
Tissue: Embryonic rhabdomyosarcoma.
[12]"Titin antibodies in myasthenia gravis: identification of a major immunogenic region of titin."
Gautel M., Lakey A., Barlow D.P., Holmes Z., Scales S., Leonard K., Labeit S., Mygland A., Gilhus N.E., Aarli J.A.
Neurology 43:1581-1585(1993) [PubMed: 8351016] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE OF 14257-14543.
[13]"Towards a molecular understanding of titin."
Labeit S., Gautel M., Lakey A., Trinick J.
EMBO J. 11:1711-1716(1992) [PubMed: 1582406] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 21021-22120, NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 23754-24284.
[14]"Phosphorylation of KSF motifs in the C-terminal region of titin in differentiating myoblasts."
Gautel M., Leonard K., Labeit S.
EMBO J. 12:3827-3834(1993) [PubMed: 8404852] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 29701-34350, PHOSPHORYLATION.
[15]"The full-ORF clone resource of the German cDNA consortium."
Bechtel S., Rosenfelder H., Duda A., Schmidt C.P., Ernst U., Wellenreuther R., Mehrle A., Schuster C., Bahr A., Bloecker H., Heubner D., Hoerlein A., Michel G., Wedler H., Koehrer K., Ottenwaelder B., Poustka A., Wiemann S., Schupp I.
BMC Genomics 8:399-399(2007) [PubMed: 17974005] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 33119-34350.
Tissue: Skeletal muscle.
[16]"Two immunoglobulin-like domains of the Z-disc portion of titin interact in a conformation-dependent way with telethonin."
Mues A., van der Ven P.F.M., Young P., Furst D.O., Gautel M.
FEBS Lett. 428:111-114(1998) [PubMed: 9645487] [Abstract]
Cited for: INTERACTION WITH TCAP.
[17]"Interaction of nebulin SH3 domain with titin PEVK and myopalladin: implications for the signaling and assembly role of titin and nebulin."
Ma K., Wang K.
FEBS Lett. 532:273-278(2002) [PubMed: 12482578] [Abstract]
Cited for: INTERACTION WITH NEB.
[18]"Subcellular targeting of metabolic enzymes to titin in heart muscle may be mediated by DRAL/FHL-2."
Lange S., Auerbach D., McLoughlin P., Perriard E., Schafer B.W., Perriard J.-C., Ehler E.
J. Cell Sci. 115:4925-4936(2002) [PubMed: 12432079] [Abstract]
Cited for: INTERACTION WITH FHL2.
[19]"The hydrophilic domain of small ankyrin-1 interacts with the two N-terminal immunoglobulin domains of titin."
Kontrogianni-Konstantopoulos A., Bloch R.J.
J. Biol. Chem. 278:3985-3991(2003) [PubMed: 12444090] [Abstract]
Cited for: INTERACTION WITH ANK1.
[20]"The muscle ankyrin repeat proteins: CARP, ankrd2/Arpp and DARP as a family of titin filament-based stress response molecules."
Miller M.K., Bang M.-L., Witt C.C., Labeit D., Trombitas C., Watanabe K., Granzier H., McElhinny A.S., Gregorio C.C., Labeit S.
J. Mol. Biol. 333:951-964(2003) [PubMed: 14583192] [Abstract]
Cited for: INTERACTION WITH ANKRD1; ANKRD2; ANKRD23 AND CAPN3.
[21]"Association of the chaperone alphaB-crystallin with titin in heart muscle."
Bullard B., Ferguson C., Minajeva A., Leake M.C., Gautel M., Labeit D., Ding L., Labeit S., Horwitz J., Leonard K.R., Linke W.A.
J. Biol. Chem. 279:7917-7924(2004) [PubMed: 14676215] [Abstract]
Cited for: INTERACTION WITH CRYAB.
[22]"MURF-1 and MURF-2 target a specific subset of myofibrillar proteins redundantly: towards understanding MURF-dependent muscle ubiquitination."
Witt S.H., Granzier H., Witt C.C., Labeit S.
J. Mol. Biol. 350:713-722(2005) [PubMed: 15967462] [Abstract]
Cited for: INTERACTION WITH TRIM63 AND TRIM55.
[23]"Tyrosine phosphorylated Par3 regulates epithelial tight junction assembly promoted by EGFR signaling."
Wang Y., Du D., Fang L., Yang G., Zhang C., Zeng R., Ullrich A., Lottspeich F., Chen Z.
EMBO J. 25:5058-5070(2006) [PubMed: 17053785] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT TYR-8490, MASS SPECTROMETRY.
Tissue: Embryonic kidney.
[24]"Nuclear titin interacts with A- and B-type lamins in vitro and in vivo."
Zastrow M.S., Flaherty D.B., Benian G.M., Wilson K.L.
J. Cell Sci. 119:239-249(2006) [PubMed: 16410549] [Abstract]
Cited for: INTERACTION WITH LAMIN, SUBCELLULAR LOCATION.
[25]"Mechanical stress-strain sensors embedded in cardiac cytoskeleton: Z disk, titin, and associated structures."
Hoshijima M.
Am. J. Physiol. 290:H1313-H1325(2006) [PubMed: 16537787] [Abstract]
Cited for: REVIEW.
[26]"Improved titanium dioxide enrichment of phosphopeptides from HeLa cells and high confident phosphopeptide identification by cross-validation of MS/MS and MS/MS/MS spectra."
Yu L.-R., Zhu Z., Chan K.C., Issaq H.J., Dimitrov D.S., Veenstra T.D.
J. Proteome Res. 6:4150-4162(2007) [PubMed: 17924679] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-33938 AND SER-33942, PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT THR-5304 AND SER-5306 (ISOFORM 6), MASS SPECTROMETRY.
Tissue: Cervix carcinoma.
[27]"Global proteomic profiling of phosphopeptides using electron transfer dissociation tandem mass spectrometry."
Molina H., Horn D.M., Tang N., Mathivanan S., Pandey A.
Proc. Natl. Acad. Sci. U.S.A. 104:2199-2204(2007) [PubMed: 17287340] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-22525 AND SER-22534, MASS SPECTROMETRY.
Tissue: Embryonic kidney.
[28]"ATM and ATR substrate analysis reveals extensive protein networks responsive to DNA damage."
Matsuoka S., Ballif B.A., Smogorzewska A., McDonald E.R. III, Hurov K.E., Luo J., Bakalarski C.E., Zhao Z., Solimini N., Lerenthal Y., Shiloh Y., Gygi S.P., Elledge S.J.
Science 316:1160-1166(2007) [PubMed: 17525332] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-9203 AND THR-9207, MASS SPECTROMETRY.
Tissue: Embryonic kidney.
[29]"Automated phosphoproteome analysis for cultured cancer cells by two-dimensional nanoLC-MS using a calcined titania/C18 biphasic column."
Imami K., Sugiyama N., Kyono Y., Tomita M., Ishihama Y.
Anal. Sci. 24:161-166(2008) [PubMed: 18187866] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-4065 AND SER-4068, MASS SPECTROMETRY.
Tissue: Cervix carcinoma.
[30]"Tertiary structure of an immunoglobulin-like domain from the giant muscle protein titin: a new member of the I set."
Pfuhl M., Pastore A.
Structure 3:391-401(1995) [PubMed: 7613868] [Abstract]
Cited for: STRUCTURE BY NMR OF 33483-33579.
[31]"Structural basis for activation of the titin kinase domain during myofibrillogenesis."
Mayans O., van der Ven P.F.M., Wilm M., Mues A., Young P., Furst D.O., Wilmanns M., Gautel M.
Nature 395:863-869(1998) [PubMed: 9804419] [Abstract]
Cited for: X-RAY CRYSTALLOGRAPHY (2.0 ANGSTROMS) OF 32172-32489, FUNCTION, ENZYME REGULATION, INTERACTION WITH CALM, PHOSPHORYLATION AT TYR-32341, MUTAGENESIS OF LYS-32207 AND TYR-32341.
[32]"The three-dimensional structure of a type I module from titin: a prototype of intracellular fibronectin type III domains."
Goll C.M., Pastore A., Nilges M.
Structure 6:1291-1302(1998) [PubMed: 9782056] [Abstract]
Cited for: STRUCTURE BY NMR OF 22283-22385.
[33]"Structural evidence for a possible role of reversible disulphide bridge formation in the elasticity of the muscle protein titin."
Mayans O., Wuerges J., Canela S., Gautel M., Wilmanns M.
Structure 9:331-340(2001) [PubMed: 11525170] [Abstract]
Cited for: X-RAY CRYSTALLOGRAPHY (2.1 ANGSTROMS) OF 2073-2171, DISULFIDE BOND.
[34]"Palindromic assembly of the giant muscle protein titin in the sarcomeric Z-disk."
Zou P., Pinotsis N., Lange S., Song Y.-H., Popov A., Mavridis I., Mayans O.M., Gautel M., Wilmanns M.
Nature 439:229-233(2006) [PubMed: 16407954] [Abstract]
Cited for: X-RAY CRYSTALLOGRAPHY (2.44 ANGSTROMS) OF 1-196 IN COMPLEX WITH TCAP.
[35]"Structural analysis of the titin gene in hypertrophic cardiomyopathy: identification of a novel disease gene."
Satoh M., Takahashi M., Sakamoto T., Hiroe M., Marumo F., Kimura A.
Biochem. Biophys. Res. Commun. 262:411-417(1999) [PubMed: 10462489] [Abstract]
Cited for: VARIANT CMH9 LEU-740.
[36]"Tibial muscular dystrophy is a titinopathy caused by mutations in TTN, the gene encoding the giant skeletal-muscle protein titin."
Hackman P., Vihola A., Haravuori H., Marchand S., Sarparanta J., De Seze J., Labeit S., Witt C., Peltonen L., Richard I., Udd B.
Am. J. Hum. Genet. 71:492-500(2002) [PubMed: 12145747] [Abstract]
Cited for: INVOLVEMENT IN LIMB-GIRDLE MUSCULAR DYSTROPHY TYPE 2J, VARIANT TMD PRO-34315.
[37]"Titin mutations as the molecular basis for dilated cardiomyopathy."
Itoh-Satoh M., Hayashi T., Nishi H., Koga Y., Arimura T., Koyanagi T., Takahashi M., Hohda S., Ueda K., Nouchi T., Hiroe M., Marumo F., Imaizumi T., Yasunami M., Kimura A.
Biochem. Biophys. Res. Commun. 291:385-393(2002) [PubMed: 11846417] [Abstract]
Cited for: VARIANTS CMD1G MET-54; VAL-743; TYR-3799 AND ASN-4465, VARIANTS CYS-328; GLN-4084 AND PRO-4215, CHARACTERIZATION OF VARIANTS CMD1G MET-54 AND VAL-743.
[38]"Mutations of TTN, encoding the giant muscle filament titin, cause familial dilated cardiomyopathy."
Gerull B., Gramlich M., Atherton J., McNabb M., Trombitas K., Sasse-Klaassen S., Seidman J.G., Seidman C., Granzier H., Labeit S., Frenneaux M., Thierfelder L.
Nat. Genet. 30:201-204(2002) [PubMed: 11788824] [Abstract]
Cited for: VARIANT CMD1G ARG-976.
[39]"Tibial muscular dystrophy in a Belgian family."
Van den Bergh P.Y.K., Bouquiaux O., Verellen C., Marchand S., Richard I., Hackman P., Udd B.
Ann. Neurol. 54:248-251(2003) [PubMed: 12891679] [Abstract]
Cited for: VARIANT TMD ASN-34306.
[40]"Functional analysis of titin/connectin N2-B mutations found in cardiomyopathy."
Matsumoto Y., Hayashi T., Inagaki N., Takahashi M., Hiroi S., Nakamura T., Arimura T., Nakamura K., Ashizawa N., Yasunami M., Ohe T., Yano K., Kimura A.
J. Muscle Res. Cell Motil. 26:367-374(2005) [PubMed: 16465475] [Abstract]
Cited for: VARIANT CMD1G GLN-32996.
[41]"The kinase domain of titin controls muscle gene expression and protein turnover."
Lange S., Xiang F., Yakovenko A., Vihola A., Hackman P., Rostkova E., Kristensen J., Brandmeier B., Franzen G., Hedberg B., Gunnarsson L.G., Hughes S.M., Marchand S., Sejersen T., Richard I., Edstroem L., Ehler E., Udd B., Gautel M.
Science 308:1599-1603(2005) [PubMed: 15802564] [Abstract]
Cited for: VARIANT HMERF TRP-279, CHARACTERIZATION OF VARIANT HMERF TRP-279, INTERACTION WITH NBR1.
[42]"C-terminal titin deletions cause a novel early-onset myopathy with fatal cardiomyopathy."
Carmignac V., Salih M.A.M., Quijano-Roy S., Marchand S., Al Rayess M.M., Mukhtar M.M., Urtizberea J.A., Labeit S., Guicheney P., Leturcq F., Gautel M., Fardeau M., Campbell K.P., Richard I., Estournet B., Ferreiro A.
Ann. Neurol. 61:340-351(2007) [PubMed: 17444505] [Abstract]
Cited for: INVOLVEMENT IN EOMFC.
[43]"Patterns of somatic mutation in human cancer genomes."
Greenman C., Stephens P., Smith R., Dalgliesh G.L., Hunter C., Bignell G., Davies H., Teague J., Butler A., Stevens C., Edkins S., O'Meara S., Vastrik I., Schmidt E.E., Avis T., Barthorpe S., Bhamra G., Buck G. expand/collapse author list , Choudhury B., Clements J., Cole J., Dicks E., Forbes S., Gray K., Halliday K., Harrison R., Hills K., Hinton J., Jenkinson A., Jones D., Menzies A., Mironenko T., Perry J., Raine K., Richardson D., Shepherd R., Small A., Tofts C., Varian J., Webb T., West S., Widaa S., Yates A., Cahill D.P., Louis D.N., Goldstraw P., Nicholson A.G., Brasseur F., Looijenga L., Weber B.L., Chiew Y.-E., DeFazio A., Greaves M.F., Green A.R., Campbell P., Birney E., Easton D.F., Chenevix-Trench G., Tan M.-H., Khoo S.K., Teh B.T., Yuen S.T., Leung S.Y., Wooster R., Futreal P.A., Stratton M.R.
Nature 446:153-158(2007) [PubMed: 17344846] [Abstract]
Cited for: VARIANTS [LARGE SCALE ANALYSIS] TYR-60; MET-115; CYS-328; THR-360; ILE-498; MET-799; ILE-811; HIS-922; ASP-937; THR-1081; ARG-1137; GLU-1201; ALA-1202; LEU-1295; ASP-1345; THR-1347; HIS-1350; LEU-1353; VAL-1393; CYS-1416; PRO-1441; VAL-1544; GLN-1572; GLY-1658; GLN-1664; ASP-1692; LEU-1744; GLY-1772; ILE-1907; HIS-1998; LEU-2107; THR-2118; THR-2164; TYR-2240; SER-2392; PHE-2432; ILE-2610; MET-2771; PHE-2823; ASN-2831; ILE-2930; ARG-3154; GLU-3191; LEU-3238; GLY-3250; MET-3261; GLN-3367; LYS-3482; PRO-3491; LYS-3570; VAL-3590; VAL-3762; PHE-3877; LEU-3965; PRO-4215; TRP-4238; PHE-4283; THR-4291; ASP-4303; GLU-4427; GLU-12310; HIS-12383; ALA-12469; CYS-12642; LYS-12657; GLU-12679; PHE-12720; CYS-12798; GLY-13049; LYS-13083; LEU-13096; ARG-13099; ALA-13297; MET-13399; THR-13418; VAL-13428; THR-13430; LYS-13434; ASN-13469; ASN-13495; SER-13785; HIS-13870; ILE-14109; GLN-14131; THR-14208; VAL-14728; THR-14999; THR-15021; VAL-15520; ILE-15555; GLN-15620; ILE-15629; CYS-15635; GLN-15700; PRO-15705; MET-15837; HIS-16058; ILE-16067; THR-16090; HIS-16195; CYS-16409; PRO-16424; MET-16629; ARG-16877; ASP-17060; VAL-17637; HIS-17838; ASN-17866; GLU-17906; ALA-18094; SER-18109; THR-18164; LEU-18221; THR-18222; GLN-18726; ALA-18835; LYS-18881; SER-18939; GLN-19000; GLN-19060; LYS-19091; SER-19224; ILE-19367; LYS-19392; SER-19480; GLY-19495; HIS-19665; ILE-19762; ARG-19947; MET-19956; GLN-19992; CYS-20057; LEU-20075; LYS-20179; THR-20198; VAL-20198; HIS-20331; THR-20408; LYS-20564; ILE-20718; PRO-20726; ASN-20892; ARG-20894; GLU-21125; SER-21403; CYS-21730; GLN-21747; ARG-21851; ARG-21851; ARG-21925; HIS-21995; VAL-22045; HIS-22149; ILE-22160; THR-22261; ASN-22306; HIS-22357; PRO-22408; HIS-22537; LEU-22584; PRO-22646; ALA-22670; ASP-22770; THR-22801; TRP-22823; GLN-22968; LEU-23074; PHE-23079; ASN-23282; TYR-23303; CYS-23306; SER-23515; GLN-23551; ASN-23807; ASN-23872; ALA-23891; HIS-23933; MET-23939; LEU-23952; GLY-24098; SER-24119; ILE-24133; ALA-24159; ALA-24239; LYS-24265; THR-24584; THR-24781; HIS-24799; HIS-24954; MET-24980; HIS-25659; THR-25679; ALA-25720; LYS-25821; LYS-25859; LYS-25879; VAL-25923; ILE-26045; GLU-26059; VAL-26134; CYS-26477; TYR-26843; ARG-27346; CYS-27652; VAL-27728; LEU-27754; THR-27755; VAL-27929; LEU-28132; GLN-28168; HIS-28538; THR-28572; THR-28948; VAL-28986; GLU-28993; VAL-28998; MET-29070; VAL-29090; CYS-29419; PRO-29479; LEU-29880; GLU-29976; GLY-30042; CYS-30107; PHE-30125; PRO-30211; THR-30412; SER-30617; ILE-30674; ILE-30809; ILE-30818; LYS-30825; THR-30856; ASP-30887; SER-30887; HIS-30897; HIS-30907; HIS-30946; PHE-31081; CYS-31107; GLY-31124; SER-31156; THR-31246; HIS-31330; ARG-31690; GLN-31724; ILE-31725; SER-31732; ILE-31886; CYS-32097; ASN-32171; ILE-32248; HIS-32281; HIS-32323; TRP-32411; VAL-32558; VAL-32610; VAL-32637; ALA-32922; ARG-32943; HIS-32953; LEU-33213; CYS-33242; MET-33387; ASP-33419; MET-33536; GLN-33568; LYS-33616; LEU-33620; VAL-33886; THR-33899; PRO-33904; ILE-33955 AND ALA-34115.
+Additional computationally mapped references.

Web resources

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
X90568 mRNA. Translation: CAA62188.1.
X90569 mRNA. Translation: CAA62189.1.
AJ277892 Genomic DNA. Translation: CAD12455.1.
AJ277892 Genomic DNA. Translation: CAD12456.1.
AJ277892 Genomic DNA. Translation: CAD12457.1.
AJ277892 Genomic DNA. Translation: CAD12458.1.
AJ277892 Genomic DNA. Translation: CAD12459.1.
AC009948 Genomic DNA. No translation available.
AC010680 Genomic DNA. No translation available.
FJ695199 Genomic DNA. No translation available.
AC023270 Genomic DNA. Translation: AAX88844.1.
BC013396 mRNA. Translation: AAH13396.1.
BC058824 mRNA. Translation: AAH58824.1. Sequence problems.
BC070170 mRNA. Translation: AAH70170.1. Sequence problems.
BC107797 mRNA. Translation: AAI07798.1.
X98114 mRNA. Translation: CAA66795.1.
X98115 mRNA. Translation: CAA66796.1.
X83270 mRNA. Translation: CAA58243.1.
AF058332 Genomic DNA. Translation: AAD22603.1.
AF058332 Genomic DNA. Translation: AAD22604.1.
AF525413 mRNA. Translation: AAP80791.1.
DQ248309 mRNA. Translation: ABB55264.1.
X64698 mRNA. Translation: CAA45939.1.
X64699 Genomic DNA. Translation: CAA45940.1.
X64697 mRNA. Translation: CAA45938.1.
X69490 mRNA. Translation: CAA49245.1.
AL713647 mRNA. Translation: CAD28458.1.
IPIIPI00023283.
IPI00179357.
IPI00375498.
IPI00759542.
IPI00759613.
IPI00759637.
IPI00759754.
IPI00939967.
PIRI38344.
I38346.
RefSeqNP_596870.2. NM_133379.3.
UniGeneHs.134602.

3D structure databases

PDBe
RCSB PDB
PDBj
EntryMethodResolution (Å)ChainPositionsPDBsum
1BPVNMR-A22283-22385[»]
1G1CX-ray2.10A/B2073-2171[»]
1NCTNMR-A33483-33579[»]
1NCUNMR-A33483-33579[»]
1TITNMR-A12677-12765[»]
1TIUNMR-A12677-12765[»]
1TKIX-ray2.00A/B32172-32492[»]
1TNMNMR-A33489-33579[»]
1TNNNMR-A33489-33579[»]
1WAAX-ray1.80A/B/C/D/E/F12677-12765[»]
1YA5X-ray2.44A/B1-196[»]
2A38X-ray2.00A/B/C1-194[»]
2BK8X-ray1.69A32497-32590[»]
2F8VX-ray2.75A/B/C/D1-196[»]
2ILLX-ray2.20A31854-32047[»]
2J8HX-ray1.99A31854-32047[»]
2J8OX-ray2.49A/B31854-32047[»]
2NZIX-ray2.90A/B31854-32155[»]
2RQ8NMR-A12677-12765[»]
2WP3X-ray1.48T34252-34350[»]
2WWKX-ray1.70T34252-34350[»]
2WWMX-ray2.30D/T34252-34350[»]
2Y9RX-ray1.90T34252-34350[»]
3B43X-ray3.30A7945-8511[»]
3KNBX-ray1.40A34253-34350[»]
3LCYX-ray2.50A/B/C/D31456-31649[»]
3LPWX-ray1.65A/B22877-23070[»]
ProteinModelPortalQ8WZ42.
ModBaseSearch...

Protein-protein interaction databases

DIPDIP-29145N.
DIP-33449N.
IntActQ8WZ42. 17 interactions.
MINTMINT-2881875.

PTM databases

PhosphoSiteQ8WZ42.

Polymorphism databases

DMDM108861911.

Proteomic databases

PRIDEQ8WZ42.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENST00000360870; ENSP00000354117; ENSG00000155657.
GeneID7273.
KEGGhsa:7273.

Organism-specific databases

GeneCardsGC02M179355.
H-InvDBHIX0002636.
HGNCHGNC:12403. TTN.
HPACAB022682.
HPA007042.
MIM188840. gene.
600334. phenotype.
603689. phenotype.
604145. phenotype.
608807. phenotype.
611705. phenotype.
613765. phenotype.
neXtProtNX_Q8WZ42.
Orphanet140922. Autosomal recessive limb-girdle muscular dystrophy type 2J.
154. Familial isolated dilated cardiomyopathy.
155. Familial isolated hypertrophic cardiomyopathy.
178464. Hereditary myopathy with early respiratory failure.
609. Tibial muscular dystrophy.
PharmGKBPA37067.
GenAtlasSearch...

Phylogenomic databases

eggNOGprNOG15623.
HOVERGENHBG080472.

Enzyme and pathway databases

ReactomeREACT_17044. Muscle contraction.
REACT_604. Hemostasis.

Gene expression databases

GenevestigatorQ8WZ42.
GermOnlineENSG00000155657. Homo sapiens.

Family and domain databases

InterProIPR022682. Calpain_domain_III.
IPR003961. Fibronectin_type3.
IPR007110. Ig-like.
IPR013783. Ig-like_fold.
IPR013098. Ig_I-set.
IPR003599. Ig_sub.
IPR003598. Ig_sub2.
IPR011009. Kinase-like_dom.
IPR004168. PPAK_motif.
IPR000719. Prot_kinase_cat_dom.
IPR012337. RNaseH-like_dom.
IPR017442. Se/Thr_kinase-like_dom.
IPR002290. Ser/Thr_kinase_dom.
IPR015129. Titin_Z.
IPR008266. Tyr_kinase_AS.
[Graphical view]
Gene3DG3DSA:2.60.40.10. Ig-like_fold. 299 hits.
KOK12567.
PfamPF00041. fn3. 132 hits.
PF07679. I-set. 162 hits.
PF00069. Pkinase. 1 hit.
PF02818. PPAK. 15 hits.
PF09042. Titin_Z. 6 hits.
[Graphical view]
SMARTSM00060. FN3. 132 hits.
SM00409. IG. 95 hits.
SM00408. IGc2. 65 hits.
SM00220. S_TKc. 1 hit.
[Graphical view]
SUPFAMSSF49265. FN_III-like. 132 hits.
SSF56112. Kinase_like. 1 hit.
SSF49758. Peptidase_C2. 1 hit.
SSF53098. RNaseH_fold. 1 hit.
PROSITEPS50853. FN3. 132 hits.
PS50835. IG_LIKE. 143 hits.
PS00107. PROTEIN_KINASE_ATP. False negative.
PS50011. PROTEIN_KINASE_DOM. 1 hit.
PS00108. PROTEIN_KINASE_ST. False negative.
PS00625. RCC1_1. False negative.
PS00626. RCC1_2. False negative.
PS50012. RCC1_3. False negative.
PS50005. TPR. False negative.
PS50293. TPR_REGION. False negative.
PS00678. WD_REPEATS_1. False negative.
PS50082. WD_REPEATS_2. False negative.
PS50294. WD_REPEATS_REGION. False negative.
[Graphical view]
ProtoNetSearch...

Other

NextBio28431.
SOURCESearch...

Entry information

Entry nameTITIN_HUMAN
AccessionPrimary (citable) accession number: Q8WZ42
Secondary accession number(s): Q10465 expand/collapse secondary AC list , Q10466, Q15598, Q2XUS3, Q32Q60, Q4U1Z6, Q4ZG20, Q6NSG0, Q6PDB1, Q6PJP0, Q7KYM2, Q7KYN4, Q7KYN5, Q7LDM3, Q7Z2X3, Q8TCG8, Q8WZ51, Q8WZ52, Q8WZ53, Q8WZB3, Q92761, Q92762, Q9UD97, Q9UP84, Q9Y6L9
Entry history
Integrated into UniProtKB/Swiss-Prot: June 13, 2006
Last sequence update: October 19, 2011
Last modified: January 25, 2012
This is version 101 of the entry and version 3 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

Human and mouse protein kinases

Human and mouse protein kinases: classification and index

Human chromosome 2

Human chromosome 2: entries, gene names and cross-references to MIM

Human entries with polymorphisms or disease mutations

List of human entries with polymorphisms or disease mutations

Human polymorphisms and disease mutations

Index of human polymorphisms and disease mutations

MIM cross-references

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot

PDB cross-references

Index of Protein Data Bank (PDB) cross-references

SIMILARITY comments

Index of protein domains and families