Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q9I7U4 (TITIN_DROME) Reviewed, UniProtKB/Swiss-Prot

Last modified April 16, 2014. Version 108. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Titin
Alternative name(s):
D-Titin
Kettin
Gene names
Name:sls
Synonyms:titin
ORF Names:CG1915
OrganismDrosophila melanogaster (Fruit fly) [Reference proteome]
Taxonomic identifier7227 [NCBI]
Taxonomic lineageEukaryotaMetazoaEcdysozoaArthropodaHexapodaInsectaPterygotaNeopteraEndopterygotaDipteraBrachyceraMuscomorphaEphydroideaDrosophilidaeDrosophilaSophophora

Protein attributes

Sequence length18141 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Key component in the assembly and functioning of adult and embryonic striated muscles and muscle tendons. By providing connections at the level of individual microfilaments, it contributes to the fine balance of forces between the two halves of the sarcomere. The size and extensibility of the cross-links are the main determinants of sarcomere extensibility properties of muscle. In non-muscle cells, seems to play a role in chromosome condensation and chromosome segregation during mitosis. Might link the lamina network to chromatin or nuclear actin, or both during interphase. Ref.1 Ref.2 Ref.5 Ref.6 Ref.7 Ref.8 Ref.9

Subcellular location

Cytoplasm. Nucleus. Chromosome. Note: Uniformly distributed along condensed mitotic chromosomes. Ref.5 Ref.7 Ref.8 Ref.9

Tissue specificity

Expressed in the mesoderm at stage 11, several hours before myoblast fusion, and persists in most muscle cells, somatic, visceral and pharyngeal muscles and their precursors, until the third instar. Also expressed in adult flight muscle Z-disks. Ref.1 Ref.2 Ref.5 Ref.6 Ref.8 Ref.9

Developmental stage

Expressed both maternally and zygotically. Ref.9

Disruption phenotype

Flies exhibit chromosome undercondensation, chromosome breakage, loss of diploidy, and premature sister chromatid separation. They also exhibit defects in myoblast fusion, muscle organization and gut morphogenesis. Ref.9

Sequence similarities

Belongs to the protein kinase superfamily. CAMK Ser/Thr protein kinase family.

Contains 5 fibronectin type-III domains.

Contains 53 Ig-like (immunoglobulin-like) domains.

Contains 1 SH3 domain.

Contains 7 TPR repeats.

Sequence caution

The sequence AAM11102.1 differs from that shown. Reason: Erroneous initiation. Translation N-terminally extended.

Ontologies

Keywords
   Biological processCell cycle
Cell division
Mitosis
   Cellular componentChromosome
Cytoplasm
Nucleus
   Coding sequence diversityAlternative splicing
   DomainCoiled coil
Immunoglobulin domain
Repeat
SH3 domain
TPR repeat
   PTMDisulfide bond
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
   Biological_processlocomotion

Inferred from mutant phenotype Ref.1. Source: UniProtKB

mesoderm development

Inferred from expression pattern PubMed 11486054. Source: FlyBase

mitotic chromosome condensation

Inferred from mutant phenotype Ref.9. Source: UniProtKB

muscle attachment

Inferred from mutant phenotype Ref.1. Source: UniProtKB

muscle organ development

Traceable author statement PubMed 12111720. Source: FlyBase

myoblast fusion

Inferred from mutant phenotype Ref.2. Source: UniProtKB

regulation of hemocyte proliferation

Inferred from mutant phenotype PubMed 20688956. Source: FlyBase

sarcomere organization

Inferred from mutant phenotype Ref.1. Source: UniProtKB

sister chromatid cohesion

Inferred from mutant phenotype Ref.9. Source: UniProtKB

skeletal muscle tissue development

Inferred from mutant phenotype Ref.1Ref.2. Source: UniProtKB

somatic muscle development

Inferred from expression pattern Ref.5. Source: BHF-UCL

visceral muscle development

Inferred from expression pattern Ref.5. Source: BHF-UCL

   Cellular_componentZ disc

Inferred from direct assay Ref.7. Source: UniProtKB

condensed nuclear chromosome

Inferred from direct assay Ref.5. Source: UniProtKB

microtubule associated complex

Inferred from direct assay PubMed 18433294. Source: FlyBase

sarcomere

Inferred from direct assay Ref.1Ref.5. Source: UniProtKB

   Molecular_functionactin binding

Inferred from direct assay Ref.7. Source: UniProtKB

structural constituent of muscle

Inferred from mutant phenotype Ref.1. Source: UniProtKB

Complete GO annotation...

Alternative products

This entry describes 5 isoforms produced by alternative splicing. [Align] [Select]
Isoform C Ref.7 (identifier: Q9I7U4-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Note: No experimental confirmation available.
Isoform A Ref.1 Ref.9 (identifier: Q9I7U4-2)

Also known as: Ket; Kettin;

The sequence of this isoform differs from the canonical sequence as follows:
     4794-4796: KRR → SKD
     4797-18074: Missing.
Isoform B Ref.9 (identifier: Q9I7U4-3)

The sequence of this isoform differs from the canonical sequence as follows:
     2-57: QRQNPNPYQQ...QRSQVTTQRQ → NKVSSLYNDV...RSLSLLLALG
     586-663: EPIPGPEIIY...YLNGRPLAAS → G
     1106-1118: GEAQQQATMIVET → A
     4751-4753: MIF → QPG
     4754-4835: Missing.
     6626-6634: EKEQGVPPQ → GK
     7030-7213: Missing.
     7214-7218: GNPIP → NVKIQ
     7577-7592: TSQIIESHEAITHVKI → RSTLSLFNYITFT
     9111-9114: NVSE → NVSEVNVYEQTKAIQDQNKHGLFVKVSKNSDTSKAYLTTIQSTFLKEDILPKPNILQ
     9329-9525: VKGNKQEFTK...EVRVVETVTE → GGIPYVQITV...TRLIKKVKGN
     9526-11249: Missing.
     14333-14478: Missing.
     16100-16100: K → KTKLYSSRKRRSRRSPVEEAADELKLQQTVVE
     16492-16498: PGPEDKP → KLLPKQK
     16499-16573: Missing.
     17341-17344: VVDD → DVVS
     17345-17484: Missing.
     17686-18141: GLDIEKYTLE...PKSKKKKSSK → DEDEVKRSAE...TENEQDCICL
Isoform D Ref.9 (identifier: Q9I7U4-4)

The sequence of this isoform differs from the canonical sequence as follows:
     9335-11180: Missing.
     14438-14457: STKVPNEETPVQEQYAKVNV → TICKGQC
Isoform E Ref.9 (identifier: Q9I7U4-5)

The sequence of this isoform differs from the canonical sequence as follows:
     16100-16100: K → KVTEETSDQTVQLKKKKKPQKPVEEAADELKLQQTVVE

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 1814118141Titin
PRO_0000247656

Regions

Domain86 – 17792Ig-like 1
Domain255 – 34389Ig-like 2
Domain372 – 46190Ig-like 3
Domain471 – 55989Ig-like 4
Domain618 – 70891Ig-like 5
Domain751 – 84292Ig-like 6
Domain890 – 98192Ig-like 7
Domain1024 – 111592Ig-like 8
Domain1158 – 124992Ig-like 9
Domain1291 – 138191Ig-like 10
Domain1424 – 151592Ig-like 11
Domain1558 – 164386Ig-like 12
Domain1691 – 178191Ig-like 13
Domain1824 – 191794Ig-like 14
Domain1958 – 205093Ig-like 15
Domain2089 – 218092Ig-like 16
Domain2222 – 231392Ig-like 17
Domain2356 – 244994Ig-like 18
Domain2488 – 258194Ig-like 19
Domain2622 – 271594Ig-like 20
Domain2754 – 284491Ig-like 21
Domain2891 – 298393Ig-like 22
Domain3029 – 311688Ig-like 23
Domain3130 – 322192Ig-like 24
Domain3263 – 335492Ig-like 25
Domain3401 – 349494Ig-like 26
Domain3539 – 362587Ig-like 27
Domain3676 – 376792Ig-like 28
Domain3811 – 390191Ig-like 29
Repeat3910 – 394435TPR 1
Domain3954 – 404794Ig-like 30
Domain4092 – 418190Ig-like 31
Domain4394 – 448289Ig-like 32
Repeat4403 – 443836TPR 2
Domain4497 – 458589Ig-like 33
Domain4604 – 469289Ig-like 34
Domain4703 – 479189Ig-like 35
Repeat5575 – 561339TPR 3
Domain6536 – 662489Ig-like 36
Domain6633 – 672896Ig-like 37
Domain6741 – 683090Ig-like 38
Domain6841 – 692989Ig-like 39
Domain6942 – 703493Ig-like 40
Domain7066 – 715186Ig-like 41
Domain7189 – 727991Ig-like 42
Repeat11872 – 1190534TPR 4
Repeat13566 – 1359934TPR 5
Repeat14904 – 1493633TPR 6
Domain16396 – 1647075SH3
Domain16501 – 1659090Ig-like 43
Domain16625 – 1671995Ig-like 44
Domain16728 – 1681184Ig-like 45
Domain16822 – 1691695Ig-like 46
Domain16919 – 1700183Ig-like 47
Domain17007 – 1709185Ig-like 48
Domain17097 – 1718084Ig-like 49
Domain17184 – 1727087Ig-like 50
Domain17277 – 1736387Ig-like 51
Domain17374 – 1746794Fibronectin type-III 1
Domain17473 – 1755886Ig-like 52
Domain17563 – 1765391Ig-like 53
Domain17660 – 1775596Fibronectin type-III 2
Repeat17694 – 1772835TPR 7
Domain17760 – 17861102Fibronectin type-III 3
Domain17862 – 1795897Fibronectin type-III 4
Domain17982 – 1807897Fibronectin type-III 5
Coiled coil4204 – 422926 Potential
Coiled coil7621 – 766343 Potential
Compositional bias2 – 6059Gln-rich
Compositional bias4979 – 5300322Gln-rich
Compositional bias5299 – 64161118Glu-rich
Compositional bias6294 – 631825Lys-rich
Compositional bias7346 – 7451106Glu-rich
Compositional bias9327 – 163567030Glu-rich
Compositional bias9386 – 113261941Lys-rich
Compositional bias11740 – 12109370Lys-rich
Compositional bias14784 – 15234451Lys-rich
Compositional bias17528 – 17651124Thr-rich

Amino acid modifications

Disulfide bond393 ↔ 445 By similarity
Disulfide bond1312 ↔ 1365 By similarity
Disulfide bond1446 ↔ 1499 By similarity
Disulfide bond1579 ↔ 1632 By similarity
Disulfide bond1846 ↔ 1899 By similarity
Disulfide bond2111 ↔ 2164 By similarity
Disulfide bond2775 ↔ 2828 By similarity
Disulfide bond3152 ↔ 3205 By similarity
Disulfide bond3560 ↔ 3613 By similarity
Disulfide bond3698 ↔ 3751 By similarity
Disulfide bond3832 ↔ 3885 By similarity
Disulfide bond3976 ↔ 4029 By similarity
Disulfide bond4625 ↔ 4676 By similarity
Disulfide bond6557 ↔ 6608 By similarity
Disulfide bond6964 ↔ 7016 By similarity
Disulfide bond16940 ↔ 16989 By similarity
Disulfide bond17494 ↔ 17542 By similarity

Natural variations

Alternative sequence2 – 5756QRQNP…TTQRQ → NKVSSLYNDVMNIFQSFIKL KMDINVVQERLKQEQRQKEQ RERDARDQAEREKAIKEAEA KERLHREEQSRLENQRQQAA IEQAQRELAARELALREQAV REEEARLQAIREQATREQLA REQAAREEELRIQSLRDIAR REEEVRLQNIRDEETRIRRE EEERIRRENESRSKREEEAR IQREEITRLQTLRDQVDQQR IVTENIRKDIQVNSIFTELR YASPLFTRPLKDAVSREGDR FVFECEVTGTPEPAVEWFKD GISIQTNSDYKTTFDKGICR LVIEETFAADSARFSCRASN LVGTCDTNATLSVRENAAEV QLVPPRILRFLQSGKATEGS SFQFACVVAGVPLPTVQWFK NDKCIDDSPDYVISYNNGEA TLKFEEVFLEDDAVYTCSAS NPAGIEHCSASLIVEPLEPT ELPSFKVPLSNAMARVGQKI KLEAIVGGIPRPEVYWLHNG KPFQPRDSKYEYGRVTLIIP QAYPNDAGSYVLSAKNLAGE AYTSCNVIVKGRLPNETSDS EMASDIEPIKPAVHLPLKDV SIFEGKPVRLDCVIVGQPEP EVIWYHNERPVKESADVQLL FQGDRCSLIIQEVYQEDAGH YKVVAINSAGEASSSCELKV TPLNQAEPATRAQAERQSLP KDSQPKFERLLSDVLADEGE QVVLEVQASGDQPLTAQWFL TNKELQLDQRITTQSDSELG VFKLILNNVSGDDKGVYTVK VTNPAGDAKCFSHLIVKSVN APENRRSSQSSVEIIERHQC PEFKELFSDKQGEIDEVIKF ECIVKGKPTPKVHWFFNDQP VHGHNFLVSTSGERQVLTIQ KLTHDAVGKISCVAENEAGK ATCVAFLNIRGSGLPASSDV QTVSQEHNTESSRVTIKKQT FTTTSTSQVNSYEGNAPQTE VHHSSAHIDQSLKQLGQQRP EIVESHHYQELHKSKEMSSP TVQQKSFSFIQSSGANGQSA VAIPDSPTRLRREIAPRFTT PLSGKIVDQGADVSMEAIYD GFPSPEIKVEKNGGQLFEDA HTRISNKCNRVTIELKQVGV GDAGRYAVTASNTVGQSTST ADLVVKKTIFPPVFGRRLQA QVSKKGEKLTMEVEVTGLPE PTVTWLKDDKPLKDAGISEH RLLAQGNSYRLIIEKAQTTD SGKYMVRATNAGGEAKSIAD CAILEPSPERLQEVVKTIVY ETGPVAPASEFKTEVQKQIQ NSENQQSYQNSQTEVTSAND LHGLSESKVITEHRCTTEAT MRLEHKSNYLDLPELTTRPK TPTTNDTITITTNTATVSMD QPDLTQPTITNTTTTNTTKV PPPVPPKPCTPVVATTFGQQ QQQPPQTLTSTRYEQSEHKS TTSSSSFDYFKKIDEETIIQ RPNPLTFKPLDTVVRQPQAQ SLAEELRSLNLIPGDAPEFC YSPKTERSEPKIPLITEKIK ILSEVQPKEPPPQGGVPVFP PPLGLQTVQHESTTTKEVKV EYGQPIVRPAAVLATPAQQN PRSPSPKPSAEGVAMSRLWT PTGVTGYTSDVEQKSEKTVI SKLATPTPTKELNAPFLVNQ IAKSIPPTTAPVTHLVNVEL EPGTPPEICFAPKVEETRRR SLVETMEQKLEQNLIQGPSK VLPHSVPTLTPNTAAPVQPK PLGNTTYRPPPPVLPTRLGV YESDYESDRYKYSGSESDVE PGIRKQPQLTTMETSFKSSG YTADTEEHSSYRKSESSFYE TKSSSTMGGAPQLQTQFPKL QPEPPAPIYFTAKPQPQVPP QVPPSQSNVSSQEAKVRIRS SLCLSVCRYYCRSLSLLLAL G in isoform B. Ref.9
VSP_052098
Alternative sequence586 – 66378EPIPG…PLAAS → G in isoform B. Ref.9
VSP_052099
Alternative sequence1106 – 111813GEAQQ…MIVET → A in isoform B. Ref.9
VSP_052100
Alternative sequence4751 – 47533MIF → QPG in isoform B. Ref.9
VSP_052101
Alternative sequence4754 – 483582Missing in isoform B. Ref.9
VSP_052102
Alternative sequence4794 – 47963KRR → SKD in isoform A. Ref.1 Ref.9
VSP_052103
Alternative sequence4797 – 1807413278Missing in isoform A. Ref.1 Ref.9
VSP_052104
Alternative sequence6626 – 66349EKEQGVPPQ → GK in isoform B. Ref.9
VSP_052105
Alternative sequence7030 – 7213184Missing in isoform B. Ref.9
VSP_052106
Alternative sequence7214 – 72185GNPIP → NVKIQ in isoform B. Ref.9
VSP_052107
Alternative sequence7577 – 759216TSQII…THVKI → RSTLSLFNYITFT in isoform B. Ref.9
VSP_052108
Alternative sequence9111 – 91144NVSE → NVSEVNVYEQTKAIQDQNKH GLFVKVSKNSDTSKAYLTTI QSTFLKEDILPKPNILQ in isoform B. Ref.9
VSP_052109
Alternative sequence9329 – 9525197VKGNK…ETVTE → GGIPYVQITVEEAKLVNTRA DVEENITSVTTEQPNMCIAA SIQLPAIEEKKLENALQTPQ FASESILKTSPQISRNAHFE TRTHEEEYSTTTESLVTTQA LRDDIDSTQKNVMQDVQMYK HFATKSLDKTVKVETDTNQA TDTTHIKQKTPTHKLASSTT QITESPEIIKTIETISEDGS PSKKMIRTRLIKKVKGN in isoform B.
VSP_052110
Alternative sequence9335 – 111801846Missing in isoform D.
VSP_052111
Alternative sequence9526 – 112491724Missing in isoform B.
VSP_052112
Alternative sequence14333 – 14478146Missing in isoform B.
VSP_052113
Alternative sequence14438 – 1445720STKVP…AKVNV → TICKGQC in isoform D.
VSP_052114
Alternative sequence161001K → KTKLYSSRKRRSRRSPVEEA ADELKLQQTVVE in isoform B.
VSP_052115
Alternative sequence161001K → KVTEETSDQTVQLKKKKKPQ KPVEEAADELKLQQTVVE in isoform E.
VSP_052116
Alternative sequence16492 – 164987PGPEDKP → KLLPKQK in isoform B.
VSP_052117
Alternative sequence16499 – 1657375Missing in isoform B.
VSP_052118
Alternative sequence17341 – 173444VVDD → DVVS in isoform B.
VSP_052119
Alternative sequence17345 – 17484140Missing in isoform B.
VSP_052120
Alternative sequence17686 – 18141456GLDIE…KKSSK → DEDEVKRSAENAWGKLLPHQ KKYFEVSMSQLCFYATVFTT PVDQVRPKETPVKRMLPAAR KKPKNQKKKAVPKWRYRSRK RPMAKPTPNPNNPAMSTAFI GFLREYQRRNTIVDVKKRLQ RAAKMWSKLSKAQKNKFRTA VSIAAYSLPSADFPPAPATC HLRRRFAWQHIGHVHRSITQ NCQSRAAPSLGDFSTLLTTP HWAKFQVLKVCNWIFQVQGN PFRSPLLTAHFYLLSFKKHT KLPGHLRGKFLQTGRIGLYL GFYNVGYSEQLRQFVCDEQL YVQIMAVVNLLASWLYLCFS HRWIYDQFVILLYVPLYIYF LILRRHLTKLLNECAGLHKS MQMIMGDRLCAKIHRECIYT LLLIIMSILRLLWQIRIYSV YQSIFIFGVAFIYHFELLFF GNYLIWLSCIFRSLNVFLAK DMRSDRLQILKGVLRQQTII WRVHRTVSRYFALHIISFMI QPGIKICIILKCSGIQMNAQ IISLILHLLLLGLFMIIASN LQKQHRTFQKSYIGLKDDPN YFVLKSWRLLQNRTLPQAFG VTFLRKREKVQYKQDVITML LRFSDSQQVYQQRANCCRFL PAVFITMILFLHKLFSYELQ KESKLVNLLQIEKRTLKSEI ELWNENLTSIYIFLTLVCSL VNKNELWKLINEAQLTYKQL KSLLGKHLVLKCSYDVLIHG LLLILLLAVMVVDIIFFNWP KASGERNTVTLTELHQIFDY LMGIPRLLFVLIMAMRILYH LISAGWLQCLGMLRLQRNLK LYQFQLRSIFYNQKCENILA GHYFKVSYMYFLWMMPFRIA ELMQFLKYDYDELVQKQKSQ EDLEDEAIWEGEENSRQQNL QELLMKPLLILSWHFALWML LLAAAYTQQKEYSTLMAKSW NFKSDENGCEMKEFLDEICW TGHAFKQLDILDLLVCTENE QDCICL in isoform B.
VSP_052121

Experimental info

Sequence conflict71N → S in CAB76253. Ref.6
Sequence conflict3821E → G in AAC23966. Ref.5
Sequence conflict5681N → S in BAA90301. Ref.1
Sequence conflict5681N → S in AAC23966. Ref.5
Sequence conflict5681N → S Ref.6
Sequence conflict7221P → S in CAB76253. Ref.6
Sequence conflict875 – 8806QQRRHV → PASVVM in AAC23966. Ref.5
Sequence conflict10591E → G in CAB76253. Ref.6
Sequence conflict13451F → L in CAB76253. Ref.6
Sequence conflict14051E → G in BAA90301. Ref.1
Sequence conflict1691 – 16922PV → NS in CAB76253. Ref.6
Sequence conflict17791S → P in CAB76253. Ref.6
Sequence conflict19051L → H in BAA90301. Ref.1
Sequence conflict19051L → H in CAB76253. Ref.6
Sequence conflict19521Q → H in CAB76253. Ref.6
Sequence conflict20501T → TVT in CAB76253. Ref.6
Sequence conflict21041G → R in CAB76253. Ref.6
Sequence conflict26611N → S in CAB76253. Ref.6
Sequence conflict29341M → V in CAB76253. Ref.6
Sequence conflict3091 – 30944VLGT → EGVR in AAF61414. Ref.9
Sequence conflict3091 – 30944VLGT → EGVR in AAG40155. Ref.9
Sequence conflict31741R → C in CAB76253. Ref.6
Sequence conflict31741R → C in CAA09971. Ref.7
Sequence conflict33031G → D in AAF61414. Ref.9
Sequence conflict33031G → D in AAG40155. Ref.9
Sequence conflict3379 – 33813ANL → RGM in CAB76253. Ref.6
Sequence conflict3500 – 35023IME → FMD in BAA90301. Ref.1
Sequence conflict3526 – 35283STE → GTG in BAA90301. Ref.1
Sequence conflict35921D → E in CAB76253. Ref.6
Sequence conflict3598 – 35992LE → PQ in CAB76253. Ref.6
Sequence conflict37011E → D in BAA90301. Ref.1
Sequence conflict38571H → N in BAA90301. Ref.1
Sequence conflict38911Y → H in BAA90301. Ref.1
Sequence conflict39971L → F in BAA90301. Ref.1
Sequence conflict41301H → Q in BAA90301. Ref.1
Sequence conflict42291Q → H in BAA90301. Ref.1
Sequence conflict43301I → T in BAA90301. Ref.1
Sequence conflict4512 – 452918QLARF…PLDVY → SWLASMPRLQALAHWMCT in BAA90301. Ref.1
Sequence conflict66501E → K in AAF62351. Ref.9
Sequence conflict66501E → K in AAF44704. Ref.9
Sequence conflict74121T → S in AAC23965. Ref.5
Sequence conflict74501A → T in AAC23965. Ref.5
Sequence conflict74541G → A in AAC23965. Ref.5
Sequence conflict75251D → G in AAC23965. Ref.5
Sequence conflict75321T → P in AAC23965. Ref.5
Sequence conflict75371A → P in CAB93524. Ref.3
Sequence conflict75521V → A in AAC23965. Ref.5
Sequence conflict10456 – 104572KI → EF in AAC23963. Ref.5
Sequence conflict11204 – 112074VPYE → LPFQ in CAB93524. Ref.3
Sequence conflict112171L → I in CAB93524. Ref.3
Sequence conflict114441V → T in AAC23964. Ref.5
Sequence conflict11460 – 114612DG → KK in AAC23964. Ref.5
Sequence conflict118631Q → H in CAB93524. Ref.3
Sequence conflict137091K → T in CAB96531. Ref.2
Sequence conflict13713 – 137153KSP → ESR in CAB96531. Ref.2
Sequence conflict137231Q → S in CAB96531. Ref.2
Sequence conflict144331D → K in AAM11102. Ref.10
Sequence conflict144361S → K in AAM11102. Ref.10
Sequence conflict159871V → A in CAB43739. Ref.2
Sequence conflict172061F → C in CAB93524. Ref.3
Sequence conflict180651E → R in CAB96427. Ref.2

Sequences

Sequence LengthMass (Da)Tools
Isoform C [UniParc].

Last modified March 18, 2008. Version 3.
Checksum: 41CA5977EB611933

FASTA18,1412,065,766
        10         20         30         40         50         60 
MQRQNPNPYQ QQNQQHQQVQ QFSSQEYSHS SQEQHQEQRI SRTEQHVQRS QVTTQRQVQQ 

        70         80         90        100        110        120 
HHGGSIGGAY VPPSLTHVYA QGDISPPVFE QIFKNARFAQ GGNALFEGRL RGNPKPFVTW 

       130        140        150        160        170        180 
TRKGAPLLES QKFRMSYNEA TGDVSLLINQ IGPGDEGEYT CTARNQYGEA ICSVYIQPEG 

       190        200        210        220        230        240 
APMPALQPIQ NLEKNIYSNG YSYTSIEEEF RVDTFEYRLL REVSFREAIT RRSGYEQDSQ 

       250        260        270        280        290        300 
LSQELDRNQG PAQAPQISQK PRSSKLIEGS DAVFTARVGS NPKPRLTWFH NGQRLVASQK 

       310        320        330        340        350        360 
YEISYSSGVA TLRVKNATAR DGGHYTLLAE NLQGCVVSSA VLAVEPAAET AYEPKPVDVM 

       370        380        390        400        410        420 
AEQLEAGKAL PPAFVKAFGD REITEGRMTR FDCRVTGNPY PEVFWLINGR QVRDDASHKI 

       430        440        450        460        470        480 
LVNESGSHSL MITNVTRLDA GAVQCLARNK AGEVAIEAQL NVLEKEQVVA PQFVQRFSTM 

       490        500        510        520        530        540 
TVREGEPITM SANAIGTPQP RITWQKDGVQ ISSTAERFVG IDGGATCLEI PRVTANDAGW 

       550        560        570        580        590        600 
YQCTAQNIAG STANRARLYV EVPREQPNYE QRRLNLPRPT KVIEPEPIPG PEIIYLRHVE 

       610        620        630        640        650        660 
RAKPHLRPGE EDRVYPPPQF IIPLQNVQQT EGGRVHMEAR IEPVGDPTMV VEWYLNGRPL 

       670        680        690        700        710        720 
AASARATSVF KFGFIALDLL SIMGHDSGEY MCRVTNASGV AESRAILSVV QRPSIEQSSQ 

       730        740        750        760        770        780 
NPNSLQYINQ LEDYSRYQRT ESIDEQLNQA PQFIRPLRDL GEFEEGKNVH FEAQVTPVND 

       790        800        810        820        830        840 
PSMRVEWYKD GLPITASSRI TAIFNFGYVS LNILHLRAED AGTYTVRAVN RIGEAISQSS 

       850        860        870        880        890        900 
IRVHSRSQVT ADLGIPEQQR YIEKVEELED YRKSQQRRHV QEAAEAIAPP QFKTPIQNQL 

       910        920        930        940        950        960 
DLREHAHAHF EARLEPVGDS TMRVEWLKDG QPLEASSRIT TYHNFGYVAL TIKQLTIYDA 

       970        980        990       1000       1010       1020 
GTYTCRAYNA MGQDTTVAQL TVISKNEIVS ESQHPGGLQK IQHLEDSSRY GRREEEETYI 

      1030       1040       1050       1060       1070       1080 
TQAPRFLGPL KGTTKILEGQ RAHFEARVEP QSDLGLVIEW YHNGRSITAA NRIQTYYDFG 

      1090       1100       1110       1120       1130       1140 
YVALDISQVR AEDAGVYLVV ARNKLGEAQQ QATMIVETRS SIDTSSMHRG LYEKTQNLEN 

      1150       1160       1170       1180       1190       1200 
KPFVEPQYDI EEISKSKPVF VTPLSDPKPI HDGKNIHLEC RLEPMGDPTM RVEWFHNGRP 

      1210       1220       1230       1240       1250       1260 
VTVGSRFRTY YDFGFVALDI IKATAADSGE YTVRATNHLG TAHTSACVRV IDHTDVVTET 

      1270       1280       1290       1300       1310       1320 
QNEQSLEQIQ LLEDSRRRHH QEEDITIMQA PQFTRGLHNI ETIEGTNVHL ECRLQPVGDP 

      1330       1340       1350       1360       1370       1380 
SMRIEWFVNG KPVKTGHRFR PAYEFDYVAL DLLGCYAIDS GVYTCQARNQ LGEAVTSCSV 

      1390       1400       1410       1420       1430       1440 
RIIAKNDLIL ETQNESGLQK IQYLEDSTRH RRSEFVDEVV NIRPRFLTHP KSLTNTREGG 

      1450       1460       1470       1480       1490       1500 
HAHFECKIEP VTDPNLKVEW FKNGRPITVG HRFRPIHDFG YVALDIVHLI AEDSGVYTCR 

      1510       1520       1530       1540       1550       1560 
AVNLIGSDET QVELQCRSGE QIVTVTQNEA GLEQIHYLED RSRYTRREEI DESTKQAPVF 

      1570       1580       1590       1600       1610       1620 
TTSLKNVEIK ENQRAHFECR LIPVSDPSMR VEWYHNNLPL KSGSRFTETN NFGFVALDIM 

      1630       1640       1650       1660       1670       1680 
STLPEDAGTY TCRAYNAVGE AITSAVAVVH TKKSIYLESQ HETALPRLQH LEDGSKRQRI 

      1690       1700       1710       1720       1730       1740 
SVQDEFVSQA PVFTMPVRDV RVAENQAVHF EARLIPVGDP KLTVEWLRNG QPIEASNRTT 

      1750       1760       1770       1780       1790       1800 
TMHDFGYVAL NMKYVNPEDS GTYTCRAVNE LGQAVTSASL IVQSKTSIQL ETQHEAAMHK 

      1810       1820       1830       1840       1850       1860 
IHQLEDHSRY QRREEEEYTV TTAPVFVTKL IGPSNLVEGQ SAHYECRIEP YPDPNLKVEW 

      1870       1880       1890       1900       1910       1920 
FHNGKPLSTG HRFRTTYDFG FAALDILTVY AEDSGEYTCR VTNNLGEAIN SIVLNVTSRS 

      1930       1940       1950       1960       1970       1980 
SIIHETQHEE ALTKIQHLED TSRFQRKTDE EQFHAERPQF GRPLRNAKVN EGAPVHLEAT 

      1990       2000       2010       2020       2030       2040 
LIPVNDPTMK VEWYCNGRPI QTGHRFKTTY DFGFVALDIL YAHAEDTGTY MCKAKNAIGE 

      2050       2060       2070       2080       2090       2100 
AVTTCAVNVT ANKTLDLDTL DAQRLEKIRQ LETYAPPPKP VVEEKGQKPI FLTPLSNLEH 

      2110       2120       2130       2140       2150       2160 
LKEGEHAHLE CRVEPINDPN LKIEWFCNGK QLPTGHRYRT THDFGYVALD ILYVYGEDTG 

      2170       2180       2190       2200       2210       2220 
TYICKATNQL GEAVNTCNVR VLNRRSMILD TQHPDALEKI QKLESKVPNA RTEVGDAPIS 

      2230       2240       2250       2260       2270       2280 
PPHFTAELRG STEIYEGQTA HFEAQVAPVH DPNLRIEFYH NGKPLPSASR FHITFDFGYV 

      2290       2300       2310       2320       2330       2340 
SLDITHAVAE DAGEYSVRAV NALGQAVSST NLRVIPRGTI ISDTQHPEGL EKIRKLESTA 

      2350       2360       2370       2380       2390       2400 
PHQRQEPETP GTRQRPVFTQ PLQNIDRINE HQTAHFEARL IPVGDPNLKV EWYRNEKIIE 

      2410       2420       2430       2440       2450       2460 
DSSRITKQHD FGFVSLDISH IRKEDEGVYM CRAVNPLGEA VTTASMRVVS EASIQMDTQH 

      2470       2480       2490       2500       2510       2520 
PDSISRIHQL EKPLAPRPTE PERLFEKPIF TQLLTGPSEL WEGTHAHFEA RVVPVGDPSL 

      2530       2540       2550       2560       2570       2580 
KFEWFINGVE LQMGSRLRTT HDFGFVTLDI TAVVPEDAGV YMCRAYNAAG EAVSSTAMKV 

      2590       2600       2610       2620       2630       2640 
KTKSNIDGQP LIPESWEAIR LKEAAMNRVP EMFVDSTPQQ APVFTTHLQS YDKLHEGQHV 

      2650       2660       2670       2680       2690       2700 
LLEAQVEPRA DPNLRIEWFK NGISLTTGSR IRSTFDFGLV TLSINGLRAD DSAIYTCKAT 

      2710       2720       2730       2740       2750       2760 
NQVGEAVSTS SLKIEDRHWL QAESLHPDSL PRIGELEAPK EGRPEAPEPT YETPVFITHL 

      2770       2780       2790       2800       2810       2820 
NNIECKESDN VRFECNVEPA RDPTMSIEWF YNGQPLQAAA KFKSIYDFGY CALDLTNSYA 

      2830       2840       2850       2860       2870       2880 
ENSGVYTCKA TNSKGSATTS GTLKCTGGKT MFLDTQHPQG EAGLEAVQET EEELANRYTS 

      2890       2900       2910       2920       2930       2940 
KTTKPETQYP PPVWTKPLQA EFHLSEAQPI HLEANVEPKE DPNLFIEWYF NGKMLNHGSR 

      2950       2960       2970       2980       2990       3000 
FKMTSEFGFV TMDMIEVYAR DQGIYTCKAY NKAGEAFTST TIFCSSKENI IESTQHPKGA 

      3010       3020       3030       3040       3050       3060 
EGLEQIQDLE DSLRKDGSKP EQPDLGIPPR FTTEFVNIAD IGEGELAHFE ANLIPVGDQS 

      3070       3080       3090       3100       3110       3120 
MVIEWFYNGK VLEASHRVRT IYAFGTVALE VLGTKIEDTG TYTCRATNKH GTAEISCNLE 

      3130       3140       3150       3160       3170       3180 
CVDKPRGQKP RFTSHIQPLE GLKDGQSAHF ECTLIPVNDP DLKVEWYHNG KLMRHSNRIK 

      3190       3200       3210       3220       3230       3240 
TVSDFGYVVL DISYLQDHDS GEYVCRAWNK YGEDFTRTTL NCGGRGGVFY DSLQPDSLQR 

      3250       3260       3270       3280       3290       3300 
IRELECPQGQ QADTSAPLVA EPPKFITQIV DVTKLVEGQS AHFEARLTPI TDPDLVVEWY 

      3310       3320       3330       3340       3350       3360 
FNGKKLPHGH RFRTFHDFGI VILDILYCYE ENSGVYEARA RNKYGEDVTR ASLKCASKSS 

      3370       3380       3390       3400       3410       3420 
LILDSQLPRG MEGGLEKIAN LEYSMVRTRE ETTEETKGKA PVFTVPLENI ENLREGENAH 

      3430       3440       3450       3460       3470       3480 
FEARITPADD PKLKVEWYWN GRPLKAGSRF RTFCDFGFVI LEISPVYPED SGEYSCRAIN 

      3490       3500       3510       3520       3530       3540 
EYGEAVTTAT MKIQGKRSII MESQLPKGME GTIDRIAELE GLGSRSTEFV PDDDTGKPPE 

      3550       3560       3570       3580       3590       3600 
FITSPFDMVI GENALAHFEC RLQPINDPSM RVDWFHNGKA LWAGSRIKTI NDFGFVILEI 

      3610       3620       3630       3640       3650       3660 
AGCYQRDSGL YTCKATNKHG EATVSCKLQV KGRQGIVMEP QLPSNFRTGT ESLQKLEETM 

      3670       3680       3690       3700       3710       3720 
HKREELVTED EQPNPPKFTE EIKDNLDVPE GGPIHFDCRV EPVGDPTMRI EWFYNGHVMA 

      3730       3740       3750       3760       3770       3780 
TGSRVHQLND FGFIALDVDY IYARDSGEYT CRATNKWGTA TTSAKVTCKG KHNIVYESQL 

      3790       3800       3810       3820       3830       3840 
PEGMTSEKLK ELERGRIPEA PKVVEEVFGP PKFTTQITSV TVDEAEAVRF ECQVEPKTDP 

      3850       3860       3870       3880       3890       3900 
SLRVEWYRNG KPLPSGHRYR NIFDMGFVSL DILYVYGEDS GEYVCRAINN YGEDRTRATV 

      3910       3920       3930       3940       3950       3960 
SCKKLPTILL QNQVPRGMKR SDALTQMEAT IKKYTSEVHL TEDDLFDPDR KQPPRFVTQI 

      3970       3980       3990       4000       4010       4020 
KEQLTLTEMA VTKFECQLAP VGDPNMKVEW FFNGKPLLHK NRFQPIYDFG YVAMNFGWVY 

      4030       4040       4050       4060       4070       4080 
PEDSGEYVCR ATNLYGKDET RAIIKVSGKP GIVYDSQLPA HMQSIDRIRE MEASWQVVPD 

      4090       4100       4110       4120       4130       4140 
EVDPDAKPRT KPVFVSKLEP QTVEEGDPAR FCVRVTGHPR PRVMWLINGH TVVHGSRYKL 

      4150       4160       4170       4180       4190       4200 
TNDGMFHLDV PKTRQYDTGK VEVIARNSVG ESIATTELKV VARSDDYRNV LKNSPRPWYD 

      4210       4220       4230       4240       4250       4260 
YELAAYQKER QENELEKVFD ERKQVLSEQS SHTLKGVEHL KPKQYKPPTP DWQQNVKAKK 

      4270       4280       4290       4300       4310       4320 
SEDYYNKLQT LETEQLLKET NLRRDTHQYA IPGEKVVSSS QAKGMAQSYE ENLQEKTSTT 

      4330       4340       4350       4360       4370       4380 
EVQAAPPKGI AQPSESSVHG REVHMNKQQQ VQKEIQGDLE ITRKITATET TEVEHKGTIQ 

      4390       4400       4410       4420       4430       4440 
ERVVQGPVKP AKAPVFTKKI QPCRVFENEQ AKFEVEFEGE PNPTVKWYRE SFPIQNSPDL 

      4450       4460       4470       4480       4490       4500 
QIHTFSGKSI LIIRQVFVED SAVFSCVAEN RGGTAKCSAN LVVEERRRAG KGGIQPPSFV 

      4510       4520       4530       4540       4550       4560 
TTIQSTTVAT GQLARFDAKV TGTRPLDVYW LKNGMKIQPS IKFKMLEEDS VHTLLIIEPF 

      4570       4580       4590       4600       4610       4620 
AEDSGRYECV AVNAAGEARC DGDCIVQSPS KPEKPTTPGS EKAPHIVEQL KSQTVEEGSK 

      4630       4640       4650       4660       4670       4680 
VIFRCRVDGK PTPTARWMRG ENFVKPSRYF QMSRQGEYYQ LVISEAFPED EGTYKCVAEN 

      4690       4700       4710       4720       4730       4740 
KLGSIQTSAQ LKVRPIENLD APPTITALKD VSVTEGMPAQ FKTTVTGKVK ATSVQWFREG 

      4750       4760       4770       4780       4790       4800 
QLIPETPDFQ MIFDGNSAVL LIGTTYEEDS GIFTVRVTSS TGQVESSAKL TVKKRRISAF 

      4810       4820       4830       4840       4850       4860 
QLRTIDSAED ESSSSGREDS APESPHAFQP GQQPGQQFGQ FLGVNGQGQH QGRSRQKKPK 

      4870       4880       4890       4900       4910       4920 
VRSKSLQPAT KVIPWRKSSR PTRGRSLDKG VFLPGFKPEP VKSWTEETIN LKATPIEKKK 

      4930       4940       4950       4960       4970       4980 
PAPKLEAAKV VLKSIKTERD QGIMSLGATL EQIIAGKTEK EAIPWITMRE KLKAVESVQQ 

      4990       5000       5010       5020       5030       5040 
QLNKFDLDEV YLQPLEGQIE TEGQLPQQAQ VEQVQRTKEI QRLKSMESVE IMEMTDQIDK 

      5050       5060       5070       5080       5090       5100 
LITQQQNAKD LIPWKEMRQQ LKSVQRVTKQ IDKFKIEEVE LRHLQAQQAI TEEYQTGTAE 

      5110       5120       5130       5140       5150       5160 
ETVVMIDESS KGSISKVLRR DEQLQYEDQS NIYKQKFITT EDVNIMHVSE REKLEAQRLI 

      5170       5180       5190       5200       5210       5220 
REQQAVNWRQ QQQRPQLQPL TSVEDTVISQ TSERQKLVQQ QSFIEEAQRQ QFVQVEDSQM 

      5230       5240       5250       5260       5270       5280 
MSLEEYEHQK IINQRTQQEA FSWRQPREPQ KFIQVEDSTL LHLQERHDTQ EQQLLQQQPV 

      5290       5300       5310       5320       5330       5340 
MWDRGRKKPD QPQYVQPQEQ RVKEEFVEKP KTYEEMHDEL VEPTPIEQPQ PVPVMWERGK 

      5350       5360       5370       5380       5390       5400 
KKPQPQEKTF EEAHDELVEP TPVQQPEPVP VMWERGKKKV AQQETVLSQE VVQTSQVVEQ 

      5410       5420       5430       5440       5450       5460 
QIVEETKKTA VRRVIPPREP EQKVEQVTLK PTPRPRPKEA VKAEEIQLKP LRSTRPVPQP 

      5470       5480       5490       5500       5510       5520 
VEAEQKAYEE ATDELTEEPI PQPQPVMWER GKKKPQKPQE EVTEIPKTLE IAVDTLEEEV 

      5530       5540       5550       5560       5570       5580 
PKPTEPQPQP VLWARGQKKP QKPDEQKQEL PKSLEIAVDT IEEDLIKPVQ PEPQPVLWER 

      5590       5600       5610       5620       5630       5640 
KKKKPQPQDV IEEKLDVAPT KTYEKAVDVL PDEPKVEEKP EPVLWQRGKK KIPKSEPTEE 

      5650       5660       5670       5680       5690       5700 
VHPDEVDAQI ETVVKEDEMI VEEKRRIKKT KRPKSTKEVT EELFEEQPEE EISPEEEVPQ 

      5710       5720       5730       5740       5750       5760 
KEVIEEIEEI VEEKRRLKKT KKPKLTQQVT EEETPHEEII KESEEVVQEQ EEIVEEKKKV 

      5770       5780       5790       5800       5810       5820 
KKVKKPKTVA EKQLKEEEIP TEETVEEEET AEDQQLVVEE SKKVKKVKKP TGTVEKTDVE 

      5830       5840       5850       5860       5870       5880 
ELPGEEVPVE EVPVEEVPED VAPEEELIEE QEEIVDQDEI QEQKRKVKKA KKPKKTIEKT 

      5890       5900       5910       5920       5930       5940 
EIEIEEDQPE EEVLQEEIIG EQEEITERQR KVKSIKKPKK VVTEKTVDQT EQPEKPEESQ 

      5950       5960       5970       5980       5990       6000 
AEEVKETVTE EPKKPKPAPE EAKVEQVEKI SLKPAPRKQR LLPEKEQVEE VLLKPVKKIV 

      6010       6020       6030       6040       6050       6060 
AVSEAEQPET PETEFEVKEF AITTTEDILD VTKKRVKKKK PKTKVAAEES TEEPAEETEE 

      6070       6080       6090       6100       6110       6120 
FEEEATQPEE VQPVEEIPEE PQVKEVADER KTAPKPKPRK EEIIEKVEEV ALKRVTRPKK 

      6130       6140       6150       6160       6170       6180 
ELPQEATIEE VRLKPTQRTS IKPEEVKLEE VDLQHVEKKE DEIVQEEKRK TRKVKKPKHE 

      6190       6200       6210       6220       6230       6240 
DLPEIPDAEP TQLEEAEHIE LEKQPKPEED QPQVPWKRGE KKQPVEEVLE EKKWPSGKRR 

      6250       6260       6270       6280       6290       6300 
PLPEQQPEEV QLKPIPSKPI EEQQKPEKAI PGPQLVPEEK PESEEEELEL EPLKLPEDKK 

      6310       6320       6330       6340       6350       6360 
PKEPKAKKEK KKKPKLKKAT PSVDEVSEEV AEPFDEPIAE EDEVEEMPVD DVKVVAVSED 

      6370       6380       6390       6400       6410       6420 
VLPEEEVVPT EETPEAKQKA HKKRTKRLKE ASVEGQPQLL EAAIAEIEKV DEISQEISQK 

      6430       6440       6450       6460       6470       6480 
TITLLKKTED TRPQFITTEQ LIELDVEDVR RDLEMKVTSN IIKKEKRRVV LDDSQPLPEL 

      6490       6500       6510       6520       6530       6540 
ELITQKRIQE GIDKVADEEL IEDQQLIQNQ QETTTSEVIG QERKLVKKKK KEIKPPRITE 

      6550       6560       6570       6580       6590       6600 
KLRPRQCVPE EPTVLECKVE GVPFPEIKWY FNDILLFASE KYEITVMEQV AKLKIAKVTP 

      6610       6620       6630       6640       6650       6660 
SDVGVYTCEA KNEAGVATSR TNIILEKEQG VPPQFTKPLK IEFIEEKQPE RLKVTVTCQV 

      6670       6680       6690       6700       6710       6720 
TGKPNPEVKW YRGIEEVIPS ETVQMFYDEK TGDVALEVIN PTPNEAVVYS VQAQNQFGRA 

      6730       6740       6750       6760       6770       6780 
IGNANILSRV DEVPREILKA PTVTPLSAVV VPTGGTLFFE AKYDGLPRPE VKWMRNGREI 

      6790       6800       6810       6820       6830       6840 
IENEETIIET TETTTTIKVV NMTRKRTGKY EVWAKNKVGE AKSSGSVVVS DQKPDEQIKP 

      6850       6860       6870       6880       6890       6900 
PRFIQPLEPK YFGEHEVAII EAIVESEPLS SFQWFVHNEP IKSSNEVRIV SQANKSTLLI 

      6910       6920       6930       6940       6950       6960 
ENFQSKFVGP FTCRAENVGG SVTSTATVNL IPQEEAEEFE SPRFVEELVQ PVEVMDGEAL 

      6970       6980       6990       7000       7010       7020 
LLTCQVTGKP TPKVEWYHNA EKITENKETT ISQDLQGVCQ LQITEVFPEN EGQYECVATN 

      7030       7040       7050       7060       7070       7080 
KIGKSVSKTN VKIQAFEYIP DSEITGLTGS EEDLLDRTLS IDEQAPKIIK KLPEKIEPKE 

      7090       7100       7110       7120       7130       7140 
GEQAKLEVKV VGKPKPKVKW LRDDEQIFAS EEYQIENFED GTSVLVINHV YPDDLGTISF 

      7150       7160       7170       7180       7190       7200 
EAYNPLGVAV TTALFAVEGI VGSKDYRKPE WVSQMEEMQV ALKAAKCSPS LLNEMRDCRA 

      7210       7220       7230       7240       7250       7260 
ALGETAKFSI QFAGNPIPDI QWYFNNVQLR ASEKYRMVVQ EQEATLEIMK ITSEDCGYYN 

      7270       7280       7290       7300       7310       7320 
CKLINEIGMT MTRAKFDISS TSTIVEETKA KTTVKKKSGK KTMVKRSGAS ESQNVQKTEI 

      7330       7340       7350       7360       7370       7380 
RIIPTSAVET SMNVIKVKQP VSVLVEKSEI SEVLVVKDRE VADAEERSSQ LIEEIEEEEE 

      7390       7400       7410       7420       7430       7440 
IEEKVQHDEE DEVEVQVEQK ETYTSSKKIE ITKTVELIRT KISEKIITIE DVQVLSHHEE 

      7450       7460       7470       7480       7490       7500 
VQWLLESIEA ESFGQIGESA LRDLATIGLL LRYGCEHYEI TYMYEQNIFI SLKKPESQSA 

      7510       7520       7530       7540       7550       7560 
LVQLVEREGH EELISQILSE SSNEDETILA ATVGFKAFIR MIQTYEITIE IVIRKFVRED 

      7570       7580       7590       7600       7610       7620 
FISQDWKICG KERIVETSQI IESHEAITHV KIETATTKVE KLFKKQEQEH VQNLEQQEQV 

      7630       7640       7650       7660       7670       7680 
KIQVQTKQIA QMNTKIKKHK KHKQQEQEVS ETTIQCEQKE TLAHETSAEL PQSETLEQIE 

      7690       7700       7710       7720       7730       7740 
ESLSTYETLP IQNLSKDTLQ TVAVSVTTEL STPSPTASRV QEEILPQKVL AINEEVLPLD 

      7750       7760       7770       7780       7790       7800 
EFGLRKESPR PKENKLTENI EVRLKHALNV SHAKTAESSK ELPSKIPKSV KAQRKMKESR 

      7810       7820       7830       7840       7850       7860 
SLVVEAPNAE EAIEDLKPLK AVSQEVQSDI LFSHEITEEQ HQALETIEKL KPTSAIEDTV 

      7870       7880       7890       7900       7910       7920 
QQKLLSQEEL IIAEVLPSET VGRDVTDVRP PGETISPRLT PNMSLCITEC QPEDSIGEMQ 

      7930       7940       7950       7960       7970       7980 
QAAKERMETP SMSVTESKAV GGQELEVLEN VDHMPLITQP TKGLADYTIK AEEVPVQVQE 

      7990       8000       8010       8020       8030       8040 
IITFDSLERE TVAKTQTAKS NALELFELSE GLVSSTADSH SPIAEDLPIF EKDVKEATID 

      8050       8060       8070       8080       8090       8100 
MQMQHHVTTS ETVSNENAVK DLKAVDTPKM AEGTLGQSSA LTIGETQQMN LVETTVELIE 

      8110       8120       8130       8140       8150       8160 
PNVESTKPAK GALTEAYGTA ESNEETLLES LGLVPDDNRK IEQGKVNISE GEYVAKVQTT 

      8170       8180       8190       8200       8210       8220 
TVTDTEGEFV SVAPKLVNPK FDFVEQSALQ IKQDTTVEKE EILSSNIELA PQLATSNMFP 

      8230       8240       8250       8260       8270       8280 
AELKVTSIYE VQPGLTSSDI ITEQTKSVSA NQVFETMSIG VTSKPDMLES TSHIDAFQHP 

      8290       8300       8310       8320       8330       8340 
EFKTGDTILD ENQQPLEVTN VQITESSTDI IDVLPNQKLT KAETVTDGFK YAEGLVVLPM 

      8350       8360       8370       8380       8390       8400 
ESTIDKTEDT KPTAVNADIS MHQQFGTDVR EQEPLESTLT RTEDLKPQTQ TTESQFGLLQ 

      8410       8420       8430       8440       8450       8460 
SLETSSCVTL EGESVLSVKE RHPEQSAAIG TSSALQVANI TRPQHMESLD RLDEQKVPYY 

      8470       8480       8490       8500       8510       8520 
QANVNIGEIT LPNVEKIDSF DVLSDLNTPD YNKSSKGRVQ LIESTTSLKT TTAVVSESTE 

      8530       8540       8550       8560       8570       8580 
ELKDLNITQP VHIKPKPYES DQKISISEQT NVLEHVSSLN PVLPALETIQ SSIKSLHEIN 

      8590       8600       8610       8620       8630       8640 
VRETDILEKE ESLKDVDHIS GRLAKIILDC TTGIAQVRQE ETLEHEEDLK APLIPLEKAI 

      8650       8660       8670       8680       8690       8700 
PASSELHRLP LTEYVQEQQG TSDMTDFKVS NKCASPNIDH LYETKSSEMI VYDSSINSVD 

      8710       8720       8730       8740       8750       8760 
SEFPAGIVPQ KSLVPFRHTM VTENVAFNAS ENFEILSADQ QIATNVQDSL SQSIIAEDQI 

      8770       8780       8790       8800       8810       8820 
AFETEQNLGL ETTPTHKPKL LKDDQNLHAK LVDEATVYEA MGQEQKVDKY NIQQAEITHD 

      8830       8840       8850       8860       8870       8880 
LPQVYATDLQ QTFEAEKEIT TREQSYVAAT TDIIPSRLGL AMTTKTHPVE GIDVLLSSPP 

      8890       8900       8910       8920       8930       8940 
KPSLAQTNYE ETQHEVRVRE TQAIEESEEL TDGRLLPVSA VESIDSTFKV TSDSQQPPVF 

      8950       8960       8970       8980       8990       9000 
DKELSIPTVS PLEARAKPSL NLLQGTTTFD VIPLESSVLL KDTHVAVQKA QQEYVAQVES 

      9010       9020       9030       9040       9050       9060 
NKVHVQMDNL VMHKEDIFEN AEIENFCKPI TEGTQLETVV IEVVPIDNVG GIHLAPQPST 

      9070       9080       9090       9100       9110       9120 
LLATLTSTDI VNQSHVIDTQ VPLEMESEAQ APLDNIAQAR IKSAEDHVHT NVSEVNVYEQ 

      9130       9140       9150       9160       9170       9180 
TKAIQDQNKH GLFVKVSKNS DTSKAYLTTI QSTFLKEDIL PKPNILQDTA QAAADELQSL 

      9190       9200       9210       9220       9230       9240 
VTEEVVSVSS IQETYELKIP LQKTANLTQQ TPQNSVNVCQ QLAYEETPDI AFEPHALTRA 

      9250       9260       9270       9280       9290       9300 
TTSSVPTFLK PAENATVNIY ENIEGHGDFK PGTVNLTSNS NLNSELVVSV VQEVTSVPSL 

      9310       9320       9330       9340       9350       9360 
GSLATVEPQE LKAMPVTKSS TNLAYSEEVK GNKQEFTKIE TVEEDDKQPE TTVTVEELPY 

      9370       9380       9390       9400       9410       9420 
EEEKPEEIQE LPEEVCVVET VTEDGKPKKK KIRTRVIKKV KGDKQEVTKI ETVEEDDKQP 

      9430       9440       9450       9460       9470       9480 
ETTVTVEEVP YEEEKPEEIQ ELPEEVRVVE TVTEDGKPKK KKIRTRVIKK VKGDKQEVTK 

      9490       9500       9510       9520       9530       9540 
IETVEEDDKQ PETTVTVEEV PYEEEKPEEI QELPEEVRVV ETVTEDGKPK KKKIRTRFIK 

      9550       9560       9570       9580       9590       9600 
KVKGDKQEVT KIETVEEDDK QPETTVTVEE VPYEEEKPEE IQELPEEVRV VETVTEDGKP 

      9610       9620       9630       9640       9650       9660 
KKKKIRTRVI KKVKGDKQEV TKIETVEEDD KQPETTVTVE EVPYEEEKPE EIQELPEEVR 

      9670       9680       9690       9700       9710       9720 
VVETVTEDGK PKKKKIRTRV IKKVKGDKQE VTKIETVEED DKQPETTVTV EEVPYEEEKP 

      9730       9740       9750       9760       9770       9780 
EEIQELPEEV RVVETVTEDG KPKKKKIRTR VIKKVKGDKQ EVTKIETVEE DDKQPETTVT 

      9790       9800       9810       9820       9830       9840 
VEEVPYEEEK PEEIQELPEE VRVVETVTED GKPKKKKIRT RVIKKVKGDK QEVTKIETAE 

      9850       9860       9870       9880       9890       9900 
EDDKQPETTV TVEEVPYEEE KPEEIQELPE EVRVVETVTE DGKPKKKKIR TRVIKKVKGD 

      9910       9920       9930       9940       9950       9960 
KQEVTKIETV EEDDKQPETT VTVEEVPYEE EKPEEIQELP EEVRVVETVT EDGKPKKKKI 

      9970       9980       9990      10000      10010      10020 
RTRVIKKVKG DKQEVTKIET VEEDDKQPET TVTVEEVPYE EEKPEEIQEL PEEVRVVETV 

     10030      10040      10050      10060      10070      10080 
TEDGKPKKKK IRTRVIKKVK GDKQEVTKIE TVEEDDKQPE TTVTVEEVPY EEEKPEEIQE 

     10090      10100      10110      10120      10130      10140 
LPEEVRVVET VTEDGKPKKK KIRTRVIKKV KGDKQEVTKI ETVEENDKQP ETTVTVEEVP 

     10150      10160      10170      10180      10190      10200 
YEEEKPEEIQ ELPEEVRVVE TVTEDGKPKK KKIRTRVIKK VKGDKQEVTK IETVEEDDKQ 

     10210      10220      10230      10240      10250      10260 
PKTTVTVEEV PYEEEKPEEI QELPEEVRVV ETVTEDGKPK KKKIRTRVIK KVKGDNQEVT 

     10270      10280      10290      10300      10310      10320 
KIETVEEDDK QPETTVTVEE VPYEEEKPEE IQELPEEVRV VETVTEDGKP KKKKIRTRVI 

     10330      10340      10350      10360      10370      10380 
KKVKGDKQEV TKIETVEEDD KQPETTVTVE EVPYEEEKPE EIQELPEEVR VVETVTEDGK 

     10390      10400      10410      10420      10430      10440 
PKKKKIRTRV IKKVKGDMQE VTKIETVEED DKQPETTVTV EEVPYEEEKP EEIQELPEEV 

     10450      10460      10470      10480      10490      10500 
RVVETVTEDG KPKKKKIRTR VIKKVKGDKQ EVTKIETVEE DDKQPETTVT VEEVPYEEEK 

     10510      10520      10530      10540      10550      10560 
PEEIQELPEE VRVVETVTED GKPKKKKIRT RVIKKVKGDK QEVTKIETVE EDDKQPETTV 

     10570      10580      10590      10600      10610      10620 
TVEEVPYEEE KPEEIQELPE EVRVVETVTE DGKPKKKKIR TRVIKKVKGD KQEVTKIETV 

     10630      10640      10650      10660      10670      10680 
EEDDKQPETT VTVEEVPYEE EKPEEIQELP EEVRVVETVT EDGKPKKKKI RTRVIKKVKG 

     10690      10700      10710      10720      10730      10740 
DKQEVTKIET VEEDDKQPET TVTVEEVPYE EEKPEEIQEL PEEVRVVETV TEDGKPKKKK 

     10750      10760      10770      10780      10790      10800 
IRTRVIKKVK GDKQEVTKIE TVEEDDKQPE TTVTVEEVPY EEEKPEEIQE LPEEVRVVET 

     10810      10820      10830      10840      10850      10860 
VTEDGKPKKK KIRTRVIKKV KGDKQEVTKI ETVEEDDKQP ETTVTVEEVP YEEEKPEEIQ 

     10870      10880      10890      10900      10910      10920 
ELPEEVRVVE TVTEDGKPKK KKIRTRVIKK VKGDKQEVTK IETVEEDDKQ PETTVTVEEV 

     10930      10940      10950      10960      10970      10980 
PYEEEKPEEI QELPEEVRVV ETVTEDGKPK KKKIRTRVIK KVKGDKQEVT KIETVEEDDK 

     10990      11000      11010      11020      11030      11040 
QPETTVTVEE VPYEVEKPDE IQELPEEVRV VETVTEDGKP KKKKIRTRVI KKVKGDKQEV 

     11050      11060      11070      11080      11090      11100 
TKIETVEEDD KQPETTVTVE EVPYEEEKPE EIQELPEEVR VVETVTEDGK PKKKKIRTRV 

     11110      11120      11130      11140      11150      11160 
IKKVKGDKQE VTKIETVEED DKQPETTVTV EEVPYEEEKP EEIQELPEEV RVVETVTEDG 

     11170      11180      11190      11200      11210      11220 
KPKKKKIRTR VIKKVKGDKQ EVTKIETVEE DDKQPETTVT VEEVPYEEEK PEEIQELPEE 

     11230      11240      11250      11260      11270      11280 
VRVVETVTED GKPKKKKIRT RVIKKVKGDK QEVTKIETVE EDDKQPETTV TVEEVPYEEE 

     11290      11300      11310      11320      11330      11340 
KPEEIQELPE EVRVVETVTE DGKPKKKKIR TRVIKKVKGD KQEVTKIETV EEDDKQPETT 

     11350      11360      11370      11380      11390      11400 
VTVEEVPYEE EKLEEIQELP EEVRVVETVT EDGKPKKKKI RTRVIKKVKG DKQEVTTIET 

     11410      11420      11430      11440      11450      11460 
VEEDDKKAET TVTVEETELS APSVGKVQLK KRVIVQKPED AVTVFELPER KSVILSEKED 

     11470      11480      11490      11500      11510      11520 
GTPTKTVIKT RIIKKIQGPN MEVTKVQTVE EYEKAPQTIV SVEKFNTPFP ELPEERLSEV 

     11530      11540      11550      11560      11570      11580 
VMLPDEVFES EAVDEEGRLK MIKTKKRIIR KPALDNTEEV TEIGIIEQDN VEPIYSVKIQ 

     11590      11600      11610      11620      11630      11640 
ERPLTESKPE DSKLIELPEH VTELNVILPD GKKKRRTVKS RAFKKSLDDD LDEVTTIHII 

     11650      11660      11670      11680      11690      11700 
EEEDKEPLTK VNIEVVPSDE ISITPIPIEE LPEETVFTEE LDENKKPKKK TTKTRTFKKR 

     11710      11720      11730      11740      11750      11760 
GPDDDEYFQI QTIDEEGKEP ISLIRVVSDE NIADIIDISK LDDEKVLKHK QKPHKHKDQY 

     11770      11780      11790      11800      11810      11820 
YKEYTITEPE EASADALQKP TKDKTPKQKK TLETPIEEVD ETVIIDEGTG EQTDQIAIKR 

     11830      11840      11850      11860      11870      11880 
KPRKVQGNVQ VEAVDEKPIE KKEKAKKKKV VKTKRDEMDD YIQFLIHQEI PKTVLQPYQR 

     11890      11900      11910      11920      11930      11940 
TEMELPQRAR RDSSFKQPVK LTPMKIEKVE FKKPKMVEIS SVVEFPQMLK LKAPKQRPQE 

     11950      11960      11970      11980      11990      12000 
EKKKKNEASF KNKKLKSWIR FVPYAPYCFP YVVTELETNR EVGELSRNVD EAEEVLKLRP 

     12010      12020      12030      12040      12050      12060 
KKFKHSKPEK AELEEADLGA YESDHSDKSN KELLHPKYKR GKKEKIETPD ESRKLKFGKG 

     12070      12080      12090      12100      12110      12120 
KVPQNEEASE EVNLKPVKLD IAEIEDAEMP VRTQEEEVVK KKKPKKSSKP EEGLQFEPIE 

     12130      12140      12150      12160      12170      12180 
FEEMERTSDI REESDTSVSV DTSTQEKPIY KKKKKVTPSP QKNQYKILPG QPREFEETPE 

     12190      12200      12210      12220      12230      12240 
DDLNLRKRQG ERPDDDKADT KLKPFYKFEV LDLEPEGVQA ETVPLSETIA KEPKKKRKIK 

     12250      12260      12270      12280      12290      12300 
VKTEQEDNTI EIVPLSPEDN DEQIFEITVT SSEIPQGDAK AKTIGKKKVK RMNKQELDDF 

     12310      12320      12330      12340      12350      12360 
VTELQEEPNQ EVYETRMSDF YEVKLTELPS EMDSDKPTKR ILRHEKGDEV QVLEIVESVV 

     12370      12380      12390      12400      12410      12420 
APGEEPFYEI NVISSANTEG DSEEITTDKI KKKSRKIKKD DLDAYIQQLI NAEIPVTELE 

     12430      12440      12450      12460      12470      12480 
KYEKIDVDGK AKKPKKLKAK TKKPIIDEGE TLQVGVTEHE PTKKLKTKKP EEKKNVIEKE 

     12490      12500      12510      12520      12530      12540 
LAEHEAVPEY DEFLINKTES ERPQEKRVEE TEKDIVPIVD KVLADLNDCL PFVVVEEDLK 

     12550      12560      12570      12580      12590      12600 
DMPLATDVIA LEDEKIIRKR KVRAKKDSKQ YEIEIIETEK PGDIPDEARV IVITTEVSGD 

     12610      12620      12630      12640      12650      12660 
TIDGPAPSTT EAPKKSVRKV KKEKLKEFIV NIVEEAPLDH VSEIYEDVLR TPFRESSEKE 

     12670      12680      12690      12700      12710      12720 
DIPSFTTTVV EDEIVNPVLP EKIKTVDDVR VPKDKKKKID NQKKIKISEF EPTPTSEDST 

     12730      12740      12750      12760      12770      12780 
IEEYTPKLSE HDEDLQTDEY SVDVKDSLPK SKKKSTKKQK KESLPGPISL YTIRIEETTP 

     12790      12800      12810      12820      12830      12840 
EPITEKIYED GKEVVRVINK RRIKKKAGPK EYLIEVIETY EDNNPEADVV IRTIETTPSI 

     12850      12860      12870      12880      12890      12900 
DSKPQEDHKI QVVQEKKPKT ESLDNYIQKL IDQEIPQVDH KEFKATVLET SPESKKAKKI 

     12910      12920      12930      12940      12950      12960 
KKHHKKTTEV IDGIPITVIE VTIQETETDD EDFKPDEVTL KEIDHENAEE APKVLKSKVS 

     12970      12980      12990      13000      13010      13020 
EEKPKSKKEK SLEFKIAEED KPKPVLEDIS EDVQVVQIIE EDGTPKQVEI KKKKVSRKHG 

     13030      13040      13050      13060      13070      13080 
PKEQVFEITE TKAIDEPLSE VTVVEITDEQ PQEEVLPAQE KKPIKKQKKL KPEDVNTYVV 

     13090      13100      13110      13120      13130      13140 
KVLEELTEPT QFETIPEDAD DKPQPVIEDI SENVQVVQII EEDGTPKQVE IKKKKVSPKH 

     13150      13160      13170      13180      13190      13200 
GPKEQVFEIT ETRPSDEPLA EVTIVELTEE GLNKDIVIPQ EKKTVKKPKK LKPEDIQSYV 

     13210      13220      13230      13240      13250      13260 
IRVLEEFNEP QWPASTEKPI IEDIAESIEI VPVTEEDGIT KEVEVKKKKV SRKQGTKNQV 

     13270      13280      13290      13300      13310      13320 
FEIIETKTSD EPLAEVTILE LSGDKSQEVT ILPKEKKPIK KTIKLKPEDV ESYVVNVLEE 

     13330      13340      13350      13360      13370      13380 
FCEPQSFESP EPTEGEAHET KTKTKKPKKP IVKAPENVIL IEEMAPETVI ENIVNEIGEE 

     13390      13400      13410      13420      13430      13440 
VKQVKTTKKL KKKEGPKEYL IEIKETYEEN KPEGDIEITT TELVPEGSPD ASDDQPVIVV 

     13450      13460      13470      13480      13490      13500 
QKIKKKKPVK DDLDKYIQQL IEQEITKTPL EEYEPTEMDS KKKPKKKVKS HNKKTIEVID 

     13510      13520      13530      13540      13550      13560 
GLPVTIHEFN VEDIVSEPED METPKTLLDE IKEIPQLPDD SSKYLVNISD EFGEADKPIK 

     13570      13580      13590      13600      13610      13620 
QPTQDQPIKK EKPLKKKKDV EYPVSLEAFD HTVKVVSEPT LEGTVKEVTV KKRKVSRRKG 

     13630      13640      13650      13660      13670      13680 
SKDHIFEITE TTSEDRPTAE VTVVELSSDE VLDSEEKPKH ERKIVKKPKQ LKKDDVEEYI 

     13690      13700      13710      13720      13730      13740 
INIIEEFIQP IPVGLVEDEV EKVQKEETKK PKKSPITYIA TEQEDNDNNY DALVKEDLDQ 

     13750      13760      13770      13780      13790      13800 
PIERALEKPS SPLEYTISVE EDSVGEEQKQ PKPKKISKPK SIKQPSVDKS PDYLVNVISE 

     13810      13820      13830      13840      13850      13860 
ESIIDEPIPE DYVVTEAAEE KPSEEPTFKV EELETEAVEK EVTDDDKGET TKQSVTKRKI 

     13870      13880      13890      13900      13910      13920 
KKLVGPKEEI IEIVETKTGD TPEYEVIVTT EEVQEKSKEA PEEKKAKTVR KAKKIPKDDL 

     13930      13940      13950      13960      13970      13980 
QDYIQKLIEQ DIPKTELEKY EKIDLDEPVK MKRKPIKKVK QSEEQPKEET EEPIEDKPVE 

     13990      14000      14010      14020      14030      14040 
KISEYSEVDS DEPKLTVAVK EFIPEKPEEK PFEIVVLEET VESKREPDEE GKVREKVVKT 

     14050      14060      14070      14080      14090      14100 
KKIKQNRGSV EVVHDIVEEI DTDTNESVIT VTTTVPTETP DQDQPSVKQK RTKKIKKDEV 

     14110      14120      14130      14140      14150      14160 
EDFVKRVIEE EAPQPEGSVD LVVIEDFVPK PSSEKRKKKP IKDKHTSVEE ETPHEDEVLL 

     14170      14180      14190      14200      14210      14220 
IESVPEDSPL SDDLITVVDS VPIEEEPENK VNQIEDTKKP EKKKKPKPSA KILEENVPED 

     14230      14240      14250      14260      14270      14280 
TVEKPLEALH TDSDLEKPDV QEFSISIKEE EQKHTHPEKK KSSKISSEQP KQPSTEQYEI 

     14290      14300      14310      14320      14330      14340 
SVTEHDLKPE EEKPFTVQVI QSETNVEETK DDTGKVHKQV TTKRMLRRPA GEGEIEIIEV 

     14350      14360      14370      14380      14390      14400 
VRDDQPEAEI TIVEYEPEPV NQDEKPKEPK KKTRKVKKDD IHDYIQKLIE LETPKTELEK 

     14410      14420      14430      14440      14450      14460 
YEKIEFEPIV KDKPLDSPID VLDESPKEVQ KKDKKSRSTK VPNEETPVQE QYAKVNVVEE 

     14470      14480      14490      14500      14510      14520 
EAPEQPEIPV QILEVKPVEV DVKEVITEDG KPVQEKTTKR VLKKIGPEEQ TTFKITMIES 

     14530      14540      14550      14560      14570      14580 
EDNDSVTVIV DEEPEIASPQ SIEEHPEQSK EKLAPKPKKT VRKVKKDDLS DYVKKLIEEE 

     14590      14600      14610      14620      14630      14640 
IPKVDLEKYE KVEMPEKPVK LTVSDSIPEE PKPDKSQPIS VLPDTTKPKK TKTPKTPKTE 

     14650      14660      14670      14680      14690      14700 
DTDQQVPDEP TETTVDTTDI PELTPTQTAQ PEDTATAQIT PSAQEEKSTQ DDTKDTIQKT 

     14710      14720      14730      14740      14750      14760 
VKHKKTKPDT QKSVETSELP EVHKDYQISI IHEELVEEEQ PEKILEVRVI DEVAEVEESQ 

     14770      14780      14790      14800      14810      14820 
PIVEEVEDEE PQPATEETVE DVTKPKSKKK KVVKKKTDDH DELIKKMLEQ EIEKTELEKY 

     14830      14840      14850      14860      14870      14880 
EKIEFDVPKK LKPEFAALEP IKIERKEQKP TKVTILDATD VPKTVKLKPS KRKEKPAEEL 

     14890      14900      14910      14920      14930      14940 
TVQLPKFRLK ARMVLVEYPP APLIPKTTDI GAIKDNGELS RNIEEAEEIL KFKPHKTKKI 

     14950      14960      14970      14980      14990      15000 
KKIKDDLEKV ELEKYEKYIS SEEEPEEKTP YKKPEKAPKP EEKQEDVKLK LGKGKKKPKE 

     15010      15020      15030      15040      15050      15060 
EEAPENVTLK NIPQKPQEVE EEVELKQKPK EVEIVEEQTK KPKDGEFVVE PFEPSEFDRP 

     15070      15080      15090      15100      15110      15120 
EYVPDELEQI EHPEIPEKVK KPSKTKYKPK DKSKSEPETI VSEIVAGVPK EEEAIPEQDV 

     15130      15140      15150      15160      15170      15180 
KFRKPERDAP EETDSEIKLR PVPQASKDEN PDEQALVTPK AEEPIPQEIE DKAIDDEKKP 

     15190      15200      15210      15220      15230      15240 
KKSKPKKVQP KEQEIAKEEP EEFEVSVKEE EALVDKPIEI EKPKDVKVKE KKPKEAPVSE 

     15250      15260      15270      15280      15290      15300 
VVVIEEEPKP EEVPEEIPVE YKITTTVLEP EDAPKEHQVK VIDFDERQET TEEVIEEKVV 

     15310      15320      15330      15340      15350      15360 
TRKKKPKPQQ PEEFEVTLKE PKEEQIQPDV VSAEISLPIE EPEQKPEQYE VELKITQTTP 

     15370      15380      15390      15400      15410      15420 
EEPNDVQIAV KEKVKTKPVK KVKEDKIVVV EAEEEKQPVE ETIVEVEKQE EKKKSEKPKS 

     15430      15440      15450      15460      15470      15480 
YEFKISETQS IEEKPIEVAE EAPEETPKVV EKKVAEKFDS YEFTLKETDE EKVITVDDQP 

     15490      15500      15510      15520      15530      15540 
EEEAPVEVVF KKKPKEPEAV EAEFVMTEPK IVEETSVETA IKQKKTKKPK KDEEEAQLAI 

     15550      15560      15570      15580      15590      15600 
KVVESEAPVA EEVFSEAPES KIVEEEVIAE EKPKEFTIRV SESEPKPEEP SVEQFTVKKR 

     15610      15620      15630      15640      15650      15660 
KPSVTFADEP ATEIVIKESK PAEVVTEDAH IKTKKPKKKV TDVEAEELKI KITEEVPQEI 

     15670      15680      15690      15700      15710      15720 
PILEEVSEEE VITETKKTAP VVEEKTYKIG IKETEPEKPA EAIVEEEEPV VTEPIEEAPK 

     15730      15740      15750      15760      15770      15780 
PEVFEEHKVR VIEETPRELV EEVIEEEVKV IRRKKPKPEI KEEPEAEVTV STPKPVEEVE 

     15790      15800      15810      15820      15830      15840 
ATSSIAVIPE QPTEEEAADL KITIIEEETP PQELVQEIEE IEIVEEPKAP EEQPTDFTFA 

     15850      15860      15870      15880      15890      15900 
TKDSEKKPTV EELPEEQVTI QKKKKKAPVP EVVEEPEAEF LVKPKTPVQE VTEEAKITKS 

     15910      15920      15930      15940      15950      15960 
KKPVKEEEAA AELKVTITEE IPTEPEVQEI IEEIEEIEEE KPAEYVIEVK ESQPEAVEDK 

     15970      15980      15990      16000      16010      16020 
EVSLPKKKPK APIVEEPEAE ITLKPKVKSE EVQEEAKIVK KKPKKIDEVA VADELTVKVE 

     16030      16040      16050      16060      16070      16080 
EEVVPEPIVE EEVIEEFEIK KKPKEPEPED IVDAAIVKLK KPEPVDADEV VAEVTLKPKA 

     16090      16100      16110      16120      16130      16140 
KTEVTEEEFS VDVKLPKEKK ERPVEIEEEE IIEEAVVIRK KPKKPFEPTV EDLEETEFSL 

     16150      16160      16170      16180      16190      16200 
SFKKPHTINE GVEEAATVLK KRPVKPTTLD EAAAELSIKR QEEEYEEGED IEEFVVSQQR 

     16210      16220      16230      16240      16250      16260 
KPKPLQITEE DEEAYTVKKL KRRKQVDIPE YADVENVTFR ARSTKTKEDV DQEFNIALDS 

     16270      16280      16290      16300      16310      16320 
YAEEEISMSG KVKLKKPIKK TFSEAADEAK IKIIQDFDDG EEPIIEEIRD DEDTIDEVEE 

     16330      16340      16350      16360      16370      16380 
PEEYFVEELP PDEVDFKLKP KKHPKPAYSV QDEEEEQFLI GIRHPKRDSV TYDEDSLTFK 

     16390      16400      16410      16420      16430      16440 
KKRKVVQQLF NEDGASLNIT REMNVEESEN LNIMYSICNY IADNNEAINL VEGEKVTVVG 

     16450      16460      16470      16480      16490      16500 
RHSSEWWYVK KSTTEEEGWV PAQYLMEPEE YAQYVQNKLH EKIDKLPVFE RPGPEDKPIA 

     16510      16520      16530      16540      16550      16560 
PRFIEKLQPI HTPDGYTVQF ECKVEGNPRP QIAWFRETAI IKPSQDFQMF YDDDNVATLI 

     16570      16580      16590      16600      16610      16620 
IREVFPEDAG QFTVVAKNAA GFTSSTTELI VESPLSDHGS DATALSRRSM SRESSLADIL 

     16630      16640      16650      16660      16670      16680 
EGIPPTFSKK PKAQYVDENT NVILECRLVA VPEPDIVWTF NGEDIDEEEI KNVRIVTESD 

     16690      16700      16710      16720      16730      16740 
MHMYCSVVHI SKVKKSQEGT YEVIATNREG EARLPITLKV RTTDKEAPQI LEPLRNMVIR 

     16750      16760      16770      16780      16790      16800 
EGESVVLSTQ IVGNPPPKVT WYKDGKPVKN AKSDKDLHTL TLITPQKSEK GEYTVKAVNP 

     16810      16820      16830      16840      16850      16860 
LGSVETTANL TIEEPAGGNA EPPLFVERFE EQNVPQKGEI RLPAKVSGNP VPEVQWLFNN 

     16870      16880      16890      16900      16910      16920 
TPLFPSERIQ QVYDGENIEL IIKDANPETD SGDYKCIASN PIGKTSHGAR VIVEVDEVTF 

     16930      16940      16950      16960      16970      16980 
TKKLKKTITI EEVQSLTLEC ETSHVVTTKW FFNGKELSGM DHRVVVEDGK THKLVIRNTN 

     16990      17000      17010      17020      17030      17040 
LRDSGTYTCK VKKQETQSTV EVLQRKPDFI KVLEDYEVTE KDTAILDVEL TTEATEVTWY 

     17050      17060      17070      17080      17090      17100 
KDGEKITPEN KNVEFIKDGK ARRLVIRDVT IHDEGQYTCK IEGQECSCEL VVIELPPEIV 

     17110      17120      17130      17140      17150      17160 
EPLNDVAVTK GENAVFEVEL SKGDALVKWF KNGKEIVFNE RIQLAIDGKK QSLRIVKAKP 

     17170      17180      17190      17200      17210      17220 
EDVGEYSVQV GEQTSKAKLT VEEPLVDFVI RLPDITLATK TTDAEFTVQL SQPDVEVTWC 

     17230      17240      17250      17260      17270      17280 
KKGKPIKPNQ KHEVFVEGTV RRLVIHDASD EDAGEISCVA ENVTSSTKLC VEELKLPPVI 

     17290      17300      17310      17320      17330      17340 
TSDKDQTIKV KENDDVTFTV KYTGVPTPEA CWTTRKVVIP KSKRTIPTID EQSAKLTIKK 

     17350      17360      17370      17380      17390      17400 
VVDDDEGEYT VKLVNPVGEA EASLHLVIMR KPTAPGTPQP LEIMHDSITL YWKAPEDDGK 

     17410      17420      17430      17440      17450      17460 
SEIIEYILEY QDVKEEKWTE IRKIKDTTYT ISKLKIDTEY VFRSIAVNEV GPSPPSPLSP 

     17470      17480      17490      17500      17510      17520 
PIRLVPKVET KAPSVQEPLQ DVVSELDKEV TLSCVFGGIP EPKVTWKKNG QVFESRSIRY 

     17530      17540      17550      17560      17570      17580 
ENRVAKYTIE KTTIETEATY TCVATNEKGS AETSCRLKLQ QKPVLEVEDK YLTQKLRTGS 

     17590      17600      17610      17620      17630      17640 
ILTIPATVRG YPQPTVTWHK ETIEQKTTKS VTIETTETTS TYTVKKVTRE QSGKYKVTAT 

     17650      17660      17670      17680      17690      17700 
NESGTTYVEC TVQVIDKPSR PQSLEIKDIK KDSIVLEWTP PVDDGGLDIE KYTLEKCDVQ 

     17710      17720      17730      17740      17750      17760 
NNVWMKVSDF NKDIKSYAVQ KLSMNAQYMF RVVAANPIGE SEPTESDPVT ITKKFEKPSP 

     17770      17780      17790      17800      17810      17820 
PRGPTTVSGM NDTSFNLAWE PSETDGGSKI IEYIVEIREE TETTYRSVGV TLGTVTNIHV 

     17830      17840      17850      17860      17870      17880 
EKVVRNKGYF FRIYARNEVG TSEAFETTEK IVLGRKITPP SPPQNLRAPD VTSRSVTLDW 

     17890      17900      17910      17920      17930      17940 
EVPARNGGSE ITGYCVEKRS STSTNWTKVI TLDAHQLHYT IDNLKEKCEY WFRVSAENEV 

     17950      17960      17970      17980      17990      18000 
GLGAPAVTES ISLKTHASEF ILVVRNCGSY TEFGFLKAVP SPPTGPLEAR VLAANAHIFE 

     18010      18020      18030      18040      18050      18060 
WGLPESDGGA PLLGYHIAIR DMKKTMWIEV GRVPAGVLKF QIRDLQENHE YMIRIFAKNE 

     18070      18080      18090      18100      18110      18120 
IGLSEPLESE EPYKAMTAGH ESLPDEPRTE MSSCNTSSWL RDHHMDADIH SYARGRLLQR 

     18130      18140 
DEYFFRLWAE LPKSKKKKSS K 

« Hide

Isoform A (Ket) (Kettin) [UniParc].

Checksum: F39A3B9B7230A7E5
Show »

FASTA4,863548,598
Isoform B [UniParc].

Checksum: C16FBFC3C09569CA
Show »

FASTA17,9702,041,306
Isoform D [UniParc].

Checksum: 8419A8664D293917
Show »

FASTA16,2821,849,153
Isoform E [UniParc].

Checksum: 737C3E6C5D524D9B
Show »

FASTA18,1782,069,971

References

« Hide 'large scale' references
[1]"Requirements of Kettin, a giant muscle protein highly conserved in overall structure in evolution, for normal muscle function, viability, and flight activity of Drosophila."
Hakeda S., Endo S., Saigo K.
J. Cell Biol. 148:101-114(2000) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM A), FUNCTION, TISSUE SPECIFICITY.
[2]"Drosophila D-titin is required for myoblast fusion and skeletal muscle striation."
Zhang Y., Featherstone D., Davis W., Rushton E., Broadie K.
J. Cell Sci. 113:3103-3115(2000) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] (ISOFORM D), NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-187, NUCLEOTIDE SEQUENCE [MRNA] OF 4789-4963 AND 13411-13939 (ISOFORMS B/C/D/E), NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 15913-16691 (ISOFORM E), NUCLEOTIDE SEQUENCE [MRNA] OF 18064-18141 (ISOFORMS C/D/E), FUNCTION, TISSUE SPECIFICITY.
Strain: Berkeley and Oregon-R.
Tissue: Embryo, Larva and Ovary.
[3]"The genome sequence of Drosophila melanogaster."
Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N., Sutton G.G., Wortman J.R., Yandell M.D. expand/collapse author list , Zhang Q., Chen L.X., Brandon R.C., Rogers Y.-H.C., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C., Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., Abril J.F., Agbayani A., An H.-J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B., Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C., Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S., Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M., Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., Hostin D., Houston K.A., Howland T.J., Wei M.-H., Ibegwam C., Jalali M., Kalush F., Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D., Kraft C.L., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A., Li J.H., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C., McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C., Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L., Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R., Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., Reinert K., Remington K., Saunders R.D.C., Scheeler F., Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T.J., Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R., Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.-Y., Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., Woodage T., Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.-F., Zaveri J.S., Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.C., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.
Science 287:2185-2195(2000) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: Berkeley.
[4]"Annotation of the Drosophila melanogaster euchromatic genome: a systematic review."
Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S., Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E., Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P., Bettencourt B.R., Celniker S.E., de Grey A.D.N.J. expand/collapse author list , Drysdale R.A., Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M., Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.
Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002) [PubMed] [Europe PMC] [Abstract]
Cited for: GENOME REANNOTATION, ALTERNATIVE SPLICING (ISOFORMS A AND C).
Strain: Berkeley.
[5]"Human autoantibodies reveal titin as a chromosomal protein."
Machado C., Sunkel C.E., Andrew D.J.
J. Cell Biol. 141:321-333(1998) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 1-880 (ISOFORMS A/C/D/E), NUCLEOTIDE SEQUENCE [MRNA] OF 7219-7572 (ISOFORMS B/C/D/E), NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 10456-10723, NUCLEOTIDE SEQUENCE [MRNA] OF 11358-11461 (ISOFORMS B/C/D/E), FUNCTION, SUBCELLULAR LOCATION, TISSUE SPECIFICITY.
[6]"Sequence and expression of the kettin gene in Drosophila melanogaster and Caenorhabditis elegans."
Kolmerer B., Clayton J., Benes V., Allen T., Ferguson C., Leonard K., Weber U., Knekt M., Ansorge W., Labeit S., Bullard B.
J. Mol. Biol. 296:435-448(2000) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-3999, FUNCTION, TISSUE SPECIFICITY.
Strain: Berkeley.
[7]"Association of kettin with actin in the Z-disc of insect flight muscle."
van Straaten M., Goulding D., Kolmerer B., Labeit S., Clayton J., Leonard K., Bullard B.
J. Mol. Biol. 285:1549-1562(1999) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 371-500 AND 3126-3354, ACTIN-BINDING, SUBCELLULAR LOCATION.
[8]"Kettin, a large modular protein in the Z-disc of insect muscles."
Lakey A., Labeit S., Gautel M., Ferguson C., Barlow D., Leonard K., Bullard B.
EMBO J. 12:2863-2871(1993) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1691-2214, FUNCTION, SUBCELLULAR LOCATION, TISSUE SPECIFICITY.
[9]"D-Titin: a giant protein with dual roles in chromosomes and muscles."
Machado C., Andrew D.J.
J. Cell Biol. 151:639-652(2000) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 3091-3407, NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 6635-7000, FUNCTION, SUBCELLULAR LOCATION, TISSUE SPECIFICITY, IDENTIFICATION (ISOFORMS A AND B), DEVELOPMENTAL STAGE, DISRUPTION PHENOTYPE.
Tissue: Muscle.
[10]"A Drosophila full-length cDNA resource."
Stapleton M., Carlson J.W., Brokstein P., Yu C., Champe M., George R.A., Guarin H., Kronmiller B., Pacleb J.M., Park S., Wan K.H., Rubin G.M., Celniker S.E.
Genome Biol. 3:RESEARCH0080.1-RESEARCH0080.8(2002) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 13581-14436 (ISOFORMS B/C/D/E).
Strain: Berkeley.
Tissue: Ovary.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AB026845 mRNA. Translation: BAA90301.2.
AJ238575 Genomic DNA. Translation: CAB43710.1.
AJ238577 Genomic DNA. Translation: CAB43739.2.
AJ271740 Genomic DNA. Translation: CAB93524.1.
AJ400900 mRNA. Translation: CAB96426.1.
AJ400901 mRNA. Translation: CAB96427.1.
AJ400902 mRNA. Translation: CAB96531.1.
AE014296 Genomic DNA. Translation: AAF47604.1.
AE014296 Genomic DNA. Translation: AAG22226.3.
AF045775 mRNA. Translation: AAC23966.1.
AF045776 mRNA. Translation: AAC23965.1.
AF045777 mRNA. Translation: AAC23964.1.
AF045778 Genomic DNA. Translation: AAC23963.1.
AJ245406 Genomic DNA. Translation: CAB76253.1.
AJ012279 mRNA. Translation: CAA09970.1.
AJ012280 mRNA. Translation: CAA09971.1.
AF135062 mRNA. Translation: AAF61414.1.
AF135167 Transcribed RNA. Translation: AAF62351.1.
AF241648 mRNA. Translation: AAG40155.1.
AF241649 Genomic DNA. Translation: AAF44704.1.
BK000146 mRNA. Translation: DAA00021.1.
AY094749 mRNA. Translation: AAM11102.1. Different initiation.
PIRS35341.
RefSeqNP_524676.2. NM_079937.3.
UniGeneDm.2974.

3D structure databases

ProteinModelPortalQ9I7U4.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

BioGrid68776. 3 interactions.
IntActQ9I7U4. 1 interaction.
MINTMINT-1723019.

Protein family/group databases

MEROPSI43.001.

Proteomic databases

PaxDbQ9I7U4.
PRIDEQ9I7U4.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

GeneID44013.
KEGGdme:Dmel_CG1915.

Organism-specific databases

CTD44013.
FlyBaseFBgn0086906. sls.

Phylogenomic databases

eggNOGNOG12793.
InParanoidQ9I7U4.
OrthoDBEOG7327MW.

Enzyme and pathway databases

SignaLinkQ9I7U4.

Gene expression databases

BgeeQ9I7U4.

Family and domain databases

Gene3D2.60.40.10. 58 hits.
InterProIPR010939. DUF1136.
IPR003961. Fibronectin_type3.
IPR007110. Ig-like_dom.
IPR013783. Ig-like_fold.
IPR013098. Ig_I-set.
IPR003599. Ig_sub.
IPR003598. Ig_sub2.
IPR001452. SH3_domain.
[Graphical view]
PfamPF06582. DUF1136. 13 hits.
PF00041. fn3. 5 hits.
PF07679. I-set. 53 hits.
PF00018. SH3_1. 1 hit.
[Graphical view]
SMARTSM00060. FN3. 5 hits.
SM00409. IG. 34 hits.
SM00408. IGc2. 15 hits.
SM00326. SH3. 1 hit.
[Graphical view]
SUPFAMSSF49265. SSF49265. 3 hits.
SSF50044. SSF50044. 1 hit.
PROSITEPS50853. FN3. 5 hits.
PS50835. IG_LIKE. 51 hits.
PS50002. SH3. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

ChiTaRSSLS. drosophila.
GenomeRNAi44013.
NextBio836598.

Entry information

Entry nameTITIN_DROME
AccessionPrimary (citable) accession number: Q9I7U4
Secondary accession number(s): O76275 expand/collapse secondary AC list , O76276, O76277, O76278, O96767, O96768, Q7KA82, Q7KAJ1, Q7KPQ8, Q7RTL4, Q8T3N3, Q9N2P7, Q9N680, Q9N9Y5, Q9N9Y6, Q9N9Y7, Q9NFS3, Q9NJP4, Q9NL88, Q9UB79, Q9W055, Q9XZT9
Entry history
Integrated into UniProtKB/Swiss-Prot: July 25, 2006
Last sequence update: March 18, 2008
Last modified: April 16, 2014
This is version 108 of the entry and version 3 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programDrosophila annotation project

Relevant documents

SIMILARITY comments

Index of protein domains and families

Drosophila

Drosophila: entries, gene names and cross-references to FlyBase