ID CO3A1_HUMAN Reviewed; 1466 AA. AC P02461; D2JYH5; D3DPH4; P78429; Q15112; Q16403; Q53S91; Q541P8; AC Q6LDB3; Q6LDJ2; Q6LDJ3; Q7KZ56; Q8N6U4; Q9UC88; Q9UC89; Q9UC90; AC Q9UC91; R4N3C5; V9GZI1; DT 21-JUL-1986, integrated into UniProtKB/Swiss-Prot. DT 23-JAN-2007, sequence version 4. DT 11-NOV-2015, entry version 181. DE RecName: Full=Collagen alpha-1(III) chain; DE Flags: Precursor; GN Name=COL3A1; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606; RN [1] RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), AND VARIANT GLN-1353. RC TISSUE=Skin fibroblast; RX PubMed=2764886; RA Ala-Kokko L., Kontusaari S., Baldwin C.T., Kuivaniemi H., RA Prockop D.J.; RT "Structure of cDNA clones coding for the entire prepro alpha 1 (III) RT chain of human type III procollagen. Differences in protein structure RT from type I procollagen and conservation of codon preferences."; RL Biochem. J. 260:509-516(1989). RN [2] RP NUCLEOTIDE SEQUENCE [GENOMIC DNA] (ISOFORM 1), AND VARIANT GLN-1353. RX PubMed=11566270; DOI=10.1016/S0945-053X(01)00145-7; RA Valkkila M., Melkoniemi M., Kvist L., Kuivaniemi H., Tromp G., RA Ala-Kokko L.; RT "Genomic organization of the human COL3A1 and COL5A2 genes: COL5A2 has RT evolved differently than the other minor fibrillar collagen genes."; RL Matrix Biol. 20:357-366(2001). RN [3] RP NUCLEOTIDE SEQUENCE [GENOMIC DNA], AND VARIANT ARG-726. RA Fang H.; RL Submitted (JAN-2013) to the EMBL/GenBank/DDBJ databases. RN [4] RP NUCLEOTIDE SEQUENCE [GENOMIC DNA]. RG NHLBI resequencing and genotyping service (RS&G); RL Submitted (SEP-2009) to the EMBL/GenBank/DDBJ databases. RN [5] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15815621; DOI=10.1038/nature03466; RA Hillier L.W., Graves T.A., Fulton R.S., Fulton L.A., Pepin K.H., RA Minx P., Wagner-McPherson C., Layman D., Wylie K., Sekhon M., RA Becker M.C., Fewell G.A., Delehaunty K.D., Miner T.L., Nash W.E., RA Kremitzki C., Oddy L., Du H., Sun H., Bradshaw-Cordum H., Ali J., RA Carter J., Cordes M., Harris A., Isak A., van Brunt A., Nguyen C., RA Du F., Courtney L., Kalicki J., Ozersky P., Abbott S., Armstrong J., RA Belter E.A., Caruso L., Cedroni M., Cotton M., Davidson T., Desai A., RA Elliott G., Erb T., Fronick C., Gaige T., Haakenson W., Haglund K., RA Holmes A., Harkins R., Kim K., Kruchowski S.S., Strong C.M., RA Grewal N., Goyea E., Hou S., Levy A., Martinka S., Mead K., RA McLellan M.D., Meyer R., Randall-Maher J., Tomlinson C., RA Dauphin-Kohlberg S., Kozlowicz-Reilly A., Shah N., RA Swearengen-Shahid S., Snider J., Strong J.T., Thompson J., Yoakum M., RA Leonard S., Pearman C., Trani L., Radionenko M., Waligorski J.E., RA Wang C., Rock S.M., Tin-Wollam A.-M., Maupin R., Latreille P., RA Wendl M.C., Yang S.-P., Pohl C., Wallis J.W., Spieth J., Bieri T.A., RA Berkowicz N., Nelson J.O., Osborne J., Ding L., Meyer R., Sabo A., RA Shotland Y., Sinha P., Wohldmann P.E., Cook L.L., Hickenbotham M.T., RA Eldred J., Williams D., Jones T.A., She X., Ciccarelli F.D., RA Izaurralde E., Taylor J., Schmutz J., Myers R.M., Cox D.R., Huang X., RA McPherson J.D., Mardis E.R., Clifton S.W., Warren W.C., RA Chinwalla A.T., Eddy S.R., Marra M.A., Ovcharenko I., Furey T.S., RA Miller W., Eichler E.E., Bork P., Suyama M., Torrents D., RA Waterston R.H., Wilson R.K.; RT "Generation and annotation of the DNA sequences of human chromosomes 2 RT and 4."; RL Nature 434:724-731(2005). RN [6] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], AND VARIANT GLN-1353. RA Mural R.J., Istrail S., Sutton G.G., Florea L., Halpern A.L., RA Mobarry C.M., Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., RA Flanigan M.J., Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., RA Hannenhalli S., Turner R., Yooseph S., Lu F., Nusskern D.R., RA Shue B.C., Zheng X.H., Zhong F., Delcher A.L., Huson D.H., RA Kravitz S.A., Mouchard L., Reinert K., Remington K.A., Clark A.G., RA Waterman M.S., Eichler E.E., Adams M.D., Hunkapiller M.W., Myers E.W., RA Venter J.C.; RL Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases. RN [7] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2), AND VARIANT RP GLN-1353. RC TISSUE=Liver; RX PubMed=15489334; DOI=10.1101/gr.2596504; RG The MGC Project Team; RT "The status, quality, and expansion of the NIH full-length cDNA RT project: the Mammalian Gene Collection (MGC)."; RL Genome Res. 14:2121-2127(2004). RN [8] RP NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-176. RX PubMed=2777083; DOI=10.1016/0378-1119(89)90228-X; RA Benson-Chanda V., Su M.W., Weil D., Chu M.-L., Ramirez F.; RT "Cloning and analysis of the 5' portion of the human type-III RT procollagen gene (COL3A1)."; RL Gene 78:255-265(1989). RN [9] RP NUCLEOTIDE SEQUENCE [MRNA] OF 1-170 (ISOFORMS 1/2). RC TISSUE=Placenta; RX PubMed=3405773; DOI=10.1093/nar/16.14.7201; RA Toman D., Ricca G., de Crombrugghe B.; RT "Nucleotide sequence of a cDNA coding for the amino-terminal region of RT human prepro alpha 1(III) collagen."; RL Nucleic Acids Res. 16:7201-7201(1988). RN [10] RP NUCLEOTIDE SEQUENCE [MRNA] OF 149-1225 (ISOFORM 1). RX PubMed=2780304; DOI=10.1093/nar/17.16.6742; RA Janeczko R.A., Ramirez F.; RT "Nucleotide and amino acid sequences of the entire human alpha 1 (III) RT collagen."; RL Nucleic Acids Res. 17:6742-6742(1989). RN [11] RP PROTEIN SEQUENCE OF 168-398, AND HYDROXYLATION. RX PubMed=557335; DOI=10.1021/bi00625a020; RA Seyer J.M., Kang A.H.; RT "Covalent structure of collagen: amino acid sequence of cyanogen RT bromide peptides from the amino-terminal segment of type III collagen RT of human liver."; RL Biochemistry 16:1158-1164(1977). RN [12] RP SEQUENCE REVISION. RA Seyer J.M.; RL Submitted (DEC-1977) to the PIR data bank. RN [13] RP NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 186-194 (ISOFORMS 1/2). RX PubMed=8317500; RA Milewicz D.M., Witz A.M., Smith A.C., Manchester D.K., Waldstein G., RA Byers P.H.; RT "Parental somatic and germ-line mosaicism for a multiexon deletion RT with unusual endpoints in a type III collagen (COL3A1) allele produces RT Ehlers-Danlos syndrome type IV in the heterozygous offspring."; RL Am. J. Hum. Genet. 53:62-70(1993). RN [14] RP NUCLEOTIDE SEQUENCE [MRNA] OF 302-423 (ISOFORMS 1/2). RX PubMed=7487954; RA Chiodo A.A., Sillence D.O., Cole W.G., Bateman J.F.; RT "Abnormal type III collagen produced by an exon-17-skipping mutation RT of the COL3A1 gene in Ehlers-Danlos syndrome type IV is not RT incorporated into the extracellular matrix."; RL Biochem. J. 311:939-943(1995). RN [15] RP PROTEIN SEQUENCE OF 395-416; 579-595; 800-814 AND 1064-1079. RC TISSUE=Colon carcinoma; RX PubMed=7864881; DOI=10.1006/bbrc.1995.1264; RA Minafra I.P., Andriolo M., Basirico L., Aquino A., Minafra S., RA Boutillon M.-M., van der Rest M.; RT "Onco-fetal/laminin-binding collagen from colon carcinoma: detection RT of new sequences."; RL Biochem. Biophys. Res. Commun. 207:852-859(1995). RN [16] RP PROTEIN SEQUENCE OF 399-727. RX PubMed=687591; DOI=10.1021/bi00609a034; RA Seyer J.M., Kang A.H.; RT "Covalent structure of collagen: amino acid sequence of five RT consecutive CNBr peptides from type III collagen of human liver."; RL Biochemistry 17:3404-3411(1978). RN [17] RP NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 537-605. RX PubMed=1672129; RA Lee B., Vitale E., Superti-Furga A., Steinmann B., Ramirez F.; RT "G to T transversion at position +5 of a splice donor site causes RT skipping of the preceding exon in the type III procollagen transcripts RT of a patient with Ehlers-Danlos syndrome type IV."; RL J. Biol. Chem. 266:5256-5259(1991). RN [18] RP PROTEIN SEQUENCE OF 728-964. RX PubMed=6246925; DOI=10.1021/bi00549a008; RA Seyer J.M., Mainardi C., Kang A.H.; RT "Covalent structure of collagen: amino acid sequence of alpha 1 (III)- RT CB5 from type III collagen of human liver."; RL Biochemistry 19:1583-1589(1980). RN [19] RP NUCLEOTIDE SEQUENCE [GENOMIC DNA / MRNA] OF 861-1015 (ISOFORM 1). RX PubMed=2145268; RA Cole W.G., Chiodo A.A., Lamande S.R., Janeczko R., Ramirez F., RA Dahl H.-H.M., Chan D., Bateman J.F.; RT "A base substitution at a splice site in the COL3A1 gene causes exon RT skipping and generates abnormal type III procollagen in a patient with RT Ehlers-Danlos syndrome type IV."; RL J. Biol. Chem. 265:17070-17077(1990). RN [20] RP NUCLEOTIDE SEQUENCE [MRNA] OF 950-1466 (ISOFORM 1), AND VARIANT RP GLN-1353. RX PubMed=3357782; DOI=10.1093/nar/16.5.2337; RA Mankoo B.S., Dalgleish R.; RT "Human pro alpha 1(III) collagen: cDNA sequence for the 3' end."; RL Nucleic Acids Res. 16:2337-2337(1988). RN [21] RP SEQUENCE REVISION TO 1184. RX PubMed=3211760; DOI=10.1093/nar/16.24.11833; RA Molyneux K., Dalgleish R.; RT "Human type III collagen 'variant' is a cDNA cloning artefact."; RL Nucleic Acids Res. 16:11833-11833(1988). RN [22] RP PROTEIN SEQUENCE OF 965-1200, AND HYDROXYLATION. RX PubMed=7016180; DOI=10.1021/bi00512a040; RA Seyer J.M., Kang A.H.; RT "Covalent structure of collagen: amino acid sequence of alpha 1(III)- RT CB9 from type III collagen of human liver."; RL Biochemistry 20:2621-2627(1981). RN [23] RP NUCLEOTIDE SEQUENCE [MRNA] OF 1065-1466 (ISOFORM 1). RX PubMed=6096827; DOI=10.1093/nar/12.24.9383; RA Loidl H.R., Brinker J.M., May M., Pihlajaniemi T., Morrow S., RA Rosenbloom J., Myers J.C.; RT "Molecular cloning and carboxyl-propeptide analysis of human type III RT procollagen."; RL Nucleic Acids Res. 12:9383-9394(1984). RN [24] RP NUCLEOTIDE SEQUENCE [MRNA] OF 1161-1200 (ISOFORMS 1/2). RX PubMed=3754462; DOI=10.1021/bi00354a033; RA Miskulin M., Dalgleish R., Kluve-Beckerman B., Rennard S.I., RA Tolstoshev P., Brantly M., Crystal R.G.; RT "Human type III collagen gene expression is coordinately modulated RT with the type I collagen genes during fibroblast growth."; RL Biochemistry 25:1408-1413(1986). RN [25] RP NUCLEOTIDE SEQUENCE [MRNA] OF 1165-1196 (ISOFORMS 1/2). RX PubMed=3858826; DOI=10.1073/pnas.82.10.3385; RA Emanuel B.S., Cannizzaro L.A., Seyer J.M., Myers J.C.; RT "Human alpha 1(III) and alpha 2(V) procollagen genes are located on RT the long arm of chromosome 2."; RL Proc. Natl. Acad. Sci. U.S.A. 82:3385-3389(1985). RN [26] RP NUCLEOTIDE SEQUENCE [MRNA] OF 1122-1466 (ISOFORMS 1/2), AND VARIANT RP GLN-1353. RX PubMed=2579949; RA Chu M.-L., Weil D., de Wet W.J., Bernard M.P., Sippola M., Ramirez F.; RT "Isolation of cDNA and genomic clones encoding human pro-alpha 1 (III) RT collagen. Partial characterization of the 3' end region of the gene."; RL J. Biol. Chem. 260:4357-4363(1985). RN [27] RP REVIEW ON VARIANTS. RX PubMed=9101290; RX DOI=10.1002/(SICI)1098-1004(1997)9:4<300::AID-HUMU2>3.0.CO;2-9; RA Kuivaniemi H., Tromp G., Prockop D.J.; RT "Mutations in fibrillar collagens (types I, II, III, and XI), fibril- RT associated collagen (type IX), and network-forming collagen (type X) RT cause a spectrum of diseases of bone, cartilage, and blood vessels."; RL Hum. Mutat. 9:300-315(1997). RN [28] RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RX PubMed=22905912; DOI=10.1021/pr300539b; RA Rosenow A., Noben J.P., Jocken J., Kallendrusch S., RA Fischer-Posovszky P., Mariman E.C., Renes J.; RT "Resveratrol-induced changes of the human adipocyte secretion RT profile."; RL J. Proteome Res. 11:4733-4743(2012). RN [29] RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS]. RC TISSUE=Liver; RX PubMed=24275569; DOI=10.1016/j.jprot.2013.11.014; RA Bian Y., Song C., Cheng K., Dong M., Wang F., Huang J., Sun D., RA Wang L., Ye M., Zou H.; RT "An enzyme assisted RP-RPLC approach for in-depth analysis of human RT liver phosphoproteome."; RL J. Proteomics 96:253-262(2014). RN [30] RP X-RAY CRYSTALLOGRAPHY (2.3 ANGSTROMS) OF 1158-1199, AND INTERCHAIN RP DISULFIDE BONDS. RX PubMed=18805790; DOI=10.1074/jbc.M805394200; RA Boudko S.P., Engel J., Okuyama K., Mizuno K., Bachinger H.P., RA Schumacher M.A.; RT "Crystal structure of human type III collagen Gly991-Gly1032 cystine RT knot-containing peptide shows both 7/2 and 10/3 triple helical RT symmetries."; RL J. Biol. Chem. 283:32580-32589(2008). RN [31] RP X-RAY CRYSTALLOGRAPHY (1.68 ANGSTROMS) OF 1222-1466, PROPEPTIDE, RP DISULFIDE BONDS, AND CALCIUM-BINDING SITES. RX PubMed=23001006; DOI=10.1038/nsmb.2389; RA Bourhis J.M., Mariano N., Zhao Y., Harlos K., Exposito J.Y., RA Jones E.Y., Moali C., Aghajari N., Hulmes D.J.; RT "Structural basis of fibrillar collagen trimerization and related RT genetic disorders."; RL Nat. Struct. Mol. Biol. 19:1031-1036(2012). RN [32] RP VARIANT AAA ARG-303, AND VARIANT THR-668. RX PubMed=8514866; DOI=10.1172/JCI116490; RA Tromp G., Wu Y., Prockop D.J., Madhatheri S.L., Kleinert C., RA Earley J.J., Zhuang J., Noerrgaard O., Darling R.C., Abbott W.M., RA Cole C.W., Jaakkola P., Ryynaenen M., Pearce W.H., Yao J.S.T., RA Majamaa K., Smullens S.N., Gatalica Z., Ferrell R.E., Jimenez S.A., RA Jackson C.E., Michels V.V., Kaye M., Kuivaniemi H.; RT "Sequencing of cDNA from 50 unrelated patients reveals that mutations RT in the triple-helical domain of type III procollagen are an infrequent RT cause of aortic aneurysms."; RL J. Clin. Invest. 91:2539-2545(1993). RN [33] RP VARIANT THR-698. RX PubMed=2235526; DOI=10.1093/nar/18.20.6180; RA Zafarullah K., Kleinert C., Tromp G., Kuivaniemi H., Kontusaari S., RA Wu Y., Ganguly A., Prockop D.J.; RT "G to A polymorphism in exon 31 of the COL3A1 gene."; RL Nucleic Acids Res. 18:6180-6180(1990). RN [34] RP VARIANT AAA ARG-786. RX PubMed=2243125; DOI=10.1172/JCI114863; RA Kontusaari S., Tromp G., Kuivaniemi H., Romanic A.M., Prockop D.J.; RT "A mutation in the gene for type III procollagen (COL3A1) in a family RT with aortic aneurysms."; RL J. Clin. Invest. 86:1465-1473(1990). RN [35] RP VARIANT EDS4 830-PRO--PRO-838 DEL. RX PubMed=1370809; DOI=10.1007/BF00197268; RA Richards A.J., Lloyd J.C., Narcisi P., Ward P.N., Nicholls A.C., RA De Paepe A., Pope F.M.; RT "A 27-bp deletion from one allele of the type III collagen gene RT (COL3A1) in a large family with Ehlers-Danlos syndrome type IV."; RL Hum. Genet. 88:325-330(1992). RN [36] RP VARIANT EDS4 ARG-828. RX PubMed=8411057; DOI=10.1136/jmg.30.8.690; RA Richards A.J., Narcisi P., Lloyd J.C., Ferguson C., Pope F.M.; RT "The substitution of glycine 661 by arginine in type III collagen RT produces mutant molecules with different thermal stabilities and RT causes Ehlers-Danlos syndrome type IV."; RL J. Med. Genet. 30:690-693(1993). RN [37] RP VARIANT EDS4 SER-957. RX PubMed=2492273; RA Tromp G., Kuivaniemi H., Shikata H., Prockop D.J.; RT "A single base mutation that substitutes serine for glycine 790 of the RT alpha 1 (III) chain of type III procollagen exposes an arginine and RT causes Ehlers-Danlos syndrome IV."; RL J. Biol. Chem. 264:1349-1352(1989). RN [38] RP VARIANT EDS4 VAL-960. RX PubMed=7749417; DOI=10.1002/humu.1380050213; RA Tromp G., de Paepe A., Nuytinck L., Madhatheri S.L., Kuivaniemi H.; RT "Substitution of valine for glycine 793 in type III procollagen in RT Ehlers-Danlos syndrome type IV."; RL Hum. Mutat. 5:179-181(1995). RN [39] RP VARIANT EDS4 GLU-1014. RX PubMed=1352273; DOI=10.1007/BF00194313; RA Richards A.J., Ward P.N., Narcisi P., Nicholls A.C., Lloyd J.C., RA Pope F.M.; RT "A single base mutation in the gene for type III collagen (COL3A1) RT converts glycine 847 to glutamic acid in a family with Ehlers-Danlos RT syndrome type IV. An unaffected family member is mosaic for the RT mutation."; RL Hum. Genet. 89:414-418(1992). RN [40] RP VARIANT EDS4 ASP-1050. RX PubMed=2808425; RA Tromp G., Kuivaniemi H., Stolle C.A., Pope F.M., Prockop D.J.; RT "Single base mutation in the type III procollagen gene that converts RT the codon for glycine 883 to aspartate in a mild variant of Ehlers- RT Danlos syndrome IV."; RL J. Biol. Chem. 264:19313-19317(1989). RN [41] RP VARIANT AAA ARG-786. RX PubMed=2349939; RA Kontusaari S., Tromp G., Kuivaniemi H., Ladda R.L., Prockop D.J.; RT "Inheritance of an RNA splicing mutation (G+ 1 IVS20) in the type III RT procollagen gene (COL3A1) in a family having aortic aneurysms and easy RT bruisability: phenotypic overlap between familial arterial aneurysms RT and Ehlers-Danlos syndrome type IV."; RL Am. J. Hum. Genet. 47:112-120(1990). RN [42] RP VARIANT EDS4 VAL-1077. RX PubMed=1895316; DOI=10.1136/jmg.28.7.458; RA Richards A.J., Lloyd J.C., Ward P.N., de Paepe A., Narcisi P., RA Pope F.M.; RT "Characterisation of a glycine to valine substitution at amino acid RT position 910 of the triple helical region of type III collagen in a RT patient with Ehlers-Danlos syndrome type IV."; RL J. Med. Genet. 28:458-463(1991). RN [43] RP VARIANT EDS4 GLU-1173. RX PubMed=1357232; DOI=10.1007/BF02435995; RA Johnson P.H., Richards A.J., Pope F.M., Hopkinson D.A.; RT "A COL3A1 glycine 1006 to glutamic acid substitution in a patient with RT Ehlers-Danlos syndrome type IV detected by denaturing gradient gel RT electrophoresis."; RL J. Inherit. Metab. Dis. 15:426-430(1992). RN [44] RP VARIANT EDS4 ASP-1185. RX PubMed=1496983; RA Kontusaari S., Tromp G., Kuivaniemi H., Stolle C.A., Pope F.M., RA Prockop D.J.; RT "Substitution of aspartate for glycine 1018 in the type III RT procollagen (COL3A1) gene causes type IV Ehlers-Danlos syndrome: the RT mutated allele is present in most blood leukocytes of the asymptomatic RT and mosaic mother."; RL Am. J. Hum. Genet. 51:497-507(1992). RN [45] RP VARIANT EDS4 GLU-1188. RX PubMed=8098182; DOI=10.1002/ajmg.1320460308; RA Narcisi P., Wu Y., Tromp G., Earley J.J., Richards A.J., Pope F.M., RA Kuivaniemi H.; RT "Single base mutation that substitutes glutamic acid for glycine 1021 RT in the COL3A1 gene and causes Ehlers-Danlos syndrome type IV."; RL Am. J. Med. Genet. 46:278-283(1993). RN [46] RP VARIANTS EDS4 VAL-1167; ASP-1170 AND GLU-1173. RA Richards A.J., Narcisi P., Lloyd J.C., Johnson P.H., Hopkinson D.A., RA Pope F.M.; RT "Substitution of glycines 1000, 1003 and 1006 in type III collagen all RT cause the acrogeric form of EDS-IV, and destabilise the collagen RT triple helix."; RL Matrix 13:47-47(1993). RN [47] RP VARIANTS THR-602 AND LEU-635. RX PubMed=8255472; DOI=10.1212/WNL.43.12.2652; RA Kuivaniemi H., Prockop D.J., Wu Y., Madhatheri S.L., Kleinert C., RA Earley J.J., Jokinen A., Stolle C.A., Majamaa K., Mylllylae V.V., RA Noerrgaard O., Schievink W.I., Mokri B., Fukawa O., Ter Berg J.W.M., RA de Paepe A., Lozano A.M., Leblanc R., Ryynaenen M., Baxter B.T., RA Shikata H., Ferrell R.E., Tromp G.; RT "Exclusion of mutations in the gene for type III collagen (COL3A1) as RT a common cause of intracranial aneurysms or cervical artery RT dissections: results from sequence analysis of the coding sequences of RT type III collagen from 55 unrelated patients."; RL Neurology 43:2652-2658(1993). RN [48] RP VARIANT EDS4 GLU-756. RX PubMed=7912131; DOI=10.1093/hmg/3.3.511; RA Madhatheri S.L., Tromp G., Gustavson K.H., Kuivaniemi H.; RT "Substitution of glutamic acid for glycine 589 in the triple-helical RT domain of type III procollagen (COL3A1) in a family with variable RT phenotype of the Ehlers-Danlos syndrome type IV."; RL Hum. Mol. Genet. 3:511-512(1994). RN [49] RP VARIANT EDS3 SER-804. RX PubMed=7833919; DOI=10.1093/hmg/3.9.1617; RA Narcisi P., Richards A.J., Ferguson S.D., Pope F.M.; RT "A family with Ehlers-Danlos syndrome type III/articular hypermobility RT syndrome has a glycine 637-to-serine substitution in type III RT collagen."; RL Hum. Mol. Genet. 3:1617-1620(1994). RN [50] RP VARIANT EDS4 VAL-1176. RX PubMed=8019562; DOI=10.1002/humu.1380030315; RA Nuytinck L., De Paepe A., Renard J.P., Adriaens F., Leroy J.; RT "Single-strand conformation polymorphism (SSCP) analysis of the COL3A1 RT gene detects a mutation that results in the substitution of glycine RT 1009 to valine and causes severe Ehlers-Danlos syndrome type IV."; RL Hum. Mutat. 3:268-274(1994). RN [51] RP VARIANTS EDS4 CYS-183; ARG-201; GLU-228; ARG-540; ARG-936; GLU-996; RP VAL-1071; ALA-1104; GLU-1182; VAL-1185; GLU-1188 AND ARG-1188. RA Goldstein J.A., Schwarze U., Witz A., Byers P.H.; RT "Marked heterogeneity in COL3A1 mutations that produce the EDS type IV RT phenotype."; RL Matrix Biol. 14:392-393(1994). RN [52] RP VARIANTS EDS4 ASP-909 AND ASP-939. RX PubMed=8680408; DOI=10.1002/humu.1380060408; RA Johnson P.H., Richards A.J., Lloyd J.C., Pope F.M., Hopkinson D.A.; RT "Efficient strategy for the detection of mutations in acrogeric RT Ehlers-Danlos syndrome type IV."; RL Hum. Mutat. 6:336-342(1995). RN [53] RP VARIANTS EDS4 GLU-567; CYS-762 AND ASP-1170. RX PubMed=8884076; RA Mackay K., Raghunath M., Superti-Furga A., Steinmann B., Dalgleish R.; RT "Ehlers-Danlos syndrome type IV caused by Gly400Glu, Gly595Cys and RT Gly1003Asp substitutions in collagen III: clinical features, RT biochemical screening, and molecular confirmation."; RL Clin. Genet. 49:286-295(1996). RN [54] RP VARIANT EDS4 GLU-1101. RX PubMed=9147870; RA McGrory J., Weksberg R., Thorner P., Cole W.G.; RT "Abnormal extracellular matrix in Ehlers-Danlos syndrome type IV due RT to the substitution of glycine 934 by glutamic acid in the triple RT helical domain of type III collagen."; RL Clin. Genet. 50:442-445(1996). RN [55] RP VARIANT EDS4 ASP-666. RX PubMed=8664902; RX DOI=10.1002/(SICI)1098-1004(1996)7:1<59::AID-HUMU8>3.3.CO;2-R; RA McGrory J., Costa T., Cole W.G.; RT "A novel G499D substitution in the alpha 1(III) chain of type III RT collagen produces variable forms of Ehlers-Danlos syndrome type IV."; RL Hum. Mutat. 7:59-60(1996). RN [56] RP VARIANT EDS4 SER-582. RX PubMed=8990011; RX DOI=10.1002/(SICI)1098-1004(1997)9:1<62::AID-HUMU11>3.3.CO;2-B; RA Anderson D.W., Thakker-Varia S., Tromp G., Kuivaniemi H., Stolle C.A.; RT "A glycine (415)-to-serine substitution results in impaired secretion RT and decreased thermal stability of type III procollagen in a patient RT with Ehlers-Danlos syndrome type IV."; RL Hum. Mutat. 9:62-63(1997). RN [57] RP VARIANTS EDS4 CYS-183; ARG-201; GLU-228; ARG-540 AND ARG-936. RX PubMed=9036918; DOI=10.1111/1523-1747.ep12286441; RA Smith L.T., Schwarze U., Goldstein J., Byers P.H.; RT "Mutations in the COL3A1 gene result in the Ehlers-Danlos syndrome RT type IV and alterations in the size and distribution of the major RT collagen fibrils of the dermis."; RL J. Invest. Dermatol. 108:241-247(1997). RN [58] RP VARIANT EDS4 ARG-726. RX PubMed=9452103; RA Bateman J.F., Chiodo A.A., Weng Y.M., Chan D., Haan E.; RT "A type III collagen Gly559 to Arg helix mutation in Ehler's-Danlos RT syndrome type IV."; RL Hum. Mutat. Suppl. 1:S257-S259(1998). RN [59] RP VARIANT EDS-IV ARG-1173. RX PubMed=10819545; DOI=10.1046/j.1365-2133.2000.03266.x; RA Jansen T., de Paepe A., Luytinck N., Plewig G.; RT "COL3A1 mutation leading to acrogeria (Gottron Type)."; RL Br. J. Dermatol. 142:178-180(2000). RN [60] RP VARIANTS EDS4 ASP-204; ASP-210; ARG-264; ASP-327; ASP-1098 AND RP GLU-1173. RX PubMed=10923041; RX DOI=10.1002/1098-1004(200008)16:2<176::AID-HUMU12>3.0.CO;2-E; RA Giunta C., Steinmann B.; RT "Characterization of 11 new mutations in COL3A1 of individuals with RT Ehlers-Danlos syndrome type IV: preliminary comparison of RNase RT cleavage, EMC and DHPLC assays."; RL Hum. Mutat. 16:176-177(2000). RN [61] RP VARIANTS EDS4. RX PubMed=10706896; DOI=10.1056/NEJM200003093421001; RA Pepin M., Schwarze U., Superti-Furga A., Byers P.H.; RT "Clinical and genetic features of Ehlers-Danlos syndrome type IV, the RT vascular type."; RL N. Engl. J. Med. 342:673-680(2000). RN [62] RP VARIANT EDS4 ASP-1044. RX PubMed=11168790; DOI=10.1046/j.1365-2796.2001.00761.x; RA Nishiyama Y., Nejima J., Watanabe A., Kotani E., Sakai N., RA Hatamochi A., Shinkai H., Kiuchi K., Tamura K., Shimada T., Takano T., RA Katayama Y.; RT "Ehlers-Danlos syndrome type IV with a unique point mutation in COL3A1 RT and familial phenotype of myocardial infarction without organic RT coronary stenosis."; RL J. Intern. Med. 249:103-108(2001). RN [63] RP VARIANT EDS4 ARG-297. RX PubMed=12694234; DOI=10.1034/j.1399-0004.2003.00047.x; RA Kroes H.Y., Pals G., van Essen A.J.; RT "Ehlers-Danlos syndrome type IV: unusual congenital anomalies in a RT mother and son with a COL3A1 mutation and a normal collagen III RT protein profile."; RL Clin. Genet. 63:224-227(2003). RN [64] RP ERRATUM. RA Kroes H.Y., Pals G., van Essen A.J.; RL Clin. Genet. 64:375-375(2003). RN [65] RP VARIANT EDS4 VAL-1050. RX PubMed=12786757; DOI=10.1034/j.1399-0004.2003.00075.x; RA Palmeri S., Mari F., Meloni I., Malandrini A., Ariani F., RA Villanova M., Pompilio A., Schwarze U., Byers P.H., Renieri A.; RT "Neurological presentation of Ehlers-Danlos syndrome type IV in a RT family with parental mosaicism."; RL Clin. Genet. 63:510-515(2003). RN [66] RP VARIANTS [LARGE SCALE ANALYSIS] SER-420 AND CYS-1434. RX PubMed=16959974; DOI=10.1126/science.1133427; RA Sjoeblom T., Jones S., Wood L.D., Parsons D.W., Lin J., Barber T.D., RA Mandelker D., Leary R.J., Ptak J., Silliman N., Szabo S., RA Buckhaults P., Farrell C., Meeh P., Markowitz S.D., Willis J., RA Dawson D., Willson J.K.V., Gazdar A.F., Hartigan J., Wu L., Liu C., RA Parmigiani G., Park B.H., Bachman K.E., Papadopoulos N., RA Vogelstein B., Kinzler K.W., Velculescu V.E.; RT "The consensus coding sequences of human breast and colorectal RT cancers."; RL Science 314:268-274(2006). RN [67] RP VARIANTS THR-679; THR-698; VAL-1205 AND GLN-1353. RX PubMed=18272325; DOI=10.1016/j.ygeno.2007.12.008; RA Chan T.F., Poon A., Basu A., Addleman N.R., Chen J., Phong A., RA Byers P.H., Klein T.E., Kwok P.Y.; RT "Natural variation in four human collagen genes across an ethnically RT diverse population."; RL Genomics 91:307-314(2008). CC -!- FUNCTION: Collagen type III occurs in most soft connective tissues CC along with type I collagen. Involved in regulation of cortical CC development. Is the major ligand of GPR56 in the developing brain CC and binding to GPR56 inhibits neuronal migration and activates the CC RhoA pathway by coupling GPR56 to GNA13 and possibly GNA12. CC -!- SUBUNIT: Trimers of identical alpha 1(III) chains. The chains are CC linked to each other by interchain disulfide bonds. Trimers are CC also cross-linked via hydroxylysines. CC -!- INTERACTION: CC O01949:AAEL010235 (xeno); NbExp=2; IntAct=EBI-2431491, EBI-7685554; CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular CC matrix {ECO:0000255|PROSITE-ProRule:PRU00793}. CC -!- ALTERNATIVE PRODUCTS: CC Event=Alternative splicing; Named isoforms=2; CC Name=1; CC IsoId=P02461-1; Sequence=Displayed; CC Name=2; CC IsoId=P02461-2; Sequence=VSP_022502; CC Note=No experimental confirmation available.; CC -!- DOMAIN: The C-terminal propeptide, also known as COLFI domain, CC have crucial roles in tissue growth and repair by controlling both CC the intracellular assembly of procollagen molecules and the CC extracellular assembly of collagen fibrils. It binds a calcium ion CC which is essential for its function. CC -!- PTM: Proline residues at the third position of the tripeptide CC repeating unit (G-X-Y) are hydroxylated in some or all of the CC chains. {ECO:0000269|PubMed:557335, ECO:0000269|PubMed:7016180}. CC -!- PTM: O-linked glycan consists of a Glc-Gal disaccharide bound to CC the oxygen atom of a post-translationally added hydroxyl group. CC -!- DISEASE: Ehlers-Danlos syndrome 3 (EDS3) [MIM:130020]: A CC connective tissue disorder characterized by hyperextensible skin, CC atrophic cutaneous scars due to tissue fragility and joint CC hyperlaxity. It is a form of Ehlers-Danlos syndrome characterized CC by marked joint hyperextensibility without skeletal deformity. CC {ECO:0000269|PubMed:7833919}. Note=The disease is caused by CC mutations affecting the gene represented in this entry. CC -!- DISEASE: Ehlers-Danlos syndrome 4 (EDS4) [MIM:130050]: The most CC severe form of Ehlers-Danlos syndrome, a connective tissue CC disorder characterized by hyperextensible skin, atrophic cutaneous CC scars due to tissue fragility and joint hyperlaxity. Characterized CC by the joint and dermal manifestations as in other forms of the CC syndrome, characteristic facial features (acrogeria) in most CC patients, and by proneness to spontaneous rupture of bowel and CC large arteries. The vascular complications may affect all CC anatomical areas. {ECO:0000269|PubMed:10706896, CC ECO:0000269|PubMed:10923041, ECO:0000269|PubMed:11168790, CC ECO:0000269|PubMed:12694234, ECO:0000269|PubMed:12786757, CC ECO:0000269|PubMed:1352273, ECO:0000269|PubMed:1357232, CC ECO:0000269|PubMed:1370809, ECO:0000269|PubMed:1496983, CC ECO:0000269|PubMed:1895316, ECO:0000269|PubMed:2492273, CC ECO:0000269|PubMed:2808425, ECO:0000269|PubMed:7749417, CC ECO:0000269|PubMed:7912131, ECO:0000269|PubMed:8019562, CC ECO:0000269|PubMed:8098182, ECO:0000269|PubMed:8411057, CC ECO:0000269|PubMed:8664902, ECO:0000269|PubMed:8680408, CC ECO:0000269|PubMed:8884076, ECO:0000269|PubMed:8990011, CC ECO:0000269|PubMed:9036918, ECO:0000269|PubMed:9147870, CC ECO:0000269|PubMed:9452103, ECO:0000269|Ref.46, CC ECO:0000269|Ref.51}. Note=The disease is caused by mutations CC affecting the gene represented in this entry. CC -!- DISEASE: Aortic aneurysm, familial abdominal (AAA) [MIM:100070]: A CC common multifactorial disorder characterized by permanent dilation CC of the abdominal aorta, usually due to degenerative changes in the CC aortic wall. Histologically, AAA is characterized by signs of CC chronic inflammation, destructive remodeling of the extracellular CC matrix, and depletion of vascular smooth muscle cells. CC {ECO:0000269|PubMed:2243125, ECO:0000269|PubMed:2349939, CC ECO:0000269|PubMed:8514866}. Note=Disease susceptibility is CC associated with variations affecting the gene represented in this CC entry. CC -!- SIMILARITY: Belongs to the fibrillar collagen family. CC {ECO:0000255|PROSITE-ProRule:PRU00793}. CC -!- SIMILARITY: Contains 1 fibrillar collagen NC1 domain. CC {ECO:0000255|PROSITE-ProRule:PRU00793}. CC -!- SIMILARITY: Contains 1 VWFC domain. {ECO:0000255|PROSITE- CC ProRule:PRU00220}. CC -!- WEB RESOURCE: Name=COL3A1; Note=Collagen type III alpha-1 chain CC mutations; CC URL="http://www.le.ac.uk/genetics/collagen/col3a1.html"; CC -!- WEB RESOURCE: Name=Wikipedia; Note=Type-III collagen entry; CC URL="https://en.wikipedia.org/wiki/Type-III_collagen"; CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; X14420; CAA32583.1; -; mRNA. DR EMBL; AY054301; AAL13167.1; -; Genomic_DNA. DR EMBL; AY016295; AAL13167.1; JOINED; Genomic_DNA. DR EMBL; KC567894; AGL34959.1; -; Genomic_DNA. DR EMBL; GU143397; ACZ58371.1; -; Genomic_DNA. DR EMBL; AC066694; AAY24164.1; -; Genomic_DNA. DR EMBL; CH471058; EAX10910.1; -; Genomic_DNA. DR EMBL; CH471058; EAX10911.1; -; Genomic_DNA. DR EMBL; BC028178; AAH28178.1; -; mRNA. DR EMBL; M26939; AAA52040.1; -; Genomic_DNA. DR EMBL; X07240; CAA30229.1; -; mRNA. DR EMBL; X15332; CAA33387.1; -; mRNA. DR EMBL; S62925; AAD13937.1; -; Genomic_DNA. DR EMBL; S79877; AAB35615.1; -; mRNA. DR EMBL; M59312; AAA52041.1; -; Genomic_DNA. DR EMBL; M59227; AAB59383.1; -; mRNA. DR EMBL; M55603; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; X06700; CAA29886.1; -; mRNA. DR EMBL; X01655; CAA25821.1; -; mRNA. DR EMBL; X01742; CAA25879.1; -; mRNA. DR EMBL; M13146; AAA52003.1; -; mRNA. DR EMBL; M11134; AAA52004.1; -; mRNA. DR EMBL; M10795; AAA52002.1; -; Genomic_DNA. DR EMBL; M10615; AAA52002.1; JOINED; Genomic_DNA. DR EMBL; M10793; AAA52002.1; JOINED; Genomic_DNA. DR EMBL; M10794; AAA52002.1; JOINED; Genomic_DNA. DR EMBL; M10800; AAA52002.1; JOINED; Genomic_DNA. DR EMBL; M10801; AAA52002.1; JOINED; Genomic_DNA. DR CCDS; CCDS2297.1; -. [P02461-1] DR PIR; S05272; CGHU7L. DR RefSeq; NP_000081.1; NM_000090.3. DR UniGene; Hs.443625; -. DR PDB; 2V53; X-ray; 3.20 A; B/C/D=564-584. DR PDB; 3DMW; X-ray; 2.30 A; A/B/C=1158-1199. DR PDB; 4AE2; X-ray; 1.68 A; A/B/C=1222-1466. DR PDB; 4AEJ; X-ray; 2.21 A; A/B/C=1222-1466. DR PDB; 4AK3; X-ray; 3.50 A; A=1222-1466. DR PDB; 4GYX; X-ray; 1.49 A; A/B/C=1158-1200. DR PDBsum; 2V53; -. DR PDBsum; 3DMW; -. DR PDBsum; 4AE2; -. DR PDBsum; 4AEJ; -. DR PDBsum; 4AK3; -. DR PDBsum; 4GYX; -. DR ProteinModelPortal; P02461; -. DR SMR; P02461; 1249-1466. DR BioGrid; 107678; 9. DR DIP; DIP-57177N; -. DR IntAct; P02461; 19. DR MINT; MINT-7299332; -. DR STRING; 9606.ENSP00000304408; -. DR ChEMBL; CHEMBL2364188; -. DR DrugBank; DB00048; Collagenase clostridium histolyticum. DR PhosphoSite; P02461; -. DR BioMuta; COL3A1; -. DR DMDM; 124056490; -. DR PaxDb; P02461; -. DR PRIDE; P02461; -. DR Ensembl; ENST00000304636; ENSP00000304408; ENSG00000168542. [P02461-1] DR Ensembl; ENST00000317840; ENSP00000315243; ENSG00000168542. [P02461-2] DR GeneID; 1281; -. DR KEGG; hsa:1281; -. DR UCSC; uc002uqj.1; human. [P02461-1] DR CTD; 1281; -. DR GeneCards; COL3A1; -. DR GeneReviews; COL3A1; -. DR HGNC; HGNC:2201; COL3A1. DR HPA; CAB016766; -. DR HPA; HPA007583; -. DR MIM; 100070; phenotype. DR MIM; 120180; gene. DR MIM; 130020; phenotype. DR MIM; 130050; phenotype. DR neXtProt; NX_P02461; -. DR Orphanet; 2500; Acrogeria. DR Orphanet; 286; Ehlers-Danlos syndrome, vascular type. DR Orphanet; 86; Familial abdominal aortic aneurysm. DR Orphanet; 231160; Familial cerebral saccular aneurysm. DR PharmGKB; PA26716; -. DR eggNOG; KOG3544; Eukaryota. DR eggNOG; ENOG410XNMM; LUCA. DR GeneTree; ENSGT00810000125362; -. DR HOVERGEN; HBG004933; -. DR InParanoid; P02461; -. DR KO; K06236; -. DR OMA; AEKKHVW; -. DR OrthoDB; EOG7TJ3HH; -. DR PhylomeDB; P02461; -. DR TreeFam; TF344135; -. DR Reactome; R-HSA-1442490; Collagen degradation. DR Reactome; R-HSA-1474244; Extracellular matrix organization. DR Reactome; R-HSA-1650814; Collagen biosynthesis and modifying enzymes. DR Reactome; R-HSA-186797; Signaling by PDGF. DR Reactome; R-HSA-2022090; Assembly of collagen fibrils and other multimeric structures. DR Reactome; R-HSA-216083; Integrin cell surface interactions. DR Reactome; R-HSA-3000170; Syndecan interactions. DR Reactome; R-HSA-3000171; Non-integrin membrane-ECM interactions. DR Reactome; R-HSA-3000178; ECM proteoglycans. DR Reactome; R-HSA-3000480; Scavenging by Class A Receptors. DR Reactome; R-HSA-419037; NCAM1 interactions. DR ChiTaRS; COL3A1; human. DR EvolutionaryTrace; P02461; -. DR GeneWiki; Collagen,_type_III,_alpha_1; -. DR GenomeRNAi; 1281; -. DR NextBio; 35489581; -. DR PRO; PR:P02461; -. DR Proteomes; UP000005640; Chromosome 2. DR Bgee; P02461; -. DR ExpressionAtlas; P02461; baseline and differential. DR Genevisible; P02461; HS. DR GO; GO:0005586; C:collagen type III trimer; IMP:UniProtKB. DR GO; GO:0005788; C:endoplasmic reticulum lumen; TAS:Reactome. DR GO; GO:0031012; C:extracellular matrix; IMP:UniProtKB. DR GO; GO:0005576; C:extracellular region; TAS:Reactome. DR GO; GO:0005615; C:extracellular space; IDA:UniProtKB. DR GO; GO:0005201; F:extracellular matrix structural constituent; IMP:UniProtKB. DR GO; GO:0005178; F:integrin binding; IMP:UniProtKB. DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW. DR GO; GO:0048407; F:platelet-derived growth factor binding; IDA:MGI. DR GO; GO:0007568; P:aging; IEA:Ensembl. DR GO; GO:0060414; P:aorta smooth muscle tissue morphogenesis; IEA:Ensembl. DR GO; GO:0007411; P:axon guidance; TAS:Reactome. DR GO; GO:0007160; P:cell-matrix adhesion; IDA:UniProtKB. DR GO; GO:0071230; P:cellular response to amino acid stimulus; IEA:Ensembl. DR GO; GO:0021987; P:cerebral cortex development; ISS:UniProtKB. DR GO; GO:0030574; P:collagen catabolic process; TAS:Reactome. DR GO; GO:0030199; P:collagen fibril organization; IMP:UniProtKB. DR GO; GO:0048565; P:digestive tract development; IEA:Ensembl. DR GO; GO:0043206; P:extracellular fibril organization; IMP:UniProtKB. DR GO; GO:0022617; P:extracellular matrix disassembly; TAS:Reactome. DR GO; GO:0030198; P:extracellular matrix organization; TAS:Reactome. DR GO; GO:0007507; P:heart development; IMP:UniProtKB. DR GO; GO:0007229; P:integrin-mediated signaling pathway; IMP:UniProtKB. DR GO; GO:0050777; P:negative regulation of immune response; IMP:UniProtKB. DR GO; GO:2001223; P:negative regulation of neuron migration; ISS:UniProtKB. DR GO; GO:0018149; P:peptide cross-linking; IDA:UniProtKB. DR GO; GO:0030168; P:platelet activation; NAS:UniProtKB. DR GO; GO:0035025; P:positive regulation of Rho protein signal transduction; ISS:UniProtKB. DR GO; GO:0006898; P:receptor-mediated endocytosis; TAS:Reactome. DR GO; GO:0034097; P:response to cytokine; IDA:UniProtKB. DR GO; GO:0009612; P:response to mechanical stimulus; IEA:Ensembl. DR GO; GO:0009314; P:response to radiation; IDA:UniProtKB. DR GO; GO:0001501; P:skeletal system development; IEA:Ensembl. DR GO; GO:0043588; P:skin development; IMP:UniProtKB. DR GO; GO:0007179; P:transforming growth factor beta receptor signaling pathway; IDA:UniProtKB. DR GO; GO:0042060; P:wound healing; IDA:UniProtKB. DR InterPro; IPR008160; Collagen. DR InterPro; IPR000885; Fib_collagen_C. DR InterPro; IPR001007; VWF_dom. DR Pfam; PF01410; COLFI; 1. DR Pfam; PF01391; Collagen; 8. DR Pfam; PF00093; VWC; 1. DR ProDom; PD002078; Fib_collagen_C; 1. DR SMART; SM00038; COLFI; 1. DR SMART; SM00214; VWC; 1. DR PROSITE; PS51461; NC1_FIB; 1. DR PROSITE; PS01208; VWFC_1; 1. DR PROSITE; PS50184; VWFC_2; 1. PE 1: Evidence at protein level; KW 3D-structure; Alternative splicing; Aortic aneurysm; Calcium; KW Collagen; Complete proteome; Direct protein sequencing; KW Disease mutation; Disulfide bond; Ehlers-Danlos syndrome; KW Extracellular matrix; Glycoprotein; Hydroxylation; Metal-binding; KW Polymorphism; Reference proteome; Repeat; Secreted; Signal. FT SIGNAL 1 23 FT PROPEP 24 153 N-terminal propeptide. FT /FTId=PRO_0000005740. FT CHAIN 154 1221 Collagen alpha-1(III) chain. FT /FTId=PRO_0000005741. FT PROPEP 1222 1466 C-terminal propeptide. FT /FTId=PRO_0000005742. FT DOMAIN 30 89 VWFC. {ECO:0000255|PROSITE- FT ProRule:PRU00220}. FT DOMAIN 1232 1466 Fibrillar collagen NC1. FT {ECO:0000255|PROSITE-ProRule:PRU00793}. FT REGION 149 167 Nonhelical region (N-terminal). FT REGION 168 1196 Triple-helical region. FT REGION 1197 1205 Nonhelical region (C-terminal). FT METAL 1280 1280 Calcium. FT METAL 1282 1282 Calcium. FT METAL 1283 1283 Calcium; via carbonyl oxygen. FT METAL 1285 1285 Calcium; via carbonyl oxygen. FT METAL 1288 1288 Calcium. FT MOD_RES 263 263 5-hydroxylysine; alternate. FT MOD_RES 284 284 5-hydroxylysine. FT MOD_RES 860 860 5-hydroxylysine. FT MOD_RES 977 977 5-hydroxylysine. FT MOD_RES 1094 1094 5-hydroxylysine; partial. FT MOD_RES 1106 1106 5-hydroxylysine. FT CARBOHYD 263 263 O-linked (Gal...); alternate. FT DISULFID 1196 1196 Interchain. {ECO:0000255|PROSITE- FT ProRule:PRU00793, FT ECO:0000269|PubMed:23001006}. FT DISULFID 1197 1197 Interchain. {ECO:0000255|PROSITE- FT ProRule:PRU00793, FT ECO:0000269|PubMed:23001006}. FT DISULFID 1262 1294 {ECO:0000255|PROSITE-ProRule:PRU00793, FT ECO:0000269|PubMed:23001006}. FT DISULFID 1268 1268 Interchain (with C-1285). FT {ECO:0000255|PROSITE-ProRule:PRU00793, FT ECO:0000269|PubMed:23001006}. FT DISULFID 1285 1285 Interchain (with C-1268). FT {ECO:0000255|PROSITE-ProRule:PRU00793, FT ECO:0000269|PubMed:23001006}. FT DISULFID 1302 1464 {ECO:0000255|PROSITE-ProRule:PRU00793, FT ECO:0000269|PubMed:23001006}. FT DISULFID 1372 1417 {ECO:0000255|PROSITE-ProRule:PRU00793, FT ECO:0000269|PubMed:23001006}. FT VAR_SEQ 847 1149 Missing (in isoform 2). FT {ECO:0000303|PubMed:15489334}. FT /FTId=VSP_022502. FT VARIANT 169 169 L -> F (in AAA). FT /FTId=VAR_001767. FT VARIANT 183 183 G -> C (in EDS4). FT {ECO:0000269|PubMed:9036918, FT ECO:0000269|Ref.51}. FT /FTId=VAR_001768. FT VARIANT 183 183 G -> D (in EDS4). FT /FTId=VAR_011095. FT VARIANT 183 183 G -> S (in EDS4). FT /FTId=VAR_011096. FT VARIANT 192 192 G -> V (in EDS4). FT /FTId=VAR_011097. FT VARIANT 201 201 G -> R (in EDS4). FT {ECO:0000269|PubMed:9036918, FT ECO:0000269|Ref.51}. FT /FTId=VAR_001769. FT VARIANT 204 204 G -> D (in EDS4). FT {ECO:0000269|PubMed:10923041}. FT /FTId=VAR_011098. FT VARIANT 204 204 G -> S (in EDS4). FT /FTId=VAR_011099. FT VARIANT 210 210 G -> D (in EDS4). FT {ECO:0000269|PubMed:10923041}. FT /FTId=VAR_011100. FT VARIANT 219 219 G -> C (in EDS4). FT /FTId=VAR_011101. FT VARIANT 225 225 G -> V (in EDS4). FT /FTId=VAR_011102. FT VARIANT 228 228 G -> E (in EDS4). FT {ECO:0000269|PubMed:9036918, FT ECO:0000269|Ref.51}. FT /FTId=VAR_001770. FT VARIANT 240 240 G -> R (in EDS4). FT /FTId=VAR_011103. FT VARIANT 243 243 G -> V (in EDS4). FT /FTId=VAR_011104. FT VARIANT 249 249 G -> D (in EDS4). FT /FTId=VAR_011105. FT VARIANT 249 249 G -> V (in EDS4). FT /FTId=VAR_011106. FT VARIANT 252 252 G -> D (in EDS4). FT /FTId=VAR_011107. FT VARIANT 252 252 G -> R (in EDS4). FT /FTId=VAR_011108. FT VARIANT 252 252 G -> V (in EDS4). FT /FTId=VAR_011109. FT VARIANT 255 255 G -> V (in EDS4). FT /FTId=VAR_011110. FT VARIANT 264 264 G -> R (in EDS4). FT {ECO:0000269|PubMed:10923041}. FT /FTId=VAR_011111. FT VARIANT 267 267 G -> V (in EDS4). FT /FTId=VAR_011112. FT VARIANT 297 297 G -> R (in EDS4). FT {ECO:0000269|PubMed:12694234}. FT /FTId=VAR_037007. FT VARIANT 303 303 G -> R (in fibromuscular dysplasia and FT aortic aneurysm). FT {ECO:0000269|PubMed:8514866}. FT /FTId=VAR_001771. FT VARIANT 321 321 G -> V (in EDS4). FT /FTId=VAR_011113. FT VARIANT 327 327 G -> D (in EDS4). FT {ECO:0000269|PubMed:10923041}. FT /FTId=VAR_011114. FT VARIANT 345 345 G -> R (in EDS4). FT /FTId=VAR_011115. FT VARIANT 417 417 G -> R (in EDS4). FT /FTId=VAR_011116. FT VARIANT 420 420 G -> S (in a colorectal cancer sample; FT somatic mutation). FT {ECO:0000269|PubMed:16959974}. FT /FTId=VAR_035738. FT VARIANT 444 444 G -> R (in EDS4). FT /FTId=VAR_011117. FT VARIANT 489 489 G -> E (in EDS4). FT /FTId=VAR_011118. FT VARIANT 501 501 G -> R (in EDS4). FT /FTId=VAR_011119. FT VARIANT 519 519 G -> V (in EDS4). FT /FTId=VAR_011120. FT VARIANT 534 534 G -> E (in dbSNP:rs41263744). FT /FTId=VAR_055665. FT VARIANT 540 540 G -> R (in EDS4). FT {ECO:0000269|PubMed:9036918, FT ECO:0000269|Ref.51}. FT /FTId=VAR_001772. FT VARIANT 549 549 G -> E (in EDS4). FT /FTId=VAR_011121. FT VARIANT 552 552 G -> E (in EDS4). FT /FTId=VAR_011122. FT VARIANT 567 567 G -> E (in EDS4). FT {ECO:0000269|PubMed:8884076}. FT /FTId=VAR_001773. FT VARIANT 582 582 G -> S (in EDS4). FT {ECO:0000269|PubMed:8990011}. FT /FTId=VAR_001774. FT VARIANT 588 588 G -> D (in EDS4). FT /FTId=VAR_011123. FT VARIANT 602 602 P -> T (in dbSNP:rs35795890). FT {ECO:0000269|PubMed:8255472}. FT /FTId=VAR_001775. FT VARIANT 635 635 P -> L. {ECO:0000269|PubMed:8255472}. FT /FTId=VAR_001776. FT VARIANT 636 636 G -> R (in EDS4). FT /FTId=VAR_011124. FT VARIANT 657 657 G -> E (in EDS4). FT /FTId=VAR_011125. FT VARIANT 660 660 G -> D (in EDS4). FT /FTId=VAR_011126. FT VARIANT 666 666 G -> D (in EDS4). FT {ECO:0000269|PubMed:8664902}. FT /FTId=VAR_001777. FT VARIANT 668 668 P -> T (in dbSNP:rs1801183). FT {ECO:0000269|PubMed:8514866}. FT /FTId=VAR_011127. FT VARIANT 679 679 A -> T (in dbSNP:rs41263773). FT {ECO:0000269|PubMed:18272325}. FT /FTId=VAR_055666. FT VARIANT 686 686 P -> A (in dbSNP:rs41263775). FT /FTId=VAR_055667. FT VARIANT 698 698 A -> T (in dbSNP:rs1800255). FT {ECO:0000269|PubMed:18272325, FT ECO:0000269|PubMed:2235526}. FT /FTId=VAR_001778. FT VARIANT 699 699 G -> R (in EDS4). FT /FTId=VAR_011128. FT VARIANT 726 726 G -> R (in EDS4). FT {ECO:0000269|PubMed:9452103, FT ECO:0000269|Ref.3}. FT /FTId=VAR_001779. FT VARIANT 738 738 G -> S (in EDS4). FT /FTId=VAR_011129. FT VARIANT 738 738 G -> V (in EDS4). FT /FTId=VAR_011130. FT VARIANT 744 744 G -> V (in EDS4). FT /FTId=VAR_011131. FT VARIANT 756 756 G -> E (in EDS4). FT {ECO:0000269|PubMed:7912131}. FT /FTId=VAR_001780. FT VARIANT 762 762 G -> C (in EDS4). FT {ECO:0000269|PubMed:8884076}. FT /FTId=VAR_001781. FT VARIANT 786 786 G -> R (in AAA). FT {ECO:0000269|PubMed:2243125, FT ECO:0000269|PubMed:2349939}. FT /FTId=VAR_001782. FT VARIANT 804 804 G -> S (in EDS3). FT {ECO:0000269|PubMed:7833919}. FT /FTId=VAR_001783. FT VARIANT 828 828 G -> R (in EDS4). FT {ECO:0000269|PubMed:8411057}. FT /FTId=VAR_001784. FT VARIANT 828 828 G -> W (in EDS4). FT /FTId=VAR_011132. FT VARIANT 830 838 Missing (in EDS4). FT {ECO:0000269|PubMed:1370809}. FT /FTId=VAR_037008. FT VARIANT 852 852 G -> C (in EDS4). FT /FTId=VAR_011133. FT VARIANT 879 879 G -> V (in EDS4). FT /FTId=VAR_011134. FT VARIANT 882 882 G -> D (in EDS4). FT /FTId=VAR_011135. FT VARIANT 900 900 G -> D (in EDS4). FT /FTId=VAR_011136. FT VARIANT 903 903 G -> E (in EDS4). FT /FTId=VAR_011137. FT VARIANT 909 909 G -> D (in EDS4). FT {ECO:0000269|PubMed:8680408}. FT /FTId=VAR_001785. FT VARIANT 909 909 G -> V (in EDS4). FT /FTId=VAR_011138. FT VARIANT 918 918 G -> E (in EDS4). FT /FTId=VAR_011139. FT VARIANT 924 924 G -> C (in EDS4). FT /FTId=VAR_011140. FT VARIANT 936 936 G -> R (in EDS4). FT {ECO:0000269|PubMed:9036918, FT ECO:0000269|Ref.51}. FT /FTId=VAR_001786. FT VARIANT 936 936 G -> S (in EDS4). FT /FTId=VAR_001787. FT VARIANT 939 939 G -> D (in EDS4). FT {ECO:0000269|PubMed:8680408}. FT /FTId=VAR_001788. FT VARIANT 942 942 G -> E (in EDS4). FT /FTId=VAR_011141. FT VARIANT 957 957 G -> S (in EDS4; severe variant). FT {ECO:0000269|PubMed:2492273}. FT /FTId=VAR_001789. FT VARIANT 960 960 G -> V (in EDS4; severe variant). FT {ECO:0000269|PubMed:7749417}. FT /FTId=VAR_001790. FT VARIANT 966 966 G -> V (in EDS4). FT /FTId=VAR_011142. FT VARIANT 972 972 G -> A (in EDS4). FT /FTId=VAR_011143. FT VARIANT 984 984 G -> T (in EDS4; requires 2 nucleotide FT substitutions). FT /FTId=VAR_011144. FT VARIANT 996 996 G -> E (in EDS4). {ECO:0000269|Ref.51}. FT /FTId=VAR_001791. FT VARIANT 999 999 G -> R (in EDS4). FT /FTId=VAR_011145. FT VARIANT 1011 1011 G -> E (in EDS4). FT /FTId=VAR_011146. FT VARIANT 1014 1014 G -> E (in EDS4). FT {ECO:0000269|PubMed:1352273}. FT /FTId=VAR_001792. FT VARIANT 1032 1032 G -> V (in EDS4). FT /FTId=VAR_011147. FT VARIANT 1035 1035 G -> C (in EDS4). FT /FTId=VAR_011148. FT VARIANT 1044 1044 G -> D (in EDS4). FT {ECO:0000269|PubMed:11168790}. FT /FTId=VAR_011149. FT VARIANT 1050 1050 G -> D (in EDS4; mild variant). FT {ECO:0000269|PubMed:2808425}. FT /FTId=VAR_001793. FT VARIANT 1050 1050 G -> V (in EDS4). FT {ECO:0000269|PubMed:12786757}. FT /FTId=VAR_011150. FT VARIANT 1071 1071 G -> V (in EDS4). {ECO:0000269|Ref.51}. FT /FTId=VAR_001794. FT VARIANT 1077 1077 G -> V (in EDS4). FT {ECO:0000269|PubMed:1895316}. FT /FTId=VAR_001795. FT VARIANT 1089 1089 G -> D (in EDS4). FT /FTId=VAR_011151. FT VARIANT 1098 1098 G -> D (in EDS4). FT {ECO:0000269|PubMed:10923041}. FT /FTId=VAR_011152. FT VARIANT 1098 1098 G -> V (in EDS4). FT /FTId=VAR_011153. FT VARIANT 1101 1101 G -> E (in EDS4). FT {ECO:0000269|PubMed:9147870}. FT /FTId=VAR_001796. FT VARIANT 1104 1104 G -> A (in EDS4). {ECO:0000269|Ref.51}. FT /FTId=VAR_001797. FT VARIANT 1161 1161 G -> V (in EDS4). FT /FTId=VAR_011154. FT VARIANT 1164 1164 G -> E (in EDS4). FT /FTId=VAR_011155. FT VARIANT 1164 1164 G -> R (in EDS4). FT /FTId=VAR_011156. FT VARIANT 1164 1164 G -> S (in spondyloepiphyseal dysplasia). FT /FTId=VAR_001798. FT VARIANT 1167 1167 G -> V (in EDS4). {ECO:0000269|Ref.46}. FT /FTId=VAR_001799. FT VARIANT 1170 1170 G -> D (in EDS4). FT {ECO:0000269|PubMed:8884076, FT ECO:0000269|Ref.46}. FT /FTId=VAR_001800. FT VARIANT 1170 1170 G -> V (in EDS4). FT /FTId=VAR_011157. FT VARIANT 1173 1173 G -> E (in EDS4). FT {ECO:0000269|PubMed:10923041, FT ECO:0000269|PubMed:1357232, FT ECO:0000269|Ref.46}. FT /FTId=VAR_001801. FT VARIANT 1173 1173 G -> R (in EDS4; Gottron type acrogeria). FT {ECO:0000269|PubMed:10819545}. FT /FTId=VAR_011158. FT VARIANT 1176 1176 G -> V (in EDS4; severe). FT {ECO:0000269|PubMed:8019562}. FT /FTId=VAR_001802. FT VARIANT 1179 1179 G -> R (in EDS4). FT /FTId=VAR_011159. FT VARIANT 1182 1182 G -> E (in EDS4). {ECO:0000269|Ref.51}. FT /FTId=VAR_001803. FT VARIANT 1185 1185 G -> D (in EDS4; severe variant). FT {ECO:0000269|PubMed:1496983}. FT /FTId=VAR_001804. FT VARIANT 1185 1185 G -> V (in EDS4). {ECO:0000269|Ref.51}. FT /FTId=VAR_001805. FT VARIANT 1188 1188 G -> E (in EDS4; severe variant). FT {ECO:0000269|PubMed:8098182, FT ECO:0000269|Ref.51}. FT /FTId=VAR_001806. FT VARIANT 1188 1188 G -> R (in EDS4). {ECO:0000269|Ref.51}. FT /FTId=VAR_001807. FT VARIANT 1205 1205 I -> V (in dbSNP:rs2271683). FT {ECO:0000269|PubMed:18272325}. FT /FTId=VAR_020012. FT VARIANT 1353 1353 H -> Q (in dbSNP:rs1516446). FT {ECO:0000269|PubMed:11566270, FT ECO:0000269|PubMed:15489334, FT ECO:0000269|PubMed:18272325, FT ECO:0000269|PubMed:2579949, FT ECO:0000269|PubMed:2764886, FT ECO:0000269|PubMed:3357782, FT ECO:0000269|Ref.6}. FT /FTId=VAR_030115. FT VARIANT 1434 1434 R -> C (in a colorectal cancer sample; FT somatic mutation). FT {ECO:0000269|PubMed:16959974}. FT /FTId=VAR_035739. FT CONFLICT 163 163 G -> GG (in Ref. 10; CAA33387). FT {ECO:0000305}. FT CONFLICT 168 168 G -> V (in Ref. 11; AA sequence). FT {ECO:0000305}. FT CONFLICT 226 228 Missing (in Ref. 11; AA sequence). FT {ECO:0000305}. FT CONFLICT 241 241 E -> D (in Ref. 10; CAA33387). FT {ECO:0000305}. FT CONFLICT 278 278 T -> A (in Ref. 11; AA sequence). FT {ECO:0000305}. FT CONFLICT 293 295 NGA -> DGS (in Ref. 11; AA sequence). FT {ECO:0000305}. FT CONFLICT 401 401 A -> L (in Ref. 15; AA sequence). FT {ECO:0000305}. FT CONFLICT 409 409 L -> P (in Ref. 15; AA sequence). FT {ECO:0000305}. FT CONFLICT 472 472 E -> D (in Ref. 10; CAA33387). FT {ECO:0000305}. FT CONFLICT 488 490 PGF -> LGS (in Ref. 10; CAA33387). FT {ECO:0000305}. FT CONFLICT 589 589 A -> E (in Ref. 15; AA sequence). FT {ECO:0000305}. FT CONFLICT 614 614 T -> Y (in Ref. 10; CAA33387). FT {ECO:0000305}. FT CONFLICT 635 635 P -> R (in Ref. 10; CAA33387). FT {ECO:0000305}. FT CONFLICT 664 664 D -> E (in Ref. 10; CAA33387). FT {ECO:0000305}. FT CONFLICT 676 676 D -> N (in Ref. 16; AA sequence). FT {ECO:0000305}. FT CONFLICT 803 803 T -> P (in Ref. 15; AA sequence). FT {ECO:0000305}. FT CONFLICT 896 896 S -> A (in Ref. 18; AA sequence). FT {ECO:0000305}. FT CONFLICT 980 980 S -> A (in Ref. 22; AA sequence). FT {ECO:0000305}. FT CONFLICT 985 989 ANGLS -> PSGQN (in Ref. 22; AA sequence). FT {ECO:0000305}. FT CONFLICT 1019 1019 D -> Y (in Ref. 20; CAA29886). FT {ECO:0000305}. FT CONFLICT 1067 1067 S -> P (in Ref. 15; AA sequence). FT {ECO:0000305}. FT CONFLICT 1070 1070 A -> P (in Ref. 15; AA sequence). FT {ECO:0000305}. FT CONFLICT 1097 1097 T -> P (in Ref. 22; AA sequence). FT {ECO:0000305}. FT CONFLICT 1153 1154 TS -> AT (in Ref. 22; AA sequence). FT {ECO:0000305}. FT CONFLICT 1156 1156 H -> S (in Ref. 22; AA sequence). FT {ECO:0000305}. FT CONFLICT 1184 1184 P -> S (in Ref. 10; CAA33387). FT {ECO:0000305}. FT CONFLICT 1203 1203 A -> P (in Ref. 10; CAA33387). FT {ECO:0000305}. FT CONFLICT 1210 1210 G -> A (in Ref. 10; CAA33387). FT {ECO:0000305}. FT CONFLICT 1222 1222 D -> P (in Ref. 26; AAA52002). FT {ECO:0000305}. FT CONFLICT 1235 1235 M -> I (in Ref. 26; AAA52002). FT {ECO:0000305}. FT CONFLICT 1241 1241 V -> A (in Ref. 20; CAA29886 and 23; FT CAA25879). {ECO:0000305}. FT CONFLICT 1274 1274 S -> T (in Ref. 26; AAA52002). FT {ECO:0000305}. FT CONFLICT 1332 1332 M -> I (in Ref. 26; AAA52002). FT {ECO:0000305}. FT CONFLICT 1357 1357 L -> P (in Ref. 26; AAA52002). FT {ECO:0000305}. FT HELIX 1236 1249 {ECO:0000244|PDB:4AK3}. FT STRAND 1253 1257 {ECO:0000244|PDB:4AE2}. FT HELIX 1262 1268 {ECO:0000244|PDB:4AE2}. FT STRAND 1274 1279 {ECO:0000244|PDB:4AE2}. FT HELIX 1286 1288 {ECO:0000244|PDB:4AE2}. FT STRAND 1290 1295 {ECO:0000244|PDB:4AE2}. FT TURN 1296 1299 {ECO:0000244|PDB:4AE2}. FT STRAND 1300 1303 {ECO:0000244|PDB:4AE2}. FT STRAND 1305 1307 {ECO:0000244|PDB:4AE2}. FT STRAND 1309 1313 {ECO:0000244|PDB:4AE2}. FT STRAND 1320 1322 {ECO:0000244|PDB:4AK3}. FT HELIX 1328 1331 {ECO:0000244|PDB:4AE2}. FT STRAND 1339 1341 {ECO:0000244|PDB:4AEJ}. FT STRAND 1343 1345 {ECO:0000244|PDB:4AK3}. FT HELIX 1347 1360 {ECO:0000244|PDB:4AE2}. FT STRAND 1364 1374 {ECO:0000244|PDB:4AE2}. FT TURN 1381 1384 {ECO:0000244|PDB:4AE2}. FT STRAND 1391 1393 {ECO:0000244|PDB:4AE2}. FT STRAND 1395 1397 {ECO:0000244|PDB:4AE2}. FT STRAND 1399 1404 {ECO:0000244|PDB:4AE2}. FT HELIX 1406 1408 {ECO:0000244|PDB:4AK3}. FT STRAND 1411 1415 {ECO:0000244|PDB:4AE2}. FT STRAND 1422 1434 {ECO:0000244|PDB:4AE2}. FT HELIX 1436 1438 {ECO:0000244|PDB:4AE2}. FT STRAND 1443 1445 {ECO:0000244|PDB:4AE2}. FT STRAND 1455 1465 {ECO:0000244|PDB:4AE2}. SQ SEQUENCE 1466 AA; 138564 MW; B904B4E05E17D339 CRC64; MMSFVQKGSW LLLALLHPTI ILAQQEAVEG GCSHLGQSYA DRDVWKPEPC QICVCDSGSV LCDDIICDDQ ELDCPNPEIP FGECCAVCPQ PPTAPTRPPN GQGPQGPKGD PGPPGIPGRN GDPGIPGQPG SPGSPGPPGI CESCPTGPQN YSPQYDSYDV KSGVAVGGLA GYPGPAGPPG PPGPPGTSGH PGSPGSPGYQ GPPGEPGQAG PSGPPGPPGA IGPSGPAGKD GESGRPGRPG ERGLPGPPGI KGPAGIPGFP GMKGHRGFDG RNGEKGETGA PGLKGENGLP GENGAPGPMG PRGAPGERGR PGLPGAAGAR GNDGARGSDG QPGPPGPPGT AGFPGSPGAK GEVGPAGSPG SNGAPGQRGE PGPQGHAGAQ GPPGPPGING SPGGKGEMGP AGIPGAPGLM GARGPPGPAG ANGAPGLRGG AGEPGKNGAK GEPGPRGERG EAGIPGVPGA KGEDGKDGSP GEPGANGLPG AAGERGAPGF RGPAGPNGIP GEKGPAGERG APGPAGPRGA AGEPGRDGVP GGPGMRGMPG SPGGPGSDGK PGPPGSQGES GRPGPPGPSG PRGQPGVMGF PGPKGNDGAP GKNGERGGPG GPGPQGPPGK NGETGPQGPP GPTGPGGDKG DTGPPGPQGL QGLPGTGGPP GENGKPGEPG PKGDAGAPGA PGGKGDAGAP GERGPPGLAG APGLRGGAGP PGPEGGKGAA GPPGPPGAAG TPGLQGMPGE RGGLGSPGPK GDKGEPGGPG ADGVPGKDGP RGPTGPIGPP GPAGQPGDKG EGGAPGLPGI AGPRGSPGER GETGPPGPAG FPGAPGQNGE PGGKGERGAP GEKGEGGPPG VAGPPGGSGP AGPPGPQGVK GERGSPGGPG AAGFPGARGL PGPPGSNGNP GPPGPSGSPG KDGPPGPAGN TGAPGSPGVS GPKGDAGQPG EKGSPGAQGP PGAPGPLGIA GITGARGLAG PPGMPGPRGS PGPQGVKGES GKPGANGLSG ERGPPGPQGL PGLAGTAGEP GRDGNPGSDG LPGRDGSPGG KGDRGENGSP GAPGAPGHPG PPGPVGPAGK SGDRGESGPA GPAGAPGPAG SRGAPGPQGP RGDKGETGER GAAGIKGHRG FPGNPGAPGS PGPAGQQGAI GSPGPAGPRG PVGPSGPPGK DGTSGHPGPI GPPGPRGNRG ERGSEGSPGH PGQPGPPGPP GAPGPCCGGV GAAAIAGIGG EKAGGFAPYY GDEPMDFKIN TDEIMTSLKS VNGQIESLIS PDGSRKNPAR NCRDLKFCHP ELKSGEYWVD PNQGCKLDAI KVFCNMETGE TCISANPLNV PRKHWWTDSS AEKKHVWFGE SMDGGFQFSY GNPELPEDVL DVHLAFLRLL SSRASQNITY HCKNSIAYMD QASGNVKKAL KLMGSNEGEF KAEGNSKFTY TVLEDGCTKH TGEWSKTVFE YRTRKAVRLP IVDIAPYDIG GPDQEFGVDV GPVCFL //