Protein assembler
10428137 ยท 2019-10-01
Assignee
Inventors
Cpc classification
C12N15/1031
CHEMISTRY; METALLURGY
C07K14/78
CHEMISTRY; METALLURGY
C12N15/66
CHEMISTRY; METALLURGY
International classification
C07K14/78
CHEMISTRY; METALLURGY
A61K47/64
HUMAN NECESSITIES
C12N15/10
CHEMISTRY; METALLURGY
Abstract
The present invention relates to a method for assembling (monomeric or oligomeric) proteins and peptide structures to multimeric protein or peptide structures. The present invention also provides a method for preparing peptide based polymers by crosslinking such multimeric proteins or peptides obtainable according to the inventive method and their use as polymers, for amphiphilic applications, as protein based detergents, for forming artificial organelles, etc. Disclosed are furthermore novel protein or peptide structures, nucleic acids encoding same and cloning and expression vectors suitable for carrying out the inventive method for assembling multimeric proteins or peptides. The novel method for assembling proteins and peptide structures may be furthermore be used as a novel scalable peptide generator technique, which are also described, by preparing first a multimeric protein or peptide structure using the inventive method and then specifically degrading the multimeric protein or peptide structure into its monomeric or smaller units.
Claims
1. Method for assembling multimeric protein or peptide structures, the method comprising the following steps: a) Providing or preparing a circular expression vector having a first unique restriction site for a first type II S restriction enzyme and a second unique restriction site for a second type II S restriction enzyme, the first type II S restriction enzyme suitable to generate a 3 base pair 5-overhang in the lower strand of the vector upon cleavage with the first type II S restriction enzyme, and the second type II S restriction enzyme suitable to generate a 3 base pair 5-overhang in the upper strand of the vector upon cleavage with the second type II S restriction enzyme, wherein the 3 base pair 5-overhang in the upper strand and the 3 base pair 5-overhang in the lower strand of the resulting insert and of the vector are complementary to each other; the circular expression vector also having a third unique restriction site for a third restriction enzyme, located upstream to or overlapping with the recognition sequence and/or restriction site of the first unique restriction enzyme, the third unique restriction site being different to the first and the second unique restriction site, wherein upon cleavage with the third restriction enzyme an at least 3 or 4 base pair 5-overhang or an at least 3 or 4 base pair 3-overhang is generated in the upper or lower strand of the vector different from the 5-overhang generated by the first type II S restriction enzyme and the second type II S restriction enzyme; b) Digesting the circular expression vector with the first type II S restriction enzyme and the second type II S restriction enzyme to create a 3 base pair 5-overhang in the lower strand and a 3 base pair 5-overhang in the upper strand of the vector, wherein the 3 base pair 5-overhang in the lower strand and the 3 base pair 5-overhang in the upper strand are complementary to each other and complementary to the 3 base pair 5-overhang in the upper strand of the resulting insert and the 3 base pair 5-overhang in the lower strand of the insert; c) Inserting into the digested expression vector a multiple number of the same or different double stranded nucleic acid sequences encoding a protein or peptide sequence and exhibiting a complementary 3 base pair 5-overhang in the upper strand and a complementary 3 base pair 5-overhang in the lower strand; d) Circular ligation of the multiple number of double stranded nucleic acid sequences into the expression vector using a ligase; e) Optionally propagating the circular expression vector obtained according to step d); f) Optionally sequencing the circular expression vector obtained according to step d) or e); g) Selection of circular expression vectors obtained according to step d), e) and/or f), comprising a predetermined repeat number of double stranded nucleic acid sequences of at least 2 to 15; h) Digesting the circular expression vector selected according to step g) with the first Type II S restriction enzyme; i) Inserting into the digested expression vector a further double stranded nucleic acid sequence encoding a protein or peptide sequence and exhibiting a complementary 3 base pair 5-overhang in the upper strand and a complementary 3 base pair 5-overhang in the lower strand, wherein the further double stranded nucleic acid sequence may be identical or different to the double stranded nucleic acid sequence of step c); j) Circular ligation of the further double stranded nucleic acid sequence of step i) into the expression vector using a ligase; k) Repeating steps h) to j) at least once; l) Digesting the circular expression vector with the first type II S restriction enzyme and then subsequently with a type II or type II S restriction enzyme recognizing the third unique restriction site and cutting within the first unique type II S recognition sequence, inserting an insert into said digested expression vector, wherein the insert has been prepared by digesting the circular expression vector with the third unique restriction enzyme first and then subsequently with the second unique type II S restriction enzyme and then ligating the digested expression vector and the insert; m) Optionally expressing, isolating and/or purifying the encoded multimeric protein or peptide structure.
2. Method according to claim 1, wherein the double stranded nucleic acid sequence is selected from n) a nucleic acid sequence encoding an ELP-like protein sequence according to the following general formula (I):
[(VZ.sub.1PZ.sub.2GX.sub.1G).sub.n(VZ.sub.3PZ.sub.4GX.sub.2G).sub.m].sub.p(SEQ ID NO: 95) wherein: V is Val P is Pro; G is Gly; X.sub.1 is Glu, Asp, Arg, Val, Lys, His, Ser, Thr, Asn or Gln, or is a mixture thereof; X.sub.2 is Tyr, Cys, Phe, Ile, Leu, Met, or Trp, or is a mixture thereof; Z.sub.1-4 is present or is not present and is independently from each other selected from Ala or Gly; n+m1 n is an integer selected from 1 to 300; m is an integer selected from 1 to 300; p is an integer selected from 1 to 300; o) a nucleic acid sequence encoding a resilintype ELP-like protein sequence according to the following general formula (Ia):
[(VPGVG).sub.n(VPGX.sub.2G).sub.m].sub.p(SEQ ID NO: 96) wherein: V is Val P is Pro; G is Gly; X.sub.2 is Tyr, Cys or Trp; n+m1 n is an integer selected from 1 to 300; m is an integer selected from 1 to 300; p is an integer selected from 1 to 300; and/or p) a nucleic acid sequence encoding an amphiphilic ELP-like protein sequence according to the following general formula (Ib):
[(VZ.sub.1PZ.sub.2GX.sub.1G).sub.n(VZ.sub.3PZ.sub.4GX.sub.1G).sub.m].sub.p(SEQ ID NO: 129) or according to following general formula (Ic):
[(VPGX.sub.1G).sub.n(VPGX.sub.2G).sub.m].sub.p(SEQ ID NO: 113) or according to following general formula (Id):
[Yq(VPGX.sub.1G).sub.nY.sub.q(VPGX.sub.2G).sub.mYq].sub.p(SEQ ID NO:114) wherein (in Ib or Ic or Id): V is Val P is Pro; G is Gly; X.sub.1 is Glu, Asp, Arg, Val, Lys, His, Ser, Thr, Asn or Gln or is a mixture thereof; X.sub.2 is Tyr, Cys, Phe, Ile, Leu, Met; or Trp or is a mixture thereof, Y is GFP (green fluorescent protein), mEGFP (monomeric enhanced GFP), EYFP (enhanced yellow FP), ECFP (C=cyan) or a derivative thereof; Z.sub.1-4 is present or is not present and is independently from each other selected from Ala or Gly; n+m1 n is an integer selected from 1 to 300; m is an integer selected from 1 to 300; p is an integer selected from 1 to 300; q is an integer of 0 and 1.
3. Method according to claim 1, wherein the double stranded nucleic acid sequence is additionally selected from n) a nucleic acid sequence consisting of a nucleic acid sequence selected from any of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 90 or 91, or a nucleic acid comprising an identity of at least about 95% to one of the afore mentioned nucleic acid sequences; or o) a nucleic acid sequence consisting of a nucleic acid sequence encoding a protein sequence according to any of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42 to 77 or 80 to 89, or a protein comprising an identity of at least about 95% to one of the afore mentioned protein sequences.
4. Method according to claim 1, wherein the double stranded nucleic acid sequence encodes 1 protein or a consecutive number of 2 to 100 proteins, and wherein the double stranded nucleic acid sequence is selected from n) a nucleic acid sequence encoding an ELP-like protein sequence according to the following general formula (I):
[(VZ.sub.1PZ.sub.2GX.sub.1G).sub.n(VZ.sub.3PZ.sub.4GX.sub.2G).sub.m].sub.p(SEQ ID NO: 95) wherein: V is Val P is Pro; G is Gly; X.sub.1 is Glu, Asp, Arg, Val, Lys, His, Ser, Thr, Asn or Gln, or is a mixture thereof; X.sub.2 is Tyr, Cys, Phe, Ile, Leu, Met, or Trp, or is a mixture thereof; Z.sub.1-4 is present or is not present and is independently from each other selected from Ala or Gly; n+m1 n is an integer selected from 1 to 300; m is an integer selected from 1 to 300; p is an integer selected from 1 to 300; o) a nucleic acid sequence encoding a resilintype ELP-like protein sequence according to the following general formula (Ia):
[(VPGVG).sub.n(VPGX.sub.2G).sub.m].sub.p(SEQ ID NO: 96) wherein: V is Val P is Pro; G is Gly; X.sub.2 is Tyr, Cys or Trp; n+m1 n is an integer selected from 1 to 300; m is an integer selected from 1 to 300; p is an integer selected from 1 to 300; p) a nucleic acid sequence encoding an amphiphilic ELP-like protein sequence according to the following general formula (Ib):
[(VZ.sub.1PZ.sub.2GX.sub.1G).sub.n(VZ.sub.3PZ.sub.4GX.sub.2G).sub.m].sub.p(SEQ ID NO: 129) or according to following general formula (Ic):
[(VPGX.sub.1G).sub.n(VPGX.sub.2G).sub.m].sub.p(SEQ ID NO: 113) or according to following general formula (Id):
[Yq(VPGX.sub.1G).sub.nY.sub.q(VPGX.sub.2G).sub.mYq].sub.1(SEQ ID NO: 114) wherein (in Ib or Ic or Id): V is Val P is Pro; G is Gly; X.sub.1 is Glu, Asp, Arg, Val, Lys, His, Ser, Thr, Asn or Gln or is a mixture thereof; X.sub.2 is Tyr, Cys, Phe, Ile, Leu, Met; or Trp or is a mixture thereof, Y is GFP (green fluorescent protein), mEGFP (monomeric enhanced GFP), EYFP (enhanced yellow FP), ECFP (C=cyan) or a derivative thereof; Z.sub.1-4 is present or is not present and is independently from each other selected from Ala or Gly; n+m1 n is an integer selected from 1 to 300; m is an integer selected from 1 to 300; p is an integer selected from 1 to 300; q is an integer of 0 and 1; q) a nucleic acid sequence consisting of a nucleic acid sequence selected from any of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 90 or 91, or a nucleic acid comprising an identity of at least about 95% to one of the afore mentioned nucleic acid sequences; or r) a nucleic acid sequence consisting of a nucleic acid sequence encoding a protein sequence according to any of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42 to 77 or 80 to 89, or a protein comprising an identity of at least about 95% to one of the afore mentioned protein sequences.
5. Method according to claim 1, wherein in step i) a double stranded nucleic acid is provided encoding a protein, and wherein the double stranded nucleic acid sequence is selected from n) a nucleic acid sequence encoding an ELP-like protein sequence according to the following general formula (I):
[(VZ.sub.1PZ.sub.2GX.sub.1G).sub.n(VZ.sub.3PZ.sub.4GX.sub.2G).sub.m].sub.p(SEQ ID NO: 95) wherein: V is Val P is Pro; G is Gly; X.sub.1 is Glu, Asp, Arg, Val, Lys, His, Ser, Thr, Asn or Gln, or is a mixture thereof; X.sub.2 is Tyr, Cys, Phe, Ile, Leu, Met, or Trp, or is a mixture thereof; Z.sub.1-4 is present or is not present and is independently from each other selected from Ala or Gly; n+m1 n is an integer selected from 1 to 300; m is an integer selected from 1 to 300; p is an integer selected from 1 to 300; o) a nucleic acid sequence encoding a resilintype ELP-like protein sequence according to the following general formula (Ia):
[(VPGVG).sub.n(VPGX.sub.2G).sub.m].sub.p(SEQ ID NO: 96) wherein: V is Val P is Pro; G is Gly; X.sub.2 is Tyr, Cys or Trp; n+m1 n is an integer selected from 1 to 300; m is an integer selected from 1 to 300; p is an integer selected from 1 to 300; p) a nucleic acid sequence encoding an amphiphilic ELP-like protein sequence according to the following general formula (Ib):
[(VZ.sub.1PZ.sub.2GX.sub.1G).sub.n(VZ.sub.3PZ.sub.4GX.sub.1G).sub.m].sub.p(SEQ ID NO: 129) or according to following general formula (Ic):
[(VPGX.sub.1G).sub.n(VPGX.sub.2G).sub.m].sub.p(SEQ ID NO: 113) or according to following general formula (Id):
[Yq(VPGX.sub.1G).sub.nY.sub.q(VPGX.sub.2G).sub.mYq].sub.p(SEQ ID NO: 114) wherein (in Ib or Ic or Id): V is Val P is Pro; G is Gly; X.sub.1 is Glu, Asp, Arg, Val, Lys, His, Ser, Thr, Asn or Gln or is a mixture thereof; X.sub.2 is Tyr, Cys, Phe, Ile, Leu, Met; or Trp or is a mixture thereof, Y is GFP (green fluorescent protein), mEGFP (monomeric enhanced GFP), EYFP (enhanced yellow FP), ECFP (C=cyan) or a derivative thereof; Z.sub.1-4 is present or is not present and is independently from each other selected from Ala or Gly; n+m1 n is an integer selected from 1 to 300; m is an integer selected from 1 to 300; p is an integer selected from 1 to 300; q is an integer of 0 and 1; q) a nucleic acid sequence consisting of a nucleic acid sequence selected from any of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 90 or 91, or a nucleic acid comprising an identity of at least about 95% to one of the afore mentioned nucleic acid sequences; or r) a nucleic acid sequence consisting of a nucleic acid sequence encoding a protein sequence according to any of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42 to 77 or 80 to 89, or a protein comprising an identity of at least about 95% to one of the afore mentioned protein sequences; in a subsequently following repetition of steps h) to j) a double stranded nucleic acid is provided encoding a cleavable protein sequence selected from a chemical cleavage site or a protease cleavage site or a hydroxyl amine cleavage site at one of its terminal ends, or alternatively in step i) a double stranded nucleic acid is provided encoding a protein, the protein being selected from one of n), o), p), q), or r), and additionally a protease cleavage site or a hydroxyl amine cleavage site at one of its terminal ends.
6. Method according to claim 5, wherein the cleavable protein sequences of a multimeric protein or peptide structure expressed isolated and/or purified according to final step 1) are cleaved to obtain the monomeric proteins or peptides.
7. Multimeric protein or peptide structure consisting of a ELP-like protein sequence according to any of the following formulae (I), (Ia), (Ib), (Ic) or (Id), as defined in claim 2.
8. Multimeric protein or peptide structure according to claim 7, wherein the ELP-like protein sequence consists of a resilintype ELP-like protein sequence according to any of the following formulae:
[(VPGVG).sub.1-25(VPGYG).sub.1-100].sub.3-300(SEQ ID NO: 97)
[(VPGVG).sub.1-10(VPGYG).sub.1-5].sub.3-200(SEQ ID NO: 98)
[(VPGVG).sub.1-10(VPGYG).sub.1].sub.3-200(SEQ ID NO: 99)
[(VPGVG).sub.1-5(VPGYG).sub.1].sub.3-200(SEQ ID NO: 100)
[(VPGVG).sub.2(VPGYG).sub.1].sub.3-200(SEQ ID NO: 101)
[(VPGVG).sub.2(VPGYG).sub.1].sub.10-100(SEQ ID NO: 102)
[(VPGVG).sub.2(VPGYG).sub.1].sub.10-50(SEQ ID NO: 103)
[(VPGVG).sub.2(VPGYG).sub.1].sub.15(SEQ ID NO: 104)
[(VPGVG).sub.1-25(VPGCG).sub.1-100].sub.3-300(SEQ ID NO: 105)
[(VPGVG).sub.1-10(VPGCG).sub.1-5].sub.3-200(SEQ ID NO: 106)
[(VPGVG).sub.1-10(VPGCG).sub.1].sub.3-200(SEQ ID NO: 107)
[(VPGVG).sub.1-5(VPGCG).sub.1].sub.3-200(SEQ ID NO: 108)
[(VPGVG).sub.2(VPGCG).sub.1].sub.3-200(SEQ ID NO: 109)
[(VPGVG).sub.2(VPGCG).sub.1].sub.10-100(SEQ ID NO: 110)
[(VPGVG).sub.2(VPGCG).sub.1].sub.10-50(SEQ ID NO: 111), or
[(VPGVG).sub.2(VPGCG).sub.1].sub.15(SEQ ID NO: 112).
9. Multimeric protein or peptide structure according to claim 7, wherein the ELP-like protein sequence consists of an amphiphilic ELP-like protein sequence according to one of the following formulae:
[(VPGX.sub.1G).sub.1-300(VPGX.sub.2G).sub.1-300].sub.1-300(SEQ ID NO: 115)
[(VPGX.sub.1G).sub.1-200(VPGX.sub.2G).sub.1-200].sub.1-200(SEQ ID NO: 116)
[(VPGX.sub.1G).sub.1-100(VPGX.sub.2G).sub.1-100].sub.1-100(SEQ ID NO: 117)
[(VPGX.sub.1G).sub.2-100(VPGX.sub.2G).sub.2-100].sub.1-50(SEQ ID NO: 118)
[(VPGX.sub.1G).sub.5-100(VPGX.sub.2G).sub.5-100].sub.1(SEQ ID NO: 119), or
[(VPGX.sub.1G).sub.10-50(VPGX.sub.2G).sub.10-50].sub.1(SEQ ID NO: 120) wherein: V is Val P is Pro; G is Gly; X.sub.1 is Glu, Asp, Arg, (Lys, Asn, Gln, Ser, Thr, His) or is a mixture thereof; X.sub.2 is Phe, Ile, Leu, (Tyr, Trp, Meth, Val) or is a mixture thereof.
10. Multimeric protein or peptide structure according to claim 7, wherein the ELP-like protein sequence consists of an amphiphilic ELP-like protein sequence according to one of the following formulae:
[Y.sub.q(VPGX.sub.1G).sub.1-300Y.sub.q(VPGX.sub.2G).sub.1-300Y.sub.q].sub.1-300(SEQ ID NO: 121)
[Y.sub.q(VPGX.sub.1G).sub.1-200Y.sub.q(VPGX.sub.2G).sub.1-200Y.sub.q].sub.1-200(SEQ ID NO: 122)
[Y.sub.q(VPGX.sub.1G).sub.1-100Y.sub.q(VPGX.sub.2G).sub.1-100Y.sub.q].sub.1-100(SEQ ID NO: 123)
[Y.sub.q(VPGX.sub.1G).sub.2-100Y.sub.q(VPGX.sub.2G).sub.2-100Y.sub.q].sub.1-50(SEQ ID NO: 124)
[Y.sub.q(VPGX.sub.1G).sub.5-100Y.sub.q(VPGX.sub.2G).sub.5-100Y.sub.q].sub.1(SEQ ID NO: 125),
[Y.sub.q(VPGX.sub.1G).sub.10-50Y.sub.q(VPGX.sub.2G).sub.10-50Y.sub.q].sub.1(SEQ ID NO: 126) wherein: V is Val P is Pro; G is Gly; X.sub.1 is Glu, Asp, Arg, (Lys, Asn, Gln, Ser, Thr, His) or is a mixture thereof; X.sub.2 is Phe, Ile, Leu, (Tyr, Trp, Meth, Val) or is a mixture thereof, Y is GFP, EGFP, mEGFP, EYFP, ECFP or any derivative or mixture thereof of them; and q is an integer of 0 or 1.
Description
FIGURES
(1) The figures shown in the following are merely illustrative and shall describe the present invention in a further way. These figures shall not be construed to limit the present invention thereto.
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)
(18)
(19)
(20)
(21)
(22)
EXAMPLES
(23) The examples shown in the following are merely illustrative and shall describe the present invention in a further way. These examples shall not be construed to limit the present invention thereto.
Example 01: De Novo Synthesis of Elastin-Like Protein (ELP) Encoding Nucleic Acid Fragments
(24) De novo chemical synthesized oligonucleotides of the respective nucleic acid templates for ELP-monomers (for V templateupper strand: 5-GGTGTTCCGGGTGTT-3 (SEQ ID NO: 90); lower strand: 5-ACCAACACCCGGAAC-3 (SEQ ID NO: 91)) were phosphorylated, annealed and ligated into the pET28-NMCysL vector which was previously prepared by EarI digest followed by a dephosphorylation reaction (compare overview
(25) The resulting pET28-NMCysL-V20n-His (n=1-5) vectors were transformed into E. coli BLR cells. E. coli were grown over night at 37 C. in LB-medium while shaking at 200 rpm. After IPTG induction (with f.c. 1 M IPTG) the cells were grown for another 20 h at 20 C. Protein extraction, His-Tag purification and SDS-PAG were performed as described in Sambrook et al.: Molecular cloning: A Laboratory Manual; 3.sup.rd Ed. Cold Spring Harbour Laboratory Press; Cold Spring Harbour (2001). The expressed elastin-like protein multimers were shown in
(26) To supplement homopolymeric ELP-molecules (e.g. V20, R20, F20 and F40) with a functional group (e.g. amino acid cysteine which can built disulfide bridges or could be used for oxidative coupling of the protein on gold-surfaces) the respective pIB-ELPQ2-Vn/Rn/Fn cloning vectors (n=20 or 40) were used as providers (EarI digestion) for inserts to clone them into pIB-ELPQ2-C1 (Bsp. an ELP-C1 monomer=nucleic acid template coding for (VPGCG)1-pentamer)(SEQ ID NO: 127) vector. Inserts were implemented upstream to the C1 sequence motif (see
Example 02: De Novo Synthesis of Very Large Recombinant Resilin and Spider Silk Encoding Nucleic Acid Fragments
(27) De novo chemical synthesized oligonucleotides of the respective nucleic acid templates for recombinant resilin rec1-monomers (=res1):
(28) TABLE-US-00005 (upperstrand: (SEQIDNO:69) 5 GGTGGTCGACCTTCTGATTCTTACGGTGCTCCTGGTGGTGG TAAT3; lowerstrand: (SEQIDNO:70) 5 ACCATTACCACCACCAGGAGCACCGTAAGAATCAGAAGGTC GACC3
were phosphorylated, annealed and ligated into the pET28-MCL vector which was previously prepared by EarI digestion followed by a dephosphorylation reaction. This vector is a precursor construct of the pET28-NMCysL vector which possesses the same constellation of the 3 unique type II (S) restriction sites in the linker region but differs in the absence of the C-terminal His-Tag. Within this first step approximately a regimen of 1 to 6 units integrate into the vector. The insert length was verified by DNA sequencing and cyclic ligation was confirmed with sequence verified res6 repeat units as described above. Target vector preparation (pET28-MCL-res6) was accomplished by BspQI digestion followed by a CIP reaction. Insert preparation was performed by EarI digestion of the same vector. Both reaction products were gel-purified for further reactions. In two rounds of cyclic ligation the resilin monomer template could be multimerised up to 30 repeat units (see
(29) For the de novo synthesis of nucleic acid templates for recombinant spider silk protein-monomers (=spisi1) four complementary chemically synthesized oligonucleotides of the respective sequence were phosphorylated, annealed and ligated into the pIB-ELPQ2 vector which was previously prepared by EarI digestion followed by a dephosphorylation reaction.
(30) TABLE-US-00006 Upperstrandoligo1: (SEQIDNO:71) 5 GGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGC TGCTGCTGC3; upperstrandoligo2: (SEQIDNO:72) 5 TATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTT CTCAGGGTACCTCT3; lowerstrandoligo1: (SEQIDNO:73) 5 AGCACCACCCATAGCAGCAGCAGCAGCCATACCAGCACCCT GACCACCCAGACCACCACG3; lowerstrandoligo2: (SEQIDNO:74) 5 ACCAGAGGTACCCTGAGAACCCAGACCACCGTAACCACCCT GACC3;
(31) After the first annealing of the complete nucleic acid template-monomer for the recombinant spider-silk a first round of cyclic ligation (as described above) leads to repeat unit numbers up to 5 units (spisi5). These fragments were sequence verified and transferred as EarI digested inserts into the pET28-MCL vector for further multimerisation and expression. With one cyclic ligation reaction (usage of spisi5 EarI fragment as insert) the construct could be multimerised up to 20 repeat units (2.2 kb). Further multimerisation must be performed with the directional ligation process (compare overview
Example 03: Oxidative Dimerisation of De Novo Synthesized Protein-Polymers
(32) Homopolymeric ELP-molecules (e.g. V20, R20, F20 and F40) were functionalized with a C-terminal, peripherical cysteine group (see
(33) As examples for the covalent connection of separately expressed proteins we demonstrate the oxidative homo- and heterodimerisation of different homopolymeric ELPs via disulfide-bridge formation of the respective proteins (see
Example 04: Photocrosslinking of Tyrosine Protein-Polymers
(34) ELP-molecules (e.g. ELP (V2V)15, ELP (V2Y)30) were dissolved in DMSO at concentrations of 5-20% w/w with APS 10 mM and Ru (or Fe) catalyst 0.5-1 mM (10 mM). The crosslinking is carried out with UV-light 400-500 nm using an UV-lamp. The concentration of the protein and the illumination time allow for the exact control of the elastic modulus (e.g. between several kPa and several hundred MPa to GPa) (see
Example 05: Synthetic Genes as Basic Constructs to Multimeric Peptides
(35) In the following exemplary multimeric peptide sequences have been prepared on the basis of RGD-based sequences using the inventive method for multimerization as generally described above.
(36) A. Basic Constructs for Peptide Multimers
(37) 1. Ordered Constructs
(38) The synthetic genes encoding peptides according to SEQ ID NO: 58 to 67 were ordered at Geneart GmbH (now: Invitrogene) Regensburg and other suppliers to demonstrate feasibility of the inventive concept with synthetic genes. Several suppliers were extremely expensive or not able to synthesize basic repetitive DNA sequences encoding petide multimers at the requested length and timescale.
(39) TABLE-US-00007 TABLE1 Exemplarypeptidemultimers synthetic genes shortdescription SEQ cleavage abbrevi- ofthepeptide/ ID site/ No. ation peptidemonomers NO: principle 1.1 (RGDser- GCGGNSEPRGDTYRAYN 58 Hydroxylamine/ NG)10 chemical 1.2 (RGD-I CGGNGEPRGDTYRAYIE 59 FaktorXA/ EGR)10 GR enzymatic 1.3 (RGD- CGGNGEPRGDTYRAYLV 60 Thrombin/ LVPR)1 PR enzymatic 1.4 (RGD- GCGNGEPRGDTYRAYENL 61 TEVprotease/ TEV)1 YFQ enzymatic 1.5 (fibro- GCGGGEEIQIGHIPREDV 62 Hydroxylamine/ nectin DYHLYPN chemical CS5-NG)10 1.6 (NCAM- GCGGEVYVVAENQQGKSK 63 Hydroxylamine/ FGL-NG)10 AN chemical 1.7 (NCAM- GCGGTIMGLKPETRYAVR 64 TEVprotease/ EncaminE- enzymatic TEV)10 ENLYFQ 1.8 (FDFDFDFD- GFDFDFDFDN 65 Hydroxylamine/ NG)5 chemical 1.9 (GoSi- GAYSSGAPPMPPFN 66 Hydroxylamine/ NG)5 chemical 1.10 (IronOx- GLSTVQTISPSNHN 67 Hydroxylamine/ NG)5 chemical
(40) TABLE-US-00008 TABLE2 Status expression pep- expres- construct tide sion peptide sta- inpET28- re- veri- No. monomer tus NMCysL* peats kDa fied 1.1 GCGGNSEPR 5 (RGDser- 10 20.2 + GDTYRAYN NG10)1 5 (RGDser- 20 38.2 + NG10)2 4 (RGDser- 40 74.2 + NG10)4 4 (RGDser- 80 146.2 + NG10)8 3 (RGDser- 100 NG10)10 1.2 CGGNGEPRG 5 (RGD- 10 22.7 + DTYRAYIEG IEGR10)1 R 5 (RGD- 20 43.3 + IEGR10)2 5 (RGD- 40 84.4 + IEGR10)4 3 (RGD- 50 IEGR10)5 1.3 CGGNGEPRG 2 (RGD- 1 DTYRAYLVP LVPR)1 R 1.4 GCGNGEPRG 2 (RGD- 1 DTYRAYENL TEV)1 YFQ 1.5 GCGGGEEIQ 4 (fibroCS5- 10 29.8 + IGHIPREDV NG10)1 DYHLYPN 4 (fibroCS5- 20 57.3 + NG10)2 4 (fibroCS5- 40 112.3 + NG10)4 3 (fibroCS5- 50 NG10)5 1.6 GCGGEVYVV 2 (NCAM-FGL- 10 22.2 + AENQQGKSK NG10)1 AN 1.7 GCGGTIMGL 0 (NCAM- 10 KPETRYAVR encam- ENLYFQ TEV10)1 1.8 GFDFDFDFD 2 (FDFDFDFD- 5 8.1 + N NG5)1 1.9 GAYSSGAPP 0 (GoSi-NG5)1 1 MPPFN 1.10 GLSTVQTIS 1 (IronOx- 1 PSNHN NG5)1 Status: 0-ordered, but not delivered because of synthesis problems of supplier 1 - ordered, received and cloned into pET28-NMCysL expression vector 2 - in process of multimerisation 3 - multimerised in pET28-NMCysL vector to higher order polymers and sequence verified 4 - transformation in expression cells, protein expression and purification 5 - cleavage of multimers with proteolytic substance, analysis *All constructs were cloned into pET28-NMCysL expression vector that allows expression and further polymerisation to higher order polymers by flanking restriction sites provided in the newly developed NMCys Linker region.
B. Cloning, Multimerisation and Expression of Peptide Multimers
(41) The delivered synthetic genes were cloned into the herein described and newly developed pET28-NMCysL expression and cloning vector.
(42) Therefore the EarI recognition sequences in the vector backbone were removed and a special Linker region (NMCys) was integrated to allow an efficient multimerisation of the repetitive multimeric peptide DNA template sequences within this vector (see
(43) At each step of multimerisation (see Table 2) the expression ability and expression level of the produced multimer could be tested. Successful expression of the multimerised peptide templates were marked with (+) in Table 2 (see also
(44) C. Hydroxylamine Cleavage of (RGDser-NG10).sub.2 Multipetides to Peptides
(45) As an example for chemical hydroxylamine cleavage of peptide multimers we use so far the (RGDser-NG10).sub.1 and (RGDser-NG10).sub.2 multimers
(46) The conditions for the proteolytic reactions were the following:
(47) (RGDser-NG10).sub.2 peptide multimeres (10 l with 50 g) were supplemented with 40 l of hydroxylamine cleavage-buffer (2 M H.sub.3HNO.sub.3HCl, 0.2 M Tris-HCl pH 8.0, 6.4 M GuHCl) to a final volume of 50 l.
(48) The reaction was incubated for different periods of time at 45 C., shaking with 300 rpm. As a control we used a repetitive elastin like protein: (V2Y1).sub.30 (39.7 kDa) with comparable molecular weight that displays no hydroxylamine recognition sequence.
(49) Under these conditions the protein (RGDser-NG10)2 (II.) was cleaved with H3NO-HCl (hydroxylamine) to the monomeric peptide RGDser-NG (I.) and multimers of these (see
(50) I. RGDser-NG Monomeric Amino Acid Sequence
(51) TABLE-US-00009 (SEQIDNO:58) GCGGNSEPRGDTYRAYN
II. (RGDser-NG10)2 Multimeric Amino Acid Sequence
(52) TABLE-US-00010 (SEQIDNO:68) MVHPSSSGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCG GNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTY RAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNS EPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAY NGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPR GDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGC GGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDT YRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGGRGI LEHHHHHH
(53) The resulting peptides after hydroxylamine cleavage are marked by different colored blocks. The protein (RGDser-NG10)2 (B.) should be cleaved with H3NO-HCl (hydroxylamine) to the monomeric peptide RGDser-NG (A.)
(54) D. Proteolytic Digest of (RGD-IEGR10)2 (RGDser-NG10)4 Multiptides with Factor XA
(55) As an example for enzymatical Faktor XA digest of peptide multimers the (RGD-IEGR10).sub.2 and (RGD-IEGR10).sub.4 multimers were used.
(56) The conditions for the enzymatic digest with Factor XA protease were the following. (RGDser-IEGR10)2 and 4 peptide multimers (25 g) were resuspended in 29.5 l in 1Faktor XA reaction-buffer (20 mM Tris-HCl pH 6.8, 50 mM NaCl, 1 mM CaCl.sub.2). One unit of Factor XA (0.5 l) was added to the reaction to a final volume of 30 l.
(57) The reaction was incubated for different periods of time at different temperatures (25 C., 30 C., 37 C.). As a control we used a repetitive elastin like protein: (V.sub.2Y.sub.1)30 (39.7 kDa) with comparable molecular weight that displays no Factor XA recognition sequence. Results of the Faktor XA cleavage reaction are presented in
(58) E. Expression, Purification and Quantification of Peptide Multimers
(59) Some of the cloned and expressed peptide multimers were so far expressed, purified and an average amount of protein per liter culture volume was quantified (see Table 3).
(60) TABLE-US-00011 TABLE 3 Quantification of peptide expression peptide concentration (mg/ml) average amount of multimers mg protein/ml culture protein(mg)/L culture (RGDser-NG10)1 0.41 412 (RGDser-NG10)2 0.54 531 (RGD-IEGR10)2 0.14 139 (RGD-IEGR10)4 0.08 76
Annex:
(61) The Annex shows sequences and sequence constructs exemplarily used and prepared and shall describe the present invention in a further way. These annex shall not be construed to limit the present invention thereto.
(62) 1. Exemplary Structural & Functional Macromolecular Protein Polymers Prepared According to the Present Invention:
(63) 1.1 Protein-Polymers (Based on Elastin-Like Protein (ELP) Sequences)
(64) 1.1.1 Homogeneous Protein-Polymers
(65) 1.1.1.1 ELP-V20-his (10.7 kDa)
(66) TABLE-US-00012 DNA-sequence (SEQIDNO:1) ATGACTGCAGGGAGCTCTTCTGGTGTTCCGGGTGTTGGTGTTCCGGGTG TTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGG TGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTT CCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGG GTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGT TGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGT GTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGGAAGAGAAGCTTGGATCC TCGAGCACCACCACCACCACCAC proteinsequence (SEQIDNO:2) MTAGSSSGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGV PGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVG VPGVGVPGVGGREAWILEHHHHHH
1.1.1.2 ELP-V100-His (43.4 kDa)
(67) TABLE-US-00013 DNA-sequence (SEQIDNO:3) ATGACTGCAGGGAGCTCTTCTGGTGTTCCGGGTGTTGGTGTTCCGGGTG TTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGG TGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTT CCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGG GTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGT TGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGT GTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTC CGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGG TGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTT GGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTG TTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCC GGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGT GTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTG GTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGT TCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCG GGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTG TTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGG TGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTT CCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGG GTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGT TGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGT GTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTC CGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGG TGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTT GGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTG TTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCC GGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGT GTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTG GTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGT TCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCG GGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTG TTGGTGGAAGAGAAGCTTGGATCCTCGAGCACCACCACCACCACCAC proteinsequence (SEQIDNO:4) MTAGSSSGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGV PGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVG VPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGV GVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPG VGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVP GVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGV PGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVG VPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGV GVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPG VGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVP GVGVPGVGVPGVGVPGVGGREAWILEHHHHHH
1.1.1.3 ELP-F20-His (11.2 kDa)
(68) TABLE-US-00014 DNA-sequence (SEQIDNO:5) ATGGTGCACCCGAGCTCTTCTGGTGTTCCGGGTTTCGGTGTTCCGGGTT TCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGG TGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTT CCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGG GTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTT CGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGT GTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGGAAGAGGAATTCACCACC ACCACCACCAC proteinsequence (SEQIDNO:6) MVHPSSSGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGV PGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFG VPGFGVPGFGGRGIHHHHHH
1.1.1.4 ELP-F100-His (47.8 kDa)
(69) TABLE-US-00015 DNA-sequence (SEQIDNO:7) ATGGTGCACCCGAGCTCTTCTGGTGTTCCGGGTTTCGGTGTTCCGGGTT TCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGG TGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTT CCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGG GTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTT CGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGT GTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTC CGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGG TTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTC GGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTG TTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCC GGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGT TTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCG GTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGT TCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCG GGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTT TCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGG TGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTT CCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGG GTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTT CGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGT GTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTC CGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGG TTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTC GGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTG TTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCC GGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGT TTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCG GTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGT TCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCG GGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTT TCGGTGGAAGAGGAATTCACCACCACCACCACCAC proteinsequence (SEQIDNO:8) MVHPSSSGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGV PGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFG VPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGF GVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPG FGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVP GFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGV PGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFG VPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGF GVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPG FGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVP GFGVPGFGVPGFGVPGFGGRGIHHHHHH
1.1.2 Heterogeneous Protein-Polymers (Mixed Functionalities)
1.1.2.1 V20-EYFP-V40 (EYFP=Enhanced Yellow Fluorescent Protein) (54.1 kDa)
(70) TABLE-US-00016 DNA-sequence (SEQIDNO:9) ATGACTGCAGGGAGCTCTTCTGGTGTTCCGGGTGTTGGTGTTCCGGGTG TTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGG TGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTT CCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGG GTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGT TGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGT GTTCCGGGTGTTGGTGTTCCGGGTGTTGGTATGGTGAGCAAGGGCGAGG AGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGT AAACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACC TACGGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCG TGCCCTGGCCCACCCTCGTGACCACCTTCGGCTACGGCCTGCAGTGCTT CGCCCGCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCC ATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACG GCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGT GAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATC CTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCA TGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAAGATCCGCCA CAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAAC ACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGA GCTACCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACAT GGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGAC GAGCTGTACAAGGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTC CGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGG TGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTT GGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTG TTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCC GGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGT GTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTG GTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGT TCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCG GGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTG TTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGG TGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTT CCGGGTGTTGGTGTTCCGGGTGTTGGTGGAAGAGAAGCTTGGATCCTCG AGCACCACCACCACCACCAC proteinsequence (SEQIDNO:10) MTAGSSSGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGV PGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVG VPGVGVPGVGMVSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDAT YGKLTLKFICTTGKLPVPWPTLVTTFGYGLQCFARYPDHMKQHDFFKSA MPEGYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGNI LGHKLEYNYNSHNVYIMADKQKNGIKVNFKIRHNIEDGSVQLADHYQQN TPIGDGPVLLPDNHYLSYQSALSKDPNEKRDHMVLLEFVTAAGITLGMD ELYKGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGV GVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPG VGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVP GVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGV PGVGVPGVGGREAWILEHHHHHH
1.1.2.2 ECFP-V40-EYFP
1.1.2.1 ECFP-V40-EYFP (ECFP=Enhanced Cyan Flourescent Protein) (72.6 kDa)
(71) TABLE-US-00017 DNA-sequence (SEQIDNO:11) ATGGTGCACCCGAGCTCTTCTGGTATGGTGAGCAAGGGCGAGGAGCTGT TCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGG CCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGC AAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCT GGCCCACCCTCGTGACCACCCTGACCTGGGGCGTGCAGTGCTTCAGCCG CTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCC GAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACT ACAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCG CATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGG CACAAGCTGGAGTACAACTACATCAGCCACAACGTCTATATCACCGCCG ACAAGCAGAAGAACGGCATCAAGGCCAACTTCAAGATCCGCCACAACAT CGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCC ATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCC AGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCT GCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTG TACAAGGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTG TTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGG TGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTT CCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGG GTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGT TGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGT GTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTC CGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGG TGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTT GGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTG TTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCC GGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGT GTTGGTGTTCCGGGTGTTGGTATGGTGAGCAAGGGCGAGGAGCTGTTCA CCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCA CAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAG CTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGC CCACCCTCGTGACCACCTTCGGCTACGGCCTGCAGTGCTTCGCCCGCTA CCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAA GGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACA AGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCAT CGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCAC AAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGACA AGCAGAAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACATCGA GGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATC GGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCTACCAGT CCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCT GGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTAC AAGGGTGGAAGAGGAATTCTCGAGCACCACCACCACCACCAC proteinsequence (SEQIDNO:12) MVHPSSSGMVSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYG KLTLKFICTTGKLPVPWPTLVTTLTWGVQCFSRYPDHMKQHDFFKSAMP EGYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILG HKLEYNYISHNVYITADKQKNGIKANFKIRHNIEDGSVQLADHYQQNTP IGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVLLEFVTAAGITLGMDEL YKGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGV PGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVG VPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGV GVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPG VGVPGVGMVSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGK LTLKFICTTGKLPVPWPTLVTTFGYGLQCFARYPDHMKQHDFFKSAMPE GYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGH KLEYNYNSHNVYIMADKQKNGIKVNEKIRHNIEDGSVQLADHYQQNTPI GDGPVLLPDNHYLSYQSALSKDPNEKRDHMVLLEFVTAAGITLGMDELY KGGRGILEHHHHHH
1.1.2.3 ELP-His-V20C1 (11.7 kDa)
(72) TABLE-US-00018 DNA-sequence (SEQIDNO:13) ATGGGCAGCAGCCATCATCATCATCATCACAGCAGCGGCCTGGTGCCGC GCGGCAGCCATATGGCCATGGGTGTTCCGGGTGTTGGTGTTCCGGGTGT TGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGT GTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTC CGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGG TGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTT GGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTG TTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTTGCGGTGGAAG AGAATTC proteinsequence (SEQIDNO:14) MGSSHHHHHHSSGLVPRGSHMAMGVPGVGVPGVGVPGVGVPGVGVPGVG VPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGV GVPGVGVPGVGVPGVGVPGVGVPGVGVPGCGGREF
1.1.2.4 ELP-His-R20C1 (12.8 kDa)
(73) TABLE-US-00019 DNA-sequence (SEQIDNO:15) ATGGGCAGCAGCCATCATCATCATCATCACAGCAGCGGCCTGGTGCCGC GCGGCAGCCATATGGCCATGGGTGTTCCGGGTAGAGGTGTTCCGGGTAG AGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGT GTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTC CGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGG TAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGA GGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTG TTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTTGCGGTGGAAG AGAATTC proteinsequence (SEQIDNO:16) MGSSHHHHHHSSGLVPRGSHMAMGVPGRGVPGRGVPGRGVPGRGVPGRG VPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGR GVPGRGVPGRGVPGRGVPGRGVPGRGVPGCGGREF
1.1.2.5 R20C1F20-His (20.8 kDa)
(74) TABLE-US-00020 DNA-sequence (SEQIDNO:17) ATGGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAG AGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAG GTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGT GTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGT TCCGGGTTGCGGT
GGAAGAGAATTGGCCGCACTCGAGCACCACCACCACCACCAC proteinsequence (SEQIDNO:18) MGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRG VPGRGVPGRGVPGRGVPGRGVPGRGVPGCGVP
GRELAALEHHHHHH
1.2 Resilin-Like ELP (Elastin-Like Protein): (V2Y1)15. (V2C1)20
1.2.1 ELP-(V2Y1)15 (22.7 kDa)
(75) TABLE-US-00021 DNA-sequence (SEQIDNO:19) ATGGGCAGCAGCCATCATCATCATCATCACAGCAGCGGCCTGGTGCCGC GCGGCAGCCATATGGCCATGGGCTCTTCTGGTGTTCCGGGTGTTGGTGT TCCGGGTGTTGGTGTTCCGGGTTACGGTGTTCCGGGTGTTGGTGTTCCG GGTGTTGGTGTTCCGGGTTACGGTGTTCCGGGTGTTGGTGTTCCGGGTG TTGGTGTTCCGGGTTACGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGG TGTTCCGGGTTACGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTT CCGGGTTACGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGG GTTACGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTTA CGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTTACGGT GTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTTACGGTGTTC CGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTTACGGTGTTCCGGG TGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTTACGGTGTTCCGGGTGTT GGTGTTCCGGGTGTTGGTGTTCCGGGTTACGGTGTTCCGGGTGTTGGTG TTCCGGGTGTTGGTGTTCCGGGTTACGGTGTTCCGGGTGTTGGTGTTCC GGGTGTTGGTGTTCCGGGTTACGGTGTTCCGGGTGTTGGTGTTCCGGGT GTTGGTGTTCCGGGTTACGGTGGAAGAGAATTC proteinsequence (SEQIDNO:20) MGSSHHHHHHSSGLVPRGSHMAMGSSGVPGVGVPGVGVPGYGVPGVGVP GVGVPGYGVPGVGVPGVGVPGYGVPGVGVPGVGVPGYGVPGVGVPGVGV PGYGVPGVGVPGVGVPGYGVPGVGVPGVGVPGYGVPGVGVPGVGVPGYG VPGVGVPGVGVPGYGVPGVGVPGVGVPGYGVPGVGVPGVGVPGYGVPGV GVPGVGVPGYGVPGVGVPGVGVPGYGVPGVGVPGVGVPGYGVPGVGVPG VGVPGYGGREF
1.2.2 ELP-(V2C1)20-His (26.7 kDa)
(76) TABLE-US-00022 DNA-sequence (SEQIDNO:21) ATGGTGCACCCGAGCTCTTCTGGTGTTCCGGGTGTTGGTGTTCCGGGTG TTGGTGTTCCGGGTTGCGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGG TGTTCCGGGTTGCGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTT CCGGGTTGCGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGG GTTGCGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTTG CGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTTGCGGT GTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTTGCGGTGTTC CGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTTGCGGTGTTCCGGG TGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTTGCGGTGTTCCGGGTGTT GGTGTTCCGGGTGTTGGTGTTCCGGGTTGCGGTGTTCCGGGTGTTGGTG TTCCGGGTGTTGGTGTTCCGGGTTGCGGTGTTCCGGGTGTTGGTGTTCC GGGTGTTGGTGTTCCGGGTTGCGGTGTTCCGGGTGTTGGTGTTCCGGGT GTTGGTGTTCCGGGTTGCGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTG GTGTTCCGGGTTGCGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGT TCCGGGTTGCGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCG GGTTGCGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTT GCGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTTGCGG TGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTTGCGGTGTT CCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTTGCGGTGGAAGAG GAATTCACCACCACCACCACCAC proteinsequence (SEQIDNO:22) MVHPSSSGVPGVGVPGVGVPGCGVPGVGVPGVGVPGCGVPGVGVPGVGV PGCGVPGVGVPGVGVPGCGVPGVGVPGVGVPGCGVPGVGVPGVGVPGCG VPGVGVPGVGVPGCGVPGVGVPGVGVPGCGVPGVGVPGVGVPGCGVPGV GVPGVGVPGCGVPGVGVPGVGVPGCGVPGVGVPGVGVPGCGVPGVGVPG VGVPGCGVPGVGVPGVGVPGCGVPGVGVPGVGVPGCGVPGVGVPGVGVP GCGVPGVGVPGVGVPGCGVPGVGVPGVGVPGCGVPGVGVPGVGVPGCGV PGVGVPGVGVPGCGGRGIHHHHHH
1.3 Amphiphilic Protein Block Copolymers
1.3.1 ELP-(V20R20) (18.2 kDa)
(77) TABLE-US-00023 DNA-sequence (SEQIDNO:23) ATGGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGT TGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTG GTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGT GTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGTGTTCCGGGTGTTGGT
GGAAGAGAAT TC proteinsequence (SEQIDNO:24) MGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVG VPGVGVPGVGVPGVGVPGVGVPGVG
GREF
1.3.2 ELP-(R40F20) (28.5 kDa)
(78) TABLE-US-00024 DNA-sequence (SEQIDNO:25) ATGGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAG AGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAG GTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGT GTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGT TCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTC CGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGTGTTCCG GGTAGAGGTGTTCCGGGTAGAGGTGTTCCGGGTAGAGGT
GGAAGAGAATTC proteinsequence (SEQIDNO:26) MGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRG VPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVP GRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRGVPGRG
GREF
1.3.3 ELP-1E20F201 (18.6 kDa)
(79) TABLE-US-00025 DNA-sequence (SEQIDNO:27) ATGGGTGTTCCGGGTGAAGGTGTTCCGGGTGAAGGTGTTCCGGGTGAAGGTGTTCCGGGTGAAGGTGTTCCGGGTGA AGGTGTTCCGGGTGAAGGTGTTCCGGGTGAAGGTGTTCCGGGTGAAGGTGTTCCGGGTGAAGGTGTTCCGGGTGAAG GTGTTCCGGGTGAAGGTGTTCCGGGTGAAGGTGTTCCGGGTGAAGGTGTTCCGGGTGAAGGTGTTCCGGGTGAAGGT GTTCCGGGTGAAGGTGTTCCGGGTGAAGGTGTTCCGGGTGAAGGTGTTCCGGGTGAAGGTGTTCCGGGTGAAGGT
GGAAGAGAAT TC proteinsequence (SEQIDNO:28) MGVPGEGVPGEGVPGEGVPGEGVPGEGVPGEGVPGEGVPGEGVPGEGVPGEGVPGEGVPGEGVPGEGVPGEGVPGEG VPGEGVPGEGVPGEGVPGEGVPGEGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVP GFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGFGVPGEFGVPGFGVPGFGGREF
1.3.4 EYFP-E20F20-His (47.7 kDa)
(80) TABLE-US-00026 DNA-sequence (SEQIDNO:29) ATGGTGCACCCGAGCTCTTCTGGTATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGA GCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGA CCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCTTCGGCTACGGCCTG CAGTGCTTCGCCCGCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCA GGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGG TGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTAC AACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACAT CGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCG ACAACCACTACCTGAGCTACCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAG TTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGGCGGCCGCAACCGGTGTTCCGGGTGAAGG TGTTCCGGGTGAAGGTGTTCCGGGTGAAGGTGTTCCGGGTGAAGGTGTTCCGGGTGAAGGTGTTCCGGGTGAAGGTG TTCCGGGTGAAGGTGTTCCGGGTGAAGGTGTTCCGGGTGAAGGTGTTCCGGGTGAAGGTGTTCCGGGTGAAGGTGTT CCGGGTGAAGGTGTTCCGGGTGAAGGTGTTCCGGGTGAAGGTGTTCCGGGTGAAGGTGTTCCGGGTGAAGGTGTTCC GGGTGAAGGTGTTCCGGGTGAAGGTGTTCCGGGTGAAGGTGTTCCGGGTGAAGGTGTTCCGGGTTTCGGTGTTCCGG GTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGT TTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTT CGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCG GTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGGAAGAGAATTGATCCTCGAGCACCACCAC CACCACCAC proteinsequence (SEQIDNO:30) MVHPSSSGMVSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPTLVTTFGYGL QCFARYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNY NSHNVYIMADKQKNGIKVNFKIRHNIEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSYQSALSKDPNEKRDHMVLLE FVTAAGITLGMDELYKAAATGVPGEGVPGEGVPGEGVPGEGVPGEGVPGEGVPGEGVPGEGVPGEGVPGEGVPGEGV PGEGVPGEGVPGEGVPGEGVPGEGVPGEGVPGEGVPGEGVPGEG
GRELILEHHH HHH
1.3.5 F70E40-mEGFP-His_(80.2 kDa)
(81) TABLE-US-00027 DNA-sequence (SEQIDNO:31) ATGACTGCAGGGAGCTCTTCTGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTT CCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGT GTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTC GGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGT TTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCG GGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTT CCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGT GTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTC GGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGT TTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCG GGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTT CCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGT GTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTC GGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGTTTCGGTGTTCCGGGT
1.4 Structural Protein-Polymers (not ELP Derived)
1.4.1 Recombinant Resilin Protein-Polymer: Res30-his (30 Repeat Units of 15 AS Monomer) (41.8 kDa)
(82) TABLE-US-00028 DNA-sequence (SEQIDNO:33) ATGGGTGGTCGACCTTCTGATTCTTACGGTGCTCCTGGTGGTGGTAATG GTGGTCGACCTTCTGATTCTTACGGTGCTCCTGGTGGTGGTAATGGTGG TCGACCTTCTGATTCTTACGGTGCTCCTGGTGGTGGTAATGGTGGTCGA CCTTCTGATTCTTACGGTGCTCCTGGTGGTGGTAATGGTGGTCGACCTT CTGATTCTTACGGTGCTCCTGGTGGTGGTAATGGTGGTCGACCTTCTGA TTCTTACGGTGCTCCTGGTGGTGGTAATGGTGGTCGACCTTCTGATTCT TACGGTGCTCCTGGTGGTGGTAATGGTGGTCGACCTTCTGATTCTTACG GTGCTCCTGGTGGTGGTAATGGTGGTCGACCTTCTGATTCTTACGGTGC TCCTGGTGGTGGTAATGGTGGTCGACCTTCTGATTCTTACGGTGCTCCT GGTGGTGGTAATGGTGGTCGACCTTCTGATTCTTACGGTGCTCCTGGTG GTGGTAATGGTGGTCGACCTTCTGATTCTTACGGTGCTCCTGGTGGTGG TAATGGTGGTCGACCTTCTGATTCTTACGGTGCTCCTGGTGGTGGTAAT GGTGGTCGACCTTCTGATTCTTACGGTGCTCCTGGTGGTGGTAATGGTG GTCGACCTTCTGATTCTTACGGTGCTCCTGGTGGTGGTAATGGTGGTCG ACCTTCTGATTCTTACGGTGCTCCTGGTGGTGGTAATGGTGGTCGACCT TCTGATTCTTACGGTGCTCCTGGTGGTGGTAATGGTGGTCGACCTTCTG ATTCTTACGGTGCTCCTGGTGGTGGTAATGGTGGTCGACCTTCTGATTC TTACGGTGCTCCTGGTGGTGGTAATGGTGGTCGACCTTCTGATTCTTAC GGTGCTCCTGGTGGTGGTAATGGTGGTCGACCTTCTGATTCTTACGGTG CTCCTGGTGGTGGTAATGGTGGTCGACCTTCTGATTCTTACGGTGCTCC TGGTGGTGGTAATGGTGGTCGACCTTCTGATTCTTACGGTGCTCCTGGT GGTGGTAATGGTGGTCGACCTTCTGATTCTTACGGTGCTCCTGGTGGTG GTAATGGTGGTCGACCTTCTGATTCTTACGGTGCTCCTGGTGGTGGTAA TGGTGGTCGACCTTCTGATTCTTACGGTGCTCCTGGTGGTGGTAATGGT GGTCGACCTTCTGATTCTTACGGTGCTCCTGGTGGTGGTAATGGTGGTC GACCTTCTGATTCTTACGGTGCTCCTGGTGGTGGTAATGGTGGTCGACC TTCTGATTCTTACGGTGCTCCTGGTGGTGGTAATGGTGGTCGACCTTCT GATTCTTACGGTGCTCCTGGTGGTGGTAATGGTGGAAGAGAATTGATCC TCGAGCACCACCACCACCACCAC proteinsequence (SEQIDNO:34) MGGRPSDSYGAPGGGNGGRPSDSYGAPGGGNGGRPSDSYGAPGGGNGGR PSDSYGAPGGGNGGRPSDSYGAPGGGNGGRPSDSYGAPGGGNGGRPSDS YGAPGGGNGGRPSDSYGAPGGGNGGRPSDSYGAPGGGNGGRPSDSYGAP GGGNGGRPSDSYGAPGGGNGGRPSDSYGAPGGGNGGRPSDSYGAPGGGN GGRPSDSYGAPGGGNGGRPSDSYGAPGGGNGGRPSDSYGAPGGGNGGRP SDSYGAPGGGNGGRPSDSYGAPGGGNGGRPSDSYGAPGGGNGGRPSDSY GAPGGGNGGRPSDSYGAPGGGNGGRPSDSYGAPGGGNGGRPSDSYGAPG GGNGGRPSDSYGAPGGGNGGRPSDSYGAPGGGNGGRPSDSYGAPGGGNG GRPSDSYGAPGGGNGGRPSDSYGAPGGGNGGRPSDSYGAPGGGNGGRPS DSYGAPGGGNGGRELILEHHHHHH
1.4.2 Recombinant Spidersilk-Protein-Polymer: Spisi80 (80 Repeat Units of 35 AS Monomer) (231.4 kDa)
(83) TABLE-US-00029 DNA-sequence (SEQIDNO:35) ATGACTGCAGGGAGCTCTTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTG CTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTA CGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGT CAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGG GTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCT GGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCT GGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTG GTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGG TGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCT GGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTG CTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGG TACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCT GCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTT CTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTAT GGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGT CTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTG CTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTA CGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGT CAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGG GTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCT GGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCT GGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTG GTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGG TGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCT GGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTG CTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGG TACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCT GCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTT CTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTAT GGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGT CTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTG CTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTA CGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGT CAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGG GTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCT GGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCT GGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTG GTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGG TGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCT GGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTG CTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGG TACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCT GCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTT CTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTAT GGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGT CTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTG CTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTA CGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGT CAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGG GTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCT GGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCT GGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTG GTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGG TGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCT GGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTG CTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGG TACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCT GCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTT CTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTAT GGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGT CTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTG CTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTA CGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGT CAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGG GTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCT GGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCT GGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTG GTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGG TGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCT GGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTG CTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGG TACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCT GCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTT CTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTAT GGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGT CTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTG CTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTA CGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGT CAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGG GTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCT GGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCT GGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTG GTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGG TGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCT GGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTG CTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGG TACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCT GCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTT CTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTAT GGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGT CTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTG CTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTA CGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGT CAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGG GTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCT GGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCT GGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTG GTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGG TGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCT GGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTG CTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGG TACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCT GCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTT CTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTAT GGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGT CTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTG CTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTA CGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGT CAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGG GTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCT GGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCT GGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTG GTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGG TGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCT GGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTG CTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGG TACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCT GCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTT CTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTAT GGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGT CTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTG CTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTA CGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGT CAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGG GTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCT GGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCT GGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTG GTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGG TGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCT GGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTG CTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGG TACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCT GCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTT CTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTAT GGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGT CTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTG CTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTA CGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGT CAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGG GTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCT GGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCT GGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTG GTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGG TGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCT GGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTG CTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGG TACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCT GCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTT CTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTAT GGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGT CTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTG CTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTA CGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGT CAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGG GTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCT GGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCT GGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTG GTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGG TGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCT GGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTG CTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGG TACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTATGGCTGCT GCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGTCTGGGTT CTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTGCTGGTAT GGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTACGGTGGT CTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGTCAGGGTG CTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGGGTGGTTA CGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCTGGGTGGT CAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCTGGTCAGG GTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCTGGTCGTGGTGGTCT GGGTGGTCAGGGTGCTGGTATGGCTGCTGCTGCTGCTATGGGTGGTGCT GGTCAGGGTGGTTACGGTGGTCTGGGTTCTCAGGGTACCTCTGGTGGAA GAGAATTC proteinsequence (SEQIDNO:36) MTAGSSSGRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGG QGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGA GQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTS GRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAA AAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGG LGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGG QGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGA GQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTS GRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAA AAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGG LGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGG QGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGA GQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTS GRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAA AAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGG LGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGG QGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGA GQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTS GRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAA AAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGG LGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGG QGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGA GQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTS GRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAA AAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGG LGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGG QGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGA GQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTS GRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAA AAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGG LGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGG QGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGA GQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTS GRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAA AAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGG LGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGG QGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGA GQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTS GRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAA AAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGG LGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGG QGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGA GQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTS GRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAA AAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGG LGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGG QGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGA GQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTS GRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAA AAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGG LGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGG QGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGA GQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTS GRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAA AAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGG LGSQGTSGRGGLGGQGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGG QGAGMAAAAAMGGAGQGGYGGLGSQGTSGRGGLGGQGAGMAAAAAMGGA GQGGYGGLGSQGTSGGREF
2. Protein-PolymersCleavable Multipeptides (e.g. Signalpeptides, Cellular Adhesion Peptides with Nested Chemical or Enzymatic Cleavage Sites)
2.1 Peptide-Multimer: (RGDser-NG10)10 (182.1 kDa) (100 repeat units of 17 AS monomer with hydroxylamine cleavage site (NG) for chemical cleavage)
(84) TABLE-US-00030 DNA-sequence (SEQIDNO:37) ATGGTGCACCCGAGCTCTTCTGGTTGCGGTGGTAACTCTGAACCGCGTG GTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCG TGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCG CGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAAC CGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGA ACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCT GAACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAACT CTGAACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAA CTCTGAACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGT AACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTG GTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGG TGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGC GGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAACGGTT GCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAACGG TTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAAC GGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTACA ACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTA CAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCT TACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTG CTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCG TGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTAC CGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCT ACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACAC CTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGAC ACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTG ACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGG TGACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGT GGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGC GTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACC GCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAA CCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTG AACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTC TGAACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAAC TCTGAACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTA ACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGG TAACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGT GGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCG GTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAACGGTTG CGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAACGGT TGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAACG GTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAA CGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTAC AACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTT ACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGC TTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGT GCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACC GTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTA CCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACC TACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACA CCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGA CACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGT GACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTG GTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCG TGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCG CGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAAC CGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGA ACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCT GAACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAACT CTGAACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAA CTCTGAACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGT AACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTG GTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGG TGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGC GGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAACGGTT GCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAACGG TTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAAC GGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTACA ACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTA CAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCT TACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTG CTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCG TGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTAC CGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCT ACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACAC CTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGAC ACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTG ACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGG TGACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGT GGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGC GTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACC GCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAA CCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTG AACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTC TGAACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAAC TCTGAACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGGTA ACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGTGG TAACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCGGT GGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAACGGTTGCG GTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAACGGTTG CGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAACGGT TGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAACG GTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTACAA CGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTTAC AACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGCTT ACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGTGC TTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACCGT GCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTACC GTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACCTA CCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACACC TACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGACA CCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGTGA CACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTGGT GACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCGTG GTGACACCTACCGTGCTTACAACGGTTGCGGTGGTAACTCTGAACCGCG TGGTGACACCTACCGTGCTTACAACGGTGGAAGAGGAATTCTCGAGCAC CACCACCACCACCAC proteinsequence (SEQIDNO:38) MVHPSSSGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEP RGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNS EPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGG NSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGC GGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYN GCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRA YNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTY RAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGD TYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPR GDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSE PRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGN SEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCG GNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNG CGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAY NGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYR AYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDT YRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRG DTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEP RGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNS EPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGG NSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGC GGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYN GCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRA YNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTY RAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGD TYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPR GDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSE PRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGN SEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCG GNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNG CGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAY NGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYR AYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDT YRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRG DTYRAYNGCGGNSEPRGDTYRAYNGCGGNSEPRGDTYRAYNGGRGILEH HHHHH
2.2 Peptide-Multimer: (RGDser-IEGR10)5 (104.9 kDa) (50 repeat units of 19 AS monomer with Factor Xa protease recognition sequence (IEGR))
(85) TABLE-US-00031 DNA-sequence (SEQIDNO:39) ATGGTGCACCCGAGCTCTTCTGGTGGTAACGGTGAACCGCGTGGTGACA CCTACCGTGCTTACATCGAAGGTCGTTGCGGTGGTAACGGTGAACCGCG TGGTGACACCTACCGTGCTTACATCGAAGGTCGTTGCGGTGGTAACGGT GAACCGCGTGGTGACACCTACCGTGCTTACATCGAAGGTCGTTGCGGTG GTAACGGTGAACCGCGTGGTGACACCTACCGTGCTTACATCGAAGGTCG TTGCGGTGGTAACGGTGAACCGCGTGGTGACACCTACCGTGCTTACATC GAAGGTCGTTGCGGTGGTAACGGTGAACCGCGTGGTGACACCTACCGTG CTTACATCGAAGGTCGTTGCGGTGGTAACGGTGAACCGCGTGGTGACAC CTACCGTGCTTACATCGAAGGTCGTTGCGGTGGTAACGGTGAACCGCGT GGTGACACCTACCGTGCTTACATCGAAGGTCGTTGCGGTGGTAACGGTG AACCGCGTGGTGACACCTACCGTGCTTACATCGAAGGTCGTTGCGGTGG TAACGGTGAACCGCGTGGTGACACCTACCGTGCTTACATCGAAGGTCGT TGCGGTGGTAACGGTGAACCGCGTGGTGACACCTACCGTGCTTACATCG AAGGTCGTTGCGGTGGTAACGGTGAACCGCGTGGTGACACCTACCGTGC TTACATCGAAGGTCGTTGCGGTGGTAACGGTGAACCGCGTGGTGACACC TACCGTGCTTACATCGAAGGTCGTTGCGGTGGTAACGGTGAACCGCGTG GTGACACCTACCGTGCTTACATCGAAGGTCGTTGCGGTGGTAACGGTGA ACCGCGTGGTGACACCTACCGTGCTTACATCGAAGGTCGTTGCGGTGGT AACGGTGAACCGCGTGGTGACACCTACCGTGCTTACATCGAAGGTCGTT GCGGTGGTAACGGTGAACCGCGTGGTGACACCTACCGTGCTTACATCGA AGGTCGTTGCGGTGGTAACGGTGAACCGCGTGGTGACACCTACCGTGCT TACATCGAAGGTCGTTGCGGTGGTAACGGTGAACCGCGTGGTGACACCT ACCGTGCTTACATCGAAGGTCGTTGCGGTGGTAACGGTGAACCGCGTGG TGACACCTACCGTGCTTACATCGAAGGTCGTTGCGGTGGTAACGGTGAA CCGCGTGGTGACACCTACCGTGCTTACATCGAAGGTCGTTGCGGTGGTA ACGGTGAACCGCGTGGTGACACCTACCGTGCTTACATCGAAGGTCGTTG CGGTGGTAACGGTGAACCGCGTGGTGACACCTACCGTGCTTACATCGAA GGTCGTTGCGGTGGTAACGGTGAACCGCGTGGTGACACCTACCGTGCTT ACATCGAAGGTCGTTGCGGTGGTAACGGTGAACCGCGTGGTGACACCTA CCGTGCTTACATCGAAGGTCGTTGCGGTGGTAACGGTGAACCGCGTGGT GACACCTACCGTGCTTACATCGAAGGTCGTTGCGGTGGTAACGGTGAAC CGCGTGGTGACACCTACCGTGCTTACATCGAAGGTCGTTGCGGTGGTAA CGGTGAACCGCGTGGTGACACCTACCGTGCTTACATCGAAGGTCGTTGC GGTGGTAACGGTGAACCGCGTGGTGACACCTACCGTGCTTACATCGAAG GTCGTTGCGGTGGTAACGGTGAACCGCGTGGTGACACCTACCGTGCTTA CATCGAAGGTCGTTGCGGTGGTAACGGTGAACCGCGTGGTGACACCTAC CGTGCTTACATCGAAGGTCGTTGCGGTGGTAACGGTGAACCGCGTGGTG ACACCTACCGTGCTTACATCGAAGGTCGTTGCGGTGGTAACGGTGAACC GCGTGGTGACACCTACCGTGCTTACATCGAAGGTCGTTGCGGTGGTAAC GGTGAACCGCGTGGTGACACCTACCGTGCTTACATCGAAGGTCGTTGCG GTGGTAACGGTGAACCGCGTGGTGACACCTACCGTGCTTACATCGAAGG TCGTTGCGGTGGTAACGGTGAACCGCGTGGTGACACCTACCGTGCTTAC ATCGAAGGTCGTTGCGGTGGTAACGGTGAACCGCGTGGTGACACCTACC GTGCTTACATCGAAGGTCGTTGCGGTGGTAACGGTGAACCGCGTGGTGA CACCTACCGTGCTTACATCGAAGGTCGTTGCGGTGGTAACGGTGAACCG CGTGGTGACACCTACCGTGCTTACATCGAAGGTCGTTGCGGTGGTAACG GTGAACCGCGTGGTGACACCTACCGTGCTTACATCGAAGGTCGTTGCGG TGGTAACGGTGAACCGCGTGGTGACACCTACCGTGCTTACATCGAAGGT CGTTGCGGTGGTAACGGTGAACCGCGTGGTGACACCTACCGTGCTTACA TCGAAGGTCGTTGCGGTGGTAACGGTGAACCGCGTGGTGACACCTACCG TGCTTACATCGAAGGTCGTTGCGGTGGTAACGGTGAACCGCGTGGTGAC ACCTACCGTGCTTACATCGAAGGTCGTTGCGGTGGTAACGGTGAACCGC GTGGTGACACCTACCGTGCTTACATCGAAGGTCGTTGCGGTGGTAACGG TGAACCGCGTGGTGACACCTACCGTGCTTACATCGAAGGTCGTTGCGGT GGTAACGGTGAACCGCGTGGTGACACCTACCGTGCTTACATCGAAGGTC GTTGCGGTGGTAACGGTGAACCGCGTGGTGACACCTACCGTGCTTACAT CGAAGGTCGTTGCGGTGGTAACGGTGAACCGCGTGGTGACACCTACCGT GCTTACATCGAAGGTCGTTGCGGTGGTAACGGTGAACCGCGTGGTGACA CCTACCGTGCTTACATCGAAGGTCGTTGCGGTGGAAGAGGAATTCTCGA GCACCACCACCACCACCAC proteinsequence (SEQIDNO:40) MVHPSSSGGNGEPRGDTYRAYIEGRCGGNGEPRGDTYRAYIEGRCGGNG EPRGDTYRAYIEGRCGGNGEPRGDTYRAYIEGRCGGNGEPRGDTYRAYI EGRCGGNGEPRGDTYRAYIEGRCGGNGEPRGDTYRAYIEGRCGGNGEPR GDTYRAYIEGRCGGNGEPRGDTYRAYIEGRCGGNGEPRGDTYRAYIEGR CGGNGEPRGDTYRAYIEGRCGGNGEPRGDTYRAYIEGRCGGNGEPRGDT YRAYIEGRCGGNGEPRGDTYRAYIEGRCGGNGEPRGDTYRAYIEGRCGG NGEPRGDTYRAYIEGRCGGNGEPRGDTYRAYIEGRCGGNGEPRGDTYRA YIEGRCGGNGEPRGDTYRAYIEGRCGGNGEPRGDTYRAYIEGRCGGNGE PRGDTYRAYIEGRCGGNGEPRGDTYRAYIEGRCGGNGEPRGDTYRAYIE GRCGGNGEPRGDTYRAYIEGRCGGNGEPRGDTYRAYIEGRCGGNGEPRG DTYRAYIEGRCGGNGEPRGDTYRAYIEGRCGGNGEPRGDTYRAYIEGRC GGNGEPRGDTYRAYIEGRCGGNGEPRGDTYRAYIEGRCGGNGEPRGDTY RAYIEGRCGGNGEPRGDTYRAYIEGRCGGNGEPRGDTYRAYIEGRCGGN GEPRGDTYRAYIEGRCGGNGEPRGDTYRAYIEGRCGGNGEPRGDTYRAY IEGRCGGNGEPRGDTYRAYIEGRCGGNGEPRGDTYRAYIEGRCGGNGEP RGDTYRAYIEGRCGGNGEPRGDTYRAYIEGRCGGNGEPRGDTYRAYIEG RCGGNGEPRGDTYRAYIEGRCGGNGEPRGDTYRAYIEGRCGGNGEPRGD TYRAYIEGRCGGNGEPRGDTYRAYIEGRCGGNGEPRGDTYRAYIEGRCG GNGEPRGDTYRAYIEGRCGGNGEPRGDTYRAYIEGRCGGNGEPRGDTYR AYIEGRCGGNGEPRGDTYRAYIEGRCGGRGILEHHHHHH
2.3 Peptide-Multimer: (NCAMenc-TEV10)1 (28.9 kDa) (10 repeat units of 24 AS monomer with TEV protease recognition sequence (ENLYFQ))
(86) TABLE-US-00032 DNA-sequence (SEQIDNO:41) ATGGTGCACCCGAGCTCTTCTGGCTGCGGCGGCACCATTATGGGCCTGA AACCGGAAACCCGCTATGCGGTGCGCGAAAACCTGTACTTTCAGGGCTG CGGCGGCACCATTATGGGCCTGAAACCGGAAACCCGCTATGCGGTGCGC GAAAACCTGTACTTTCAGGGCTGCGGCGGCACCATTATGGGCCTGAAAC CGGAAACCCGCTATGCGGTGCGCGAAAACCTGTACTTTCAGGGCTGCGG CGGCACCATTATGGGCCTGAAACCGGAAACCCGCTATGCGGTGCGCGAA AACCTGTACTTTCAGGGCTGCGGCGGCACCATTATGGGCCTGAAACCGG AAACCCGCTATGCGGTGCGCGAAAACCTGTACTTTCAGGGCTGCGGCGG CACCATTATGGGCCTGAAACCGGAAACCCGCTATGCGGTGCGCGAAAAC CTGTACTTTCAGGGCTGCGGCGGCACCATTATGGGCCTGAAACCGGAAA CCCGCTATGCGGTGCGCGAAAACCTGTACTTTCAGGGCTGCGGCGGCAC CATTATGGGCCTGAAACCGGAAACCCGCTATGCGGTGCGCGAAAACCTG TACTTTCAGGGCTGCGGCGGCACCATTATGGGCCTGAAACCGGAAACCC GCTATGCGGTGCGCGAAAACCTGTACTTTCAGGGCTGCGGCGGCACCAT TATGGGCCTGAAACCGGAAACCCGCTATGCGGTGCGCGAAAACCTGTAC TTTCAGGGTGGAAGAGGAATTCACCACCACCACCACCAC proteinsequence (SEQIDNO:42) MVHPSSSGCGGTIMGLKPETRYAVRENLYFQGCGGTIMGLKPETRYAVR ENLYFQGCGGTIMGLKPETRYAVRENLYFQGCGGTIMGLKPETRYAVRE NLYFQGCGGTIMGLKPETRYAVRENLYFQGCGGTIMGLKPETRYAVREN LYFQGCGGTIMGLKPETRYAVRENLYFQGCGGTIMGLKPETRYAVRENL YFQGCGGTIMGLKPETRYAVRENLYFQGCGGTIMGLKPETRYAVRENLY FQGGRGIHHHHHH