SPLIT INTEIN MEDIATED POLYMERIZATION AND PRODUCTION OF MUSSEL FOOT ADHESIVE PROTEIN MATERIALS
20200071368 ยท 2020-03-05
Assignee
Inventors
Cpc classification
C07K2319/20
CHEMISTRY; METALLURGY
C07K2319/92
CHEMISTRY; METALLURGY
International classification
Abstract
Mussels strongly adhere to a variety of surfaces by secreting byssal threads that contain mussel foot proteins (Mfps). Recombinant production of Mfps presents an attractive route for preparing advanced adhesive materials. Using synthetic biology strategies, Mfp5 together with Mfp5 oligomers containing two or three consecutive, covalently-linked Mfp5 sequences (named Mfp5.sup.2 and Mfp5.sup.3) were synthesized. Positive correlations were found between Mfp5 molecular weight and underwater adhesive properties, including adhesion force, adhesion work, protein layer thickness, and recovery distance. Dopa-modified Mfp5.sup.3 displayed a high adhesion force (20136 nN m.sup.1) and a high adhesion work (6821 fJ m.sup.1) for 200 s cure times, higher than previously reported Mfp-mimetic adhesives. Results disclosed herein highlight the power of synthetic biology in producing biocompatible and highly adhesive Mfp-based materials.
Claims
1. A mussel foot protein (Mfp) comprising a full length polypeptide sequence from a mussel species, a partial polypeptide sequence from a mussel species, or combinations thereof.
2. The Mfp of claim 1, wherein the Mfp is selected from the group consisting of Mfp1, Mfp2, Mfp3, Mfp4, and Mfp5.
3. The Mfp of claim 1, wherein the Mfp is an oligomer protein comprising a plurality of repeats of the Mfp within one protein molecule.
4. The Mfp of claim 3, wherein the oligomer protein is synthesized from a mixture of a Mfp-Int.sup.N fusion protein and an Int.sup.C-Mfp fusion protein.
5. The Mfp of claim 4, wherein the Mfp-Int.sup.N fusion protein is produced by fusing an N-terminal split intein (Int.sup.N) to a first Mfp repeat, and wherein the Int.sup.C-Mfp fusion protein is produced by fusing a C-terminal split intein (Int.sup.C) to a second Mfp repeat.
6. The Mfp of claim 5, wherein the first Mfp repeat is Mfp5.sup.2, and wherein the second repeat is Mfp5.sup.1.
7. The Mfp of claim 6, wherein the oligomer protein is Mfp5.sup.3, the Mfp-Int.sup.N fusion protein is Mfp5.sup.2-Int.sup.N, and the Int.sup.C-Mfp fusion protein is Int.sup.C-Mfp5.sup.1.
8. The Mfp of claim 5, wherein the first Mfp repeat is Mfp5.sup.1, and wherein the second repeat is Mfp5.sup.2.
9. The Mfp of claim 8, wherein the oligomer protein is Mfp5.sup.3, the Mfp-Int.sup.N fusion protein is Mfp5.sup.1-Int.sup.N, and the Int.sup.C-Mfp fusion protein is Int.sup.C-Mfp5.sup.2.
10. The Mfp of claim 5, wherein the Int.sup.N and Int.sup.C are from a Cfa split intein (SI).
11. The Mfp of claim 4, wherein the Mfp-Int.sup.N fusion protein and the Int.sup.C-Mfp fusion protein are separately expressed from E. coli.
12. A method for synthesizing a mussel foot protein (Mfp) oligomer protein, the method comprising: fusing an N-terminal split intein (Int.sup.N) to a first Mfp repeat to produce a Mfp-Int.sup.N fusion protein; fusing a C-terminal split intein (Int.sup.C) to a second Mfp repeat to produce an Int.sup.C-Mfp fusion protein; and mixing the Mfp-Int.sup.N fusion protein and the Int.sup.C-Mfp fusion protein to produce a Mfp oligomer protein.
13. The method of claim 12, wherein the Mfp-Int.sup.N fusion protein and the Int.sup.C-Mfp fusion protein are separately expressed from E. coli.
14. The method of claim 12, wherein the Int.sup.N and Int.sup.C are from a Cfa split intein (SI).
15. The method of claim 12, wherein the Mfp is selected from the group consisting of Mfp1, Mfp2, Mfp3, Mfp4, and Mfp5.
16. The method of claim 12, wherein the first Mfp repeat is Mfp5.sup.2, and wherein the second repeat is Mfp5.sup.1.
17. The method of claim 16, wherein the Mfp oligomer protein is Mfp5.sup.3, the Mfp-Int.sup.N fusion protein is Mfp5.sup.2-Int.sup.N, and the Int.sup.C-Mfp fusion protein is Int.sup.C-Mfp5.sup.1.
18. The method of claim 12, wherein the first Mfp repeat is Mfp5.sup.1, and wherein the second repeat is Mfp5.sup.2.
19. The method of claim 18, wherein the Mfp oligomer protein is Mfp5.sup.3, the Mfp-Int.sup.N fusion protein is Mfp5.sup.1-Int.sup.N, and the Int.sup.C-Mfp fusion protein is Int.sup.C-Mfp5.sup.2.
20. A system for producing a mussel foot protein (Mfp) adhesive, the system comprising a Mfp-Int.sup.N fusion protein and an Int.sup.C-Mfp fusion protein.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0010] The embodiments described herein may be better understood by referring to the following description in conjunction with the accompanying drawings.
[0011]
[0012]
[0013]
[0014]
[0015]
[0016]
[0017]
[0018]
[0019]
[0020]
[0021]
[0022]
[0023]
[0024]
[0025]
[0026]
[0027]
[0028]
DETAILED DESCRIPTION OF THE DISCLOSURE
[0029] Using synthetic DNA with codons optimized for E. coli expression, Mytilus galloprovincialis Mfp5 were designed together with Mfp5 oligomers containing two or three consecutive Mfp5 sequences (named Mfp5.sup.2 and Mfp5.sup.3) and cloned them into standardized expression vectors (
[0030] Multiple Mfps from different mussel species exist, each having a slightly different amino acid sequence. In some embodiments, different Mfps (e.g., Mfp1, Mfp2, Mfp3, Mfp4, Mfp5, etc.) are utilized in the methods disclosed herein to produce Mfp oligomers with any number of repeats. Further, as described herein, a Mfp oligomer can be made from several different combinations via split intein reactions. For example, in some embodiments Mfp5.sup.3 is made from Mfp5.sup.2-Int.sup.N and Int.sup.C-Mfp5.sup.1. In other embodiments, Mfp5.sup.3 is made from Mfp5.sup.1-Int.sup.N and Int.sup.C-Mfp5.sup.2.
[0031] In some embodiments, both Mfp5.sup.1 and Mfp5.sup.2 are produced in E. coli and purified using affinity chromatography to purities of 99.9% and 92.2%, respectively (
[0032] SIs are auto-catalytic peptides that catalyze the spontaneous splicing-ligation reactions between two SI-fused target proteins, assembling the two target proteins covalently in trans (
[0033] MALDI-TOF spectra of unmodified Mfp5.sup.1 and Mfp5.sup.1.sub.DOPA are shown in
[0034] MALDI-TOF spectra of unmodified Mfp5.sup.2 and Mfp5.sup.2.sub.DOPA are shown in
[0035] Peak force tapping atomic force microscopy (PFT-AFM) was used to examine asymmetric adhesive and cohesive characteristics of the Mfp5 proteins between a colloidal probe and a protein-adsorbed mica surface under aqueous conditions (see
[0036]
[0037]
[0038] A similar positive correlation was also observed using synthetic poly-dihydroxystyrene-co-styrene polymers. From the retract curves, the recovery distance was also measured, which is defined as the length between the point where maximum adhesion force is achieved and the point where the probe is re-established to the initial zero adhesion baseline.
[0039] Related to recovery distance, another critical material property for adhesives is adhesion work, also known as peel force or work of separation. In
[0040] The longest curing time that allowed a reliable approach-retract curve measurement for Mfp5.sup.3.sub.DOPA was 200 s, because longer cure times produced adhesion strengths that were too high and prevented recovery of retract curves, even at the maximum AFM probe separation of our equipment (1 m). At a cure time of 200 s, Mfp5.sup.3.sub.DOPA exhibited an adhesion force of 201 nN m.sup.1 and an adhesion work of 68 fJ m.sup.1, values which are higher than previously reported for Mfp-mimetic adhesives (
[0041] Protein-protein interactions confer high cohesion, and thus overall adhesion between two surfaces. To analyze protein absorption to one surface, the Mfps disclosed herein were subjected to steady shear-flows in phosphate-buffered saline (PBS) buffer and quantified adsorbed protein amounts using a quartz crystal microbalance with dissipation monitoring (QCM-D). According to polymer physics, these conditions should lead to physical chain disentanglement, resulting in a thin and disperse protein layer from a soluble protein solution.
[0042]
[0043] In accordance with the present disclosure, E. coli was successfully engineered to produce Mfp5-based adhesive proteins with strong underwater adhesion capabilities. Under the experimental conditions, Mfp5.sup.3.sub.DOPA displayed high adhesion forces and adhesion works that were comparable to or higher than previously reported Mfp-mimetic adhesives. With a longer curing time, the adhesion force of Mfp5.sup.3 could be even higher than what was measured, but was beyond the detection limit of the method disclosed herein. Such adhesive proteins can potentially be used to replace natural Mfps in a variety of applications, with even better performance due to their stronger underwater adhesion. The results demonstrated that protein-protein cohesive interactions play an important role in determining both adhesion force and adhesion work. Positive correlations were observed between Mfp5's molecular weight and its measured adhesion force and adhesion work. These correlations indicate that producing high molecular weight proteins might be a natural strategy to obtain extensive cohesive interactions, as evidenced in Mfp2 and Mfp4, which have the highest molecular weights within the byssal plaque core (45 kDa and 90 kDa, respectively) among all Mfps and play cohesive roles in mussel plaques. The observed molecular weight dependent adhesion can be used in design rules to guide future engineering efforts, creating even stronger underwater adhesives.
[0044] In some embodiments, a Mfp is disclosed comprising a full length polypeptide sequence from a mussel species, a partial polypeptide sequence from a mussel species, or combinations thereof. In some embodiments, the Mfp is selected from the group consisting of Mfp1, Mfp2, Mfp3, Mfp4, and Mfp5.
[0045] In some embodiments, the Mfp is an oligomer protein comprising a plurality of repeats of the Mfp within one protein molecule. In some embodiments, the oligomer protein is synthesized from a mixture of a Mfp-Int.sup.N fusion protein and an Int.sup.C-Mfp fusion protein. In some embodiments, the Mfp-Int.sup.N fusion protein is produced by fusing an N-terminal split intein (Int.sup.N) to a first Mfp repeat, and the Int.sup.C-Mfp fusion protein is produced by fusing a C-terminal split intein (Int.sup.C) to a second Mfp repeat.
[0046] In some embodiments, the first Mfp repeat is Mfp5.sup.2, and the second repeat is Mfp5.sup.1. In some embodiments, the oligomer protein is Mfp5.sup.3, the Mfp-Int.sup.N fusion protein is Mfp5.sup.2-Int.sup.N, and the Int.sup.C-Mfp fusion protein is Int.sup.C-Mfp5.sup.1.
[0047] In some embodiments, the first Mfp repeat is Mfp5.sup.1, and the second repeat is Mfp5.sup.2. In some embodiments, the oligomer protein is Mfp5.sup.3, the Mfp-Int.sup.N fusion protein is Mfp5.sup.1-Int.sup.N, and the Int.sup.C-Mfp fusion protein is Int.sup.C-Mfp5.sup.2. In some embodiments, the Int.sup.N and Int.sup.C are from a Cfa split intein (SI). In some embodiments, the Mfp-Int.sup.N fusion protein and the Int.sup.C-Mfp fusion protein are separately expressed from E. coli.
EXPERIMENTAL SECTION
[0048] Chemicals and Reagents. Unless otherwise noted, all chemicals and reagents were obtained from Millipore Sigma (Saint Louis, Mo., USA). Plasmid purification and gel extraction kits were purchased from iNtRON Biotechnology (Seoul, South Korea). FastDigest restriction enzymes and T4 DNA ligase were purchased from Thermo Fisher Scientific (Austin, Tex., USA) and were used according to the suggested protocols from the manufacturer.
[0049] Plasmid Construction. A BioBrick system was used to facilitate cloning of two-part split intein-fused material protein domains. This system takes the following conditions into account: (a) split intein genes, ribosome binding sites (RBS), 5-untranslated regions (5-UTR), antibiotic resistance markers, promoters, and origins of replication are flanked with appropriate restriction sites that allow them to be easily swapped with other corresponding genes of interest, (b) restriction sites that exist within open reading frames introduce amino acids that are small, flexible, and are not expected to change the protein behavior and dynamics, (c) restriction sites are all distinct with respect to one another to facilitate the assembly of complete plasmids in one pot and in one step. All restriction sites used are schematically mapped out in
[0050] E. coli strain MDS42pdu was used as a host strain for cloning of all genes and plasmids in this study. The amino acid sequences of Mytilus galloprovincialis Mfp5.sup.1, Mfp5.sup.2, and Mfp5.sup.3 were codon-optimized for E. coli expression using the Gene Designer 2.0 software package (DNA 2.0 Inc.). All designed DNA sequences were chemically synthesized by Integrated DNA Technologies Inc. (San Jose, Calif., USA) (Table 1). These synthetic genes were then amplified using polymerase chain reaction (PCR) with corresponding forward and reverse primers as listed in Table 2. All mfp5 genes were amplified with BglII and BamHI sites on the 5 and 3 ends, respectively, for insertion into the pE7a-AKTK-H6 backbone containing the same sites, which was previously PCR amplified from plasmid pE7a-GFP1, with the addition of short coding sequences, 5-ATGGCTAAGACTAAACATCATCACCATCATCAC-3, translating to N-MAKTK-H6-C (
[0051] To construct plasmids containing split intein-fused Mfp5 proteins, the amino acid sequences of Cfa N- and C-inteins (Cfa.sup.N and Cfa.sup.C, respectively) were first codon-optimized and chemically synthesized using the same method as described above. The synthesized gene containing the Cfa.sup.N domain, which is flanked by NdeI and KpnI sites, and the Cfa.sup.C domain, which is flanked by Kpn2I and XhoI sites, were digested directly from the synthesized DNA. The mfp52 and mfp51 genes were amplified with corresponding primers (Table 2) containing KpnI and Kpn2I restriction sites. Amplified mfp52 and mfp51 fragments were digested (
[0052] Expression of Recombinant Proteins. E. coli strain BL21(DE3) (Thermo Fisher Scientific, Waltham, Mass.) was used as a host strain for expression of Mfps. E. coli strains containing the plasmids listed in Table 3 were cultured in shake flasks with Luria-Bertani (LB) broth containing 10 g/L tryptone, 10 g/L sodium chloride, and 5 g/L yeast extract with the appropriate antibiotic (100 g/mL ampicillin). Fresh transformants were cultivated overnight in 50 mL LB medium at 37 C. Overnight cultures were then used to inoculate 1 L fresh LB medium in Erlenmeyer flasks at an initial OD600=0.08. Cultures were grown at 37 C. with shaking to OD600=0.6, then induced by addition of 500 M (for Mfp5.sup.1 and Cfa.sup.C-Mfp5.sup.1) or 50 M (Mfp5.sup.2 and Mfp5.sup.2-Cfa.sup.N) IPTG. The culture was further cultivated at 37 C. at 250 rpm for another 5-7 hours. Cells were harvested by centrifugation at 4,500g for 20 min at 4 C. Centrifuged cell pellets were either directly extracted or stored at 80 C. until needed.
[0053] Protein Purification. For Mfp5.sup.1 and Mfp5.sup.2, cell pellets were resuspended in 10 mL of guanidine lysis buffer (6 M guanidine hydrochloride (BioBasic Inc., Amherst, N.Y., USA), 50 mM potassium phosphates, and 300 mM sodium chloride at pH 7.4) per gram of wet cells and lysed by agitation at 250 rpm. The lysates were centrifuged at 20,000g for 20 min at 18 C. To reduce the viscosity, collected lysates were further sonicated on ice (to maintain a roughly ambient temperature) for 30 minutes with a QSonica probe sonicator using 5 seconds on/5 seconds off cycles. The lysates were filtered through 0.2 m filter membranes. Both proteins were purified using an AktaPure Fast Protein Liquid Chromatograph (FPLC, GE Healthcare Inc., Chicago, Ill., USA) equipped with a 5 mL nickel affinity chromatography column (GE Healthcare). The column was pre-equilibrated with guanidine lysis buffer followed by sample loading. After washing with 5-10 column volumes (CVs) of guanidine wash buffer (6 M guanidine hydrochloride, 50 mM potassium phosphates, 300 mM sodium chloride, and 50 mM imidazole at pH 7.4), proteins were eluted and fractionated with 5-10 CVs of guanidine elution buffer (6 M guanidine hydrochloride, 50 mM potassium phosphates, 300 mM sodium chloride, and 250 mM imidazole at pH 7.4). Purified Mfp5.sup.1 and Mfp5.sup.2 proteins were examined by SDS-PAGE as shown in
[0054] Split-Intein Mediated Ligation and Purification of Mfp5.sup.3. Cell pellets containing Cfa.sup.C-Mfp5.sup.1 and Mfp5.sup.2-Cfa.sup.N-His10 fusion proteins were separately resuspended in 10 mL of urea lysis buffer (8 M urea, 100 mM sodium phosphates, and 300 mM sodium chloride at pH 7.4) per gram of wet cells and lysed by agitation at 250 rpm overnight. The lysates were centrifuged at 20,000g for 20 min at 18 C. Clear cell lysate was then mixed at a final reactant ratio of 4:1 (Cfa.sup.C-Mfp5.sup.1:Mfp5.sup.2-Cfa.sup.N-His10) based on densitometric analysis of Coomassie Blue-stained SDS-PAGE gels. The excess Int.sup.C-Mfp5.sup.1 does not contain His-tag, thus can be easily separated from the ligated product. The lysate mixture was stirred at 30 C. for 8 hours. The mixed lysate was then filtered through a 0.2 m filter membrane and purified by nickel affinity chromatography as described above.
[0055] Post-Translational Modification with Tyrosinase. Purified protein solutions were first dialyzed against 100 mM sodium acetates buffer at pH 5.5 using a 10 kDa molecular weight cut off (MWCO) dialysis membrane (Thermo Fisher Scientific). Dialyzed proteins were then diluted to a concentration of 4 mg/mL in 100 mM sodium acetates buffer at pH 5.5 with 100 mM ascorbic acid and filtered. Tyrosinase was added to a final concentration of 250 U mL.sup.1, and the mixture was incubated at 37 C. with agitation at 250 rpm for 30 minutes. After the reaction, the solution was filtered, and the enzyme activity in the flow-through was quenched by adding 0.2 mL of 6 N HCl per mL of reaction. The solution was filtered a final time and then was dialyzed extensively in 5% acetic acid at 4 C. and lyophilized.
[0056] Matrix-Assisted Laser Desorption/Ionization-Time of Flight (MALDI-TOF) Mass Spectrometry (MS) Analysis. Purified proteins at a final concentration of 10 M in 0.1 v/v %. trifluoroacetic acid (TFA) solution were mixed with dihydroxyacetophenone (DHAP) and spotted on a stainless steel MALDI target plate. Samples were analyzed using a Shimadzu AXIMA Resonance MALDI-TOF Mass Spectrometer (Shimadzu, Columbia, Md., USA) at the Saint Louis University Protein Core Facility. Positive-ion mass spectra analysis was conducted in standard linear mode with a laser power/acceleration voltage of 60-100 volts. The quadruple ion trap available for MS analysis was limited by a maximum m/z ratio of 15 kDa. For Mfp5.sup.1.sub.DOPA with an expected molecular weight of 10978.02 Da, the single- and double-charged state peaks could be assigned at m/z values of 5489.01 and 3659.34, respectively (
[0057] AFM Colloidal Probe Analysis. Peak Force Tapping-Atomic Force Microscopy (PFT-AFM) was used to characterize the protein samples by measuring adhesion force and adhesion works using a colloidal probe AFM cantilever (
[0058] For sample preparation, 10 L of protein solution in PBS buffer was pipetted on a mica surface and set quiescently for 30 minutes. After extensive washing, the mica surface was mounted on the AFM stage and the adsorbed protein was probed under a buffered condition (100 mM sodium phosphates pH 7.4). Bovine serum albumin (BSA) and unmodified Mfp5.sup.1 were used as controls and measured under the same condition.
[0059] Approach curves were fitted to the Alexander-de Gennes (AdG) model:
where F is the measured adhesion force, Z is the measured separation distance, k.sub.B is the Boltzmann constant, T is the absolute ambient temperature (298 K), R is the contact probe radius (5 m). L, h, and s are fitted parameters that represent equilibrium film thickness, the offset distance that considers material compressibility upon maximum probe contact, and the average distance between occupied sorption sites, respectively. This model has been used to describe the interaction of random-coil polymers and proteins on surfaces. The data was fit using MATLAB. Data far beyond the fitted equilibrium thicknesses were not included in order to prevent misfitting at larger separation distances. All fitted data had correlation coefficients (r.sup.2) of at least 99%.
[0060] The fitted equilibrium thickness (L) parameters were further compared with both the radii of gyration assuming random-coil configurations, <R>, and the end-to-end distances when the proteins are fully stretched out as rigid rods, L.sub.r. These distances were calculated using the formulas:
<R>={square root over (N <l>.sup.2)}(2)
and
L.sub.r=Nl (3)
where N is the number of amino acids in the protein and 1 is the length of one amino acid (3.5 angstroms).
[0061] Following probe approaching, the probe was cured on the protein sample for a delay time varying from 0 to 50 seconds for most measurements. The adhesion force was taken at the point of maximum force (most negative force). The adhesion work was calculated as the area under the retraction curve, relative to the zero nN baseline (
[0062] For the strongest oligomer tested, Mfp5.sup.3.sub.DOPA adhesion forces with each respective cure time tested were averaged and fit to a logistic model of the form:
where F is the measured adhesion force, t is the cure time, and A, B, and Q are fitted constants. This model was appropriate given the assumption that an infinitesimally small cure time would result in zero adhesion force, while an infinite cure time would result in a theoretical maximum adhesion force (represented by the fit constant A) upon complete underwater curing of the material. The log-log scaling of the adhesion force and cure time axes allowed for the fitted model to give rise to an unpatterned residual, which indicates a high quality of fitting (
[0063] Quartz Crystal Microbalance with Dissipation Monitoring (QCM-D) Analysis. Gold-coated quartz crystal sensors (QSX 301, Biolin Scientific, Gothenburg, Sweden) were used for QCM analysis. The sensors were cleaned by UV irradiation for 15 minutes, heated in a mixture of ammonia (25%) and hydrogen peroxide (30%) at 75 C. for 5 minutes, then thoroughly rinsed with distilled water, dried with N.sub.2, and subjected to 10 minutes of UV irradiation before being mounted in the QCM flow modules (Biolin Scientific). PBS buffer carrier solution was flowed through an injector valve into the flow cell modules containing the quartz sensors at a flow rate of 0.1 mL min.sup.1 until stable baselines were achieved. Protein solutions were prepared to final concentrations of 10 M (or 2 mg mL.sup.1 in the case of BSA) in PBS buffer (100 mM sodium phosphates pH 7.4) and flowed through the flow cell at a flow rate of 10 L min.sup.1 for 60 minutes. PBS buffer carrier solution was flowed through to wash unbound protein off the quartz sensors until the frequency shift was stable (
where m is change in mass, f is change in crystal resonance frequency, A is the active area of the crystal between electrodes, .sub.quartz is the shear modulus of the quartz crystal, .sub.quartz is the density of the quartz crystal, and F.sub.quartz is the reference frequency. This equation can be simplified to:
m=C.Math.f (6)
where C(=17.7 ng cm.sup.2 Hz.sup.1) is a constant that combines all constants relating to using gold-coated quartz sensors for deposition with the QCM instrument utilized herein.
[0064] When introducing elements of the present invention or the preferred embodiment(s) thereof, the articles a, an, the, and said are intended to mean that there are one or more of the elements. The terms comprising, including, and having are intended to be inclusive and mean that there may be additional elements other than the listed elements.
[0065] As various changes could be made in the above constructions without departing from the scope of the invention, it is intended that all matter contained in the above description or shown in the accompanying drawings shall be interpreted as illustrative and not in a limiting sense.
TABLE-US-00001 TABLE1 Codingsequences Name Sequence Description mfp5.sup.1 GCTAAGACTAAACATCATCACCATCACCACGGT Codingsequenceencoding GGCGGTGGCAGCAGATCTGGATCTAGCAGCGAAG Mfp5.sup.1protein,which AGTATAAAGGTGGTTACTACCCGGGCAACACTTA includescodingsequencesfor CCACTACCATAGCGGCGGCTCCTACCACGGTTCC theAKTKexpressiontag, GGCTATCACGGTGGCTACAAAGGTAAATACTACG His.sub.6affinitytag,andlinker GCAAAGCGAAAAAGTACTACTACAAATATAAAA &BioBrickcutsites ACTCCGGCAAATACAAGTATCTGAAGAAGGCTC GTAAATATCACCGTAAAGGTTACAAAAAGTATTA CGGCGGTGGTTCTTCTGGATCC mfp5.sup.2 GCTAAGACTAAACATCATCACCATCACCACGGT Codingsequenceencoding GGCGGTGGCAGCAGATCTTCAAGTGAAGAATATA Mfp5.sup.2protein,which AGGGCGGCTATTACCCTGGGAACACCTACCATTA includescodingsequencesfor CCACTCGGGTGGTTCGTATCATGGTTCCGGCTAC theAKTKexpressiontag, CATGGTGGTTACAAGGGGAAGTATTATGGTAAA His.sub.6affinitytag,andlinkers GCGAAGAAATACTATTACAAATACAAGAATTCTG &BioBrickcutsites GAAAGTATAAATATCTGAAAAAAGCTCGCAAAT ATCATCGTAAAGGATATAAAAAATACTATGGAG GTGGCAGTAGCAGTTCCGAAGAATATAAAGGGG GTTACTACCCTGGAAACACTTATCATTATCATAG TGGAGGTTCTTACCATGGCAGTGGATATCACGGC GGTTATAAGGGTAAGTATTATGGGAAAGCCAAA AAGTATTACTATAAGTACAAGAATTCTGGCAAAT ACAAGTACTTGAAGAAGGCTCGTAAGTACCACC GCAAAGGTTACAAGAAATACTATGGAGGCGGTT CAAGTCTCGAG Cfa.sup.N AAAAAACATATGGTCAAGATCATTAGTCGTAAGA CodonoptimizedN-terminal GTCTGGGCACTCAAAACGTCTACGATATTGGAGT SI.sup.Ccodingsequencefor AGAAAAAGATCATAATTTTTTGCTGAAGAATGGG mfp5.sup.3assemblywithmfp5 CTGGTGGCCTCTAACTGCTTCAACGGTACC sequence,whichincludesthe codingsequenceforSI.sup.N nativeexteinaminoacids& KpnIrestrictionsite Cfa.sup.C TCCGGAGCAGAATATTGCCTGTCTTACGACACA CodonoptimizedC-terminal GAGATTCTGACCGTTGAATATGGATTCCTTCCTA SI.sup.Ncodingsequencefor TCGGTAAGATCGTGGAGGAACGGATTGAATGCA mfp5.sup.3assemblywithmfp5.sup.2 CAGTCTATACGGTAGATAAAAATGGCTTTGTGTA sequence,whichincludesthe TACACAACCTATTGCTCAGTGGCATAACCGGGGA codingsequenceforSI.sup.C GAACAGGAAGTTTTCGAATACTGCTTAGAAGACG nativeexteinaminoacids& GTTCGATTATCCGTGCAACGAAAGATCACAAATT Kpn2Irestrictionsite TATGACGACCGACGGTCAGATGTTACCGATTGAT GAGATTTTCGAACGGGGGTTAGACCTGAAACAA GTTGATGGTTTGCCGTAAGGATCCAAAAAA
TABLE-US-00002 TABLE2 Primers Name Sequence Description BglII- AAAAAAAGATCTAGCAGCGAAGAGTAT Forwardprimerforamplificationof Mfp5.sup.1-F AAAGGTG mfp5.sup.1withBglIIoverhangfor insertionintopE7a-AKTK-H6 backbone BamHI- AAAAAAGGATCCAGAAGAACCACCGCC Reverseprimerforamplificationof Mfp5.sup.1-R G mfp5.sup.1withBamHIoverhangfor insertionintopE7a-AKTK-H6 backbone BglII- AAAAAAAGATCTTCAAGTGAAGAATAT Forwardprimerforamplificationof Mfp5.sup.2-F AAGGGCGGCTAT mfp5.sup.2withBglIIoverhangfor insertionintopE7a-AKTK-H6 backbone XhoI- AAAAAACTCGAGACTTGAACCGCCTCC Reverseprimerforamplificationof Mfp5.sup.2-R ATAGTATTTCTTG mfp5.sup.2withXhoIoverhangfor insertionintopE7a-AKTK-H6 backbone KpnI- AAAAAAGGTACCAGCAGCGAAGAGTAT Forwardprimerforamplificationof Mfp5.sup.1-F AAAGGTGGTTACTACC mfp5.sup.1withKpnIoverhangfor insertionintopE7a-SIbackbone Kpn2I- AAAAAATCCGGAAGAAGAACCACCGCC Reverseprimerforamplificationof Mfp5.sup.1-R GTAATAC mfp5.sup.1withKpn2Ioverhangfor insertionintopE7a-SIbackbone KpnI- AAAAAAGGTACCTCAAGTGAAGAATAT Forwardprimerforamplificationof Mfp5.sup.2-F AAGGGCGGCTATTACCC mfp5.sup.2withKpnIoverhangfor insertionintopE7a-SIbackbone Kpn2I- AAAAAATCCGGAACTTGAACCGCCTCC Reverseprimerforamplificationof Mfp5.sup.2-R ATAGTATTTCTTGTAAC mfp5.sup.2withKpn2Ioverhangfor insertionintopE7a-SIbackbone
TABLE-US-00003 TABLE 3 Plasmids Name ORI Promoter Resistance Gene pE7a ColE1 P.sub.T7 Amp.sup.R N/A pE7a-mfp5.sup.1 ColE1 P.sub.T7 Amp.sup.R mfp5.sup.l pE7a-mfp5.sup.2 ColE1 P.sub.T7 Amp.sup.R mfp5.sup.2 pE7a-mfp5.sup.2-Cfa.sup.N ColE1 P.sub.T7 Amp.sup.R mfp5.sup.2 + Cfa.sup.C pE7a-Cfa.sup.C-mfp5.sup.1 ColE1 P.sub.T7 Amp.sup.R Cfa.sup.N + mfp5.sup.1
TABLE-US-00004 TABLE 4 Strains Name Genotype MDS42pdu MDS42 polB dinB umuDC BL21(DE3) F.sup. ompT gal dcm lon hsdS.sub.B(r.sub.B.sup.m.sub.B.sup.) (DE3 [lacI lacUV5-T7p07 indl sam7 nin5]) [malB.sup.+].sub.K-12(.sup.S) sM-Mfp5 MDS42pdu containing pE7a-mfp-5 sM-Mfp5.sup.2 MDS42pdu containing pE7a-mfp-5.sup.2 sM-Mfp-5.sup.2-Cfa.sup.N MDS42pdu containing pE7a-mfp-5.sup.2-Cfa.sup.N sM-Cfa.sup.C-mfp-5 MDS42pdu containing pE7a-Cfa.sup.C-mfp-5 sB-Mfp5 BL21(DE3) containing pE7a-mfp-5 sB-Mfp5.sup.2 BL21(DE3) containing pE7a-mfp-5.sup.2 sB-Mfp-5.sup.2-Cfa.sup.N BL21(DE3) containing pE7a-mfp-5.sup.2-Cfa.sup.N sB-Cfa.sup.C-Mfp-5 BL21(DE3) containing pE7a-Cfa.sup.C-mfp-5
TABLE-US-00005 TABLE5 Proteinsequences Name Sequence Description Mfp5.sup.1 AKTKHHHHHHGGGGSRSGSSSEEYKG Mfp5protein,which GYYPGNTYHYHSGGSYHGSGYHGGYK includestheAKTK GKYYGKAKKYYYKYKNSGKYKYLKK expressiontag,His.sub.6 ARKYHRKGYKKYYGGGSSGS affinitytag,andlinker &BioBrickcutsites Mfp5.sup.2 AKTKHHHHHHGGGGSRSSSEEYKGGY Mfp5.sup.2protein,which YPGNTYHYHSGGSYHGSGYHGGYKGK includestheAKTK YYGKAKKYYYKYKNSGKYKYLKKAR expressiontag,His.sub.6 KYHRKGYKKYYGGGSSSSEEYKGGYY affinitytag,andlinker PGNTYHYHSGGSYHGSGYHGGYKGKY &BioBrickcutsites YGKAKKYYYKYKNSGKYKYLKKARK YHRKGYKKYYGGGSSLE Cfa.sup.N AEYCLSYDTEILTVEYGFLPIGKIVEERI C-terminalSI.sup.Nsequence ECTVYTVDKNGFVYTQPIAQWHNRGE formfp5.sup.3assemblywith QEVFEYCLEDGSIIRATKDHKFMTTDG mfp5.sup.2,whichincludes QMLPIDEIFERGLDLKQVDGLP theSI.sup.Nnativeextein aminoacids Cfa.sup.C VKIISRKSLGTQNVYDIGVEKDHNFLLK N-terminalSI.sup.Csequence NGLVASNCFN formfp5.sup.3assemblywith mfp5,whichincludes theSI.sup.Cnativeextein aminoacids