Split intein mediated polymerization and production of mussel foot adhesive protein materials
11530245 · 2022-12-20
Assignee
Inventors
Cpc classification
C07K2319/20
CHEMISTRY; METALLURGY
C07K2319/92
CHEMISTRY; METALLURGY
International classification
Abstract
Mussels strongly adhere to a variety of surfaces by secreting byssal threads that contain mussel foot proteins (Mfps). Recombinant production of Mfps presents an attractive route for preparing advanced adhesive materials. Using synthetic biology strategies, Mfp5 together with Mfp5 oligomers containing two or three consecutive, covalently-linked Mfp5 sequences (named Mfp5.sup.2 and Mfp5.sup.3) were synthesized. Positive correlations were found between Mfp5 molecular weight and underwater adhesive properties, including adhesion force, adhesion work, protein layer thickness, and recovery distance. Dopa-modified Mfp5.sup.3 displayed a high adhesion force (201±36 nN μm.sup.−1) and a high adhesion work (68±21 fJ μm.sup.−1) for 200 s cure times, higher than previously reported Mfp-mimetic adhesives. Results disclosed herein highlight the power of synthetic biology in producing biocompatible and highly adhesive Mfp-based materials.
Claims
1. A synthetic Mfp5 mussel foot protein, wherein the synthetic Mfp5 mussel foot protein is an oligomer protein comprising a plurality of Mfp5 repeats that are covalently linked; wherein the oligomer protein is synthesized from a mixture comprising: a Mfp5-Int.sup.N fusion protein, formed from an N-terminal split intein-fused Mfp5 repeat; and an Int.sup.C-Mfp5 fusion protein, formed from a C-terminal split intein-fused Mfp5 repeat; wherein the Mfp5-Int.sup.N fusion protein is produced by fusing an N-terminal split intein (Int.sup.N) to a first Mfp5 repeat, and wherein the Int.sup.C-Mfp5 fusion protein is produced by fusing a C-terminal split intein (Int.sup.C) to a second Mfp5 repeat; wherein the first Mfp5 repeat is selected from the group consisting of SEQ ID NO: 15 and SEQ ID NO: 16; and wherein the second Mfp5 repeat is selected from the group consisting of SEQ ID NO: 15 and SEQ ID NO: 16.
2. The synthetic Mfp5 mussel foot protein of claim 1, wherein the Int.sup.N is a Cfa.sup.N protein consisting of SEQ ID NO: 17 and Int.sup.C is a Cfa.sup.C protein consisting of SEQ ID NO: 18.
3. The synthetic Mfp5 mussel foot protein of claim 1, wherein the Mfp5-Int.sup.N fusion proteins and the Int.sup.C-Mfp5 fusion proteins are separately expressed from E. coli.
4. A method for producing a synthetic Mfp5 mussel foot protein oligomer, the method comprising: fusing an N-terminal split intein (Int.sup.N) to a first Mfp5 repeat to produce a Mfp5-Int.sup.N fusion protein, wherein the first Mfp5 repeat is selected from the group consisting of SEQ ID NO: 15 and SEQ ID NO: 16, and wherein the Int.sup.N is an N-terminal Cfa split intein; fusing a C-terminal split intein (Int.sup.C) to a second Mfp5 repeat to produce an Int.sup.C-Mfp5 fusion protein, wherein the second Mfp5 repeat is selected from the group consisting of SEQ ID NO: 15 and SEQ ID NO: 16, and wherein the Int.sup.C is a C-terminal Cfa split intein; and mixing the Mfp5-Int.sup.N fusion protein and the Int.sup.C-Mfp5 fusion protein to produce the synthetic Mfp5 mussel foot protein oligomer.
5. The method of claim 4, wherein the Mfp5-Int.sup.N fusion protein and the Int.sup.C-Mfp5 fusion protein are separately expressed from E. coli.
6. The method of claim 4, wherein the Int.sup.N is a Cfa.sup.N protein consisting of SEQ ID NO: 17 and Int.sup.C is a Cfa.sup.C protein consisting of SEQ ID NO: 18.
7. The method of claim 4, further comprising reacting the synthetic Mfp5 mussel foot protein oligomer with tyrosinase.
8. A system for producing a synthetic Mfp5 mussel foot protein adhesive, the system comprising: a Mfp5-Int.sup.N fusion protein, formed from an N-terminal split intein-fused Mfp5 repeat; and an Int.sup.C-Mfp5 fusion protein, formed from a C-terminal split intein-fused Mfp5 repeat; wherein the Mfp5-Int.sup.N fusion protein is produced by fusing an N-terminal split intein (Int.sup.N) to a first Mfp5 repeat, and wherein the Int.sup.C-Mfp5 fusion protein is produced by fusing a C-terminal split intein (Int.sup.C) to a second Mfp5 repeat; wherein the first Mfp5 repeat is selected from the group consisting of SEQ ID NO: 15 and SEQ ID NO: 16; and wherein the second Mfp5 repeat is selected from the group consisting of SEQ ID NO: 15 and SEQ ID NO: 16.
9. The system of claim 8, wherein the Int.sup.N is a Cfa.sup.N protein consisting of SEQ ID NO: 17 and Int.sup.C is a Cfa.sup.C protein consisting of SEQ ID NO: 18.
10. The system of claim 8, wherein the Int.sup.N is a Cfa.sup.N protein consisting of SEQ ID NO: 17 and Int.sup.C is a Cfa.sup.C protein consisting of SEQ ID NO: 18.
11. The system of claim 8, wherein the Mfp5-Int.sup.N fusion protein and the Int.sup.C-Mfp5 fusion protein are separately expressed from E. coli.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) The embodiments described herein may be better understood by referring to the following description in conjunction with the accompanying drawings.
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)
(18)
(19)
(20)
(21)
DETAILED DESCRIPTION OF THE DISCLOSURE
(22) Using synthetic DNA with codons optimized for E. coli expression, Mytilus galloprovincialis Mfp5 were designed together with Mfp5 oligomers containing two or three consecutive Mfp5 sequences (named Mfp5.sup.2 and Mfp5.sup.3) and cloned them into standardized expression vectors (
(23) Multiple Mfps from different mussel species exist, each having a slightly different amino acid sequence. In some embodiments, different Mfps (e.g., Mfp1, Mfp2, Mfp3, Mfp4, Mfp5, etc.) are utilized in the methods disclosed herein to produce Mfp oligomers with any number of repeats. Further, as described herein, a Mfp oligomer can be made from several different combinations via split intein reactions. For example, in some embodiments Mfp5.sup.3 is made from Mfp5.sup.2-Int.sup.N and Int.sup.C-Mfp5.sup.1. In other embodiments, Mfp5.sup.3 is made from Mfp5.sup.1-Int.sup.N and Int.sup.C-Mfp5.sup.2.
(24) In some embodiments, both Mfp5.sup.1 and Mfp5.sup.2 are produced in E. coli and purified using affinity chromatography to purities of 99.9% and 92.2%, respectively (
(25) SIs are auto-catalytic peptides that catalyze the spontaneous splicing-ligation reactions between two SI-fused target proteins, assembling the two target proteins covalently in trans (
(26) MALDI-TOF spectra of unmodified Mfp5.sup.1 and Mfp5.sup.1.sub.DOPA are shown in
(27) MALDI-TOF spectra of unmodified Mfp5.sup.2 and Mfp5.sup.2.sub.DOPA are shown in
(28) Peak force tapping atomic force microscopy (PFT-AFM) was used to examine asymmetric adhesive and cohesive characteristics of the Mfp5 proteins between a colloidal probe and a protein-adsorbed mica surface under aqueous conditions (see
(29)
(30)
(31) A similar positive correlation was also observed using synthetic poly-dihydroxystyrene-co-styrene polymers. From the retract curves, the recovery distance was also measured, which is defined as the length between the point where maximum adhesion force is achieved and the point where the probe is re-established to the initial zero adhesion baseline.
(32) Related to recovery distance, another critical material property for adhesives is adhesion work, also known as peel force or work of separation. In
(33) The longest curing time that allowed a reliable approach-retract curve measurement for Mfp5.sup.3.sub.DOPA was 200 s, because longer cure times produced adhesion strengths that were too high and prevented recovery of retract curves, even at the maximum AFM probe separation of our equipment (1 μm). At a cure time of 200 s, Mfp5.sup.3.sub.DOPA exhibited an adhesion force of ˜201 nN μm.sup.−1 and an adhesion work of ˜68 fJ μm.sup.−1, values which are higher than previously reported for Mfp-mimetic adhesives (
(34) Protein-protein interactions confer high cohesion, and thus overall adhesion between two surfaces. To analyze protein absorption to one surface, the Mfps disclosed herein were subjected to steady shear-flows in phosphate-buffered saline (PBS) buffer and quantified adsorbed protein amounts using a quartz crystal microbalance with dissipation monitoring (QCM-D). According to polymer physics, these conditions should lead to physical chain disentanglement, resulting in a thin and disperse protein layer from a soluble protein solution.
(35)
(36) In accordance with the present disclosure, E. coli was successfully engineered to produce Mfp5-based adhesive proteins with strong underwater adhesion capabilities. Under the experimental conditions, Mfp5.sup.3.sub.DOPA displayed high adhesion forces and adhesion works that were comparable to or higher than previously reported Mfp-mimetic adhesives. With a longer curing time, the adhesion force of Mfp5.sup.3 could be even higher than what was measured, but was beyond the detection limit of the method disclosed herein. Such adhesive proteins can potentially be used to replace natural Mfps in a variety of applications, with even better performance due to their stronger underwater adhesion. The results demonstrated that protein-protein cohesive interactions play an important role in determining both adhesion force and adhesion work. Positive correlations were observed between Mfp5's molecular weight and its measured adhesion force and adhesion work. These correlations indicate that producing high molecular weight proteins might be a natural strategy to obtain extensive cohesive interactions, as evidenced in Mfp2 and Mfp4, which have the highest molecular weights within the byssal plaque core (45 kDa and 90 kDa, respectively) among all Mfps and play cohesive roles in mussel plaques. The observed molecular weight dependent adhesion can be used in design rules to guide future engineering efforts, creating even stronger underwater adhesives.
(37) In some embodiments, a Mfp is disclosed comprising a full length polypeptide sequence from a mussel species, a partial polypeptide sequence from a mussel species, or combinations thereof. In some embodiments, the Mfp is selected from the group consisting of Mfp1, Mfp2, Mfp3, Mfp4, and Mfp5.
(38) In some embodiments, the Mfp is an oligomer protein comprising a plurality of repeats of the Mfp within one protein molecule. In some embodiments, the oligomer protein is synthesized from a mixture of a Mfp-Int.sup.N fusion protein and an Int.sup.C-Mfp fusion protein. In some embodiments, the Mfp-Int.sup.N fusion protein is produced by fusing an N-terminal split intein (Int.sup.N) to a first Mfp repeat, and the Int.sup.C-Mfp fusion protein is produced by fusing a C-terminal split intein (Int.sup.C) to a second Mfp repeat.
(39) In some embodiments, the first Mfp repeat is Mfp5.sup.2, and the second repeat is Mfp5.sup.1. In some embodiments, the oligomer protein is Mfp5.sup.3, the Mfp-Int.sup.N fusion protein is Mfp5.sup.2-Int.sup.N, and the Int.sup.C-Mfp fusion protein is Int.sup.C-Mfp5.sup.1.
(40) In some embodiments, the first Mfp repeat is Mfp5.sup.1, and the second repeat is Mfp5.sup.2. In some embodiments, the oligomer protein is Mfp5.sup.3, the Mfp-Int.sup.N fusion protein is Mfp5.sup.1-Int.sup.N, and the Int.sup.C-Mfp fusion protein is Int.sup.C-Mfp5.sup.2. In some embodiments, the Int.sup.N and Int.sup.C are from a Cfa split intein (SI). In some embodiments, the Mfp-Int.sup.N fusion protein and the Int.sup.C-Mfp fusion protein are separately expressed from E. coli.
EXPERIMENTAL SECTION
(41) Chemicals and Reagents. Unless otherwise noted, all chemicals and reagents were obtained from Millipore Sigma (Saint Louis, Mo., USA). Plasmid purification and gel extraction kits were purchased from iNtRON Biotechnology (Seoul, South Korea). FastDigest restriction enzymes and T4 DNA ligase were purchased from Thermo Fisher Scientific (Austin, Tex., USA) and were used according to the suggested protocols from the manufacturer.
(42) Plasmid Construction. A BioBrick system was used to facilitate cloning of two-part split intein-fused material protein domains. This system takes the following conditions into account: (a) split intein genes, ribosome binding sites (RBS), 5′-untranslated regions (5′-UTR), antibiotic resistance markers, promoters, and origins of replication are flanked with appropriate restriction sites that allow them to be easily swapped with other corresponding genes of interest, (b) restriction sites that exist within open reading frames introduce amino acids that are small, flexible, and are not expected to change the protein behavior and dynamics, (c) restriction sites are all distinct with respect to one another to facilitate the assembly of complete plasmids in one pot and in one step. All restriction sites used are schematically mapped out in
(43) E. coli strain MDS42pdu was used as a host strain for cloning of all genes and plasmids in this study. The amino acid sequences of Mytilus galloprovincialis Mfp5.sup.1, Mfp5.sup.2, and Mfp5.sup.3 were codon-optimized for E. coli expression using the Gene Designer 2.0 software package (DNA 2.0 Inc.). All designed DNA sequences were chemically synthesized by Integrated DNA Technologies Inc. (San Jose, Calif., USA) (Table 1). These synthetic genes were then amplified using polymerase chain reaction (PCR) with corresponding forward and reverse primers as listed in Table 2. All mfp5 genes were amplified with BglII and BamHI sites on the 5′ and 3′ ends, respectively, for insertion into the pE7a-AKTK-H6 backbone (AKTK is SEQ ID NO: 19) containing the same sites, which was previously PCR amplified from plasmid pE7a-GFP1, with the addition of short coding sequences, 5′-ATGGCTAAGACTAAACATCATCACCATCATCAC-3′(SEQ ID NO: 1), translating to N′-MAKTK-H6-C′ (SEQ ID NO: 2) (
(44) To construct plasmids containing split intein-fused Mfp5 proteins, the amino acid sequences of Cfa N- and C-inteins (Cfa.sup.N and Cfa.sup.C, respectively) were first codon-optimized and chemically synthesized using the same method as described above. The synthesized gene containing the Cfa.sup.N domain, which is flanked by NdeI and KpnI sites, and the Cfa.sup.C domain, which is flanked by Kpn2I and XhoI sites, were digested directly from the synthesized DNA. The mfp52 and mfp51 genes were amplified with corresponding primers (Table 2) containing KpnI and Kpn2I restriction sites. Amplified mfp52 and mfp51 fragments were digested (
(45) Expression of Recombinant Proteins. E. coli strain BL21(DE3) (Thermo Fisher Scientific, Waltham, Mass.) was used as a host strain for expression of Mfps. E. coli strains containing the plasmids listed in Table 3 were cultured in shake flasks with Luria-Bertani (LB) broth containing 10 g/L tryptone, g/L sodium chloride, and 5 g/L yeast extract with the appropriate antibiotic (100 μg/mL ampicillin). Fresh transformants were cultivated overnight in 50 mL LB medium at 37° C. Overnight cultures were then used to inoculate 1 L fresh LB medium in Erlenmeyer flasks at an initial OD600=0.08. Cultures were grown at 37° C. with shaking to OD600=0.6, then induced by addition of 500 μM (for Mfp5.sup.1 and Cfa.sup.C-Mfp5.sup.1) or 50 μM (Mfp5.sup.2 and Mfp5.sup.2-Cfa.sup.N) IPTG. The culture was further cultivated at 37° C. at 250 rpm for another 5-7 hours. Cells were harvested by centrifugation at 4,500×g for 20 min at 4° C. Centrifuged cell pellets were either directly extracted or stored at −80° C. until needed.
(46) Protein Purification. For Mfp5.sup.1 and Mfp5.sup.2, cell pellets were resuspended in 10 mL of guanidine lysis buffer (6 M guanidine hydrochloride (BioBasic Inc., Amherst, N.Y., USA), 50 mM potassium phosphates, and 300 mM sodium chloride at pH 7.4) per gram of wet cells and lysed by agitation at 250 rpm. The lysates were centrifuged at 20,000×g for 20 min at 18° C. To reduce the viscosity, collected lysates were further sonicated on ice (to maintain a roughly ambient temperature) for 30 minutes with a QSonica probe sonicator using 5 seconds on/5 seconds off cycles. The lysates were filtered through 0.2 μm filter membranes. Both proteins were purified using an AktaPure Fast Protein Liquid Chromatograph (FPLC, GE Healthcare Inc., Chicago, Ill., USA) equipped with a 5 mL nickel affinity chromatography column (GE Healthcare). The column was pre-equilibrated with guanidine lysis buffer followed by sample loading. After washing with 5-10 column volumes (CVs) of guanidine wash buffer (6 M guanidine hydrochloride, 50 mM potassium phosphates, 300 mM sodium chloride, and 50 mM imidazole at pH 7.4), proteins were eluted and fractionated with 5-10 CVs of guanidine elution buffer (6 M guanidine hydrochloride, 50 mM potassium phosphates, 300 mM sodium chloride, and 250 mM imidazole at pH 7.4). Purified Mfp5.sup.1 and Mfp5.sup.2 proteins were examined by SDS-PAGE as shown in
(47) Split-Intein Mediated Ligation and Purification of Mfp5.sup.3. Cell pellets containing Cfa.sup.C-Mfp5.sup.1 and Mfp5.sup.2-Cfa.sup.N-His10 fusion proteins were separately resuspended in 10 mL of urea lysis buffer (8 M urea, 100 mM sodium phosphates, and 300 mM sodium chloride at pH 7.4) per gram of wet cells and lysed by agitation at 250 rpm overnight. The lysates were centrifuged at 20,000×g for 20 min at 18° C. Clear cell lysate was then mixed at a final reactant ratio of 4:1 (Cfa.sup.C-Mfp5.sup.1: Mfp5.sup.2-Cfa.sup.N-His10) based on densitometric analysis of Coomassie Blue-stained SDS-PAGE gels. The excess Int.sup.C-Mfp5.sup.1 does not contain His-tag, thus can be easily separated from the ligated product. The lysate mixture was stirred at 30° C. for 8 hours. The mixed lysate was then filtered through a 0.2 μm filter membrane and purified by nickel affinity chromatography as described above.
(48) Post-Translational Modification with Tyrosinase. Purified protein solutions were first dialyzed against 100 mM sodium acetates buffer at pH 5.5 using a 10 kDa molecular weight cut off (MWCO) dialysis membrane (Thermo Fisher Scientific). Dialyzed proteins were then diluted to a concentration of 4 mg/mL in 100 mM sodium acetates buffer at pH 5.5 with 100 mM ascorbic acid and filtered. Tyrosinase was added to a final concentration of 250 U mL.sup.−1, and the mixture was incubated at 37° C. with agitation at 250 rpm for 30 minutes. After the reaction, the solution was filtered, and the enzyme activity in the flow-through was quenched by adding 0.2 mL of 6 N HCl per mL of reaction. The solution was filtered a final time and then was dialyzed extensively in 5% acetic acid at 4° C. and lyophilized.
(49) Matrix-Assisted Laser Desorption/Ionization-Time of Flight (MALDI-TOF) Mass Spectrometry (MS) Analysis. Purified proteins at a final concentration of ˜10 μM in 0.1 v/v %. trifluoroacetic acid (TFA) solution were mixed with dihydroxyacetophenone (DHAP) and spotted on a stainless steel MALDI target plate. Samples were analyzed using a Shimadzu AXIMA Resonance MALDI-TOF Mass Spectrometer (Shimadzu, Columbia, Md., USA) at the Saint Louis University Protein Core Facility. Positive-ion mass spectra analysis was conducted in standard linear mode with a laser power/acceleration voltage of 60-100 volts. The quadruple ion trap available for MS analysis was limited by a maximum m/z ratio of 15 kDa. For Mfp5.sup.1.sub.DOPA with an expected molecular weight of 10978.02 Da, the single- and double-charged state peaks could be assigned at m/z values of 5489.01 and 3659.34, respectively (
(50) AFM Colloidal Probe Analysis. Peak Force Tapping-Atomic Force Microscopy (PFT-AFM) was used to characterize the protein samples by measuring adhesion force and adhesion works using a colloidal probe AFM cantilever (
(51) For sample preparation, 10 μL of protein solution in PBS buffer was pipetted on a mica surface and set quiescently for 30 minutes. After extensive washing, the mica surface was mounted on the AFM stage and the adsorbed protein was probed under a buffered condition (100 mM sodium phosphates pH 7.4). Bovine serum albumin (BSA) and unmodified Mfp5.sup.1 were used as controls and measured under the same condition.
(52) Approach curves were fitted to the Alexander-de Gennes (AdG) model:
(53)
where F is the measured adhesion force, Z is the measured separation distance, k.sub.B is the Boltzmann constant, T is the absolute ambient temperature (˜298 K), R is the contact probe radius (5 μm). L, h, and s are fitted parameters that represent equilibrium film thickness, the offset distance that considers material compressibility upon maximum probe contact, and the average distance between occupied sorption sites, respectively. This model has been used to describe the interaction of random-coil polymers and proteins on surfaces. The data was fit using MATLAB. Data far beyond the fitted equilibrium thicknesses were not included in order to prevent misfitting at larger separation distances. All fitted data had correlation coefficients (r.sup.2) of at least 99%.
(54) The fitted equilibrium thickness (L) parameters were further compared with both the radii of gyration assuming random-coil configurations, <R>, and the end-to-end distances when the proteins are fully stretched out as rigid rods, L.sub.r. These distances were calculated using the formulas:
<R>=√{square root over (N<l>.sup.2)} (2)
and
L.sub.r=Nl (3)
where N is the number of amino acids in the protein and 1 is the length of one amino acid (3.5 angstroms).
(55) Following probe approaching, the probe was cured on the protein sample for a delay time varying from 0 to 50 seconds for most measurements. The adhesion force was taken at the point of maximum force (most negative force). The adhesion work was calculated as the area “under” the retraction curve, relative to the zero nN baseline (
(56) For the strongest oligomer tested, Mfp5.sup.3.sub.DOPA, adhesion forces with each respective cure time tested were averaged and fit to a logistic model of the form:
(57)
where F is the measured adhesion force, t is the cure time, and A, B, and Q are fitted constants. This model was appropriate given the assumption that an infinitesimally small cure time would result in zero adhesion force, while an infinite cure time would result in a theoretical maximum adhesion force (represented by the fit constant ‘A’) upon complete underwater curing of the material. The log-log scaling of the adhesion force and cure time axes allowed for the fitted model to give rise to an unpatterned residual, which indicates a high quality of fitting (
(58) Quartz Crystal Microbalance with Dissipation Monitoring (QCM-D) Analysis. Gold-coated quartz crystal sensors (QSX 301, Biolin Scientific, Gothenburg, Sweden) were used for QCM analysis. The sensors were cleaned by UV irradiation for 15 minutes, heated in a mixture of ammonia (25%) and hydrogen peroxide (30%) at 75° C. for 5 minutes, then thoroughly rinsed with distilled water, dried with N2, and subjected to 10 minutes of UV irradiation before being mounted in the QCM flow modules (Biolin Scientific). PBS buffer carrier solution was flowed through an injector valve into the flow cell modules containing the quartz sensors at a flow rate of 0.1 mL min.sup.−1 until stable baselines were achieved. Protein solutions were prepared to final concentrations of 10 μM (or 2 mg mL.sup.−1 in the case of BSA) in PBS buffer (100 mM sodium phosphates pH 7.4) and flowed through the flow cell at a flow rate of 10 μL min.sup.−1 for 60 minutes. PBS buffer carrier solution was flowed through to wash unbound protein off the quartz sensors until the frequency shift was stable (
(59)
where Δm is change in mass, Δf is change in crystal resonance frequency, A is the active area of the crystal between electrodes, μ.sub.quartz is the shear modulus of the quartz crystal, ρ.sub.quartz is the density of the quartz crystal, and F.sub.quartz is the reference frequency. This equation can be simplified to:
Δm=−C.Math.Δf (6)
where C (=17.7 ng cm.sup.2 Hz.sup.−1) is a constant that combines all constants relating to using gold-coated quartz sensors for deposition with the QCM instrument utilized herein.
(60) When introducing elements of the present invention or the preferred embodiment(s) thereof, the articles “a”, “an”, “the”, and “said” are intended to mean that there are one or more of the elements. The terms “comprising,” “including”, and “having” are intended to be inclusive and mean that there may be additional elements other than the listed elements.
(61) As various changes could be made in the above constructions without departing from the scope of the invention, it is intended that all matter contained in the above description or shown in the accompanying drawings shall be interpreted as illustrative and not in a limiting sense.
(62) TABLE-US-00001 TABLE 1 Coding sequences Name Sequence Description mfp5.sup.1 GCTAAGACTAAACATCATCACCATCACCACGGT Coding sequence encoding (SEQ GGCGGTGGCAGCAGATCTGGATCTAGCAGCGAAG Mfp5.sup.1 protein, which ID NO: AGTATAAAGGTGGTTACTACCCGGGCAACACTTA includes coding sequences for 3) CCACTACCATAGCGGCGGCTCCTACCACGGTTCC the AKTK (SEQ ID NO: 19) GGCTATCACGGTGGCTACAAAGGTAAATACTACG expression tag, GCAAAGCGAAAAAGTACTACTACAAATATAAAA His.sub.6 affinity tag, and linker ACTCCGGCAAATACAAGTATCTGAAGAAGGCTC & BioBrick cut sites GTAAATATCACCGTAAAGGTTACAAAAAGTATTA CGGCGGTGGTTCTTCTGGATCC mfp5.sup.2 GCTAAGACTAAACATCATCACCATCACCACGGT Coding sequence encoding (SEQ GGCGGTGGCAGCAGATCTTCAAGTGAAGAATATA Mfp5.sup.2 protein, which ID NO: AGGGCGGCTATTACCCTGGGAACACCTACCATTA includes coding sequences for 4) CCACTCGGGTGGTTCGTATCATGGTTCCGGCTAC the AKTK (SEQ ID NO: 19) CATGGTGGTTACAAGGGGAAGTATTATGGTAAA expression tag, GCGAAGAAATACTATTACAAATACAAGAATTCTG His.sub.6 affinity tag, and linker GAAAGTATAAATATCTGAAAAAAGCTCGCAAAT & BioBrick cut sites ATCATCGTAAAGGATATAAAAAATACTATGGAG GTGGCAGTAGCAGTTCCGAAGAATATAAAGGGG GTTACTACCCTGGAAACACTTATCATTATCATAG TGGAGGTTCTTACCATGGCAGTGGATATCACGGC GGTTATAAGGGTAAGTATTATGGGAAAGCCAAA AAGTATTACTATAAGTACAAGAATTCTGGCAAAT ACAAGTACTTGAAGAAGGCTCGTAAGTACCACC GCAAAGGTTACAAGAAATACTATGGAGGCGGTT CAAGTCTCGAG Cfa.sup.N AAAAAACATATGGTCAAGATCATTAGTCGTAAGA Codon optimized N-terminal (SEQ GTCTGGGCACTCAAAACGTCTACGATATTGGAGT SI.sup.C coding sequence for ID NO: AGAAAAAGATCATAATTTTTTGCTGAAGAATGGG mfp5.sup.3 assembly with mfp5 5) CTGGTGGCCTCTAACTGCTTCAACGGTACC sequence, which includes the coding sequence for SI.sup.N native extein amino acids & KpnI restriction site Cfa.sup.C TCCGGAGCAGAATATTGCCTGTCTTACGACACA Codon optimized C-terminal (SEQ GAGATTCTGACCGTTGAATATGGATTCCTTCCTA SI.sup.N coding sequence for ID NO: TCGGTAAGATCGTGGAGGAACGGATTGAATGCA mfp5.sup.3 assembly with mfp5.sup.2 6) CAGTCTATACGGTAGATAAAAATGGCTTTGTGTA sequence, which includes the TACACAACCTATTGCTCAGTGGCATAACCGGGGA coding sequence for SI.sup.C GAACAGGAAGTTTTCGAATACTGCTTAGAAGACG native extein amino acids & GTTCGATTATCCGTGCAACGAAAGATCACAAATT Kpn2I restriction site TATGACGACCGACGGTCAGATGTTACCGATTGAT GAGATTTTCGAACGGGGGTTAGACCTGAAACAA GTTGATGGTTTGCCGTAAGGATCCAAAAAA
(63) TABLE-US-00002 TABLE 2 Primers Name Sequence Description BglII- AAAAAAAGATCTAGCAGCGAAGAGTAT Forward primer for amplification of Mfp5.sup.1-F AAAGGTG mfp5.sup.1 with BglII overhang for (SEQ ID insertion into pE7a-AKTK-H6 NO: 7) backbone (AKTK is SEQ ID NO: 19) BamHI- AAAAAAGGATCCAGAAGAACCACCGCC Reverse primer for amplification of Mfp5.sup.1-R G mfp5.sup.1 with BamHI overhang for (SEQ ID insertion into pE7a-AKTK-H6 NO: 8) backbone (AKTK is SEQ ID NO: 19) BglII- AAAAAAAGATCTTCAAGTGAAGAATAT Forward primer for amplification of Mfp5.sup.2-F AAGGGCGGCTAT mfp5.sup.2 with BglII overhang for (SEQ ID insertion into pE7a-AKTK-H6 NO: 9) backbone (AKTK is SEQ ID NO: 19) XhoI- AAAAAACTCGAGACTTGAACCGCCTCC Reverse primer for amplification of Mfp5.sup.2-R ATAGTATTTCTTG mfp5.sup.2 with XhoI overhang for (SEQ ID insertion into pE7a-AKTK-H6 NO: 10) backbone (AKTK is SEQ ID NO: 19) KpnI- AAAAAAGGTACCAGCAGCGAAGAGTAT Forward primer for amplification of Mfp5.sup.1-F AAAGGTGGTTACTACC mfp5.sup.1 with KpnI overhang for (SEQ ID insertion into pE7a-SI backbone NO: 11) Kpn2I- AAAAAATCCGGAAGAAGAACCACCGCC Reverse primer for amplification of Mfp5.sup.1-R GTAATAC mfp5.sup.1 with Kpn2I overhang for (SEQ ID insertion into pE7a-SI backbone NO: 12) KpnI- AAAAAAGGTACCTCAAGTGAAGAATAT Forward primer for amplification of Mfp5.sup.2-F AAGGGCGGCTATTACCC mfp5.sup.2 with KpnI overhang for (SEQ ID insertion into pE7a-SI backbone NO: 13) Kpn2I- AAAAAATCCGGAACTTGAACCGCCTCC Reverse primer for amplification of Mfp5.sup.2-R ATAGTATTTCTTGTAAC mfp5.sup.2 with Kpn2I overhang for (SEQ ID insertion into pE7a-SI backbone NO: 14)
(64) TABLE-US-00003 TABLE 3 Plasmids Name ORI Promoter Resistance Gene pE7a ColE1 P.sub.T7 Amp.sup.R N/A pE7a-mfp5.sup.1 ColE1 P.sub.T7 Amp.sup.R mfp5.sup.l pE7a-mfp5.sup.2 ColE1 P.sub.T7 Amp.sup.R mfp5.sup.2 pE7a-mfp5.sup.2-Cfa.sup.N ColE1 P.sub.T7 Amp.sup.R mfp5.sup.2 + Cfa.sup.C pE7a-Cfa.sup.C-mfp5.sup.1 ColE1 P.sub.T7 Amp.sup.R Cfa.sup.N + mfp5.sup.1
(65) TABLE-US-00004 TABLE 4 Strains Name Genotype MDS42pdu MDS42 polB dinB umuDC BL21(DE3) F.sup.− ompT gal dcm lon hsdS.sub.B(r.sub.B.sup.−m.sub.B.sup.−) λ(DE3 [lacI lacUV5-T7p07 indl sam7 nin5]) [malB.sup.+].sub.K-12(λ.sup.S) sM-Mfp5 MDS42pdu containing pE7a-mfp-5 sM-Mfp5.sup.2 MDS42pdu containing pE7a-mfp-5.sup.2 sM-Mfp-5.sup.2-Cfa.sup.N MDS42pdu containing pE7a-mfp-5.sup.2-Cfa.sup.N sM-Cfa.sup.C-mfp-5 MDS42pdu containing pE7a-Cfa.sup.C-mfp-5 sB-Mfp5 BL21(DE3) containing pE7a-mfp-5 sB-Mfp5.sup.2 BL21(DE3) containing pE7a-mfp-5.sup.2 sB-Mfp-5.sup.2-Cfa.sup.N BL21(DE3) containing pE7a-mfp-5.sup.2-Cfa.sup.N sB-Cfa.sup.C-Mfp-5 BL21(DE3) containing pE7a-Cfa.sup.C-mfp-5
(66) TABLE-US-00005 TABLE 5 Protein sequences Name Sequence Description Mfp5.sup.1 AKTKHHHHHHGGGGSRSGSSSEEYKG Mfp5 protein, which (SEQ ID GYYPGNTYHYHSGGSYHGSGYHGGYK includes the AKTK NO: 15) GKYYGKAKKYYYKYKNSGKYKYLKK (SEQ ID NO: 19) ARKYHRKGYKKYYGGGSSGS expression tag, His.sub.6 affinity tag, and linker & BioBrick cut sites Mfp5.sup.2 AKTKHHHHHHGGGGSRSSSEEYKGGY Mfp5.sup.2 protein, which (SEQ ID YPGNTYHYHSGGSYHGSGYHGGYKGK includes the AKTK NO: 16) YYGKAKKYYYKYKNSGKYKYLKKAR (SEQ ID NO: 19) KYHRKGYKKYYGGGSSSSEEYKGGYY expression tag, His.sub.6 PGNTYHYHSGGSYHGSGYHGGYKGKY affinity tag, and linker YGKAKKYYYKYKNSGKYKYLKKARK & BioBrick cut sites YHRKGYKKYYGGGSSLE Cfa.sup.N AEYCLSYDTEILTVEYGFLPIGKIVEERI C-terminal SI.sup.N sequence (SEQ ID ECTVYTVDKNGFVYTQPIAQWHNRGE for mfp5.sup.3 assembly with NO: 17) QEVFEYCLEDGSIIRATKDHKFMTTDG mfp5.sup.2, which includes QMLPIDEIFERGLDLKQVDGLP the SI.sup.N native extein amino acids Cfa.sup.C VKIISRKSLGTQNVYDIGVEKDHNFLLK N-terminal SI.sup.C sequence (SEQ ID NGLVASNCFN for mfp5.sup.3 assembly with NO: 18) mfp5, which includes the SI.sup.C native extein amino acids