Systems and methods for increased production of recombinant biopolymers via genome engineering and downregulation of basal expression
12391912 ยท 2025-08-19
Assignee
Inventors
- Alexander Joseph CONNOR (Slingerlands, NY, US)
- Mattheos KOFFAS (Niskayuna, NY, US)
- Runye ZHA (Troy, NY, US)
- Derek NELSON (Troy, NY, US)
Cpc classification
C12N15/70
CHEMISTRY; METALLURGY
International classification
C12N15/70
CHEMISTRY; METALLURGY
Abstract
Recombinant E. coli strains and synthetic protein sequence designs are leveraged for production of disordered polypeptides such as spidroins and elastin-like peptides (ELPs). These disordered polypeptides, the high-titer production of which has proven difficult, include repeating structural motifs from a small selection of amino acid residues, resulting in lack of well-defined tertiary and quaternary structure. The recombinant E. coli include expression vectors with genes encoding for the disordered polypeptide product. Expression of these genes is controlled by a promoter that downregulates and substantially inhibits basal expression in the recombinant bacteria. Further, the recombinant bacteria include mutations to one or more stress-response genes from wild-type E. coli, such as yggw, yedv, yedw, yedy, spec, speb, uspc, hcha, loip, mltc, envz, ompr, yhgf, or hupb. The recombinant E. coli enable production of high titers of disordered protein product while minimizing the toxic effects thereof on the host.
Claims
1. A recombinant bacteria for producing polypeptides, comprising: an E. coli strain, wherein the E. coli strain includes: one or more exogenous nucleotide sequences encoding a disordered polypeptide, the disordered peptide including a primary sequence selected from the group consisting of: (a) GPGQQ AAAAA GPGQQ GPGQQ GPGQQ GPGEQ GPGSG (SEQ. ID NO.: 1), (b) GPGQQ AAAAA AAAAA GPGQQ GPGQQ GPGEQ GPGSG (SEQ. ID NO.: 2), (c) (VPGAGVPGAGVPGAGVPGAGVPGYGVPGAGVPGAGVPGAGVPGAG VPGYG).sub.2 GRGDS (SEQ. ID NO.: 3), and (d) combinations thereof; one or more mutations to stress-response genes from wild-type E. coli B; and at least one promoter regulating the expression, in the E. coli strain, of the one or more exogenous nucleotide sequences; wherein basal expression of the one or more exogenous nucleotide sequences is inhibited such that incubation in an about 0.6 to about 0.8 OD600 culture of the E. coli strain for about 4 hours produces less than about 7 mg/L disordered polypeptide in the culture.
2. The recombinant bacteria according to claim 1, wherein the E. coli strain includes a SoluBL21 genome and a pLysS plasmid.
3. The recombinant bacteria according to claim 1, wherein the E. coli strain includes one or more mutations in at least one of the following genes: yggw, yedv, yedw, yedy, spec, speb, uspc, hcha, loip, mltc, envz, ompr, yhgf, and hupb.
4. The recombinant bacteria according to claim 1, wherein the disordered polypeptide includes: between about 30% and about 40% glycine residues; and between about 10% and about 20% proline residues.
5. The recombinant bacteria according to claim 1, wherein the disordered polypeptide includes recombinant spidroins, elastin-like peptides (ELPs), or combinations thereof.
6. The recombinant bacteria according to claim 1, wherein the at least one promoter regulating the expression of the one or more exogenous nucleotide sequences includes a T7 polymerase promoter, and wherein the basal expression of the one or more exogenous nucleotide sequences is inhibited via production by the E. coli strain of T7 lysozyme.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) The drawings show embodiments of the disclosed subject matter for the purpose of illustrating the invention. However, it should be understood that the present application is not limited to the precise arrangements and instrumentalities shown in the drawings, wherein:
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)
DETAILED DESCRIPTION
(18) Some embodiments of the present disclosure are directed to a recombinant bacteria for producing polypeptides. In some embodiments, the recombinant bacteria is produced by making one or more genetic modifications to a natural or engineered strain of bacteria. In some embodiments, the recombinant bacteria is produced by making one or more genetic modifications to a strain of E. coli, i.e., is a modified E. coli strain. In some embodiments, the recombinant bacteria is a modified E. coli B strain. In some embodiments, the recombinant bacteria is a modified E. coli SoluBL21 strain. As will be discussed in greater detail below, the recombinant bacteria includes one or more exogenous genes. In some embodiments, the one or more exogenous genes are inserted into the endogenous genetic material of the recombinant bacteria. In some embodiments, the recombinant bacteria includes one or more exogenous plasmids. In some embodiments, the one or more exogenous genes are included in one of the exogenous plasmids. In some embodiments, one or more of the exogenous plasmids is an expression vector. The expression vector is any suitable vector compatible with the recombinant bacterial host and capable of facilitating expression of the one or more exogenous genes included therein. In some embodiments, the expression vector is pET-19b.
(19) As discussed above, the recombinant bacteria includes one or more exogenous nucleotide sequences. In some embodiments, the recombinant bacteria produce at least one exogenous polypeptide via expression of the one or more exogenous nucleotide sequences. In some embodiments, the recombinant bacteria produce two or more exogenous polypeptides. In some embodiments, the recombinant bacteria produce two or more exogenous polypeptides simultaneously. In some embodiments, the exogenous polypeptides include wild-type polypeptides, recombinant polypeptides, or combination thereof.
(20) In some embodiments, the one or more exogenous nucleotides encode a disordered polypeptide. As used herein, the term disordered polypeptide is used to refer to a polypeptide with repeating structural motifs made up of a small selection of amino acid residues, resulting in lack of well-defined tertiary and quaternary structure in the polypeptide. In some embodiments, the disordered polypeptide includes between about 20% and about 50% glycine residues. In some embodiments, the disordered polypeptide includes between about 25% and about 45% glycine residues. In some embodiments, the disordered polypeptide includes between about 30% and about 40% glycine residues. In some embodiments, the disordered polypeptide includes between about 5% and about 25% proline residues. In some embodiments, the disordered polypeptide includes between about 10% and about 20% proline residues. In some embodiments, the disordered polypeptide includes more than about 15% glutamine residues. In some embodiments, the disordered polypeptide includes more than about 20% glutamine residues. In some embodiments, the disordered polypeptide includes more than about 10% valine residues. In some embodiments, the disordered polypeptide includes more than about 15% valine residues. In some embodiments, the disordered polypeptide includes above about 0% and below about 5% tyrosine residues. Structurally, in some embodiments, the disordered polypeptide includes above about 0% and below about 35% beta sheets. In some embodiments, the disordered polypeptide includes between about 5% and about 30% beta sheets. In some embodiments, the disordered polypeptide includes between about 5% and about 20% beta sheets. In some embodiments, the disordered polypeptide includes between about 15% and about 30% beta sheets. In some embodiments, the disordered polypeptide includes between about 20% and about 75% alpha helices. In some embodiments, the disordered polypeptide includes between about 25% and about 70% alpha helices. In some embodiments, the disordered polypeptide includes between about 25% and about 50% alpha helices. In some embodiments, the disordered polypeptide includes between about 50% and about 55% alpha helices. In some embodiments, the disordered polypeptide includes between about 45% and about 70% alpha helices. In some embodiments, the disordered polypeptide includes between 0% and about 55% random coils. In some embodiments, the disordered polypeptide includes between 0% and about 50% random coils. In some embodiments, the disordered polypeptide includes between 0% and about 5% random coils. In some embodiments, the disordered polypeptide includes between 25% and about 50% random coils. In some embodiments, the disordered polypeptide includes above about 0% and below about 50% beta-turns. In some embodiments, the disordered polypeptide includes between about 5% and about 45% beta-turns. In some embodiments, the disordered polypeptide includes between about 10% and about 45% beta-turns. In some embodiments, the disordered polypeptide includes between about 20% and about 30% beta-turns. In some embodiments, the disordered polypeptide includes recombinant spidroins, elastin-like peptides (ELPs), or combinations thereof.
(21) In some embodiments, the disordered polypeptide includes a primary sequence including GPGQQ AAAAA GPGQQ GPGQQ GPGQQ GPGEQ GPGSG (SEQ. ID NO.: 1). In some embodiments, the disordered polypeptide includes a primary sequence including GPGQQ AAAAA AAAAA GPGQQ GPGQQ GPGEQ GPGSG (SEQ ID. NO.: 2). In some embodiments, the disordered polypeptide includes ELP constructs combining one or more native elastin motifs of VPGAG with specific placement of tyrosine and GRGDS domains for targeted neurological drug delivery. In some embodiments, the disordered polypeptide includes a primary sequence including: (VPGAGVPGAGVPGAGVPGAGVPGYGVPGAGVPGAGVPGAGVPGAGVPGYG)2 GRGDS (SEQ. ID NO.: 3). In some embodiments, the disordered polypeptide is based on two or more distinct and purposefully designed monomers. In some embodiments, the disordered polypeptide is based on two or more distinct and purposefully designed monomers which are duplicated one or more times. In some embodiments, the disordered polypeptide includes a 2, 4, 8, 16, 32, or 64mer of the primary sequence.
(22) In some embodiments, the recombinant bacteria includes one or more mutations to stress-response genes from wild-type E. coli B. In some embodiments, the recombinant bacteria includes one or more mutations to one or more of the following stress-response genes: coproporphyrinogen-III oxidase-like protein yggw, sensory kinase yedv, transcriptional regulatory protein yedw, reductase catalytic subunit yedy, ornithine decarboxylase spec, streptococcus pyrogenic exotoxin B speb, universal stress protein C uspc, protein and nucleotide deglycase hcha, metalloprotease loip, membrane-bound lytic murein transglycosylase C mltc, sensor histidine kinase envz, DNA-binding dual transcriptional regulator ompr, RNA-binding protein yhgf, and DNA-binding protein HU- hupb. In some embodiments, the one or more mutations includes a base substitution, deletion, insertion, or combinations thereof. Representative mutations consistent with the embodiments of the present disclosure are provided below at Table 1:
(23) TABLE-US-00001 TABLE 1 Genetic Mutations in Stress-Response Genes of Recombinant Bacteria consistent with some embodiments of the present disclosure. Gene Mutations yggw Gln60Lys Thr91Ala Asn150Lys Ala305Thr Ser334Stop Glu336Asp yedv Gln227His Asp99Gly Gly86Ser yedw Gln137His yedy Ala34Thr Pro63Ala spec Met702Val Glu631Asp Ala565Val Asn496Asp Ala418Glu Phe275Tyr Ile146Val Thr18Ser speb Ser33Ala uspc Gln62His hcha Asn164Thr loip Ala47Thr mltc Gln164Lys envz Leu41Pro ompr Asp183Tyr Asn6Lys yhgf Asp147Tyr Val723Glu hupb Residues Ala57-Thr59 deleted
(24) In some embodiments, the recombinant bacteria includes at least one promoter regulating the expression of the one or more exogenous nucleotide sequences. In some embodiments, the promoter is any suitable promoter compatible with the recombinant bacteria and capable of being activated in the presence of one or more inducing agents. In some embodiments, the promoter is at least substantially deactivated in the absence of the inducing agent. In some embodiments, the recombinant bacteria has little to no expression of the one or more exogenous nucleotide sequences in the absence of inducing agent, i.e., basal expression of the one or more exogenous nucleotide sequences is low or inhibited. As will be discussed in greater detail below, controlled expression of exogenous polypeptides can be disrupted by so called leaky promoters, where a baseline level of polypeptide expression occurs even in the absence of the inducing agent. In some embodiments of the present disclosure, the promoters of the one or more exogenous nucleotide sequences are sufficiently strong and/or downregulated so that such basal expression is limited or inhibited altogether in the absence of inducing agent. In some embodiments, basal expression of the one or more exogenous nucleotide sequences in an about 0.6 to about 0.8 OD600 culture of the recombinant bacteria for about 4 hours of incubation produces less than about 70 mg/L, produces less than about 60 mg/L, produces less than about 50 mg/L, produces less than about 40 mg/L, produces less than about 30 mg/L, produces less than about 20 mg/L, less than about 10 mg/L, less than about 9 mg/L, less than about 8 mg/L, or less than about 7 mg/L disordered polypeptide in the culture. In some embodiments, basal expression of the one or more exogenous nucleotide sequences in a culture of the recombinant bacteria at about 3 hours of incubation post-inoculation produces less than about 7 mg/L. In these embodiments, the term incubation is used to refer to bacterial culture performed under conditions suitable to foster bacterial growth, e.g., at about 37 C. for the modified E. coli identified in the exemplary embodiments below. In some embodiments, basal expression of the one or more exogenous nucleotide sequences is inhibited. In some embodiments, the recombinant bacteria includes a SoluBL21 genome. In some embodiments, the recombinant bacteria includes a pLysS plasmid.
(25) Referring now to
(26) In some embodiments, the disordered polypeptide includes a primary sequence including GPGQQ AAAAA GPGQQ GPGQQ GPGQQ GPGEQ GPGSG (SEQ. ID NO.: 1). In some embodiments, the disordered polypeptide includes a primary sequence including GPGQQ AAAAA AAAAA GPGQQ GPGQQ GPGEQ GPGSG (SEQ. ID NO.: 2). In some embodiments, the disordered polypeptide includes ELP constructs combining one or more native elastin motifs of VPGAG with specific placement of tyrosine and GRGDS domains for targeted neurological drug delivery. In some embodiments, the disordered polypeptide includes a primary sequence including: (VPGAGVPGAGVPGAGVPGAGVPGYGVPGAGVPGAGVPGAGVPGAGVPGYG)2 GRGDS (SEQ. ID NO.: 3). In some embodiments, the disordered polypeptide is based on two or more distinct and purposefully designed monomers. In some embodiments, the disordered polypeptide is based on two or more distinct and purposefully designed monomers which are duplicated one or more times. In some embodiments, the disordered polypeptide includes a 2, 4, 8, 16, 32, or 64mer of the primary sequence.
(27) At 102A, one or more exogenous gene fragments for the disordered polypeptides are inserted into a recombinant bacterial host. As discussed above, in some embodiments, the recombinant bacterial host is a modified E. coli strain. Referring specifically to
(28) As also discussed above, in some embodiments, the recombinant E. coli strain includes one or more mutations to one or more stress-response genes. In some embodiments, the one or more stress-response genes is yggw, yedv, yedw, yedy, spec, speb, uspc, hcha, loip, mltc, envz, ompr, yhgf, or hupb. In some embodiments, basal expression of the one or more exogenous nucleotide sequences in an about 0.6 to about 0.8 OD600 culture of the recombinant bacteria for about 4 hours of incubation produces less than about 70 mg/L, produces less than about 60 mg/L, produces less than about 50 mg/L, produces less than about 40 mg/L, produces less than about 30 mg/L, produces less than about 20 mg/L, less than about 10 mg/L, less than about 9 mg/L, less than about 8 mg/L, or less than about 7 mg/L disordered polypeptide in the culture. In some embodiments, basal expression of the one or more exogenous nucleotide sequences in a culture of the recombinant bacteria at about 3 hours of incubation post-inoculation produces less than about 7 mg/L.
(29) Referring again to
(30) Referring now to
(31) In some embodiments, the disordered polypeptide includes a primary sequence including GPGQQ AAAAA GPGQQ GPGQQ GPGQQ GPGEQ GPGSG (SEQ. ID NO.: 1). In some embodiments, the disordered polypeptide includes a primary sequence including GPGQQ AAAAA AAAAA GPGQQ GPGQQ GPGEQ GPGSG (SEQ. ID NO.: 2). In some embodiments, the disordered polypeptide includes ELP constructs combining one or more native elastin motifs of VPGAG with specific placement of tyrosine and GRGDS domains for targeted neurological drug delivery. In some embodiments, the disordered polypeptide includes a primary sequence including: (VPGAGVPGAGVPGAGVPGAGVPGYGVPGAGVPGAGVPGAGVPGAGVPGYG)2 GRGDS (SEQ. ID NO.: 3). In some embodiments, the disordered polypeptide is based on two or more distinct and purposefully designed monomers. In some embodiments, the disordered polypeptide is based on two or more distinct and purposefully designed monomers which are duplicated one or more times. In some embodiments, the disordered polypeptide includes a 2, 4, 8, 16, 32, or 64mer of the primary sequence.
(32) At 202, a recombinant E. coli strain is prepared. As discussed above, in some embodiments, the recombinant E. coli strain includes an expression vector. In some embodiments, the expression vector includes one or more exogenous nucleotide sequences encoding a disordered polypeptide. In some embodiments, the expression vector includes at least one promoter regulating the expression of the one or more exogenous nucleotide sequences.
(33) As also discussed above, in some embodiments, the recombinant E. coli strain includes one or more mutations to one or more stress-response genes. In some embodiments, the one or more stress-response genes is yggw, yedv, yedw, yedy, spec, speb, uspc, hcha, loip, mltc, envz, ompr, yhgf, or hupb. In some embodiments, basal expression of the one or more exogenous nucleotide sequences in an about 0.6 to about 0.8 OD600 culture of the recombinant bacteria for about 4 hours of incubation produces less than about 70 mg/L, produces less than about 60 mg/L, produces less than about 50 mg/L, produces less than about 40 mg/L, produces less than about 30 mg/L, produces less than about 20 mg/L, less than about 10 mg/L, less than about 9 mg/L, less than about 8 mg/L, or less than about 7 mg/L disordered polypeptide in the culture. In some embodiments, basal expression of the one or more exogenous nucleotide sequences in a culture of the recombinant bacteria at about 3 hours of incubation post-inoculation produces less than about 7 mg/L.
(34) At 204, the one or more exogenous nucleotide sequences are expressed. As discussed above, in some embodiments, expression 204 of the one or more exogenous nucleotide sequences includes induction via application of one or more inducing agents. In some embodiments, the one or more inducing agents includes IPTG. In other embodiments, the expression of the one or more exogenous nucleotide sequences is basal expression, e.g., in the absence of inducing agent. As discussed above, in some embodiments, longer culture times in the absence of inducer are utilized. In some embodiments, the basal expression of the one or more exogenous nucleotide sequences in culture is allowed for proceed for greater than 4, 5, 6, 7, 8, 9, 10, 15, 20 hours, etc. At 206, the disordered polypeptide is isolated from the E. coli strain as a polypeptide product. Isolation 206 of the polypeptide product is performed via any suitable process, e.g., cellular secretion, cellular lysis and subsequent isolation via decanting, centrifugation, chromatography, membrane separation, etc., or combinations thereof.
(35) Exemplary recombinant bacteria were prepared consistent with the embodiments described above and compared to 9 commercially available E. coli strains: BL21, BL21 pLysS, RosettaGami B, BL21 pGro7, BLR, HMS174, MG1655, SoluB21, and Origami B. Some strains were chosen based on factors previously shown or hypothesized to affect recombinant silk production, such as codon usage and inclusion body formation. These strains include RosettaGami B, which has upregulation of seven tRNAs for rare codons including those for glycine and proline, as well as BLR, which has a recA-mutation that may facilitate increased stability of plasmids containing repetitive sequences. The strain SoluBL2 has been developed through directed evolution to produce soluble protein when its ancestral strain, BL21(DE3), does not yield detectable soluble product. Likewise, strain pGro7 expresses a chaperone protein that prevents inclusion body formation and promotes soluble production. Strains HMS 174 and MG1655, which unlike other strains tested, are from the K-12 E. coli lineage instead of the B lineage. Strain pLysS restricts basal expression, while strain Origami B includes mutations that change the cytoplasmic environment and cellular stress responses through alterations to the thiol-redox equilibrium, glutathione metabolism, and oxidative stress response. All strains were DE3 lysogens and proteins were expressed from the pET19b vector under control of the T7 promoter.
(36) Four different de novo designed spidroin constructs were expressed in these E. coli stains, with titer, plasmid maintenance, and OD600 measured as expression outcomes. The primary sequences and polymeric structure of the spidroin constructs are depicted in Table 2 below:
(37) TABLE-US-00002 TABLE2 PrimarySequenceofRecombinantSpidroinConstructs Molecular Spidroin Number Weight Construct PrimarySequence ofRepeats (kDa) A54mer MGHHHHHHHHHHSSGHIDDDDKHMLEHMPG n=4 16.1 (GPGQQAAAAAGPGQQGPGQQGPGQQGPGEQGPGSG)n TSGS(SEQ.IDNO.:4) A516mer MGHHHHHHHHHHSSGHIDDDDKHMLEHMPG n=16 52.7 (GPGQQAAAAAGPGQQGPGQQGPGQQGPGEQGPGSG)n TSGS(SEQ.IDNO.:5) A104mer MGHHHHIHHHHSSGHIDDDDKHMLEHMPG n=4 15.6 (GPGQQAAAAAAAAAAGPGQQGPGQQGPGEQGPGSG)n TSGS(SEQ.IDNO.:6) A1016mer MGHHHHHHHHHHSSGHIDDDDKHMLEHMPG n=16 50.9 (GPGQQAAAAAAAAAAGPGQQGPGQQGPGEQGPGSG)n TSGS(SEQ.IDNO.:7)
(38) To assess the effect of protein size, recombinant spidroins were designed to have either four or sixteen identical monomer units in tandem (referred to as 4mers and 16mers, respectively). To assess the effect of modulating primary sequence, two different monomer units of 35 amino acids were designed, with one containing a segment of five tandem alanine residues (A5) and the other a segment of ten tandem alanine residues (A10). The remaining amino acids in the monomer sequences included multiple GPGQQ motifs (four for the A5 monomer and three for the A10 monomer) and single GPGEQ and GPGSG motifs. Both monomer units were designed based on naturally occurring primary sequences found in the MaSp2 dragline spidroin of orb-weaving spiders. Modulating the tandem alanine length and total construct length were chosen as focal points to demonstrate the effect of construct design on expression outcomes. All constructs expressed had a starting sequence that is present on the pET-19b expression vector, which contains a 10 histidine tag for purification followed by an enterokinase cleavage sequence.
(39) Referring to
(40) In this exemplary embodiment, the pLysS plasmid from the pLysS strain was extracted and transformed into SoluBL21, referred to hereinafter as SoluBL21-pLysS. The hybrid strain was able to produce the small spidroins at 201 (6) and 189 (10) mg/L for the A5 4mer and A10 4mer, respectively. These titers were approximately twice that of either parent strain. Moreover, when compared to BL21, these titers represented a 13-fold increase for the A5 4mer and a 33-fold increase for the A10 4mer (see
(41) To demonstrate the increased spidroin titers achieved with the hybrid SoluBL21-pLysS strain could be extended to other repetitive, structural proteins, an ELP was produced in strains BL21, pLysS, SoluBL21, and SoluBL21-pLysS. The recombinant ELP, A4Y1, was chosen for production in these four strains based on structural similarity when compared to the A5 4mer primary sequence, as shown in Table 3 below:
(42) TABLE-US-00003 TABLE3 A54merandA4Y1ELPPrimarySequencesandAminoAcidComposition Glycine Proline Alanine Glutamine Valine Tyrosine Protein Sequence % % % % % % A5 MGHHHHHHHHHHSSGHI 32 14 12 21 0 0 4mer DDDDKHMLEHMPG (MW: (GPGQQAAAAAGPGQQGP 16kDa) GQQGPGQQGPGEQGPGSG)n TSGS(SEQ.IDNO.:4) A4Y1 MGHHHHHHHHHHSSGHI 35 16 13 0 16 3 ELP DDDDKHMLEHM (MW: VPGAGVPGAGVPGAGVPG 16kDa) AGVPGYGVPGAGVPGAG VPGAGVPGAGVPGYGGRG DSVPGAGVPGAGVPGAGV PGAGVPGYGVPGAGVPGA GVPGAGVPGAGVPGYGGR GDSVPGAGVPGAGVPGAG VPGAGVPGYGVPGAGVPG AGVPGAGVPGAGVPGYG (SEQ.IDNO.:8)
(43) The A5 4mer and the A4Y1 ELP both have a 4mer polymeric structure, along with similar molecular weight and glycine, proline, and alanine contents. Furthermore, both recombinant spidroins and ELPs self-assemble into supramolecular materials when triggered by external stimuli. However, the A5 4mer has tandem alanines (A.sub.n) while the ELP has alanine residues distributed throughout the construct. Additionally, the A5 4mer has a high amount of glutamine (21%), which the ELP lacks, and the ELP has a high percentage of valine (16%) and some tyrosine (3%), both of which are missing from the A5 4mer.
(44) Referring now to
(45) During exogenous polypeptide expression, cells can potentially lose the plasmid vector that was transformed into them. Plasmid loss is indicative of excessive metabolic burden, which may stem from repetitive recombinant DNA sequences or toxic recombinant protein products and exacerbated by depletion of antibiotic selection factors. Plasmid-free cells can continue to divide, leading to a substantial decrease in the overall number of cells in a culture that are producing recombinant protein. A high level of plasmid maintenance contributes to the achievement of high titers, particularly for high-density cell cultivation in bioreactors.
(46) Referring to
(47) The possibility that expressing A5 and A10 spidroins causes host cell toxicity was further supported by observations made during the plasmid maintenance assay, in which 0.1 ml of a 10,000 culture dilution (generated through serial dilutions) was plated for colony counting. For most strains observed, this procedure resulted in several hundred single colonies on LB agar plates. However, the strains that showed moderate to high levels of plasmid maintenance but low titers and inhibited growth after induction (RosettaGami, BLR, HMS174, and MG1655) displayed a lack of colony forming units on LB plates using this protocol. Compared to other strains at the same OD600, cultures of RosettaGami, BLR, HMS174, and MG1655 used a 100 (instead of a 10,000) dilution of a culture sample to obtain enough isolated colonies for the plasmid maintenance assay (minimum of 50 colonies). This lack of colony-forming-units after recombinant protein induction is a documented effect of toxic protein expression in cases where the vector is still maintained.
(48) Referring now to
(49) Referring now to
(50) Without wishing to be bound by theory, the disordered nature of the spidroin constructs is a factor underlying their toxicity, low titers, and low plasmid maintenance. This toxicity may result from promiscuous and harmful binding interactions by disordered proteins within the intracellular milieu. As recombinant MaSp2-mimetic spidroins can be high in disorder-promoting amino acids (proline, glutamine, glycine, serine) and low in order-promoting amino acids, the recombinant bacteria of the present disclosure are advantageous in their ability to produce high titers of recombinant spidroins without experiencing these toxic effects. The hybrid SoluBL21-pLysS strain showed substantially improved recombinant spidroin expression compared to other E. coli strains (see again
(51) To demonstrate the underlying cellular mechanisms of this exemplary embodiment, basal expression of the recombinant nucleotides was examined. As discussed above, basal expression refers to the expression of a recombinant gene without induction, which can cause plasmid loss and subsequent low titers if the gene product is toxic. In this exemplary embodiment, SoluBL21-pLysS exerts tight control over basal expression via production of T7 lysozyme, which inhibits action of T7 polymerase and prevents basal expression of recombinant genes placed on pET vectors. Upon addition of IPTG, T7 polymerase concentration increases and overcomes the inhibition of T7 polymerase.
(52) Referring now to
(53) Referring now to
(54) Sequencing of the recombinant bacterial strain of the exemplary embodiment revealed a number of mutations that differentiate it from E. coli BL21. Mutations were identified on 47 genes in the SoluBL21 genomic material of the exemplary embodiment. Of these, there are 14 genes with mutations that are directly involved in stress responses in E. coli. These genes include yggw, yedv, yedw, yedy, spec, speb, uspc, hcha, loip, mltc, envz, ompr, yhgf, hupb. Referring again to Table 1, these genes play roles in stress responses pertaining to heat, reactive oxygen species, cell surface damage, salt changes, acid exposure, carbonyls, osmotic changes, putrescine production, nutrient starvation, ethanol exposure, radiation, and the SOS response. Several of the mutations occur on genes responsible for extensive and broad stress pathway signaling within E. coli. This includes the envz/ompr two-component system, in which a mutation on the envz gene causes constant phosphorylation of the ompr transcriptional regulator. This results in a decreased repression of several stress response pathways, including those related to osmotic and acid stress. Furthermore, mutations in the spec and speb genes may alter putrescine production pathways, with putrescine production representing a way that several organisms, including E. coli, respond to a myriad of harmful conditions. Likewise, the uspc gene potentially promotes a more favorable cell phenotype in response to silk protein production, as uspc is induced by a diversity of stress factors that include nutrient starvation (of multiple types), oxidative stress, DNA damage, radiation, heat shock, and ethanol exposure.
(55) There were also mutations in the DNA-binding region of hupb, a transcriptional factor that controls expression of 8% of the entire genome in regions where the genes are associated with adaptations to harsh environments, including the SOS response system, and oxidative and radiative stress systems. These mutations promote a cell phenotype that is more tolerant to the expression of disordered and potentially toxic proteins.
(56) Nonrepetitive terminal domains are present in natural spidroins, typically 100-200 amino acids in length, and promote solubility and controlled self-assembly of recombinant spidroins. An exemplary embodiment of the present disclosure was prepared to demonstrate the use of terminal domains on increasing titers and decreasing toxicity. cDNA copies of the terminal regions of L. hesperus (western black widow) MaSp1 dragline silk were inserted at either end of the A10 4mer gene to form the A10 4mer BWT construct shown below:
(57) TABLE-US-00004 (SEQ.IDNO.:9) MGHHHHHHHHHHSSGHIDDDDKHMLEHMQANTPWSSKANADAFINSFISA ASNTGSFSQDQMEDMSLIGNTLMAAMDNMGGRITPSKLQALDMAFASSVA EIAASEGGDLGVTTNAIADALTSAFYQTTGVVNSRFISEIRSLIGMFAQA SANDVYASAGSSGGGGYGASSASAASASAAAPSGVAYQAPAQAQISFTLR GQQPVSGPGQQAAAAAAAAAAGPGQQGPGQQGPGEQGPGSGGPGQQAAAA AAAAAAGPGQQGPGQQGPGEQGPGSGGPGQQAAAAAAAAAAGPGQQGPGQ QGPGEQGPGSGGPGQQAAAAAAAAAAGPGQQGPGQQGPGEQGPGSGSGPG QIYYGPQSVAAPAAAAASALAAPATSARISSHASALLSNGPTNPASISNV ISNAVSQISSSNPGASACDVLVQALLELVTALLTIIGSSNIGSVNYDSSG QYAQWTQSVQNAFA
The A10 4mer BWT protein was readily produced and purified from the SoluBL21-pLysS strain at 96 (12) mg/L.
(58)
(59) Methods and systems of the present disclosure advantageously leverage recombinant E. coli strains and synthetic protein designs for production of disordered polypeptides such as spidroins and ELPs. Downregulation of basal expression in the recombinant bacteria of the present disclosure result in high titers of disordered protein product while minimizing the toxic effects thereof, enabling the economical production of these desired polypeptide constructs at industrially-relevant scales. Examples of functionalities related to the improved titers for the E. coli platform include an increased potential for next-generation biopolymers to function as enhanced replacements for current materials by way of improved unit economics. Examples of functionalities related to the production of recombinant spidroins and ELPs include tissue regeneration, sustainable clothing and textiles, drug delivery, food preservation, and biomedical implants. These methods can also be applied to different permutations of de novo designed recombinant spidroin or elastin-like peptides, as well as other protein-based biopolymers of interest, such as collagen, that share similarities in primary sequence composition, aggregation propensity, and intrinsic disorder.
(60) Although the invention has been described and illustrated with respect to exemplary embodiments thereof, it should be understood by those skilled in the art that the foregoing and various other changes, omissions and additions may be made therein and thereto, without parting from the spirit and scope of the present invention.