PYRENOID-LIKE STRUCTURES
20220282268 · 2022-09-08
Inventors
Cpc classification
C12N15/8257
CHEMISTRY; METALLURGY
International classification
Abstract
Aspects of the present disclosure relate to genetically altered plants having a modified Rubisco and further having a modified Essential Pyrenoid Component 1 (EPYC1) for formation of an aggregate of modified Rubisco and EPYC1 polypeptides. Other aspects of the present disclosure relate to methods of making such plants as well as cultivating these genetically altered plants.
Claims
1. A genetically altered higher plant or part thereof, comprising a modified Rubisco for formation of an aggregate of Essential Pyrenoid Component 1 (EPYC1) polypeptides and modified Rubiscos, wherein the modified Rubisco comprises an algal Rubisco small subunit (SSU) polypeptide or a modified higher plant Rubisco SSU polypeptide wherein at least part of the higher plant Rubisco SSU polypeptide is replaced with at least part of an algal Rubisco SSU polypeptide.
2. The plant or part thereof of claim 1, further comprising the EPYC1 polypeptides and the aggregate.
3. The plant or part thereof of claim 1, wherein the modified Rubisco comprising the algal Rubisco SSU polypeptide has increased affinity for the EPYC1 polypeptides as compared to unmodified Rubisco.
4. The plant or part thereof of claim 1, wherein the modified higher plant Rubisco SSU polypeptide was modified by substituting one or more higher plant Rubisco SSU α-helices with one or more algal Rubisco SSU α-helices; substituting one or more higher plant Rubisco SSU β-strands with one or more algal Rubisco SSU β-strands; and/or substituting a higher plant Rubisco SSU βA-βB loop with an algal Rubisco SSU βA-βB loop.
5. The plant or part thereof of claim 1, wherein the modified higher plant Rubisco SSU polypeptide has increased affinity for the EPYC1 polypeptides as compared to the higher plant Rubisco SSU polypeptide without the modification.
6. A genetically altered higher plant or part thereof, comprising EPYC1 polypeptides for formation of an aggregate of the EPYC1 polypeptides and modified Rubiscos.
7. The plant or part thereof of claim 6, wherein the EPYC1 polypeptides are algal EPYC1 polypeptides or modified EPYC1 polypeptides comprising one or more, two or more, four or more, or eight tandem copies of a first algal EPYC1 repeat region.
8. The plant or part thereof of claim 7, wherein the algal EPYC1 polypeptides are truncated mature EPYC1 polypeptides.
9. The plant or part thereof of claim 8, wherein the truncated mature EPYC1 polypeptides have increased affinity for the modified Rubiscos as compared to the non-truncated EPYC1 polypeptides.
10. The plant or part thereof of claim 7, wherein the modified EPYC1 polypeptides are expressed without the native EPYC1 leader sequence and/or comprise a C-terminal cap.
11. The plant or part thereof of claim 10, wherein the modified EPYC1 polypeptides have increased affinity for the modified Rubiscos as compared to the corresponding unmodified EPYC1 polypeptide.
12. The plant or part thereof of claim 6, wherein the aggregate is localized to a chloroplast stroma of at least one chloroplast of a plant cell, and wherein the plant cell is a leaf mesophyll cell.
13. A genetically altered higher plant or part thereof, comprising a first nucleic acid sequence encoding an EPYC1 polypeptide and a second nucleic acid sequence encoding a modified Rubisco polypeptide.
14. The plant or part thereof of claim 13, wherein the first nucleic acid sequence is operably linked to a third nucleic acid sequence encoding a chloroplastic transit peptide functional in the higher plant cell, and wherein the first nucleic acid sequence does not comprise the native EPYC1 leader sequence and is not operably linked to the native EPYC1 leader sequence, and wherein the second nucleic acid sequence is operably linked to a fourth nucleic acid sequence encoding a chloroplastic transit peptide functional in the higher plant cell and wherein the second nucleic acid sequence does not encode the native algal SSU leader sequence and is not operably linked to a nucleic acid sequence encoding the native algal SSU leader sequence.
15. The plant or part thereof of claim 13, wherein the EPYC1 polypeptide is a truncated mature EPYC1 polypeptide or a modified EPYC1 polypeptide comprising one or more, two or more, four or more, or eight tandem copies of a first algal EPYC1 repeat region.
16. The plant or part thereof of claim 13, wherein the modified Rubisco polypeptide comprises an algal Rubisco small subunit (SSU) polypeptide or a modified higher plant Rubisco SSU polypeptide wherein at least part of the higher plant Rubisco SSU polypeptide is replaced with at least part of an algal Rubisco SSU polypeptide.
17. The plant or part thereof of claim 13, wherein the plant or part thereof further comprises an aggregate of the modified Rubisco polypeptides and the EPYC1 polypeptides.
18. A method of producing the genetically altered higher plant of claim 1, comprising: a) introducing a first nucleic acid sequence encoding an EPYC1 polypeptide into a plant cell, tissue, or other explant; b) regenerating the plant cell, tissue, or other explant into a genetically altered plantlet; and c) growing the genetically altered plantlet into a genetically altered plant with the first nucleic acid encoding the EPYC1 polypeptide.
19. The method of claim 18, further comprising introducing a second nucleic acid sequence encoding a modified Rubisco SSU polypeptide into a plant cell, tissue, or other explant prior to step (a) or concurrently with step (a), wherein the genetically altered plant of step (c) further comprises the second nucleic acid encoding the modified Rubisco SSU polypeptide.
20. The method of claim 18, wherein the first nucleic acid sequence is introduced with a first vector, and wherein the first vector comprises a first copy of the first nucleic acid sequence wherein the first nucleic acid sequence does not comprise the native EPYC1 leader sequence and is not operably linked to the native EPYC1 leader sequence, wherein the first nucleic acid sequence is operably linked to the third nucleic acid sequence encoding a chloroplastic transit peptide functional in the higher plant cell, wherein the first nucleic acid sequence is operably linked to the first promoter, and wherein the first nucleic acid sequence is operably linked to one terminator; and wherein the first vector further comprises a second copy of the first nucleic acid sequence wherein the first nucleic acid sequence does not comprise the native EPYC1 leader sequence and is not operably linked to the native EPYC1 leader sequence, wherein the first nucleic acid sequence is operably linked to the third nucleic acid sequence encoding a chloroplastic transit peptide functional in the higher plant cell, wherein the first nucleic acid sequence is operably linked to a third promoter, and wherein the first nucleic acid sequence is operably linked to two terminators.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0032] The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
[0033]
[0034]
[0035]
[0036]
[0037]
[0038]
[0039]
[0040]
[0041]
[0042]
[0043]
[0044]
[0045]
[0046]
[0047]
[0048]
[0049]
[0050]
[0051]
[0052]
[0053]
DETAILED DESCRIPTION
[0054] The following description sets forth exemplary methods, parameters, and the like. It should be recognized, however, that such description is not intended as a limitation on the scope of the present disclosure but is instead provided as a description of exemplary embodiments.
Genetically Altered Plants
[0055] An aspect of the disclosure includes a genetically altered higher plant or part thereof including a modified Rubisco for formation of an aggregate of modified Rubisco and Essential Pyrenoid Component 1 (EPYC1) polypeptides. An aggregate of modified Rubisco and EPYC1 may also be referred to as a condensate of modified Rubisco and EPYC1. An additional embodiment of this aspect includes the modified Rubisco being an algal Rubisco small subunit (SSU) polypeptide or a modified higher plant Rubisco SSU polypeptide wherein at least part of the higher plant Rubisco SSU polypeptide is replaced with at least part of an algal Rubisco SSU polypeptide. In a further embodiment of this aspect, which may be combined with any of the preceding embodiments, the genetically altered higher plant or part thereof further includes the EPYC1 polypeptides and the aggregate. Yet another embodiment of this aspect, which may be combined with any of the preceding embodiments, includes the aggregate being detectable by confocal microscopy, transmission electron microscopy (TEM), cryo-electron microscopy (cryo-EM), a liquid-liquid phase separation assay, or a phase separation assay. Yet another embodiment of this aspect includes the aggregate being detectable by assaying chlorophyll autofluorescence and observing a displacement of chlorophyll autofluorescence when the aggregate is present. A preferred embodiment, which may be combined with any of the preceding embodiments, includes the aggregate being detectable by confocal microscopy in vivo. A further embodiment of this aspect includes the aggregate undergoing internal mixing. An additional embodiment of this aspect includes the aggregate displacing chloroplast thylakoid membranes. Still another embodiment of this aspect, which may be combined with any of the preceding embodiments that has a modified higher plant Rubisco, includes the modified higher plant Rubisco polypeptide including an endogenous Rubisco SSU polypeptide. In yet another embodiment of this aspect, which may be combined with any of the preceding embodiments that has a modified higher plant Rubisco, the modified higher plant Rubisco SSU polypeptide was modified by substituting one or more higher plant Rubisco SSU α-helices with one or more algal Rubisco SSU α-helices; substituting one or more higher plant Rubisco SSU β-strands with one or more algal Rubisco SSU β-strands; and/or substituting a higher plant Rubisco SSU βA-βB loop with an algal Rubisco SSU βA-βB loop. An additional embodiment of this aspect includes the higher plant Rubisco SSU polypeptide being modified by substituting two higher plant Rubisco SSU α-helices with two algal Rubisco SSU α-helices. A further embodiment of this aspect includes the two higher plant Rubisco SSU α-helices corresponding to amino acids 23-35 and amino acids 80-93 in SEQ ID NO: 1 and the two algal Rubisco SSU α-helices corresponding to amino acids 23-35 and amino acids 86-99 in SEQ ID NO: 2. Yet another embodiment of this aspect that can be combined with any of the preceding embodiments that has two higher plant Rubisco SSU α-helices being substituted with two algal Rubisco SSU α-helices, the higher plant Rubisco SSU polypeptide being further modified by substituting four higher plant Rubisco SSU β-strands with four algal Rubisco SSU β-strands, and by substituting a higher plant Rubisco SSU βA-βB loop with an algal Rubisco SSU βA-βB loop. An additional embodiment of this aspect includes the four higher plant Rubisco SSU β-strands corresponding to amino acids 39-45, amino acids 68-70, amino acids 98-105, and amino acids 110-118 in SEQ ID NO: 1, the four algal Rubisco SSU β-strands corresponding to amino acids 39-45, amino acids 74-76, amino acids 104-111, and amino acids 116-124 in SEQ ID NO: 2, the higher plant Rubisco SSU βA-βB loop corresponding to amino acids 46-67 in SEQ ID NO: 1, and the algal Rubisco SSU βA-βB loop corresponding to amino acids 46-73 in SEQ ID NO: 2.
[0056] Still another embodiment of this aspect, which may be combined with any of the preceding embodiments that has a modified higher plant Rubisco, includes the higher plant Rubisco SSU polypeptide having at least 70% sequence identity, at least 71% sequence identity, at least 72% sequence identity, at least 73% sequence identity, at least 74% sequence identity, at least 75% sequence identity, at least 76% sequence identity, at least 77% sequence identity, at least 78% sequence identity, at least 79% sequence identity, at least 80% sequence identity, at least 81% sequence identity, at least 82% sequence identity, at least 83% sequence identity, at least 84% sequence identity, at least 85% sequence identity, at least 86% sequence identity, at least 87% sequence identity, at least 88% sequence identity, at least 89% sequence identity, at least 90% sequence identity, at least 91% sequence identity, at least 92% sequence identity, at least 93% sequence identity, at least 94% sequence identity, at least 95% sequence identity, at least 96% sequence identity, at least 97% sequence identity, at least 98% sequence identity, or at least 99% sequence identity to SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, or SEQ ID NO: 156. Yet another embodiment of this aspect, which may be combined with any of the preceding embodiments that has a modified higher plant Rubisco, includes the algal Rubisco SSU polypeptide having at least 70% sequence identity, at least 71% sequence identity, at least 72% sequence identity, at least 73% sequence identity, at least 74% sequence identity, at least 75% sequence identity, at least 76% sequence identity, at least 77% sequence identity, at least 78% sequence identity, at least 79% sequence identity, at least 80% sequence identity, at least 81% sequence identity, at least 82% sequence identity, at least 83% sequence identity, at least 84% sequence identity, at least 85% sequence identity, at least 86% sequence identity, at least 87% sequence identity, at least 88% sequence identity, at least 89% sequence identity, at least 90% sequence identity, at least 91% sequence identity, at least 92% sequence identity, at least 93% sequence identity, at least 94% sequence identity, at least 95% sequence identity, at least 96% sequence identity, at least 97% sequence identity, at least 98% sequence identity, or at least 99% sequence identity to SEQ ID NO: 2, SEQ ID NO: 30, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, or SEQ ID NO: 164. In an additional embodiment of this aspect, the algal Rubisco SSU polypeptide is SEQ ID NO: 2, SEQ ID NO: 30, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, or SEQ ID NO: 164. A further embodiment of this aspect, which may be combined with any of the preceding embodiments that has a modified higher plant Rubisco, includes the modified higher plant Rubisco SSU polypeptide having increased or altered affinity for the EPYC1 polypeptide as compared to the higher plant Rubisco SSU polypeptide without the modification.
[0057] An additional aspect of the disclosure includes a genetically altered higher plant or part thereof including EPYC1 polypeptides for formation of an aggregate of modified Rubiscos and the EPYC1 polypeptides. An aggregate of modified Rubisco and EPYC1 may also be referred to as a condensate of modified Rubisco and EPYC1. A further embodiment of any of the preceding aspects includes the EPYC1 polypeptides being algal EPYC1 polypeptides. An additional embodiment of this aspect includes the algal EPYC1 polypeptides having an amino acid sequence having at least 70% sequence identity, at least 71% sequence identity, at least 72% sequence identity, at least 73% sequence identity, at least 74% sequence identity, at least 75% sequence identity, at least 76% sequence identity, at least 77% sequence identity, at least 78% sequence identity, at least 79% sequence identity, at least 80% sequence identity, at least 81% sequence identity, at least 82% sequence identity, at least 83% sequence identity, at least 84% sequence identity, at least 85% sequence identity, at least 86% sequence identity, at least 87% sequence identity, at least 88% sequence identity, at least 89% sequence identity, at least 90% sequence identity, at least 91% sequence identity, at least 92% sequence identity, at least 93% sequence identity, at least 94% sequence identity, at least 95% sequence identity, at least 96% sequence identity, at least 97% sequence identity, at least 98% sequence identity, or at least 99% sequence identity to SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 165, SEQ ID NO: 166, or SEQ ID NO: 167. In yet another embodiment of this aspect, the algal EPYC1 polypeptide is SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 165, SEQ ID NO: 166, or SEQ ID NO: 167. An additional embodiment of this aspect includes EPYC1 being the mature or truncated form of EPYC1 corresponding to SEQ ID NO: 35. A further embodiment of this aspect includes the full-length form of EPYC1 corresponding to SEQ ID NO: 34 being truncated between residues 26 (V) and 27 (A) to form the mature native form of EPYC1 corresponding to SEQ ID NO: 35. Still another embodiment of any of the preceding aspects includes the EPYC1 polypeptides being modified EPYC1 polypeptides. A further embodiment of this aspect includes the modified EPYC1 polypeptides including one or more, two or more, four or more, or eight tandem copies of a first algal EPYC1 repeat region. An additional embodiment of this aspect includes the modified EPYC1 polypeptides including four tandem copies or eight tandem copies of the first algal EPYC1 repeat region. Yet another embodiment of this aspect, which may be combined with any of the preceding embodiments including modified EPYC1 polypeptides including tandem copies of a first algal EPYC1 repeat region, includes the first algal EPYC1 repeat region being a polypeptide having at least 70% sequence identity, at least 71% sequence identity, at least 72% sequence identity, at least 73% sequence identity, at least 74% sequence identity, at least 75% sequence identity, at least 76% sequence identity, at least 77% sequence identity, at least 78% sequence identity, at least 79% sequence identity, at least 80% sequence identity, at least 81% sequence identity, at least 82% sequence identity, at least 83% sequence identity, at least 84% sequence identity, at least 85% sequence identity, at least 86% sequence identity, at least 87% sequence identity, at least 88% sequence identity, at least 89% sequence identity, at least 90% sequence identity, at least 91% sequence identity, at least 92% sequence identity, at least 93% sequence identity, at least 94% sequence identity, at least 95% sequence identity, at least 96% sequence identity, at least 97% sequence identity, at least 98% sequence identity, or at least 99% sequence identity to SEQ ID NO: 36. A further embodiment of this aspect includes the first algal EPYC1 repeat region being SEQ ID NO: 36. Still another embodiment of this aspect, which may be combined with any of the preceding embodiments including modified EPYC1, includes the modified EPYC1 polypeptides being expressed without the native EPYC1 leader sequence and/or including a C-terminal cap. Yet another embodiment of this aspect includes the native EPYC1 leader sequence including a polypeptide having at least 70% sequence identity, at least 71% sequence identity, at least 72% sequence identity, at least 73% sequence identity, at least 74% sequence identity, at least 75% sequence identity, at least 76% sequence identity, at least 77% sequence identity, at least 78% sequence identity, at least 79% sequence identity, at least 80% sequence identity, at least 81% sequence identity, at least 82% sequence identity, at least 83% sequence identity, at least 84% sequence identity, at least 85% sequence identity, at least 86% sequence identity, at least 87% sequence identity, at least 88% sequence identity, at least 89% sequence identity, at least 90% sequence identity, at least 91% sequence identity, at least 92% sequence identity, at least 93% sequence identity, at least 94% sequence identity, at least 95% sequence identity, at least 96% sequence identity, at least 97% sequence identity, at least 98% sequence identity, or at least 99% sequence identity to SEQ ID NO: 42, and the C-terminal cap including a polypeptide having at least 70% sequence identity, at least 71% sequence identity, at least 72% sequence identity, at least 73% sequence identity, at least 74% sequence identity, at least 75% sequence identity, at least 76% sequence identity, at least 77% sequence identity, at least 78% sequence identity, at least 79% sequence identity, at least 80% sequence identity, at least 81% sequence identity, at least 82% sequence identity, at least 83% sequence identity, at least 84% sequence identity, at least 85% sequence identity, at least 86% sequence identity, at least 87% sequence identity, at least 88% sequence identity, at least 89% sequence identity, at least 90% sequence identity, at least 91% sequence identity, at least 92% sequence identity, at least 93% sequence identity, at least 94% sequence identity, at least 95% sequence identity, at least 96% sequence identity, at least 97% sequence identity, at least 98% sequence identity, or at least 99% sequence identity to SEQ ID NO: 41. A further embodiment of this aspect includes the C-terminal cap being SEQ ID NO: 41. Still another embodiment of this aspect, which may be combined with any of the preceding embodiments including modified EPYC1, includes the modified EPYC1 polypeptide having increased affinity for Rubisco SSU polypeptide as compared to the corresponding unmodified EPYC1 polypeptide.
[0058] In yet another embodiment of this aspect, which may be combined with any of the preceding embodiments, the aggregate is localized to a chloroplast stroma of at least one chloroplast of a plant cell. The aggregate may also be referred to as the condensate. A further embodiment of this aspect includes the plant cell being a leaf mesophyll cell. In still another embodiment of this aspect, which may be combined with any of the preceding embodiments, the plant is selected from the group of cowpea (e.g., black-eyed pea, catjang, yardlong bean, Vigna unguiculata), soy (e.g., soybean, soya bean, Glycine max, Glycine soja), cassava (e.g., manioc, yucca, Manihot esculenta), rice (e.g., indica rice, japonica rice, aromatic rice, glutinous rice, Oryza sativa, Oryza glaberrima), wheat (e.g., common wheat, spelt, durum, einkorn, emmer, kamut, Triticum aestivum, Triticum spelta, Triticum durum, Triticum urartu, Triticum monococcum, Triticum turanicum, Triticum spp.), barley (e.g., Hordeum vulgare), rye (i.e., Secale cereale), oat (i.e., Avena sativa), tomato (e.g., Solanum lycopersicum), potato (e.g., russet potatoes, yellow potatoes, red potatoes, Solanum tuberosum), canola (e.g., Brassica rapa, Brassica napus, Brassica juncea), or other C3 crop plants. In some embodiments, the plant is tobacco (i.e., Nicotiana tabacum, Nicotiana edwardsonii, Nicotiana plumbagnifolia, Nicotiana longijlora, Nicotiana benthamiana) or Arabidopsis (i.e., rockcress, thale cress, Arabidopsis thaliana).
[0059] A further aspect of the disclosure includes a genetically altered higher plant or part thereof including a first nucleic acid sequence encoding an EPYC1 polypeptide and a second nucleic acid sequence encoding a modified Rubisco. An additional embodiment of this aspect includes EPYC1 being the mature or truncated form of EPYC1 corresponding to SEQ ID NO: 35. A further embodiment of this aspect includes the full-length form of EPYC1 corresponding to SEQ ID NO: 34 being truncated between residues 26 (V) and 27 (A) to form the mature native form of EPYC1 corresponding to SEQ ID NO: 35. Yet another embodiment of this aspect includes the first nucleic acid sequence being introduced with a binary vector comprising two separate expression cassettes, wherein each expression cassette comprises the first nucleic acid sequence. An additional embodiment of this aspect includes the first nucleic acid sequence being operably linked to a first promoter. A further embodiment of this aspect includes the first promoter being selected from the group of a constitutive promoter, an inducible promoter, a leaf specific promoter, or a mesophyll cell specific promoter. Yet another embodiment of this aspect includes the first promoter being a constitutive promoter selected from the group of a CaMV35S promoter, a derivative of the CaMV35S promoter, a CsVMV promoter, a derivative of the CsVMV promoter, a maize ubiquitin promoter, a trefoil promoter, a vein mosaic cassava virus promoter, and an A. thaliana UBQ10 promoter. Still another embodiment of this aspect, which may be combined with any of the preceding embodiments, includes the first nucleic acid sequence being operably linked to a third nucleic acid sequence encoding a chloroplastic transit peptide functional in the higher plant cell, and the first nucleic acid sequence not including the native EPYC1 leader sequence and not being operably linked to the native EPYC1 leader sequence. An additional embodiment of this aspect includes the chloroplastic transit peptide being a polypeptide having at least 70% sequence identity, at least 71% sequence identity, at least 72% sequence identity, at least 73% sequence identity, at least 74% sequence identity, at least 75% sequence identity, at least 76% sequence identity, at least 77% sequence identity, at least 78% sequence identity, at least 79% sequence identity, at least 80% sequence identity, at least 81% sequence identity, at least 82% sequence identity, at least 83% sequence identity, at least 84% sequence identity, at least 85% sequence identity, at least 86% sequence identity, at least 87% sequence identity, at least 88% sequence identity, at least 89% sequence identity, at least 90% sequence identity, at least 91% sequence identity, at least 92% sequence identity, at least 93% sequence identity, at least 94% sequence identity, at least 95% sequence identity, at least 96% sequence identity, at least 97% sequence identity, at least 98% sequence identity, or at least 99% sequence identity to SEQ ID NO: 63. Yet another embodiment of this aspect includes the chloroplastic transit peptide being SEQ ID NO: 63. In a further embodiment of this aspect that can be combined with any of the preceding embodiments that has a native EPYC1 leader sequence, the native EPYC1 leader sequence corresponds to nucleotides 60-137 of SEQ ID NO: 65. In still another embodiment of this aspect that can be combined with any of the preceding embodiments, the first nucleic acid sequence is operably linked to one or two terminators. A further embodiment of this aspect includes the one two terminators being selected from the group of a HSP terminator, a NOS terminator, an OCS terminator, an intronless extensin terminator, a 35S terminator, a pinII terminator, a rbcS terminator, an actin terminator, or any combination thereof.
[0060] Still another embodiment of this aspect, which may be combined with any of the preceding embodiments, includes the second nucleic acid sequence being operably linked to a second promoter. In a further embodiment of this aspect, the second promoter is selected from the group of a constitutive promoter, an inducible promoter, a leaf specific promoter, or a mesophyll cell specific promoter. In an additional embodiment of this aspect, the second promoter is a constitutive promoter selected from the group of a CaMV35S promoter, a derivative of the CaMV35S promoter, a CsVMV promoter, a derivative of the CsVMV promoter, a maize ubiquitin promoter, a trefoil promoter, a vein mosaic cassava virus promoter, or an A. thaliana UBQ10 promoter. In yet another embodiment of this aspect that can be combined with any of the preceding embodiments that has a second nucleic acid sequence being operably linked to a second promoter, the second nucleic acid sequence encodes an algal Rubisco SSU polypeptide. In an additional embodiment of this aspect, the second nucleic acid sequence is operably linked to a fourth nucleic acid sequence encoding a chloroplastic transit peptide functional in the higher plant cell and the second nucleic acid sequence does not encode the native algal SSU leader sequence and is not operably linked to a nucleic acid sequence encoding the native algal SSU leader sequence. In a further embodiment of this aspect, the chloroplastic transit peptide is a polypeptide having at least 70% sequence identity, at least 71% sequence identity, at least 72% sequence identity, at least 73% sequence identity, at least 74% sequence identity, at least 75% sequence identity, at least 76% sequence identity, at least 77% sequence identity, at least 78% sequence identity, at least 79% sequence identity, at least 80% sequence identity, at least 81% sequence identity, at least 82% sequence identity, at least 83% sequence identity, at least 84% sequence identity, at least 85% sequence identity, at least 86% sequence identity, at least 87% sequence identity, at least 88% sequence identity, at least 89% sequence identity, at least 90% sequence identity, at least 91% sequence identity, at least 92% sequence identity, at least 93% sequence identity, at least 94% sequence identity, at least 95% sequence identity, at least 96% sequence identity, at least 97% sequence identity, at least 98% sequence identity, or at least 99% sequence identity to SEQ ID NO: 64. In yet another embodiment of this aspect, the chloroplastic transit peptide is SEQ ID NO: 64. In still another embodiment of this aspect that can be combined with any of the preceding embodiments that has a native algal SSU leader sequence, the native algal SSU leader sequence corresponds to amino acids 1 to 45 of SEQ ID NO: 32. In yet another embodiment of this aspect that can be combined with any of the preceding embodiments that has a native algal SSU leader sequence, the native algal SSU leader sequence corresponds to amino acids 1 to 45 of SEQ ID NO: 31. In a further embodiment of this aspect that can be combined with any of the preceding embodiments that has a second nucleic acid sequence being operably linked to a second promoter, the second nucleic acid sequence is operably linked to a terminator. In an additional embodiment of this aspect, the terminator is selected from the group of a HSP terminator, a NOS terminator, an OCS terminator, an intronless extensin terminator, a 35S terminator, a pinII terminator, a rbcS terminator, or an actin terminator. In yet another embodiment of this aspect that can be combined with any of the preceding embodiments that has a second nucleic acid sequence being operably linked to a second promoter, the second nucleic acid sequence encodes a modified higher plant Rubisco SSU polypeptide wherein at least part of the higher plant Rubisco SSU polypeptide is replaced with at least part of an algal Rubisco SSU polypeptide. A further embodiment of this aspect, which can be combined with any of the preceding embodiments, includes the EPYC1 polypeptide being the EPYC1 polypeptide of any one of the preceding embodiments. An additional embodiment of this aspect includes EPYC1 being the mature or truncated form of EPYC1 corresponding to SEQ ID NO: 35. A further embodiment of this aspect includes the full-length form of EPYC1 corresponding to SEQ ID NO: 34 being truncated between residues 26 (V) and 27 (A) to form the mature native form of EPYC1 corresponding to SEQ ID NO: 35. An additional embodiment of this aspect includes the Rubisco SSU polypeptide being the Rubisco SSU polypeptide of any one of the preceding embodiments.
[0061] Yet another embodiment of this aspect, which may be combined with any of the preceding embodiments, includes at least one cell of the plant or part thereof including an aggregate of the Rubisco polypeptide and the EPYC1 polypeptide. A further embodiment of this aspect includes the aggregate being localized to a chloroplast stroma of at least one chloroplast of at least one plant cell. An additional embodiment of this aspect includes the plant cell being a leaf mesophyll cell. In still another embodiment of this aspect, which may be combined with any of the preceding embodiments that has a plant or part thereof including an aggregate of the Rubisco polypeptide and the EPYC1 polypeptide, the aggregate is detectable by confocal microscopy, transmission electron microscopy (TEM), cryo-electron microscopy (cryo-EM), or a liquid-liquid phase separation assay. Yet another embodiment of this aspect includes the aggregate being detectable by assaying chlorophyll autofluorescence and observing a displacement of chlorophyll autofluorescence when the aggregate is present. A preferred embodiment, which may be combined with any of the preceding embodiments, includes the aggregate being detectable by confocal microscopy in vivo. A further embodiment of this aspect includes the aggregate undergoing internal mixing. An additional embodiment of this aspect includes the aggregate displacing chloroplast thylakoid membranes. In yet another embodiment of this aspect, which may be combined with any of the preceding embodiments, the plant is selected from the group of cowpea (e.g., black-eyed pea, catjang, yardlong bean, Vigna unguiculata), soy (e.g., soybean, soya bean, Glycine max, Glycine soja), cassava (e.g., manioc, yucca, Manihot esculenta), rice (e.g., indica rice, japonica rice, aromatic rice, glutinous rice, Oryza sativa, Oryza glaberrima), wheat (e.g., common wheat, spelt, durum, einkorn, emmer, kamut, Triticum aestivum, Triticum spelta, Triticum durum, Triticum urartu, Triticum monococcum, Triticum turanicum, Triticum spp.), barley (e.g., Hordeum vulgare), rye (i.e., Secale cereale), oat (i.e., Avena sativa), tomato (e.g., Solanum lycopersicum), potato (e.g., russet potatoes, yellow potatoes, red potatoes, Solanum tuberosum), canola (e.g., Brassica rapa, Brassica napus, Brassica juncea), or other C3 crop plants. In some embodiments, the plant is tobacco (i.e., Nicotiana tabacum, Nicotiana edwardsonii, Nicotiana plumbagnifolia, Nicotiana longijlora, Nicotiana benthamiana) or Arabidopsis (i.e., rockcress, thale cress, Arabidopsis thaliana).
[0062] A further embodiment of this aspect that can be combined with any of the preceding embodiments includes a genetically altered higher plant cell produced from the plant or plant part of any one of the preceding embodiments. Yet another embodiment of this aspect that can be combined with any of the preceding embodiments with respect to plant part includes the plant part being a leaf, a stem, a root, a tuber, a flower, a seed, a kernel, a grain, a fruit, a cell, or a portion thereof and the genetically altered plant part including the one or more genetic alterations. A further embodiment of this aspect includes the plant part being a fruit, a tuber, a kernel, or a grain. Still another embodiment of this aspect that can be combined with any of the preceding embodiments with respect to pollen grain or ovules includes a genetically altered pollen grain or a genetically altered ovule of the plant of any one of the preceding embodiments, wherein the genetically altered pollen grain or the genetically altered ovule includes the one or more genetic alterations. A further embodiment of this aspect that can be combined with any of the preceding embodiments includes a genetically altered protoplast produced from the genetically altered plant of any of the preceding embodiments, wherein the genetically altered protoplast includes the one or more genetic alterations. An additional embodiment of this aspect that can be combined with any of the preceding embodiments includes a genetically altered tissue culture produced from protoplasts or cells from the genetically altered plant of any one of the preceding embodiments, wherein the cells or protoplasts are produced from a plant part selected from the group of leaf, leaf mesophyll cell, anther, pistil, stem, petiole, root, root tip, tuber, fruit, seed, kernel, grain, flower, cotyledon, hypocotyl, embryo, or meristematic cell, wherein the genetically altered tissue culture includes the one or more genetic alterations. An additional embodiment of this aspect includes a genetically altered plant regenerated from the genetically altered tissue culture that includes the one or more genetic alterations. Yet another embodiment of this aspect that can be combined with any of the preceding embodiments includes a genetically altered plant seed produced from the genetically altered plant of any one of the preceding embodiments.
Methods of Producing and Cultivating Genetically Altered Plants
[0063] Another aspect of the disclosure includes methods of producing the genetically altered higher plant of any of the preceding embodiments including a) introducing a first nucleic acid sequence encoding an EPYC1 polypeptide into a plant cell, tissue, or other explant; b) regenerating the plant cell, tissue, or other explant into a genetically altered plantlet; and c) growing the genetically altered plantlet into a genetically altered plant with the first nucleic acid encoding the EPYC1 polypeptide. An additional embodiment of this aspect includes EPYC1 being the mature or truncated form of EPYC1 corresponding to SEQ ID NO: 35. A further embodiment of this aspect includes the full-length form of EPYC1 corresponding to SEQ ID NO: 34 being truncated between residues 26 (V) and 27 (A) to form the mature native form of EPYC1 corresponding to SEQ ID NO: 35. An additional embodiment of this aspect further includes introducing a second nucleic acid sequence encoding a modified Rubisco SSU polypeptide into a plant cell, tissue, or other explant prior to step (a) or concurrently with step (a), wherein the genetically altered plant of step (c) further includes the second nucleic acid encoding the modified Rubisco SSU polypeptide. An additional embodiment of this aspect further includes identifying successful introduction of the first nucleic acid sequence and, optionally, the second nucleic acid sequence by screening or selecting the plant cell, tissue, or other explant prior to step (b); screening or selecting plantlets between step (b) and (c); or screening or selecting plants after step (c). In yet another embodiment of this aspect, which may be combined with any of the preceding embodiments, transformation is done using a transformation method selected from the group of particle bombardment (i.e., biolistics, gene gun), Agrobacterium-mediated transformation, Rhizobium-mediated transformation, or protoplast transfection or transformation.
[0064] Still another embodiment of this aspect that can be combined with any of the preceding embodiments includes the first nucleic acid sequence being introduced with a first vector, and the second nucleic acid sequence being introduced with a second vector. An additional embodiment of this aspect includes the first nucleic acid sequence being introduced with a binary vector comprising two separate expression cassettes, wherein each expression cassette comprises the first nucleic acid sequence. In a further embodiment of this aspect, the first nucleic acid sequence is operably linked to a first promoter. In an additional embodiment of this aspect, the first promoter is selected from the group of a constitutive promoter, an inducible promoter, a leaf specific promoter, or a mesophyll cell specific promoter. In yet another embodiment of this aspect, the first promoter is a constitutive promoter selected from the group of a CaMV35S promoter, a derivative of the CaMV35S promoter, a CsVMV promoter, a derivative of the CsVMV promoter, a maize ubiquitin promoter, a trefoil promoter, a vein mosaic cassava virus promoter, or an A. thaliana UBQ10 promoter. In still another embodiment of this aspect that can be combined with any of the preceding embodiments, the first nucleic acid sequence is operably linked to a third nucleic acid sequence encoding a chloroplastic transit peptide functional in the higher plant cell and the first nucleic acid sequence does not include the native EPYC1 leader sequence and is not operably linked to the native EPYC1 leader sequence. In yet another embodiment of this aspect, the chloroplastic transit peptide is a polypeptide having at least 70% sequence identity, at least 71% sequence identity, at least 72% sequence identity, at least 73% sequence identity, at least 74% sequence identity, at least 75% sequence identity, at least 76% sequence identity, at least 77% sequence identity, at least 78% sequence identity, at least 79% sequence identity, at least 80% sequence identity, at least 81% sequence identity, at least 82% sequence identity, at least 83% sequence identity, at least 84% sequence identity, at least 85% sequence identity, at least 86% sequence identity, at least 87% sequence identity, at least 88% sequence identity, at least 89% sequence identity, at least 90% sequence identity, at least 91% sequence identity, at least 92% sequence identity, at least 93% sequence identity, at least 94% sequence identity, at least 95% sequence identity, at least 96% sequence identity, at least 97% sequence identity, at least 98% sequence identity, or at least 99% sequence identity to SEQ ID NO: 63. In still another embodiment of this aspect, the endogenous chloroplastic transit peptide is SEQ ID NO: 63. Yet another embodiment of this aspect that can be combined with any of the preceding embodiments that has a native EPYC1 leader sequence includes the native EPYC1 leader sequence corresponding to nucleotides 60 to 137 of SEQ ID NO: 65. In a further embodiment of this aspect that can be combined with any of the preceding embodiments, the first nucleic acid sequence is operably linked to one or two terminators. In an additional embodiment of this aspect, the one or two terminators are selected from the group of a HSP terminator, a NOS terminator, an OCS terminator, an intronless extensin terminator, a 35S terminator, a pinII terminator, a rbcS terminator, an actin terminator, or any combination thereof.
[0065] An additional embodiment of this aspect that can be combined with any of the preceding embodiments includes the second nucleic acid sequence being operably linked to a second promoter. A further embodiment of this aspect includes the second promoter being selected from the group consisting of a constitutive promoter, an inducible promoter, a leaf specific promoter, and a mesophyll cell specific promoter. Yet another embodiment of this aspect includes the second promoter being a constitutive promoter selected from the group consisting of a CaMV35S promoter, a derivative of the CaMV35S promoter, a CsVMV promoter, a derivative of the CsVMV promoter, a maize ubiquitin promoter, a trefoil promoter, a vein mosaic cassava virus promoter, or an A. thaliana UBQ10 promoter. Still another embodiment of this aspect that can be combined with any of the preceding embodiments that has the second nucleic acid sequence being operably linked to a second promoter includes the second nucleic acid sequence encoding an algal SSU polypeptide. An additional embodiment of this aspect includes the second nucleic acid sequence being operably linked to a fourth nucleic acid sequence encoding a chloroplastic transit peptide functional in the higher plant cell and the second nucleic acid sequence not encoding the native SSU leader sequence and not being operably linked to a nucleic acid sequence encoding the native SSU leader sequence. A further embodiment of this aspect includes the chloroplastic transit peptide being a polypeptide having at least 70% sequence identity, at least 71% sequence identity, at least 72% sequence identity, at least 73% sequence identity, at least 74% sequence identity, at least 75% sequence identity, at least 76% sequence identity, at least 77% sequence identity, at least 78% sequence identity, at least 79% sequence identity, at least 80% sequence identity, at least 81% sequence identity, at least 82% sequence identity, at least 83% sequence identity, at least 84% sequence identity, at least 85% sequence identity, at least 86% sequence identity, at least 87% sequence identity, at least 88% sequence identity, at least 89% sequence identity, at least 90% sequence identity, at least 91% sequence identity, at least 92% sequence identity, at least 93% sequence identity, at least 94% sequence identity, at least 95% sequence identity, at least 96% sequence identity, at least 97% sequence identity, at least 98% sequence identity, or at least 99% sequence identity to SEQ ID NO: 64. Yet another embodiment of this aspect includes the chloroplastic transit peptide being SEQ ID NO: 64. An additional embodiment of this aspect that can be combined with any of the preceding embodiments, which has a native SSU leader sequence, includes the native SSU leader sequence corresponding to amino acids 1 to 45 of SEQ ID NO: 32. In yet another embodiment of this aspect that can be combined with any of the preceding embodiments that has a native algal SSU leader sequence, the native algal SSU leader sequence corresponds to amino acids 1 to 45 of SEQ ID NO: 31. Still another embodiment of this aspect that can be combined with any of the preceding embodiments that has the second nucleic acid sequence being operably linked to a second promoter includes the second nucleic acid sequence being operably linked to a terminator. A further embodiment of this aspect includes the terminator being selected from the group of a HSP terminator, a NOS terminator, an OCS terminator, an intronless extensin terminator, a 35S terminator, a pinII terminator, a rbcS terminator, or an actin terminator. In a further embodiment of this aspect that can be combined with any of the preceding embodiments that has the second nucleic acid sequence being operably linked to a second promoter, the second nucleic acid sequence encodes a modified higher plant Rubisco SSU polypeptide wherein at least part of the higher plant Rubisco SSU polypeptide is replaced with at least part of an algal Rubisco SSU polypeptide.
[0066] In an additional embodiment of this aspect that can be combined with any of the preceding embodiments that has a second vector, the second vector includes one or more gene editing components that target a nuclear genome sequence operably linked to a nucleic acid encoding an endogenous Rubisco SSU polypeptide. A further embodiment of this aspect includes one or more gene editing components being selected from the group of a ribonucleoprotein complex that targets the nuclear genome sequence; a vector comprising a TALEN protein encoding sequence, wherein the TALEN protein targets the nuclear genome sequence; a vector comprising a ZFN protein encoding sequence, wherein the ZFN protein targets the nuclear genome sequence; an oligonucleotide donor (ODN), wherein the ODN targets the nuclear genome sequence; or a vector comprising a CRISPR/Cas enzyme encoding sequence and a targeting sequence, wherein the targeting sequence targets the nuclear genome sequence. Yet another embodiment of this aspect that can be combined with any of the preceding embodiments that has gene editing includes the result of gene editing being at least part of the higher plant Rubisco SSU polypeptide being replaced with at least part of an algal Rubisco SSU polypeptide. A further embodiment of this aspect, which can be combined with any of the preceding embodiments, includes the EPYC1 polypeptide being the EPYC1 polypeptide of any one of the preceding embodiments. An additional embodiment of this aspect includes the Rubisco SSU polypeptide being the Rubisco SSU polypeptide of any one of the preceding embodiments.
[0067] Yet another embodiment of this aspect that can be combined with any of the preceding embodiments that has a first nucleic acid sequence being operably linked to a third nucleic acid sequence encoding a chloroplastic transit peptide functional in the higher plant cell and the first nucleic acid sequence not comprising the native EPYC1 leader sequence and not being operably linked to the native EPYC1 leader sequence includes and that has the first nucleic acid sequence being operably linked to one or two terminators includes the first vector including a first copy of the first nucleic acid sequence wherein the first nucleic acid sequence does not include the native EPYC1 leader sequence and is not operably linked to the native EPYC1 leader sequence, wherein the first nucleic acid sequence is operably linked to the third nucleic acid sequence encoding a chloroplastic transit peptide functional in the higher plant cell, wherein the first nucleic acid sequence is operably linked to the first promoter, and wherein the first nucleic acid sequence is operably linked to one terminator; and wherein the first vector further includes a second copy of the first nucleic acid sequence wherein the first nucleic acid sequence does not include the native EPYC1 leader sequence and is not operably linked to the native EPYC1 leader sequence, wherein the first nucleic acid sequence is operably linked to the third nucleic acid sequence encoding a chloroplastic transit peptide functional in the higher plant cell, wherein the first nucleic acid sequence is operably linked to a third promoter, and wherein the first nucleic acid sequence is operably linked to two terminators. A further embodiment of this aspect includes the first promoter being selected from the group of a constitutive promoter, an inducible promoter, a leaf specific promoter, or a mesophyll cell specific promoter; wherein the third promoter is selected from the group of a constitutive promoter, an inducible promoter, a leaf specific promoter, or a mesophyll cell specific promoter; and wherein the first and third promoters are not the same. Yet another embodiment of this aspect includes the chloroplastic transit peptide being a polypeptide having at least 70% sequence identity, at least 71% sequence identity, at least 72% sequence identity, at least 73% sequence identity, at least 74% sequence identity, at least 75% sequence identity, at least 76% sequence identity, at least 77% sequence identity, at least 78% sequence identity, at least 79% sequence identity, at least 80% sequence identity, at least 81% sequence identity, at least 82% sequence identity, at least 83% sequence identity, at least 84% sequence identity, at least 85% sequence identity, at least 86% sequence identity, at least 87% sequence identity, at least 88% sequence identity, at least 89% sequence identity, at least 90% sequence identity, at least 91% sequence identity, at least 92% sequence identity, at least 93% sequence identity, at least 94% sequence identity, at least 95% sequence identity, at least 96% sequence identity, at least 97% sequence identity, at least 98% sequence identity, or at least 99% sequence identity to SEQ ID NO: 63. Still another embodiment of this aspect includes the native EPYC1 leader sequence corresponding to nucleotides 60 to 137 of SEQ ID NO: 65. An additional embodiment of this aspect includes the terminators being selected from the group of a HSP terminator, a NOS terminator, an OCS terminator, an intronless extensin terminator, a 35S terminator, a pinII terminator, a rbcS terminator, an actin terminator, or any combination thereof. A further embodiment of this aspect that can be combined with any of the preceding embodiments includes a plant or plant part produced by the method of any one of the preceding embodiments.
[0068] A further aspect of the disclosure includes methods of cultivating the genetically altered plant of any of the preceding embodiments that has a genetically altered plant, including the steps of: a) planting a genetically altered seedling, a genetically altered plantlet, a genetically altered cutting, a genetically altered tuber, a genetically altered root, or a genetically altered seed in soil to produce the genetically altered plant or grafting the genetically altered seedling, the genetically altered plantlet, or the genetically altered cutting to a root stock or a second plant grown in soil to produce the genetically altered plant; b) cultivating the plant to produce harvestable seed, harvestable leaves, harvestable roots, harvestable cuttings, harvestable wood, harvestable fruit, harvestable kernels, harvestable tubers, and/or harvestable grain; and harvesting the harvestable seed, harvestable leaves, harvestable roots, harvestable cuttings, harvestable wood, harvestable fruit, harvestable kernels, harvestable tubers, and/or harvestable grain; and c) harvesting the harvestable seed, harvestable leaves, harvestable roots, harvestable cuttings, harvestable wood, harvestable fruit, harvestable kernels, harvestable tubers, and/or harvestable grain. An additional embodiment of this aspect includes a plant growth rate and/or photosynthetic efficiency of the genetically altered plant of any of the preceding embodiments being comparable to the plant growth rate and/or photosynthetic efficiency of a WT plant. Yet another embodiment of this aspect includes a plant growth rate and/or photosynthetic efficiency of the genetically altered plant of any of the preceding embodiments being improved as compared to the plant growth rate and/or photosynthetic efficiency of a WT plant. Still another embodiment of this aspect includes a yield of the genetically altered plant of any of the preceding embodiments being improved as compared to the yield of a WT plant. A further embodiment of this aspect includes the yield being improved by at least 5%, at least 10%, at least 15%, at least 20%, at least 25%, at least 30%, at least 35%, at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, or at least 100%.
Molecular Biological Methods to Produce Genetically Altered Plants and Plant Cells
[0069] One embodiment of the present invention provides a genetically altered plant or plant cell containing a modified Rubisco and an Essential Pyrenoid Component 1 (EPYC1) for formation of an aggregate or condensate of modified Rubisco and EPYC1 polypeptides. For example, the present disclosure provides plants with a first nucleic acid sequence encoding an EPYC1 polypeptide and a second nucleic acid sequence encoding a modified Rubisco. In addition, the present disclosure provides plants with algal EPYC1 polypeptides, modified EPYC1 polypeptides, algal Rubisco small subunit (SSU) polypeptides, and modified Rubisco SSU polypeptides.
[0070] Certain aspects of the present invention relate to the C. reinhardtii protein EPYC1 (C. reinhardtii EPYC1 genomic sequence=SEQ ID NO: 66; C. reinhardtii EPYC1 transcript sequence=SEQ ID NO: 65; C. reinhardtii EPYC1 full length protein=SEQ ID NO: 34; C. reinhardtii mature EPYC1 protein=SEQ ID NO: 35). EPYC1 is a modular protein consisting of four highly similar repeat regions flanked by shorter terminal regions (
[0071] At the N-terminus of the native C. reinhardtii protein EPYC1, a cleavage site at amino acid 26 in SEQ ID NO: 34 (indicated by a black arrow in
[0072] A modified EPYC1 polypeptide of the present invention includes tandem copies of the first EPYC1 repeat domain. A further embodiment of this aspect includes the modified EPYC1 polypeptides including one or more, two or more, four or more, or eight tandem copies of a first algal EPYC1 repeat region. An additional embodiment of this aspect includes the modified EPYC1 polypeptides including four tandem copies or eight tandem copies of the first algal EPYC1 repeat region. Exemplary modified EPYC1 sequences are shown in
[0073] For correct targeting of EPYC1 in a higher plant, a higher plant chloroplast targeting sequence is attached to the EPYC1 sequence. In some embodiments, this chloroplast targeting sequence is the 1A.sub.At chloroplastic transit peptide. In further embodiments, the chloroplast targeting sequence is the 1B.sub.At chloroplastic transit peptide (SEQ ID NO: 18), 2B.sub.At chloroplastic transit peptide (SEQ ID NO: 19), or the 3B.sub.At chloroplastic transit peptide (SEQ ID NO: 20). In additional embodiments, the chloroplast targeting sequence is obtained from chlorophyll a/b-binding protein, Rubisco activase, ferredoxin, or starch synthase proteins. In additional embodiments, the chloroplast transit sequence is a truncated chloroplast transit sequence (e.g., 55 residues of the 1A.sub.At chloroplastic transit peptide). A further embodiment of this aspect includes the chloroplastic transit peptide being a polypeptide having at least 70% sequence identity, at least 71% sequence identity, at least 72% sequence identity, at least 73% sequence identity, at least 74% sequence identity, at least 75% sequence identity, at least 76% sequence identity, at least 77% sequence identity, at least 78% sequence identity, at least 79% sequence identity, at least 80% sequence identity, at least 81% sequence identity, at least 82% sequence identity, at least 83% sequence identity, at least 84% sequence identity, at least 85% sequence identity, at least 86% sequence identity, at least 87% sequence identity, at least 88% sequence identity, at least 89% sequence identity, at least 90% sequence identity, at least 91% sequence identity, at least 92% sequence identity, at least 93% sequence identity, at least 94% sequence identity, at least 95% sequence identity, at least 96% sequence identity, at least 97% sequence identity, at least 98% sequence identity, or at least 99% sequence identity to SEQ ID NO: 64. Yet another embodiment of this aspect includes the chloroplastic transit peptide being SEQ ID NO: 64. Exemplary gene expression cassettes containing the 55 residue 1A.sub.At chloroplastic transit peptide attached to EPYC1 sequences (mature EPYC1 and modified EPYC1) are shown in
[0074] Additional aspects of the present invention relate to an algal Rubisco SSU protein. In some embodiments, the algal Rubisco SSU proteins is a C. reinhardtii Rubisco SSU protein, S1.sub.Cr (SEQ ID NO: 30) or S2.sub.Cr (SEQ ID NO: 2) (
[0075] A modified Rubisco SSU of the present invention includes a higher plant Rubisco SSU modified by substituting one or more higher plant Rubisco SSU α-helices with one or more algal Rubisco SSU α-helices; substituting one or more higher plant Rubisco SSU β-strands with one or more algal Rubisco SSU β-strands; and/or substituting a higher plant Rubisco SSU βA-βB loop with an algal Rubisco SSU βA-βB loop. In some embodiments, the higher plant Rubisco SSU polypeptide is modified by substituting two higher plant Rubisco SSU α-helices with two algal Rubisco SSU α-helices. In additional embodiments, the higher plant Rubisco SSU polypeptide is further modified by substituting four higher plant Rubisco SSU β-strands with four algal Rubisco SSU β-strands, and by substituting a higher plant Rubisco SSU βA-βB loop with an algal Rubisco SSU βA-βB loop. Higher plant Rubisco SSU polypeptides of the present invention include polypeptides having at least 70% sequence identity, at least 71% sequence identity, at least 72% sequence identity, at least 73% sequence identity, at least 74% sequence identity, at least 75% sequence identity, at least 76% sequence identity, at least 77% sequence identity, at least 78% sequence identity, at least 79% sequence identity, at least 80% sequence identity, at least 81% sequence identity, at least 82% sequence identity, at least 83% sequence identity, at least 84% sequence identity, at least 85% sequence identity, at least 86% sequence identity, at least 87% sequence identity, at least 88% sequence identity, at least 89% sequence identity, at least 90% sequence identity, at least 91% sequence identity, at least 92% sequence identity, at least 93% sequence identity, at least 94% sequence identity, at least 95% sequence identity, at least 96% sequence identity, at least 97% sequence identity, at least 98% sequence identity, or at least 99% sequence identity to SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, or SEQ ID NO: 156. Algal Rubisco SSU polypeptides of the present invention include polypeptides having at least 70% sequence identity, at least 71% sequence identity, at least 72% sequence identity, at least 73% sequence identity, at least 74% sequence identity, at least 75% sequence identity, at least 76% sequence identity, at least 77% sequence identity, at least 78% sequence identity, at least 79% sequence identity, at least 80% sequence identity, at least 81% sequence identity, at least 82% sequence identity, at least 83% sequence identity, at least 84% sequence identity, at least 85% sequence identity, at least 86% sequence identity, at least 87% sequence identity, at least 88% sequence identity, at least 89% sequence identity, at least 90% sequence identity, at least 91% sequence identity, at least 92% sequence identity, at least 93% sequence identity, at least 94% sequence identity, at least 95% sequence identity, at least 96% sequence identity, at least 97% sequence identity, at least 98% sequence identity, or at least 99% sequence identity to SEQ ID NO: 2, SEQ ID NO: 30, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, or SEQ ID NO: 164. In an additional embodiment of this aspect, the algal Rubisco SSU polypeptide is SEQ ID NO: 2, SEQ ID NO: 30, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, or SEQ ID NO: 164. A further embodiment of this aspect includes the two higher plant Rubisco SSU α-helices corresponding to amino acids 23-35 (i.e., SEQ ID NO: 3) and amino acids 80-93 (i.e., SEQ ID NO: 4) in SEQ ID NO: 1 and the two algal Rubisco SSU α-helices corresponding to amino acids 23-35 (i.e., SEQ ID NO: 10) and amino acids 86-99 (i.e., SEQ ID NO: 12) in SEQ ID NO: 2. Yet another embodiment of this aspect that can be combined with any of the preceding embodiments that has two higher plant Rubisco SSU α-helices being substituted with two algal Rubisco SSU α-helices, the higher plant Rubisco SSU polypeptide being further modified by substituting four higher plant Rubisco SSU β-strands with four algal Rubisco SSU β-strands, and by substituting a higher plant Rubisco SSU βA-βB loop with an algal Rubisco SSU βA-βB loop. An additional embodiment of this aspect includes the four higher plant Rubisco SSU β-strands corresponding to amino acids 39-45 (i.e., SEQ ID NO: 5), amino acids 68-70 (i.e., SEQ ID NO: 6), amino acids 98-105 (i.e., SEQ ID NO: 7), and amino acids 110-118 (i.e., SEQ ID NO: 8) in SEQ ID NO: 1, the four algal Rubisco SSU β-strands corresponding to amino acids 39-45 (i.e., SEQ ID NO: 11), amino acids 74-76 (i.e., SEQ ID NO: 6), amino acids 104-111 (i.e., SEQ ID NO: 13), and amino acids 116-124 (i.e., SEQ ID NO: 14) in SEQ ID NO: 2, the higher plant Rubisco SSU βA-βB loop corresponding to amino acids 46-67 (i.e., SEQ ID NO: 9) in SEQ ID NO: 1, and the algal Rubisco SSU βA-βB loop corresponding to amino acids 46-73 (i.e., SEQ ID NO: 15) in SEQ ID NO: 2. In further embodiments, the algal Rubisco SSU βA-βB loop corresponds to SEQ ID NO: 16.
[0076] A higher plant chloroplast targeting sequence is attached to the algal Rubisco SSU or the modified Rubisco SSU. In some embodiments, this chloroplast targeting sequence is the 1A.sub.At chloroplastic transit peptide. In further embodiments, the chloroplast targeting sequence is the 1B.sub.At chloroplastic transit peptide (SEQ ID NO: 18), 2B.sub.At chloroplastic transit peptide (SEQ ID NO: 19), or the 3B.sub.At chloroplastic transit peptide (SEQ ID NO: 20). In additional embodiments, the chloroplast targeting sequence is obtained from chlorophyll a/b-binding protein, Rubisco activase, ferredoxin, or starch synthase proteins. In additional embodiments, the chloroplast transit sequence is a truncated chloroplast transit sequence (e.g., 57 residues of the 1A.sub.At chloroplastic transit peptide). A further embodiment of this aspect includes the chloroplastic transit peptide being a polypeptide having at least 70% sequence identity, at least 71% sequence identity, at least 72% sequence identity, at least 73% sequence identity, at least 74% sequence identity, at least 75% sequence identity, at least 76% sequence identity, at least 77% sequence identity, at least 78% sequence identity, at least 79% sequence identity, at least 80% sequence identity, at least 81% sequence identity, at least 82% sequence identity, at least 83% sequence identity, at least 84% sequence identity, at least 85% sequence identity, at least 86% sequence identity, at least 87% sequence identity, at least 88% sequence identity, at least 89% sequence identity, at least 90% sequence identity, at least 91% sequence identity, at least 92% sequence identity, at least 93% sequence identity, at least 94% sequence identity, at least 95% sequence identity, at least 96% sequence identity, at least 97% sequence identity, at least 98% sequence identity, or at least 99% sequence identity to SEQ ID NO: 63. Yet another embodiment of this aspect includes the chloroplastic transit peptide being SEQ ID NO: 63. Exemplary sequences containing the 57 residue 1A.sub.At chloroplastic transit peptide attached to SSU sequences (S2.sub.Cr with 1A.sub.At-TP (SEQ ID NO: 22) and 1A.sub.A1MOD with 1A.sub.At-TP (SEQ ID NO: 33)) are shown in
[0077] Transformation and generation of genetically altered monocotyledonous and dicotyledonous plant cells is well known in the art. See, e.g., Weising, et al., Ann. Rev. Genet. 22:421-477 (1988); U.S. Pat. No. 5,679,558; Agrobacterium Protocols, ed: Gartland, Humana Press Inc. (1995); and Wang, et al. Acta Hort. 461:401-408 (1998). The choice of method varies with the type of plant to be transformed, the particular application and/or the desired result. The appropriate transformation technique is readily chosen by the skilled practitioner.
[0078] Any methodology known in the art to delete, insert or otherwise modify the cellular DNA (e.g., genomic DNA and organelle DNA) can be used in practicing the inventions disclosed herein. For example, a disarmed Ti plasmid, containing a genetic construct for deletion or insertion of a target gene, in Agrobacterium tumefaciens can be used to transform a plant cell, and thereafter, a transformed plant can be regenerated from the transformed plant cell using procedures described in the art, for example, in EP 0116718, EP 0270822, PCT publication WO 84/02913 and published European Patent application (“EP”) 0242246. Ti-plasmid vectors each contain the gene between the border sequences, or at least located to the left of the right border sequence, of the T-DNA of the Ti-plasmid. Of course, other types of vectors can be used to transform the plant cell, using procedures such as direct gene transfer (as described, for example in EP 0233247), pollen mediated transformation (as described, for example in EP 0270356, PCT publication WO 85/01856, and U.S. Pat. No. 4,684,611), plant RNA virus-mediated transformation (as described, for example in EP 0 067 553 and U.S. Pat. No. 4,407,956), liposome-mediated transformation (as described, for example in U.S. Pat. No. 4,536,475), and other methods such as the methods for transforming certain lines of corn (e.g., U.S. Pat. No. 6,140,553; Fromm et al., Bio/Technology (1990) 8, 833-839); Gordon-Kamm et al., The Plant Cell, (1990) 2, 603-618) and rice (Shimamoto et al., Nature, (1989) 338, 274-276; Datta et al., Bio/Technology, (1990) 8, 736-740) and the method for transforming monocots generally (PCT publication WO 92/09696). For cotton transformation, the method described in PCT patent publication WO 00/71733 can be used. For soybean transformation, reference is made to methods known in the art, e.g., Hinchee et al. (Bio/Technology, (1988) 6, 915) and Christou et al. (Trends Biotech, (1990) 8, 145) or the method of WO 00/42207.
[0079] Genetically altered plants of the present invention can be used in a conventional plant breeding scheme to produce more genetically altered plants with the same characteristics, or to introduce the genetic alteration(s) in other varieties of the same or related plant species. Seeds, which are obtained from the altered plants, preferably contain the genetic alteration(s) as a stable insert in nuclear DNA or as modifications to an endogenous gene or promoter. Plants comprising the genetic alteration(s) in accordance with the invention include plants comprising, or derived from, root stocks of plants comprising the genetic alteration(s) of the invention, e.g., fruit trees or ornamental plants. Hence, any non-transgenic grafted plant parts inserted on a transformed plant or plant part are included in the invention.
[0080] Introduced genetic elements, whether in an expression vector or expression cassette, which result in the expression of an introduced gene, will typically utilize a plant-expressible promoter. A ‘plant-expressible promoter’ as used herein refers to a promoter that ensures expression of the genetic alteration(s) of the invention in a plant cell. Examples of promoters directing constitutive expression in plants are known in the art and include: the strong constitutive 35S promoters (the “35S promoters”) of the cauliflower mosaic virus (CaMV), e.g., of isolates CM 1841 (Gardner et al., Nucleic Acids Res, (1981) 9, 2871-2887), CabbB S (Franck et al., Cell (1980) 21, 285-294; Kay et al., Science, (1987) 236, 4805) and CabbB JI (Hull and Howell, Virology, (1987) 86, 482-493); cassava vein mosaic virus promoter (CsVMV); promoters from the ubiquitin family (e.g., the maize ubiquitin promoter of Christensen et al., Plant Mol Biol, (1992) 18, 675-689, or the A. thaliana UBQ10 promoter of Norris et al. Plant Mol. Biol. (1993) 21, 895-906), the gos2 promoter (de Pater et al., The Plant J (1992) 2, 834-844), the emu promoter (Last et al., Theor Appl Genet, (1990) 81, 581-588), actin promoters such as the promoter described by An et al. (The Plant J, (1996) 10, 107), the rice actin promoter described by Zhang et al. (The Plant Cell, (1991) 3, 1155-1165); promoters of the Cassava vein mosaic virus (WO 97/48819, Verdaguer et al. (Plant Mol Biol, (1998) 37, 1055-1067), the pPLEX series of promoters from Subterranean Clover Stunt Virus (WO 96/06932, particularly the S4 or S7 promoter), an alcohol dehydrogenase promoter, e.g., pAdh1S (GenBank accession numbers X04049, X00581), and the TR1′ promoter and the TR2′ promoter (the “TR1′ promoter” and “TR2′ promoter”, respectively) which drive the expression of the 1′ and 2′ genes, respectively, of the T DNA (Velten et al., EMBO J, (1984) 3, 2723 2730).
[0081] Alternatively, a plant-expressible promoter can be a tissue-specific promoter, i.e., a promoter directing a higher level of expression in some cells or tissues of the plant, e.g., in leaf mesophyll cells. In preferred embodiments, leaf mesophyll specific promoters or leaf guard cell specific promoters will be used. Non-limiting examples include the leaf specific Rbcs1A promoter (A. thaliana RuBisCO small subunit 1A (AT1G67090) promoter), GAPA-1 promoter (A. thaliana Glyceraldehyde 3-phosphate dehydrogenase A subunit 1 (AT3G26650) promoter), and FBA2 promoter (A. thaliana Fructose-bisphosphate aldolase 2 317 (AT4G38970) promoter) (Kromdijk et al., Science, 2016). Further non-limiting examples include the leaf mesophyll specific FBPase promoter (Peleget al., Plant J, 2007), the maize or rice rbcS promoter (Nomura et al., Plant Mol Biol, 2000), the leaf guard cell specific A. thaliana KAT1 promoter (Nakamura et al., Plant Phys, 1995), the A. thaliana Myrosinase-Thioglucoside glucohydrolase 1 (TGG1) promoter (Husebye et al., Plant Phys, 2002), the A. thaliana rha1 promoter (Terryn et al., Plant Cell, 1993), the A. thaliana AtCHX20 promoter (Padmanaban et al., Plant Phys, 2007), the A. thaliana HIC (High carbon dioxide) promoter (Gray et al., Nature, 2000), the A. thaliana CYTOCHROME P450 86A2 (CYP86A2) mono-oxygenase promoter (pCYP) (Francia et al., Plant Signal & Behav, 2008; Galbiati et al., The Plant Journal, 2008), the potato ADP-glucose pyrophosphorylase (AGPase) promoter (Muller-Rober et al., The Plant Cell 1994), the grape R2R3 MYB60 transcription factor promoter (Galbiati et al., BMC Plant Bio, 2011), the A. thaliana AtMYB60 promoter (Cominelli et al., Current Bio, 2005; Cominelli et al., BMC Plant Bio, 2011), the A. thaliana At1g22690-promoter (pGC1) (Yang et al., Plant Methods, 2008), and the A. thaliana AtMYB 61 promoter (Liang et al., Curr Biol, 2005). These plant promoters can be combined with enhancer elements, they can be combined with minimal promoter elements, or can comprise repeated elements to ensure the expression profile desired.
[0082] In some embodiments, genetic elements to increase expression in plant cells can be utilized. For example, an intron at the 5′ end or 3′ end of an introduced gene, or in the coding sequence of the introduced gene, e.g., the hsp70 intron. Other such genetic elements can include, but are not limited to, promoter enhancer elements, duplicated or triplicated promoter regions, 5′ leader sequences different from another transgene or different from an endogenous (plant host) gene leader sequence, 3′ trailer sequences different from another transgene used in the same plant or different from an endogenous (plant host) trailer sequence.
[0083] An introduced gene of the present invention can be inserted in host cell DNA so that the inserted gene part is upstream (i.e., 5′) of suitable 3′ end transcription regulation signals (e.g., transcript formation and polyadenylation signals). This is preferably accomplished by inserting the gene in the plant cell genome (nuclear or chloroplast). Preferred polyadenylation and transcript formation signals include those of the A. tumefaciens nopaline synthase gene (Nos terminator; Depicker et al., J. Molec Appl Gen, (1982) 1, 561-573), the octopine synthase gene (OCS terminator; Gielen et al., EMBO J, (1984) 3:835 845), the A. thaliana heat shock protein terminator (HSP terminator); the SCSV or the Malic enzyme terminators (Schunmann et al., Plant Funct Biol, (2003) 30:453-460), and the T DNA gene 7 (Velten and Schell, Nucleic Acids Res, (1985) 13, 6981 6998), which act as 3′ untranslated DNA sequences in transformed plant cells. In some embodiments, one or more of the introduced genes are stably integrated into the nuclear genome. Stable integration is present when the nucleic acid sequence remains integrated into the nuclear genome and continues to be expressed (e.g., detectable mRNA transcript or protein is produced) throughout subsequent plant generations. Stable integration into and/or editing of the nuclear genome can be accomplished by any known method in the art (e.g., microparticle bombardment, Agrobacterium-mediated transformation, CRISPR/Cas9, electroporation of protoplasts, microinjection, etc.).
[0084] The term recombinant or modified nucleic acids refers to polynucleotides which are made by the combination of two otherwise separated segments of sequence accomplished by the artificial manipulation of isolated segments of polynucleotides by genetic engineering techniques or by chemical synthesis. In so doing one may join together polynucleotide segments of desired functions to generate a desired combination of functions.
[0085] As used herein, the terms “overexpression” and “upregulation” refer to increased expression (e.g., of mRNA, polypeptides, etc.) relative to expression in a wild type organism (e.g., plant) as a result of genetic modification. In some embodiments, the increase in expression is a slight increase of about 10% more than expression in wild type. In some embodiments, the increase in expression is an increase of 50% or more (e.g., 60%, 70%, 80%, 100%, etc.) relative to expression in wild type. In some embodiments, an endogenous gene is overexpressed. In some embodiments, an exogenous gene is overexpressed by virtue of being expressed. Overexpression of a gene in plants can be achieved through any known method in the art, including but not limited to, the use of constitutive promoters, inducible promoters, high expression promoters, enhancers, transcriptional and/or translational regulatory sequences, codon optimization, modified transcription factors, and/or mutant or modified genes that control expression of the gene to be overexpressed.
[0086] Where a recombinant nucleic acid is intended for expression, cloning, or replication of a particular sequence, DNA constructs prepared for introduction into a host cell will typically comprise a replication system (e.g. vector) recognized by the host, including the intended DNA fragment encoding a desired polypeptide, and can also include transcription and translational initiation regulatory sequences operably linked to the polypeptide-encoding segment. Additionally, such constructs can include cellular localization signals (e.g., plasma membrane localization signals). In preferred embodiments, such DNA constructs are introduced into a host cell's genomic DNA, chloroplast DNA or mitochondrial DNA.
[0087] In some embodiments, a non-integrated expression system can be used to induce expression of one or more introduced genes. Expression systems (expression vectors) can include, for example, an origin of replication or autonomously replicating sequence (ARS) and expression control sequences, a promoter, an enhancer and necessary processing information sites, such as ribosome-binding sites, RNA splice sites, polyadenylation sites, transcriptional terminator sequences, and mRNA stabilizing sequences. Signal peptides can also be included where appropriate from secreted polypeptides of the same or related species, which allow the protein to cross and/or lodge in cell membranes, cell wall, or be secreted from the cell.
[0088] Selectable markers useful in practicing the methodologies of the invention disclosed herein can be positive selectable markers. Typically, positive selection refers to the case in which a genetically altered cell can survive in the presence of a toxic substance only if the recombinant polynucleotide of interest is present within the cell. Negative selectable markers and screenable markers are also well known in the art and are contemplated by the present invention. One of skill in the art will recognize that any relevant markers available can be utilized in practicing the inventions disclosed herein.
[0089] Screening and molecular analysis of recombinant strains of the present invention can be performed utilizing nucleic acid hybridization techniques. Hybridization procedures are useful for identifying polynucleotides, such as those modified using the techniques described herein, with sufficient homology to the subject regulatory sequences to be useful as taught herein. The particular hybridization techniques are not essential to the subject invention. As improvements are made in hybridization techniques, they can be readily applied by one of skill in the art. Hybridization probes can be labeled with any appropriate label known to those of skill in the art. Hybridization conditions and washing conditions, for example temperature and salt concentration, can be altered to change the stringency of the detection threshold. See, e.g., Sambrook et al. (1989) vide infra or Ausubel et al. (1995) Current Protocols in Molecular Biology, John Wiley & Sons, NY, N.Y., for further guidance on hybridization conditions.
[0090] Additionally, screening and molecular analysis of genetically altered strains, as well as creation of desired isolated nucleic acids can be performed using Polymerase Chain Reaction (PCR). PCR is a repetitive, enzymatic, primed synthesis of a nucleic acid sequence. This procedure is well known and commonly used by those skilled in this art (see Mullis, U.S. Pat. Nos. 4,683,195, 4,683,202, and 4,800,159; Saiki et al. (1985) Science 230:1350-1354). PCR is based on the enzymatic amplification of a DNA fragment of interest that is flanked by two oligonucleotide primers that hybridize to opposite strands of the target sequence. The primers are oriented with the 3′ ends pointing towards each other. Repeated cycles of heat denaturation of the template, annealing of the primers to their complementary sequences, and extension of the annealed primers with a DNA polymerase result in the amplification of the segment defined by the 5′ ends of the PCR primers. Because the extension product of each primer can serve as a template for the other primer, each cycle essentially doubles the amount of DNA template produced in the previous cycle. This results in the exponential accumulation of the specific target fragment, up to several million-fold in a few hours. By using a thermostable DNA polymerase such as the Taq polymerase, which is isolated from the thermophilic bacterium Thermus aquaticus, the amplification process can be completely automated. Other enzymes which can be used are known to those skilled in the art.
[0091] Nucleic acids and proteins of the present invention can also encompass homologues of the specifically disclosed sequences. Homology (e.g., sequence identity) can be 50%-100%. In some instances, such homology is greater than 80%, greater than 85%, greater than 90%, or greater than 95%. The degree of homology or identity needed for any intended use of the sequence(s) is readily identified by one of skill in the art. As used herein percent sequence identity of two nucleic acids is determined using an algorithm known in the art, such as that disclosed by Karlin and Altschul (1990) Proc. Natl. Acad. Sci. USA 87:2264-2268, modified as in Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873-5877. Such an algorithm is incorporated into the BLASTN, BLASTP, and BLASTX, programs of Altschul et al. (1990) J. Mol. Biol. 215:402-410. BLAST nucleotide searches are performed with the BLASTN program, score=100, wordlength=12, to obtain nucleotide sequences with the desired percent sequence identity. To obtain gapped alignments for comparison purposes, Gapped BLAST is used as described in Altschul et al. (1997) Nucl. Acids. Res. 25:3389-3402. When utilizing BLAST and Gapped BLAST programs, the default parameters of the respective programs (BLASTN and BLASTX) are used. See www.ncbi.nih.gov. One of skill in the art can readily determine in a sequence of interest where a position corresponding to amino acid or nucleic acid in a reference sequence occurs by aligning the sequence of interest with the reference sequence using the suitable BLAST program with the default settings (e.g., for BLASTP: Gap opening penalty: 11, Gap extension penalty: 1, Expectation value: 10, Word size: 3, Max scores: 25, Max alignments: 15, and Matrix: blosum62; and for BLASTN: Gap opening penalty: 5, Gap extension penalty:2, Nucleic match: 1, Nucleic mismatch −3, Expectation value: 10, Word size: 11, Max scores: 25, and Max alignments: 15).
[0092] Preferred host cells are plant cells. Recombinant host cells, in the present context, are those which have been genetically modified to contain an isolated nucleic molecule, contain one or more deleted or otherwise non-functional genes normally present and functional in the host cell, or contain one or more genes to produce at least one recombinant protein. The nucleic acid(s) encoding the protein(s) of the present invention can be introduced by any means known to the art which is appropriate for the particular type of cell, including without limitation, transformation, lipofection, electroporation or any other methodology known by those skilled in the art.
Plant Breeding Methods
[0093] Plant breeding begins with the analysis of the current germplasm, the definition of problems and weaknesses of the current germplasm, the establishment of program goals, and the definition of specific breeding objectives. The next step is the selection of germplasm that possess the traits to meet the program goals. The selected germplasm is crossed in order to recombine the desired traits and through selection, varieties or parent lines are developed. The goal is to combine in a single variety or hybrid an improved combination of desirable traits from the parental germplasm. These important traits may include higher yield, field performance, improved fruit and agronomic quality, resistance to biological stresses, such as diseases and pests, and tolerance to environmental stresses, such as drought and heat.
[0094] Each breeding program should include a periodic, objective evaluation of the efficiency of the breeding procedure. Evaluation criteria vary depending on the goal and objectives, but should include gain from selection per year based on comparisons to an appropriate standard, overall value of the advanced breeding lines, and number of successful cultivars produced per unit of input (e.g., per year, per dollar expended, etc.). Promising advanced breeding lines are thoroughly tested and compared to appropriate standards in environments representative of the commercial target area(s) for three years at least. The best lines are candidates for new commercial cultivars; those still deficient in a few traits are used as parents to produce new populations for further selection. These processes, which lead to the final step of marketing and distribution, usually take five to ten years from the time the first cross or selection is made.
[0095] The choice of breeding or selection methods depends on the mode of plant reproduction, the heritability of the trait(s) being improved, and the type of cultivar used commercially (e.g., F.sub.1 hybrid cultivar, inbred cultivar, etc.). For highly heritable traits, a choice of superior individual plants evaluated at a single location will be effective, whereas for traits with low heritability, selection should be based on mean values obtained from replicated evaluations of families of related plants. The complexity of inheritance also influences the choice of the breeding method. Backcross breeding is used to transfer one or a few genes for a highly heritable trait into a desirable cultivar (e.g., for breeding disease-resistant cultivars), while recurrent selection techniques are used for quantitatively inherited traits controlled by numerous genes, various recurrent selection techniques are used. Commonly used selection methods include pedigree selection, modified pedigree selection, mass selection, and recurrent selection.
[0096] Pedigree selection is generally used for the improvement of self-pollinating crops or inbred lines of cross-pollinating crops. Two parents which possess favorable, complementary traits are crossed to produce an F.sub.1. An F2 population is produced by selfing one or several F.sub.1s or by intercrossing two F.sub.1s (sib mating). Selection of the best individuals is usually begun in the F.sub.2 population; then, beginning in the F.sub.3, the best individuals in the best families are selected. Replicated testing of families, or hybrid combinations involving individuals of these families, often follows in the F.sub.4 generation to improve the effectiveness of selection for traits with low heritability. At an advanced stage of inbreeding (i.e., F.sub.6 and F.sub.7), the best lines or mixtures of phenotypically similar lines are tested for potential release as new cultivars.
[0097] Mass and recurrent selections can be used to improve populations of either self- or cross-pollinating crops. A genetically variable population of heterozygous individuals is either identified or created by intercrossing several different parents. The best plants are selected based on individual superiority, outstanding progeny, or excellent combining ability. The selected plants are intercrossed to produce a new population in which further cycles of selection are continued.
[0098] Backcross breeding (i.e., recurrent selection) may be used to transfer genes for a simply inherited, highly heritable trait into a desirable homozygous cultivar or line that is the recurrent parent. The source of the trait to be transferred is called the donor parent. The resulting plant is expected to have the attributes of the recurrent parent (e.g., cultivar) and the desirable trait transferred from the donor parent. After the initial cross, individuals possessing the phenotype of the donor parent are selected and repeatedly crossed (backcrossed) to the recurrent parent. The resulting plant is expected to have the attributes of the recurrent parent (e.g., cultivar) and the desirable trait transferred from the donor parent.
[0099] The single-seed descent procedure in the strict sense refers to planting a segregating population, harvesting a sample of one seed per plant, and using the one-seed sample to plant the next generation. When the population has been advanced from the F.sub.2 to the desired level of inbreeding, the plants from which lines are derived will each trace to different F.sub.2 individuals. The number of plants in a population declines each generation due to failure of some seeds to germinate or some plants to produce at least one seed. As a result, not all of the F.sub.2 plants originally sampled in the population will be represented by a progeny when generation advance is completed.
[0100] In addition to phenotypic observations, the genotype of a plant can also be examined. There are many laboratory-based techniques available for the analysis, comparison and characterization of plant genotype; among these are Isozyme Electrophoresis, Restriction Fragment Length Polymorphisms (RFLPs), Randomly Amplified Polymorphic DNAs (RAPDs), Arbitrarily Primed Polymerase Chain Reaction (AP-PCR), DNA Amplification Fingerprinting (DAF), Sequence Characterized Amplified Regions (SCARs), Amplified Fragment Length polymorphisms (AFLPs), Simple Sequence Repeats (SSRs—which are also referred to as Microsatellites), and Single Nucleotide Polymorphisms (SNPs).
[0101] Molecular markers, or “markers”, can also be used during the breeding process for the selection of qualitative traits. For example, markers closely linked to alleles or markers containing sequences within the actual alleles of interest can be used to select plants that contain the alleles of interest. The use of markers in the selection process is often called genetic marker enhanced selection or marker-assisted selection. Methods of performing marker analysis are generally known to those of skill in the art.
[0102] Mutation breeding may also be used to introduce new traits into plant varieties. Mutations that occur spontaneously or are artificially induced can be useful sources of variability for a plant breeder. The goal of artificial mutagenesis is to increase the rate of mutation for a desired characteristic. Mutation rates can be increased by many different means including temperature, long-term seed storage, tissue culture conditions, radiation (such as X-rays, Gamma rays, neutrons, Beta radiation, or ultraviolet radiation), chemical mutagens (such as base analogs like 5-bromo-uracil), antibiotics, alkylating agents (such as sulfur mustards, nitrogen mustards, epoxides, ethyleneamines, sulfates, sulfonates, sulfones, or lactones), azide, hydroxylamine, nitrous acid or acridines. Once a desired trait is observed through mutagenesis the trait may then be incorporated into existing germplasm by traditional breeding techniques. Details of mutation breeding can be found in Principles of Cultivar Development: Theory and Technique, Walter Fehr (1991), Agronomy Books, 1 (https://lib.dr.iastate.edu/agron_books/1).
[0103] The production of double haploids can also be used for the development of homozygous lines in a breeding program. Double haploids are produced by the doubling of a set of chromosomes from a heterozygous plant to produce a completely homozygous individual. For example, see Wan, et al., Theor. Appl. Genet., 77:889-892, 1989.
[0104] Additional non-limiting examples of breeding methods that may be used include, without limitation, those found in Principles of Plant Breeding, John Wiley and Son, pp. 115-161 (1960); Principles of Cultivar Development: Theory and Technique, Walter Fehr (1991), Agronomy Books, 1 (https://lib.dr.iastate.edu/agron_books/1), which are herewith incorporated by reference.
[0105] Having generally described this invention, the same will be better understood by reference to certain specific examples, which are included herein to further illustrate the invention and are not intended to limit the scope of the invention as defined by the claims.
EXAMPLES
[0106] The present disclosure is described in further detail in the following examples which are not in any way intended to limit the scope of the disclosure as claimed. The attached figures are meant to be considered as integral parts of the specification and description of the disclosure. The following examples are offered to illustrate, but not to limit the claimed disclosure.
Example 1: Rubisco and EPYC1 Interact and can be Engineered to Increase their Interaction Strength
[0107] The following example describes the development and engineering of different variants of EPYC1 and different variants of the Rubisco Small Subunit (SSU). The example also describes yeast two-hybrid experiments testing the interactions between EPYC1 variants and Rubisco SSU variants.
Materials and Methods
[0108] Chlamydomonas reinhardtii and Arabidopsis thaliana Rubisco Small Subunits (SSUs) and the C. Reinhardtii Protein Essential Pyrenoid Component 1 (EPYC1)
[0109] C. reinhardtii has two similar Rubisco SSU homologs, S1.sub.Cr (SEQ ID NO: 30) and S2.sub.Cr (SEQ ID NO: 2), which are the same size and have identical α-helices and β-sheets. S1.sub.Cr and S2.sub.Cr share a 97.1% identity at the protein level, and differ in amino acid sequence by only four residues (indicated in bold in
[0110] The C. reinhardtii protein EPYC1 is a modular protein consisting of four highly similar repeat regions flanked by shorter terminal regions (
Yeast Two-Hybrid (Y2H)
[0111] The yeast two-hybrid plasmid vectors pGBKT7 (binding domain vector) and pGADT7 (activation domain vector) were used to detect interactions between proteins of interest. Genes were amplified using Q5 DNA polymerase (NEB) and the primers listed in Table 1. Both S1.sub.Cr and S2.sub.Cr were used in initial yeast two-hybrid testing, and then S2.sub.Cr was used in later experiments due to being more highly expressed in C. reinhardtii. The coding sequence of EPYC1 was codon optimized for expression in higher plants using an online tool (www.idtdna.com/CodonOpt). All variants of EPYC1 were synthesized as Gblock fragments (IDT), and amplified using the primers listed in Table 1. Amplified genes were then cloned into each vector using the multiple cloning site, thus creating fusions with either the GAL4 DNA binding or activation domain, respectively.
TABLE-US-00001 TABLE 1 List of primers used for producing the vectors used in the yeast two-hybrid assays. Primer name Primer sequence Vector EPYC.1 BD&AD Fw TTTTGAATTCATGGCTACGATCAGTT pGBKT7_EPYC1 CTATGAGAGT (SEQ ID NO: 72) pGADT7_EPYC1 EPYC.1 BD&AD Rev ATAGGATCCTCAAAGGCCCTTTCTC CAGTCTG (SEQ ID NO: 73) RbcS1 mature BD&AD AAAAGAATTCGTGTGGACACCGGTG pGADT7_S1.sub.Cr Fw AACAACAAG (SEQ ID NO: 74) pGBKT7_S1.sub.Cr RbcS1 BD&AD Rev ATACCCGGGACGTTTGTTGGCTGGT TGGAAATC (SEQ ID NO: 75) matRbcS2 Fw AD AAAAGAATTCGTGTGGACACCGGTG pGADT7_S2.sub.Cr AACAACAAG (SEQ ID NO: 74) pGBKT7_S2.sub.Cr matRbcS2 Rev AD TATCCCGGGACGTTTGTTGGCTGGTT GC (SEQ ID NO: 76) matRbcS1A (&mod) AAACCCGGGCATGCAGGTGTGGCCT pGADT7_1A.sub.At Fw AD CCG (SEQ ID NO: 77) pGADT7_1AAtMOD matRbcS1A (&mod) AAAGGATCCTTAACCGGTGAAGCTT pGADT7_1AAtMOD(β-sheets) Rev AD GGTGGC (SEQ ID NO: 78) pGADT7_1AAtMOD(loop) pGADT7_1AAtMOD(β- sheets+loop) pGADT7_1AAtMOD(α- helices+β-sheets) pGADT7_1AAtMOD(α- he1ices+β-sheets+loop) RbcL BD&AD Fw ATATGAATTCATGGTTCCACAAACA pGADT7_LSUCr GAAACTAAAGCA (SEQ ID NO: 79) pGBKT7_LSUCr RbcL BD&AD Rev CCCGGATCCTTAAAGTTTGTCAATA GTATCAAATTCGA (SEQ ID NO: 80) Ctr1EPYC.1/LCI5 Rev TTTGGATCCTCTGTTCGTTGCACTAC pGBKT7_N-ter EPYC1 BD TAGCTCTT (SEQ ID NO: 81) Ctr2EPYC.1/LCI5 Rev TTTGGATCCGGCCTTCTTTGAAGCTG pGBKT7_N-ter+1rep EPYC1 BD AGCTACTT (SEQ ID NO: 82) Ctr3EPYC.1/LCI5 Rev AATGGATCCGGCCTTCTTGCTGGAA pGBKT7_N-ter+2reps EPYC1 BD GAACTCCTA (SEQ ID NO: 83) Ctr4EPYC.1/LCI5 Rev TTTGGATCCTGCTTTTTTGCTCGCCG pGBKT7_N-ter+3reps EPYC1 BD ATGAGCTACG (SEQ ID NO: 84) Ctr5EPYC.1/LCI5 Rev ATAGGATCCGGCTTTGTCAGCGGAG pGBKT7_N-ter+4reps EPYC1 BD GAACTAGATGAC (SEQ ID NO: 85) Ntr5EPYC.1/LCI5 Fw TTTTGAATTCGTGAGCCCAACAAGA pGBKT7_4reps+C-ter EPYC1 AGCGTTCTC (SEQ ID NO: 86) Ntr4EPYC.1/LCI5 Fw TTTTGAATTCGTTACTCCTTCAAGAA pGBKT7_3reps+C-ter EPYC1 GTGCCTTGC (SEQ ID NO: 87) Ntr3EPYC.1/LCI5 Fw TTTTGAATTCGTCACTCCGTCTCGTT pGBKT7_2reps+C-ter EPYC1 CAGCTC (SEQ ID NO: 88) Ntr2EPYC.1/LCI5 Fw TTTTGAATTCGTCACCCCTAGTAGAT pGBKT7_1rep1+C-ter EPYC1 CGGCC (SEQ ID NO: 89) Ntr1EPYC.1/LCI5 Fw AAAAGAATTCGGAACTAATCCTTGG pGBKT7_C-ter EPYC1 ACAGGTAAAAGC (SEQ ID NO: 90) EPYC rep1 A for ACGTACCGGTCTCCACATCCCGGGG All pGBKT7_synthEPYC GTGAGCCCAACAAGAAGCG (SEQ ID vectors NO: 91) EPYC rep1 T rev ACGTACCGGTCTCCACAAGGATCCG GCCTTCTTTGAAGCTGAG (SEQ ID NO: 92) EPYC rep1 B for ACGTACCGGTCTCCTGTAAGCCCAA pGBKT7_synthEPYC1 2reps CAAGAAGCGTTC (SEQ ID NO: 93) pGBKT7_synthEPYC1 4reps EPYC rep1 B rev ACGTACCGGTCTCCTACAGCCTTCTT pGBKT7_synthEPYC1 8reps TGAAGCTGAG (SEQ ID NO: 94) EPYC rep1 C for ACGTACCGGTCTCCGGTTAGCCCAA pGBKT7_synthEPYC1 4reps CAAGAAGCGTTC (SEQ ID NO: 95) pGBKT7_synthEPYC1 2α- EPYC rep1 C rev ACGTACCGGTCTCCAACCGCCTTCTT helices 4reps TGAAGCTGAG (SEQ ID NO: 96) EPYC rep1 D for ACGTACCGGTCTCCCGTCAGCCCAA CAAGAAGCGTTC (SEQ ID NO: 97) EPYC rep1 D rev ACGTACCGGTCTCCGACGGCCTTCT TTGAAGCTGAG (SEQ ID NO: 98) EPYC rep1 A2 for ACGTACCGGTCTCCACATCCCGGGG pGBKT7_synthEPYC1 8reps GTGAG (SEQ ID NO: 99) EPYC rep1 T2 rev GCCACTTGGTCTCGACAAGGATCCG GCCTTC (SEQ ID NO: 100) EPYC rep1 E for CTCTGTGAAGACAGGTCTCGAGTGA GCCCAAC (SEQ ID NO: 101) EPYC rep1 E rev CTTCGTGAAGGGTCTCACACTGCCT TCTTTG (SEQ ID NO: 102) synthEPYC J for TTGAATCACTCAGAAATAATTGGAG pGBKT7_synthEPYC1 2α- GCAAGAACTTG (SEQ ID NO: 103) helices lrep synthEPYC J rev CAAGTTCTTGCCTCCAATTATTTCTG AGTGATTCAA (SEQ ID NO: 104) EPYC rep1 H for ACGTACCGGTCTCATCAGAACGGCA pGBKT7_synthEPYC1 GCTCGTCG (SEQ ID NO: 105) modified α-helix lrep EPYC rep1 H rev ACGTACCGGTCTCTCTGATTTCTGAG TGATTCAAGTTC (SEQ ID NO: 106) EPYC rep1 G for ACGTACCGGTCTCCGTAGAAATGGT pGBKT7_synthEPYC1 α- AACGGCAGC (SEQ ID NO: 107) helix knockout 1 EPYC rep1 G rev ACGTACCGGTCTCCCTACGTGATTC AAGTTCTTG (SEQ ID NO: 108) synthEPYC I for ACGTACCGGTCTCATGGCTTGAATC pGBKT7_synthEPYC1 α- ACTCAGAAATG (SEQ ID NO: 109 helix knockout 2 synthEPYC I rev ACGTACCGGTCTCAGCCATTGCCTC CAATTAGCTG (SEQ ID NO: 110) matLCIB Fw AD ATACATATGCAAGCAGCATCAACAG pGADT7_LCIB CGGTTGC (SEQ ID NO: 111) matLCIB Rev AD ATACCCGGGGTTTTTTGGTGCTTCAA ATGACGGGTG (SEQ ID NO: 112) matLCIC Fw AD TATCCCGGGTAGTCAAGCTCTCACT pGADT7_LCIC GTTAGCCAA (SEQ ID NO: 113) matLCIC Rev AD TATGGATCCGTTCATATTAGCTAGCT CGGGAGA (SEQ ID NO: 114) CAH3 BD&AD Fw ATTTGAATTCCGAAGCGCAGTTCTT pGADT7_CAH3 CAGAGAG (SEQ ID NO: 115) CAH3 BD&AD Rev TTAGGATCCTCAGAGCTCATACTCC ACAAGTCTA (SEQ ID NO: 116) CP12 Fw AD TTTTGAATTCGGTCCGGTCCATTTGA pGADT7_CP12 ACAATTCG (SEQ ID NO: 117) CP12 Rrev AD TTTCCCGGGGCACTCGTTGGTCTCA GGATTGTC (SEQ ID NO: 118)
[0112] Competent yeast cells (Y2H Gold, Clontech) were prepared from a 50 ml culture grown in YPDA medium supplemented with kanamycin (50 μg ml.sup.−1). Cells were washed with ddH2O and a lithium acetate/TE solution (100 mM LiAc, 10 mM Tris-HCl [pH 7.5], 1 mM EDTA) before re-suspending in lithium acetate/TE solution. Cells were then co-transformed with binding and activation domain vectors by mixing 50 μl of competent cells with 1 μg of each plasmid vector and a PEG solution (100 mM LiAc, 10 mM Tris-HCl [pH 7.5], 1 mM EDTA, 40% [v/v] PEG 4000). Cells were incubated at 30° C. for 30 min, then subjected to a heat shock of 42° C. for 20 min. The cells were centrifuged, re-suspended in 500 μl YPDA and incubated at 30° C. for ca 90 min, then centrifuged and washed in TE (10 mM Tris-HCl [pH 7.5], 1 mM EDTA). The pellet was re-suspended in 200 μl TE, spread onto SD-L-W (standard dextrose medium (minimal yeast medium) lacking leucine and tryptophan, Anachem) and grown for 3 days at 30° C. Ten to fifteen of the resulting colonies were pooled per co-transformation and grown in a single culture for 24 hrs. The following day 1 ml of culture was harvested, cell density (OD.sub.600) measured, centrifuged and then diluted in TE to give a final OD.sub.600 of 0.5 or 0.1.
[0113] Yeast cultures were then plated onto SD-L-W (yeast synthetic minimal media lacking leucine (L) and tryptophan (W)) and SD-L-W-H (yeast synthetic minimal media lacking L, W, and histidine(H)) (Anachem). Yeast expressing both binding and activation domain constructs was grown on SD-L-W to confirm presence of both plasmids. To assess interaction strength, yeast was plated onto SD-L-W-H with differing concentrations of the HIS3 inhibitor 3-aminotriazole (3-AT). These plates were then incubated for 3 days before assessing for presence or absence of growth, to perform a semi-quantitative yeast two-hybrid assay as in van Nues and Beggs (van Nues and Beggs, Genetics (2000) 157: 1451-1467). The same yeast transformation was used for each interaction study. Different colonies on the same yeast transformation plate were considered independent biological replicates (as for E. coli). Two biological replicates (top and bottom row for each interaction) were spotted from different liquid culture concentrations (0.5 and 0.1 OD). Each interaction experiment was performed at least twice. Summary figures of the yeast interaction studies are shown in
[0114] Table 2 provides descriptions of the vectors that were used in the yeast two-hybrid assays.
TABLE-US-00002 TABLE 2 Vectors used for yeast two-hybrid assays. Vector Description pGBKT7_EPYC1 Full-length codon-optimized EPYC1 in yeast two-hybrid (Y2H) binding domain vector pGADT7_EPYC1 Full-length codon-optimized EPYC1 in Y2H Activation domain vector pGADT7_S1.sub.Cr C. reinhardtii Rubisco small subunit (SSU) RbcS1 in Y2H activation domain vector pGADT7_S2.sub.Cr C. reinhardtii SSU RbcS2 in Y2H activation domain vector pGADT7_1A.sub.At A. thaliana SSU RbcS1A in Y2H activation domain vector pGADT7_1A.sub.AtMOD(α-helices) A. thaliana SSU RbcS1A with modified alpha-helices in Y2H activation domain vector pGADT7_LSU.sub.Cr C. reinhardtii Rubisco large subunit in Y2H activation domain vector pGADT7_1A.sub.AtMOD(β-sheets) A. thaliana SSU RbcS1A with modified β-sheets in Y2H activation domain vector pGADT7_1A.sub.AtMOD(loop) A. thaliana SSU RbcS1A with modified loop in Y2H activation domain vector pGADT7_1A.sub.AtMOD(β- A. thaliana SSU RbcS1A with modified β-sheets and loop in Y2H sheets + loop) activation domain vector pGADT7_1A.sub.AtMOD(α- A. thaliana SSU RbcS1A with modified α-helices and β-sheets in helices + β-sheets) Y2H activation domain vector pGADT7_1A.sub.AtMOD(α- A. thaliana SSU RbcS1A with modified α-helices, β-sheets and helices + β-sheets + loop) loop in Y2H activation domain vector pGBKT7_N-ter EPYC1 N-terminus of EPYC1 in Y2H binding domain vector pGBKT7_N-ter + 1rep EPYC1 N-terminus and first repeat of EPYC1 in Y2H binding domain vector pGBKT7_N-ter + 2reps EPYC1 N-terminus and first two repeats of EPYC1 in Y2H binding domain vector pGBKT7_N-ter + 3reps EPYC1 N-terminus and first three repeats of EPYC1 in Y2H binding domain vector pGBKT7_N-ter + 4reps EPYC1 N-terminus and all four repeats of EPYC1 in Y2H binding domain vector pGBKT7_4reps + C-ter EPYC1 All four repeats plus C-terminus of EPYC1 in Y2H binding domain vector pGBKT7_3reps + C-ter EPYC1 First three repeats plus C-terminus of EPYC1 in Y2H binding domain vector pGBKT7_2reps + C-ter EPYC1 First two repeats plus C-terminus of EPYC1 in Y2H binding domain vector pGBKT7_1rep1 + C-ter EPYC1 First repeat plus C-terminus of EPYC1 in Y2H binding domain vector pGBKT7_C-ter EPYC1 C-terminus of EPYC1 in Y2H binding domain vector pGBKT7_mEPYC1 Mature EPYC (minus C-terminus) in Y2H binding domain vector pGBKT7_mEPYC1-α1 Mature EPYC with 1 α-helix mutation in Y2H binding domain vector pGBKT7_mEPYC1-α1,2 Mature EPYC with 1,2 α-helix mutations in Y2H binding domain vector pGBKT7_mEPYC1-α1,2,3 Mature EPYC with 1,2,3 α-helix mutations in Y2H binding domain vector pGBKT7_mEPYC1-α1,2,3,4 Mature EPYC with 1,2,3,4 α-helix mutations in Y2H binding domain vector pGBKT7_mEPYC1-α3,4 Mature EPYC with 3,4 α-helix mutations in Y2H binding domain vector pGBKT7_mEPYC1-α4 Mature EPYC with 4 α-helix mutation in Y2H binding domain vector pGBKT7_synthEPYC1 1rep Repeat 1 of EPYC1 in Y2H binding domain vector pGBKT7_synthEPYC1 2reps Two times repeat 1 of EPYC1 in Y2H binding domain vector pGBKT7_synthEPYC1 4reps Four times repeat 1 of EPYC1 in Y2H binding domain vector pGBKT7_synthEPYC1 8reps Eight times repeat 1 of EPYC1 in Y2H binding domain vector pGBKT7_synthEPYC1 2α- Four times repeat 1 of EPYC1 with double alpha helix in Y2H helices 4reps binding domain vector pGBKT7_synthEPYC1 2α- Repeat 1 of EPYC1 with double α-helix in Y2H binding domain helices 1rep vector pGBKT7_synthEPYC1 Repeat 1 of EPYC1 with modified α-helix in Y2H binding domain modified α-helix 1rep vector pGBKT7_synthEPYC1 α-helix Repeat 1 of EPYC1 with α-helix knockout version 1 in Y2H knockout 1 binding domain vector pGBKT7_synthEPYC1 α-helix Repeat 1 of EPYC1 with α-helix knockout version 2 in Y2H knockout 2 binding domain vector pGBKT7_LSU.sub.Cr C. reinhardtii Rubisco large subunit in Y2H binding domain vector pGBKT7_S1.sub.Cr C. reinhardtii SSU RbcS1 in Y2H binding domain vector pGADT7_EPYC1 Full-length EPYC1 in Y2H activation domain vector pGADT7_LCIB C. reinhardtii LCIB in Y2H activation domain vector pGADT7_LCIC C. reinhardtii LCIC in Y2H activation domain vector pGADT7_CAH3 C. reinhardtii CAH3 in Y2H activation domain vector pGADT7_CP12 A. thaliana CP12 in Y2H activation domain vector pGBKT7_1A.sub.AtMOD(α-helices) A. thaliana SSU RbcS1A with modified alpha-helices in Y2H binding domain vector pGBKT7_LSU.sub.At A. thaliana Rubisco large subunit in Y2H binding domain vector pGADT7_LSU.sub.At A. thaliana Rubisco large subunit in Y2H Activation domain vector
[0115] Protein extraction was carried out by re-suspending yeast cells to an OD.sub.600 of 1 from an overnight liquid culture in a lysis buffer (50 mM Tris HCl [pH 233 6], 4% [v/v] SDS, 8 M urea, 30% [v/v] glycerol, 0.1 M DTT, 0.005% [w/v] Bromophenol blue), incubating 65° C. for 30 min, and loading directly onto a 10% (w/v) Bis-Tris protein gel (Expedeon). In the immunoblot shown in
Liquid Chromatography-Mass Spectrometry (LC-MS)
[0116] Cell lysate was prepared from C. reinhardtii cells according to Mackinder et al. (Mackinder, et al., PNAS (2016) 113: 5958-5963). Following membrane solubilization with 2% (w/v) digitonin, the clarified lysate was applied to 150 μl Protein A Dynabeads that had been incubated with 20 μg anti-EPYC1 antibody. The Dynabead-cell lysate was incubated for 1.5 hours with rotation at 4° C. The beads were then washed four times with IP buffer (50 mM HEPES, 50 mM KOAc, 2 mM Mg(OAc).sub.2.4H.sub.2O, 1 mM CaCl.sub.2), 200 mM sorbitol, 1 mM NaF, 0.3 mM NA.sub.3VO.sub.4, Roche cOmplete EDTA-free protease inhibitor) containing 0.1% (w/v) digitonin. EPYC1 was eluted from the beads by incubating for 10 minutes in elution buffer (50 mM Tris-HCl, 0.2 M glycine [pH 2.6]), and the eluate was immediately neutralized with 1:10 (v/v) Tris-HCl (pH 8.5). A small amount of the eluate was run on an SDS-PAGE gel and stained with coomassie (
[0117] Intact protein LC-MS experiments were performed on a Synapt G2 Q-ToF instrument equipped with electrospray ionization (i.e., electrospray ionization mass spectrometry (ESI-MS); Waters Corp., Manchester, UK). LC separation was achieved using an Acquity UPLC equipped with a reverse phase C4 Aeris Widepore 50×2.1 mm HPLC column (Phenomenex, Calif., USA) and a gradient of 5-95% acetonitrile (0.1% formic acid) over 10 minutes was employed. Data analysis was performed using MassLynx v4.1 and deconvolution was performed using MaxEnt.
PCOILS Analysis of EPYC1
[0118] PCOILS is an online tool (https://toolkit.tuebingen.mpg.de/#/tools/pcoils) that predicts the probability (from 0-1) of the presence of coiled-coil domains in a submitted protein sequence. The direct output following submission is shown in
Results
[0119] EPYC1 Interacts with C. reinhardtii SSUs and Modified A. thaliana SSUs in Y2H Assays
[0120] The two α-helices of the C. reinhardtii SSU (
[0121] The Y2H assays further showed that EPYC1 did not interact with itself (
Higher Plant Rubisco SSUs can be Engineered for Increased Affinity to EPYC1
[0122] Next, key domains on the C. reinhardtii SSU required for interaction with EPYC1 were identified. To isolate the structural components of the SSU, a total of six different chimeric versions of 1A.sub.At bearing residues from S1.sub.Cr associated with the three distinct β-sheets (βA, βC and βD), the βA-βB loop, and the two α-helices (αA and αB) (Spreitzer, Arch. Biochem. Biophys, (2003) 414: 141-149) were generated (
[0123] When tested in Y2H assays, as before, EPYC1 did not interact with 1A.sub.At (
EPYC1 can be Engineered for Increased Interaction Strength with the Rubisco SSU
[0124] A variety of truncated EPYC1 variants were generated to characterize the key regions of EPYC1 required for interaction with the Rubisco SSU. Because EPYC1 is a modular protein consisting of four highly similar repeat sequences flanked by shorter terminal regions at the N- and C-terminus, truncations were made to eliminate each region sequentially from either the N- or the C-terminus direction (
[0125] It was hypothesized that the interaction between EPYC1 and the SSU could be mediated through the predicted conserved α-helix in each of the four repeats, which together would allow EPYC1 to bind at least four Rubisco complexes (Mackinder, et al., PNAS (2016) 113: 5958-5963; Freeman Rosenzweig, et al., Cell (2017) 171: 148-162). The relative contribution of each of the four domains was analyzed by eliminating the predicted α-helical structure through mutation of the residues “RQELESL” (SEQ ID NO: 119) in the first repeat and “KQELESL” (SEQ ID NO: 120) in the subsequent three repeats into seven alanines (
[0126] Analysis of EPYC1 with PCOILS suggested that the putative α-helices of EPYC1 might behave like coiled-coil domains, with the first repeat showing the highest predicted value (
[0127] Using the single copy variant (synthetic EPYC1 1 rep), modifications of the α-helix region based on predictions from the PCOILS tool (NWRQELESLRNGNGSS (SEQ ID NO: 121)) or a G-Q substitution near the α-helix (WRQELESLRNQ (SEQ ID NO: 122)) predicted an increased probability of coiled-coil behavior (
The N-Terminus of EPYC1 Contains a Cleavage Site
[0128] Removal of the N-terminus also increased the interaction strength, which was consistent with the predicted role of the N-terminus as a chloroplastic transit peptide that would be cleaved during import into the chloroplast (Mackinder, et al., PNAS (2016) 113: 5958-5963). Prediction tools ChloroP and PredAlgo suggested cleavage at residues 78 and 170, respectively (Emanuelsson, et al., Nat. Protoc. (2007) 2: 953-971). However, both predictions were unconvincing as they would result in cleavage within the repeat regions required for EPYC1 function. To identify the potential cleavage site, EPYC1 from C. reinhardtii was immunoprecipitated and analyzed using electrospray ionization mass spectrometry (ESI-MS). Intact protein ESI-MS analysis revealed several proteoforms of mature EPYC1 ranging from 29622-30621 Da (
Example 2: EPYC1 can be Targeted to Chloroplasts in Higher Plants and EPYC1 Interacts with Rubisco in Planta
[0129] The following example describes the engineering of an EPYC1 construct that was able to successfully target EPYC1 expression to higher plant chloroplasts (e.g., N. benthamiana and A. thaliana). When expressed in higher plant chloroplasts, EPYC1 was shown to interact with Rubisco in planta.
Materials and Methods
Plant Material and Growth Conditions
[0130] Arabidopsis (Arabidopsis thaliana, Col-0) seeds were sown on compost, stratified for 3 days at 4° C. and grown at 20° C., ambient CO.sub.2, 70% relative humidity and 150 μmol photons m.sup.−2s.sup.−1 in 12 hours (h) light, 12 h dark conditions. For comparisons of different genotypes, plants were grown from seeds of the same age and storage history, and harvested from plants grown in the same environmental conditions. N. benthamiana was grown at 20° C. with 150 μmol photons m.sup.−2s.sup.−1 in 12 h light, 12 h dark conditions.
Construct Design and Transformation
[0131] The coding sequence of EPYC1 was codon optimized for expression in higher plants using an online tool (www.idtdna.com/CodonOpt). All variants of EPYC1 were synthesized as Gblock fragments (IDT) and cloned directly into level 0 acceptor vectors pAGM1299 and pICH41264 of the Plant MoClo system (Engler, et al., ACS Synth. Bio. (2014) 3: 839-843) or pB7WG2,0 vectors containing C- or N-terminal YFP. Table 3 provides descriptions of the vectors that were used for plant transformation.
TABLE-US-00003 TABLE 3 Vectors used for plant transformation. Vector Description pICH47742_EPYC1::GFP Full-length codon-optimized EPYC1 with GFP in Golden Gate (GG) Level 1 expression vector pICH47742_1A.sub.AtTP::EPYC1::GFP Full-length codon-optimized EPYC1 with A. thaliana RbcS1A transit peptide and GFP in GG Level 1 expression vector pAGM8031_1A.sub.AtTP::EPYC1_pFast Full-length codon-optimized EPYC1 with A. thaliana RbcS1A transit peptide in GG Level M expression vector with pFast red selection marker pAGM8031_1A.sub.AtTP::EPYC1::GFP_pFast Full-length codon-optimized EPYC1 with A. thaliana RbcS1A transit peptide and GFP in GG Level M expression vector with pFast red selection marker pAGM8031_EPYC1::GFP_pFast Full-length codon-optimized EPYC1 with GFP in GG Level M expression vector with pFast red selection marker pB7_S2.sub.Cr::YFP.sup.N C. reinhardtii SSU RbcS2 fused to N terminus of YFP in pB7WG2,0 expression vector pB7_S2.sub.Cr::YFP.sup.C C. reinhardtii SSU RbcS2 fused to C terminus of YFP in pB7WG2,0 expression vector pB7_1A.sub.AtTP::EPYC1::YFP.sup.N EPYC1 fused to N terminus of YFP in pB7WG2,0 expression vector pB7_1A.sub.AtTP::EPYC1::YFP.sup.C EPYC1 fused to C terminus of YFP in pB7WG2,0 expression vector pB7_1A.sub.AtMOD::YFP.sup.N A. thaliana SSU RbcS1A with modified alpha-helices fused to N terminus of YFP in pB7WG2,0 expression vector pB7_1A.sub.AtMOD::YFP.sup.C A. thaliana SSU RbcS1A with modified alpha-helices fused to C terminus of YFP in pB7WG2,0 expression vector pB7_1A.sub.At::YFP.sup.N A. thaliana SSU RbcS1A fused to N terminus of YFP in pB7WG2,0 expression vector pB7_1A.sub.At::YFP.sup.C A. thaliana SSU RbcS1A fused to C terminus of YFP in pB7WG2,0 expression vector pICH47732_CP12.sub.At::YFP.sup.C A. thaliana CP12 fused to N terminus of YFP in Level 1 Golden Gate expression vector pICH47732_CP12.sub.At::YFP.sup.N A. thaliana CP12 fused to C terminus of YFP in Level 1 Golden Gate expression vector pB7_S2.sub.Cr::YFP.sup.N C. reinhardtii SSU RbcS2 fused to N terminus of YFP in pB7WG2,0 expression vector
[0132] To generate fusion proteins, gene expression constructs were assembled into binary level M acceptor vectors. Level M vectors were transformed into Agrobacterium tumefaciens (AGL1) for transient gene expression in N. benthamiana (Schöb, et al., Mol. and Gen. Genetics (1997) 256: 581-585) or stable insertion in A. thaliana plants by floral dipping (Clough and Bent, Plant J. (1998) 16: 735-743). Homozygous insertion lines were identified in the T3 generation using the pFAST-R selection cassette (Shimada, et al., Plant J. (2010) 61: 519-528).
DNA and Leaf Protein Analyses
[0133] PCR reactions were performed as in McCormick and Kruger (McCormick and Kruger, Plant J. (2015) 81: 570-683) using the gene-specific primers listed in Table 4.
TABLE-US-00004 TABLE 4 List of primers used for producing the vectors used for plant transformation. Primer name Primer sequence Vector LCI5 full 1F TACGGTCGAAGACGAAGGTATGGCTA pICH47742_EPYC1::GFP CGATCAGTTCTATG (SEQ ID NO: 125) pICH47742_1A.sub.AtTP::EPYC1::GFP LCI5 full 1R TACGGTCGAAGACGAGATGACTCTCTC pAGM8031_1A.sub.AtTP::EPYC1_pFast CAAGATCCTCT (SEQ ID NO: 126) pAGM8031_1A.sub.AtTP::EPYC1::GFP_pFast LCI5 full 2F ACGTACCGAAGACCACATCTACTGCTA pAGM8031_EPYC1::GFP_pFast CAGTTCAAGC (SEQ ID NO: 127) L0 CDS1 ACGTACCGAAGACCATGACCTAGCTGG LCI5+SP-1 R TGCTGGCG (SEQ ID NO: 128) L0 CDS1 ACGTACCGAAGACAGGTCATCCTCAGC LCI5+SP-2 F TAGTTGGAG (SEQ ID NO: 129) L0 CDS1 ACGTACCGAAGACAGAAGCTCAAAGG LCI5+SP-2 R CCCTTTCTCCA (SEQ ID NO: 130) L0 SP SP1A_F TGCACTCGAAGACAGAATGGCTTCCTC pICH47742_1A.sub.AtTP::EPYC1::GFP TATGCTC (SEQ ID NO: 131) pAGM8031_1A.sub.AtTP::EPYC1_pFast L0 SP SP1A_R TGCACTCGAAGACAGACCTTCGGAATC pAGM8031_1A.sub.AtTP::EPYC1::GFP_pFast GGTAAG (SEQ ID NO: 132) L0 CDS1 ACGTACCGAAGACAGAAGCTCAAAGG pAGM8031_1A.sub.AtTP::EPYC1_pFast LCI5+SP-2 R CCCTTTCTCCA (SEQ ID NO: 130) AT1G67090_TP CAACTTTGTACAAAAAAGCAGGCTCCG pB7_S2.sub.Cr::YFP.sup.C (+TOPO)_for AATTCGCCCTTATGGCTTCCTCTATG pB7_1A.sub.AtMOD::YFP.sup.C (SEQ ID NO: 133) pB7_1A.sub.At::YFP.sup.C pB7_S2.sub.Cr::YFP.sup.N pB7_1A.sub.AtMOD::YFP.sup.N pB7_1A.sub.At::YFP.sup.N pB7_1A.sub.AtTP::EPYC1::YFP.sup.N pB7_1A.sub.AtTP::EPYC1::YFP.sup.C RbcS1A(+YFPc155) AGCGTAATCTGGAACATCGTATGGGTA pB7_1A.sub.At::YFP.sup.C rev CATACCGGTGAAGCTTGGTGGCTTG pB7_1A.sub.AtMOD::YFP.sup.C (SEQ ID NO: 134) RbcS1A(+YFPn173) ATCCTCCTCAGAAATCAACTTTTGCTC pB7_1A.sub.At::YFP.sup.N rev CATACCGGTGAAGCTTGGTGGCTTG pB7_1A.sub.AtMOD::YFP.sup.N (SEQ ID NO: 135) RbcS1(+YFPc155) AGCGTAATCTGGAACATCGTATGGGTA pB7_S2.sub.Cr::YFP.sup.C rev CATAACACTACGTTTGTTGGCTGG (SEQ ID NO: 136) RbcS1(+YFPn173) GATCCTCCTCAGAAATCAACTTTTGCT pB7_S2.sub.Cr::YFP.sup.N rev CCATAACACTACGTTTGTTGGCTGG (SEQ ID NO: 137) LCI5(+YFPc155) AGCGTAATCTGGAACATCGTATGGGTA pB7_1A.sub.AtTP::EPYC1::YFP.sup.C rev CATAAGGCCCTTTCTCCAGTCTG (SEQ ID NO: 138) LCI5(+YFPn173) AAGATCCTCCTCAGAAATCAACTTTTG pB7_1A.sub.AtTP::EPYC1::YFP.sup.N rev CTCCATAAGGCCCTTTCTCCAGTCTG (SEQ ID NO: 139)
[0134] Soluble protein was extracted from frozen leaf material of 21-d-old plants (sixth and seventh leaf) in 5× Bolt LDS sample buffer (ThermoFisher Scientific) with 200 mM DTT at 70° C. for 15 min. Extracts were centrifuged and the supernatants subjected to SDS-PAGE on a 4-12% (w/v) polyacrylamide gel and transferred to a nitrocellulose membrane. Membranes were probed with rabbit serum raised against wheat Rubisco at 1:10,000 dilution (Howe, et al., PNAS (1982) 79: 6903-6907) or against EPYC1 at 1:2,000 dilution (Mackinder, et al., PNAS (2016) 113: 5958-5963), followed by HRP-linked goat anti-rabbit IgG (Abcam) at 1:10,000 dilution, and visualized using Pierce ECL Western Blotting Substrate (Life Technologies).
Growth Analysis and Photosynthetic Measurements
[0135] A. thaliana plant lines expressing EPYC1 fused with the 1A.sub.AtTP (1A.sub.At-TP::EPYC1) in either WT, S2.sub.Cr or the 1A.sub.AtMOD background were tested. Three independently transformed T3 lines (Line 1, Line 2, and Line 3) per background (WT, S2.sub.Cr or the 1A.sub.AtMOD) were measured, and compared to their corresponding segregant lines (Line 1 Seg, Line 2 Seg, and Line 3 Seg) lacking EPYC1.
[0136] For growth analysis, plants were harvested at 31 days and the fresh (FW) and dry weights (DW) were measured. The values in
[0137] For photosynthetic measurements, the same plants used in growth analysis were measured on day 31 (before harvest). Means±SE of measurements made on a single leaf from each of 12 plants are shown in Table 5, below. Maximum quantum yield of photosystem II (PSII) (dark-adapted leaf fluorescence; F.sub.v/F.sub.m) was measured using a Hansatech Handy PEA continuous excitation chlorophyll fluorimeter (Hansatech Instruments Ltd.) (Maxwell and Johnson, J. of Exp. Bot. (2000) 51: 659-668).
Co-Immunoprecipitation and Immunoblotting
[0138] Rosettes of 35-d-old A. thaliana plants expressing EPYC1 in a complemented Rubisco mutant background (S2.sub.Cr, 1A.sub.AtMOD or 1A.sub.At) were snap frozen and ground in liquid N.sub.2. An equal volume of IP extraction buffer (100 mM HEPES [pH 7.5], 150 mM NaCl, 4 mM EDTA, 5 mM DTT, 0.4 mM PMSF, 10% [v/v] glycerol, 0.1% [v/v] Triton-X-100 and one Roche cOmplete EDTA-free protease inhibitor tablet per 10 ml) was added, samples were rotated at 4° C. for 15 min, centrifuged at 4° C. and filtered through two layers of Miracloth (Merck). Each extract (2 ml) was pre-cleared by incubating with 50 μl Protein A Dynabeads (ThermoFisher Scientific) pre-equilibrated in IP buffer for 1 hr at 4° C., before discarding the beads. Antibody-coated beads were generated by applying 3.5 μg anti-EPYC1 antibody to 50 μl Protein A Dynabeads, which were then rotated at 4° C. for 30 min. The antibody was crosslinked to the beads using Pierce BS3 cross-linking agent (Thermo Scientific). Each protein extract was incubated with the antibody-coated beads and rotated at 4° C. for 2 hrs. Unbound sample (flow-through) was discarded and the beads washed four times with washing buffer (20 mM Tris-HCl [pH 8], 150 147 mM NaCl, 0.1% [w/v] SDS, 1% [v/v] Triton-X-100, 2 mM EDTA). Immunocomplexes were eluted by adding 50 μl elution buffer (2× LDS sample buffer, 200 mM DTT) and heating for 15 min at 70° C., before discarding beads.
[0139] The eluted immunocomplexes were subjected to SDS-PAGE and immunoblotting. The 1A.sub.At-TP::EPYC1 antibody serum targets the C-terminus of EPYC1 (Emanuelsson, et al., Nat. Protoc. (2007) 2: 953-971). For immunoblotting, two antibodies were used: anti-EPYC1 from Mackinder, et al., PNAS (2016) 171: 133-147, and anti-Rubisco (Rubisco antibody as used in Mackinder 2016 and first published in Howe, et al., PNAS (1982) 79: 6903-6907). In
Bimolecular Fluorescence Complementation Analysis (BiFC)
[0140] Bimolecular fluorescence complementation analysis (BiFC) was carried out to provide additional information about the EPYC1-Rubisco interaction in vivo. Three Rubisco SSUs (1A.sub.At, S2.sub.Cr and 1A.sub.AtMOD) and EPYC1, each fused at the C-terminus to either YFP.sup.N or YFP.sup.C were transiently co-expressed in N. benthamiana (Walter, et al., Plant J. (2004) 40: 428-438).
Confocal Laser Scanning Microscopy
[0141] Leaves were imaged with a Leica TCS SP2 laser scanning confocal microscope or a Leica TCS SP8 laser scanning confocal microscope as in Atkinson et al. (Atkinson, et al., Plant Biotech. J. (2016) 14: 1302-1315).
Results
EPYC1 can be Targeted to Higher Plant Chloroplasts
[0142] EPYC1 was codon-optimized for nuclear expression in higher plants (
EPYC1 Expression in Plant Chloroplasts does not Hinder Plant Growth or Photosynthetic Efficiency
[0143] Wild-type A. thaliana plants and two Rubisco small subunit (1a3b) mutant lines complemented with S2.sub.Cr or 1A.sub.AtMOD, previously made by Atkinson et al. (Atkinson, et al., New Phytol. (2017) 214: 655-667) (
[0144] Growth analyses showed a slightly reduced growth phenotype (i.e. area, FW and DW) for some plants expressing 1A.sub.At-TP::EPYC1 compared to their corresponding segregants, but the observed decrease was not consistently significant (
[0145] Table 5 shows the maximum quantum yield of PSII (Fv/Fm) measurements for EPYC1 expressing A. thaliana plants. For each of the three genetic backgrounds (WT, S2.sub.Cr, and 1A.sub.AtMOD), three independently transformed T3 lines (Line 1, Line 2, and Line 3) were measured, and compared to their corresponding segregants lacking EPYC1 (Line 1 Seg, Line 2 Seg, and Line 3 Seg). Regardless of genetic background, the addition of 1A.sub.At-TP::EPYC1 did not affect photosynthetic efficiency as measured by dark-adapted leaf fluorescence; Fv/Fm).
TABLE-US-00005 TABLE 5 Maximum quantum yield of PSII (Fv/Fm) measurements for 1A.sub.At-TP::EPYC1 expressing A. thaliana plants from three genetic backgrounds. Genetic background Line 1 Line 1 Seg Line 2 Line 2 Seg Line 3 Line 3 Seg WT 0.856 ± 0.856 ± 0.856 ± 0.856 ± 0.856 ± 0.856 ± 0.002 0.002 0.002 0.002 0.002 0.002 S2.sub.Cr 0.856 ± 0.856 ± 0.856 ± 0.856 ± 0.856 ± 0.856 ± 0.002 0.002 0.002 0.002 0.002 0.002 1A.sub.AtMOD 0.859 ± 0.859 ± 0.859 ± 0.859 ± 0.859 ± 0.859 ± 0.001 0.001 0.001 0.001 0.001 0.001
[0146] Immunoblots against 1A.sub.At-TP::EPYC1 in A. thaliana produced a dominant band of approximately 34 kDa (slightly smaller than the mature native C. reinhardtii isoform [35 kDa]) which suggested cleavage of both 1A.sub.At-TP and a portion of the N-terminal region of EPYC1 (the antibody serum targeted the C-terminus of EPYC1) (Emanuelsson, et al., Nat. Protoc. (2007) 2:953-971) (
[0147] The above results showed that constitutive expression of EPYC1 in the chloroplast did not impact plant growth under the conditions tested. Further, the constitutive expression of EPYC1 in the chloroplast did not impact plant photosynthetic efficiency, as measured by Fv/Fm.
EPYC1 Interacts with Rubisco in Higher Plants
[0148] Having shown that specific SSUs can interact with EPYC1 in a yeast two-hybrid system, it was next investigated whether the interactions with Rubisco would occur in planta. Multiple A. thaliana plant lines were evaluated, specifically two complemented 1a3b mutant lines and one wild-type line expressing EPYC1 (S2.sub.Cr_EPYC1_1, 1A.sub.AtMOD_EPYC1_1 and EPYC1_1, respectively). EPYC1 was immunoprecipitated from each of these lines using anti-EPYC1 antibody attached to Protein A coated beads, and the elutes were analyzed by immunoblot using antibodies against EPYC1 or Rubisco (
[0149] Consistent with the immunoprecipitation results shown in
Example 3: EPYC1 can be Engineered to Exhibit Liquid-Like Aggregate in Heterologous Systems and Expression of TobiEPYC1 Constructs Results in Spherical Aggregates in Higher Plant Chloroplasts
[0150] The following example describes the detection of liquid-like aggregate of EPYC1, using an in vitro system. Further, the following example describes the detection of spherical aggregates of the TobiEPYC1::GFP construct in higher plant chloroplasts.
Materials and Methods
Protein Production, Droplet Sedimentation Assay and Microscopy
[0151] Rubisco was purified from 25- to 30-day-old A. thaliana rosettes (wild-type plants and S2.sub.Cr lines) using a combination of ammonium sulfate precipitation, ion-exchange chromatography, and gel filtration (Shivhare and Mueller-Cajar, Plant Phys. (2017) 1505-1516). The hybrid Rubisco complexes in S2.sub.Cr lines consisted of the A. thaliana LSU and a mixed population of A. thaliana SSUs and S2.sub.Cr (roughly 1:1) (Atkinson, et al., New Phytol. (2017) 214: 655-667). Rubisco was also purified from C. reinhardtii cells (CC-2677). EPYC1 and EPYC1::GFP were produced in E. coli and purified as described in Wunder et al. (Wunder, et al., Nature Commun. (2018) 9: 5076).
[0152] EPYC1-Rubisco droplets were reconstituted at room temperature in 10 μl reactions for 5 min in buffer A (20 mM Tris-HCl [pH 8.0], and 50 mM NaCl), and were separated at 4° C. from the bulk solution by centrifugation for 4 min at 21,100×g. Liquid-liquid phase separation with EPYC1 was tested using an in vitro assay developed by Wunder et al. (Wunder, et al., Nature Commun. (2018) 9: 5076). Pellet (droplet) and supernatant (bulk solution) fractions were subjected to SDS-PAGE and Coomassie staining.
[0153] For light and fluorescence microscopy, reaction solutions (5 μl) were imaged after 3-5 min with a Nikon Eclipse Ti Inverted Microscope using the settings for differential interference contrast and epifluorescence microscopy (using fluorescein isothiocyanate filter settings) with a ×100 oil-immersion objective focusing on the coverslip surface. The coverslips used were 22×22 mm (Superior Marienfeld, Germany) and fixed in one-well Chamlide CMS chamber for 22×22 coverslip (Live Cell Instrument, South Korea). ImageJ was used to pseudocolor all images.
Immunogold Labelling and Electron Microscopy
[0154] Leaf samples were taken from 21-d-old S2.sub.Cr and S2.sub.Cr EPYC1 plants and fixed with 4% (v/v) paraformaldehyde, 0.5% (v/v) glutaraldehyde and 0.05 M sodium cacodylate [pH 7.2]. Leaf strips (1 mm wide) were vacuum infiltrated with fixative three times for 15 min, then rotated overnight at 4° C. Samples were rinsed three times with PBS then dehydrated sequentially by vacuum infiltrating with 50%, 70%, 80% and 90% ethanol (v/v) for 1 hr each, then three times with 100% ethanol. Samples were infiltrated with increasing concentrations of LR White Resin (30%, 50%, 70% [w/v]) mixed with ethanol for 1 hr each, then 100% resin three times. The resin was polymerized in capsules at 50° C. overnight. Sections (1 μm thick) were cut on a Leica Ultracut ultramicrotome, stained with Toluidine Blue, and viewed in a light microscope to select suitable areas for investigation. Ultrathin sections (60 nm thick) were cut from selected areas and mounted onto plastic-coated copper grids. Grids were blocked with 1% (w/v) BSA in TBSTT (Tris-buffered saline with 0.05% [v/v] Triton X-100 and 0.05% [v/v] Tween 20), incubated overnight with anti-Rubisco antibody in TBSTT at 1:250 dilution, and washed twice each with TBSTT and water. Incubation with 15 nm gold particle-conjugated goat anti-rabbit secondary antibody (Abcam) in TBSTT was carried out for 1 hr at 1:200 dilution, before washing as before. Grids were stained in 2% (w/v) uranyl acetate then viewed in a JEOL JEM-1400 Plus TEM. Images were collected on a GATAN OneView camera.
TobiEPYC1 Construct Design and Plant Transformation and Aggregate Data
[0155] TobiEPYC1 gene expression cassettes are shown in
[0156] Binary plasmid constructs were assembled by Golden Gate MoClo system (Engler, et al., ACS Synth. Bio. (2014) 3: 839-843). The plasmids contained two TobiEPYC1 expression cassettes, as shown in
TABLE-US-00006 TABLE 6 TobiEPYC1 vectors used for plant transformation. Vector Description pAGM4723_TobiEPYC1 Full-length codon-optimized TobiEYPC1 in Golden Gate (GG) Level 2 expression vector pAGM4723_TobiEPYC1::GFP Full-length codon-optimized TobiEYPC1 and GFP in GG Level 2 expression vector pAGM4723_4_reps_TobiEPYC1 Full-length codon-optimized 4 reps TobiEYPC1 in Golden Gate (GG) Level 2 expression vector pAGM4723_4_reps_TobiEPYC1::GFP Full-length codon-optimized 4 reps TobiEYPC1 and GFP in GG Level 2 expression vector pAGM4723_8_reps_TobiEPYC1 Full-length codon-optimized 8 reps TobiEYPC1 in Golden Gate (GG) Level 2 expression vector pAGM4723_8_reps_TobiEPYC1::GFP Full-length codon-optimized 8 reps TobiEYPC1 and GFP in GG Level 2 expression vector
[0157] Transformation of the vectors into A. thaliana was done using the floral dipping method as described in Example 2. At least three separate plant lines were generated for each of the vectors in Table 6.
Detection of Aggregate in TobiEPYC1::GFP Plant Lines
[0158] Tissue from TobiEPYC1::GFP transgenic plant lines was imaged using confocal microscopy, as described in Example 2. Confocal images were from intact leaf tissue (
[0159] Aggregate characteristics were analyzed by fluorescence recovery after photobleaching (FRAP). FRAP was carried out using a Leica SP8 confocal microscope and a 63× water immersion objective, with a PMT detector. GFP fluorescence was imaged by excitation at 488 nm and emission between 504-532 nm. For the pre- and post-bleach images, laser power was set to 2%, whilst the bleach itself was carried out at 56% laser power. Pre-bleach images were captured at 189 ms intervals (6 in total), and post-bleach images were captured at 400 ms intervals (150 in total). Photo-bleaching was carried out on leaf samples by directing the laser to a small area of one of the TobiEPYC1::GFP aggregates within one chloroplast. Recovery time after photo-bleaching was calculated by comparing GFP expression in the bleached versus an un-bleached region.
[0160] The presence of EPYC1 and the C. reinhardtii Rubisco SSU was confirmed by immunoblot, as described in Example 2.
Results
[0161] Hybrid Rubisco Containing Higher Plant Large Subunits (LSUs) and Mixed Populations of Higher Plant and C. reinhardtii SSUs Phase Separates with EPYC1
[0162] Current models of pyrenoid formation are based on specific weak multivalent interactions that promote liquid-like phase separation (Hyman, et al., Annu. Rev. Cell Biol. (2014) 30: 39-58; Freeman Rosenzweig, et al., Cell (2017) 171: 148-162). To observe if such interactions could occur with hybrid plant-derived Rubisco, it was examined whether Rubisco from A. thaliana 1a3b mutants complemented with S2.sub.Cr was able to facilitate liquid-liquid phase separation with EPYC1 using an in vitro assay developed by Wunder et al. (Wunder, et al., Nature Commun. (2018) 9: 5076). Similarly to C. reinhardtii Rubisco, hybrid plant Rubisco (from the S2.sub.Cr lines) was able to demix with EPYC1 and formed liquid-like droplets of comparable size, albeit at slightly higher ratios of EPYC1: Rubisco (
EPYC1 can be Engineered to Form Aggregates in Higher Plant Chloroplasts
[0163] To investigate the effect of EPYC1 on Rubisco aggregate in planta, the localization of Rubisco in the chloroplast of S2.sub.Cr complemented A. thaliana 1a3b mutants expressing the highest levels of EPYC1 (S2.sub.Cr_EPYC1_1) was examined. Immunogold labelling of Rubisco revealed an even distribution of gold particles throughout the chloroplast when visualized by TEM, which was similar to the S2.sub.Cr control not expressing EPYC1 (
Spherical Aggregate is Observed in Higher Plant Chloroplasts of Plants Transformed with TobiEPYC1
[0164] Initially, two versions of EPYC1 were tested for expression in plants. The first of these was EPYC1 truncated by 78 residues at the N-terminus (the predicted chloroplast transit peptide based on the ChloroP online tool) and fused to a long version of the chloroplast signal peptide for A. thaliana Rubisco SSU 1A (80 residues, MASSMLSSATMVASPAQATMVAPFNGLKSSAAFPATRKANNDITSITSNGGRVNCMQV WPPIGKKKFETLSYLPDLTDSE (SEQ ID NO: 62)). The second of these was the full length EPYC1 (317 residues; SEQ ID NO: 34) fused to a long version of the chloroplast signal peptide for A. thaliana Rubisco SSU 1A (80 residues; SEQ ID NO: 62). Neither of these two versions produced evidence of aggregate in either wild-type plants or in the stable transgenic A. thaliana line expressing C. reinhardtii SSU.
[0165] Compared to these two previous versions, the TobiEPYC1 constructs were optimized in three ways (TobiEPYC1 gene expression cassettes are shown in
[0166] Third, two copies of the EPYC1 expression cassette were included on the binary plasmid with the aim to increase expression levels. Further, one copy had two terminators (see
[0167]
[0168]
Example 4: Increased Expression of a Truncated, Mature Form of EPYC1 Stably Aggregates Rubisco into Phase-Separated, Liquid-Like Condensate Structures in Higher Plant Chloroplasts
[0169] The following example describes molecular and cellular characterization of EPYC1-Rubisco chloroplastic condensates in Arabidopsis thaliana plant lines expressing high levels of a truncated, mature form of EPYC1 from a binary expression vector, alongside a plant-algal hybrid Rubisco. Further, it describes the impact of the condensates on plant metabolism, when plants are grown under different light levels.
[0170] This Example uses the same construct shown in
Materials and Methods
Plant Material and Growth Conditions
[0171] Arabidopsis (Arabidopsis thaliana, Col-0 background) seeds were sown on compost, stratified for 3 d at 4° C. and grown at 20° C., ambient CO.sub.2 and 70% relative humidity under either 200 or 900 μmol photons m.sup.−2 s.sup.−1 supplied by cool white LED lights (Percival SE-41AR3cLED, CLF PlantClimatics GmbH, Wertingen, Germany) in 12 h light, 12 h dark. For comparisons of different genotypes, plants were grown from seeds of the same age and storage history, harvested from plants grown in the same environmental conditions.
[0172] The S2.sub.Cr A. thaliana background line (1a3b Rubisco mutant complemented with an SSU from C. reinhardtii) is described in Atkinson et al. (New Phytol 214, 655-667, doi:10.1111/nph.14414 (2017)). The 1A.sub.AtMOD A. thaliana background line is described in Meyer et al. (PNAS, 109, 19474-19479, doi:10.1073/pnas.1210993109 (2012)) and Atkinson et al. (New Phytol 214, 655-667, doi:10.1111/nph.14414 (2017)).
Construct Design and Transformation
[0173] The coding sequence of EPYC1 was codon-optimized for expression in higher plants as in Atkinson et al. (J. Exp. Bot. 70, 5271-5285, doi:10.1093/jxb/erz275 (2019)). Truncated mature EPYC1 was cloned directly into the level 0 acceptor vector pAGM1299 of the Plant MoClo system (Engler, C. et al. A Golden Gate Modular Cloning Toolbox for Plants. Acs Synth Biol 3, 839-843, doi:10.1021/sb4001504 (2014)). To generate fusion proteins, gene expression constructs were assembled into binary level 2 acceptor vectors. Level 2 vectors were transformed into Agrobacterium tumefaciens (AGL1) for stable insertion in A. thaliana plants by floral dipping as described in Example 2. Homozygous transgenic and azygous lines were identified in the T2 generation using the pFAST-R selection cassette (Shimada, et al., Plant J. (2010) 61: 519-528).
[0174] A schematic representation of the binary vector for dual GFP expression (EPYC1-dGFP) is shown in
Protein Analyses
[0175] Soluble protein was extracted from frozen leaf material of 21-d-old plants (sixth and seventh leaf) in protein extraction buffer (50 mM HEPES-KOH pH 7.5 with 17.4% glycerol, 2% Triton X-100 and cOmplete Mini EDTA-free Protease Inhibitor Cocktail (Roche, Basel, Switzerland). Samples were heated at 70° C. for 15 min with 1× Bolt LDS sample buffer (ThermoFisher Scientific, UK) and 200 mM DTT. Extracts were centrifuged and the supernatants subjected to SDS-PAGE on a 12% (w/v) polyacrylamide gel and transferred to a nitrocellulose membrane.
[0176] Membranes were probed with: rabbit serum raised against wheat Rubisco at 1:10,000 dilution (Howe, et al., PNAS (1982) 79: 6903-6907), rabbit serum raised against the SSU RbcS2 from C. reinhardtii (CrRbcS2) (raised to the C-terminal region of the SSU (KSARDWQPANKRSV (SEQ ID NO: 172)) by Eurogentec, 205 Southampton, UK) at 1:1,000 dilution, anti-Actin antibody (beta Actin Antibody 60008-1-Ig from Proteintech, UK) at 1:1000 dilution, and/or an anti-EPYC1 antibody at 1:2,000 dilution (Mackinder, et al., PNAS (2016) 113: 5958-5963 doi:10.1073/pnas.1522866113), followed by IRDye 800CW goat anti-rabbit IgG (LI-COR Biotechnology, Cambridge, UK) at 1:10,000 dilution, and visualized using the Odyssey CLx imaging system (LI-COR Biotechnology).
Condensate Extraction
[0177] Soluble protein was extracted as described above in the “Protein analyses” section, then filtered through Miracloth (Merck Millipore, Burlington, Mass., USA), and centrifuged at 500 g for 3 min at 4° C., as in Mackinder et al. (PNAS 113: 5958-5963 (2016)). The pellet was discarded, and the extract centrifuged again for 12 min. The resulting pellet was washed once in protein extraction buffer, then re-suspended in a small volume of buffer and centrifuged again for 5 min. Finally, the pellet was re-suspended in 25 μl of extraction buffer and used in confocal analysis or SDS-PAGE electrophoresis as described below.
Growth Analysis and Photosynthetic Measurements
[0178] Rosette growth rates were quantified using the imaging system described in Dobrescu et al. (Plant methods 13, 95 (2017)). Maximum quantum yield of photosystem II (PSII) (F.sub.v/F.sub.m) was measured on 32-day-old plants using a Hansatech Handy PEA continuous excitation chlorophyll fluorimeter (Hansatech Instruments Ltd, King's 222 Lynn, UK) (Maxwell and Johnson, J Exp Bot 412 51, 659-668 (2000)).Gas exchange and chlorophyll fluorescence were determined using a LI-COR LI-6400 (LI-COR, Lincoln, Nebr., USA) portable infra-red gas analyzer with a 6400-40 leaf chamber on either the sixth or seventh leaf of 35- to 45-day-old non-flowering rosettes grown in large pots under 200 μmol photons m.sup.−2 s.sup.−1 to generate leaf area sufficient for gas exchange measurements as in Flexas et al. (New Phytologist 175, 501-511, doi:10.1111/j.1469-8137.2007.02111.x (2007)). The response of net CO.sub.2 assimilation (A) to the intercellular CO.sub.2 concentration (C.sub.i) was measured at 50, 100, 150, 200, 250, 300, 350, 400, 600, 800, 1000, and 1200 μmol mol.sup.−1 CO.sub.2 under saturating light (1,500 μmol photons m.sup.−2 s.sup.−1). For all gas exchange experiments, the flow rate was kept at 200 μmol mol.sup.−1, leaf temperature was controlled at 25° C. and approximately 70% relative humidity was maintained inside the chamber. Measurements were performed after net assimilation and stomatal conductance had reached steady state. Gas exchange data were corrected for CO.sub.2 diffusion from the measuring chamber as in Bellasio et al (Plant Cell Environ 39, 1180-1197, doi:10.1111/pce.12560 (2015)). The means±standard error of the mean (SEM) shown in Table 7, below, are from measurements made on seven 35- to 45-day-old rosettes for gas exchange variables, or on twelve 32-day-old rosettes for F.sub.v/F.sub.m. The F.sub.v/F.sub.m values shown in Table 7, below, are for attached leaves that had been dark-adapted for 45 minutes prior to fluorescence measurements.
[0179] To estimate the maximum rate of Rubisco carboxylation (V.sub.max), the maximum electron transport rate (J.sub.max), the net CO.sub.2 assimilation rate at ambient concentrations of CO.sub.2 normalized to Rubisco (A.sub.Rubisco), the CO.sub.2 compensation point (F), and the mesophyll conductance to CO.sub.2 (conductance of CO.sub.2 across the pathway from intercellular airspace to chloroplast stroma; g.sub.m), the A/C.sub.i data were fitted to the C.sub.3 photosynthesis model as in Ethier and Livingston (Plant Cell Environ 27, 137-153, doi:10.1111/j.1365-3040.2004.01140.x (2004)) using the catalytic parameters K.sub.c.sup.air and affinity for O.sub.2 (KO values for wild-type A. thaliana Rubisco at 25° C. and the Rubisco content of WT and S2.sub.Cr lines (Atkinson, N. et al. New Phytol 214, 655-667, doi:10.1111/nph.14414 (2017)). g.sub.m was measured as in Ethier and Livingston (Plant Cell Environ 27, 137-153, doi:10.1111/j.1365-3040.2004.01140.x (2004)) and Diamos, et al. (Plant Biotech J 16, 1971-1982, doi:10.1111/pbi.12931 (2018)).
Confocal Laser Scanning and Super-Resolution Image Microscopy
[0180] Leaves were imaged with a Leica TCS SP8 laser scanning confocal microscope (Leica Microsystems, Milton Keynes, UK) as in Atkinson et al. (Plant Biotech J 14, 1302-1315, doi:10.1111/pbi.12497 (2016)). Image processing was done with Leica LAS AF Lite software. Condensate and chloroplast dimensions were measured from confocal images using Fiji (ImageJ, v1.52n) (Schindelin et al., Nature Methods 9, 676-682, doi:10.1038/nmeth.2019 (2012)). Condensate volume was calculated as a sphere. Chloroplast volume was calculated as an ellipsoid in which depth was estimated as 25% of the measured width. Chloroplast volumes varied between 24-102 μm.sup.3, which was within the expected size range and distribution for A. thaliana chloroplasts (Crumpton-Taylor et al., Plant Phys 158, 905-916, doi:10.1104/pp. 111.186957 (2012)). Comparative pyrenoid area measurements were performed using Fiji on TEM cross-section images of WT C. reinhardtii cells (cMJ030) as described in Itakura et al. (PNAS 116, 18445-18454, doi:10.1073/pnas.1904587116 (2019)).
[0181] Super-resolution images were acquired using structured illumination microscopy. Samples were prepared on high precision cover-glass (Zeiss, Jena, Germany). 3D SIM images were acquired on an N-SIM (Nikon Instruments, UK) using a 100×1.49NA lens and refractive index matched immersion oil (Nikon Instruments). Samples were imaged using a Nikon Plan Apo TIRF objective (NA 1.49, oil immersion) and an Andor DU-897X-5254 camera using a 488 nm laser line. Z-step size for z stacks was set to 0.120 μm as required by manufacturer's software. For each focal plane, 15 images (5 phases, 3 angles) were captured with the NIS-Elements software. SIM image processing, reconstruction, and analyses were carried out using the N-SIM module of the NIS-Element Advanced Research software. Images were checked for artefacts using the SIMcheck software (http://www.micron.ox.ac.uk/software/SIMCheck.php). Images were reconstructed using NiS Elements software v4.6 (Nikon Instruments) from a z stack comprising of no less than 1 μm of optical sections. In all SIM image reconstructions, the Wiener and Apodization filter parameters were kept constant.
Immunogold Labelling and Electron Microscopy
[0182] Leaf samples were taken from 21-day-old S2.sub.Cr plants and S2.sub.Cr transgenic lines expressing EPYC1-dGFP, and fixed, prepared, and sectioned as described in Example 3 above. Blocked grids were incubated overnight with anti-Rubisco antibody in TBSTT at 1:250 dilution or anti-CrRbcS2 antibody at 1:50 dilution, and washed twice each with TBSTT and water. Incubation with 15 nm gold particle-conjugated goat anti-rabbit secondary antibody (Abcam, Cambridge, UK) in TBSTT was carried out for 1 hr at 1:200 dilution for Rubisco labelling or 1:10 for CrRbcS2 labelling, before washing as described above in Example 3. Staining, viewing, and image collection were performed as described above in Example 3.
Statistical Analyses
[0183] Results were subjected to analysis of variance (ANOVA) to determine the significance of the difference between sample groups. When ANOVA was performed, Tukey's honestly significant difference (HSD) post-hoc tests were conducted to determine the differences between the individual treatments (IBM SPSS Statistics Ver. 26.0, Chicago, Ill., USA).
Results
[0184] Dual-GFP-Tagged Truncated EPYC1 Expressed in S2.sub.Cr Transgenic A. thaliana Plants Underwent Less Proteolytic Degradation
[0185] EPYC1 was truncated according to the predicted transit peptide cleavage site between residues 26 (V) and 27 (A) (Atkinson et al., J Exp Bot 70, 5271-5285, doi:10.1093/jxb/erz275 (2019)). A dual GFP expression system (
[0186] The dual GFP construct (EPYC1-dGFP) was transformed into WT plants or into the A. thaliana 1a3b Rubisco mutant complemented with a Rubisco SSU from C. reinhardtii (S2.sub.Cr). The resulting transgenic plants (three lines, termed Ep1, Ep2, and Ep3, respectively) expressed both EPYC1::eGFP and EPYC1::tGFP, of which the latter was generally more highly expressed (
[0187] In Example 2 above and in Atkinson et al. (J Exp Bot 70, 5271-5285, doi:10.1093/jxb/erz275 (2019)), immunoblots against full length EPYC1 expressed using other constructs in S2.sub.Cr or WT plants showed additional lower molecular weight bands indicative of proteolytic degradation (
EPYC1-dGFP Expression in S2.sub.Cr and 1A.sub.At/MOD A. thaliana Backgrounds Caused Condensate Formation in the Chloroplast Stroma
[0188] The fluorescence signal for EPYC1-dGFP in WT plants was distributed evenly throughout the chloroplast (
[0189] The average diameter of the condensates was 1.6±0.1 μm (n=126; 42 each from three individual S2.sub.Cr transgenic lines) (
[0190] Condensates were also observed when EPYC1-dGFP was expressed in the A. thaliana 1a3b Rubisco mutant complemented with a native A. thaliana SSU modified to contain the two α-helices necessary for pyrenoid formation from the Rubisco small subunit from C. reinhardtii (1A.sub.AtMOD) (
[0191] Furthermore, visible condensates formed when either EPYC1::tGFP or EPYC1::eGFP expression cassettes were individually transformed into the S2n-A. thaliana background (
[0192] In Example 2 above, expression of a full length (i.e., non-truncated) variant of EPYC1-dGFP in A. thaliana chloroplasts did not result in phase separation (
The Condensates Exhibit Liquid-Like Characteristics
[0193] Fluorescence recovery after photobleaching (FRAP) assays were conducted on condensates in live S2.sub.Cr-A. thaliana leaf cells expressing EPYC1-dGFP to test for the presence of internal mixing characteristics consistent with the liquid-like behavior of pyrenoids. Condensates recovered full fluorescence 20-40 seconds after photobleaching (
[0194] Further, condensates that were extracted from S2.sub.Cr A. thaliana plants expressing EPYC1-dGFP and then resuspended in vitro coalesced into larger droplets (
Condensates in A. thaliana Chloroplasts Expressing EPYC1-dGFP are Enriched in EPYC1-dGFP and Rubisco
[0195] To test for the presence of Rubisco, condensates were extracted from A. thaliana leaf tissue by gentle centrifugation and examined by immunoblot. Isolated condensates (pellet fraction) from S2.sub.Cr A. thaliana plants expressing EPYC1-dGFP were shown to be enriched in EPYC1-dGFP and both the large and small subunits of Rubisco (
[0196] Regarding the Rubisco SSU, the Western shown in
[0197] Consistent with the Coomasie staining, immunogold analysis of TEM images of chloroplasts from S2.sub.Cr expressing EPYC1-dGFP (
EPYC1-dGFP Expression in A. thaliana does not Impair Growth
[0198] Growth comparisons were conducted on three separate T2 EPYC1-dGFP S2Cr transgenic lines (Ep1-3), which had been screened for the presence of condensates, and their respective T2 azygous segregant S2Cr lines (Az1-3). Growth was assessed after cultivation under two different light levels: those typical for A. thaliana growth (200 μmol photons m.sup.−2 s.sup.−1) (
[0199] Regardless of the growth conditions, rosette expansion rates or biomass accumulation were not distinguishable between S2.sub.Cr transformants and their segregant controls (
EPYC1-dGFP Expression in A. thaliana does not Impair Photosynthesis
[0200] Photosynthetic parameters derived from response curves of CO.sub.2 assimilation rate to the intercellular CO.sub.2 concentration under saturating light were similar between respective EPYC1-dGFP-expressing and azygous segregant lines (
[0201] Table 7 shows photosynthetic parameters derived from gas exchange and fluorescence measurements for S2.sub.Cr and WT transgenic lines of A. thaliana. The mean and standard error of the mean (SEM) are shown for seven 35- to 45-day-old rosettes for gas exchange variables, and for twelve 32-day-old rosettes for the maximum potential quantum efficiency of photosystem II (F.sub.v/F.sub.m). F.sub.v/F.sub.m is shown for attached leaves dark-adapted for 45 minutes prior to fluorescence measurements. Letters after the SEM indicate significant difference within the data in the same row (P<0.05) as determined by ANOVA followed by Tukey's HSD tests. Values followed by the same letter within a row are not statistically significantly different from each other. Terms are abbreviated as follows: V.sub.cmax is the maximum rate of Rubisco carboxylation, measured in μmol CO.sub.2 m.sup.−2s.sup.−1; J.sub.max is the maximum electron transport rate, measured in μmol e.sup.−m.sup.−2 s.sup.−1); F is the CO.sub.2 compensation point, measured in μmol CO.sub.2 m-2 s-1 and calculated as C.sub.i−A; g.sub.s is stomatal conductance to water vapor, measured in mol H.sub.2O m.sup.−2s.sup.−1; g.sub.m is mesophyll conductance to CO.sub.2 (i.e., the conductance of CO.sub.2 across the pathway from intercellular airspace to the chloroplast stroma), measured in mol CO.sub.2 m.sup.−2s.sup.−1; F.sub.v/F.sub.m is the maximum potential quantum efficiency of photosystem II; ML denotes measurements taken under medium light (200 μmol photons m.sup.−2s.sup.−1); HL denotes measurements taken under high light (900 μmol photons m.sup.−2s.sup.−1); Ep1, Ep2, and Ep3 are the same three T2 EPYC1-dGFP S2.sub.Cr transgenic lines shown in the other Figures in this Example; Az1, Az2, Az3 are the respective azygous segregants of Ep1-3; EpWT is an EPYC1-dGFP WT transformant; AzWT is an azygous segregant of EpWT.
TABLE-US-00007 TABLE 7 Photosynthetic parameters for S2.sub.Cr and WT A. thaliana lines expressing EPYC1-dGFP and azygous segregants thereof. Parameter Ep1 Az1 Ep2 Az2 Ep3 Az3 EpWt AzWt V.sub.cmax 35.6 ± 36.4 ± 32.2 ± 33.6 ± 33.1 ± 33.8 ± 44.9 ± 43.3 ± 1.5 2.0 1.9 1.6 1.9 2.2 1.6 1.7 a a a a a a b b J.sub.max 59.2 ± 61.9 ± 57.2 ± 56.1 ± 52.9 ± 58.6 ± 76.4 ± 74.9 ± 2.3 6.3 2.6 3.5 4.4 5.2 2.4 7.5 a a a a a a b b Γ 63 ± 53 ± 52 ± 54 ± 53 ± 56 ± 51 ± 64 ± 8 5 6 7 7 8 7 12 a a a a a a a a g.sub.s 0.249 ± 0.279 ± 0.233 ± 0.251 ± 0.233 ± 0.236 ± 0.287 ± 0.306 ± 0.031 0.051 0.017 0.015 0.021 0.016 0.018 0.011 a a a a a a a a g.sub.m 0.034 ± 0.035 ± 0.032 ± 0.033 ± 0.034 ± 0.032 ± 0.045 ± 0.046 ± 0.001 0.003 0.002 0.002 0.003 0.002 0.002 0.003 b b b b b b a a F.sub.v/F.sub.m 0.848 ± 0.849 ± 0.848 ± 0.847 ± 0.847 ± 0.845 ± 0.851 ± 0.850 ± (ML) 0.002 0.002 0.001 0.001 0.002 0.002 0.002 0.001 a a a a a a a a F.sub.v/F.sub.m 0.852 ± 0.845 ± 0.850 ± 0.855 ± 0.846 ± 0.849 ± 0.850 ± 0.852 ± (HL) 0.002 0.002 0.001 0.004 0.002 0.001 0.003 0.002 a a a a a a a a
[0202] Notably, the CO.sub.2 assimilation rates at ambient concentrations of CO.sub.2 for EPYC1-dGFP-expressing and azygous segregant lines were comparable to WT lines when normalized for Rubisco content (A.sub.Rubisco;
[0203] Mesophyll conductance (g.sub.m) levels were also reduced in all S2.sub.Cr lines compared to WT plants (Table 7), which was consistent with the impact of reduced Rubisco content on g.sub.m observed in transplastomic tobacco (Galmes et al., Photosynth Res 115, 153-166, doi:10.1007/s11120-013-9848-8 (2013)).
[0204] Measurements of the maximum electron transport rate (J.sub.max) and the maximum potential quantum efficiency of photosystem II (F.sub.v/F.sub.m) were also indistinguishable between transformant and segregant lines (Table 7). Thus, the apparent displacement of the thylakoid membrane matrix by the condensates (
[0205] The results described in this Example show that EPYC1 and specific residues on the SSU were sufficient to aggregate Rubisco into a single proto-pyrenoid condensate, and that this condensate had no apparent negative impact on plant growth. The overall photosynthetic performances of S2.sub.Cr transgenic lines appeared unaffected by the condensate, which suggested that conditions inside higher plant chloroplasts were highly compatible with the presence of pyrenoid-type bodies. This data provides a platform for adding additional components of the algal biophysical carbon concentrating mechanism (CCM) to higher plants in order to create a “fully assembled” biophysical CCM. The data presented here is arguably the key step for the assembly of a pyrenoid-based CCM into plants that could increase crop yield potentials by >60% (McGrath and Long, Plant Phys 164, 2247-2261, doi:10.1104/pp. 113.232611 (2014); Long et al. in Sustaining Global Food Security: The Nexus of Science and Policy. (ed R. S. Zeigler) Ch. 9, (CSIRO Publishing, 2019); Price et al., Plant Phys 155, 20-26, doi:10.1104/pp. 110.164681 (2011)). Previously described approaches for engineering the cyanobacterial carboxysome-based CCM required engineering of the chloroplast-encoded Rubisco large subunit, an approach that is not currently feasible in major grain crops such as wheat and rice (Long et al., Nat Commun 9, doi:Artn 3570 10.1038/S41467-018-06044-0 (2018)). The results of this Example demonstrated that condensation of Rubisco was achievable through modification of the nuclear-encoded SSU, which is significantly more amenable to genetic modification.
Example 5: TobiEPYC1 Will Stably Aggregate Rubisco into Pyrenoid-Like Structures in N. benthamiana Chloroplasts
[0206] The following example describes characterization of the molecular properties of the chloroplastic EPYC1 aggregates in TobiEPYC1 N. benthamiana lines. Further, it describes the impact of the EPYC1 aggregates on plant metabolism, when plants are grown under different light levels.
Materials and Methods
[0207] Materials and Methods for Characterizing TobiEPYC1 N. benthamiana Lines
[0208] The materials and methods described in Examples 2, 3, and 4 are used to characterize TobiEPYC1 N. benthamiana lines.
[0209] The EPYC1 aggregates in the TobiEPYC1 N. benthamiana lines are characterized. In particular, the type of Rubisco present in the aggregate (i.e., the ratio of C. reinhardtii SSUs to native SSUs) is characterized. Further, the liquid-liquid like behavior of the aggregate is characterized (e.g., using FRAP analysis). In addition, the physical properties of the aggregate (e.g., shape/architecture/density) are characterized (e.g., by TEM/CryoEM). Moreover, the aggregates are isolated, and in the isolated aggregates, EPYC1 is characterized for cleavage/degradation and Rubisco content and activity are measured. The BiFC experiments described in Example 2 are also used to characterize the TobiEPYC1 lines. Instead of the BiFC system used in Example 2, a more stringent system based on tri-partite GFP (Liu et al., 2018 Plant Journal) is used.
[0210] The impact of the EPYC1 aggregates is characterized in plants of the TobiEPYC1 N. benthamiana lines grown under medium light levels and high (i.e., Rubisco-limiting) light levels. In particular, the leaf area, fresh weight, and dry weight is measured. Further, chlorophyll content, protein content, and total Rubisco content are measured. In addition, photosynthetic parameters are measured using fluorescence (e.g., Fv/Fm) and gas exchange analyses (e.g., A:Ci curves). Gas exchange and fluorescence are done with a LICOR 6400.
Results
[0211] Immunogold and/or fluorescence co-localization data will show the presence of Rubisco in the EPYC1 chloroplast aggregates.
[0212] Immunogold and/or fluorescence co-localization data will estimating the relative distribution of Rubisco aggregates in chloroplasts vs. Rubisco aggregates throughout the stroma, and will show that there are more Rubisco aggregates in chloroplasts.
[0213] Fluorescence localization data will show that aggregates form when TobiEPYC1 is expressed in higher plants carrying different permutations of the Rubisco SSU (e.g., an A. thaliana SSU mutant background complemented with: the whole C. reinhardtii RbcS2; modified A. thaliana SSUs carrying the C. reinhardtii α-helices; modified A. thaliana SSUs carrying the C. reinhardtii α-helices and β-sheets; modified A. thaliana SSUs carrying the C. reinhardtii α-helices, β-sheets, and βA-βB loop; etc.).
[0214] Immunoblot data will show that TobiEPYC1 and TobiEPYC1::GFP are stable when expressed in higher plants.
[0215] Fluorescence recovery after photobleaching (FRAP) data will show that fluorescently-tagged EPYC1 and Rubisco exhibit liquid-like mixing in the aggregates in higher plant chloroplasts.
[0216] Plant growth data (e.g., fresh weight, dry weight, rosette area, etc.) will show that growth of plants with aggregated Rubisco will be comparable to untransformed plants. Chlorophyll content, protein content, and total Rubisco content will also be comparable to untransformed plants.
[0217] Photosynthetic measurements (e.g., F.sub.v/F.sub.m, A: Ci curves, etc.) will show that plants with aggregated Rubisco perform photosynthesis at similar efficiencies compared to untransformed plants.
[0218] Biochemical data (e.g., from isolated aggregates) will show that aggregated Rubisco is catalytically active. In addition, biochemical data will demonstrate that EPYC1 is present in the aggregate, and will characterize the EPYC1 in the aggregate for cleavage/degradation.
[0219] TEM/cryo-EM data will demonstrate the presence of the EPYC1 aggregate, and will characterize the physical properties of the EPYC1 aggregate.
Example 6: A Variety of Other Higher Plants Will be Engineered to Express Pyrenoid-Like EPYC1-Rubisco Aggregates in the Chloroplast Stroma
[0220] The following example describes characterization of the molecular properties of the chloroplastic EPYC1 aggregates in TobiEPYC1 cowpea, soybean, cassava, rice, wheat, and tobacco lines. In addition, the following example describes characterization of the molecular properties of the chloroplastic EPYC1 aggregates in TobiEPYC1 cowpea, soybean, cassava, rice, wheat, and tobacco lines.
Materials and Methods
[0221] Materials and Methods Relevant for Engineering Crop Plants with EPYC1-Rubisco Aggregates
[0222] The most promising constructs from Examples 3, 4, and 5 are used to design constructs for expression of EPYC1 in cowpea, soybean, cassava, rice, wheat, and tobacco (N. tabacum, Petite Havana). Species-specific optimization of the chloroplast signal peptide is done as needed. In addition, endogenous SSUs in cowpea, soybean, cassava, rice, wheat, and tobacco are reduced (e.g., using a CRISPR knockout approach). A C. reinhardtii SSU or a modified endogenous SSU having C. reinhardtii SSU motifs is introduced. Plants are transformed using nuclear transformation approaches.
[0223] The transformed plant lines are characterized as described in Examples 3-4.
Results
[0224] Transformation of TobiEPYC1 into cowpea, soybean, cassava, rice, wheat, and tobacco and subsequent immunoblot data will show that the generated lines can stably express EPYC1.
[0225] Immunogold microscopy/other aggregate detection method of the above lines will show that they form EPYC1 and Rubisco aggregates in the chloroplast stroma.
[0226] Plant growth data (e.g., fresh weight, dry weight, yield, etc.) will show that growth of plants with aggregated Rubisco will be comparable to untransformed plants. Chlorophyll content, protein content, and total Rubisco content will also be comparable to untransformed plants.
[0227] Photosynthetic measurements (e.g., F.sub.v/F.sub.m, A:Ci curves, etc.) will show that plants with aggregated Rubisco perform photosynthesis at similar efficiencies compared to untransformed plants.