Methods and compositions for processing biomass with elevated levels of starch
09598700 ยท 2017-03-21
Assignee
Inventors
- Philip A. Lessard (Framingham, MA)
- Michael Lanahan (Cary, NC)
- Vladimir Samoylov (Sudbury, MA)
- Oleg Bougri (Boise, ID)
- Jonas Emery (Eugene, OR)
- R. Michael Raab (Arlington, MA)
- Dongcheng Zhang (Newton, MA)
Cpc classification
Y02P20/52
GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
C12N9/1294
CHEMISTRY; METALLURGY
C12N15/8245
CHEMISTRY; METALLURGY
C12N15/8218
CHEMISTRY; METALLURGY
C12N15/8246
CHEMISTRY; METALLURGY
Y02P60/87
GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
A23K40/10
HUMAN NECESSITIES
A23K10/30
HUMAN NECESSITIES
C12P19/14
CHEMISTRY; METALLURGY
A23K10/12
HUMAN NECESSITIES
International classification
C12N15/82
CHEMISTRY; METALLURGY
C12P19/14
CHEMISTRY; METALLURGY
Abstract
Genetically engineered plants having altered levels of one or more starch regulation enzymes and a polysaccharide degrading enzyme are provided. Methods of genetically engineering plants to express products altering expression of one or more starch regulation enzymes and polysaccharide degrading enzymes, and genetic constructs are provided. Methods of agricultural processing and animal feed using the genetically engineered plants are described.
Claims
1. A genetically engineered plant comprising: a first isolated nucleic acid comprising a sequence of SEQ ID NO: 50 that encodes a ribonucleic acid product that inactivates or inhibits expression of at least one gene encoding a protein involved in mobilization of starch in a plant, and a second isolated nucleic acid that encodes an endo-1,4-beta-xylanase from Thermomyces lanuginosus, a cellobiohydrolase B and an endo-beta 1,4-endoglucanase from Nasutitermes takasagoensis; wherein the second isolated nucleic acid comprises a sequence of: SEQ ID NO: 86, SEQ ID NO: 87, and SEQ ID NO: 90.
2. The genetically engineered plant of claim 1 further comprising a promoter operably linked to at least one of the first isolated nucleic acid or the second isolated nucleic acid.
3. A genetic construct comprising: a first isolated nucleic acid comprising a sequence of SEQ ID NO: 50 that encodes a ribonucleic acid product that inactivates or inhibits expression of at least one gene encoding a protein involved in mobilization of starch in a plant, and a second isolated nucleic acid that encodes an endo-1,4-beta-xylanase from Thermomyces lanuginosus, a cellobiohydrolase B and an endo-beta 1,4-endoglucanase from Nasutitermes takasagoensis; wherein the second isolated nucleic acid comprises a sequence of: SEQ ID NO: 86, SEQ ID NO: 87, and SEQ ID NO: 90.
4. The genetic construct of claim 3 further comprising a promoter operably linked to at least one of the first isolated nucleic acid or the second isolated nucleic acid.
5. A method of agricultural processing or preparing animal feed comprising pretreating a genetically engineered plant with a chemical formulation to form a mixture, wherein the genetically engineered plant comprises: a first isolated nucleic acid comprising a sequence of SEQ ID NO: 50 that encodes a ribonucleic acid product that inactivates or inhibits expression of at least one gene encoding a protein involved in mobilization of starch in a plant, and a second isolated nucleic acid that encodes an endo-1,4-beta-xylanase from Thermomyces lanuginosus, a cellobiohydrolase B and an endo-beta 1,4-endoglucanase from Nasutitermes takasagoensis; wherein the second isolated nucleic acid comprises a sequence of: SEQ ID NO: 86, SEQ ID NO: 87, and SEQ ID NO: 90.
6. The method of claim 5, wherein the genetically engineered plant further comprises a promoter operably linked to at least the first isolated nucleic acid or the second isolated nucleic acid.
7. The method of claim 5, wherein the chemical formulation includes at least one moiety comprising an ion selected from the group consisting of: sulfite, bisulfite, sulfate, carbonate, hydroxide and oxide.
8. The method of claim 7, wherein the at least one moiety further includes a counter ion selected from the group consisting of: ammonium, sodium, magnesium and calcium.
9. The method of claim 5, wherein the chemical formulation includes a compound selected from the group consisting of: a sulfuric acid, a base, ammonium bisulfite and ammonium carbonate.
10. The method of claim 5, wherein the mixture has a liquid to solid ratio of 8 to 10.
11. The method of claim 5, wherein the pretreating includes incubating the mixture for a period from 1 hour to 16 hours.
12. The method of claim 5, wherein the pretreating includes a mixture temperature of 40 C. to 120 C.
13. The method of claim 5, wherein the pretreating includes a mixture pH ranging from 1.0 to 12.0.
14. The method of claim 13 further comprising hydrolyzing the mixture.
15. The method of claim 14, wherein the step of hydrolyzing includes incubating the mixture for a period from one hour to 144 hours.
16. The method of claim 14, wherein the step of hydrolyzing includes incubating the mixture at a temperature of 40 C. to 95 C.
17. The method of claim 16 further comprising hydrolyzing the genetically engineered plant with one or more exogenous enzymes.
18. The method of claim 17, further comprising exposing the genetically engineered plant to a fermenting organism under fermenting conditions to produce a chemical product.
19. The method of claim 17, wherein the one or more exogenous enzymes is selected from the group consisting of: a glycosidase, an endoglucanase, a cellobiohydrolase, a glycosidase, a -xylosidase, a cellulase, a xylanase, an amylase, an invertase, a protease, a phytase, a hydrolytic enzyme, a glucanase, a hemicellulase, an esterase, a laccase, a mannanase, and a peroxidase.
20. The method of claim 5, wherein the genetically engineered plant is made using Agrobacterium-mediated transformation prior to the step of pretreatment.
21. The method of claim 5, wherein the genetically engineered plant is selected from the group consisting of: corn, soybean, rice, sugar cane, sugar beet, sorghum, switchgrass, miscanthus, eucalyptus, willow and poplar.
22. A method of preparing animal feed comprising at least one procedure selected from the group consisting of: drying a genetically engineered plant, pelletizing a genetically engineered plant into feed pellets, ensiling a genetically engineered plant to make silage, and combining the genetically engineered plant with distillers grains or with a source of edible fiber, wherein the genetically engineered plant comprises a first isolated nucleic acid comprising a sequence of SEQ ID NO: 50 that encodes a ribonucleic acid product that inactivates or inhibits expression of at least one gene encoding a protein involved in mobilization of starch in a plant, and a second isolated nucleic acid that encodes an endo-1,4-beta-xylanase from Thermomyces lanuginosus, a cellobiohydrolase B and an endo-beta 1,4-endoglucanase from Nasutitermes takasagoensis; wherein the second isolated nucleic acid comprises a sequence of: SEQ ID NO: 86, SEQ ID NO: 87, and SEQ ID NO: 90.
23. The method of claim 22, wherein the genetically engineered plant further comprises a promoter operably linked to at least the first isolated nucleic acid or the second isolated nucleic acid.
24. The method of claim 22, wherein the source of edible fiber is selected from the group of plants consisting of: corn, sorghum, wheat, rye, soybeans, corn grain, sorghum grain, wheat grain, rye grain, soybean meal, corn meal, corn oil, and corn germ.
25. A method of agricultural processing comprising feeding an animal with an animal feed comprising a genetically engineered plant to promote animal growth, wherein prior to the step of feeding the animal feed is prepared by at least one procedure selected from the group consisting of: drying a genetically engineered plant, pelletizing a genetically engineered plant into feed pellets, ensiling a genetically engineered plant to make silage, and combining the genetically engineered plant with distillers grains or with a source of edible fiber, and the genetically engineered plant comprises a first isolated nucleic acid comprising a sequence of SEQ ID NO: 50 that encodes a ribonucleic acid product that inactivates or inhibits expression of at least one gene encoding a protein involved in mobilization of starch in a plant, and a second isolated nucleic acid that encodes an endo-1,4-beta-xylanase from Thermomyces lanuginosus, a cellobiohydrolase B and an endo-beta 1,4-endoglucanase from Nasutitermes takasagoensis; wherein the second isolated nucleic acid comprises a sequence of: SEQ ID NO: 86, SEQ ID NO: 87, and SEQ ID NO: 90.
26. The method of claim 25, wherein the animal is selected from the group consisting of: chicken, swine, and cattle.
27. The genetically engineered plant of claim 1 comprising the nucleic acid sequence of SEQ ID NO: 139.
28. The genetic construct of claim 3 comprising the nucleic acid sequence of SEQ ID NO: 139.
29. The method of any one of claim 5, 22 or 25, wherein the genetically engineered plant comprises the isolated nucleic acid sequence of SEQ ID NO: 139.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) The following detailed description of the preferred embodiments of the present invention will be better understood when read in conjunction with the appended drawings. For the purpose of illustrating the invention, there are shown in the drawings particular embodiments. It is understood, however, that the invention is not limited to the precise arrangements and instrumentalities shown. In the drawings:
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)
(18)
(19)
(20)
(21)
(22)
(23)
(24)
(25)
(26)
(27)
(28)
(29)
(30)
(31)
(32)
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
(33) Certain terminology is used in the following description for convenience only and is not limiting.
(34) Isolated nucleic acid, isolated polynucleotide, isolated oligonucleotide, isolated DNA, or isolated RNA as used herein refers to a nucleic acid, polynucleotide, oligonucleotide, DNA, or RNA separated from the organism from which it originates or from the naturally occurring genome, location, or molecules with which it is normally associated, or is a nucleic acid that was made through a synthetic process.
(35) Isolated protein, isolated polypeptide, isolated oligopeptide, or isolated peptide as used herein refers to a protein, polypeptide, oligopeptide or peptide separated from the organism from which it originates or from the naturally occurring location, or molecules with which it is normally associated.
(36) As used herein, variant refers to a molecule that retains a biological activity that is the same or substantially similar to that of the original sequence. The variant may be from the same or different species or be a synthetic sequence based on a natural or prior molecule.
(37) Nucleic acids, nucleotide sequences, proteins or amino acid sequences referred to herein can be isolated, purified, synthesized chemically, or produced through recombinant DNA technology. All of these methods are well known in the art.
(38) As used herein, operably linked refers to the association of two or more biomolecules in a configuration relative to one another such that the normal function of the biomolecules can be performed. In relation to nucleotide sequences, operably linked refers to the association of two or more nucleic acid sequences in a configuration relative to one another such that the normal function of the sequences can be performed. For example, the nucleotide sequence encoding a presequence or secretory leader is operably linked to a nucleotide sequence for a polypeptide if it is expressed as a preprotein that participates in the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding sequence if it affects the transcription of the coding sequence; and a nucleic acid ribosome binding site is operably linked to a coding sequence if it is positioned so as to facilitate binding of the ribosome to the nucleic acid.
(39) The words a and one, as used in the claims and in the corresponding portions of the specification, are defined as including one or more of the referenced item unless specifically stated otherwise. This terminology includes the words above specifically mentioned, derivatives thereof, and words of similar import. The phrase at least one followed by a list of two or more items, such as A, B, or C, means any individual one of A, B or C as well as any combination thereof.
(40) Increasing the starch content of biomass can increase the energy content (calories) in animal feed or improve glucose extraction from biomass for the production of ethanol or other biochemicals.
(41) A strategy for increasing glucose availability in plant derived biomass that is to be used as animal feed or chemical feedstock would be to cause the plants to accumulate additional starch. Such additional starch would both augment the total amount of glucose present in the biomass and make a greater portion of that glucose easily-extracted. To increase the amount of starch that accumulates in biomass, particularly in vegetative (non-seed) parts of the plant, the normal processes by which plants synthesize and turn over vegetative starch may be modulated.
(42) Plants typically synthesize starch in vegetative tissues during the daytime, while at night they degrade the starch to mobilize the resulting sugar in order to support the energy needs of the plant. Vegetative plant cells express a series of enzymes to initiate mobilization of transitory starch during the nighttime (Stitt & Samuel C Zeeman 2012). Among these are Glucan Water Dikinase (GWD), which phosphorylates the starch polymer, and Phosphoglucan Water Dikinase (PWD), which further phosphorylates the starch. These steps in starch turnover make the starch polymer accessible to subsequent starch degrading enzymes. A number of starch degrading enzymes can then bind to the phosphorylated polysaccharide, but depolymerization (hydrolysis) of the starch granule does not progress efficiently until the starch polymer is dephosphorylated through the action of enzymes such as the dual-specificity protein phosphatase (DSP) or other phosphoglucan phosphatases. Subsequently, enzymes such as beta amylases (BAM), isoamylases (such as ISA3), alpha amylases (AMY), debranching enzymes (DBE), disproportionating enzymes (DPE) and limit dextrinases, convert the starch polymer to linear glucans, short oligosaccharides such as maltose, and glucose. To develop biomass that has more utility and higher value as a feed product (e.g., when formulated as a feed ration or when used in silage) or as an industrial feedstock (e.g., to provide glucose for the fermentative production of ethanol or other biochemicals), an increased starch accumulation was sought in vegetative tissues by reducing the expression or activity of key enzymes involved in the turnover of transitory starch.
(43) An embodiment provides a method for alteration in the amount of starch that accumulates in vegetative tissues of plants by inhibiting the activity of enzymes that are normally responsible for mobilizing vegetative starch (hereinafter referred to as Green Starch or vegetative starch) during day/night cycles. Isolated nucleic acids are provided for alteration in the amount of starch that accumulates in vegetative tissues of plants by inhibiting the activity of enzymes that are normally responsible for mobilizing Green Starch. Genetically engineered plants are provided, which include nucleic acids for alteration in the amount of starch that accumulates in vegetative tissues of plants by inhibiting the activity of enzymes that are normally responsible for mobilizing Green Starch. Any plant can be provided as the genetically engineered plant.
(44) In an embodiment, plants may be also genetically engineered to express polysaccharide degrading enzymes. Genetically engineered plants having elevated levels of vegetative starch may also express one or more polysaccharide degrading enzymes. Polysaccharide degrading enzymes may be starch degrading enzymes. Starch degrading enzymes may be but are not limited to amylases (BAM), isoamylases (such as ISA3), alpha amylases (AMY), debranching enzymes (DBE), disproportionating enzymes (DPE), limit dextrinases, glucoamylases, glucotransferases, glucosidases or invertases. Polysaccharide degrading enzymes may be cell wall degrading enzymes (CWDEs) or modified CWDEs. The modified forms may be intein modified CWD proteins. Intein modified enzymes and conditions for inducing splicing of the inteins, which could be used as activation conditions, were described in U.S. application Ser. No. 10/886,393 filed Jul. 7, 2004 and PCT/US10/55746 filed Nov. 5, 2010, and PCT/US10/55669 filed Nov. 5, 2010 and PCT/US10/55751 filed Nov. 5, 2010, which are incorporated herein by reference as if fully set forth. One or more polysaccharide degrading enzyme may be but is not limited to an enzyme selected from XynA: Beta-1,4-xylanase 229B from Dictyoglomus thermophilum (Uniprot accession P77853); XynB: Endo-1,4-beta-xylanase from Thermomyces lanuginosus (Uniprot accession O43097); EGA: Endo-beta 1,4-endoglucanase from Nasutitermes takasagoensis (Uniprot accession O77044); EGB: Endo-beta 1,4-endoglucanase from Acidothermus cellulolyticus (Uniprot accession P54583); AccA: Feruloyl esterase A from Apergillus niger (Uniprot accession O42807); AccB: Feruloyl esterase B from Aspergillus niger (Uniprot accession number Q8WZI8); AccA/B: Feruloyl esterase A and Feruloyl esterase B from Aspergillus niger; EGC: Endo-beta 1,4-endoglucanase from Rhodothermus marinus (Uniprot accession O33897); P40942: Beta-1,4-xylanase from Clostridium stercorarium F9 (Uniprot accession number P40942); P40943: Beta-1,4-xylanase from Geobacillus stearothermophilus T-6 (Bacillus stearothermophilus; Uniprot accession number P40943); O30700: Beta-1,4-xylanase from Bacillus sp. NG-27 (Uniprot accession number O30700); CBHA: cellobiohydrolase A from Clostridium thermocellum (Uniprot accession number O68438); CBHB: cellobiohydrolase B (SYT BD22308); or XynE: xylanase (EU591743).
(45) Embodiments herein provide for harvesting plants having elevated levels of starch and/or in planta polysaccharide degrading enzymes for use as a feedstock in agricultural processing. Genetically engineered plant biomass expressing polysaccharide degrading enzymes may not require harsh pretreatments to improve cellulose cell wall accessibility to exogenous enzymes. Methods and compositions for consolidated pretreatment and hydrolysis of plant biomass expressing cell wall degrading enzymes were described in U.S. patent application Ser. No. 13/414,627, filed Mar. 7, 2012; and International Patent Application No. PCT/US2012/028132, filed Mar. 7, 2012, which are incorporated herein by reference as if fully set forth.
(46) In an embodiment, animal feed applications including increased levels of starch in vegetative tissues are provided. Easily-fermentable sugars available in a fermentation process may be provided by embodiments herein. Production of biofuels may be enhanced by providing easily-fermentable sugars. Methods of providing easily fermentable sugars and methods of enhancing production of biofuels are provided as embodiments herein.
(47) Crops with elevated levels of vegetative starch have a variety of uses and utilities. In an embodiment, biomass from plants that accumulate elevated levels of vegetative starch relative to wild type plants are provided. These plants may have added value as feedstocks for fermentation processes or animal feed applications. For example, in a typical cellulosic process, polysaccharides such as cellulose and hemicelluloses that are present in the biomass are hydrolyzed to simple sugars, which may then be fermented to ethanol, butanol, isobutanol, fatty acids, or other hydrocarbons by microorganisms. Because of the recalcitrance of the biomass, the release of the simple sugars from polymers such as cellulose and hemicelluloses often requires the use of harsh pretreatment conditions and hydrolysis with relatively expensive mixtures of enzymes. In contrast, any starch that is present in the biomass represents an additional source of simple sugars (namely, glucose), which can be released very easily and much less expensively with either dilute acid treatments or hydrolysis by amylases, which are currently available and much less expensive than the enzymes required for the digestion of cellulose and hemicelluloses. As a result, any increase in the amount of starch present in the biomass will simultaneously increase the amount of fermentable sugar that can be recovered (and therefore the amount of ethanol, butanol, etc. that can be made) with only a disproportionately small increase in process costs (i.e. addition of an inexpensive amylase or acid pretreatment). Similarly, biomass that contains elevated levels of starch may have greater value in forage applications, where the plant material is fed to livestock. Again, the excess starch present in this material is more easily digested by most animals than is the cellulosic material, providing more energy per unit biomass than biomass with ordinary levels of starch. Embodiments include utilizing a transgenic plant as set forth herein for any of these methods.
(48) Methods herein, including those in the previous paragraph, may include modifying plants to create genetically engineered plants, growing the genetically engineered plants, harvesting the plants and either processing them for animal feed applications as one would other forage crops, or dry them and treat them for use in fermentation processes similar to the manner of treatment that is used in cellulosic processes but with the addition of a treatment such as acid hydrolysis or amylase digestion to hydrolyze the starch to its component sugars. Any one step, set of steps, or all the steps set forth in this paragraph may be provided in a method herein.
(49) Genes to target for Green Starch alteration were identified. Any enzyme, protein or nucleic acid involved in starch metabolism may be targeted for alteration of Green Starch levels. In an embodiment, alteration is accomplished by suppression of gene expression of genes related to Green Starch. In an embodiment, alteration is an increase in the amount of Green Starch. Particular enzymes that may be targets include but are not limited to Glucan Water Dikinase (also known as GWD, R1, sex1); Phosphoglucan Water Dikinase (also known as PWD); Dual Specificity Protein Phosphatase (also known as DSP, sex4); -amylase (BAM), isoamylase (also known as ISA3), limit dextrinases (also known as LDA); disproportionating enzyme; and other debranching enzymes. GWD phosphorylates starch, which is then susceptible to starch degrading enzymes. PWD phosphorylates starch, and may be dependent upon prior action by GWD by episatsis. DSP is regulatory, and may activate starch degrading enzymes. DSP may also phosphorylate starch. Also, DSP is suspected of having endo-amylase activity, which may be synergistic with -amylase and isoamylase starch mobilization. BAM (but not -amylase) and ISA3 are involved in mobilizing vegetative starch. BAM activity depends on GWD, and ISA3 activity depends on BAM.
(50) A number of strategies are available for interfering with the expression or accumulation of enzyme activities in plants. Among these are antisense RNA, co-suppression, and RNA interference (Frizzi & Huang 2010 and Chi-Ham et al. 2010), mutagenesis and screening strategies such as TILLING (Sikora et al. 2011), T-DNA insertion and transposon-based mutagenesis (An et al. 2005), genome editing strategies involving nucleases such as zinc-finger nucleases (Wright et al. 2005), TAL effector nucleases (Christian et al. 2010), or intein-derived meganucleases (Wehrkamp-Richter et al. 2009). The references cited are incorporated herein by reference as if fully set forth.
(51) A number of genome editing strategies have been developed to introduce stable changes that either completely or partially (that is, silence or attenuate) block expression of genes encoding enzymes that are involved in mobilization of transitory starch. Among these are deleting critical regions in the promoters, coding sequences, or terminator sequences of the genes or changing or deleting key amino acid residues in enzyme catalytic domain to reduce the activity of the enzyme that is expressed. Changes can be introduced by introducing a transgene into transgenic plants that expresses a recombinant nuclease that is designed to cleave the target gene as described previously (Puchta et al. 1993; Wright et al. 2005 and Wehrkamp-Richter et al. 2009, which are incorporated herein by reference as if fully set forth). The target gene can itself be a separate transgene or an endogenous gene that is native to the plant. Once expressed, the nuclease will introduce double stranded DNA breaks in the target sequence, for example, deleting a short segment that then may be partially repaired by the cell's DNA repair mechanisms, leaving a lesion within the target gene. Once the target gene has been cleaved, the nuclease gene is no longer needed. If the nuclease gene was itself introduced as part of a separate stable transgene, then the nuclease transgene may have integrated into a different site within the genome of the host organism. In such cases, the nuclease gene can be eliminated from the plants via genetic segregation and selection. Alternatively, the nuclease gene can be introduced as a transient expression system, for example as part of the genome of a non-integrating virus as described by Vainstein et al. 2011, which is incorporated herein by reference as if fully set forth. In either case, if the alterations (deletions) to the target gene are carried by cells in the germline, progeny of the modified plants will carry the altered target genes, and they will not express fully functional enzyme from that target gene.
(52) In an embodiment, targets may be suppressed using RNAi suppression of gene expression. RNAi constructs are provided to suppress gene expression of target proteins. The target proteins may be enzymes. The target enzyme may be selected from an enzyme involved in Green Starch mobilization. RNAi constructs suppressing at least one of GWD, PWD, DSP, BAM, isoamylase, LDA, disproportionating enzyme and other debranching enzymes are provided.
(53) A number of strategies have been developed for expressing RNAi in transgenic plants. See, for example, Horiguchi G., RNA silencing in plants: a shortcut to functional analysis (2004) Differentiation 72(2-3): 65-73, which is incorporated by reference herein as if fully set forth. See also Smith N A, Singh S P, Wang B, Stoutjesdijk P A, Green A G, Waterhouse P M, Total silencing by intron-spliced hairpin RNAs (2000) Nature 407:319-20; Stoutjesdijk P A, Singh S P, Liu Q, Hurlstone C J, Waterhouse P A, Green A G hpRNA-mediated targeting of the Arabidopsis FAD2 gene gives highly efficient and stable silencing (2002) Plant Physiol. 129(4): 1723-31, which are incorporated by reference herein as if fully set forth. Referring to
(54) In an embodiment, isolated nucleic acids are provided having a sequence as set forth in any one of the nucleic acids listed herein or the complement thereof. In an embodiment, isolated nucleic acids having a sequence that hybridizes to a nucleic acid having the sequence of any nucleic acid listed herein or the complement thereof are provided. In an embodiment, the hybridization conditions are low stringency conditions. In an embodiment, the hybridization conditions are moderate stringency conditions. In an embodiment, the hybridization conditions are high stringency conditions. Examples of hybridization protocols and methods for optimization of hybridization protocols are described in the following books: Molecular Cloning, T. Maniatis, E. F. Fritsch, and J. Sambrook, Cold Spring Harbor Laboratory, 1982; and, Current Protocols in Molecular Biology, F. M. Ausubel, R. Brent, R. E. Kingston, D. D. Moore, J. G. Seidman, J. A. Smith, K. Struhl, Volume 1, John Wiley & Sons, 2000, which are incorporated by reference in their entirety as if fully set forth. Moderate conditions may be as follows: filters loaded with DNA samples are pretreated for 2-4 hours at 68 C. in a solution containing 6 citrate buffered saline (SSC; Amresco, Inc., Solon, Ohio), 0.5% sodium dodecyl sulfate (SDS; Amresco, Inc., Solon, Ohio), 5Denhardt's solution (Amresco, Inc., Solon, Ohio), and denatured salmon sperm (Invitrogen Life Technologies, Inc. Carlsbad, Calif.). Hybridization is carried in the same solution with the following modifications: 0.01 M EDTA (Amresco, Inc., Solon, Ohio), 100 g/ml salmon sperm DNA, and 52010.sup.6 cpm .sup.32P-labeled or fluorescently labeled probes. Filters are incubated in hybridization mixture for 16-20 hours and then washed for 15 minutes in a solution containing 2SSC and 0.1% SDS. The wash solution is replaced for a second wash with a solution containing 0.1SSC and 0.5% SDS and incubated an additional 2 hours at 20 C. to 29 C. below Tm (melting temperature in C.). Tm=81.5+16.61 Log.sub.10([Na+]/(1.0+0.7[Na+]))+0.41(%[G+C])(500/n)PF. [Na+]=Molar concentration of sodium ions. %[G+C]=percent of G+C bases in DNA sequence. N=length of DNA sequence in bases. P=a temperature correction for % mismatched base pairs (1 C. per 1% mismatch). F=correction for formamide concentration (=0.63 C. per 1% formamide). Filters are exposed for development in an imager or by autoradiography. Low stringency conditions refers to hybridization conditions at low temperatures, for example, between 37 C. and 60 C., and the second wash with higher [Na+] (up to 0.825M) and at a temperature 40 C. to 48 C. below Tm. High stringency refers to hybridization conditions at high temperatures, for example, over 68 C., and the second wash with [Na+]=0.0165 to 0.0330M at a temperature 5 C. to 10 C. below Tm.
(55) In an embodiment, isolated nucleic acids having a sequence that has at least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99 or 100% identity along its length to a contiguous portion of a nucleic acid having any one of the sequences set forth herein or the complements thereof are provided. The contiguous portion may be the entire length of a sequence set forth herein or the complement thereof.
(56) Determining percent identity of two amino acid sequences or two nucleic acid sequences may include aligning and comparing the amino acid residues or nucleotides at corresponding positions in the two sequences. If all positions in two sequences are occupied by identical amino acid residues or nucleotides then the sequences are said to be 100% identical. Percent identity may be measured by the Smith Waterman algorithm (Smith T F, Waterman M S 1981 Identification of Common Molecular Subsequences, J Mol Biol 147: 195-197, which is incorporated herein by reference as if fully set forth).
(57) In an embodiment, isolated nucleic acids, polynucleotides, or oligonucleotides are provided having a portion of the sequence as set forth in any one of the nucleic acids listed herein or the complement thereof. These isolated nucleic acids, polynucleotides, or oligonucleotides are not limited to but may have a length in the range from 10 to full length, 10 to 3000, 10 to 2900, 10 to 2800, 10 to 2700, 10 to 2600, to 2500, 10 to 2400, 10 to 2300, 10 to 2200, 10 to 2100, 10 to 2000, 10 to 1900, 10 to 1800, 10 to 1700, 10 to 1600, 10 to 1500, 10 to 1400, 10 to 1300, 10 to 1200, 10 to 1100, to 1000, 10 to 900, 10 to 800, 10 to 10 to 600, 10 to 500, 10 to 400, 10 to 300, 10 to 200, 10 to 100, 10 to 90, 10 to 80, 10 to 70, 10 to 60, 10 to 50, 10 to 40, 10 to 35, 10 to 30, 10 to 25, 10 to 20, 10 to 15, or 20 to 30 nucleotides or 10, 15, 20 or 25 nucleotides. An isolated nucleic acid, polynucleotide, or oligonucleotide having a length within one of the above ranges may have any specific length within the range recited, endpoints inclusive. The recited length of nucleotides may start at any single position within a reference sequence (i.e., any one of the nucleic acids herein) where enough nucleotides follow the single position to accommodate the recited length. In an embodiment, a hybridization probe or primer is 85 to 100%, 90 to 100%, 91 to 100%, 92 to 100%, 93 to 100%, 94 to 100%, 95 to 100%, 96 to 100%, 97 to 100%, 98 to 100%, 99 to 100%, or 100% complementary to a nucleic acid with the same length as the probe or primer and having a sequence chosen from a length of nucleotides corresponding to the probe or primer length within a portion of a sequence as set forth in any one of the nucleic acids listed herein. In an embodiment, a hybridization probe or primer hybridizes along its length to a corresponding length of a nucleic acid having the sequence as set forth in any one of the nucleic acids listed herein. In an embodiment, the hybridization conditions are low stringency. In an embodiment, the hybridization conditions are moderate stringency. In an embodiment, the hybridization conditions are high stringency.
(58) Any of the isolated nucleic acids herein may be provided in a kit. The kit may be used to make an RNAi construct, produce transgenic plants, test a plant for the presence of a gene of interest, test a plant for the presence of an RNAi construct as described herein, or any other method or purpose described herein. A kit may include one or more vector herein or one or more probe or primer herein.
(59) In an embodiment, a genetically engineered plant is provided. The genetically engineered plant may be derived from any plant. The genetically engineered plant may be derived from an energy crop plant, a forage crop plant or a food crop plant. The energy crop plant may be but is not limited to a corn plant, a switchgrass plant, a poplar plant or a miscanthus plant. The forage crop plant may be but is not limited to a sorghum plant. The food crop plant may be but is not limited to a corn plant or a tomato plant. The genetically engineered plant may be a transgenic plant. The transgenic plant may include an RNAi construct. The transgenic plant may include a genetic construct that inactivates or inhibits expression of at least one gene encoding a protein envolved in mobilization of starch in a plant. The transgenic plant may also include a nucleic acid encoding a polysaccharide degrading enzyme. The plant may be a rice plant, a switchgrass plant, a sorghum plant, a corn plant or a tomato plant.
(60) A genetically engineered plant refers to a transgenic plant or a mutant plant, progeny of a transgenic plant or a genetically engineered plant, a descendant of a transgenic plant or a genetically engineered plant, or a part of any of the foregoing. A transgenic plant may include a genetic construct. The genetic construct may include a first nucleic acid that encodes a product that may inactivate or inhibit the expression of at least one gene encoding a protein involved in mobilization of starch in a plant. The genetic construct may also include a second isolated nucleic acid that encodes at least one polysaccharide degrading enzyme, which does not occur naturally in the plant. Upon the expression of the first nucleic acid and the second nucleic acid, the genetically engineered plant may have an altered level of vegetative starch compared to the level of vegetative starch in a non-genetically engineered plant of the same genetic background as the genetically engineered plant but lacking the genetic construct. The genetically engineered plant may express at least one polysaccharide degrading enzyme. The genetically engineered plant may include more than one genetic construct. The genetically engineered plant may include a first construct comprising a first isolated nucleic acid that encodes a product that may inactivate or inhibit the expression of at least one gene encoding a protein involved in mobilization of starch in a plant. The genetically engineered plant may also include a second genetic construct comprising a second nucleic acid that encodes at least one polysaccharide degrading enzyme. The genetic construct(s) may be integrated into a genome of the plant. The genetic construct(s) may be transiently expressed in the plant. As used herein, genetic background is defined as a plurality of all genes in a plant. Thus, plants of the same species or variety may be referred to as plants having the same genes or the same genetic background. A genetically engineered plant may include the genetic construct or constructs described herein and, otherwise, may have the same genes as non-genetically engineered plant of the same genetic background.
(61) An embodiment includes a genetically engineered plant. The genetically engineered plant may be any one described herein. The genetically engineered plant may be a transgenic plant having an altered level of vegetative starch, or any part of the transgenic plant. The genetically engineered plant may be a transgenic plant expressing a polysaccharide degrading enzyme, or any part of the transgenic plant. The genetically engineered plant may be any transgenic plant having an altered level of vegetative starch and/or expressing a polysaccharide degrading enzyme, or any part of the transgenic plant. The genetically engineered plant may include a first isolated nucleic acid that encodes a product that inactivates or inhibits expression of at least one gene encoding a protein involved in mobilization of starch in a plant, and a second isolated nucleic acid that encodes at least one polysaccharide degrading enzyme. The first isolated nucleic acid may be as described below. The second isolated nucleic acid may be as described below. Upon the expression of the first nucleic acid, the genetically engineered plant may have an altered level of vegetative starch compared to the level of vegetative starch in a non-genetically engineered plant having the same genetic background as the genetically engineered plant but lacking the first isolated nucleic acid. The altered level may be an increased level.
(62) A genetically engineered plant may be a conventional mutant having one or more mutations in a nucleic acid sequence of a gene resulted in inhibiting expression of the gene encoding an enzyme involved in mobilization of starch in a plant. The mutations may be deletions, insertions or substitutions of nucleic acids in a sequence of target genes. The conventional mutant may have an altered level of vegetative starch compared to a non-mutant plant of the same genetic background. The conventional mutant may be further genetically engineered to include a nucleic acid encoding a polysaccharide degrading enzyme. The conventional mutant having an altered level of vegetative starch may be also a transgenic plant expressing a polysaccharide degrading enzyme.
(63) The genetically engineered plant may be of any type of plant. The genetically engineered plant may be but is not limited to corn, soybean, rice, sugar cane, sugar beet, sorghum, switchgrass, miscanthus, eucalyptus, willow, or poplar. A genetically engineered plant may be a whole transgenic plant or a mutant plant or parts of the plant. The parts may be but are not limited to leaves, stems, flowers, buds, petals, ovaries, fruits, or seeds. A genetically engineered plant may be callus from a transgenic plant or a mutant plant. A genetically engineered plant may be regenerated from parts of a transgenic plant or a mutant plant or plants. A genetically engineered plant may be a product of sexual crossing of a first transgenic plant and a second transgenic plant or a non-transgenic plant where the product plant retains a polynucleotide sequence introduced to the first transgenic plant. A genetically engineered plant may be a product of sexual crossing of a first mutant plant and a second non-mutant plant where the product plant retains a mutation introduced to the first mutant plant. The transgenic plant or the mutant plant may be any one of the transgenic plants or mutant plants provided herein.
(64) An embodiment provides a genetic construct designed to implement a strategy for modifying levels of vegetative starch in plants. The genetic construct may include a first isolated nucleic acid that encodes a product that inactivates or inhibits expression of one or more genes encoding a protein involved in mobilization of starch in a plant. The product may be but is not limited to an RNAi construct, an hpRNA, an miRNA, a restricting enzyme, or a meganuclease. The protein may be but is not limited to Glucan Water Dikinase (GWD), Phosphoglucan Water Dikinase (PWD), Dual Specificity Protein Phosphatase (DSP), limit dextrinase (LDA), a disproportionating enzyme, a debranching enzyme, -amylase (BAM) and isoamylase. The genetic construct may include a second isolated nucleic acid that encodes one or more polysaccharide degrading enzymes.
(65) The genetic construct may further include an inverted complement of the first isolated nucleic acid, and a spacer contiguous with and between the first isolated nucleic acid and the inverted complement. The term inverted complement refers to a sequence complimentary to another sequence on the same strand of a nucleic acid. For example, an inverted complement of the first isolated nucleic acid is on the same strand as the first isolated nucleic acid. As a further example, an inverted complement of a nucleic acid sequence of OsDSP1:
(66) TATGGTTGACAAGCTTGTGCAGTTTGCTAATCACAGCAGGAAGCCTCAATCGCAAATCAAAAGCATCAAAATCCCTAATTTCGGCACGAC AGTGCTCTATATCTTTACATTGTAGACAATATTCTTGAATGGCACAGATGTCAACTCCAAAATATTCAAGGTCTGGATCTTGCTGCAGGC AGAATACTGTTTTTACACCAATGTCCCTAAGTTTATCAACATCAAGTGGGCTCTGTAAGCAGGAGCCCACGATCAAGTCTGGGCGTATGA AATTGTAGTTCATTCCAAGCTCATGTCTATACGTCAACACTGCTCCCATAGCTTGCGTCATGTTGGTGCTGTACGTATCGGATTTCTCCG TGCCCGCCTCCACTGCGCCACTCTGGGCGCTAGAAGTAGACGCCCCGGATGCGGTTTTGACAGTGTTTGATCGGCGACTCCCGCCACGAA CCATCGTCAGATTGAGCGGCGAGGGCCGCCTCATGGACCTGGATCCCACGATTGGAGGCTCCTTGAGCAGGTTCTGGAGGCAGTTCAT
(SEQ ID NO: 40) is the following sequence:
(67) TABLE-US-00001 (SEQIDNO:172) ATGAACTGCCTCCAGAACCTGCTCAAGGAGCCTCCAATCGTGGGATCCAG GTCCATGAGGCGGCCCTCGCCGCTCAATCTGACGATGGTTCGTGGCGGGA GTCGCCGATCAAACACTGTCAAAACCGCATCCGGGGCGTCTACTTCTAGC GCCCAGAGTGGCGCAGTGGAGGCGGGCACGGAGAAATCCGATACGTACAG CACCAACATGACGCAAGCTATGGGAGCAGTGTTGACGTATAGACATGAGC TTGGAATGAACTACAATTTCATACGCCCAGACTTGATCGTGGGCTCCTGC TTACAGAGCCCACTTGATGTTGATAAACTTAGGGACATTGGTGTAAAAAC AGTATTCTGCCTGCAGCAAGATCCAGACCTTGAATATTTTGGAGTTGACA TCTGTGCCATTCAAGAATATTGTCTACAATGTAAAGATATAGAGCACTGT CGTGCCGAAATTAGGGATTTTGATGCTTTTGATTTGCGATTGAGGCTTCC TGCTGTGATTAGCAAACTGCACAAGCTTGTCAACCATA.
(68) A sequence of the inverted complement may be capable of hybridizing to a sequence of the first isolated nucleic acid. The sequence of the inverted complement may be capable of hybridizing to the sequence of the first isolated nucleic acid under in situ conditions in a genetically engineered plant. The sequence of the inverted complement may be capable of hybridizing to the sequence of the first nucleic acid sequence under conditions of one of low, moderate, or high stringency.
(69) The genetic construct may also include a spacer contiguous with and between the first isolated nucleic acid and the inverted complement. The spacer may be operably linked to the first isolated nucleic acid and the inverted complement and may provide a connection between the first isolated nucleic acid and the inverted complement such that the RNA sequences transcribed from the first isolated nucleic acid and the inverted complement can hybridize with one another. The first isolated nucleic acid may be upstream from and contiguous with the spacer, and the spacer may be upstream from and contiguous with the inverted complement. The inverted complement may be upstream from and contiguous with the spacer, and the spacer may be upstream from and contiguous with the first nucleic acid. An operably linked spacer may be an intron. The intron may splice the sequences of the first isolated nucleic and the inverted complement.
(70) The genetic construct may include a promoter operably linked to the first isolated nucleic acid, the inverted complement and the spacer. The operably linked promoter may allow transcription of the first isolated nucleic acid, the inverted complement and the spacer. Transcription of the first isolated nucleic acid, the inverted complement and the spacer may be referred to as expression of the first isolated nucleic acid, the inverted complement and the spacer. Upon expression of the first isolated nucleic acid, the inverted complement and the spacer, the RNA sequence transcribed from the first isolated nucleic acid and the RNA sequence transcribed from the inverted complement may be capable of hybridizing with each other. The hybridized RNA transcripts of the first isolated nucleic acid and the inverted complement may be capable of inhibiting expression of the gene. A transgenic plant may include more than one kind of RNAi construct. Each different kind of RNAi construct may be directed to inhibiting a different gene expressing a different target protein. The target gene may be selected from one or more gene encoding a protein involved in mobilization of starch in a plant. The protein may be but is not limited to Glucan Water Dikinase (GWD), Phosphoglucan Water Dikinase (PWD), Dual Specificity Protein Phosphatase (DSP), limit dextrinase (LDA), a disproportionating enzyme, a debranching enzyme, -amylase (BAM), or isoamylase.
(71) In an embodiment, a first isolated nucleic acid and an inverted repeat may encode a product that may be an hpRNA. The hpRNA may be homologous to a portion of a messenger RNA encoding a protein involved in mobilization of starch in a plant. The hpRNA may be homologous to a portion of a messenger RNA encoding the 5 UTR, or the 3 UTR sequences for a protein involved in mobilization of starch in a plant. The sequences that are complementary to a messenger RNA are called driver sequences.
(72) In an embodiment, a first isolated nucleic acid may encode a product that may be an RNAi construct. The RNAi construct may be designed to implement any RNAi strategy, including but not limited to those illustrated in
(73) The first isolated nucleic acid may include a portion having at least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% identity to a sequence of the target gene. Identity may be measured along the entire length of the sequence of the target gene. The target gene may be selected from the one or more gene encoding a protein involved in mobilization of vegetative starch in a plant. The target gene may be any gene involved in mobilization of vegetative starch in a plant. There may be more than one target gene. The length of the portion of the first isolated nucleic acid may be equal to the length of the target gene. The length of the portion may be but is not limited to being 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 145, 150, 155, 160, 165, 170, 175, 180, 185, 190, 195, 200, 205, 210, 215, 220, 225, 230, 235, 240, 245, 250, 255, 260, 265, 270, 275, 280, 285, 290, 295, 300, 305, 310, 315, 320, 325, 330, 335, 340, 345, 350, 355, 360, 365, 370, 375, 380, 385, 390, 395, 400, 405, 410, 415, 420, 425, 430, 435, 440, 445, 450, 455, 460, 465, 470, 475, 480, 485, 490, 495, 500, 505, 510, 515, 520, 525, 530, 535, 540, 545, 550, 555, 560, 565, 570, 575, 580, 585, 590, 595, 600, 605, 610, 615, 620, 625, 630, 635, 640, 645, 650, 655, 660, 665, 670, 675, 680, 685, 690, 695, or 700 nucleotides in length, or any length within a range between any two of the foregoing lengths.
(74) In an embodiment, a first isolated nucleic acid may encode the product that may be an miRNA capable of targeting a messenger RNA transcribed from the target gene. The first isolated nucleic acid may be 18 to 25 nucleotides in length. The first isolated nucleic acid may be 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 145, 150, 155, 160, 165, 170, 175, 180, 185, 190, 195, 200, 205, 210, 215, 220, 225, 230, 235, 240, 245, 250, 255, 260, 265, 270, 275, 280, 285, 290, 295, or 300 nucleotides in length, or any length within a range between any two of the foregoing lengths.
(75) In an embodiment, a first isolated nucleic acid may encode a product that may be a restricting enzyme capable of cutting a sequence of the target gene. The restricting enzyme may be but is not limited to a meganuclease, a zinc-finger nuclease, or a TAL effector nuclease. The meganuaclease may be I-CreI, I-DmoI, I-SceI, E-Dmel or DmoCre. Other known meganucleases may be used.
(76) The genetic construct may include a first isolated nucleic acid encoding a product that has any suitable sequence to affect expression of a gene coding for a target protein. The sequence may be suitable to affect RNAi of a gene coding for a target protein. The first isolated nucleic acid may include a sequence with at least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% identity to a reference sequence selected from the group consisting of: SEQ ID NO: 38 [OsGWD amiRNA1wmd3], SEQ ID NO: 39 [OsGWD osa-MR809aM1 micro RNA], SEQ ID NO: 40 [OsDSP1], SEQ ID NO: 41 [OsDSP2], SEQ ID NO: 42 [OsGWD1], SEQ ID NO: 43 [OsGWD2], SEQ ID NO: 44[OsPWD1], SEQ ID NO: 45 [OsPWD2], SEQ ID NO: 46[SbGWD-SbGWDko2b-flanking seqs], SEQ ID NO: 47 [SbGWD1], SEQ ID NO: 48 [SbGWD2], SEQ ID NO: 49 [ZmGWD1], SEQ ID NO: 50 [ZmGWD2], SEQ ID NO: 179 [GWD1], SEQ ID NO: 180 [GWD2], SEQ ID NO: 183 [DSP1], SEQ ID NO: 184 [ISA3], SEQ ID NO: 209 [PvGWDko2], and SEQ ID NO: 216 [SbGWDko2a-flanking seqs]. Identity may be measured along the entire length of the reference sequence. The length of the first isolated nucleic acid may be equal to the length of the reference sequence.
(77) The RNAi construct may include a first isolated nucleic acid capable of hybridizing to a nucleic acid comprising, consisting essentially of or consisting of a reference sequence selected from the group consisting of: SEQ ID NO: 38 [OsGWD amiRNA1wmd3], SEQ ID NO: 39 [OsGWD osa-MR809aM1 micro RNA], SEQ ID NO: 40 [OsDSP1], SEQ ID NO: 41 [OsDSP2], SEQ ID NO: 42 [OsGWD1], SEQ ID NO: 43 [OsGWD2], SEQ ID NO: 44[OsPWD1], SEQ ID NO: 45 [OsPWD2], SEQ ID NO: 46[SbGWD-SbGWDko2b-flanking seqs], SEQ ID NO: 47 [SbGWD1], SEQ ID NO: 48 [SbGWD2], SEQ ID NO: 49 [ZmGWD1], SEQ ID NO: 50 [ZmGWD2], SEQ ID NO: 179 [GWD1], SEQ ID NO: 180 [GWD2], SEQ ID NO: 183 [DSP1], SEQ ID NO: 184 [ISA3], SEQ ID NO: 209 [PvGWDko2], and SEQ ID NO: 216 [SbGWDko2a-flanking seqs], or the complement thereof under conditions of one of low, moderate or high stringency.
(78) The RNAi construct may include the sequence of an inverted complement capable of hybridizing with the sequence of the first isolated nucleic acid under in situ conditions in the genetically engineered plant. The RNAi construct may include the sequence of the inverted complement capable of hybridizing with the sequence of the first isolated nucleic acid under conditions of one of low, moderate or high stringency.
(79) The RNAi construct may include an inverted complement having at least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99 or 100% identity to the inverted complement of a reference sequence selected from the group consisting of: SEQ ID NO: 40 [OsDSP1], SEQ ID NO: 41 [OsDSP2], SEQ ID NO: 42 [OsGWD1], SEQ ID NO: 43 [OsGWD2], SEQ ID NO: 44[OsPWD1], SEQ ID NO: 45 [OsPWD2], SEQ ID NO: 46[SbGWD-SbGWDko2b-flanking seqs], SEQ ID NO: 47 [SbGWD1], SEQ ID NO: 48 [SbGWD2], SEQ ID NO: 49 [ZmGWD1], SEQ ID NO: 50 [ZmGWD2], SEQ ID NO: 179 [GWD1], SEQ ID NO: 180 [GWD2], SEQ ID NO: 183 [DSP1], SEQ ID NO: 184 [ISA3], SEQ ID NO: 209 [PvGWDko2], and SEQ ID NO: 216 [SbGWDko2a-flanking seqs]. Identity may be measured along the length of the inverted complement of the reference sequence. The length of the inverted complement may be equal to the length of the inverted complement of the reference sequence.
(80) The spacer may be any sequence. The spacer may be an intron. The intron may be any intron. The intron may be the OsUbi intron. The sequence of the OsUbi intron may be found in the sequence of pAL409 having SEQ ID NO: 185 with reference to
(81) A protein involved in mobilization of starch in a plant may be a target protein. The target protein may be any protein involved with regulation of Green Starch. For example, the target protein may be one of Glucan Water Dikinase, Phosphoglucan Water Dikinase, Dual Specificity Protein Phosphatase, -amylase, isoamylase, limit dextrinase, disproportionating enzyme, or a debranching enzyme. The target gene may encode the target protein. The target gene may be selected from the one or more genes encoding a protein involved in mobilization of starch in a plant. The target gene encoding the target protein may have a sequence with at least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% identity to a reference sequence selected from the group consisting of: SEQ ID NO: 5 [SbGWD coding seq], SEQ ID NO: 6 [ZmGWD coding seq], SEQ ID NO: 7 [OsGWD coding sequence], SEQ ID NO: 8 [SbGWD gene], SEQ ID NO: 9 [ZMGWD gene], SEQ ID NO: 10 [SbGWD gene 5UTR and promoter], SEQ ID 11 [ZmGWD gene 5UTR and promoter], SEQ ID NO: 12 [SbGWD gene 3UTR], SEQ ID NO: 13 [ZmGWD gene 3UTR], SEQ ID NO: 17 [SbPWD coding seq], SEQ ID NO: 18 [ZmPWD coding seq], SEQ ID NO: 19 [OsPWD coding seq], SEQ ID NO: 20 [SbPWD gene], SEQ ID NO: 21 [ZmPWD gene]. SEQ ID NO: 22 [SbPWD gene 5UTR and promoter], SEQ ID NO: 23 [SbPWD gene 3UTR], SEQ ID NO: 24 [ZmPWD gene 3UTR], SEQ ID NO: 29 [ZmDSP coding sequence], SEQ ID NO: 30 [SbDSP coding sequence], SEQ ID NO: 31 [OsDSP coding sequence], SEQ ID NO: 32 [ZmDSP gene], SEQ ID NO: 33 [SbDSP gene], SEQ ID NO: 34 [ZmDSP gene 5UTR and promoter], SEQ ID NO: 35 [SbDSP gene 5UTR and promoter], SEQ ID NO: 36 [ZmDSP 3UTR], SEQ ID NO: 37 [SbDSP gene 3UTR], SEQ ID NO: 173 [OsGWD gene], SEQ ID NO: 174 [DSP gene], SEQ ID NO: 175 [ISA3 gene], SEQ ID NO: 176 [OsGWD coding sequence], SEQ ID NO: 177 [DSP coding sequence], SEQ ID NO: 178 [ISA3 coding sequence], SEQ ID NO: 182 [portion of SlGWD gene], SEQ ID NO: 191 [SbGWD gene-1], SEQ ID NO: 192 [SbGDW gene-2], SEQ ID NO: 206 [PvGWD-2], SEQ ID NO: 207 [PvGWD-5], SEQ ID NO: 208 [PvGWD-1], and SEQ ID NO: 215 [switchgrass amalgamated sequence]. Identity may be measured along the entire length of the reference sequence. The length of the sequence of the target gene may be equal to the length of the reference sequence.
(82) The target gene encoding the target protein may have a sequence that hybridizes to a reference sequence selected from the group consisting of: SEQ ID NO: 5 [SbGWD coding seq], SEQ ID NO: 6 [ZmGWD coding se], SEQ ID NO: 7 [OsGWD coding sequence], SEQ ID NO: 8 [SbGWD gene], SEQ ID NO: 9 [ZMGWD gene], SEQ ID NO: 10 [SbGWD gene 5UTR and promoter], SEQ ID 11 [ZmGWD gene 5UTR and promoter], SEQ ID NO: 12 [SbGWD gene 3UTR], SEQ ID NO: 13 [ZmGWD gene 3UTR], SEQ ID NO: 17 [SbPWD coding seq], SEQ ID NO: 18 [ZmPWD coding seq], SEQ ID NO: 19 [OsPWD coding seq], SEQ ID NO: 20 [SbPWD gene], SEQ ID NO: 21 [ZmPWD gene]. SEQ ID NO: 22 [SbPWD gene 5UTR and promoter], SEQ ID NO: 23 [SbPWD gene 3UTR], SEQ ID NO: 24 [ZmPWD gene 3UTR], SEQ ID NO: 29 [ZmDSP coding sequence], SEQ ID NO: 30 [SbDSP coding sequence], SEQ ID NO: 31 [OsDSP coding sequence], SEQ ID NO: 32 [ZmDSP gene], SEQ ID NO: 33 [SbDSP gene], SEQ ID NO: 34 [ZmDSP gene 5UTR and promoter], SEQ ID NO: 35 [SbDSP gene 5UTR and promoter], SEQ ID NO: 36 [ZmDSP 3UTR], SEQ ID NO: 37 [SbDSP gene 3UTR], SEQ ID NO: 173 [OsGWD gene], SEQ ID NO: 174 [DSP gene], SEQ ID NO: 175 [ISA3 gene], SEQ ID NO: 176 [OsGWD coding sequence], SEQ ID NO: 177 [DSP coding sequence], SEQ ID NO: 178 [ISA3 coding sequence], SEQ ID NO: 182 [portion of SlGWD gene], SEQ ID NO: 191 [SbGWD gene-1], SEQ ID NO: 192 [SbGDW gene-2], SEQ ID NO: 206 [PvGWD-2], SEQ ID NO: 207 [PvGWD-5], SEQ ID NO: 208 [PvGWD-1], and SEQ ID NO: 215 [switchgrass amalgamated sequence], or the complement thereof under conditions of one of low, moderate or high stringency.
(83) In an embodiment, the genetic construct may include the first isolated nucleic acid that encodes a product that inactivates or inhibits expression of one or more genes encoding a protein involved in mobilization of starch in a plant, and the second isolated nucleic acid that encodes one or more polysaccharide degrading enzymes. A transgenic plant may include more than one kind of genetic construct. Each different kind of genetic construct may be directed to inhibiting a different gene expressing a different target protein and expressing a different polysaccharide degrading enzyme. One genetic construct may include the first isolated nucleic acid. Another genetic construct may include the second isolated nucleic acid. The target gene may be selected from one or more gene encoding a protein involved in mobilization of starch in a plant. The at least one polysaccharide degrading enzyme may be but is not limited to a xylanase, an endoglucanase, an exoglucanase, an amylase, an intein-modified xylanase, an intein-modified endoglucanase, an intein-modified exoglucanase, and an intein-modified amylase.
(84) In an embodiment, the second nucleic acid may have at least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% identity to a reference sequence selected from the group consisting of: SEQ ID NO: 86 [O43097], SEQ ID NO: 87 [BD22308], SEQ ID NO: 88 [BD25243], SEQ ID NO: 89 [EU591743], SEQ ID NO: 90 [NtEGm], SEQ ID NO: 91 [P0C2S1], SEQ ID NO: 92 [P77853], SEQ ID NO: 93 [O68438], SEQ ID NO: 94 [O33897], SEQ ID NO:164 [amylase 19862], SEQ ID NO: 165 [glucoamylase 20082], SEQ ID NO: 166 [glucoamylase 20707], SEQ ID NO: 167 [amylase 21853], SEQ ID NO: 168 [AmyS], SEQ ID NO: 170 [GlaA], SEQ ID NO: 104 [EU591743:AS146-7], SEQ ID NO: 105 [P77853:S158-30-108-35], and SEQ ID NO: 106 [P77853:T134-101-100]. Identity may be measured along the entire length of the reference sequence. The length of the sequence of the second nucleic acid may be equal to the length of the reference sequence.
(85) In an embodiment, the second isolated nucleic acid may encode a variant of a polysaccharide degrading enzyme. The amino acid sequence of a variant of a polysaccharide degrading enzyme may differ by deletions, additions, substitutions of amino acid sequences, or other modifications of the polysaccharide degrading enzyme. A variant of a polysaccharide degrading enzyme may maintain the biological activity of the polysaccharide degrading enzyme. To maintain biological activity as used herein means that the variant has at least 60% of the activity of the polysaccharide degrading enzyme from which it is derived Activity of a xylanase and an a endoglucanase may be assessed in an assay using Xylazyme AX substrate and Cellazyme substrate, respectively, as described in U.S. application Ser. No. 10/886,393 filed Jul. 7, 2004 and PCT/US10/55746 filed Nov. 5, 2010, and PCT/US10/55669 filed Nov. 5, 2010 and PCT/US10/55751 filed Nov. 5, 2010, which are incorporated herein by reference as if fully set forth. Activity of a exoglucanase may be assessed by using fluorescent 4-methylumbelliferyl-b-D-lactopyranoside (4-MU) as described in Harrison M D et al. 2011 Accumulation of recombinant cellobiohydrolase and endoglucanase in the leaves of mature transgenic sugar cane, Plant Biotechnology Journal 9: 884-896 and incorporated here by reference as if fully set forth. Activity of a feruloyl esterase may be assessed using an assay using pNP labeled ferulate as a substrate (as described in Hegde S. et al. 2009 Single-step synthesis of 4-nitrophenyl ferulate for spectrophotometric assay of feruloyl esterases, Analytical Biochemistry 387(1): 128-129). The foregoing tests for activity of a xylanase, endoglucanase, exoglucanase, or feruloyl esterase may be utilized to determine whether a sequence with less than 100% identity to a polysaccharide degrading enzyme sequence herein is a variant of the polysaccharide degrading enzyme. Variants of a polysaccharide degrading enzyme herein may be modified in amino acid sequence versus the polysaccharide degrading enzyme based on similarity in hydrophobicity, hydrophilicity, solubility, polarity of amino acid residues. Variants of a polysaccharide degrading enzyme herein may differ following post-translational modifications. The differing post-translational modification may be but are not limited to glycosylations, acetylations, or phosphorylations. A variant may be developed by any means. A variant may be developed through site-directed mutagenesis or non-targeted mutagenesis. Error-prone PCR may be used to create mutants of a polysaccharide degrading enzyme herein, and any of the assays above may be used to assess whether the mutant is a variant.
(86) Embodiments include at least one of the polysaccharide degrading enzymes, or variants thereof, fused to variants of at least one of a targeting peptide, or a carboxy targeting peptide. Variants of a targeting peptide or a carboxy targeting peptide will target the protein it is fused with to the same location as the reference sequence for the targeting peptide or carboxy targeting peptide.
(87) Variants of an intein may be provided in a sequence of the polysaccharide degrading enzyme. An intein variant may splice from the protein in which it is fused.
(88) In an embodiment, a genetic construct may include a sequence with at least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% identity to a reference sequence selected from the group consisting of SEQ ID NO: 59 [OsUbi3P:OsGWD amiRNA1wmd3], SEQ ID NO: 60 [OsUbi3P: OsGWD osa-MIR809aM1], SEQ ID NO: 61 [OsUbi3P:OsDSP1 hpRNA], SEQ ID NO: 62 [OsUbi3P:OsDSP2 hpRNA], SEQ ID NO: 63 [OsUbi3P:OsGWD1 hpRNA], SEQ ID NO:64 [OsUbi3P:OsGWD2 hpRNA], SEQ ID NO: 65 [OsUbi3P: OsPWD1 hpRNA], SEQ ID NO: 66 [OsUbi3P:OsPWD2 hpRNA], SEQ ID NO: 67 [OsUbi3P:SbGWD RNAi], SEQ ID NO: 68 [ZmPepCP:SbGWD1 RNAi], SEQ ID NO: 69 [ZmPepCP:SbGWD2 RNAi], SEQ ID NO: 70 [ZmPepCP:ZmGWD1 RNAi], SEQ ID NO: 71 [ZmPepCP:ZmGWD2 RNAi], SEQ ID NO: 72 [OsDSP1 and OsGWD2], SEQ ID NO: 73 [OsPWD2 and OsGWD1], SEQ ID NO: 74 [OsDSP2 and OsPWD1], SEQ ID NO: 119 [ZmUbi1P:xGZein27ss:BD22308:xHvVSD], SEQ ID NO: 120 [ZmPepCP:xGZein27ss:BD25243:SEKDEL], SEQ ID NO: 121 [OsUbi3P:EU591743], SEQ ID NO: 122 [ZmUbi1P:EU591743: AS146-7:SEKDEL], SEQ ID NO: 123 [ZmUbilp:HvAle:NtEGm:SEKDEL], SEQ ID NO: 124 [ZmPepCP:HvAle:NtEGm:SEKDEL], SEQ ID NO: 125 [OsUbi3P:HvAle:NtEGm:SEKDEL], SEQ ID NO: 126 [OsUbi3P:BAASS: O33897], SEQ ID NO: 127 [ZmPepCP:BAASS:O43097:SEKDEL], SEQ ID NO: 128 [OsUbi3P:O68438], SEQ ID NO: 129 [OsUbi3P:P0C2S1], SEQ ID NO: 130 [ZmUbi1P:ZmUBQm:BAASS:P77853:S158-30-108-35], SEQ ID NO: 131 [ZmUbi1P:BAASS:P77853:T134-100-101:SEKDEL], SEQ ID NO: 132 [2379 cassette-3 CWDE and 1 hpRNA], SEQ ID NO: 133 [2380 cassette-3 CWDE and 1 hpRNA], SEQ ID NO: 134 [4106 cassette-3 CWDE and 1 hpRNA], SEQ ID NO: 135 [4107 cassette-3 CWDE and 1 hpRNA], SEQ ID NO: 136 [4108 cassette-3 CWDE and 1 hpRNA], SEQ ID NO: 137 [4109 cassette-3 CWDE and 1 hpRNA], SEQ ID NO: 138 [4110 cassette-3 CWDE and 1 hpRNA], SEQ ID NO:139 [4111 cassette-3 CWDE and 1 hpRNA], SEQ ID NO: 140 [4112 cassette-3 CWDE and 1 hpRNA], SEQ ID NO: 141 [4113 cassette-3 CWDE and 1 hpRNA], SEQ ID NO: 142 [4114 cassette 3 CWDE and 1 hpRNA], SEQ ID NO: 143 [4115 cassette-3 CWDE and 1 hpRNA], SEQ ID NO: 144 [4116 cassette-3 CWDE and 1 hpRNA], SEQ ID NO: 145 [4117 cassette-3 CWDE and 1 hpRNA], SEQ ID NO: 146 [4120 cassette-3 CWDE and 1 hpRNA], SEQ ID NO: 147 [4121 cassette-3 CWDE and 1 hpRNA], SEQ ID NO: 148 [4124 cassette-2 CWDE and 1 hpRNA], SEQ ID NO: 149 [4125 cassette-2 CWDE and 1 hpRNA], SEQ ID NO:150 [4514 cassette-3 CWDE and 1 hpRNA], and SEQ ID NO: 151 [4515 cassette-3 CWDE and 1 hpRNA]. Identity may be measured along the entire length of the reference sequence. The length of the sequence may be equal to the length of the reference sequence.
(89) The genetic construct may further include a regulatory sequence (also referred to as a regulatory element) operably connected to a first isolated nucleic acid or a second isolated nucleic acid. In this context, operably connected means that the regulatory sequence imparts it function to the nucleic acid or the polynucleotide sequence. In the case of a regulatory sequence that is a promoter, the promoter is capable of controlling expression of the nucleic acid or the polynucleotide sequence when they are operably connected. The promoter may be operably linked to the first isolated nucleic acid. The promoter may be operably linked to the second isolated nucleic acid. The operably linked promoter may be any kind of promoter. The operably linked promoter may be an inducible promoter. The operably linked promoter may be a constitutive promoter. The promoter may be an inducible promoter, which initiates transcription of the polynucleotide sequences only when exposed to a particular chemical or environmental stimulus. Examples of inducible promoters include but are not limited to those that are an alcohol inducible promoter, a tetracycline inducible promoter, a steroid inducible promoter, or a hormone inducible promoter. The promoter may be a constitutive promoter. The promoter may be a constitutive promoter, which provides transcription of the nucleic acids or polynucleotide sequences throughout the plant in most cells, tissues and organs and during many but not necessarily all stages of development. The promoter may be specific to a particular developmental stage, organ, or tissue. A tissue specific promoter may be capable of initiating transcription in a particular plant tissue. Plant tissue that may be targeted by a tissue specific promoter may be but is not limited to a stem, leaves, trichomes, anthers, or seed. A constitutive promoter herein may be the rice Ubiquitin 3 promoter (OsUbi3P) or maize the phosphoenolpyruvate carboxylase promoter (ZmPepCP). Other known constitutive promoters may be used, and include but are not limited to Cauliflower Mosaic Virus (CAMV) 35S promoter, the Cestrum Yellow Leaf Curling Virus promoter (CMP) or the CMP short version (CMPS), the Rubisco small subunit promoter, the rice acting promoter (OsAct1P) and the maize ubiquitin promoter (ZmUbi1P). The tissue specific promoter may include the seed-specific promoter. The seed specific promoter may be but is not limited to the rice GluB4 promoter or the maize zein promoter. The promoter may be the P-OsUbi promoter. The sequence of the P-OsUbi promoter may be found in the sequence of pAL409 having SEQ ID NO: 185 with reference to
(90) In the case of a regulatory element that is a terminator, the terminator is capable of terminating transcription of the nucleic acid or the polynucleotide sequence. A terminator sequence may be included at the 3 end of a transcriptional unit of the expression cassette. The terminator may be derived from a variety of plant genes. The terminator may be a terminator sequence from the nopaline synthase (NOS) or octopine synthase (OCS) genes of Agrobacterium tumefaciens. The terminator sequence may be the CaMV 35S terminator from CaMV, or any of the 3UTR sequences shown to terminate the transgene transcription in plants. For example, the maize PepC terminator (3UTR) can be used.
(91) In an embodiment, a first isolated nucleic acid or the second isolated nucleic acid may further include a targeting polynucleotide sequence encoding a targeting peptide. A targeting peptide may be fused to one or more polysaccharide degrading enzymes. When a second isolated nucleic acid encodes more than one polysaccharide degrading enzyme, a targeting peptide may be independently selected for each of the polysaccharide degrading enzymes. A targeting peptide may be selected from but is not limited to an amyloplast targeting signal, a cell wall targeting peptide, a mitochondrial targeting peptide, a cytosol localization signal, a chloroplast targeting signal, a nuclear targeting peptide, and a vacuole targeting peptide. A targeting polynucleotide may be upstream of the first isolated nucleic acid or the second isolated nucleic acid. A targeting peptide may have at least 70, 72, 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% identity to one of BAASS (SEQ ID NO: 107), SEKDEL signal peptide (SEQ ID NO: 108), xHvVSD targeting signal (SEQ ID NO: 109), the ZmUBQm translational fusion (SEQ ID NO: 110), the xGZein27ss (SEQ ID NO: 111), or the HvAle signal (SEQ ID NO: 112).
(92) In an embodiment, a genetic construct may be inserted in a vector appropriate for genetically engineering a plant. Vectors incorporating a genetic construct herein may also include additional genetic elements such as multiple cloning sites to facilitate molecular cloning and selection markers to facilitate selection. A selectable marker that may be included in a vector may be a phosphomannose isomerase (PMI) gene from Escherichia coli, which confers to the transformed cell the ability to utilize mannose for growth. Selectable markers that may be included in a vector include but are not limited to a neomycin phosphotransferase (npt) gene, conferring resistance to kanamycin, a hygromycin phosphotransferase (hpt) gene, conferring resistance to hygromycin, and an enolpyruvylshikimate-3-phosphate synthase gene, conferring resistance to glyphosate.
(93) An embodiment includes a vector having any genetic construct herein. The vector may be an intermediate vector. The vector may be a transformation vector. The genetic construct in the vector may have a any isolated nucleic acid or polynucleotide described herein.
(94) A vector herein may be configured for expression in a host having the gene targeted by the genetic construct. The genetic construct may be an RNAi construct. Upon expression, an RNA sequence transcribed from the first isolated nucleic acid and an RNA sequence transcribed from the inverted complement may be capable of hybridizing with each other and causing inhibition of expression of the gene in the host.
(95) A vector herein may have a sequence comprising, consisting essentially of or consisting of a sequence having at least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99 or 100% identity to a reference sequence selected from the group consisting of SEQ ID NO: 187 [pAG2100], SEQ ID NO: 188 [pAG2101], SEQ ID NO: 189 [pAG2102], SEQ ID NO: 190 [pAG2103], SEQ ID NO: 195 [pAG2106] and SEQ ID NO: 218 [pAL409jSbGWDko2].
(96) A vector herein may have a sequence comprising, consisting essentially of or consisting of a sequence having at least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99 or 100% identity to a reference sequence selected from the group consisting of SEQ ID NO: 185 [pAL409] and SEQ ID NO: 186 [pAG2004].
(97) An embodiment provides a method of making a genetically engineered plant. A genetically engineered plant may be constructed by any method of genetic engineering. A genetically engineered plant may be a transgenic plant. A transgenic plant may be transformed by any known method of transformation. Agrobacterium mediated transformation may be utilized. The transgenic plant may be created by other methods for transforming a plant, for example, particle bombardment or direct DNA uptake. The plant may be any kind of plant. The plant may be an energy crop plant, a food crop plant or a forage crop plant. The plant may be a rice plant, a switchgrass plant, a sorghum plant, a corn plant or a tomato plant. The transformation may be done with any suitable vector including or consisting of any one or more genetic construct herein. The transgenic plant may be created by Agrobacterium-mediated transformation using a vector that includes a first nucleic acid encoding a product that inactivates or inhibits one or more gene encoding a protein involved in mobilization of starch in a plant, and a second isolated nucleic acid that encodes at least one polysaccharide degrading enzyme. Agrobacterium mediated transformation may utilize any suitable transformation vector harboring any one or more RNAi construct herein. Agrobacterium mediated transformation may be done with a vector having a sequence with at least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99 or 100% identity to a sequence selected from the group consisting of SEQ ID NO: 187 [pAG2100], SEQ ID NO: 188 [pAG2101], SEQ ID NO: 189 [pAG2102], SEQ ID NO: 190 [pAG2103], SEQ ID NO: 195 [pAG2106] and SEQ ID NO: 218 [pAL409jSbGWDko2]. The transgenic plant may include any isolated nucleic acid, amino acid sequence, genetic construct, or vector described herein.
(98) The mutant plant may be created by mutagenizing plant seeds; e.g., by chemical mutagenesis (EMS) or radiation, and selecting the mutants by PCR amplification and sequencing the mutant PCR product. The mutant plant may be created by using mutagenesis and screening strategies such as Targeted Induced Local Lesions In Genomics (TILLING; Sikora et al. 2011), T-DNA insertion and transposon-based mutagenesis (An et al. 2005), genome editing strategies involving nucleases such as zinc-finger nucleases (Wrigyt et al. 2005), TAL effector nucleases (Christian et al. 2010), or intein-derived meganucleases (Wehrkamp-Richter et al. 2009). The references cited are incorporated herein as if fully set forth.
(99) In an embodiment, the genetically engineered plant may include any isolated nucleic acid sequence, amino acid sequence, one or more genetic construct, or one or more vectors described herein.
(100) An embodiment includes a method of altering vegetative starch levels in a plant. The method may include expressing an isolated nucleic acid in the plant. Expression of the isolated nucleic acid in the plant may alter the activity of at least one enzyme related to starch metabolism in the plant. The plant may be any genetically engineered plant herein. The genetically engineered plant may include any one or more genetic construct described herein.
(101) Any genetically engineered plant herein may be provided in a method of agricultural processing or animal feed applications. The genetically engineered plant may include any one or more genetic construct described herein. A first isolated nucleic acid that encodes a product that inactivates or inhibits expression of at least one gene encoding a protein involved in mobilization of starch in a plant, and a second isolated nucleic acid that encodes at least one polysaccharide degrading enzyme in the genetically engineered plant may be expressed at any point in the method. The first isolated nucleic acid and the second isolated nucleic acid may be expressed prior to the step of processing the plant. The first isolated sequence and the second isolated sequence may be expressed during the step of processing the plant. The expression may be induced. Upon the expression of the first and the second nucleic acids, the genetically engineered plant may have an altered level of vegetative starch compared to the level of starch in a non-genetically engineered plant of the same genetic background but lacking the one or more genetic construct.
(102) A step of providing the genetically engineered plant may include obtaining it from another party that produced it. A step of providing may include making the transgenic plant. A step of providing may include making the mutant plant. The step of providing may include genetically engineering the plant by contacting the plant with any one of the genetic constructs herein. The step of providing may include stable transformation of the plant by any of the methods described herein, or known methods. The step of providing may include genetically engineering the plant by cleaving a gene encoding a protein involved in starch metabolism at a cleavage site recognized by a restricting enzyme transiently expressed in the plant after contacting the plant with a genetic construct comprising a first nucleic acid encoding the restricting enzyme. The step of providing may also include regenerating the genetically engineered plant from a tissue of the transgenic or mutant plant having an altered level of vegetative starch. The step of providing may include obtaining a progeny of genetically engineered plant resulted from self-pollination or cross-pollination between the genetically engineered plant and non-genetically engineered plant. The genetically engineered plant may be used in a variety of subsequent methods or uses. The step of providing may include procuring the genetically engineered plant. The step of providing may include making the genetically engineered plant available for further processing steps.
(103) The genetically engineered plant may be provided in a method of agricultural processing as a feedstock engineered with elevated levels of starch and/or expressing one or more polysaccharide degrading enzyme. The feedstock may include any genetically engineered plant herein alone or in combination with other components. The other components may include other plant material. Agricultural processing is the manipulation or conversion of any agricultural feedstock for a particular product or use. Agricultural processing may include drying the genetically engineered plant. Agricultural processing may include fermenting the genetically engineered plant. Agricultural processing may include hydrolyzing the genetically engineered plant by one or more an exogenous enzymes to obtain a biochemical product.
(104) The genetically engineered plant may be provided in a method of preparing animal feed. Preparing animal feed may include combining the genetically engineered plant with distillers grains. Preparing animal feed may include pelletizing the genetically engineered plant into feed pellets. Preparing animal feed may include ensiling the genetically engineered plant to make silage.
(105) Preparing animal feed may include combing the genetically engineered plant with a source of edible fiber.
(106) Agricultural processing or preparing animal feed may also include at least one of the operations of harvesting, baling, grinding, milling, chopping, size reducing, crushing, extracting a component from the feedstock, purifying a component or portion of the feedstock, extracting or purifying starch, hydrolyzing polysaccharides into oligosaccharides or monosaccharides, chemical conversion, or chemical catalysis of the feedstock.
(107) In an embodiment of the method of agricultural processing, the genetically engineered plant may be used for producing a chemical product. The method may include pretreating a genetically engineered plant with a chemical formulation to form a mixture. The chemical formulation may include one or more moiety including but not limited to an ion of sulfite, bisulfite, sulfate, carbonate, hydroxide or oxide. The chemical formulation may further include one or more counter ion including but not limited to ammonium, sodium, magnesium or calcium.
(108) In an embodiment, the chemical formulation may include but is not limited to one compound selected from a sulfuric acid, a base, ammonium bisulfite and ammonium carbonate. The ammonium bisulfite may be at a concentration of 0.02 M to 0.35 M. The ammonium carbonate may be at a concentration of 0.025 M to 0.25 M. The sulfuric acid may be at concentration of 0.25 M. The base may be 7.5% ammonium hydroxide.
(109) In an embodiment, the chemical formulation and the genetically engineered plant may be admixed with other plant material at an optimal liquid-to-solid ratio in a mixture. The mixture may have a liquid to solid ratio selected from the value of less than or equal to one of 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1, or any value in a range between any two of the foregoing (endpoints inclusive).
(110) Pretreating may include incubating the mixture for any period of time. Pretreating may include incubating the mixture for up to 16 hours. Incubating may occur for longer or shorter periods may be performed. Pretreating may include incubating the mixture for a period of less or equal to one of 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 hour(s).
(111) Pretreating may include providing a mixture temperature of 40 C. to 150 C. A mixture temperature of 40 C. to 95 C. may allow breakage or removal of portions of lignin within the lignocellulosic material in the mixture without deactivating hydrolytic enzymes. Pretreating may include providing a mixture temperature of 40 C., 55 C., 65 C., 75 C., 95 C., 150 C., less than 55 C., less than 65 C., less than 75 C., less than 95 C., less than 150 C., 40 C. to 55 C., 40 C. to 65 C., 40 C. to 75 C., 40 C. to 95 C., 40 C. to less than 150 C., 55 C. to 65 C., 55 C. to 75 C., 55 C. to 95 C., 55 C. to less than 150 C., 65 C. to 75 C., 65 C. to 95 C., 65 C. to less than 150 C., 75 C. to 95 C., 75 C. to less than 150 C., or 95 C. to less than 150 C.
(112) Pretreating may include providing a mixture pH ranging from 1.0 to 12.0. Pretreating may include providing a mixture pH within a range of 6.5 to 8.5. The mixture pH provided may be 1.0, 1.5, 2.0, 2.5, 3.0, 3.5, 4.0, 4.5, 5.0, 5.5, 6.0, 7.0, 7.5, 8.0, 9.0, 9.5, 10, 10.5, 11.0, 11.5 or 12.0, or a pH within a range between any two of the foregoing pH values (endpoints inclusive). The pH of the mixture during pretreating may depend on the type of chemical used and/or type of plant material used. Providing a mixture pH may include adding a pH modifying chemical. A pH modifying chemical may be an acid or an alkali.
(113) In an embodiment, the method of agricultural processing may further include hydrolyzing the mixture. Hydrolyzing may include incubating the mixture for a period 144 hours, 140 hours, 130 hours, 120 hours, 110 hours, 100 hours, 90 hours, 80 hours, 70 hours, 60 hours, 50 hours, 40 hours, 30 hours, 20 hours, 10 hours, 9 hours, 8 hours, 7 hours, 6 hours, 5 hours, 4 hours, 3 hours, 2 hours, 1 hour, less than 144 hours, less than 140 hours, less than 130 hours, less than 120 hours, less than 110 hours, less than 100 hours, less than 90 hours, less than 80 hours, less than 70 hours, less than 60 hours, less than 50 hours, less than 40 hours, less than 30 hours, less than 20 hours, less than 10 hours, less than 9 hours, less than 8 hours, less than 7 hours, less than 6 hours, less than 5 hours, less than 4 hours, less than 3 hours, less than 2 hours, or less than 1 hour.
(114) In an embodiment, the step of hydrolyzing may include providing a mixture temperature of 100 C. or less, 65 C. or less, 50 C. or less, 48 C. to 50 C., 48 C. to 65 C., 48 C. to less that 100 C., or 48 C. to 100 C. The step of hydrolyzing may include providing a pH ranging from 4.8 to 5.0, a pH of 4.8, a pH of 4.9, or a pH of 5.0. At least one of the temperature, pH, or time of treatment, may be selected based on the specific activity of a polysaccharide degrading enzyme in the genetically engineered plant.
(115) If the genetically engineered plant includes multiple polysaccharide degrading enzymes, the step of hydrolyzing may sequentially provide conditions optimal for at least one of expression, pretreating, or hydrolysis by each of the multiple polysaccharide degrading enzymes. The step of hydrolyzing may include providing a pH optimal for activity of one enzyme, followed by a different pH optimal for activity of another enzyme. The step of hydrolyzing may include adjusting temperatures at different periods of time for optimal activity of each enzyme. For example, a cellobiohydrolase may require a different temperature or pH than a xylanase.
(116) The method of agricultural processing may include adding one or more exogenous enzymes to at least one of the genetically engineered plant, other plant material, or the mixture. The exogenous enzymes may be added before, during, or after pretreating. The exogenous enzymes may be added before, during, or after the step of hydrolyzing. One or more exogenous enzymes may be provided in an enzyme cocktail. An enzyme cocktail may include one or more polysaccharide degrading enzymes. A polysaccharide degrading enzyme provided in an embodiment herein may be but is not limited to a lignin degrading enzyme, a cellulose degrading enzyme, or a hemicellulose degrading enzyme. A polysaccharide degrading enzyme provided in an embodiment herein may be but is not limited to one selected from glycosidases, xylanases, cellulases, endoglucanases, exoglucanases, cellobiohydrolases, -xylosidases, feruloyl esterases, -glucosidases, and amylases. An enzyme cocktail may include a cellulase isolated from Trichoderma reesii. An enzyme cocktail may be purchased from a vendor. An enzyme cocktail may be, but is not limited to, Accellerase 1000, Accellerase 1500, Accelerase TRIO, and AccelleraseXY available from Genencor International (Rochester, N.Y.). An enzyme cocktail may be Cellic, CTEC, HTEC available from Novozymes (Denmark). An enzyme cocktail may include different classes of polysaccharide degrading enzymes. An enzyme cocktail may include starch degrading enzymes. An enzyme cocktail may include an amylase or an invertase. Optimal conditions for different classes of polysaccharide degrading enzymes in a cocktail may be provided. For example, the temperature, pH and time of treatment for hydrolysis may be adjusted during the method to provide optimal conditions for different enzymes in the cocktail.
(117) The method of agricultural processing may further include contacting the mixture and/or products of hydrolysis with a fermenting organism to produce a chemical product. After enzymatic hydrolysis, soluble sugars may be recovered and used for production of a chemical product. The chemical product may be glucose. Alternatively, simultaneous saccharification and fermentation of soluble sugars into a chemical product may be performed in the method. A chemical product may be but is not limited to butane, butanediol, butadiene, butanol, isobutanol, propane, propanediol, propylene, propanol, isopropanol, methane, methanol, ethanol, phenol, glycerol, ethylene, toluene, ethyl, benzene, styrene, xylene, ethylene glycol, ethylene oxide, formic acid, carbone dioxide, formaldehyde, acetaldehyde, acetone, a vitamin, ethane, pentane, hexane, heptane, octane, benzene, acetic acid, sorbitol, arabinitol, succinic acid, fumaric acid, malic acid, furan dicarboxylic acid, aspartic acid, glucaric acid, glutamic acid, itaconic acid, levulinic acid, hydroxybutyrolactone, glycerol, sorbitol, xylitol, arabinitol, gluconic acid, lactic acid, malonic acid, propionic acid, citric acid, aconitic acid, xylonic acid, furfural, levoglucosan, alanine, proline, lysine, serine, or threonine (See T. Werpy and G. Petersen, Top Value Added Chemicals From Biomass, Volume 1, Results of Screening for Potential Candidates from Sugars and Synthesis Gas, August 2004, Report, PNNL & NREL, which is incorporated herein by reference as if fully set forth).
(118) The conversion of sugars into a chemical product may be performed by any suitable fermenting organism. The fermenting organism may be selected based on the desired chemical product. The fermenting organism may be yeast. The yeast may be but is not limited to one of Saccharomyces, Kluyveromyces, Pichia, Yarrowia, Spathaspora or Scheffersomyces ssp. The fermenting organism may be a bacterium. A bacterium may be but is not limited to a Zymomonas, Escherichia, Bacillus, Lactobacillus, or Clostridium spp. The fermenting organism may be a wild type organism or a genetically engineered recombinant organism. The fermenting organism may be a collection of organisms isolated from a ruminant animal. The fermenting organism may be an acetogen.
(119) The method of agricultural processing may include simultaneous saccharification and fermentation of soluble sugars to produce ethanol. Simultaneous saccharification and fermentation to produce ethanol may include providing Saccharomyces cerevisiae D5A before, during or after pretreating or providing hydrolysis conditions.
(120) An embodiment provides a method of preparing the genetically engineered plant to be used as an animal feed. The method may include a step of contacting the genetically engineered plant with liquid to form a mixture and incubating the mixture at a temperature of 40 C. to 100 C. for a time sufficient to produce soluble sugars from lignocellulosic material in the mixture. The liquid may be water. A mixture temperature of 40 C. to 95 C. may allow breakage or removal of lignin within the lignocellulosic material in the mixture without deactivating one or more CWDEs included in the genetically engineered plant.
(121) In an embodiment, the method of preparing the animal feed may further include contacting the mixture with an alkaline chemical. The alkaline chemical may be but is not limited to calcium oxide, calcium hydroxide, potassium hydroxide, sodium hydroxide, hypochlorite, hydrogen peroxide and ammonia. The step of contacting may occur in a vessel and may include rotating and mechanical grinding the mixture in the vessel.
(122) In an embodiment, the method of preparing the animal feed may further include adding to the mixture one or more enzymes for enzymatic hydrolysis of lignocellulosic material. The added enzymes may be but are not limited to one selected from amylases, proteases, phytases, hydrolytic enzymes, cellulases, glucanases, hemicellulases, xylanases, amylases, esterases, laccases, mannanases, and peroxidases.
(123) In an embodiment, the method of preparing the animal feed may include combining the mixture with a source of edible fiber. The source of edible fiber may be but is not limited to corn, sorghum, wheat, rye, soybeans, switchgrass, grasses, corn grain, sorghum grain, wheat grain, wheat straw, rye grain, corn fiber, corn stove, corn husks, soybean meal, corn meal, corn oil, wheat germ, corn germ, or combination thereof.
(124) In an embodiment, the method of preparing the animal feed may include combining the mixture with distillers grains. Distillers grains may be created in distilleries as byproducts in breweries and ethanol producing plants. Distillers grains may be used as fodder for livestock.
(125) In an embodiment, the method of preparing the animal feed may include pelletizing the genetically engineered plants into feed pellets. The pelletizing may be performed by any known methods. In an embodiment, the genetically engineered plants may be shredded in traditional shredders used in pellet manufacturing. The shredded material may be further ground to produce particles ranging in size from 0.5 inches to 6 inches. The particles may be used for producing pellets.
(126) In an embodiment, the method of preparing the animal feed may include ensiling the genetically engineered plants to make silage. As used herein, silage is fermented, high moisture fodder that can be fed to ruminant animals, in particular to dairy cattle, sheep and horses. Silage may be made by placing green parts of plants in a silo and by piling it in a heap covered by a plastic film. Silage retains more nutrients than dried plants, hay, or stover. Silage goes through a bacterial fermentation process resulted in production of volatile fatty acids and improved digestability for ruminant animals. The method may also include the addition of ensiling agents to improve stability or digestability of the ensiled genetically engineered plant. The ensiling agent may be but is not limited to sugars, lactic acid or inculants.
(127) In an embodiment of the method of preparing the animal feed, the genetically engineered plants may be used to obtain a digestable feedstock. The method may also include feeding an animal with a digestable feedstock comprising the genetically engineered plants to promote animal growth. The animal may be but is not limited to chicken, swine, or cattle.
(128) Further embodiments herein may be formed by supplementing an embodiment with one or more element from any one or more other embodiment herein, and/or substituting one or more element from one embodiment with one or more element from one or more other embodiment herein.
EXAMPLES
(129) The following non-limiting examples are provided to illustrate particular embodiments. The embodiments throughout may be supplemented with one or more detail from one or more example below, and/or one or more element from an embodiment may be substituted with one or more detail from one or more example below.
Example 1
Identifying Target Genes
(130) T-DNA insertion libraries from different organisms may be researched to locate genes in those organisms related to starch regulation. Based on the discovery of such genes, a search may be conducted to find similar genes in a plant of interest. The genes of interest may be used in constructs herein to affect alteration in starch regulation.
(131) A number of other methods have been developed to generate or identify null alleles among genes. Among these are TILLING (Till B J, Cooper J, Tai T H, Colowit P, Greene E A, Henikoff S, Comai L Discovery of chemically induced mutations in rice by TILLING (2007) BMC Plant Biol. 7:19), and gene tagging with Tos17 retrotranspsons or engineered maize (Zea mays) Ac and Ds/dSpm transposons (Krishnan A, Guiderdoni E, An G, Hsing Y I, Han C D, Lee M C, Yu S M, Upadhyaya N, Ramachandran S, Zhang Q, Sundaresan V, Hirochika H, Leung H, Pereira A. 2009. Mutant resources in rice for functional genomics of the grasses. Plant Physiol. 149:165-70 and references therein), which are incorporated herein by reference as if fully set forth. These methods may be used to generate or identify null alleles among genes related to starch regulation.
(132) Once genes that encode key enzymes involved in the turnover of transitory starch in example crops were identified, candidate genes in rice, maize, and sorghum that encode enzymes involved in starch turnover were identified by their homology to genes that encode the corresponding enzymes in species that have been better studied. For example, CLUSTAL amino acid alignments between hypothetical glucan water dikinase (GWD) sequences that have been inferred from draft genome sequences from sorghum (SbGWD; SEQ ID NO: 1), maize (ZmGWD; SEQ ID NO: 2) and rice (OsGWD; SEQ ID NO: 3) and the known GWD enzyme sequence from potato (StGWD; SEQ ID NO: 4) show extensive homology among these polypeptides:
(133) TABLE-US-00002 CLUSTAL2.1multiplesequencealignment SbGWD----------MTGFSAAASAAAAAERCALAIRARPAASSPAKRQQQSASLRRSGGQRRPT50 ZmGWD------------------------------------------------------------ OsGWD-------------------------------------------------------PAATT5 StGWDMSNSLGNNLLYQGFLTSTVLEHKSRISPPCVGGNSLFQQQVISKSPLSTEFRGNRLKVQK60 SbGWDTLAASRRSPVVVPRAIATSADRASHDLVGKFTLDSNSELLVAVNPAPQGLVSVIGLEVTN110 ZmGWD------------------------------------------------------------ OsGWDTLAVSRRS-LLAPRAIAASTGRASPGLVGRFTLDANSELKVTLNPAPQGSVVEINLEATN64 StGWDKKIPMEKKRAFSSSPHAVLTTDTSSELAEKFSLGGNIELQVDVRPPTSGDVSFVDFQVTN120 SbGWDTSGSLILHWGVLRPDKRDWILPSRQPDGTTVYKNRALRTPFVKSGDNSTLRIEIDDPAVQ170 ZmGWD-------------------------PDGTTVYKNRALRTPFVKSGDNSTLRIEIDDPGVH35 OsGWDTSGSLILHWGALRPDRGEWLLPSRKPDGTTVYKNRALRTPFIKSGDNSTLKIEIDDPAVQ124 StGWDGSDKLFLHWGAVKFGKETWSLPNDRPDGTKVYKNKALRTPFVKSGSNSILRLEIRDTAIE180 ****.****:******:***.***::***..:. SbGWDAIEFLIFGETQNKWFKNNGQNFQIQLQSSRHQGNGASGASSSATSTLVPEDLVQIQAYLR230 ZmGWDAIEFLIFDETQNKWFKNNGQNFQVQFQSSRHQGTGASGASSSATSTLVPEDLVQIQAYLR95 OsGWDAIEFLIFDEARNNWYKNNGQNFQIQLQASQYQGQGTSTATS---STVVPEDLVQIQSYLR181 StGWDAIEFLIYDEAHDKWIKNNGGNFRVKLSRKEIRGP----------DVSVPEELVQIQSYLR230 ******:.*::::*******::::...:*..***:*****:*** SbGWDWERKGKQSYTPEQEKEEYEAARAELIEELNRGVSLEKLRAKLTKTPEAPESDERKSPASR290 ZmGWDWERRGKQSYTPEQEKEEYEAARAELIEEVNRGVSLEKLRAKLTKAPEAPESDESKSSASR155 OsGWDWERKGKQSYTPEQEKEEYEAARTELIEELNKGVSLEKLRAKLTKTPEATDSNAPASEST-240 StGWDWERKGKQNYPPEKEKEEYEAARTVLQEEIARGASIQDIRARLTKTNDKSQSKEEPLHVT-289 ***:***.*.**:*********:***::*.*::.:**:***::.:*.: SbGWDMPVDKLPEDLVQVQAYIRWEKAGKPNYPPEKQLVELEEARKELQAEVDKGISIDQLRQKI350 ZmGWDVPIGKLPEDLVQVQAYIRWEQAGKPNYPPEKQLVEFEEARKELQAEVDKGISIDQLRQKI215 OsGWD-VTTKVPEELVQVQAYIRWEKAGKPNYAPEKQLVEFEEARKELQSELDKGTSVEQLRNKI299 StGWD--KSDIPDDLAQAQAYIRWEKAGKPNYPPEKQIEELEEARRELQLELEKGITLDELRKTI347 .:*::*.*.*******:******.****:*:****:****::**::::**:.* SbGWDLKGNIESKVSKQLKNKKYFSVERIQRKKRDIMQLLSKHKHT--VMEEKVEVAPKQPTVLD408 ZmGWDLKGNIESKVSKQLKNKKYFSVERIQRKKRDITQLLSKHKHT--VMEDKVEVVPKQPTVLD273 OsGWDLKGNIETKVSKQLKDKKYFSVERIQRKKRDIVQLLKKHKPT--VMEAQVET-PKQPTVLD356 StGWDTKGEIKTKVEKHLK-RSSFAVERIQRKKRDFGHLINKYTSSPAVQVQKVLEEPPALSKIK406 **:*::**.*:**:.*:**********::*:.*:.:*:**::. SbGWDLFTKSLHEKDGCEVLSRKLFKFGDKEILAISTKVQNKTEVHLATNHTEPLILHWSLAKKA468 ZmGWDLFTKSLHEKDGCEVLSRKLFKFGDKEILAISTKVQNKTEVHLATNHTDPLILHWSLAKNA333 OsGWDLFTKSLQEQDNCEVLSRKLFKFGDKEILGITTVALGKTKVHLATNYMEPLILHWALSKEN416 StGWDLYAKEKEEQIDDPILNKKIFKVDDGELLVLVAKSSGKTKVHLATDLNQPITLHWALSKSP466 *::*..*:.:*.:*:**..**:*::.**:*****::*:***:*:*. SbGWDGEWKAPPSNILPSGSKLLDMACETEFTRSELDGLC--YQVVEIELDDGGYKGMPFVLRSG526 ZmGWDGEWKAPSPNILPSGSTLLDKACETEFTKSELDGLH--YQVVEIELDDGGYKGMPFVLRSG391 OsGWDGEWQAPPSSILPSGSSLLDKACETSFSEYELNGLH--CQVVEIELDDGGYKRMPFVLRSG474 StGWDGEWMVPPSSILPPGSIILDKAAETPFSASSSDGLTSKVQSLDIVIEDGNFVGMPFVLLSG526 ***.*...***.**:***.***:.:***::*::**.:******* SbGWDETWIKNNGSDFFLDFSTRDTRNIK--LKDNGDAGKGTAKALLERIADLEEDAQRSLMHRF584 ZmGWDETWIKNNGSDFFLDFSTHDVRNIKAILKDNGDAGKGTSKALLERIADLEEDAQRSLMHRF451 OsGWDETWMKNNGSDFYLDFSTKVAKNTK----DTGDAGKGTAKALLERIADLEEDAQRSLMHRF530 StGWDEKWIKNQGSDFYVGFSAASKLALK-----AAGDGSGTAKSLLDKIADMESEAQKSFMHRF581 *.*:**:****::.**:*..*.**:*:**::***:*.:**:*:**** SbGWDNIAADLADEARDAGLLGIVGLEVWIRFMATRQLTWNKNYNVKPREISKAQDRFTDDLENM644 ZmGWDNIAADPADQARDAGLLGIVGLFVWIRFMATRQLTWNKNYNVKPREISKAQDRFTDDLENM511 OsGWDNIAADLVDQARDNGLLGIIGIFVWIRFMATRQLIWNKNYNVKPREISKAQDRFTDDLENM590 StGWDNIAADLIEDATSAGELGFAGILVWMRFMATRQLIWNKNYNVKPREISKAQDRLTDLLQNA641 *****::*.***:*::**:**************************:***:* SbGWDYRTYPQYREILRMIMAAVGRGGEGDVGQRIRDEILVIQRNNDCKGGMMEEWHQKLHNNTS704 ZmGWDYKTYPQYREILRMIMAAVGRGGEGDVGQRIRDEILVIQRNNDCKGGMMEEWHQKLHNNTS571 OsGWDYRTYPQYQEILRMIMSAVGRGGEGDVGQRIRDEILVIQRNNDCKGGMMEEWHQKLHNNTS650 StGWDFTSHPQYREILRMIMSTVGRGGEGDVGQRIRDEILVIQRNNDCKGGMMQEWHQKLHNNTS701 :::***:*******::*******************************:*********** SbGWDPDDVVICQALIDYIKNDFDISVYWDTLNKNGITKERLLSYDRAIHSEPNFRSEQKEGLLR764 ZmGWDPDDVVICQALIDYIKSDFDISVYWDTLNKNGITKERLLSYDRAIHSEPNFRSEQKAGLLR631 OsGWDPDDVVICQALLDYIKSDFDIGVYWDTLKKDGITKERLLSYDRPIHSEPNFRSEQKDGLLR710 StGWDPDDVVICQALIDYIKSDFDLGVYWKTLNENGITKERLLSYDRAIHSEPNFRGDQKGGLLR761 **********:****.***:.***.**:::************.********.:****** SbGWDDLGNYMRSLKAVHSGADLESAIATCMGYKSEGEGFMVGVQINPVKGLPSGFPELLEFVLD824 ZmGWDDLGNYMRSLKAVHSGADLESAIASCMGYKSEGEGFMVGVQINPVKGLPSGFPELLEFVLE691 OsGWDDLGNYMRSLKAVHSGADLESAIATCMGYKSEGEGFMVGVQINPVKGLPSGFPKLLEFVLD770 StGWDDLGHYMRTLKAVHSGADLESAIANCMGYKTEGEGFMVGVQINPVSGLPSGFQDLLHFVLD821 ***:***:***************.*****:**************.******.**.***: SbGWDHVEDKSAEPLLEGLLEARVDLRPLLLDSPERMKDLIFLDIALDSTFRTAIERSYEELNDA884 ZmGWDHVEDKSAEPLPEGLLEARVELRPLLLDSRERMKDLIFLDIALDSTFRTAIERSYEELNDA751 OsGWDHVEDKSAEPLLEGLLEARAELHPLLLGSPERMKDLIFLDIALDSTFRTAVERSYEELNNV830 StGWDHVEDKNVETLLERLLEAREELRPLLLKPNNRLKDLLFLDIALDSTVRTAVERGYEELNNA881 *****..*.*******:*:****.:*:***:*********.***:**.*****:. SbGWDAPEKIMYFISLVLENLAFSIDDNEDILYCLKGWNQALEMAKQKDDQWALYAKAFLDRIRL944 ZmGWDAPEKIMYFISLVLENLALSIDDNEDILYCLKGWNQALEMAKQKDDQWALYAKAFLDRNRL811 OsGWDEPEKIMYFISLVLENLALSTDDNEDILYCLKGWNQALEMAKQKNNQWALYAKAFLDRTRL890 StGWDNPEKIMYFISLVLENLALSVDDNEDLVYCLKGWNQALSMSNGGDNHWALFAKAVLDRTRL941 ****************:******::**********.*:::::***:***.***** SbGWDALASKGEQYHNMMQPSAEYLGSLLSIDKWAVNIFTEEIIRGGSAATLSALLNRFDPVLRN1004 ZmGWDALASKGEQYHNMMQPSAEYLGSLLSIDQWAVNIFTEEIIRGGSAATLSALLNRFDPVLRN871 OsGWDALASKGEQYYNLMQPSAEYLGSLLNIDQWAVNIFTEEIIRGGSAATLSALLNRIDPVLRN950 StGWDALASKAEWYHHLLQPSAEYLGSILGVDQWALNIFTEEIIRAGSAASLSSLLNRLDPVLRK1001 *****.**::::*********:*.:*:**:*********.****:**:****:*****: SbGWDVANLGSWQVISPVEVSGYVVVVDELLAVQNKSYDKPTILVAKSVKGEEEIPDGVVGVITP1064 ZmGWDVAHLGSWQVISPVEVSGYVVVVDELLAVQNKSYDKPTILVAKSVKGEEEIPDGVVGVITP931 OsGWDVAQLGSWQVISPVEVSGYIVVVDELLAVQNKSYDKPTILVAKSVKGEEEIPDGVVGVITP1010 StGWDTANLGSWQIISPVEAVGYVVVVDELLSVQNEIYEKPTILVAKSVKGEEEIPDGAVALITP1061 .*:*****:*****.**:*******:***:*:*******************.*.:*** SbGWDDMPDVLSHVSVRARNSKVLFATCFDHTTLSELEGYDQKLLSFKPTSADITYREITESELQ1124 ZmGWDDMPDVLSHVSVRARNSKVLFATCFDHTTLSELEGYDQKLFSFKPTSADITYREITESELQ991 OsGWDDMPDVLSHVSVRARNCKVLFATCFDPNTLSELQGHDGKVFSFKPTSADITYREIPESELQ1070 StGWDDMPDVLSHVSVRARNGKVCFATCFDPNILADLQAKEGRILLLKPTPSDIIYSEVNEIELQ1121 ***********************.*::*:.:::::***.:****:**** SbGWDQSSSPNAEVGHAVPSISLAKKKFLGKYAISAEEFTEEMVGAKSRNIAYLKGKVPSWVGVP1184 ZmGWDQSSSPNAEVGHAVPSISLAKKKFLGKYAISAEEFSEEMVGAKSRNIAYLKGKVPSWVGVP1051 OsGWD-SGSLNAEAGQAVPSVSLVKKKFLGKYAISAEEFSEEMVGAKSRNVAYLKGKVPSWVGVP1129 StGWD--SSSNLVEAETSATLRLVKKQFGGCYAISADEFTSEMVGAKSRNIAYLKGKVPSSVGIP1179 .**..:.::*.**:*******:**:.*********:***********:* SbGWDTSVAIPFGTFEKVLSDGLNKEVAQTIEKLKIRLAQEDFSALGEIRKAVLNLTAPMQLVNE1244 ZmGWDTSVAIPFGTFEKVLSDGLNKEVAQSIEKLKIRLAQEDFSALGEIRKVVLNLTAPMQLVNE1111 OsGWDTSVAIPFGTFEKVLSDEINKEVAQTIQMLKGKLAQDDFSALGEIRKTVLNLTAPTQLIKE1189 StGWDTSVALPFGVFEKVLSDDINQGVAKELQILMKKLSEGDFSALGEIRTTVLDLSAPAQLVKE1239 ****:***.*******:*:**:::*:*::*********..**:*:****::* SbGWDLKERMLGSGMPWPGDEGNRRWEQAWMAIKKVWASKWNERAYFSTRKVKLNHEYLSMAVLV1304 ZmGWDLKERMLGSGMPWPGDEGDKRWEQAWMAIKKVWASKWNERAYFSTRKVKLDHEYLSMAVLV1171 OsGWDLKEKMLGSGMPWPGDEGDQRWEQAWMAIKKVWASKWNERAYFSTRKVKLDHDYLSMAVLV1249 StGWDLKEKMQGSGMPWPGDEGPKRWEQAWMAIKKVWASKWNERAYFSTRKVKLDHDYLCMAVLV1299 ***:************:******************************:*:**.***** SbGWDQEVVNADYAFVIHTTNPSSGDSSEIYAEVVKGLGETLVGAYPGRAMSFVCKKDDLDSPKL1364 ZmGWDQEVVNADYAFVIHTTNPSSGDSSEIYAEVVKGLGETLVGAYPGRAMSFVCKKDDLDSPKL1231 OsGWDQEIVNADYAFVIHTTNPSSGDSSEIYAEVVKGLGETLVGAYPGRAMSFVCKKNDLDSPKV1309 StGWDQEIINADYAFVIHTTNPSSGDDSEIYAEVVRGLGETLVGAYPGRALSFICKKKDLNSPQV1359 **::*****************.********:**************:**:***.**:**:: SbGWDLGYPSKPIGLFIRRSIIFRSDSNGEDLEGYAGAGLYDSVPMDEEDEVVLDYTTDPLIVDR1424 ZmGWDLGYPSKPIGLFIRQSIIFRSDSNGEDLEGYAGAGLYDSVPMDEEDEVVLDYTTDPLIVDR1291 OsGWDLGFPSKPIGLFIKRSIIFRSDSNGEDLEGYAGAGLYDSVPMDEEDEVILDYTTDPLITDQ1369 StGWDLGYPSKPIGLFIKRSIIFRSDSNGEDLEGYAGAGLYDSVPMDEEEKVVIDYSSDPLITDG1419 **:*********::******************************::*::**::****.* SbGWDGFRNSILSSIARAGHAIEELYGSPQDVEGVVKDGKIYVVQTRPQM1469(SEQIDNO:1) ZmGWDGFRSSILSSIARAGHAIEELYGSPQDVEGVVKDGKIYVVQTRPQM1336(SEQIDNO:2) OsGWDGFQKSILSSIARAGHAIEELYGSPQDVEGAVKEGKLYVVQTRPQM1414(SEQIDNO:3) StGWDNFRQTILSNIARAGHAIEELYGSPQDIEGVVRDGKIYVVQTRPQM1464(SEQIDNO:4) .*:.:***.*****************:**.*::**:*********
(134) Based on this homology, it is possible to identify gene sequences (including coding sequences which had been previously unannotated in public databases) that correspond the GWD enzymes from the genomes of the respective species. These sequences include putative GWD cDNA (coding) sequences from sorghum (SEQ ID NO: 5), maize (SEQ ID NO: 6), and rice (SEQ ID NO:7), the corresponding genes from which the GWD mRNAs are transcribed for GWD from sorghum (SEQ ID NO: 8) and maize (SEQ ID NO: 9). Furthermore, from this information, one can also infer the sequences of the 5 untranslated regions (UTRs) and promoters of the respective GWD genes in sorghum (SEQ ID NO: 10) and maize (SEQ ID NO: 11), as well as the 3 UTRs of the respective GWD genes in sorghum (SEQ ID NO: 12) and maize (SEQ ID NO: 13). The SbGWD gene is located on sorghum chromosome 10 of sorghum and consists of a 12128 bp sequence (ATG to Stop), while the 11693 bp ZmGWD gene is located on chromosome 6 of maize. The overall sequence identity between the two genes is 78.3%, while the inferred cDNA sequences are 95.6% identical. Each gene is composed of 32 exons of identical length interrupted by 31 introns, which differ not considerably in size. The deduced 1471 amino acid (AA) ZmGWD and 1469AA SbGWD proteins share 94.4% sequence identity at the amino acid level. Both proteins are characterized by the presence of the PPDK_N domain (Pyruvate phosphate dikinase, PEP/pyruvate binding) in the 1267-1470AA region of ZmGWD and 1265-1468AA of SbGWD. The other notable characteristic of the two proteins is the presence of a conserved His residue at 1074AA in ZmGWD and at 1072AA in SbGWD, which is required for phosphorylation activity.
(135) Similar strategies can be used to identify gene sequences for other target enzymes in sorghum and maize. CLUSTAL alignment as follows was used to identify candidate PWD target proteins from sorghum (SbPWD; SEQ ID NO: 14) and maize (ZmPWD; SEQ ID NO: 15) based on their similarity to the known PWD protein from Arabidopsis thaliana (AtPWD; SEQ ID NO: 16).
(136) TABLE-US-00003 CLUSTAL2.1multiplesequencealignment SbPWDMASLRPFDPSLAARPGPPPPARPAARRPVPAPPLAAPFASALIFPVRPRIPGRTRGTGVA60 ZmPWD------------------------------------------------------------ AtPWD-------MESIGSHCCSSPFTFITRNSSSSLPRLVN--ITHRVNLSHQSHRLRNSNSRLT51 SbPWDASTKHITRTKEEKQTDPSKQDIVRLHVCLDHQVMFGEHVGIIGSAKELGSWKSPVEMDWT120 ZmPWD------------------------------------------------------------ AtPWDRTATSSSTIEEQRKKKDGSGTKVRLNVRLDHQVNFGDHVAMFGSAKEIGSWKKKSPLNWS111 SbPWDPNGWVCQLDLPGETLLEFKFVVFLNRGKDKIWEDGDNRVVNLPKNGSFDMACHWNKTKEP180 ZmPWD------------------------------------------------------------ AtPWDENGWVCELELDGGQVLECKFVIVKNDG-SLSWESGDNRVLKVPNSGNFSVVCHWDATRET170 SbPWDLNLLGTS-EIKLSGDTEKEKDEDAKLSRNIALEEMGNISNAGDGDLTPKLESSTLGGLWQ239 ZmPWD------------------------------------------------------------ AtPWDLDLPQEVGNDDDVGDGGHERDNHDVGDDRVVGSENG-----------AQLQKSTLGGQWQ219 SbPWDGSDTVFMRSNEHRNNESDRKWDMTGLDAVSLKLVEGDKASRNWWRKLELVRGLVSEYVHD299 ZmPWD------------------------------------------------------------ AtPWDGKDASFMRSNDHGNREVGRNWDTSGLEGTALKMVEGDRNSKNWWRKLEMVREVIVGSVER279 SbPWDQSHLEALTYSAIYLKWIYTGQIPCFEDGGHHRPNKHAEISRQIFREIERIYYGENTSAQD359 ZmPWD------------------------------------------------------------ AtPWDEERLKALIYSAIYLKWINTGQIPCFEDGGHHRPNRHAEISRLIFRELEHICSKKDATPEE339 SbPWDLLVIRKIHPCLPSFKSEFTASVPLTRIRDIAHRNDIPHDLKQEIKHTIQNKLHRNAGPED419 ZmPWD------------------------------------------------------------ AtPWDVLVARKIHPCLPSFKAEFTAAVPLTRIRDIAHRNDIPHDLKQEIKHTIQNKLHRNAGPED399 SbPWDLIATEAMLARITKTPGEYSEAFVEQFKTFYSELKDFFNAGSLLEQVQSIEQSLDESGLEA479 ZmPWD-----------------------------------------LLEQLESIEQSLNESGLEA19 AtPWDLIATEAMLQRITETPGKYSGDFVEQFKIFHNELKDFFNAGSLTEQLDSMKISMDDRGLSA459 ***::*::*:::**.* SbPWDLSSFLKTKKNLDQLEDAKDLDENGGVQVLLKTLLSLSYLRSILMKGLESGLRNDAPDSAI539 ZmPWDLSSFLKTKKNLDQLEDAKDLDENGGVHVLLKTLLSLSYLRSILMKGLESGLRNDAPDSAI79 AtPWDLNLFFECKKRLDTSG------ESSNVLELIKTMHSLASLRETIIKELNSGLRNDAPDTAI513 *.*::**.***...**:**:**:**.::**:*********:** SbPWDAMRQKWRLCEIGLEDYSFVLLSRYINALEALGGSASLAEGLPT-NTSLWDDALDALVIGI598 ZmPWDAMRQKWRLCEIGLEDYSFVLLSRYINALEALGGSASLAEGLPT-NTSLWDDALDALIIGI138 AtPWDAMRQKWRLCEIGLEDYFFVLLSRFLNALETMGGADQLAKDVGSRNVASWNDPLDALVLGV573 **********************::****::**:.**:.::*.:*:*.****::*: SbPWDNQVSFSGWKPNECTAIVNELLSWKQKGLSEFEGSEDGKYIWALRLKATLDRSRALTEEYS658 ZmPWDNQVCFSGWKPNECSAIVNELLSWKQKGLSEFEGNEDGKYIWALRLKATLDRTGRLTEEYS198 AtPWDHQVGLSGWKQEECLAIGNELLAWRERDLLEKEGEEDGKTIWAMRLKATLDRARRLTAEYS633 :**:****:********:*:::.****.*******:********:****** SbPWDEALLSIFPEKVKVLGKALGIPENSVRTYTEAEIRAGVIFQVSKLCTVLLKATRAVLGSSV718 ZmPWDEALLSIFPEKVKVLGKALGIPENSVRTYTEAEIRASVIFQVSKLCTVLLKATRAVLGSSV258 AtPWDDLLLQIFPPNVEILGKALGIPENSVKTYTEAEIRAGIIFQISKLCTVLLKAVRNSLGSEG693 :**.***:*::************:*********.:***:**********.****. SbPWDWDVLVPGVAHGALIQVERIAPGSLPSSIKEPVVLVVNKADGDEEVKAAGDNIVGVILLQE778 ZmPWDWDVLVPGVAHGALIQVERIAPGSLPSSMKEPVVLVVNKADGDEEVKAAGDNIVGVVLLQE318 AtPWDWDVVVPGSTSGTLVQVESIVPGSLPATSGGPIILLVNKADGDEEVSAANGNIAGVMLLQE753 ***:***:*:*:****.*****::*::*:**********.**..**.**:**** SbPWDLPHLSHLGVRARQEKVVFVTCEDDDTIKNTRLLEGKYVRLGASSNNVDLSVVSNKDECAA838 ZmPWDLPHLSHLGVRARQEKVVFVTCEDDDTIKNMRLLEGKHVRLGASSNNVDLSVVSNKDDCAA378 AtPWDLPHLSHLGVRARQEKIVFVTCDDDDKVADIRRLVGKFVRLEASPSHVNLILST---EGRS810 ***************:*****:***.::****.*****..:*:*:::: SbPWDMSSELSSGGNLFAQQFSLPLTTDKKLELSEQRS-----------YTSGANIMSGVLELSE887 ZmPWDMSSEPSAGGDLFAQQFSL-LTTDKKLELSEQKS-----------YTSVANGMSGVLELSE426 AtPWDRTSKSSATKKTDKNSLSKKKTDKKSLSIDDEESKPGSSSSNSLLYSSKDIPSGGIIALAD870 :*:*:.:.:**.*.*.:.::.**:*.*::*:: SbPWDASIESSGAKAAACGTLSVLSSVSNKVYNDQGTPAAFRVPAGAVIPFGSMEDAFKKSGSLK947 ZmPWDASIESSGAKAAACGTLSVLSSMSNKVYNDQGTPAAFRVPAGAVIPFGSMEDALKKSGSLK486 AtPWDADVPTSGSKSAACGLLASLAEASSKVHSEHGVPASFKVPTGVVIPFGSMELALKQNNSEE930 *.::**:*:*****:*:.*.**:.::*.**:*:**:*.*********:*:..*: SbPWDSYTNLLERIETAQIENGELDSLSAELQATVSLLSPSEEIIESLKRIFDQNVRLIVRSTAN1007 ZmPWDSYTNLLERIETAQIENGELDSLSSKLQATVSLLSPSEEIIESLKKTFDQNVRLIVRSTAN546 AtPWDKFASLLEKLETARPEGGELDDICDQIHEVMKTLQVPKETINSISKAFLKDARLIVRSSAN990 .::.***::***:*.****.:.:::.:.*..:**:*:.:*::.******:** SbPWDVEDLAGMSAAGLYESIPNVSLSDPSSFCAAVGQVWASLYTRRAILSRRAAGVPQRDAKMA1067 ZmPWDVEDLAGMSAAGLYESIPNVSLSDPRSFGAAVGQVWASLYTRRAILSRRAAGVPQRDAKMA606 AtPWDVEDLAGMSAAGLYESIPNVSPSDPLVFSDSVCQVWASLYTRRAVLSRRAAGVSQREASMA1050 ************************:************:********.**:*.** SbPWDVLVQEMLQPDLSFVLHTVSPVDHDPKLVEAEVAPGLGETLASGTRGTPWRLSCHKFDGKV1127 ZmPWDVLVQEMLQPDLSFVLHTISPVDHDPKLVEAEVAPGLGETLASGTRGTPWRLSCHKLDGKV666 AtPWDVLVQEMLSPDLSFVLHTVSPADPDSNLVEAEIAPGLGETLASGTRGTPWRLASGKLDGIV1110 *******.*********:**.**.:*****:*******************:.*:*** SbPWDTTLAFANFSEEMVVLNSGPTDGEVTRRTVDYSKKPLSVDATFRGQFGQRLAAIGQYLEQK1187 ZmPWDTTLAFANFSEELMVLNSGPTDGEMSRRTVDYSKKPLSVDATFREQFGQRLAAIGQYLEQK726 AtPWDQTLAFANFSEELLVSGTGPADGKYVRLTVDYSKKRLTVDSVFRQQLGQRLGSVGFFLERN1170 **********::*.:**:**:*********:**:.***:****.::*:**:: SbPWDFGSAQDVEGCLVGQDIFIVQSRPQP-1212(SEQIDNO:14) ZmPWDFGSAQDVEGCLVGPDIFIVQSRPQPQ752(SEQIDNO:15) AtPWDFGCAQDVEGCLVGEDVYIVQSRPQPL1196(SEQIDNO:16) **.***********::********
(137) Based on this homology, it was possible to identify gene sequences (including coding sequences that had been previously unannotated in public databases) that correspond the PWD enzymes from the genomes of the respective species. These sequences include putative PWD cDNA (coding) sequences from sorghum (SEQ ID NO: 17), maize (SEQ ID NO: 18), and rice (SEQ ID NO: 19), and the corresponding genes from which the PWD mRNAs are transcribed for PWD from sorghum (SEQ ID NO: 20) and maize (SEQ ID NO: 21). Furthermore, from this information, one can also infer the sequences of the 5 untranslated regions (UTRs) and promoters of the PWD gene in sorghum (SEQ ID NO: 22), as well as the 3 UTRs of the respective PWD genes in sorghum (SEQ ID NO: 23) and maize (SEQ ID NO: 24). The 24.3 kb SbPWD gene is located on sorghum chromosome 4. The deduced protein sequence of the SbPWD (1212AA) has 57% sequence identity in its entire length to the functionally characterized Arabidopsis 1196AA PWD protein (AY747068). However, only the C-terminal part of the SbPWD protein (374AA) has sequence homology to maize genomic sequences localized on chromosome 10. This partial protein sequence of maize ZmPWD has 58% sequence identity to AtPWD and 93% sequence identity to SbPWD. Sequence alignment of the compiled sequences for the last 13 exons of the SbPWD and ZmPWD genes demonstrated 95.2% sequence identity on the nucleotide level between coding regions. However, when the entire genomic sequences for this part of the maize and sorghum PWD are aligned, only 31% sequence identity is present. The differences in intronic sequences between the two genes could explain this situation. For example, intron #11 in SbPWD is 5.3 kb and intron #13 is 0.8 kb, while their counterparts in ZmPWD are 17 kb and 5 kb long. The opposite situation was found for intron #16, which is 3.6 kb in SbPWD and only 248 bp in ZmPWD. Sequence analysis of the 252 bp 3 UTR of SbPWD and ZmPWD demonstrated 78% sequence identity interrupted by numerous breaks in homology.
(138) CLUSTAL sequence alignment as follows was used to identify candidate DSP target proteins from maize (ZmDSP; SEQ ID NO: 25) and sorghum (SbDSP; SEQ ID NO: 26) based on their similarity to the known DSP protein from Arabidopsis thaliana (AtDSP; SEQ ID NO: 27). Similarly, it was possible to identify a candidate rice DSP protein (OsDSP; SEQ ID NO: 28) among lists of hypothetical proteins from rice.
(139) TABLE-US-00004 CLUSTAL2.1multiplesequencealignmentZmDSP,SbDSPandAtDSP ZmDSPMNCLQNLLKE--PPIVGSRSMRR---PSPLNLAMVRGGSRRSNTVKT---LQAPGASTSG52 SbDSPMNCLQNLLKE--PPIVGSRSMRR---PSPLNLAMVRGGSRRSNTVKT---LQAPGASTSG52 AtDSPMNCLQNLPRCSVSPLLGFGCIQRDHSSSSSSLKMLISPPIKANDPKSRLVLHAVSESKSS60 *******:.*::*.::*.*..**:..::**:*:*.*.*. ZmDSPAESSAVEMGTEKSEVYSTNMTQAMGAALTYRHELGMNYNFIRPDLIVGSCLQSPLDVDKL112 SbDSPAESSAVEMGTEKSEVYSTNMTHAMGAALTYRHELGMNYNFIRPDLIVGSCLQSPLDVDKL112 AtDSPSEMSGVAKDEEKSDEYSQDMTQAMGAVLTYRHELGMNYNFIRPDLIVGSCLQTPEDVDKL120 :**.*.***:**:**:****.*************************:****** ZmDSPRKIGVKTVFCLQQDSDLEYFGVDIRAIQDYSLQFKDIVHCRAEIRDFDAFDLRLRLPAVV172 SbDSPRKIGVKTVFCLQQDSDLEYFGVDIGAIQDYSLQFKDIMHCRAEIRDFDAFDLRLRLPAVV172 AtDSPRKIGVKTIFCLQQDPDLEYFGVDISSIQAYAKKYSDIQHIRCEIRDFDAFDLRMRLPAVV180 *******:******.*********:***:::.****.***********:****** ZmDSPSKLHKLINCNGGVTYIHCTAGLGRAPAVALAYMFWILGYSLNEGHRLLQSKRACFPKLEA232 SbDSPSKLHKLVNCNGGVTYIHCTAGLGRAPAVALAYMFWILGYSLNEGHQLLQSKRACFPKLEA232 AtDSPGTLYKAVKRNGGVTYVHCTAGMGRAPAVALTYMFWVQGYKLMEAHKLLMSKRSCFPKLDA240 ..*:*::******:*****:********:****:**.**.*:*****:*****:* ZmDSPIKLATADILTGLSKNTITLKWEADGSSSVEISGLDIGWGQRIPLTYDEEKGAWFLEKELP292 SbDSPIKLATADILTGLSKNTITLKWEDDGSSSVEISGLDIGWGQRIPLTYDEERGAWFLEKELP292 AtDSPIRNATIDILTGLKRKTVTLTLKDKGFSRVEISGLDIGWGQRIPLTLDKGTGFWILKRELP300 *:********.::*:**.:.********************:**:*::*** ZmDSPEGRYEYKYVVDGKWLCNEHELITKPNADGHVNNYVQVSRDGTSDEEKELRERLTGPDPDL352 SbDSPEGRYEYKYIVDGKWLCNEHEMLTKPNADGHVNNYVQVSRDGTSDEEKELRERLTGPDPVL352 AtDSPEGQFEYKYIIDGEWTHNEAEPFIGPNKDGHTNNYAKVVDDPTS-VDGTTRERLSSEDPEL359 **::****::**:****:*****.***.:****:****:.*** ZmDSPTDQERLMIREYLEQYADAAER373(SEQIDNO:25) SbDSPTDEERLMIREYLEQYADAGER373(SEQIDNO:26) AtDSPLEEERSKLIQFLETCSEAEV-379(SEQIDNO:27) ::**:::**::*
(140) Based on this homology, it was possible to identify gene sequences (including coding sequences which had been previously unannotated in public databases) that correspond the DSP enzymes from the genomes of the respective species. These sequences include putative DSP cDNA (coding) sequences from maize (SEQ ID NO: 29), sorghum (SEQ ID NO: 30), and rice (SEQ ID NO: 31), the corresponding genes from which the DSP mRNAs are transcribed for DSP from maize (SEQ ID NO: 32) and sorghum (SEQ ID NO: 33). Furthermore, from this information, one can also infer the sequences of the 5 untranslated regions (UTRs) and promoters of the DSP gene in maize (SEQ ID NO: 34) and sorghum (SEQ ID NO: 35), as well as the 3 UTRs of the respective DSP genes in maize (SEQ ID NO: 36) and sorghum (SEQ ID NO: 37). The 5 kb and 6 kb DSP genes are localized on chromosomes 1 of sorghum and maize genomes, respectively. The DSP gene structure is very similar in both maize and sorghum, with each gene containing 14 exons of identical length and 13 slightly differently sized introns. The deduced protein sequence alignment between Arabidopsis, maize, and sorghum demonstrates 58.8% identity, while between SbDSP and ZmDSP proteins there is 96.5% sequence identity on the amino acid level. The sequence identity between the compiled exons of SbDSP and ZmDSP genes is 95.5%, while when the introns are included, the identity between the two genes drops to 65.3%. There is 50-60% sequence identity in 513 UTR regions. Both SbDSP and ZmDSP proteins contain the DSPc domain (Dual specificity phosphatase, catalytic domain) at 130-227AA with the active site at the residue C190.
Example 2
Suppressing Expression, Accumulation, or Activity of the Target Enzymes in Plants
(141) Once the genes encoding each of the target enzymes have been identified in a given species, the next objective was to suppress the expression, accumulation, or activity of that enzyme in planta.
(142) The target genes were inhibited using an RNA interference strategy. One approach to gene suppression via RNA interference was to modify a naturally-occurring microRNA sequence such that its native targeting sequences are altered so that they will recognize a different target sequence, and then to express this modified microRNA according to Wartmann et al. 2008, which is incorporated herein by reference as if fully set forth.
(143) An example of naturally occurring microRNA from rice is illustrated in
(144) Alternatively, the expression of a target enzyme was suppressed by expressing an appropriately-designed hairpin RNA (hpRNA) that includes inverted copies of an RNA sequence that is homologous to a portion of the mRNA, the 5 UTR, or the 3 UTR for a target enzyme, according to Horiguchi 2004, which is incorporated herein by reference as if fully set forth. The sequences homologous to the target mRNA are called driver sequences. DNA sequences encoding individual RNA driver sequences that were used to create hpRNAs include OsDSP1 (SEQ ID NO: 40), OsDSP2 (SEQ ID NO: 41): OsGWD1 (SEQ ID NO: 42), OsGWD2 (SEQ ID NO: 43), OsPWD1 (SEQ ID NO: 44), OsPWD2 (SEQ ID NO: 45), SbGWD (SEQ ID NO: 46), SbGWD1 (SEQ ID NO: 47), SbGWD2 (SEQ ID NO: 48), ZmGWD1 (SEQ ID NO: 49) and ZmGWD2 (SEQ ID NO: 50).
(145) The strategy for expressing inverted driver sequences was to transcribe them such that the inverted copies were separated by a spacer sequence. This spacer itself corresponded to an intron. Introns that were used included the Zea mays alcohol dehydrogenase intron (ZmAdh1i6; SEQ ID NO: 51), the Oryza sativa alcohol dehydrogenase intron (OsAdh1i; SEQ ID NO: 52), Sorghum bicolor alcohol dehydrogenase intron (SbAdh1-2i: SEQ ID NO: 53) and Sorghum bicolor glucan water dikinase intron (SbGWDi; SEQ ID NO: 54). In an expression cassette, to direct transcription of the microRNA or hpRNA a driver sequence was linked to the Z. mays ubiquitin promoter (ZmUbi1P; SEQ ID NO: 55), the Z. mays phosphoenolpyruvate carboxylase promoter (ZmPepCP: SEQ ID NO: 56) or the O. sativa ubiquitin promoter (OsUbi3P; SEQ ID NO: 57). A polyadenylation signal (NOS terminator; SEQ ID NO: 58) was used as the transcription terminator.
(146)
(147) Multiple cassettes for different hpRNAs can be linked in a single construct to be introduced into transgenic plants such that two hpRNAs will be expressed simultaneously.
(148) Examples of constructs with multiple hpRNA expression cassettes are provided as SEQ ID NO: 72 [OsDSP1 and OsGWD2], SEQ ID NO: 73[OsPWD2 and OsGWD1] and SEQ ID NO: 74 [OsDSP2 and OsPWD1].
Example 3
Vectors
(149) Sequences from any gene related to starch regulation may be provided in an intermediate RNAi vector, a transformation vector, or in a transgenic plant herein.
(150) RNAi Vector pAL409
(151) An example of an intermediate RNAi vector is pAL409, which is illustrated in
(152) TABLE-US-00005 >pAL409sequence [SEQIDNO:185] TCGCGCGTTTCGGTGATGACGGTGAAAACCTCTGACACATGCAGCTCCC GGAGACGGTCACAGCTTGTCTGTAAGCGGATGCCGGGAGCAGACAAGCC CGTCAGGGCGCGTCAGCGGGTGTTGGCGGGTGTCGGGGCTGGCTTAACT ATGCGGCATCAGAGCAGATTGTACTGAGAGTGCACCATATGCGGTGTGA AATACCGCACAGATGCGTAAGGAGAAAATACCGCATCAGGCGCCATTCG CCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTT CGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGGCGATTAAG TTGGGTAACGCCAGGGTTTTCCCAGTCACGACGTTGTAAAACGACGGCC AGTGAATTCGGGCGGTTAATTAACTAATCGACTCTAGTAACGGCCGCCA GTGTGCTGGAATTAATTCGGCTTGTCGACCACCCAACCCCATATCGACA GAGGATGTGAAGAACAGGTAAATCACGCAGAAGAACCCATCTCTGATAG CAGCTATCGATTAGAACAACGAATCCATATTGGGTCCGTGGGAAATACT TACTGCACAGGAAGGGGGCGATCTGACGAGGCCCCGCCACCGGCCTCGA CCCGAGGCCGAGGCCGACGAAGCGCCGGCGAGTACGGCGCCGCGGCGGC CTCTGCCCGTGCCCTCTGCGCGTGGGAGGGAGAGGCCGCGGTGGTGGGG GCGCGCGCGCGCGCGCGCGCAGCTGGTGCGGCGGCGCGGGGGTCAGCCG CCGAGCCGGCGGCGACGGAGGAGCAGGGCGGCGTGGACGCGAACTTCCG ATCGGTTGGTCAGAGTGCGCGAGTTGGGCTTAGCCAATTAGGTCTCAAC AATCTATTGGGCCGTAAAATTCATGGGCCCTGGTTTGTCTAGGCCCAAT ATCCCGTTCATTTCAGCCCACAAATATTTCCCCAGAGGATTATTAAGGC CCACACGCAGCTTATAGCAGATCAAGTACGATGTTTCCTGATCGTTGGA TCGGAAACGTACGGTCTTGATCAGGCATGCCGACTTCGTCAAAGAGAGG CGGCATGACCTGACGCGGAGTTGGTTCCGGGCACCGTCTGGATGGTCGT ACCGGGACCGGACACGTGTCGCGCCTCCAACTACATGGACACGTGTGGT GCTGCCATTGGGCCGTACGCGTGGCGGTGACCGCACCGGATGCTGCCTC GCACCGCCTTGCCCACGCTTTATATAGAGAGGTTTTCTCTCCATTAATC GCATAGCGAGTCGAATCGACCGAAGGGGAGGGGGAGCGAGAGCTTTGCG TTCTCTAATCGCCTCGTCAAGCCTAGGTGTGTGTCCGGAGTCAAGGTAA CTAATCAATCACCTCGTCCTAATCCTCGAATCTCTCGTGGTGCCCGTCT AATCTCGCGATTTTGATGCTCGTGGTGGAAAGCGTAGGAGGATCCCGTG CGAGTTAGTCTCAATCTCTCAGGGTTTCGTGCGATTTTAGGGTGATCCA CCTCTTAATCGAGTTACGGTTTCGTGCGATTTTAGGGTAATCCTCTTAA TCTCTCATTGATTTAGGGTTTCGTGAGAATCGAGGTAGGGATCTGTGTT ATTTATATCGATCTAATAGATGGATTGGTTTTGAGATTGTTCTGTCAGA TGGGGATTGTTTCGATATATTACCCTAATGATGTGTCAGATGGGGATTG TTTCGATATATTACCCTAATGATGTGTCAGATGGGGATTGTTTCGATAT ATTACCCTAATGATGGATAATAAGAGTAGTTCACAGTTATGTTTTGATC CTGCCACATAGTTTGAGTTTTGTGATCAGATTTAGTTTTACTTATTTGT GCTTAGTTCGGATGGGATTGTTCTGATATTGTTCCAATAGATGAATAGC TCGTTAGGTTAAAATCTTTAGGTTGAGTTAGGCGACACATAGTTTATTT CCTCTGGATTTGGATTGGAATTGTGTTCTTAGTTTTTTTCCCCTGGATT TGGATTGGAATTGTGTGGAGCTGGGTTAGAGAATTACATCTGTATCGTG TACACCTACTTGAACTGTAGAGCTTGGGTTCTAAGGTCAATTTAATCTG TATTGTATCTGGCTCTTTGCCTAGTTGAACTGTAGTGCTGATGTTGTAC TGTGTTTTTTTACCCGTTTTATTTGCTTTACTCGTGCAAATCAAATCTG TCAGATGCTAGAACTAGGTGGCTTTATTCTGTGTTCTTACATAGATCTG TTGTCCTGTAGTTACTTATGTCAGTTTTGTTATTATCTGAAGATATTTT TGGTTGTTGCTTGTTGATGTGGTGTGAGCTGTGAGCAGCGCTCTTATGA TTAATGATGCTGTCCAATTGTAGTGTAGTATGATGTGATTGATATGTTC ATCTATTTTGAGCTGACAGTACCGATATCGTAGGATCTGGTGCCAACTT ATTCTCCAGCTGCTTTTTTTTACCTATGTTAATTCCAATCCTTTCTTGC CTCTTCCAGGGATCCACCGGTCCGATCGAGCTTACTGAAAAAATTAACA TCTCTTGCTAAGCTGGGAGCGCTAGCTCCCCGAATTTCCCCGATCGTTC AAACATTTGGCAATAAAGTTTCTTAAGATTGAATCCTGTTGCCGGTCTT GCGATGATTATCATATAATTTCTGTTGAATTACGTTAAGCATGTAATAA TTAACATGTAATGCATGACGTTATTTATGAGATGGGTTTTTATGATTAG AGTCCCGCAATTATACATTTAATACGCGATAGAAAACAAAATATAGCGC GCAAACTAGGATAAATTATCGCGCGCGGTGTCATCTATGTTACTAGATC GGGAATTGGCGAGCTCGCCCGGGCGGGCGAAGCTTGGCGTAATCATGGT CATAGCTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACACAA CATACGAGCCGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTAATGAGTG AGCTAACTCACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGG GAAACCTGTCGTGCCAGCTGCATTAATGAATCGGCCAACGCGCGGGGAG AGGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCGCTCACTGACTCG CTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGG CGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACAT GTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTG CTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATC GACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCA GGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTG CCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGC TTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCG CTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGC GCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACT TATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTA TGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTAC ACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCT TCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGG TAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAA GGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGT GGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAG GATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATC TAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCA GTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGC CTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCT GGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAG ATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGG TCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAA GCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCA TTGCTACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATT CAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTG TGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTA AGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTC TCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTAC TCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTT GCCCGGCGTCAATACGGGATAATACCGCGCCACATAGCAGAACTTTAAA AGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATC TTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACT GATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAAC AGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGT TGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGG GTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAA ACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTC TAAGAAACCATTATTATCATGACATTAACCTATAAAAATAGGCGTATCA CGAGGCCCTTTCGTC
(153) Embodiments herein provide intermediate RNAi vectors that replicate to high copy in E. coli, have low complexity, and several convenient restriction sites. pAL409 has these characteristics. Vectors with such characteristics would be useful for assembling RNAi expression cassettes that can then be transferred to an Agrobacterium transformation vector.
(154) Transformation Vectors
(155) An exemplary transformation vector, pAG2004 is illustrated in
(156) Plant transformation vectors were assembled by inserting the expression cassettes or constructs described herein between the Agrobacterium T-DNA right border and left border sequences of a suitable plasmid such as pAG2005 or pAG4003 described below. Following are descriptions of several pAG2005, pAG4003 and derivative recombinant plasmid vectors that can be used to generate transgenic plants via Agrobacterium-mediated transformation:
(157) pAG2005 (SEQ ID NO: 75) is a plasmid that carries a spectinomycin resistance marker, a bacterial origin of replication, an Agrobacterium T-DNA right border (RB), and an Agrobacterium T-DNA left border (LB). Between the RB and LB is a multicloning site (MCS) and a plant selectable marker comprised of a rice Ubi3 promoter (OsUbi3P), the phosphomannose isomerase coding sequence, and NOS terminator; this plasmid also carries an added rice Ubi3 promoter (OsUbi3P) and NOS terminator in the MCS, between which additional coding sequences, microRNA, or hpRNA transcription units may be added.
(158) pAG2107 is pAG2005 with an OsGWD amiRNA1wmd3 microRNA (SEQ ID NO: 38) between the rice Ubi3 promoter (OsUbi3P) and NOS terminator.
(159) pAG2108 is pAG2005 with an OsGWD osa-MIR809aM1 microRNA (SEQ ID NO: 39) between the rice Ubi3 promoter (OsUbi3P) and NOS terminator.
(160) pAG2109 is pAG2005 with an OsDSP1 hpRNA silencing cassette-1 (SEQ ID NO: 40) between the rice Ubi3 promoter (OsUbi3P) and NOS terminator.
(161) pAG2110 is pAG2005 with an OsGWD1 hpRNA silencing cassette-1 (SEQ ID NO: 42) between the rice Ubi3 promoter (OsUbi3P) and NOS terminator.
(162) pAG2111 is pAG2005 with an OsGWD2 hpRNA silencing cassette-2 (SEQ ID NO: 43) between the rice Ubi3 promoter (OsUbi3P) and NOS terminator.
(163) pAG2112 is pAG2005 with an OsPWD2 hpRNA silencing cassette-2 (SEQ ID NO: 45) between the rice Ubi3 promoter (OsUbi3P) and NOS terminator.
(164) pAG2113 is pAG2005 with an OsDSP2 hpRNA silencing cassette-2 (SEQ ID NO: 41) between the rice Ubi3 promoter (OsUbi3P) and NOS terminator.
(165) pAG2114 is pAG2005 with an OsPWD1 hpRNA silencing cassette-1 (SEQ ID NO: 44) between the rice Ubi3 promoter (OsUbi3P) and NOS terminator.
(166) pAG2115 is pAG2111 with an additional OsDSP1 hpRNA silencing cassette-1; the tandem cassettes for expressing OsGWD2 and OsDSP1 hpRNAs is provided as SEQ ID NO: 72.
(167) pAG2116 is pAG2110 with an additional OsPWD2 hpRNA silencing cassette-2; the tandem cassettes for expressing OsGWD1 and OsPWD2 hpRNAs is provided as SEQ ID NO: 73.
(168) pAG2117 is pAG2114 with an additional OsDSP2 hpRNA silencing cassette-2; the tandem cassettes for expressing OsPWD1 and OsDSP2 hpRNAs is provided as SEQ ID NO: 74;
(169) pAG4003 (SEQ ID NO: 76) is a plasmid that carries a spectinomycin resistance marker, a bacterial origin of replication, an Agrobacterium T-DNA right border (RB), and an Agrobacterium T-DNA left border (LB). Between the RB and LB is a multicloning site (MCS) and a plant selectable marker comprised of a maize ubiquitin promoter (ZmUbi1P), the phosphomannose isomerase coding sequence, and NOS terminator.
(170) pAG4101 is pAG4003 with an SbGWD RNAi between the rice Ubi3 promoter (OsUbi3P) and NOS terminator (SEQ ID NO: 67).
(171) pAG4102 is pAG4003 with an ZmGWD1 RNAi between the maize ZmPepC promoter (ZmPepCP) and NOS terminator (SEQ ID NO: 70).
(172) pAG4103 is pAG4003 with an ZmGWD2 RNAi between the maize ZmPepC promoter and NOS terminator (SEQ ID NO: 71).
(173) pAG4104 is pAG4003 with an SbGWD1 RNAi between the maize ZmPepC promoter and NOS terminator (SEQ ID NO: 68).
(174) pAG4105 is pAG4003 with an SbGWD2 RNAi between the maize ZmPepC promoter and NOS terminator (SEQ ID NO: 69).
(175) Sequences of the target proteins, genes, elements of the expression cassettes and vectors used herein are listed in Table 1.
(176) TABLE-US-00006 TABLE 1 Description of Sequences. SEQ ID NO Sequence Name Sequence Description 1 SbGWD protein amino acid sequence of hypothetical protein from Genbank accession XM_002438374 2 ZmGWD protein amino acid sequence of GWD target protein from maize 3 OsGWD protein amino acid sequence of GWD target protein from rice 4 StGWD protein amino acid sequence of GWD target protein from potato 5 SbGWD coding nucleotide sequence encoding hypothetical Sequence protein from Genbank accession XM_002438374 6 ZmGWD coding nucleotide sequence (cDNA) encoding sequence unannotated protein from maizegdb.org accession GRMZM2G412611 7 OsGWD coding nucleotide sequence encoding GWD target sequence protein from rice 8 SbGWD gene genomic sequence containing unannotated GWD-like sequence at plantgdb.org/SbGDB (37970015-37957888) 9 ZmGWD gene genomic sequence containing unannotated GWD-like sequence GRMZM2G412611 at maizegdb.org (110695290-110706982) 10 SbGWD gene genomic sequence upstream of GWD-like 5UTR and promoter sequence at plantgdb.org/SbGDB (37973102-37970016) 11 ZmGWD gene genomic sequence upstream of GWD-like 5UTR and promoter sequence GRMZM2G412611 at maizegdb.org (110692290-110695289) 12 SbGWD gene genomic sequence downstream of GWD- 3UTR like sequence at plantgdb.org/SbGDB (37957887-37956500) 13 ZmGWD gene genomic sequence downstream of GWD- 3UTR like sequence GRMZM2G412611 at maizegdb.org (110706983-110708982) 14 SbPWD protein amino acid sequence of possible PWD target protein from sorghum 15 ZmPWD protein partial amino acid sequence of possible PWD target protein from maize 16 AtPWD protein amino acid sequence of functionally characterized PWD protein from Arabidopsis (Genebank accession AY747068) 17 SbPWD coding nucleotide sequence (cDNA) encoding sequence hypothetical protein from Genbank accession XM_002453614 18 ZmPWD coding nucleotide sequence (cDNA) encoding sequence PWD-like protein compiled from genomic sequence on maize Chromosome 10 (20383422-20409795) 19 OsPWD coding nucleotide sequence (cDNA) encoding sequence PWD-like protein compiled from genomic sequence from rice 20 SbPWD gene SbPWD gene genomic sequence containing unannotated PWD-like sequence at plantgdb.org/SbGDB (13092874-13068592) 21 ZmPWD gene ZmPWD gene genomic sequence containing unannotated partial PWD-like sequence at maizegdb.org (20383406-20409795) 22 SbPWD gene SbPWD gene 5UTR and promoter genomic 5UTR and promoter sequence upstream of PWD-like sequence at plantgdb.org/SbGDB (13095874-13092875) 23 SbPWD gene SbPWD gene 3UTR genomic sequence 3UTR downstream of PWD-like sequence at plantgdb.org/SbGDB (13068591-13066592) 24 ZmPWD gene ZmPWD gene 3UTR genomic sequence 3UTR downstream of PWD-like sequence at maizegdb.org (20409796-20411795) 25 ZmDSP protein ZmDSP putative protein amino acid sequence of target protein from maize 26 SbDSP protein SbDSP putative protein amino acid sequence of target protein from sorghum 27 AtDSP protein AtDSP protein amino acid sequence of functionally characterized DSP protein from Arabidopsis (Genebank accession Q9FEB5) 28 OsDSP protein OsDSP putative protein tyrosine phosphatase from rice, Genebank accession ABF93554.1 29 ZmDSP coding nucleotide sequence (cDNA) encoding sequence uncharacterized protein LOC100216768 from Genbank accession NM_001143167 30 SbDSP coding nucleotide sequence (cDNA) encoding sequence unannotated DSP-like sequence at plantgdb.org/SbGDB (73074312-73079318) 31 OsDSP coding nucleotide sequence (cDNA) encoding DSP- sequence like protein compiled from genomic sequence from rice 32 ZmDSP gene ZmDSP gene genomic sequence at maizegdb.org (2544510-2538556) containing uncharacterized protein LOC100216768 from Genbank accession NM_001143167 33 SbDSP gene SbDSP gene genomic sequence containing unannotated DSP-like sequence at plantgdb.org/SbGDB (73074312-73079318) 34 ZmDSP gene ZmDSP gene 5UTR and promoter genomic 5UTR and promoter sequence at maizegdb.org (2538555- 2536556) upstream of uncharacterized protein LOC100216768 from Genbank accession NM_001143167 35 SbDSP gene SbDSP gene 5UTR and promoter genomic 5UTR and promoter sequence upstream of unannotated DSP-like sequence at plantgdb.org/SbGDB (73071312-73074311) 36 ZmDSP gene ZmDSP gene 3UTR genomic sequence at 3UTR maizegdb.org (2538555-2536556) downstream of uncharacterized protein LOC100216768 from Genbank accession NM_001143167 37 SbDSP gene SbDSP gene 3UTR genomic sequence 3UTR downstream of unannotated DSP-like sequence at plantgdb.org/SbGDB (73079319-73081318) 38 OsGWD DNA sequence encoding the OsGWD amiRNA1wmd3 amiRNA1wmd3 micro RNA 39 OsGWD osa- DNA sequence encoding the OsGWD osa- MIR809aM1 MIR809aM1 micro RNA 40 OsDSP1 OsDSP1 hairpin RNA driver sequence 41 OsDSP2 OsDSP2 hairpin RNA driver sequence 42 OsGWD1 OsGWD1 hairpin RNA driver sequence 43 OsGWD2 OsGWD2 hairpin RNA driver sequence 44 OsPWD1 OsPWD1 hairpin RNA driver sequence 45 OsPWD2 OsPWD2 hairpin RNA driver sequence 46 SbGWD SbGWD hairpin RNA driver sequence 47 SbGWD1 SbGWD1 hairpin RNA driver sequence 48 SbGWD2 SbGWD2 hairpin RNA driver sequence 49 ZmGWD1 ZmGWD1 hairpin RNA driver sequence 50 ZmGWD2 ZmGWD2 hairpin RNA driver sequence 51 ZmAdh1i6 DNA sequence corresponding to the ZmAdh1i6 intron 52 OsAdh1i DNA sequence corresponding to the OsAdh1 intron 53 SbAdh1-2i DNA sequence corresponding to the SbAdh1-2i intron 54 SbGWDi DNA sequence corresponding to the SbGWDi intron 55 ZmUbi1P maize Ubi1 promoter with intron 56 ZmPepCP ZmPepC promoter 57 OsUbi3P rice Ubi3 promoter with intron 58 NOS terminator NOS transcriptional terminator and polyadenylation signal 59 OsUbi3P:OsGWD OsUbi3P:OsGWD amiRNA1wmd3 amiRNA1wmd3 expression cassette for one micro RNA 60 OsUbi3P:OsGWD OsUbi3P:OsGWD osa-MIR809aM1 osa-MIR809aM1 expression cassette for one micro RNA 61 OsUbi3P:OsDSP1 OsUbi3P:OsDSP1 hpRNA expression hpRNA cassette for one hairpin RNA 62 OsUbi3P:OsDSP2 OsUbi3P:OsDSP2 hpRNA expression hpRNA cassette for one hairpin RNA 63 OsUbi3P:OsGWD1 OsUbi3P:OsGWD1 hpRNA expression hpRNA cassette for one hairpin RNA 64 OsUbi3P:OsGWD2 OsUbi3P:OsGWD2 hpRNA expression hpRNA cassette for one hairpin RNA 65 OsUbi3P:OsPWD1 OsUbi3P:OsPWD1 hpRNA expression hpRNA cassette for one hairpin RNA 66 OsUbi3P:OsPWD2 OsUbi3P:OsPWD2 hpRNA expression hpRNA cassette for one hairpin RNA 67 OsUbi3P:SbGWD OsUbi3P:SbGWD RNAi expression cassette RNAi for one hairpin RNA 68 ZmPepCP:SbGWD1 ZmPepCP:SbGWD1 RNAi expression RNAi cassette for one hairpin RNA 69 ZmPepCP:SbGWD2 ZmPepCP:SbGWD2 RNAi expression RNAi cassette for one hairpin RNA 70 ZmPepCP:ZmGWD1 ZmPepCP:ZmGWD1 RNAi expression RNAi cassette for one hairpin RNA 71 ZmPepCP:ZmGWD2 ZmPepCP:ZmGWD2 RNAi expression RNAi cassette for one hairpin RNA 72 OsDSP1 and OsDSP1 and OsGWD2 construct containing OsGWD2 two hpRNA expression cassettes 73 OsPWD2 and OsPWD2 and OsGWD1 construct OsGWD1 containing two hpRNA expression cassettes 74 OsDSP2 and OsDSP2 and OsPWD1 construct containing OsPWD1 two hpRNA expression cassettes 75 pAG2005 pAG2005 transformation vector with phsophomannose isomerase plant selectable marker 76 pAG4003 pAG4003 transformation vector with phsophomannose isomerase plant selectable marker 77 O43097 protein O43097 enzyme amino acid sequence 78 BD22308 protein BD22308 enzyme amino acid sequence 79 BD25243 protein BD25243 enzyme amino acid sequence 80 EU591743 protein EU591743 enzyme amino acid sequence 81 NtEGm protein NtEGm enzyme amino acid sequence 82 P0C2S1 protein P0C2S1 enzyme amino acid sequence 83 P77853 protein P77853 enzyme amino acid sequence 84 O68438 protein O68438 enzyme amino acid sequence 85 O33897 protein O33897 enzyme amino acid sequence 86 O43097 coding O43097 enzyme coding sequence sequence 87 BD22308 coding BD22308 enzyme coding sequence sequence 88 BD25243 coding BD25243 enzyme coding sequence sequence 89 EU591743 coding EU591743 enzyme coding sequence sequence 90 NtEGm coding NtEGm enzyme coding sequence sequence 91 P0C2S1 coding P0C2S1 enzyme coding sequence sequence 92 P77853 coding P77853 enzyme coding sequence sequence 93 O68438 coding O68438 enzyme coding sequence sequence 94 O33897 coding O33897 enzyme coding sequence sequence 95 AS146-7 intein AS146-7 intein amino acid sequence polypeptide 96 S158-30-108-35 S158-30-108-35 intein amino acid sequence intein polypeptide 97 T134-100-101 T134-100-101 intein amino acid sequence intein polypeptide 98 AS146-7 coding AS146-7 DNA sequence encoding an intein sequence 99 S158-30-108-35 S158-30-108-35 DNA sequence encoding coding sequence an intein 100 T134-100-101 T134-100-101 DNA sequence encoding an coding sequence intein 101 EU591743:AS146-7 EU591743:AS146-7 intein-modified intein modified enzyme amino acid sequence enzyme 102 P77853:S158-30- P77853:S158-30-108-35 intein-modified 108-35 intein enzyme amino acid sequence modified enzyme 103 P77853:T134-100- P77853:T134-100-101 intein-modified 101 intein enzyme amino acid sequence modified enzyme 104 EU591743:AS146-7 EU591743:AS146-7 intein-modified coding sequence enzyme coding sequence 105 P77853:S158-30-108- P77853:S158-30-108-35 intein-modified 35 coding sequence enzyme coding sequence 106 P77853:T134-100- P77853:T134-100-101 intein-modified 101 coding sequence enzyme coding sequence 107 BAASS signal peptide BAASS protein targeting signal 108 SEKDEL signal SEKDEL protein targeting signal amino peptide acid sequence 109 xHvVSD xHvVSD protein targeting signal amino acid targeting signal sequence 110 ZmUBQm ZmUBQm translational fusion amino acid translational fusion sequence 111 xGZein27ss xGZein27ss protein targeting signal amino acid sequence 112 HvAle signal HvAle protein targeting signal amino acid sequence 113 BAASS coding BAASS DNA sequence encoding a protein sequence targeting signal 114 SEKDEL coding SEKDEL DNA sequence encoding a protein sequence targeting signal 115 xHvVSD coding xHvVSD DNA sequence encoding a protein sequence targeting signal 116 ZmUBQm coding ZmUBQm DNA sequence encoding a sequence translational fusion 117 xGZein27ss xGZein27ss DNA sequence encoding a coding sequence protein targeting signal 118 HvAle coding HvAle DNA sequence encoding a protein sequence targeting signal 119 ZmUbi1P:xGZein27ss: ZmUbi1P:xGZein27ss:BD22308:xHvVSD BD22308:xHvVSD expression for cassette for one enzyme 120 ZmPepCP:xGZein27ss: ZmPepCP:xGZein27ss:BD25243:SEKDEL BD25243:SEKDEL expression cassette for one enzyme 121 OsUbi3P:EU591743 OsUbi3P:EU591743 expression cassette for one enzyme 122 ZmUbi1P:EU591743: ZmUbi1P:BAASS:EU591743:AS146- AS146-7:SEKDEL 7:SEKDEL expression cassette for one intein-modified enzyme 123 ZmUbi1P:HvAle: ZmUbi1P:HvAle:NtEGm:SEKDEL NtEGm:SEKDEL expression cassette for one enzyme 124 ZmPepCP:HvAle: ZmPepCP:HvAle:NtEGm:SEKDEL NtEGm:SEKDEL expression cassette for one enzyme 125 OsUbi3P:HvAle: OsUbi3P:HvAle:NtEGm:SEKDEL NtEGm:SEKDEL expression cassette for one enzyme 126 OsUbi3P:BAASS: OsUbi3P:BAASS:O33897 expression O33897 cassette for one enzyme 127 ZmPepCP:BAASS: ZmPepCP:BAASS:O43097:SEKDEL O43097:SEKDEL expression cassette for one enzyme 128 OsUbi3P:O68438 OsUbi3P:O68438 expression cassette for one enzyme 129 OsUbi3P:P0C2S1 OsUbi3P:P0C2S1 expression for cassette for one enzyme 130 ZmUbi1P: ZmUbi1P:ZmUBQm:BAASS:P77853: ZmUBQm:BAASS: S158-30-108-35 expression cassette for one P77853:S158-30- intein-modified enzyme 108-35 131 ZmUbi1P:BAASS: ZmUbi1P:BAASS:P77853:T134-100- P77853:T134-100- 101:SEKDEL expression cassette for one 101:SEKDEL intein-modified enzyme 132 2379 2379 construct containing expression cassettes for three CWDEs and one hpRNA 133 2380 2380 construct containing expression cassettes for three CWDEs and one hpRNA 134 4106 4106 construct containing expression cassettes for three CWDEs and one hpRNA 135 4107 4107 construct containing expression cassettes for three CWDEs and one hpRNA 136 4108 4108 construct containing expression cassettes for three CWDEs and one hpRNA 137 4109 4109 construct containing expression cassettes for three CWDEs and one hpRNA 138 4110 4110 construct containing expression cassettes for three CWDEs and one hpRNA 139 4111 4111 construct containing expression cassettes for three CWDEs and one hpRNA 140 4112 4112 construct containing expression cassettes for three CWDEs and one hpRNA 141 4113 4113 construct containing expression cassettes for three CWDEs and one hpRNA 142 4114 4114 construct containing expression cassettes for three CWDEs and one hpRNA 143 4115 4115 construct containing expression cassettes for three CWDEs and one hpRNA 144 4116 4116 construct containing expression cassettes for three CWDEs and one hpRNA 145 4117 4117 construct containing expression cassettes for three CWDEs and one hpRNA 146 4120 4120 construct containing expression cassettes for three CWDEs and one hpRNA 147 4121 4121 construct containing expression cassettes for three CWDEs and one hpRNA 148 4124 4124 construct containing expression cassettes for two CWDEs and one hpRNA 149 4125 4125 construct containing expression cassettes for two CWDEs and one hpRNA 150 4514 4514 construct containing expression cassettes for three CWDEs and one hpRNA 151 4515 4515 construct containing expression cassettes for three CWDEs and one hpRNA 152 Synthetic construct, Nucleic acid of ZmGWD1 intron-splicing ZmGWD1 intron- site splicing site 153 Synthetic construct, ob1659, nucleic acid of forward primer GWD forward primer 154 Synthetic construct, ob1660, nucleic acid of reverse primer GWD reverse primer 155 Synthetic construct, ob1555, nucleic acid of forward primer beta-actin forward primer 156 Synthetic construct, ob1556, nucleic acid of reverse primer beta-actin reverse primer 157 Synthetic construct, ob1567, nucleic acid of forward primer GADPH forward primer 158 Synthetic construct, ob1568, nucleic acid of reverse primer GADHP reverse primer 159 osa-MIR528 RNA sequence of osa-MIR528 included in FIG. 1 160 Amylase 19862 Amylase 19862 amino acid sequence protein 161 Glucoamylase Glucoamylase 20082 amino acid sequence 20082 protein 162 Glucoamylase Glucoamylase 20707 amino acid sequence 20707 protein 163 Amylase 21853 Amylase 21853 amino acid sequence protein 164 Amylase 19862 Amylase 19862 coding sequence coding sequence 165 Glucoamylase 20082 Glucoamylase 20082 coding sequence coding sequence 166 Glucoamylase 20707 Glucoamylase 20707 coding sequence coding sequence 167 Amylase 21853 Amylase 21853 coding sequence coding sequence 168 AmyS coding sequence AmyS amylase coding sequence 169 AmyS protein AmyS amylase amino acid sequence 170 GlaA coding sequence GlaA coding sequence 171 GlaA protein GlaA amino acid sequence 172 An inverted A sequence complimentary to OsDSP1 complement of (SEQ ID NO: 40) with a reverse order of OsDSP1 nucleotides 173 GWD gene genomic sequence GWD gene Os06g30310 Os06g30310 174 DSP gene genomic sequence DSP gene Os03g01750 Os03g01750 175 ISA3 gene genomic sequence ISA3 gene Os09g29404 Os09g29404 176 GWD coding nucleotide sequence encoding GWD target sequence -1 protein from rice [similar to SEQ ID NO: 7 plus 147 additional nucleotides at the 5 end and 12 additional nucleotides at the 3 end] 177 DSP coding DSP coding sequence from rice [similar to sequence -1 SEQ ID NO: 31 plus 18 additional nucleotides at 5 end and 12 additional nucleotides at the 3 end] 178 ISA3 coding sequence ISA3 coding sequence 179 Synthetic construct, GWD1 driver sequence GWD1 driver sequence 180 Synthetic construct, GWD2 driver sequence construct, GWD2 driver sequence 181 Portion of GWD2 Portion of GWD2 in FIG. 4 in FIG. 4 182 Portion of S. lyco. Portion of S. lyco. GWD gene in FIG. 4 GWD gene in FIG. 4 183 DSP1 driver sequence DSP1 driver sequence 184 ISA3 driver sequence ISA3 driver sequence 185 Synthetic pAL409 construct, pAL409 186 Synthetic construct, pAG2004 pAG2004 187 Synthetic construct, pAG2100 pAG2100 188 Synthetic construct, pAG2101 pAG2101 189 Synthetic construct, pAG2102 pAG2102 190 Synthetic construct, pAG2103 pAG2103 191 Sorghum bicolor Putative Sorghum bicolor GWD gene GWD gene-1 192 Portion of Portion of putative Sorghum bicolor GWD putative sorghum gene corresponding to GWD2 region of rice bicolor GWD gene gene 193 Synthetic construct, sbGWDk02a sbGWDk02a 194 Synthetic construct, sbGWDk2b sbGWDk2b 195 Synthetic construct, pAG2106 pAG2106 196 OsGWD excerpt 1 OsGWD excerpt 1 197 sbGWD excerpt 1 sbGWD excerpt 1 198 ZmGWD excerpt ZmGWD excerpt 1 1, n, is any nucleotide 199 SlGWD excerpt 1 SlGWD excerpt 1 200 OsGWD excerpt 2 OsGWD excerpt 2 201 SbGWD excerpt 2 SbGWD excerpt 2 202 ZmGWD excerpt 2 ZmGWD excerpt 2 203 SlGWD excerpt 2 SlGWD excerpt 2 204 Synthetic construct, dgGWDup2 (PCR Primer) dgGWDup2 (PCR Primer) 205 Synthetic construct, dgGWD down 2 (PCR Primer) dgGWD down 2 (PCR Primer) 206 PvGWD-2 PvGWD-2 207 PvGWD-5 PvGWD-5 208 PvGWD-1 PvGWD-1 209 Synthetic construct, PvGWDko2 RNAi driver sequence PvGWDko2 RNAi driver sequence 210 Synthetic construct, pAG2104 pAG2104 211 PvGWDi-1 PvGWDi-1 212 PvGWDi-2 PvGWDi-2 213 PvGWDi-3 PvGWDi-3 214 PvGWDi-4 PvGWDi-4 215 Switchgrass Switchgrass amalgamated sequence amalgamated sequence; n, is any nucleotide 216 Synthetic sequence, Synthetic sequence, sbGWDko2a minus sbGWDko2a flanking sequence minus flanking sequence 217 Synthetic sequence, Synthetic sequence, antisense sbGWDko2b antisense sbGWDko2b 218 Synthetic sequence, Synthetic sequence, pAL409jSbGWDko2 pAL409jSbGWDko2 219 Synthetic 4206 construct containing expression construct, 4206 cassettes for three CWDEs
Example 4
Rice RNAi Constructs
(177) Three exemplary genes to target for RNA interference in rice are GWD, DSP, and ISA3. SEQ ID NOS: 173 [GWD gene os06g30310]-175 [ISA3 geneos09g29404] list the sequences for the rice GWD, DSP and ISA3 genes, respectively. SEQ ID NOS: 176 [GWD coding sequence]-178 [ISA3 coding sequence] list the predicted coding sequences for the GWD, DSP and ISA3 genes, respectively. The GWD, DSP, and ISA3 gene sequences are from the RiceGE database: accession Nos. Os06g30310 (GWD); Os03g01750 (DSP); and Os09g29404 (ISA3).
(178) Based on the coding sequences in SEQ ID NOS: 176 [GWD coding sequence]-178 [ISA3 coding sequence], artificial cDNAs were synthesized and provided a resource for expressing the corresponding proteins in heterologous systems (e.g., E. coli or yeasts), which in turn would make it possible to raise antibodies for use in analyzing the planned transgenic plants.
(179) Plasmid DNAs carrying the entire coding sequences of SEQ ID NOS: SEQ ID NOS: 176 [GWD coding sequence]-178 [ISA3 coding sequence] were used as templates in PCR reactions to prepare driver sequences to be used in the RNAi constructs. For the GWD gene, two separate driver sequences were prepared.
(180) TABLE-US-00007 >GWD1driversequence(onecopy) [SEQIDNO:179] TAGCGCTAAGGAAGGGAGAGATATCCATCCGGATCCCGGAAGCCGAATCC ATCCATCCATCCATCCCATACTGCCCTTACGATCGAGCTGTTTGATATTC GTGCAGATGAGCGGATTCTCCGCGGCAGCTGCTGCGGCCGAGCGCTTGTC GGAAGGTTCACCCTGGATGCCAACTCCGAGCTTAAGGTGACATTGAACCC AGCACCGCAGGGTTCGGTGGTGGAGATCAATCTAGAGGCAACTAACACCA GCGGCTCCCTGATACTGCATTGGGGCGCCCTTCGCCCGGATAGAGGAGAA TGGCTCCTACCAT >GWD2driversequence(onecopy) [SEQIDNO:180] AGCAGATCTAGTTGACCAAGCAAGAGATAATGGATTATTGGGTATTATTG GAATTTTTGTTTGGATTAGGTTCATGGCTACAAGGCAACTAATATGGAAC AAGAACTACAATGTGAAGCCACGTGAGATAAGCAAAGCACAAGATAGGTT TACAGATGATCTTGAGAATATGTACAGAACTTACCCACAATATCAGGAGA TCTTAAGAATGATAATGTCTGCTGTTGGTCGGGGAGGTGAAGGTGATGTT GGTCAACGCATTCGTGATGAGATATTAGTAATCCAGAGAAATAATGACTG CAAAGGTGGAATGATGGAGGAGTGGCACCAGAAACTGCACAACAATACAA GCCCAGATGATGTAGTGATCTGCCAGGCCCTACTTGATTATATCAAGAGT GATTTTGATATTGGTGTTTACTGGGACACCTTGAAAAAAGATGGTATAAC AAAAGAGCGTCTATTGAGCTATGATCGACCGATTCATTCAGAGCCAAATT TCAGGAGTGAACAGAAAGATGGCTTACTCCGTGACTTGGGCAATTATATG AGAAGCCTCAAGATGGAGGGTACCC
(181) GWD1 is derived from a region near the 5 end of the GWD coding sequence. The second GWD driver sequence, GWD2 is derived from a region closer to the middle of the GWD coding sequence, which corresponds to a region of relatively higher sequence conservation among GWD genes from divergent species. See
(182) Portions of the DSP and ISA3 genes from rice were also selected to serve as driver sequences.
(183) TABLE-US-00008 >DSP1driversequence [SEQIDNO:183] CTCCAATCGTGGGATCCAGGTCCATGAGGCGGCCCTCGCCGCTCAATCTG ACGATGGTTCGTGGCGGGAGTCGCCGATCAAACACTGTCAAAACCGCATC CGGGGCGTCTACTTCTAGCGCCGAGAGTGGCGCAGTGGAGGCGGGCACGG AGAAATCCGATACGTACAGCACCAACATGACGCAAGCTATGGGAGCAGTG TTGACGTATAGACATGAGCTTGGAATGAACTACAATTTCATACGCCCAGA CTTGATCGTGGGCTCCTGCTTACAGAGCCCACTTGATGTTGATAAACTTA GGGACATTGGTGTAAAAACAGTATTCTGCCTGCAGCAAGATCCAGACCTT GAATATTTTGGAGTTGACATCTGTGCCATT >ISA3driversequence [SEQIDNO:184] CTAGCGAATACACTGAACTGCAACCATCCTGTTGTCAAGGAGCTCATTCT TGACAGCTTGAGACACTGGGTTGAGGAGTATCACATAGATGGATTTCGAT TTGACCTTGCAAGTGTTCTTTGTCGTGGACCAGATGGTTGTCCTCTTGAT GCACCTCCACTCATCAAGGAAATTGCCAAAGATGCTGTATTATCTAGATG TAAGATCATTGCTGAACCTTGGGATTGCGGCGGCCTTTATCTCGTAGGGC GTTTCCCTAACTGGGACAGGTGGGCTGAATGGAACGGCAAATACAGAGAT GATCTTCGAAGATTTATTAAGGGTGACCCTGGTATGAAGGGGGTGTTTGC GACTCGTGTGTCTGGATCTGCTGATCTCTATCAGGTGAACGAGCGGAAGC CTTACCATGGTGTAAATTTTGTGATTGCACATGATGGATTTACTTTATGT GACCTTGTTTCTTACAACTTAAAGCACAATGATGCTAATGGAGAAGGTGG CTGTGATGGATC
(184) GWD1, GWD2, DSP1 and ISA3 driver sequences were each amplified by PCR such that each was flanked with restriction enzyme recognition sites (e.g., NheI and XmaI). The fragments were first ligated into pCRBlunt II TOPO (Invitrogen), confirmed via multiple restriction enzyme digests and sequencing, then excised (using restriction enzymes that cleave the introduced flanking sites) and ligated first into the BspEI and AvrII sites and then the NheI and AgeI sites of pAL409 (
(185) Still referring to
Example 5
Sorghum RNAi Construct
(186) A draft of the genomic sequence corresponding to the putative GWD gene from Sorghum bicolor [SEQ ID NO: 191] was obtained through the Joint Genome Institute (JGI) Sorghum bicolor Home Page (http://genome.jgi-psf.org/Sorbi1/Sorbi1.home.html). From this sequence, a region corresponding roughly to the GWD2 region of the rice gene [SEQ ID NO: 192] was identified. In sorghum, the coding sequences in this region are interrupted by one or more introns, as identified by JGI, and the introns are at approximately nucleotides 140-342, nucleotides 507-628 and nucleotides 723-795 in [SEQ ID NO: 192]. A native intron derived from the sorghum genome was utilized in assembling an RNAi cassette for knocking down the GWD gene from sorghum. A portion of the sorghum GWD gene was amplified. The portion amplified included one full exon (based on the JGI prediction) in the highly conserved middle region (described earlier, see
(187) TABLE-US-00009 >SbGWDko2a(withflankingrestrictionsites) [SEQIDNO:193] GGTTCAATAACCCGGGAGTGAGATAAGCAAAGCACAAGATAGGTTTACAG ATGATCTTGAGAATATGTACAGAACTTATCCTCAGTACAGAGAGATACTA AGAATGATAATGGCTGCTGTTGGTCGTGGAGGTGAAGGTGACGTTGGTCA ACGCATTCGTGATGAGATATTAGTAATACAGGTAAAACTGATGGTCCTTG GTGAATATACAGTTATTTTCGTTCATTGCTCTGCTGAATTGAGCAGTTGG TAGTGCTCATCCAAAACGTAGACATTGTCAACAATAAAATGTTTGGTGTG TTACAGAGAAATACCGGTGCAAAGCTAGCATGATGGAAGAATGG
(188) A second PCR product (SbGWDko2b), corresponding to only the first exon mentioned above, was also amplified by PCR with flanking NheI and XmaI sites introduced at the 5 and 3 ends (relative to the direction of transcription), and ligated into pCRBluntII TOPO. The composition of this fragment was also confirmed via multiple restriction enzyme digests and sequencing.
(189) TABLE-US-00010 >SbGWDko2b(withflankingrestrictionsites) [SEQIDNO:194] GGTTCAATAAGCTAGCAGTGAGATAAGCAAAGCACAAGATAGGTTTACAG ATGATCTTGAGAATATGTACAGAACTTATCCTCAGTACAGAGAGATACTA AGAATGATAATGGCTGCTGTTGGTCGTGGAGGTGAAGGTGACGTTGGTCA ACGCATTCGTGATGAGATATTAGTAATACAGCCCGGGCTGATGGTCC
(190) Next SbGWDko2b was excised from pCRBlunt II as an NheI-XmaI fragment, and ligated into the NheI and AgeI sites of the plasmid carrying SbGWDko2a, positioning SbGWDko2b downstream of the intron and in the opposite orientation of SbGWDko2a. In this orientation, sequences in the sbGWDko2b portion of the plasmid are presented as an inverted complement of sequences within the sbGWDko2a portion. Referring to
(191) The entire RNAi cassette from pAL409j SbGWDko2 was then excised as a PacI-XmaI fragment and ligated into the Pad and XmaI sites of pAG2004, producing the Agrobacterium transformation vector pAG2106 [SEQ ID NO: 195] in a manner similar to that described in reference to
Example 6
Sequencing of the Switchgrass GWD Gene(s)
(192) Homologues for GWD and ISA3 were detected in the switchgrass genome and the number of homologues that are present for each were estimated using a Southern blotting strategy. Results with the Southern blot using the rice ISA3 probe are shown in
(193) A portion of the switchgrass GWD gene was identified and clones using a degenerate PCR approach. Degenerate PCR employs oligonucleotide primers with one or more ambiguous bases that allow the primers to anneal to template sequences for which only approximate sequence information is available. That is, in regions of strong sequence conservation between genes of widely divergent species, one can infer the range of possible sequences that might be present in the corresponding gene from an under-characterized species such as switchgrass. One can then design degenerate primers that will anneal to the predicted sequences, permitting PCR amplification and cloning of a portion of the gene in question.
(194) Pursuing the degenerate PCR strategy, portions of the GWD genes derived from rice, sorghum, maize, and tomato were aligned. The strongest alignments occurred in the region of the GWD genes that was described in
(195) The same primers were then used in PCR reactions that used switchgrass (ecotype Alamo) genomic DNA as a template. These reactions produced discrete PCR products of approximately 1100 bp. These products were ligated into pCRBluntII TOPO and five of the resulting plasmids were sequenced. From these five sequences, it was determined that: Each cloned PCR product was derived from a gene with very strong homology to the rice GWD gene Among the five sequenced products, there were clearly three classes of (highly homologous) sequences, suggesting that the clones were derived from three different GWD homologues within the switchgrass genome. This observation agrees with the data from Southern blots that suggested multiple GWD genes reside within the switchgrass genome. The main differences in the sizes of the products that arose from degenerate PCR of sorghum and switchgrass can be attributed to differences in the lengths of the putative introns in each of the respective genes.
(196) Referring to
(197) As shown below, an alignment of the sequences from three of the switchgrass-derived degenerate PCR products, demonstrates that relatively few single nucleotide changes and two somewhat lengthier insertions/deletions distinguish these three GWD homologues in this region. These three products are PvGWD-2 [SEQ ID NO: 206], PvGWD-5 [SEQ ID NO: 207] and PvGWD-1 [SEQ ID NO: 208].
(198) TABLE-US-00011 CLUSTAL2.0.10multiplesequencealignment PvGWD-2 TGGAATTCTTGTTTGGATGAGATTCATGGCTACCAGGCAACTAACATGGAATAAGAACTA 60 PvGWD-5 TGGAATTCTTGTTTGGATTAGGTTCATGGCTACCAGGCAACTAACATGGAATAAGAACTA 60 PvGWD-1 TGGAATTTTTGTTTGGATGAGATTCATGGCTACAAGACAACTGACATGGAATAAGAACTA 60 ****************************************************** PvGWD-2 TAATGTGAAGCCCCGGTATATACCTGTCTTTATCATTTACTTCAGTGATGTTTACTCTCT 120 PvGWD-5 TAATGTGAAGCCCCGGTATATACCTGTCTTTATCATTTACTTCAGTGATGTTTACTCTCT 120 PvGWD-1 TAATGTGAAGCCACGGTATATACCTGTCTTTATTATTTACTTCAGTAATGTTTACTCTCT 120 ********************************************************* PvGWD-2 GCTTAAAAATTTAAAGAATCTGAAGCTGTCCTTTTCTTTTGTGCGGGAACATAATTGAGA 180 PvGWD-5 GCTTAAAAATTTAAAGAATCTGAAGCTGTCCTTTTCTTTTGTGCGGGAACATATTTGAGA 180 PvGWD-1 GCTTTAAAAGTTAAAGAATCAGAAGTTGTCCCTTTCTTTTGTGCGGGAACATAATTGAAA 180 ***************************************************** PvGWD-2 AATTGGTGTTTTTGCCACTACTTCATGATGCAATTGTAATTTTTCCCTCATTTTTTTCAA 240 PvGWD-5 AATTGGTGTTTTTGCCACTACTTCATGATGCAATTGTAATTTTTCCCTCATTTTTTTCAA 240 PvGWD-1 AGTTGGTGTTCTTGCCACTAC--------------------------------------- 201 ******************* PvGWD-2 CTTTGTGATTTTGCCCTTTACTATTCACAAGTCAACGCAATTTTGCTCCTGTTTTGACCG 300 PvGWD-5 CTTTGTGATTTTGCCCTTTACTATTCACAAGTCAACGCAATTTTGCTCCTGTTTTGACCG 300 PvGWD-1 ----------------------------AAGTCAACGCGATTTTACCCCT-CGTCAACGG 232 *********************** PvGWD-2 TTGACTGAG-GGAAAAATCGCGTTAACTTGTGAATAGTAAGTGCAAAATTGCAAAGTTGA 359 PvGWD-5 TTGACTGAG-GGAAAAATCGCGTTAACTTGTGAATAGTAAGTGCAAAATTGCAAAGTTGA 359 PvGWD-1 TCAAAACAGTAGCAAAATCGCGTTGACTTGTGAATAGTAAGGGCAAA-TCACAAAGTTGG 291 ********************************************** PvGWD-2 AAAAAACAAGGACAAAATCACAATTGCACTGCAAAGTAGGGGTGGAAACACAAATGCCCC 419 PvGWD-5 AAAAAACAAGGACAAAATCACAATTGCACTGCAAAGTAGGGGTGGAAACACAAATGCCCC 419 PvGWD-1 AAAAAACAAGGACAAAATCACAATTGCACTGCAAAGTAGTCGCGGAAACACAAATGCCCC 351 ********************************************************* PvGWD-2 AAAATAATTTGGCTGTTTGTCCTGATAGAAAACAATACAATTCAGTACTCAGAGAATATT 479 PvGWD-5 AAAATAATTTGGCTGTTTGTCCTGATAGAAAACAATACAATTCAGTACTAAGAGAATATT 479 PvGWD-1 AAAATAATTTGGCTGTTTGTCCTGATAAAAAACAATACAATTCAGTACTCAGAGAATATT 411 ********************************************************** PvGWD-2 ATATTTCTATAAATGAAAAACATAACTCATGTCACATTCTTT--------GGCATCTCAT 531 PvGWD-5 ATATTTCTATAAATGAAAAACATAACTCATGTCACATTCTTT--------GGCATCTCAT 531 PvGWD-1 ATATTTCTATAAATGAAAAACATAACTCATGTCGCATTCTTTCATTCTTTGGCATCTCAT 471 *************************************************** PvGWD-2 ATCGATCAATAACTATGCAGTGAGATAAGCAAAGCACAAGATAGGTTTACAGATGATCTT 591 PvGWD-5 ATCGATCAATAACTATGCAGTGAGATAAGCAAAGCACAAGATAGGTTTACAGATGATCTT 591 PvGWD-1 ATTGATTAATAACTACGCAGTGAGATAAGCAAAGCACAAGATAGGTTTACAGATGATCTT 531 ********************************************************* PvGWD-2 GAGAACATGTACAAAGCTTATCCTCAGTGCAGAGAGATATTAAGAATGATAATGGCTGCT 651 PvGWD-5 GAGAACATGTACAAAGCTTATCCTCAGTGCAGAGAGATATTAAGAATGATAATGGCTGCT 651 PvGWD-1 GAGAACATGTACAAAGCTTATCCTCAGTACAGAGAGATATTAAGAATGATAATGGCTGCT 591 *********************************************************** PvGWD-2 GTTGGTCGTGGAGGTGAAGGTGATGTTGGTCAACGTATTCGTGATGAGATATTAGTAATA 711 PvGWD-5 GTTGGTCGTGGAGGTGAAGGTGATGTTGGTCAACGTATTCGAGATGAGATATTAGTAATA 711 PvGWD-1 GTTGGTCGTGGAGGTGAAGGTGATGTTGGTCAACGTATTCGTGATGAGATATTAGTAATA 651 *********************************************************** PvGWD-2 CAGGTAAAATTAATGGTCCTAGGTGAATATACACTTACTTTTATTCATTGCTTCACCGAA 771 PvGWD-5 CAGGTAAAATTAATGGTCCTAGGTGAATATACACTTACTTTTATTCATTGCTTCACTGAA 771 PvGWD-1 CAGGTAAAATTAATGGTCCTAGGTGAATATACACCTACTTTTATTCATTGCTTCACTGAA 711 ********************************************************** PvGWD-2 TTATACGGTTGGTAGTTCTCATCCAAAAGATAGACATTGTGAATAATAATAAAATGCTTG 831 PvGWD-5 TTATACGGTTGGTAGTTCTCATCCAAAAGATAGACATTGTGAATAATAATAAAATGCTTG 831 PvGWD-1 TTATACGGTTGGTAGTTCTGATCCAAAAGATAGACATTGTGAATAATAATAAAATGCTTG 771 *********************************************************** PvGWD-2 CTGCTTTAATAGAGAAATAATGACTGCAAAGGTGGAATGATGGAAGAATGGCACCAGAAA 891 PvGWD-5 CTGCTTTAATAGAGAAATAATGACTGCAAAGGTGGAATGATGGAAGAATGGCACCAGAAA 891 PvGWD-1 CTGCTTTTATAGAGAAATAATGACTGCAAAGGTGGAATGATGGAAGAATGGCACCAGAAA 831 *********************************************************** PvGWD-2 TTGCACAACAATACAAGCCCAGATGATGTAGTGATATGCCAGGTAATGGATATTTTGAAT 951 PvGWD-5 TTGCACAACAATACAAGCCCAGATGATGTAGTGATATGCCAGGTAATGGATATTTTGAAT 951 PvGWD-1 TTGCACAACAATACAAGCCCAGATGATGTAGTGATATGCCAGGTATTGGATATTTTGAAT 891 *********************************************************** PvGWD-2 TCTTAATACAGTAAGTATTTAAGCATTGAGGTTTTCATGGTTATGTCTCTCCTTGGGCAG 1011 PvGWD-5 TCTTAATACAGTAAGTATTTAAGCATTGAGGTTTTCATGGTTATGTCTCTCCTTGGGCAG 1011 PvGWD-1 TCTTAATACTGTAAGTATTTAAGCATTGAGGTTTTTATGGTTATGTCTCTCCTTGGGCAG 951 ********************************************************** PvGWD-2 GCACTAATTGATTATATCAAGAGTGATTTTGATATAAGTGTTTACTGGGACACCTTGAAC 1071 PvGWD-5 GCACTAATTGATTATATCAAGAGTGATTTTGATATAAGTGTTTACTGGGACACCTTGAAC 1071 PvGWD-1 GCATTAATTGATTATATCAAGAGTGATTTTGATATAAGTGTTTACTGGGACACCTTGAAC 1011 *********************************************************** PvGWD-2 AAAAATGGCATAACCAAAGAGCGTCTCTTGAGCTATGATCGAG-CTATCCATTCAGAACC 1130 PvGWD-5 AAAAATGGCATAACCAAAGAGCGTCTATTGAGTTATGACCGTC-CGATCCATTCCAGACC 1130 PvGWD-1 AAAAATGGCATAACCAAAGAGCGTCTTTTGAGCTATGATCGTTGCTATCCATTCAGAACC 1071 **************************************************
(199) Sequences of the exons from the switchgrass GWD gene(s) were inferred from the above information. The inferred sequences were used to (1) develop an RNAi construct that would target this central region of one or all of the switchgrass GWD genes, and (2) determine more of the genomic sequence for each of these (at least three) GWD homologues in switchgrass.
(200) To develop an RNAi construct, PCR was used to amplify portions from two of the exons encompassed in the degenerate PCR products described above. These two products were then fused by SOE PCR (Horton R. M., Hunt H. D., Ho S, N., Pullen J. K., Pease L. R., Engineering hybrid genes without the use of restriction enzymes: gene splicing by overlap extension (1989) Gene 77(1):61-8), which is incorporated herein by reference as if fully set forth). The fused products included a contiguous sequence that was expected to more closely match one or more of the switchgrass GWD mRNAs. NheI and XmaI sites were incorporated into the termini of the fused product to enable subsequent cloning into pAL409. The sequence of this product (called PvGWDko2 along with the flanking restriction sites) is depicted below.
(201) TABLE-US-00012 >PvGWDko2RNAidriversequence [SEQIDNO:209] GGCTAGCGAGATAAGCAAAGCACAAGATAGGTTTACAGATGATCTTGAGA ACATGTACAAAGCTTATCCTCAGTACAGAGAGATATTAAGAATGATAATG GCTGCTGTTGGTCGTGGAGGTGAAGGTGATGTTGGTCAACGTATTCGTGA TGAGATATTAGTAATACAGGAGAAATAATGACTGCAAAGGTGGAATGATG GAAGAATGGCACCAGAAATTGCACAACAATACAAGCCCAGATGATGTAGT GATATGCCCGGGAGG
(202) One copy of this element was ligated into the AvrII and BspEI sites of pAL409, then a second copy was ligated into the NheI and AgeI sites of the resulting plasmid, producing the RNAi cassette pAL409 PvGWDko2, which had the elements arranged in opposite orientations, separated by the OsUbi3 intron, as described in reference to
(203) By learning the complete genomic sequences of each of the GWD genes in switchgrass identification of the potentially unique sequences (5 and 3 untranslated regions) that flank each of these genes may be possible. With this information, it may be able to design RNAi constructs that specifically target one or the other of these genes.
(204) To identify more of the sequences associated with each of the GWD homologues, a strategy was pursued that employed inverse PCR (iPCR) as well as degenerate PCR. Genomic DNA from switchgrass was digested with either EcoRI, HindIII, or Bgl II. These were then subjected to self-ligation, diluted approximately 100-fold, and used as templates in inverse PCR reactions. The sequences of the first primers used in iPCR reactions are summarized in Table 2.
(205) TABLE-US-00013 TABLE2 SequencesofprimersusedforinversePCR PvGWDi-1 CCGTGGCTTCACATTATAGTTCTTATTCCA SEQID NO:211 PvGWDi-2 GAGATAAGCAAAGCACAAGATAGGT SEQID NO:212 PvGWDi-3 GCCTGCCCAAGGAGAGACATAACCA SEQID NO:213 PvGWDi-4 GATATAAGTGTTTACTGGGACACCT SEQID NO:214
(206) Inverse PCR reactions with either primers PvGWDi-1 and PvGWDi-2 or primers PvGWDi-3 and PvGWDi-4 were carried out using the EcoRI- or HindIII-digested (and self-ligated) templates. These reactions gave rise to a small number of clear products, which were purified from agarose gels and ligated into pCRBluntII-TOPO. Sequence analysis of the resulting plasmids allowed extending the known sequence from switchgrass GWD genes at both the 5 and 3 ends to a total of 3.4 kb. Again, the sequences from individual clones differed by about 1-2%, consistent with the idea that the cloned PCR products were derived from separate but very similar GWD homologues in the switchgrass genome. This exercise was repeated with newly designed primers, incorporating both inverse PCR and degenerate PCR to extend the known sequence further. Approximately 7 kb of switchgrass GWD sequence was identified.
(207) An amalgamated sequence is provided representing the switchgrass GWD gene sequences discovered herein. The sequence presented does not include all of the variations identified among the homologues. Thus, the sequence could be viewed as a chimera of these homologues. This sequence straddles a segment of approximately 1-2 kb for which there is no sequence data. This segment is represented as a string of Ns. Referring to
(208) TABLE-US-00014 >switchgrassGWDhomologues [SEQIDNO:215] GGAACGACAGTGTACAAGAACAGGGCTCTTCGGACGCCTTTTCTAAAGGT CAGTCTTGTTACATTATGGATCTCTTTGTTACCACAGAACAGTCTGGTTA GCAGTAATGTCCATAACTGTGCAGTCAGGAGGTGATAACTCCACGCTTAG AATTGAGATAGATGATCCTGCGGTGCAAGCTATTGAATTTCTCATCTTTG ATGAGACACAGAACAAATGGTAACCCAGCTGTTTTCGTTACCATGTAGCA CTGTTTGTTTGTTTGAATGCAAAAGGTATATAAACTATGCAAAACTCTAC ATTGCACAGGTTTAAAAATAATGGCCAGAATTTTCAAATTCAGCTCCAAT CGAGCCACCATCATGGTAGTGGCGCATCTGGTGCCTCATCTTCTGCTACT TCTGCCTTGGTGCCAGAGGATCTTGTGCAGATCCAAGCTTACCTACGGTG GGAAAGAAATGGAAAGCAGTCATACACACCGGAGCAAGAAAAGGAAAGCT TTTAGTTGTTTTTTTTTATCTTCAGTCTGGAAGGAACTCAATGTACTAAG TTGATTAAAAATAAGAGGTGGTGTATTTTTTCTCCAGGAGGAGTATGAAG CTGCACGAGCTGAGTTAATAGAAGAATTAAATAGAGGTGTTTCTTTGGAG AAGCTTCGAGCTAAATTGACAAAAGCACCTGAAGTGCCCGACTCAGATGA AAATGATTCTCCTGCATCTCAAATTACTGTTGATAAAATTCCAGAGGACC TTGTACAAGTCCAGGCTTATATAAGGTGGGAGAAAGCAGGCAAGCCAAAC TATCCTCCTGAGAAGCAACTGGTAATGCATTGATTCAATAGCGTAAAATA CCTTGTTGGCTTTACACTTTATGGAGGTTCTTATCTCACAATTCGCTAGG TCGAGTTTGAGGAAGCAAGGAAGGAACTGCAGGCTGAGGTGGACAAGGGA ATCTCGATTGATCAGTTGAGGAAGAAGATTTTGAAAGGAAACATTGAGAG TAAAGTTTCGAAGCAGCTGAAGAATAAGAAGTACTTCTCTGTAGAAAGGA TTCAGCGCAAAAAGAGAGATATCATGCAGATTCTTAGTAAACATAAGCAT ACTGTCATAGAAGAGCAAGCAGAGGTTGCACCAAAACAACTAACTGTTCT TGATCTCTTCACCAATTCATTACAGAAGGATGGCTTTGAAGTTCTAAGCA AAAAACTGTTCAAGTTCGGTGATAAACAGATCCTGGTTAGGATCCTTAAG ATATTCTTTGTATCTCCAGATCTTTTTCTACCATGCTAATTAAGCTTCTC TCTTCTTAAGGCAATCTCCACCAAGGTTCTAAACAAATCAAAAGTTTACT TGGCAACAAATCATACGGAGCCACTTATCCTTCACTGGTCACTAGCGAAA AAGGCTGGAGAGTGGAAGGTTAAATTTCAAAATTGTTTCCAGTAGTTAAA GCCACAAACTCAGCAGCTTTTTTAAACACTGCTATCAGTACCAATGCGGT GTTATTTAACTGTGCAGGCACCTCCTTCAAACATATTGCCATCTGGTTCA AAATTGTTAGACATGGCATGCGAAACTGAATTTACTAAGTCTGAATTGGA TGGTTTGCATTATCAGGTGGAAATAACATCTTCAACCTGTTATTTTATTC TTATTTTTATTAGCCCTCCTGCTATCTCAAGGCTCTTAATTTCCAGGTTG TTGAGATAGAGCTTGATGATGGAGGATATAAAGGGATGCCATTCGTTCTT CGGTCTGGTGAAATGTGGATAAAAAATAATGGCTCTGATTTTTACCTTGA TCTCAGCACCCGTGATACCAGAAATATTAAGGCAAGTGTTTCTGTCCATT TTACCTTTCAAACTTTAAACTATTGTCTTTGTTTTGTCTATGCAACTAGT CGCTAAATTGTGAAGTAACCGATCTGTTCTTAATTGAAGGACACTGGTGA TGCTGGTAAAGGTACTGCTAAGGCATTGCTGGAAAGAATAGCAGAGCTGG AGGAAGATGCCCAGCGATCTCTTATGCACAGGTCAGGCACTAAAATATCC ATAATAATATGACTGAATTTTACATGGAAAATTCTCCTAAACTACTTCTA CTCCTTGACAGATTCAACATTGCAGCAGATCTAGTTGACCAAGCCAGAGA TGCTGGACTATTGGGTATTGTTGGACTTTTTGTTTGGATTAGATTCATGG CTACAAGACAACTGACATGGAATAAGAACTATAATGTGAAGCCACGGTAT ATACCTGTCTTTATTATTTACTTCAGTAATGTTTACTCTCTGCTTTAAAA GTTAAAGAATCAGAAGTTGTCCCTTTCTTTTGTGCGGGAACATAATTGAA AAGTTGGTGTTCTTGCCACTACAAGTCAACGCGATTTTACCCCTCGTCAA CGGTCAAAACAGTAGCAAAATCGCGTTGACTTGTGAATAGTAAGGGCAAA TCACAAAGTTGGAAAAAACAAGGACAAAATCACAATTGCACTGCAAAGTA GTCGCGGAAACACAAATGCCCCAAAATAATTTGGCTGTTTGTCCTGATAA AAAACAATACAATTCAGTACTCAGAGAATATTATATTTCTATAAATGAAA AACATAACTCATGTCGCATTCTTTCATTCTTTGGCATCTCATATTGATTA ATAACTACGCAGTGAGATAAGCAAAGCACAAGATAGGTTTACAGATGATC TTGAGAACATGTACAAAGCTTATCCTCAGTACAGAGAGATATTAAGAATG ATAATGGCTGCTGTTGGTCGTGGAGGTGAAGGTGATGTTGGTCAACGTAT TCGTGATGAGATATTAGTAATACAGGTAAAATTAATGGTCCTAGGTGAAT ATACACCTACTTTTATTCATTGCTTCACTGAATTATACGGTTGGTAGTTC TGATCCAAAAGATAGACATTGTGAATAATAATAAAATGCTTGCTGCTTTT ATAGAGAAATAATGACTGCAAAGGTGGAATGATGGAAGAATGGCACCAGA AATTGCACAACAATACAAGCCCAGATGATGTAGTGATATGCCAGGTATTG GATATTTTGAATTCTTAATACTGTAAGTATTTAAGCATTGAGGTTTTTAT GGTTATGTCTCTCCTTGGGCAGGCATTAATTGATTATATCAAGAGTGATT TTGATATAAGTGTTTACTGGGACACCTTGAACAAAAATGGCATAACCAAA GAGCATCTCTTGAGCTATGATCGTGCGATTCATTCAGAACCAAATTTCAG AAGTGAACAGAAGGAGGGTTTACTCCATGACCTGGGTAATTACATGAGAA GCCTGAAGGTATGTAAAACACTTAATATGGATATAAAAAAAGGCATGCAA AAAAATCTGTGCATTATCTTTGAAATTGAGTATGGTATTTTCTAAAGAAA ACATAGAAAAACACATATTGCCCTTTCAGTTCCGGAAAAAAATGATCTGC CATAAAGAGCATACAGTCAACTCATGTATTAGCACTCGCCTTTTCTGCTA ATGGTATGTTGTGTTGTGTTCTGTTCTATTCAATATATGCTTTCAGTAAT AATATTCTAGTGTTGACAACATCATTGCTCACAACATACAGAAACTGTAG TATGCCCGGTACAGTATGAACTTGTCCTTGAGTCTCCTCATTTTTTCCTT ATTCACGTCACAGCTTTATATCCTTCCAATGAATAATGATCAACTTGGAA ATCATTGGCATCTACAGTGAACCGTCCATTGTATTCTGATTTTGAACAAC TTTTTTTCCCCTCAGAACACACAGTAATAGCCAAGTATAACGACCTTACA TGGCCAAAACAACAACCTTACATGGCCAAAATAGCCAGGTAAGGGACAGA AGAAGAGAGAGGGTTGCCCTGCGGCAGATGTGGACAATGACTGATGATGT GGCTGTCCCAGTTATCAAAACAGGCAAATCCACTGTTCATGTGGCCNAAG CCAGTAATGAGCTGGTTTTGGGAAACCCTGGGGGATTGAGTAAACAATTA GAGGGTTATGTGGATTTGGTCATAGTTGGGGGTAGGAATTTGGAAATTTC CCTTTTGCTTGATAATTATGTTAGTCAAGAGATTAGACAAGTATTGTTAG GAGTTTGTTTCAGCTGGTTGAGATTGGATTTGGTTTCTTAGGTGATTGGT TAGTGCTACCCTTGCTCTATAATTGGGGATTTGCTTTTAATAAAGAAAGC AGAAATAAACCCAATCCTTCTCCGGTTCTCCCTCTTTTGTCCGATGTTTG CAGATGCGGCCACTGATAAGGTCCAGGTCCATGTCCTCCCATCAACCACA CACACATACAGCCTAAGATCTAATTCACCCCAGGACACCCAAGCTCGTGA AAATATACCATGTCATCCCACTATTCATACTTTTTTTAAAAAAATCCCAC TAATCCTGCAAATGTCCTAATATAAGAACAACATCATTTTCAGTCATGTT GTACCTTTTCTTGGTGACAAAAAGAAGACATCCATTTCATCTCTTTTTAA GGGGCATTTTCTCATCGTTTCTGCAATTGAATATTCTTTCCCTGATGTAA TCTTTGAATGAATGCTATTGTGATTTGCTCATTCTGTTAGGCTGTGCATT CTGGTGCTGATCTTGAGTCTGCTATAGCAACTTGCATGGGATACAAATCG GAGGTATCATTCTCATTCCTTTTCATTCCGCTAGAATTCTTTAGATACCT GTGCTCATATCTAATGAACTAACTTTTGGGTACAGGGTGAAGGTTTCATG GTCGGTGTTCAGATCAATCCAGTGAAGGGTTTGCCATCTGGATTTCCTGT AAAAATCCCTCACCTTCTTTTCTCAACACATGTACTTTCTAAGTTTCTTA TACTTGTGACATTTACCTTTATAGGAATTGCTCAAATTTGTGCTTGACCA TGTTGAGGACAAGTCAGCGGAACCACTTCTTGAGGTCAGTGATATAATCG AAGTTCCTGTTTGTAATAAAACGAAGAGAAGAAGCTGGGTTTTTCATCAC AACTCAAATAATCAGATCTCACATAGCTGATTGAATTTTTAAACCACCAT TTTTTGCGGNTACTATGNGAATCACTTGTTGCTAACAAAATGCTACCTTG NAGGGGNNGGTGGAAGCTCNAGTTGAACTCCNCCCTNNNCTTCNTGNTTC ACCNGNACGCATGAAAGAANNTATTTTTTTGGNCATTGCNCCTGATTCNA CTTTTANGACAGCTATNGAAAGGNCATATGANGAGCTCCNCCATGGANNC CCCGANGNTGGGCNCCCNAATATTGNCCCCATGATNNGNNNNANGNNNAG NNCCNNNANNNNNNNCNNNNNNNNNNTNNNNNANANNNNGNNTNNNNNAN NNNNNNNNNNNGNNNNNNNNCNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNAANNNNNNNNNNNNNNANNNCCNNNNNNNNNNNNNNNNANNNNNNNNA NNNNNNNNNNNNNNNNNNCNNNNNNAANNNNNNNNNATNCNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNANNNNNNNNNNNNNNCNNCNNNNNNNNNNN NNNTNNNNNNNNANNNNCAACNNNNNNNNNNNNNNNNNNNCCNNNNNNNN NGNNNNNNNNNNNNNANNNNNNNNNNNNNNNNNANNNNNNNNANNNNNNN NNNNNNNNNNNNNANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNCCNNNNNNNNNNNNNNNNNNNNNNNNANNNNNNNNNNNNN GNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCNNNNNNNNNNNNNNNNCN NNNNNNNNNNNNNNANNNNNNNNNNNNNNANNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNANNNNNNNNNNNNNTNNANN NCNNNNANTANNCNNNNNNNNNTNNNNGNNNNNNNNNNNNNNNNNNNNNN NNNNGANNNNNNNNNNNNNNNNAGNNNTNNNANNNNNCNNNNNNNNNNGN NNTNNNNNNNNNNNNNGNNNNNNNNNNNNNNNNNNNNNNNTNNNNNNNNN NNNNNNNNGNNNNNNTNNNNNNNNGNNNNNNNNCNNNNNNNNNNNNNNNN NNNNNNNNNNTNACNNNNNNNNNNNNNTCNCNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNGGCNTNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNANNNNNCNNNNNNNNNNCNNNNNNNNNNNNNTNNNNNNCNNN NNCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTGNNNNNNCNN NNNNNNNNNNCANNNCNNNNGNAGATCTCGGAGAGTGAACTTCAGCAATC AAGTTCTCCGGATGCAGAAGCTGGCCATGCAGTACCATCTATTTCATTGG TCAATAAGAAGTTTCTTGGAAAATATGCAATATCAGCTGAAGAATTCTCT GAGGAAATGGTTAGTAATATAAAATTTTGCATTAGGAAATCTGCCATTCG TAAGGAAGTCTTGATGAAACCAATTGTTATTATGCTGGTTTCCTTTTCTT TTGGCCTTGTGCTTCTAGTACTCACTTTTATGTTTTCAGGTTGGGGCTAA GTCTCGGAATATAGCATACCTCAAAGGAAAAGTACCTTCGTGGGTTGGTG TCCCAACATCAGTTGCGATACCATTTGGCACTTTTGAGAAGGTTTTATCA GATGGGCTTAATAAGGTTGGTTGGTGGTTTATTTTGATGTATATACTTGA ATAATAGAACTGCATGGTTCTTGGAGAAGTCAGATTCTTTAACATGTTTG AAATACACTACTGGGAAGGTAACAACGTGCAATTTAATGTCCACCAATAT CTAAACAGCCATTTTTGGCATTCAATTCACTATATATTTTATTTCATGAG CCTGCTCTATAAGTAGCGTCTTCAGTAGTTGTAGCTCATAGCTTCATAGT CTCATTCTACCATGAACTAATTTTGCTAACTTACATCTACTCTTGAAATA AGTAATACTTGTATATTATTATCTTTGATTGTAAAAGAACTTCCCTTGCT CGTTTGTCAAGGTGTCTTTTAGACAGGAGATGGAATTGACTGTTATCAAA GCAAATGATAACAAGAAACCTCTTGTTGATTGGTTGAGCAGTTTCAACTA ATCCATTTTTTTTTCTTTTTGGCATGTGATCTTTGTATTATTGGCCCAAA TGAAATTCTATTTCTCCCATTAACCACCCACAATGGCAGGTTTGGGTACA TATAGGCCAACCATGGGTAGGTGGCTTAAAAGTTGAGTAAAGCATAATTG GGGATAAGGTGCACATAGGCACGGACCACCCACAGACAAAGTGCTTGCAG GCACTACTAATACATTATTCTATCACCATCAGGATTCAATTCTAACATGT ACTGTTTCTTCTTTTTCTTCTTTGTACAGTTCCTGTATAGACCCTTTTGT ACAGTTTCCTAACAAATGAAAAAGATCAGTAGGAGACCCTCTTCTCCTGT TCCACAAAAAATGTTAAAATGGTCTTTCTAATATTTGATTGTTCTTTCTT TTATGGCAGGAAGAGCGCAAAACATAGAAAAGCTT
Example 7
Cloning ZmGWD1RNAi and ZmGWD2RNAi Silencing Cassettes
(209) The gene-specific inverted repeats within silencing cassettes ZmGWD1 and ZmGWD2 in the initial vectors pAG4102 and pAG4103 were selected in two independent regions of the maize ZmGWD cDNA (Maize GDB, Accession No. GRMZM2G412611). The 497 bp ZmGWD1 repeat sequence, which was used in construction of the pAG4102 vector, corresponds to position 389-885 nt at 5 end of the predicted maize GWD cDNA. The ZmGWD1 has two nucleotide mismatches (positions 449 and 483) comparing to the ZmGWD cDNA, which derived from the expressed sequence tag (EST) assembly TC458260 (The Gene Index Database) representing maize GWD expressed sequence.
(210) TABLE-US-00015 ZmGWD1 (SEQIDNO:49) TCCTCCCGTCCAGAAAACCTGATGGAACGACAGTGTACAAGAACAGGGCTCTCAGGACAC 60 CTTTTGTAAAGTCAGGTGATAACTCCACTCTAAGGATTGAGATAGATGATCCTGGGGTGC 120 ACGCCATTGAGTTCCTCATCTTTGACGAGACACAGAACAAATGGTTTAAAAACAATGGCC 180 AGAATTTTCAGGTTCAGTTCCAGTCGAGCCGCCATCAGGGTACTGGTGCATCTGGTGCCT 240 CCTCTTCTGCTACTTCTACCTTGGTGCCAGAGGATCTTGTGCAGATCCAAGCTTACCTTC 300 GGTGGGAAAGAAGGGGAAAGCAGTCATACACACCAGAGCAAGAAAAGGAGGAGTATGAAG 360 CTGCACGAGCTGAGTTAATAGAGGAAGTAAACAGAGGTGTTTCTTTAGAGAAGCTTCGAG 420 CTAAATTGACAAAAGCACCTGAAGCACCCGAGTCGGATGAAAGTAAATCTTCTGCATCTC 480 GAGTGCCCATCGGTAAA 497
(211) The 540 bp ZmGWD2 repeat sequence, which was used in construction of the pAG4103 vector, corresponds to position 2986-3525 nt of the predicted maize GWD cDNA. The ZmGWD2 sequence spans over a nucleotide triplet encoding Hys1072 that has been proposed to be a crucial amino acid residue in phosphorylation activity of the GWD protein (Mikkelsen, 2004).
(212) TABLE-US-00016 ZmGWD2 (SEQIDNO:50) CTTCTGAACCGATTTGATCCTGTTTTAAGGAATGTTGCTCACCTCGGAAGTTGGCAGGTT 60 ATAAGCCCGGTTGAAGTATCAGGTTATGTGGTTGTGGTTGATGAGTTACTTGCTGTCCAG 120 AACAAATCTTATGATAAACCAACCATCCTTGTGGCAAAGAGTGTCAAGGGAGAGGAAGAA 180 ATACCAGATGGAGTAGTTGGTGTAATTACACCTGATATGCCAGATGTTCTGTCTCATGTG 240 TCAGTCCGAGCAAGGAATAGCAAGGTACTGTTTGCGACCTGTTTTGACCACACCACTCTA 300 TCTGAACTTGAAGGATATGATCAGAAACTGTTTTCCTTCAAGCCTACTTCTGCAGATATA 360 ACCTATAGGGAGATCACAGAGAGTGAACTTCAGCAATCAAGTTCTCCAAATGCAGAAGTT 420 GGCCATGCAGTACCATCTATTTCATTGGCCAAGAAGAAATTTCTTGGAAAATATGCAATA 480 TCAGCCGAAGAATTCTCTGAGGAAATGGTTGGGGCCAAGTCTCGGAATATAGCATACCTC 540
(213) Prior to pAG4102 and pAG4103 vector construction, both ZmGWD1 and ZmGWD2 sequences were checked for potential intron splicing sites with only one such site identified in ZmGWD1 (AAAGGAGGAGT: SEQ ID NO: 152). This sequence was left intact for pAG4102 vector construction.
(214) The spacer region ZmAdh1i6 in ZmGWD1RNAi and ZmGWD2RNAi silencing cassettes (corresponding vectors pAG4102 and pAG4103) represents 342 bp sequence of the predicted intron 6 of the maize Adh1 gene (Gene Bank Accession No. X04049). This intronic sequence was cloned into silencing cassettes with its native gene coding flanking sequences consisting of either of 11 bp at 5 end or 10 bp at 3 end of the intron. The flanking sequences were included to ensure efficient post-transcriptional intron splicing. The flanking sequences are shown in capital letters in ZmAdh1i6.
(215) TABLE-US-00017 ZmAdhli6 (SEQIDNO:51) ATTCGAAGAAGgtacagtacacacacatgtatatatgtatgatgtatcccttcgatcgaa 60 ggcatgccttggtataatcactgagtagtcattttattactttgttttgacaagtcagta 120 gttcatccatttgtcccattttttcagcttggaagtttggttgcactggcacttggtcta 180 ataactgagtagtcattttattacgttgtttcgacaagtcagtagctcatccatctgtcc 240 cattttttcagctaggaagtttggttgcactggccttggactaataactgattagtcatt 300 ttattacattgtttcgacaagtcagtagctcatccatctgtcccatttttcagCTAGGAA 360 GTT 363
(216) The parts of the ZmGWD1RNAi and ZmGWD2RNAi silencing cassettes containing inverted repeats with the spacer sequence ZmAdh1i6, were synthesized as BamHI-AvrII fragments by the GenScript CRO. The synthesized fragments were cloned between maize PepC promoter and Agrobacterium tumefaciens nopaline synthase (NOS) gene terminator sequences to produce pAG4102 and pAG4103 silencing vectors.
(217) Both ZmGWD1RNAi and ZmGWD2RNAi cassettes were subsequently used for generating more complex stacked vectors by cloning silencing cassettes into T-DNA regions of the vectors that already had various expression cassettes for production of the cell wall degrading enzymes (CWDEs).
Example 8
Generation of Transgenic Plants
(218) E. coli strains carrying pAG2104, pAG2105, pAG2100, pAG2101, pAG2102, pAG2103, pAG2104, pAG2106, pAG4003, pAG4102, pAG4103 or plasmid-based transformation vectors carrying constructs 2379, 2380, 4106, 4107, 4108, 4109, 4110, 4111, 4112, 4113, 4114, 4115, 4116, 4117, 4120, 4121, 4124, 4125, 4514 or 4515 were used for conjugation with Agrobacterium and subsequent transformation of rice, maize, sorghum and switchgrass.
Example 9
Suppression of mRNAs from Genes Involved in Mobilizing Vegetative Starch
(219) To determine whether the RNAi vectors described above were exerting an effect on targeted mRNAs in transgenic plants, RNA was isolated from several control and transgenic plants, and real time reverse transcriptase PCR (real time RT-PCR, also known as real time quantitative PCR, RT-qPCR) was used to measure the relative abundances of mRNA species (
(220) Referring to
(221) The efficiency of GWD gene silencing in transgenic maize plants transformed with either pAG4102 or pAG4103 silencing constructs was similarly assessed by RT-qPCR. Green leaf material of transgenic and untransformed maize plants was sampled at the dusk, when the highest expression levels of the GWD gene is observed in monocots (Agrivida unpublished data). Collected leaf material was immediately frozen in liquid nitrogen and transferred to a 80 C. freezer for the storage prior to RNA isolation. The total RNA isolation was performed from 0.1-0.2 g of the frozen maize leaf tissues using TRIZol reagent (Invitrogen) according to the instructions supplied by the manufacturer with minor modifications. In order to remove any residual amounts of the maize genomic DNA in RNA preparations, 10 micrograms of the total RNA that was extracted with TRIZol from each sample were subsequently digested with DNase using the TURBO DNA-free Kit (Applied Biosystems/Ambion). The DNase-treated RNA samples were further purified with the RNeasy MinElute Cleanup Kit (QIAGEN) and 1 microgram of the purified RNA was subjected to cDNA synthesis using iScript Reverse Transcriptase (Bio-Rad) as described in the protocol provided by the manufacturer. All cDNA samples were diluted 1:50 with nuclease-free water. One microliter of the diluted cDNA sample (equivalent to 1 ng of the total RNA that was used for cDNA synthesis) was subjected to the qPCR assay with iQ SYBR Green Supermix (Bio-Rad) as specified in the supplied protocol. The GWD gene expression in plants that were evaluated was normalized against expression of two internal maize reference genes such as beta-Actin (GenBank Accession No. U60508) and glyceraldehyde-3-phosphate-dehydrogenase (GADPH, GenBank Accession No. X07156.1). The primers utilized in RT-qPCR analysis were designed with Primer3 software that is available online. The PCR primers are listed in Table 3:
(222) TABLE-US-00018 TABLE3 ListoftheprimersusedinRT-qPCRforanalysisofmaizeGWD expression Annealing Product Target temperature size, gene Forwardprimer Reverseprimer ( C.) bp GWD ob1659 ob1660 55 104 (SEQIDNO:153): (SEQIDNO:154): ATGAAGGAGACAAGCGTTGG AAGTTTCACCTTGCGTGTGC Beta- ob1555 ob1556 55 119 Actin (SEQIDNO:155): (SEQIDNO:156): CAACTGCCCAGCAATGTATG CGTAGATAGGGACGGTGTGG GADPH ob1567 ob1568 55 88 (SEQIDNO:157): (SEQIDNO:158): CGCTGAGTATGTCGTGGAGT AACAACCTTCTTGGCACCAC
(223) All PCR primers for GWD expression assays had been previously validated through regular RT-PCR followed by the agarose gel electrophoresis in order to check primer specificity. Furthermore, selected primers were validated by the Standard Curve Calibration and Melt Point analyses to ensure reproducible amplification results. Real-time PCR amplification reactions were performed in 12.5 microliter volume in 96-well plates in a CFX-96 instrument (Bio-Rad). A control cDNA representing untransformed plant #37 (wild type maize line AB) was run with each experimental plate to ensure availability of a cross-reference between different qPCR runs. A no-template control (NTC) was also included with each run for every gene. Each experimental sample was run in triplicates. The thermal profile of the PCR reaction was 95 C. for 3 min activation and denaturation, followed by 40 cycles of 95 C. for 10 sec, 55 C. for 20 sec, and 72 C. for 30 sec. The quantification cycle value (Cq) for each reaction was calculated automatically by the CFX Manager Software (Version 2.1). The observed expression of the GWD gene in untransformed control plant #37 was set to 1 and GWD expression in each analyzed transgenic plant was compared to this value in order to generate relative gene expression data.
(224) Results of the GWD Expression Analysis in Transgenic Maize
(225) The silencing cassettes ZmGWD1RNAi and ZmGWD2RNAi provided significant levels of suppression of the GWD gene expression reaching more than 13 fold transcript level reduction in individual transgenic plants, when compared to the untransformed maize plant #37. The GWD gene expression was not suppressed in only one plant (4102.64) among all analyzed transgenic plants. The observed GWD silencing efficiency was 95.8-100%, with more than 3 fold transcript level reduction in each analyzed transgenic plant relative to the GWD level in untransformed maize (Table 4 and
(226) TABLE-US-00019 TABLE 4 Efficiency of GWD silencing in transgenic maize by ZmGWD1RNAi and ZmGWD2RNAi cassettes Number of Number of plants The observed analyzed with suppressed efficiency of Silencing transgenic GWD transcript GWD silencing (% cassette plants levels of silenced plants) ZmGWD1RNAi 24 23 95.8 (pAG4102) ZmGWD2RNAi 12 12 100 (pAG4103)
Example 10
Modified Plants Accumulate Elevated Levels of Starch and Higher Overall Glucose
(227) Tissues were collected from control plants as well as rice and switchgrass plants that carry integrated copies of the RNAi transgenes described above. These tissues were then dried and milled to a fine powder. The starch content of these tissues was then determined by standard methods (Smith A M and Zeeman S C, Quantification of starch in plant tissues (2006) Nat. Protocols 1:1342-1345, which is incorporated herein by reference as if fully set forth). Referring to
(228) Referring to
(229) To generate additional transgenic plants that express the above-described microRNAs or hpRNAs, constructs were introduced into plasmid vectors for Agrobacterium-mediated plant transformation pAG2005 (SEQ ID NO: 75), and pAG4003 (SEQ ID NO: 76).
(230) Transgenic rice plants were generated, and grown in greenhouses to maturity, after which seed and dried straw were harvested. The starch content of individual plants was measured via a total starch assay (Megazyme, Bray, County Wicklow, Ireland).
(231) Milled samples of the straw were collected from each of several transgenic plants produced from pAG2110, pAG2115, and pAG2116 that accumulate >8% (>80 mg/g) starch, and these were pooled to prepare a transgenic mix. This transgenic mix had an average total starch content of approximately 9.2% (92 mg starch per gram dry weight). Similarly, samples of straw from Nipponbarre plants were pooled to prepare a NB control mix, which had an approximate starch content of 0.8% (8 mg starch per gram dry weight). Samples were then taken from these mixes, and total acid hydrolysis was used to examine their total carbohydrate composition to determine whether starch accumulation affected overall carbohydrate content. About 0.20 g milled tissue from each mix was treated with 72% sulfuric acid at 30 C. for 1 hr. The acid was then diluted to 4% concentration, and samples were autoclaved at 121 C. for 1 hr. After cooling, the pH of the samples was adjusted to pH 5-8. Analysis was carried out via HPLC, and sugar concentrations were calculated relative to pure sugar standards. Results from this analysis are shown in Table 5.
(232) TABLE-US-00020 TABLE 5 Carbohydrate composition of control and transgenic rice straw. NB control mix Transgenic mix Carbohydrate Ave Stdev Ave Stdev Glucose (g/100 g biomass) 36.32 0.07 43.39 0.39 Xylose (g/100 g biomass) 14.64 0.09 13.84 0.06 Galactose (g/100 g biomass) 1.26 0.09 0.91 0.53 Arabinose(g/100 g biomass) 0.25 0.02 0.07 0.24 Mannose (g/100 g biomass) 2.16 0.06 2.01 0.45
(233) These results confirm that the additional starch content of the transgenic biomass results in an increase in the total glucose content of the material and does not come at the expense of, for example, cellulose-derived glucose.
Example 11
Transgenic Plants that Accumulate Vegetative Starch have Greater Above-Ground Biomass
(234) To determine whether the starch-accumulation phenotype affected the biomass yield among transgenic plants, 20 progeny from a single rice plant carrying construct 2116 and 9 progeny from wild type (NB) control plants were grown in a greenhouse to maturity, seed was collected, and total above ground biomass (excluding seeds and panicles) was harvested, dried and weighed (
Example 12
Glucose Recovery Following Enzymatic Hydrolysis
(235) Rather than using a total acid hydrolysis to extract glucose from lignocellulosic biomass, a dilute acid pretreatment is often used to pretreat biomass in order to disrupt the crystallinity of the carbohydrate polymers that are present in the plant material, and then cocktails of hydrolytic enzymes (cellulases, hemicellulases, etc.) are added to hydrolyze the polymers to their component sugars. To determine whether high-starch biomass would allow more glucose to be recovered during such an enzymatic hydrolysis procedure, 20 mg of milled tissue from each of the mixed samples (NB control mix and Transgenic mix) was pretreated with 0.25M sulfuric acid (pH 1.0) at 95 C. for 4 or 16 hrs, or at 120 C. for 1 hr. After adjusting the pH of the pretreated samples, enzymatic hydrolysis was carried out with a commercially available enzyme cocktail, Accellerase (Genencor, Palo Alto Calif.). Hydrolysis with a full cocktail (FCt) Accellerase mixture involves 200 L, Accellerase 1500 and 100 L Accellerase XY per gram biomass at 50 C., pH 5.0, for 72 hrs. Following hydrolysis, glucose yield was measured using HPLC (
(236) To determine whether increased glucose could be recovered from the transgenic straw when alternative methods of biomass pretreatment were employed, samples of the NB control mix and the Transgenic mix biomass were subjected to three different pretreatment strategies. 20 mg milled tissue was pretreated with a dilute acid pretreatment (0.25M sulfuric acid at pH 1.0), a base pretreatment (7.5% NH.sub.4OH at pH 12.0), or a bisulfite pretreatment (0.175M NH.sub.4HSO.sub.3+0.18M (NH.sub.4).sub.2CO.sub.3; bisulfite at pH 8.1). Following pretreatment, samples were hydrolyzed with an enzyme cocktail as described above. The amounts of glucose and xylose that were released following each pretreatment and hydrolysis were quantified by HPLC (
(237) These observations indicate that increasing starch content of the biomass enables the recovery of more glucose per unit of biomass. Furthermore, comparison of the glucose yield from acid pretreated (95 C. 16 h) NB control mix sample with the glucose yield from the bisulfate pretreated (95 C. 16 h) transgenic mix sample as shown in
(238) In a similar experiment, samples from two individual transgenic rice plants 2110_13 and 2110_21 and the NB control mix were pretreated under four different conditions, then subjected to enzymatic hydrolysis (
Example 13
Simultaneous Saccharification and Fermentation of Biomass
(239) Ethanol, organic acids, or other biochemicals can be produced from lignocellulosic biomass through simultaneous saccharification and fermentation (SSF). This process utilizes biomass pretreatment, as described above, followed by the hydrolysis of the biomass with enzyme cocktails in the presence of yeasts or other microorganisms, which will metabolize the resulting sugars to produce ethanol, organic acids, and/or additional biochemicals.
(240) To determine whether biomass from the high-starch transgenic rice would support the production of more ethanol relative to control biomass via SSF, 3 g of biomass from either the NB control mix or the transgenic mix was pretreated with 0.25 M H.sub.2SO.sub.4 pH 1.0 at 120 C. for 1 hr in a pressure cooker. Hydrolysis was carried out at pH 4.9 at 50 C. with the full Accellerase cocktail (FCt). FCt contains 200 L Accellerase 1500 per 1 g biomass and 100 L Accellerase XY per 1 g biomass. YP medium (10 g/L yeast extract, 20 g/L peptone) was added to the hydrolysate to aid in yeast growth, and then an inoculum of D5A yeast cells was added from a frozen stock. A glucose control sample was also prepared that employed pure glucose (approximately 20-22 g/L) in the medium in place of the hydrolyzed biomass.
(241)
(242)
(243)
(244) Based upon the initial amounts of glucose that were present in each culture as glucan polymers in the biomass samples, it is possible to calculate the effective ethanol yield on glucose, using Equation 1:
(245)
where the [EtOH].sub.f is the final ethanol concentration (g/L), [EtOH].sub.0 is the initial ethanol concentration (g/L), 0.51 is the conversion factor for glucose to ethanol based on the stoichiometric biochemistry of yeast, f is the glucan (glucose polysaccharide) fraction of the dry biomass, and [Biomass] is the starting concentration (g/L) of the dry biomass in the sample. Using this equation, there was a higher ethanol conversion rate from the transgenic biomass than from the control biomass, as shown in Table 6.
(246) TABLE-US-00021 TABLE 6 Ethanol conversion rates of control and transgenic biomass. % EtOH Conversion % EtOH yield Strain NB control mix Transgenic mix Glucose Control % Conversion 81.83 84.12 89.45
(247) It was observed that the ethanol conversion rate on the transgenic biomass was closer that observed on pure glucose, indicating that ethanol fermentation can proceed more efficiently from transgenic, high-starch biomass than from control biomass.
Example 14
Combining Starch Accumulation with the Expression of Polysaccharide Degrading Enzymes
(248) While increased starch accumulation in the transgenic biomass enables the release of more glucose from biomass during hydrolysis or SSF, it may be possible to convert even more of the available glucan into glucose by producing polysaccharide degrading enzymes in the biomass while plants are growing at the same time that starch is accumulating. Plant cell walls are composed of numerous polysaccharides, including glucan polymers such as cellulose, -glucan, and xyloglucans, as well as other complex polysaccharides such as hemicelluloses (heteroxylans), pectins, etc. Expressing enzymes that degrade these polysaccharides will make it possible to decompose these polysaccharides into their component sugars more readily, thereby providing an additional benefit to high-starch biomass for animal feed or fermentation applications, in which fermentable or digestible sugars are more efficiently recovered from both vegetative starch and structural polysaccharides.
(249) A number of polysaccharide degrading enzymes can be employed for this objective. Examples of these polysaccharide degrading enzymes are provided as SEQ ID NO: 77 (O43097 xylanase), SEQ ID NO: 78 (BD22308 cellobiohydrolase), SEQ ID NO: 79 (BD25243 endoglucanase), SEQ ID NO: 80 (EU591743 xylanase), SEQ ID NO: 81 (NtEGm endoglucanase), SEQ ID NO: 82 (P0C2S1 cellobiohydrolase), SEQ ID NO: 83 (P77853 xylanase), SEQ ID NO: 84 (O68438 cellobiohydrolase), SEQ ID NO: 85 (O33897 endoglucanase), SEQ ID NO: 160 (amylase 19862), SEQ ID NO: 161 (glucoamylase 20082), SEQ ID NO: 162 (glucoamylase 20707), SEQ ID NO: 163 (amylase 21853), SEQ ID NO: 169 (AmyS), and SEQ ID NO: 171 (GlaA). These enzymes can be encoded by DNA sequences SEQ ID NO: 86 (O43097 xylanase), SEQ ID NO: 87 (BD22308 cellobiohydrolase), SEQ ID NO: 88 (BD25243 endoglucanase), SEQ ID NO: 89 (EU591743 xylanase), SEQ ID NO: 90 (NtEGm endoglucanase), SEQ ID NO: 91 (P0C2S1 cellobiohydrolase), SEQ ID NO: 92 (P77853 xylanase), SEQ ID NO: 93 (O68438 cellobiohydrolase), SEQ ID NO: 94 (O33897 endoglucanase), SEQ ID NO: 164 (amylase 19862), SEQ ID NO: 165 (glucoamylase 20082), SEQ ID NO: 166 (glucoamylase 20707), SEQ ID NO: 167 (amylase 21853), SEQ ID NO: 168 (AmyS), and SEQ ID NO: 170 (GlaA).
(250) Because the activity of polysaccharide degrading enzymes in living plant tissues may interfere with plant growth or development, it may be helpful to express such enzymes as inactive precursors, for example, as intein-modified pro-enzymes that can be activated after the biomass has been harvested by regulating the conditions under which inteins can be activated (See the following references for intein-modified pro-enzymes that may be utilized herein. PCT patent application PCT/US03/00432 filed Jan. 7, 2003, PCT/US2010/055669 filed Nov. 5, 2010, PCT/US2010/055751 filed Nov. 5, 2010, PCT/US2010/055746 filed Nov. 5, 2010, U.S. Pat. No. 8,247,647 issued Aug. 21, 2012, and U.S. patent application Ser. No. 12/590,444 filed Nov. 6, 2009, all of which are incorporated by reference herein as if fully set forth). Examples of inteins that can be employed for this purpose are provided as the polypeptides SEQ ID NO: 95 (AS146-7 intein polypeptide), SEQ ID NO: 96 (S158-30-108-35), and SEQ ID NO: 97 (T134-100-101), which can be encoded by the DNA sequences SEQ ID NO: 98 (AS146-7), SEQ ID NO: 99 (S158-30-108-35), and SEQ ID NO: 100 (T134-100-101). Examples of enzymes that incorporate such inteins are provided as SEQ ID NO: 101 (EU591743:AS146-7), SEQ ID NO: 102 (P77853:S158-30-108-35), and SEQ ID NO: 103 (P77853:T134-100-101), which can be encoded by the DNA sequences provided as SEQ ID NO: 104 (EU591743:AS146-7), SEQ ID NO: 105 (P77853:S158-30-108-35), and SEQ ID NO; 106 (P77853:T134-100-101).
(251) Expression cassettes can be constructed such that transcription of the CWDE is directed by a suitable promoter. Examples of such promoters are provided as SEQ ID NO: 55 (ZmUbi1P), SEQ ID NO: 56 (ZmPepCP) and SEQ ID NO: 57 (OsUbi3P). Transcription from this cassette can be terminated by employing a suitable polyadenylation signal. An example of such a signal is provided as SEQ ID NO: 58 (NOS terminator). Furthermore, accumulation, stability, and/or subcellular targeting of a given polysaccharide degrading enzyme can be modified by expressing them as fusions with suitably chosen N-terminal or C-terminal polypeptides. Examples of such polypeptides are provided as SEQ ID NO: 107 (BAASS signal peptide), SEQ ID NO: 108 (SEKDEL signal peptide), SEQ ID NO: 109 (xHvVSD targeting signal), SEQ ID NO: 110 (ZmUBQm translational fusion), SEQ ID NO: 111 (xGZein27ss), and SEQ ID NO: 112 (HvAle signal), which can be encoded by DNA sequences provided as SEQ ID NO: 113 (BAASS), SEQ ID NO: 114 (SEKDEL), SEQ ID NO: 115 (xHvVSD), SEQ ID NO: 116 (ZmUBQm), SEQ ID NO: 117 (xGZein27ss), and SEQ ID NO: 118 (HvAle).
(252)
(253) Examples of other CWDE expression cassettes are provided as SEQ ID NO: 119 (ZmUbi1P:xGZein27ss:BD22308:xHvVSD, SEQ ID NO: 120 (ZmPepCP:xGZein27ss:BD25243:SEKDEL), SEQ ID NO: 121 (OsUbi3P:EU591743), SEQ ID NO: 122 (ZmUbi1P:EU591743:AS146-7:SEKDEL), SEQ ID NO: 123 (ZmUbi1P:HvAle:NtEGm:SEKDEL), SEQ ID NO: 124 (ZmPepCP:HvAle:NtEGm:SEKDEL), SEQ ID NO: 125 (OsUbi3P:HvAle:NtEGm:SEKDEL), SEQ ID NO: 126 (OsUbi3P:BAASS:O33897), SEQ ID NO: 127 (ZmPepCP: BAASS:O43097:SEKDEL), SEQ ID NO: 128 (OsUbi3P:O68438), SEQ ID NO: 129 (OsUbi3P:P0C2S1), SEQ ID NO: 130 (ZmUbi1P:ZmUBQm:BAASS:P77853:S158-30-108-35 and SEQ ID NO: 131 (ZmUbi1P:BAASS:P77853:T134-100-101:SEKDEL).
(254) Multiple CWDEs and hpRNAs can be expressed simultaneously in transgenic plants by constructing a transformation vector that has multiple expression cassettes linked in tandem.
(255)
(256) Following are descriptions of vectors that can be used to introduce the above-described constructs into plant cells via Agrobacterium-mediated transformation.
(257) pAG4003 (SEQ ID NO: 76) is a plasmid that carries a spectinomycin resistance marker, a bacterial origin of replication, an Agrobacterium T-DNA right border (RB), and an Agrobacterium T-DNA left border (LB). Between the RB and LB is a plant selectable marker comprised of a maize ubiquitin promoter (ZmUbi1P), the phosphomannose isomerase coding sequence, and NOS terminator
(258) pAG4106 is a derivative of pAG4003 (SEQ ID NO: 76) carrying a quadruple stack construct (SEQ ID NO: 134) with ZmPepCP:ZmGWD1 RNAi+ZmUbi1P:xGZein27ss:BD22308:xHvVSD+OsUbi3P:HvAle:NtEGm:SEKDEL+ZmUbi1P:ZmUBQm:BAASS:P77853:S 158-30-108-35
(259) pAG4107 is a derivative of pAG4003 (SEQ ID NO: 76) carrying a quadruple stack construct (SEQ ID NO: 135) with ZmPepCP:ZmGWD2 RNAi+ZmUbi1P:xGZein27ss:BD22308:xHvVSD+OsUbi3P:HvAle:NtEGm:SEKDEL+ZmUbi1P:ZmUBQm:BAASS:P77853:S158-30-108-35
(260) pAG4108 is a derivative of pAG4003 (SEQ ID NO: 76) carrying a quadruple stack construct (SEQ ID NO: 136) with ZmPepCP:SbGWD1 RNAi+ZmUbi1P:xGZein27ss:BD22308:xHvVSD+OsUbi3P:HvAle:NtEGm:SEKDEL+ZmUbi1:ZmUBQm:BAASS:P77853:S158-30-108-35
(261) pAG4109 is a derivative of pAG4003 (SEQ ID NO: 76) carrying a quadruple stack construct (SEQ ID NO: 137) with ZmPepCP:SbGWD2 RNAi+ZmUbi1P:xGZein27ss:BD22308:xHvVSD+OsUbi3P:HvAle:NtEGm:SEKDEL+ZmUbi1P:ZmUBQm:BAASS:P77853:S 158-30-108-35
(262) pAG4110 is a derivative of pAG4003 (SEQ ID NO: 76) carrying a quadruple stack construct (SEQ ID NO: 138) with ZmPepCP:ZmGWD1 RNAi+ZmUbi1P:xGZein27:BD22308:xHvVSD+OsUbi3P:HvAle:NtEGm:SEKDEL:NOST+ZmPEPCP:BAASS:O43097:SEKDEL
(263) pAG4111 is a derivative of pAG4003 (SEQ ID NO: 76) carrying a quadruple stack construct (SEQ ID NO: 139) with ZmPepCP:ZmGWD2 RNAi+ZmUbi1P:xGZein27:BD22308:xHvVSD+OsUbi3P:HvAle:NtEGm:SEKDEL:NOST+ZmPepCP:BAASS:O43097:SEKDEL
(264) pAG4112 is a derivative of pAG4003 (SEQ ID NO: 76) carrying a quadruple stack construct (SEQ ID NO: 140) with ZmPepCP:ZmGWD1 RNAi+ZmUbi1P:xGZein27:BD22308:xHvVSD+ZmUbi1P:HvAle:NtEGm:SEKDEL+ZmUbi1P:BAASS:EU591743:AS146-7:SEKDEL
(265) pAG4113 is a derivative of pAG4003 (SEQ ID NO: 76) carrying a quadruple stack construct (SEQ ID NO: 141) with ZmPepCP:ZmGWD2 RNAi+ZmUbi1P:xGZein27:BD22308:xHvVSD+ZmUbi1P:HvAle:NtEGm:SEKDEL+ZmUbi1P:BAASS:EU591743:AS146-7:SEKDEL
(266) pAG4114 is a derivative of pAG4003 (SEQ ID NO: 76) carrying a quadruple stack construct (SEQ ID NO: 142) with ZmPepCP:ZmGWD1 RNAi+ZmUbi1P:xGZein27:BD22308:xHvVSD+ZmPepCP:xGZein27ss:BD25243:SEKDEL+ZmUbi1P:BAASS:EU591743:AS146-7:SEKDEL
(267) pAG4115 is a derivative of pAG4003 (SEQ ID NO: 76) carrying a quadruple stack construct (SEQ ID NO: 143) with ZmPepCP:ZmGWD2 RNAi+ZmUbi1P:xGZein27:BD22308:xHvVSD+ZmPepCP:xGZein27ss:BD25243:SEKDEL+ZmUbi1P:BAASS:EU591743:AS146-7:SEKDEL
(268) pAG4116 is a derivative of pAG4003 (SEQ ID NO: 76) carrying a quadruple stack construct (SEQ ID NO: 144) with ZmPepCP:ZmGWD1 RNAi+ZmUbi1P:xGZein27:BD22308:xHvVSD+OsUbi3P:HvAle:NtEGm:SEKDEL+ZmUbi1P:BAASS:EU591743:AS146-7:SEKDEL
(269) pAG4117 is a derivative of pAG4003 (SEQ ID NO: 76) carrying a quadruple stack construct (SEQ ID NO: 145) with ZmPepCP:ZmGWD2 RNAi+ZmUbi1P:xGZein27:BD22308:xHvVSD+OsUbi3P:HvAle:NtEGm:SEKDEL+ZmUbi1P:BAASS:EU591743:AS146-7:SEKDEL
(270) pAG4120 is a derivative of pAG4003 (SEQ ID NO: 76) carrying a quadruple stack construct (SEQ ID NO: 146) with ZmPepCP:ZmGWD1 RNAi+OsUbi3P:P0C2S1+OsUbi3P:HvAleSp:NtEGm:SEKDEL+ZmUbi1P:ZmUBQm:BAASS:P77853-T134-100-101:SEKDEL
(271) pAG4121 is a derivative of pAG4003 (SEQ ID NO: 76) carrying a quadruple stack construct (SEQ ID NO: 147) with ZmPEPCP:ZmGWD2 RNAi+OsUbi3P:P0C2S1+OsUbi3P:HvAleSp:NtEGm:SEKDEL+ZmUbi1P:ZmUBQm:BAASS:P77853-T134-100-101:SEKDEL
(272) pAG4124 is a derivative of pAG4003 (SEQ ID NO: 76) carrying a triple stack construct (SEQ ID NO: 148) with ZmPepCP:ZmGWD1 RNAi+ZmPepCP:xGZein27ss:BD25243:SEKDEL+ZmUbi1P:BAASS:EU591743:AS146-7:SEKDEL
(273) pAG4125 is a derivative of pAG4003 (SEQ ID NO: 76) carrying a triple stack construct (SEQ ID NO: 149) with ZmPepCP:ZmGWD2 RNAi+ZmPepCP:xGZein27ss:BD25243:SEKDEL+ZmUbi1P:BAASS:EU591743:AS146-7:SEKDEL
(274) pAG4206 is a derivative of pAG4003 carrying a triple stack construct (SEQ ID NO: 219) with ZmUbi1P:xGZein27:BD22308:xHvVSD+OsUbi3P:HvAle:NtEGm:SEKDEL:NosT+ZmPepCP:BAASS:O43097:SEKDEL
(275) pAG4514 is a derivative of pAG4003 (SEQ ID NO: 76) carrying a quadruple stack construct (SEQ ID NO: 150) with ZmPepCP:ZmGWD1 RNAi+ZmUbi1P:xGZein27:BD22308:xHvVSD+ZmPepCP:HvAle:NtEGm:SEKDEL+ZmUbi1P:BAASS:EU591743:AS146-7:SEKDEL
(276) pAG4515 is a derivative of pAG4003 (SEQ ID NO: 76) carrying a quadruple stack construct (SEQ ID NO: 151) with ZmPepCP:ZmGWD2 RNAi+ZmUbi1P:xGZein27:BD22308:xHvVSD+ZmPepCP:HvAle:NtEGm:SEKDEL+ZmUbi1P:BAASS:EU591743:AS146-7:SEKDEL
Example 15
More Glucose can be Recovered from Transgenic Plants that Accumulate Starch and Express Cell Wall Degrading Enzymes
(277) Individual control and transgenic maize plants carrying either construct 4110 or 4111 were grown to maturity in a greenhouse. Leaves and stalks were harvested from the mature, senescent plants, dried, and milled. Samples from each were assayed for starch content, xylanase activity, and endoglucanase activity using (respectively) the Total Starch Assay Kit, Xylazyme, and Cellazyme chromogenic substrates from Megazyme (Bray, County Wicklow, Ireland). As shown in
(278) It is understood, therefore, that this invention is not limited to the particular embodiments disclosed, but is intended to cover all modifications which are within the spirit and scope of the invention as defined by the appended claims; the above description; and/or shown in the attached drawings.
REFERENCES
(279) Alvira, P et al., 2010. Pretreatment technologies for an efficient bioethanol production process based on enzymatic hydrolysis: A review. Bioresource technology, 101(13), pp. 4851-61. Available at: http://www.ncbi.nlm.nih.gov/pubmed/20042329 [Accessed Jul. 18, 2012]. An, G et al., 2005. Reverse genetic approaches for functional genomics of rice. Plant molecular biology, 59(1), pp. 111-23. Available at: http://www.ncbi.nlm.nih.gov/pubmed/16217606 [Accessed Jul. 25, 2012]. Ausubel F M, Brent R, Kingston R E, Moore D D, Seidman J G, Smith J A, Struhl K, 2000. Current Protocols in Molecular Biology Volume 1, John Wiley & Sons. Charles Abbas, Wuli Bao, Kyle Beery, Mike Cecava, Perry H. Doane, James L. Dunn, D. P. H., 2008. Abbas 2008 US Patent Application US20080220125.pdf., pp. 1-17. Chi-Ham C L et al., 2010. The intellectual property landscape for gene suppression technologies in plants. Nature Biotechnology, 28 (1):32-36. Christian, M et al., 2010. Targeting DNA double-strand breaks with TAL effector nucleases. Genetics, 186(2), pp. 757-61. Available at: http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2942870&tool=pmcentrez&rendertype=abstract [Accessed Jul. 14, 2012]. Frizzi A & Huang S, 2010. Tapping RNA silencing pathways for plant biotechnology. Plant biotechnology journal, 8(6), pp. 655-77. Available at: http://www.ncbi.nlm.nih.gov/pubmed/20331529 [Accessed Jul. 24, 2012]. Horiguchi G, 2004. RNA silencing in plants: a shortcut to functional analysis. Differentiation; research in biological diversity, 72(2-3), pp. 65-73. Available at: http://www.ncbi.nlm.nih.gov/pubmed/15066186. Horton R M, Hunt H D, Ho S N, Pullen J and Pease L R, 1989. Engineering hybrid genes without the use of restriction enzymes: gene splicing by overlap extension. Gene 77(1):61-68. Kavakli I H et al., 2002. Generation, characterization, and heterologous expression of wild-type and up-regulated forms of Arabidopsis thaliana leaf ADP-glucose pyrophosphorylase. Planta, 215(3), pp. 430-9. Available at: http://www.ncbi.nlm.nih.gov/pubmed/12111225 [Accessed Jul. 25, 2012]. Kttin, O et al., 2009. STARCH-EXCESS4 is a laforin-like Phosphoglucan phosphatase required for starch degradation in Arabidopsis thaliana. The Plant cell, 21(1), pp. 334-46. Available at: http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2648081&tool=pmcentrez&rendertype=abstract [Accessed Jul. 25, 2012]. Krishnan A, Guiderdoni E, An G, Hsing Y I, Han C D, Lee M C, Yu S M, Upadhyaya N, Ramachandran S, Zhang Q, Sundaresan V, Hirochika H, Leung H and Pereira A, 2009. Mutant resources in rice for functional genomics of the grasses. Plant Physiol. 149:165-70. Maniatis T, Fritsch E F and J. Sambrook J, 1982. Molecular Cloning Cold Spring Harbor Laboratory. Obana Y et al., 2006. Enhanced turnover of transitory starch by expression of up-regulated ADP-glucose pyrophosphorylases in Arabidopsis thaliana. Plant Science, 170(1), pp. 1-11. Available at: http://linkinghub.elsevier.com/retrieve/pii/S0168945205002670 [Accessed Jul. 25, 2012]. Puchta H, Dujon B and Hohn B, 1993. Homologous recombination in plant cells is enhanced by in vivo induction of double strand breaks into DNA by a site-specific endonuclease. Nucleic acids research, 21(22), pp. 5034-40. Available at: http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=310614&tool=pmcentrez&rendertype=abstract. Sikora P et al., 2011. Mutagenesis as a tool in plant genetics, functional genomics, and breeding. International journal of plant genomics, 2011, p. 314829. Available at: http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=3270407&tool=pmcentrez&rendertype=abstract [Accessed Jul. 25, 2012]. Smith A M and Zeeman S C, 2006. Quantification of starch in plant tissues. Nat. Protocols 1:1342-1345 Smith N A, Singh S P, Wang M B, Stoutjesdijk P A, Green A G and Waterhouse P M, 2000. Total silencing by intron-spliced hairpin RNA, Nature 407:319-20. Smith T F, Waterman M S, 1981. Identification of Common Molecular Subsequences. J Mol Biol 147: 195-197. Stitt M & Zeeman, Samuel C, 2012. Starch turnover: pathways, regulation and role in growth. Current opinion in plant biology, 15(3), pp. 282-92. Available at: http://www.ncbi.nlm.nih.gov/pubmed/22541711 [Accessed Jul. 16, 2012]. Stoutjesdijk P A, Singh S P, Liu Q, Hurlstone C J, Waterhouse P A and Green A G, 2002. hpRNA-mediated targeting of the Arabidopsis FAD2 gene gives highly efficient and stable silencing. Plant Physiol. 129(4): 1723-31. Till B J, Cooper J, Tai T H, Colowit P, Greene E A, Henikoff S, Comai L, 2007 Discovery of chemically induced mutations in rice by TILLING. BMC Plant Biol. 7:19. Vainstein A et al., 2011. Permanent genome modifications in plant cells by transient viral vectors. Trends in biotechnology, 29(8), pp. 363-9. Available at: http://www.ncbi.nlm.nih.gov/pubmed/21536337 [Accessed Jul. 30, 2012]. Warthmann N et al., 2008. Highly specific gene silencing by artificial miRNAs in rice. PloS one, 3(3), p.e1829. Available at: http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2262943&tool=pmcentrez&rendertype=abstract [Accessed Jul. 25, 2012]. Wehrkamp-Richter S et al., 2009. Characterisation of a new reporter system allowing high throughput in planta screening for recombination events before and after controlled DNA double strand break induction. Plant physiology and biochemistry: PPB/Socit franaise de physiologie vgtale, 47(4), pp. 248-55. Available at: http://www.ncbi.nlm.nih.gov/pubmed/19136269 [Accessed Jul. 17, 2012]. Wright D et al., 2005. High-frequency homologous recombination in plants mediated by zinc-finger nucleases. The Plant journal
: for cell and molecular biology, 44(4), pp. 693-705. Available at: http://www.ncbi.nlm.nih.gov/pubmed/16262717 [Accessed Jul. 19, 2012]. Yu T S et al., 2001. The Arabidopsis sex1 mutant is defective in the R1 protein, a general regulator of starch degradation in plants, and not in the chloroplast hexose transporter. The Plant cell, 13(8), pp. 1907-18. Available at: =abstract. Zhang Z, Schwartz S, Wagner L, and Miller W, 2000. A greedy algorithm for aligning DNA sequences. J Comput Biol 2000; 7(1-2):203-14
(280) The references cited throughout this application, are incorporated for all purposes apparent herein and in the references themselves as if each reference was fully set forth. For the sake of presentation, specific ones of these references are cited at particular locations herein. A citation of a reference at a particular location indicates a manner(s) in which the teachings of the reference are incorporated. However, a citation of a reference at a particular location does not limit the manner in which all of the teachings of the cited reference are incorporated for all purposes.
(281) It is understood, therefore, that this invention is not limited to the particular embodiments disclosed, but is intended to cover all modifications which are within the spirit and scope of the invention as defined by the appended claims; the above description; and/or shown in the attached drawings.