METHODS, SYSTEMS, AND COMPOSITIONS FOR LEGUME-BASED PRODUCTION OF THERAPEUTIC PROTEINS AND THERAPEUTIC MEDICAL MATERIALS
20200115718 ยท 2020-04-16
Inventors
Cpc classification
A61K47/46
HUMAN NECESSITIES
A23L33/105
HUMAN NECESSITIES
C12N15/8257
CHEMISTRY; METALLURGY
A23C11/103
HUMAN NECESSITIES
C07K14/485
CHEMISTRY; METALLURGY
A61K9/0014
HUMAN NECESSITIES
International classification
C12N15/82
CHEMISTRY; METALLURGY
A61K9/00
HUMAN NECESSITIES
A23C11/10
HUMAN NECESSITIES
Abstract
Methods and compositions for producing proteins such as growth factors, antibodies, or other therapeutic proteins in legumes. The present invention also features medical materials comprising legume material from a transgenic legume and a recombinant protein produced by the transgenic legume. The material may be a bandage, gauze, an injectable composition, or the like. The material may further comprise other elements such as non-active plant elements, synthetic elements, or medications. Soybean plants may be non-allergenic soybean plants.
Claims
1. A transgenic legume expressing a protein, the protein being an animal or human protein.
2. The transgenic legume of claim 1, wherein the transgenic legume is a soybean, a lentil, a bean, a pea, or a peanut.
3. The transgenic legume of claim 1, wherein the expressed protein is a therapeutic protein, a bioregulatory protein, or an antibody.
4. The transgenic legume of claim 3, wherein the protein is a growth factor, wherein the growth factor includes EGF, FGF, PDGF, VEGF, IGF, HSF, TGF-alpha, TGF-beta, TNF-alpha, IL-1, Interferons, or a combination thereof.
5. The transgenic legume of claim 4, wherein the EGF protein is encoded by nucleic acid sequence according to SEQ ID NO: 1 or SEQ ID NO: 26 or a polynucleotide at least 90% identical thereto, wherein the polynucleotide encodes a protein having hEGF activity, or a functional fragment thereof.
6. The transgenic legume of claim 5, wherein the nucleotide sequence encodes a protein of SEQ ID NO: 12 or a polynucleotide sequence at least 90% identical thereto encoding a protein having hEGF activity, or a functional fragment thereof.
7. The transgenic legume of claim 3, wherein the bioregulatory or therapeutic protein, comprises insulin, fibronectin, or HIF-1 alpha.
8. The transgenic legume of claim 1, wherein the transgenic legume is a non-allergenic legume, wherein the transgenic legume is a non-allergenic soybean.
9. A composition comprising the animal or human protein according to any of claims 1-8, wherein the composition comprises soymilk.
10. A method of harvesting a recombinant protein expressed in a transgenic legume according to any of claims 1-9, said method comprising processing an entire plant of the transgenic legume.
11. The method of claim 10, wherein processing the entire plant comprises grinding. micro pulverizing, particulating, or morselating.
12. A medical material comprising an animal or human protein derived from a transgenic legume according to any of claims 1-11 and at least a portion of the transgenic legume that produced said animal protein.
13. The medical material of claim 10, wherein the material is for epidermal or dermal application, wherein the material comprises a partially vacuous, discontinuous or holey construct, wherein the medical material is fabricated as a gauze, mesh, sheet, film, fibrous construct, or a bandage, wherein the material is constructed by spin-coating, drop casting, spin casting, extrusion, electrospinning, film formation spraying, spray drying, drop casting, spin casting, extrusion, electrospinning, low-temperature thermoforming, micro-particle formation, nano-particle formation, micro-capsule formation, nano-capsule formation, or a combination thereof.
14. The medical material of claim 13 further comprising a non-active plant element, a synthetic element, excipient, or adjuvant.
15. The medical material of claim 14, comprises hydrogel, aerogel or organogels or a combination thereof, wherein the excipient or adjuvant comprises a colloidal binder, a gelatin, polyethylene glycol (PEG), PEG-lactide, Plutronics, Tetronics, Carbopol, Eudragits, Agar, Pectin, Guar gum, alginates, PVA, carboxymethycellulose, Hyaluronic acid, or a combination thereof.
16. The medical material of claim 15, wherein the element is hygroscopic or hydrophobic.
17. The medical material of claim 13 further comprising a marker or sensor or means of detection, wherein the sensor is for providing feedback information as to status of the topical condition and subepidermal or subdermal condition under or adjacent to the applied location.
18. The medical material of claim 17, wherein the marker is a pH indicator, wherein the marker is for detecting infection.
19. The medical material of claim 13 further comprising a medication, wherein the medication is an anti-inflammatory medication, an anti-bacterial medication, an anti-microbial medication, an antifungal medication, an anesthetic medication, or a combination thereof.
20. The medical material of claim 13 further comprising a perfumant, wherein the medical material comprises a compound for reducing odor.
21. The medical material of claim 13 further comprising non-allergenic soy, a cell, or cell product, wherein the material comprises or delivers a cell product or constituent such as platelet-rich plasma (prp) or extract, a viral vector, gene, plasmid, episome or bacteriophage, siRNA, aptamer, genetic material, bacteriophage, or a combination thereof, wherein the cell or cell product delivers live, dead or attenuated epithelial cells, platelets or white blood cells, or a combination thereof.
22. A method of treating a topical condition, said method comprising applying to the topical condition a medical material according to any of claims 11-20, wherein the medical material reduces inflammation.
23. A system of treatment involving application of the therapeutic construct, monitoring its status via the contained sensors/indicators and then removing and/or re-application pending sensor readout.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0023] The features and advantages of the present invention will become apparent from a consideration of the following detailed description presented in connection with the accompanying drawings in which:
[0024]
[0025]
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
[0033]
[0034]
[0035]
DETAILED DESCRIPTION OF THE INVENTION
[0036] Referring now to
[0037] The present invention shows the accumulation of human EGF (hEGF) in genetically engineered soybean seeds. Further, the present invention shows that the recombinant EGF is indistinguishable from authentic human EGF and is bioactive at stimulating EGF receptor (EGFR) activity. Briefly, the present invention utilizes transgenic soybean seeds expressing a seed-specific codon optimized gene encoding of the human EGF protein with an added ER signal tag at the N terminal. Seven independent lines were grown to homozygous and found to accumulate a range of 6.7+/3.1 to 129.0+/36.7 ug EGF/g of dry soybean seed. Proteomic and immunoblot analysis indicate that the inserted EGF is the same as the human EGF protein. Phosphorylation and immunohistochemical assays on the EGF receptor in HeLa cells indicate the EGF protein produced in soybean seed is bioactive and comparable to commercially available human EGF.
[0038] To produce hEGF in soybean, a strong soybean seed-specific promoter and terminator was used to regulate gene expression of a synthetic soybean codon optimized hEGF (ShEGF) gene that included an N-terminal 60 nucleotide ER-signal sequence (
[0039] The hEGF soybeans and non-transgenic soybeans were evaluated to determine the biochemical authenticity of the soybean-produced EGF protein. Using 1D SDS/PAGE and parallel immunoblots probed with anti-EGF, the soluble low molecular weight (<10 kDa) seed proteins and the Mr of the soybean-produced hEGF was evaluated. The total protein polypeptide of the hEGF expressing lines appeared to be identical to the standard parental control (See
Soybean-Milk is Compatible with EGF Bioactivity
[0040] To evaluate the potential of EGF activity in soymilk delivery, commercial recombinant human EGF (rhEGF) was added as a supplement to soymilk and the intrinsic activity of the EGF was tested with a HeLa cell assay.
Soybean-Synthesized hEGF is Bioactive
[0041] To assess the bioactivity of soybean-produced hEGF, samples were prepared from both ShEGF transgenic soybean lines and non-transgenic controls that were used to stimulate HeLa cells to induce EGFR internalization, degradation and phosphorylation. As shown in
Synthesis of hEGF does not Affect Overt Soybean Seed Composition
[0042] To test for potential collateral composition in the hEGF-producing soybeans, the ShEGF transgenic and non-transgenic control soybeans were analyzed by non-targeted proteomics and metabolomics. Among the significant proteins identified include various well-documented allergens and anti-metabolite proteins. A comparison of standard soybeans with hEGF-producing soybean lines showed that there was no significant difference (p=0.01) between non-transgenic control and ShEGF transgenic soybeans aside from the targeted production of hEGF for any other proteins of concern. This data is available in PRIDE partner repository with the dataset identifier PXD003326 and 10.6019/PXD003326.
[0043] Non-targeted small molecule metabolomics was used to conduct a parallel analysis of the non-transgenic and hEGF soybeans. Again there were insignificant differences between non-transgenic soybean seeds and the ShEGF transgenic seeds (see
EXAMPLE 1
[0044] The following Example describes non-limiting methods associated with the present invention.
Transgenic EGF Soybean Seeds
[0045] Epidermal growth factor protein from humans was produced in soybean seeds by constructing a plant gene expression cassette that involved a synthetic codon optimized EGF nucleotide sequence (protein sequence from Genbank accession CCQ43157). This 162 bp open reading frame was placed in-frame behind a 20-amino acid endoplasmic reticulum (ER) signal sequence from the Arabidopsis chitinase gene [30,31]. The ER-directed EGF encoding open reading frame was developmentally regulated by the strong seed-specific storage protein glycinin regulatory elements [31]. The entire seed specific cassette to direct EGF production was placed in a vector containing the hygromycin resistance gene under the strong constitutive expression of the potato ubiquitin 3 regulatory elements as previously described (Schmidt M A, Herman E M. The Collateral Protein Compensation Mechanism Can Be Exploited To Enhance Foreign Protein Accumulation In Soybean Seeds. Plant Biotechnol J. 2008; 6: 832-842; Schmidt M A, Herman E M. A RNAi knockdown of soybean 24 kda oleosin results in the formation of micro-oil bodies that aggregate to form large complexes of oil bodies and ER containing caleosin. Mol Plant. 2008; 1: 910-924; Schmidt M A, Parrott W A, Hildebrand D F, Berg R H, Cooksey A, Pendarvis K, et al. Transgenic soybean seeds accumulating -carotene exhibit the collateral enhancements of high oleate and high protein content traits. Plant Biotechnol J. 2015; 13: 590-600). The result plasmid pGLY::ShEGF was sequenced using a glycinin promoter primer (5 TCATTCAC CTTCCTCTCTTC 3) to ensure the EGF open reading frame was placed correctly between the regulatory elements. Somatic soybean (Glycine max L. Merrill cv Jack (wild type)) embryos were transformed via biolistics using 30 mg/L hygromycin B selection and regenerated as previously described (Schmidt M A, Tucker D M, Cahoon E B, Parrott W A. Towards normalization of soybean somatic embryo maturation. Plant Cell Rep. 2004; 24: 383-391). Embryos from resistant lines were analyzed by genomic PCR to confirm the presence of inserted hygromycin cassette using primers specific to the hygromycin gene (HygF 5CTCACTATTCCTTTGCCCTC3 and HygR 5CTGACCTATTGCATCTCCCG3), cetyl trimethyl ammonium bromide (CTAB) extraction genomic DNA isolation and the following amplification conditions: 150 ng genomic DNA in 25 l total reaction containing 200 nM primers and 3 U Taq polymerase (NEB) and the following cycling parameters (initial 95 C. 4 min then 45 cycles of 95 C. 30 s, 55 C. 45 s, 72 C. 90 s; followed by a final extension of 72 C. 7 min). Dry seeds from two successive generations of PCR positive plants were analyzed by ELISA for the expression of EGF protein until all 7 lines were confirmed to be homozygous. EGF transgenic soybean plants along with non-transgenic control wild type cultivar plants were grown side by side in a greenhouse at 25 C. under 16 h daylight with 1000 m-2/s.
[0046] As previously discussed, the present invention features compositions comprising nucleic acid sequence, SEQ ID NO: 1 of Table 1 below. The vector of SEQ ID NO: 1 comprises a modified hEGF gene (the sequence within SEQ ID NO: 1 that encodes hEGF is outlined). The optimized hEGF nucleic acid sequence is not limited to SEQ ID NO: 1 and comprises a nucleic acid that encodes a peptide of interest.
[0047] In some embodiments, the nucleic acid is at least about 90% identical to SEQ ID NO: 1. In some embodiments, the nucleic acid is at least about 93% identical to SEQ ID NO: 1. In some embodiments, the nucleic acid sequence is at least about 95% identical to SEQ ID NO: 1. In some embodiments, the nucleic acid sequence is at least about 98% identical to SEQ ID NO: 1. In some embodiments, the nucleic acid sequence is at least about 99% identical to SEQ ID NO: 1. Non-limiting examples of such nucleic acid sequences can be found in Table 1 below. For example, SEQ ID NO: 2 and SEQ ID NO: 7 are sequences for a modified hEGF that is about 99% identical to SEQ ID NO: 1. SEQ ID NO: 3 and SEQ ID NO: 8 are sequences for a modified EGF that is about 98% identical to SEQ ID NO: 1; SEQ ID NO: 4 and SEQ ID NO: 9 are sequences for a modified EGF that is about 95% identical to SEQ ID NO: 1 (note that the bold letters in Table 1 are nucleotide substitutions as compared to SEQ ID NO: 1, and the codon underlined).
TABLE-US-00001 TABLE1 ExamplesofNucleicAcidSequenceIdentity90%toSEQIDNO:1 % Seq Alignment ID toSEQID NO Description NucleicAcidSequence NO:1 1 OptimizedEGFsequencefor tctctttcttcagccgaaaattccgatagtgagtgtc 100 soybeantransformation cactctcccatgatggctattgtttgcacgacgga gtttgcatgtatattgaagctttggataagtacgcat gtaactgcgttgtgggatatatcggtgaaagatgc caatacagggacctcaaatggtgggagctgag ataa 2 OptimizedEGFsequencefor tctctttcttcagccgaaacttccgatagtgagtgtc 99 soybeantransformationwith2 cactctcccatgatggctattgtttgcacgacgga basesubstitutionfor99% gttcgcatgtatattgaagctttggataagtacgca sequenceidentitytoSeqID1 tgtaactgcgttgtgggatatatcggtgaaagatg ccaatacagggacctcaaatggtgggagctga gataa 3 OptimizedEGFsequencefor tctctttcttcagccgaaaactccgctagtgagtgt 98 soybeantransformationwith4 ccactctcccatgatggctattgtttgcacgacgg basesubstitutionfor98% agttcgcatgtatattgaagctttggataagtacgc sequenceidentitytoSeqID1 atataactgcgttgtgggatatatcggtgaaagat gccaatacagggacctcaaatggtgggagctg agataa 4 OptimizedEGFsequencefor tctctttcttcagccgaaaactccgctagtgagtgt 95 soybeantransformationwith9 tcactctcccatgatggcgattgtttgcacgacgg basesubstitutionfor95% agttcgcatgtatattgaagctttggataagtacgc sequenceidentitytoSeqID1 atataactgcgttgtggaatatcggtgaaaga ggccaatacagggacctcaaacggtgggagct gagataa 5 OptimizedEGFsequencefor tctctttcttcagccgaaaactccgctattgagtgtt 93 soybeantransformationwith13 cactctcccctgatggcgattgtttgcacgacgga basesubstitutionfor93% gttcgcatgtatattgaagctttgtataagtacgcat sequenceidentitytoSeqID1 ataactgcgttgtggaatatatcggtgaaagagg ccaatacaggaacctcaaacggtgggagctga gataa 6 OptimizedEGFsequencefor tctctttcttcagccgaaaactccgctattgagtgtt 90 soybeantransformationwith18 cactctcccctgatggcgattgtttgcaagacgta basesubstitutionfor90% gttcgcatgtatagtgaagctttgtataagtacgc sequenceidentitytoSeqID1 atataactgcgttgtggaatatctcggtgaaaga ggccaatacaggaacctcaaacggtggaagct gagataa 7 OptimizedEGFsequencefor tctctttcttcagccgaaaatcccgatagtgagtgt 99 soybeantransformationwith2 ccactctgccatgctggctattgttcgcacgacgg basesubstitutionfor99% agtttgcatgtatattgtagctgtggataagtacgc sequenceidentitytoSeqID1 atgtaactgcgctgtgggatatatcggtgcaagat gccaatacagcgacctcaaatggtgggacccg agataa 8 OptimizedEGFsequencefor tctctttcttcagccgaaaatcccgatcgtgagtgt 98 soybeantransformationwith4 ccactctgccatgctggctattgttcgcacgacgg basesubstitutionfor98% agtttgcatgtatattgtagctgtggataagtacgc sequenceidentitytoSeqID1 atgtaactgcgctgtgggatatatcggtgcaagat gccaatacagcgacctcaaatggtgggacccg agataa 9 OptimizedEGFsequencefor tctctttcttcagccgaaaatcccgatcgtgagtgt 95 soybeantransformationwith9 ccactctgccatgctggctattgttcgcacgacgg basesubstitutionfor95% agtttgcatgtatattgtagctgtggataagtacgc sequenceidentitytoSeqID1 atgtaactgctgtgggatatatcggtgcaagat gccaatacagcgacctcaaatggtgggacccg agataa 10 OptimizedEGFsequencefor tctctttcttcagccgaaaatcccgatcgtgcgtgt 93 soybeantransformationwith13 ccactctgccatgctgtctattgtcgcacgacgg basesubstitutionfor93% agtttgcatgtatattgtagctgtggataagtacgc sequenceidentitytoSeqID1 atgtaactgcgctgtgggatatatcggtgcaagat gccaatacagcgacctcaaatggtgggacccg agataa 11 OptimizedEGFsequencefor tctctttcttcagccgaaaatcccgatcgtgcgtgt 90 soybeantransformationwith18 ctactctgccatgctgtctattgttcgcacgacag basesubstitutionfor90% agtttgcatgtatattgtagctgtggataagtactc sequenceidentitytoSeqID1 atgtaactgcgctgtgggatgtatcggtgcaaga tgccaatacagcgacctcaattggtgggagccg agataa Bold letters are nucleotide substitutions within a codon; the respective codon is underlined
[0048] The vector comprises a nucleic acid that encodes a peptide of interest. In some embodiments, the nucleic acid sequence is at least about 90% identical to SEQ ID NO: 1. In some embodiments, the nucleic acid sequence is at least about 93% identical to SEQ ID NO: 1. In some embodiments, the nucleic acid sequence is at least about 95% identical to SEQ ID NO: 1. In some embodiments, the nucleic acid sequence is at least about 98% identical to SEQ ID NO: 1. In some embodiments, the nucleic acid sequence is at least about 99% identical to SEQ ID NO: 1. Non-limiting examples of resulting amino acid sequences encoded by such nucleic acid sequences can be found in Table 2 below. For example, SEQ ID NO: 12 and SEQ ID NO: 18 are amino acid sequences encoded by modified hEGF polynucleotide sequences of Seq ID NO: 2 and SEQ ID NO: 6, respectively, that are about 99% identical to SEQ ID NO: 1 (note that the bold letters in Table 2 are amino acid substitutions as compared to SEQ ID NO: 12).
TABLE-US-00002 TABLE2 ExamplesofAminoAcidSequencewithNucleicAcidIdentity90% Seq %Alignment ID toSeqID NO Description AminoAcidSequence NO:1 12 OptimizedEGF SLSSAENSDSECPLSHDGYCLHDGVCMY 100 sequenceforsoybean IEALDKYACNCVVGYIGERCQYRDLKWW transformation ELR 13 OptimizedEGF SLSSAETSDSECPLSHDGYCLHDGVRMY 99 sequenceforsoybean IEALDKYACNCVVGYIGERCQYRDLKWW transformationwith2 ELR basesubstitutionfor99% sequenceidentitytoSeq ID1 14 OptimizedEGF SLSSAETSASECPLSHDGYCLHDGVRMY 98 sequenceforsoybean IEALDKYAYNCVVGYIGERCQYRDLKWW transformationwith4 ELR basesubstitutionfor98% sequenceidentitytoSeq ID1 15 OptimizedEGF SLSSAETSASECSLSHDGDCLHDGVRMY 95 sequenceforsoybean IEALDKYAYNCVVEYIGERGQYRDLKRWE transformationwith9 LR basesubstitutionfor95% sequenceidentitytoSeq ID1 16 OptimizedEGF SLSSAETSAIECSLSPDGDCLHDGVRMYI 93 sequenceforsoybean EALYKYAYNCVVEYIGERGQYRNLKRWE transformationwith13 LR basesubstitutionfor93% sequenceidentitytoSeq ID1 17 OptimizedEGF SLSSAETSAIECSLSPDGDCLQDVVRMYS 90 sequenceforsoybean EALYKYAYNCVVEYLGERGQYRNLKRW transformationwith18 KLR basesubstitutionfor90% sequenceidentitytoSeq ID1 18 OptimizedEGF SLSSAENADSECPLSHDGYCLHDGVCMY 99 sequenceforsoybean IVALDKYACNCVVGYIGERCQYRDLKVWV transformationwith2 ELR basesubstitutionfor99% sequenceidentitytoSeq ID1 19 OptimizedEGF SLSSAENADRECPLSHAGYCLHDGVCMY 98 sequenceforsoybean IVALDKYACNCVVGYIGERCQYRDLKVWV transformationwith4 ELR basesubstitutionfor98% sequenceidentitytoSeq ID1 20 OptimizedEGF SLSSAENADRECPLCHAGYCSHDGVCM 95 sequenceforsoybean YIVALDKYACNCAVGYIGERCQYSDLKW transformationwith9 WEPR basesubstitutionfor95% sequenceidentitytoSeq ID1 21 OptimizedEGF SLSSAENADRACPLCHAVYCSHDGVCM 93 sequenceforsoybean YIVAVDKYACNCAVGYIGARCQYSDLKW transformationwith13 WEPR basesubstitutionfor93% sequenceidentitytoSeq ID1 22 OptimizedEGF SLSSAENADRACLLCHAVYCSHDRVCMY 90 sequenceforsoybean IVAVDKYSCNCAVGCIGARCQYSDLNWW transformationwith18 EPR basesubstitutionfor93% sequenceidentitytoSeq ID1 Bold letters are nucleotide substitutions within a codon; the respective codon is underlined.
[0049] The present invention also features compositions comprising nucleic acid SEQ ID NO: 26 of Table 3 below. The vector of SEQ ID NO: 1 comprises a modified hEGF gene comprising a modified polynucleotide for the protein-coding region of hEGF, SEQ ID NO: 26 (the sequence within SEQ ID NO: 1 that encodes hEGF is outlined). The optimized hEGF nucleic acid protein-coding sequence is not limited to SEQ ID NO: 26 and comprises a nucleic acid that encodes a peptide of interest.
[0050] In some embodiments, the hEGF protein-coding nucleotide sequence is at least 90% identical to SEQ ID NO: 26. In some embodiments, the nucleic acid is at least 93% identical to SEQ ID NO: 26. In some embodiments, the nucleic acid is at least 95% identical to SEQ ID NO: 26. In some embodiments, the nucleic acid is at least 98% identical to SEQ ID NO: 26. In some embodiments, the nucleic acid is at least 99% identical to SEQ ID NO: 26. Non-limiting examples of such nucleic acid sequences can be found in Table 3 below. For example, SEQ ID NO: 27 is a sequence for a modified hEGF that is about 99% identical to SEQ ID NO: 26. SEQ ID NO: 28 is a sequence for a modified EGF that is about 98% identical to SEQ ID NO: 26; SEQ ID NO: 29 is a sequence for a modified EGF that is about 95% identical to SEQ ID NO: 26 (note that the bold letters in Table 3 are nucleotide substitutions as compared to SEQ ID NO: 26, and the codon underlined).
TABLE-US-00003 TABLE3 ExamplesofNucleicAcidSequenceIdentity90%toCodingRegionofSEQID NO:26 % Alignment Seq toSEQID ID Description NucleicAcidSequence NO:26 26 CodingRegionofSEQID:1 aattccgatagtgagtgtccactctcccatgatgg 100 ctattgtttgcacgacggagtttgcatgtatattgaa gctttggataagtacgcatgtaactgcgttgtggg atatatcggtgaaagatgccaatacagggacct caaatggtgggagctgagataa 27 CodingRegionofSEQID:1with acttccgatagtgagtgtccactctcccatgatgg 99 2basesubstitutionfor99% ctattgtttgcacgacggagttcgcatgtatattga sequenceidentitytoSeqID26 agctttggataagtacgcatgtaactgcgttgtgg gatatatcggtgaaagatgccaatacagggacc tcaaatggtgggagctgagataa 28 CodingRegionofSEQID:1with aactccgctagtgagtgtccactctcccatgatgg 98 4basesubstitutionfor98% ctattgtttgcacgacggagttcgcatgtatattga sequenceidentitytoSeqID26 agctttggataagtacgcatataactgcgttgtgg gatatatcggtgaaagatgccaatacagggacc tcaaatggtgggagctgagataa 29 CodingRegionofSEQID:1with aactccgctagtgagtgttcactctcccatgatgg 95 9basesubstitutionfor95% cgattgtttgcacgacggagttcgcatgtatattga sequenceidentitytoSeqID26 agatttggataagtacgcatataactgcgttgtgg aatatatcggtgaaagaggccaatacagggac ctcaaacggtgggagctgagataa 30 CodingRegionofSEQID:1with aactccgctattgagtgttcactctcccctgatgg 93 12basesubstitutionfor93% cgattgtttgcacgacggagttcgcatgtatattga sequenceidentitytoSeqID26 agctttgtataagtacgcatataactgcgttgtgga atatatcggtgaaagaggccaatacaggaacct caaacggtgggagctgagataa 31 CodingRegionofSEQID:1with aactccgctattgagtgttcactctcccctgatgg 90 17basesubstitutionfor90% cgattgtttgcaagacgtagttcgcatgtatagtg sequenceidentitytoSeqID26 aagctttgtataagtacgcatataactgcgttgtgg aatatctcggtgaaagaggccaatacaggaac ctcaaacggtggaagctgagataa Bold letters are nucleotide substitutions within a codon; the respective codon is underlined
[0051] The present invention also features compositions comprising nucleic acid sequence, SEQ ID NO: 32 of Table 4 below. The vector of SEQ ID NO: 1 comprises a modified hEGF gene comprising a polynucleotide for the non-hEGF protein coding region, SEQ ID NO: 32. The non-hEGF protein coding sequence of the optimized hEGF nucleotide is not limited to SEQ ID NO: 32. In some embodiments, the 3 end of SEQ ID NO: 32 is operatively coupled to the 5 end of SEQ ID NO: 26.
[0052] In some embodiments, the non-hEGF protein coding nucleotide sequence is at least 90% identical to SEQ ID NO: 32. Non-limiting examples of such nucleic acid sequences can be found in Table 4 below. For example, SEQ ID NO: 33 is a sequence that is at least 90% (<100%) identical to SEQ ID NO: 32 (note that the bold letters in Table 4 are nucleotide substitutions as compared to SEQ ID NO: 26, and the codon underlined).
TABLE-US-00004 TABLE4 ExamplesofNucleicAcidSequenceIdentity90%toNon-hEGF ProteinCodingRegionofSEQIDNO:32 Seq %Alignmentto ID Description NucleicAcidSequence SEQIDNO:32 32 Optimizednon-hEGFproteincoding tctctttcttcagccgaa 100 regionnucleicacidsequence 33 Optimizednon-hEGFproteincoding tctttttcttcagccgaa 95<100 regionsequencewith1base substitutionforatleast90% sequenceidentitytoSeqID32 34 Optimizednon-hEGFproteincoding tctttttcttaagccgaa 90<95 regionsequencewith2base substitutionforatleast90% sequenceidentitytoSeqID32 Bold letters are nucleotide substitutions within a codon; the respective codon is underlined
EGF Detection Via Immunoblot
[0053] Total soluble protein was extracted from dry seeds of two homozygous EGF lines and a non-transgenic control by repeated acetone washes followed by acetone precipitation with the protein pellet dissolved in water. Proteins with molecular weight 10 kDa and under were isolated by separately passing each extract through an Amicon Ultra centrifugal filter (Merck, Kenilworth N.J.). The samples were each suspended in sample buffer (50 mM Tris HCL, pH6.8 2% SDS (w/v), 0.7 M -mercaptoethanol, 0.1% (w/v) bromphenol blue and 10% (v/v) glycerol) and then denaturated 5 min 95 C. Protein content was determined by Bradford assay. A 15% SDS-PAGE gel was used to separate 30 g protein for each of the three samples: negative control wild type, Lines 4 and 5 of EGF transgenic soybean dry seeds. Commercially available human EGF (Gibco, Life Technologies, United Kingdom) was used at 0.5 g as positive control. Gel was electroblotted onto Immobilon P transfer membrane (Millipore, Bedford Mass.) and blocked with 3% milk solution in TBS for at least 1 hr. Primary antibody was a commercially available anti-EGF (Calbiochem, San Diego Calif.) and was used in a 1:100 ratio in 3% BSA-TBS buffer overnight at room temperature. After 3 washes of 15 mins each with TBS buffer, the blot was incubated with a 1:10,000 ratio in TBS of secondary antibody anti-rabbit IgG Fabspecific alkaline phosphatase conjugate (Sigma, St. Louis Mo.). After 3 washes, the presence of the EGF protein was detected by using a color substrate (BCIP/NBT: final concentrations 0.02% (w/v) 5-bromo-4-chloro-3-indoyl phosphate and 0.03% (w/v) nitro blue tetrazolium in 70% (v/v) demenjkoplthylformadmide) (KPL, Gaithersburg Mass.).
EGF Quantification
[0054] Total soluble protein was extracted from dry soybean seeds as described previously (Schmidt M A, Herman E M. The Collateral Protein Compensation Mechanism Can Be Exploited To Enhance Foreign Protein Accumulation In Soybean Seeds. Plant Biotechnol J. 2008; 6: 832-842; Schmidt M A, Herman E M. A RNAi knockdown of soybean 24 kda oleosin results in the formation of micro-oil bodies that aggregate to form large complexes of oil bodies and ER containing caleosin. Mol Plant. 2008; 1: 910-924) from all 7 lines of pGLY::ShEGF transgenic plants along with non-transgenic seeds as a negative control. EGF was quantitated by commercially available human EGF ELISA assay (Quantikine ELISA kit from R&D systems, Minneapolis Minn.) according to the manufacturer's instructions. The provided positive control was used to create a standard curve in order to determine the amount of EGF in each soybean protein extract. Each homozygote EGF transgenic line was assayed with three biological replicates and results displayed as mean +/ standard error.
Seed Proteome Composition Analysis
[0055] Total soluble proteins were extracted, quantitated and suspended in sample loading buffer as previously described (Schmidt M A, Herman E M. The Collateral Protein Compensation Mechanism Can Be Exploited To Enhance Foreign Protein Accumulation In Soybean Seeds. Plant Biotechnol J. 2008; 6: 832-842; Schmidt M A, Herman E M. A RNAi knockdown of soybean 24 kda oleosin results in the formation of micro-oil bodies that aggregate to form large complexes of oil bodies and ER containing caleosin. Mol Plant. 2008; 1: 910-924). Approximately 30 g of protein extract from dry seeds of 4 homozygous EGF lines were separated on a 4-20% gradient SDS-PAGE gel (BioRad, Hercules Calif.) along with extract from a non-transgenic seed. The gel was subsequently stained with 0.1% (w/v) Coomassie Brilliant Blue R250 in 40% (v/v) methanol, 10% (v/v) acetic acid overnight and then destained for approximately 3 hrs in 40% methanol, 10% acetic acid with frequent solution changes.
Mass Spectrometry Analysis to Detect EGF in Soybean Samples
[0056] Total soluble protein was extracted from 3 biological EGF transgenic soybean dry seed samples, lines 4, 5 and 6. As described above, proteins with molecular weights lowers than 10 kDa were concentrated using an Amicon Ultra centrifugal filter (Merck, Kenilworth N.J.). Non-transgenic seeds were used as a negative control and 5 g commercially available EGF (as above in immunoblot section) was the positive control. Protein was precipitated by adjusting the solution to 20% (v/v) trichloroacetic acid and allowed to sit at 4 C. overnight. Precipitated proteins were pelleted using centrifugation, washed twice with acetone and then dried using vacuum centrifugation. The commercial EGF was not filtered or precipitated, only dried. Dried pellets were rehydrated with the addition of 10 l 100 mM dithiothreitol in 100 mM ammonium bicarbonate and placed at 85 C. for 5 minutes to reduce disulphide bonds. Samples were then alkylated with addition of 10 l iodacetamide in 100 mM ammonium bromide and placed at room temperature in the dark for 30 minutes. Two g trypsin in 200 l 100 mM ammonium bromide was added to each samples and placed in 37 C. overnight for enzymatic digestion. Post trypsin digest samples were desalted using a peptide reverse phase microtrap (Michrom BioResources, Auburn Calif.), dried and ultimately resuspended in 2 l of 2% (v/v) acetonitrile, 0.1% (v/v) for-mic acid. Separation of peptides was performed using a Dionex U3000 splitless nanoflow HPLC system operated at 333 nl minute using a gradient from 2-50% acetonitrile over 60 minutes, followed by a 15 minute wash with 95% acetonitrile and a 15 minute equilibration with 2% acetonitrile. The C18 column, an in-house prepared 75 m by 15 cm reverse phase column packed with Halo 2.7 m, 90 C18 material (MAC-MOD Analytical, Chadds Ford Pa.) was located in the ion source just before a silica emitter. A potential of 2100 volts was applied using a liquid junction between the column and emitter. A Thermo LTQ Velos Pro mass spectrometer using a nanospray Flex ion source was used to analyze the eluate from the U3000. Scan parameters for the LTQ Velos Pro were one MS scan followed by 10 MS/MS scans of the 5 most intense peaks. MS/MS scans were performed in pairs, a CID fragmentation scan followed a HCD fragmentation scan of the same precursor m/z. Dynamic exclusion was enabled with a mass exclusion time of 3 min and a repeat count of 1 within 30 sec of initial m/z measurement. Spectra were collected over the entirety of each 90 minute chromatography run. Raw mass spectra were converted to MGF format using MSConvert, part of the ProteoWizard software library (Kessner D, Chambers M, Burke R, Agus D, Mallick P. ProteoWizard: open source software for rapid proteomics tools development. Bioinformatics. 2008; 24: 2534-2536) X!tandem 2013.09.01.1 (Craig R, Beavis R C. TANDEM: matching proteins with tandem mass spectra. Bioinformatics. 2004; 20: 1466-1467) and OMSSA (Geer L Y, Markey S P, Kowalak J A, Wagner L, Xu M, Maynard D M, et al. Open mass spectrometry search algorithm. J Proteome Res. 2004; 3: 958-964) algorithms were employed via the University of Arizona High Performance Computing Center to perform spectrum matching. Precursor and fragment mass tolerance were set to 0.2 Daltons for both OMSSA and X!tandem. Trypsin cleavage rules were used for both algorithms with up to 2 missed cleavages. Amino acid modifications search consisted of single and double oxidation of methionine, oxidation of proline, N-terminal acetylation, carbamidomethylation of cysteine, deamidation of asparagine and glutamine and phosphorylation of serine, threonine, and tyrosine. X!tandem xml and OMSSA xml results were filtered using Perl to remove any peptide matches with an E-value>0.05 as well as proteins identified by a single peptide sequence. The protein fasta database for Glycine max was downloaded on Aug. 5, 2015 from NCBI RefSeq with the addition of the EGF amino acid sequence. A randomized version of the Glycine max fasta was concatenated to the original as a way to assess dataset quality. The mass spectrometry proteo-mics data have been deposited to the ProteomeXchange Constortium (http://proteomecentral. proteomexchange.org) via the PRIDE partner repository (Guo J, Longshore S, Nair R, Warner B W. Retinoblastoma protein (pRb), but not p107 or p130, is required for maintenance of enterocyte quiescence and differentiation in small intestine. J Biol Chem. 2009; 284:134-40) with the dataset identifier PXD003326 and 10.6019/PXD003326.
Cell Culture, Western Blotting and Immunocytochemistry
[0057] Hela cells (obtained from American Tissue Culture Collection) were cultured in Minimum Essential Media (MEM) complemented with 10% Fetal Bovine Serum (FBS), 100 units/ml penicillin, and 100 g/ml streptomycin. For western blotting assay, cells grown in 6-well plate were kept in serum free MEM media for 24 hours. Cells were then either kept in serum free medium (control) or stimulated with soymilk alone, soy EGF or commercial recombined human EGF for different time period as indicated. Cells were lysed by directly adding 1SDS sample buffer (50 mM Tris-HCl, pH 6.8, 10% glycerol, 2% SDS and 5% -ME) to the cells after washing 3 times with 1PBS. EGF bio-activity was determined via EGFR phosphorylation and down-stream AKT phosphorylation. Total EGFR was also measured since EGFR is known to undergo internalization when stimulated with EGF. Antibodies used in western blot are anti-p-EGFR (Tyr1068) (#2234, Cell Signaling Technology), anti-total EGFR (#06-847, Millipore), anti-p-AKT (#4060, Cell Signaling Technology) and anti-Lamin B1 (#13435, Cell Signaling Technology) [40]. For immunocytochemistry assay, cells were grown on coverslip in 6-well plate and kept in serum free media for 24 hours before stimulation, cells were then either kept in serum free media (control) or stimulated with human or soy EGF for 6 hours. Cells were washed with PBS and fixed with 4% formalin. EGFR was labeled using anti-EGFR antibody (#4267, Cell Signaling Technology) and detected with Alexa Fluor 594 Goat anti-rabbit IgG (# A11012, Life Technology). The cell nuclei were shown using mounting medium with DAPI (# H-1200, Vectorshield).
[0058] Various modifications of the invention, in addition to those described herein, will be apparent to those skilled in the art from the foregoing description. Such modifications are also intended to fall within the scope of the appended claims. Each reference cited in the present application is incorporated herein by reference in its entirety.
[0059] Although there has been shown and described the preferred embodiment of the present invention, it will be readily apparent to those skilled in the art that modifications may be made thereto which do not exceed the scope of the appended claims. Therefore, the scope of the invention is only to be limited by the following claims. Reference numbers recited in the claims are exemplary and only for ease of review by the patent office and are not limiting in any way. In some embodiments, the figures presented in this patent application are drawn to scale, including the angles, ratios of dimensions, etc. In some embodiments, the figures are representative only and the claims are not limited by the dimensions of the figures. In some embodiments, descriptions of the inventions described herein using the phrase comprising includes embodiments that could be described as consisting of, and as such the written description requirement for claiming one or more embodiments of the present invention using the phrase consisting of is met.
[0060] The reference numbers recited in the below claims are solely for ease of examination of this patent application, and are exemplary, and are not intended in any way to limit the scope of the claims to the particular features having the corresponding reference numbers in the drawings.