RNA virus-derived plant expression system
10287602 ยท 2019-05-14
Assignee
Inventors
- Sylvestre Marillonnet (Halle, DE)
- Carola Engler (Halle, DE)
- Victor Klimyuk (Leipzig, DE)
- Yuri Gleba (Berlin, DE)
Cpc classification
C12N15/8257
CHEMISTRY; METALLURGY
International classification
Abstract
A process of expressing a sequence of interest in a plant, plant part, or plant cell culture, comprising: (a) providing a plant, plant part, or plant cell culture containing in cell nuclei a heterologous DNA having a sequence encoding an RNA replicon operably linked or linkable to a transcription promoter, wherein said sequence encoding an RNA replicon contains (i) sequences for replicon function of said RNA replicon, said sequences being derived from a sequence of a plant RNA virus, (ii) a sequence of interest, whereby said sequences for replicon function exhibit at selected localities of said sequences of said plant RNA virus function-conservative differences from said sequence of said plant RNA virus, said differences causing an increased frequency of replicon formation compared to an RNA replicon not exhibiting said differences; and (b) causing expression of said sequence of interest.
Claims
1. A process of expressing a sequence of interest in a plant, plant part, or plant cell culture, comprising: transforming a plant, plant part, or plant cell culture with a suspension of Agrobacteria, said Agrobacteria containing in T-DNA a heterologous DNA having a sequence encoding a replicon operably linked or linkable to a transcription promoter, wherein said sequence encoding a replicon contains (i) sequences that have a replicon function of a plant RNA viral replicon, and (ii) a sequence of interest, wherein said suspension of Agrobacteria has a concentration of cells of said Agrobacteria corresponding to a calculated optical density at 600 nm of at most 0.004, wherein said calculated optical density is defined by an at least 250-fold dilution of a suspension of said Agrobacteria of an OD at 600 nm of 1.0.
2. The process of claim 1, wherein said suspension of Agrobacteria has a concentration of cells of said Agrobacteria corresponding to a calculated optical density at 600 nm of at most 0.001, wherein said calculated optical density is defined by an at least 1000-fold dilution of a suspension of said Agrobacteria of an OD at 600 nm of 1.0.
3. The process of claim 1, which is a process of transiently expressing said sequence of interest and comprises transient transformation of said plant, plant part, or plant cell culture with a nucleic acid molecule containing said heterologous DNA.
4. The process of claim 1, wherein said transforming is done by infiltrating said plant or said plant part with said suspension of Agrobacteria.
Description
BRIEF DESCRIPTION OF THE FIGURES
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12) Ptranscription promoter; Ttranscription termination region; SMselectable marker gene; Ac2promoter of Arabidopsis ACTIN2 gene; RdRP viral RNA-dependent RNA polymerase; MPviral movement protein; NTRviral 3 non-translated region.
(13)
(14)
(15)
(16) Ptranscription promoter; Ttranscription termination region; SMselectable marker gene; Ac2promoter of Arabidopsis ACTIN2 gene; RdRP viral RNA-dependent RNA polymerase; MPviral movement protein; NTRviral 3 non-translated region.
(17)
(18)
DETAILED DESCRIPTION OF THE INVENTION
(19) We have surprisingly found that the incorporation of plant introns into certain regions of plant viral RNA vectors and removal or replacement of cryptic introns within sequences for replicon function can dramatically increase (at least 10.sup.2 folds) the efficiency of the appearance of said RNA replicons in the cytoplasm of host plants. Such increase in efficiency was reflected in at least one easily measurable parameter: relative proportion of cells showing replication of said vector, e.g. in increased frequency of replicon formation. Such optimisation of initiation of RNA replicon formation led to the ability of synchronized switching on of expression of a sequence of interest in a whole plant, resulting in dramatically increased yield of recombinant protein of interest in shorter time than for a non-modified vector.
(20) Despite of publications concerning the increase of nuclear transgene expression by incorporation of introns in coding regions of recombinant DNA (Mascarenhas et al., 1990, Plant Mol. Biol., 15, 913-920; Bourdon et al., 2001, EMBO Reports, 2, 394-398; Rose, A B., 2002, RNA, 8, 1444-1453; U.S. Pat. No. 5,955,330), there is no hint in the prior art showing that incorporation of introns into viral RNA replicons would have any positive effect on the frequency of viral replicon formation and subsequently, on the level of expression of a sequence of interest provided by said replicon. This effect is surprising considering that nuclear mRNA transcription and viral RNA replication take place in different sub-cellular compartments. Even if the cDNA copy of a viral replicon is placed in the nucleus, only the first copy of the viral replicon precursor is produced in nucleus and then amplified in the cytoplasm under conditions different from those in the nucleus. In the prior art, the use of introns for preventing the cytotoxic effect of leaky expression of viral genes in E. coli during cloning with wild type virus cDNAs was described (Johansen, I. E. 1996, Proc. Natl. Acad. Sci. USA, 93, 12400-12405; Yang et al., 1998, Arch. Virol., 143, 2443-2451; Lopez-Moya & Garcia, 2000, Virus Res., 68, 99-107). There is no hint that intron inclusion can increase the frequency of replicon formation from a viral cDNA clone. The results obtained for wild type RNA viruses and their cDNA copies cannot be compared with virus-derived expression vectors designed for the expression of a heterologous sequence of interest in plants, predominantly at the expense of other properties of wild type viruses like high infectivity and stability of said viruses. Infectivity is not an issue in the present invention. Notably, infectivity is not an issue in a process of expressing a sequence of interest in a stably transformed plant. Infectivity of a viral DNA vector or its transcript is also not an issue when a plant is transformed with Agrobacteria containing the DNA vector in T-DNA.
(21) The present invention provides a method for increasing fundamentally the frequency of RNA virus-derived replicon formation, said replicons are derived upon transcription of DNA precursor and designed for the expression of sequences of interest. This method overcomes the limitations of existing viral vector-based expression systems, such as size limitation for heterologous sequences to be expressed and high instability of said vectors. Further, said method offers better biosafety characteristics, allows to design leakage-proof control over transgene expression (zero expression level in non-induced state), as such design can be an integrated part of the strategy for said RNA virus-derived replicon design. By providing high frequency of RNA virus-derived replicon formation, the approach described herein allows for a rapid initiation of the expression of a sequence of interest in a whole plant, part of plant or plant cell culture containing in cell nuclei a heterologous DNA encoding said RNA replicon. By practicing the invention, the performance of practically any plant RNA virus-derived replicon designed for the expression of a heterologous sequence of interest can be improved significantly through dramatic increase of the frequency of replicon formation.
(22) RNA viruses belonging to different taxonomic groups are suitable for constructing RNA replicons according to this invention. A list of RNA viruses to which this invention can be applied, is presented below. Taxa names in quotes (and not in italic script) indicate that this taxon does not have an ICTV international approved name. Species (vernacular) names are given in regular script. Viruses with no formal assignment to genus or family are indicated):
(23) RNA Viruses:
(24) ssRNA Viruses: Family: Bromoviridae, Genus: Alfamovirus, Type species: alfalfa mosaic virus, Genus: Ilarvirus, Type species: tobacco streak virus, Genus: Bromovirus, Type species: brome mosaic virus, Genus: Cucumovirus, Type species: cucumber mosaic virus;
(25) Family: Closteroviridae, Genus: Closterovirus, Type species: beet yellows virus, Genus: Crinivirus, Type species: Lettuce infectious yellows virus, Family: Comoviridae, Genus: Comovirus, Type species: cowpea mosaic virus, Genus: Fabavirus, Type species: broad bean wilt virus 1, Genus: Nepovirus, Type species: tobacco ringspot virus;
(26) Family: Potyviridae, Genus: Potyvirus, Type species: potato virus Y, Genus: Rymovirus, Type species: ryegrass mosaic virus, Genus: Bymovirus, Type species: barley yellow mosaic virus;
(27) Family: Sequiviridae, Genus: Sequivirus, Type species: parsnip yellow fleck virus, Genus: Waikavirus, Type species: rice tungro spherical virus; Family: Tombusviridae, Genus: Carmovirus, Type species: carnation mottle virus, Genus: Dianthovirus, Type species: carnation ringspot virus, Genus: Machiomovirus, Type species: maize chlorotic mottle virus, Genus: Necrovirus, Type species: tobacco necrosis virus, Genus: Tombusvirus, Type species: tomato bushy stunt virus, Unassigned Genera of ssRNA viruses, Genus: Capillovirus, Type species: apple stem grooving virus;
(28) Genus: Carlavirus, Type species: carnation latent virus; Genus: Enamovirus, Type species: pea enation mosaic virus,
(29) Genus: Furovirus, Type species: soil-borne wheat mosaic virus, Genus: Hordeivirus, Type species: barley stripe mosaic virus, Genus: Idaeovirus, Type species: raspberry bushy dwarf virus;
(30) Genus: Luteovirus, Type species: barley yellow dwarf virus; Genus: Marafivirus, Type species: maize rayado fino virus; Genus: Potexvirus, Type species: potato virus X; Genus: Sobemovirus, Type species: Southern bean mosaic virus, Genus: Tenuivirus, Type species: rice stripe virus,
(31) Genus: Tobamovirus, Type species: tobacco mosaic virus,
(32) Genus: Tobravirus, Type species: tobacco rattle virus,
(33) Genus: Trichovirus, Type species: apple chlorotic leaf spot virus; Genus: Tymovirus, Type species: turnip yellow mosaic virus; Genus: Umbravirus, Type species: carrot mottle virus; Negative ssRNA Viruses: Order: Mononegavirales, Family: Rhabdoviridae, Genus: Cytorhabdovirus, Type Species: lettuce necrotic yellows virus, Genus: Nucleorhabdovirus, Type species: potato yellow dwarf virus;
(34) Negative ssRNA Viruses: Family: Bunyaviridae, Genus: Tospovirus, Type species: tomato spotted wilt virus;
(35) dsRNA Viruses: Family: Partitiviridae, Genus: Aiphacryptovirus, Type species: white clover cryptic virus 1, Genus: Betacryptovirus, Type species: white clover cryptic virus 2, Family: Reoviridae, Genus: Fijivirus, Type species: Fiji disease virus, Genus: Phytoreovirus, Type species: wound tumor virus, Genus: Oryzavirus, Type species: rice ragged stunt virus;
(36) Unassigned Viruses:
(37) Genome: ssRNA, Species Garlic viruses A, B, C, D, Species grapevine fleck virus, Species maize white line mosaic virus, Species olive latent virus 2, Species: ourmia melon virus, Species Pelargonium zonate spot virus.
(38) The general principle of the invention is shown in
(39) Surprisingly, our first attempt to find evidence that potentially problematic regions do exist, was successful and even more surprisingly, we obtained experimental confirmation by finding unexpectedly an improvement of orders of magnitude. An analysis of the sequence derived from the RNA virus of expression vector pICH8543 (EXAMPLE 1,
(40) Considering that all existing programs are not ideal and are subject to mistakes, the potential problematic regions can also be determined experimentally. This can be done by analyzing the transcripts derived from a DNA vector under test in a nuclear environment with the help of such a routine technique as RT-PCR (Frohman, M A., 1989, Methods Enzymol., 218, 340-356) or its more advanced version suitable for precise quantification of the concentration of different transcripts called real-time PCR (Gibson et al., 1996, Genome Res., 6, 995-1001), preferably followed by sequencing of the PCR-amplified products. The function-conservative differences of the invention change dramatically the RNA profile, for example by replacing intron-like sequences with exon-like ones, e.g. by introducing silent mutations with replacement of A/U-rich regions (intron-like) with G/C-rich regions (exon-like) (see
(41) Such a high efficiency of replicon formation opens the door for expressing two or more different genes from two different RNA replicons within the same plant cell, e.g. co-expressing different genes by using plant RNA virus based vectors. The achievement of synchronized release of two or more replicons at same time in the same cell is crucial for such co-expression, as the principle first come, first served is especially true for viral vectors. Systemic or cell-to-cell movement does not help, as different viral vectors do normally not overlap in their areas of spread or such overlap is insignificant. Simple calculations demonstrate the importance of the technology described in this invention for achieving co-expression of two sequences of interest in the same plant cell from two replicons. In the case of a non-optimised viral vector with a frequency of functional replicon formation of only 0.2% of all cells, the proportion of cells co-expressing two genes from two different RNA replicons will be 0.20.2=0.04%, while for the construct with increased frequency of functional RNA replicon formation (50% or of all cells), said proportion of cells will be 0.50.5=0.25 or 25%, e.g. about 625-fold higher. With some of the best performing vectors (e.g. pICH16191,
(42) It is worth to note that function-conservative differences in heterologous sequences of interest to be expressed from said RNA replicon might also be used to increase the frequency of RNA replicon formation, notably in combination with differences in sequences for replicon function. For example, modifications within said sequences of interest can be introduced that are necessary for formation and/or processing of said replicon.
(43) In an important embodiment of this invention, the frequency of replicon formation is improved by inserting nuclear introns in said sequences for replicon function (EXAMPLE 4). The incorporation of introns into the coding region of viral RNA-dependent RNA polymerase (RdRP) (EXAMPLES 4 and 8) resulted in a significant (at least 50-fold) increase in the frequency of replicon formation from (
(44) Many nuclear introns can be used to practice this invention. Examples of such introns include but are not limited to the introns from rice tpi Act1, and salT genes (Rethmeier et al., 1997, Plant J., 12, 895-899; Xu et al., 1994, Plant Physiol., 100, 459-467; McElroy et al., 1990, Plant Cell, 2, 163-171); from the maize Adh1, GapA1, actin and Bz1 genes (Callis et al., 1987, Genes Dev., 1, 1183-11200; Donath et al., 1995, Plant Mol. Biol., 28, 667-676; Maas et al., 1991, Plant Mol. Biol., 16, 199-207; Sinibaldi & Mettler, 1992, in W E Cohn, K Moldave, eds, Progress in Nucleic Acids Research and Molecular Biology, vol 42, Academic Press, New York, pp 229-257), from petunia rubisco gene SSU301 (Dean et al., 1989, Plant Cell, 1, 201-208), Arabidopsis A1 EF1, UBQ10, UBQ3, PAT1 genes (Curie et al., 1993, Mol. Gen. Genet. 228, 428-436; Norris et al., 1993, Plant Mol. Biol., 21, 895-906; Rose & Last, 1997, Plant J., 11, 455-464) and many others. Synthetic introns can also be used for this invention. The smallest usable introns or their parts may be limited to splice donor and acceptor sites which usually flank the internal intron sequences. Preferably, the introns should have a size of at least 50 nt., more preferably a size of 100 to 200 nt., but actually there are no limitations regarding the size of the introns. However, the size of the construct should be kept suitable for manipulations. The origin of the intron, its structure and size may be selected individually depending on the nature of the vector. Transient expression experiments may be used for testing the efficiency of a chosen intron or the corresponding intron parts.
(45) The modifications described above have a cumulative effect, e.g. if intron insertion(s) are combined with a modification of the MP subgenomic promoter, the increase in frequency of replicon formation can be approx. 300-fold (EXAMPLE 5). The preferred regions for intron insertions in order to have an increase in the frequency of RNA replicon formation are called selected localities herein. Such localities may contain intron-like structures. This is confirmed by the insertion of introns in MP, actually in close proximity to such a problematic region as the MP subgenomic promoter (EXAMPLE 7). A 100-fold increase in frequency of replicon formation was observed. Insertion of introns into exon-like regions does not have such a pronounced effect as insertion in said intron-like regions (EXAMPLE 6).
(46) The experiments discussed above were done with transient expression systems based on Agrobacterium-mediated DNA precursor delivery into plant cells. However, the most useful application of this invention will be for transgenic plants with a DNA precursor of said RNA replicon stably incorporated into a plant nuclear chromosome. This allows to overcome many limitations of plant viral vector-based systems, such as the restrictions to the maximal size of heterologous sequences viral vectors can tolerate. As the DNA precursor will be present in each cell of the transgenic plant, there is no absolute requirement for systemic movement or for cell to cell movement of the RNA replicon (replicon spreading). This can be compensated by the high efficiency of formation and transport of the RNA replicons of the invention into the cytoplasm. However, the ability of the vector for cell-to-cell movement can be of an additional value, as RNA replicon formation does not always occur in all cells.
(47) Different methods may be used for providing a plant cell with said heterologous DNA. Said vectors may be transformed into plant cells by a Ti-plasmid vector carried by Agrobacterium (U.S. Pat. Nos. 5,591,616; 4,940,838; 5,464,763) or particle or microprojectile bombardment (U.S. Pat. No. 5,100,792; EP 00444882B1; EP 00434616B1). Other plant transformation methods can also be used like microinjection (WO 09209696; WO 09400583A1; EP 175966B1), electroporation (EP00564595B1; EP00290395B1; WO 08706614A1) or PEG-mediated transformation of protoplasts etc. The choice of the method for vector delivery may depend on the plant species to be transformed. For example, microprojectile bombardment is generally preferred for monocot transformation, while for dicots, Agrobacterium-mediated transformation gives better results in general.
(48) In the examples described below, we used Agrobacterium-mediated delivery of vectors (said heterologous DNA) into Nicotiana cells. However, said vectors may be introduced into the plants in accordance with any of the standard techniques suitable for stable or transient transformation of the plant species of interest. Transformation techniques for dicotyledonous are well known in the art and include Agrobacterium-based techniques and techniques which do not require Agrobacterium. Non-Agrobacterium techniques involve the uptake of exogenous genetic material directly by protoplasts or cells. These techniques include PEG or electroporation mediated uptake, particle bombardment-mediated delivery and microinjection. Examples of these techniques are described in Paszkowski et al., EMBO J 3, 2717-2722 (1984), Potrykus et al., Mol. Gen. Genet. 199, 169-177 (1985), Reich et al., Biotechnology 4:1001-1004 (1986), and Klein et al., Nature 327, 70-73 (1987). In each case, the transformed cells are regenerated to whole plants using standard techniques.
(49) Agrobacterium-mediated transformation is a preferred technique for the transformation of dicotyledons because of its high transformation efficiency and its broad utility with many different species. The many crop species which may be routinely transformed by Agrobacterium include tobacco, tomato, sunflower, cotton, oilseed rape, potato, soybean, alfalfa and poplar (EP 0 317 511 (cotton), EP 0 249 432 (tomato), WO 87/07299 (Brassica), U.S. Pat. No. 4,795,855 (poplar)).
(50) Agrobacterium transformation typically involves the transfer of the binary vector carrying the foreign DNA of interest into an appropriate Agrobacterium strain which may depend on the complement of vir genes carried by the host Agrobacterium strain either on a co-resident plasmid or chromosomally (Uknes et al., Plant Cell 5:159-169 (1993). The transfer of the recombinant binary vector to Agrobacterium may be accomplished by a triparental mating procedure using E. coli carrying the recombinant binary vector, a helper E. coli strain which carries a plasmid such as pRK2013, which is able to mobilize the recombinant binary vector to the target Agrobacterium strain. Alternatively, the recombinant binary vector may be transferred to Agrobacterium by DNA transformation (Hofgen & Willmitzer, Nucl. Acids Res. 16, 9877 (1988)).
(51) Transformation of the target plant species by recombinant Agrobacterium usually involves co-cultivation of the Agrobacterium with explants from the plant following protocols known in the art. Transformed tissue carrying an antibiotic or herbicide resistance marker present between the binary plasmid T-DNA borders may be regenerated on selectable medium. This allows the generation of transgenic plants stably transformed on a nuclear chromosome with in T-DNA containing said heterologous DNA of the invention.
(52) In the examples of this invention, in parallel with stable agro-transformation we used agro-inoculation, a method of Agrobacterium-mediated delivery of T-DNA for transient expression of gene(s) of interest (Vaquero et al., 1999, Proc. Natl. Acad. Sci. USA, 96, 11128-11133). Agro-inoculation is an extremely useful tool not only for small-to-middle scale recombinant protein production systems, but as an element of a vector optimisation system, allowing to obtain fast results with different variants of constructs.
(53) The invention can also be used for large-scale/industrial production of recombinant proteins. Overnight cultures of Agrobacterium were used in our experiments. The overnight culture was prepared for agro-infiltration, as described in the prior art (Marillonnet et al., 2004, Proc. Natl. Acad. Sci. USA., 101, 6853-6857). Usually, an overnight culture reaches an optical density (O.D.) of 3-3.5 units at a wavelength 600 nm and is diluted 3-5 times before agro-infiltration, yielding in general 5-910.sup.9 colony forming units (Turpen et al., 1993, J. Virol. Methods, 42, 227-240). We have found that a 10.sup.2, preferably a 10.sup.3 and more preferably a 10.sup.4 fold dilution of auch an overnight culture works very efficiently, especially in combination with sequences for replicon function having said function-conservative differences as described herein. Surprisingly, the vectors in infiltrated tobacco leaves further improved their performance giving better yield of GFP with increasing dilutions of the transforming Agrobacteria. For example, a 10.sup.3-fold dilution gave better result than a 10.sup.2-fold dilution. A 10.sup.2-fold dilution provides better GFP yield than a 10-fold dilution. A possible explanation for this phenomenon is the negative effect of highly concentrated Agrobacterium suspension on the function of a viral vector, e.g. on cell-to-cell movement, possibly as the result of a plant response to high concentrations of pathogenic bacteria. This phenomenon is of special value for large-scale industrial protein expression processes, as it allows to reduce the amount of agrobacteria required for recombinant protein production via agro-infiltration by at least one order of magnitude compared to prior art processes.
(54) In EXAMPLE 9 of this invention, a DNA precursor of an inactivated viral RNA-based replicon is stably incorporated into chromosomal DNA. Said replicon is optimised according to the invention. In addition, the replicon contains a structure preventing expression of the sequence of interest. Expression as well as formation of the functional RNA replicon can be triggered by flipping one part of the construct with the help of site-specific recombination. Said flipping can lead to the formation of two introns as well as to the assembly of a functional sequence of interest. The system described in EXAMPLE 9 shows not only the optimisation of a viral vector but also the solution for avoiding leakiness of constructs stably integrated into chromosomal DNA, including the leaky expression of the gene of interest from said construct. In many applications, it is crucial to have zero level expression in the uninduced state, especially for cytotoxic proteins or for achieving high biosafety standards with plant expression systems for expressing technical or pharmaceutical proteins.
(55) Transcription of the heterologous DNA and/or of said recombinase can be under the control of an inducible or any other regulated (e.g. developmentally regulated) promoter. Inducible promoters can be divided into two categories according to their induction conditions: those induced by abiotic factors (temperature, light, chemical substances) and those that can be induced by biotic factors, for example, pathogen or pest attack. Examples of the first category are heat-inducible (U.S. Pat. No. 5,187,287) and cold-inducible (U.S. Pat. No. 5,847,102) promoters, a copper-inducible system (Mett et al., 1993, Proc. Natl. Acad. Sci., 90, 4567-4571), steroid-inducible systems (Aoyama & Chua, 1997, Plant J., 11, 605-612; McNellis et al., 1998, Plant J., 14, 247-257; U.S. Pat. No. 6,063,985), an ethanol-inducible system (Caddick et al., 1997, Nature Biotech., 16, 177-180; WO09321334), and a tetracycline-inducible system (Weinmann et al., 1994, Plant J., 5, 559-569). One of the latest developments in the area of chemically inducible systems for plants is a chimaeric promoter that can be switched on by glucocorticoid dexamethasone and switched off by tetracycline (Bohner et al., 1999, Plant J., 19, 87-95). For a review on chemically inducible systems see: Zuo & Chua, (2000, Current Opin. Biotechnol., 11, 146-151) and Padidam, M (2003, Curr. Opin. Plant Biol., 6, 169-177). Other examples of inducible promoters are promoters which control the expression of patogenesis-related (PR) genes in plants. These promoters can be induced by treatment of a plant with salicylic acid, an important component of plant signaling pathways in response to pathogen attack, or other chemical compounds (benzo-1,2,3-thiadiazole or isonicotinic acid) which are capable of triggering PR gene expression (U.S. Pat. No. 5,942,662).
(56) This invention is not limited to TMV-based vectors described in examples 1-9, but can be extended to replicons based on other plant RNA viruses. The analysis of other plant viral RNA sequences (EXAMPLE 10,
(57) The present invention is preferably carried out with higher multi-cellular plants, parts thereof, or cell cultures thereof. Plants for the use in this invention include any plant species with preference given to agronomically and horticulturally important species. Common crop plants for the use in present invention include alfalfa, barley, beans, canola, cowpeas, cotton, corn, clover, lotus, lentils, lupine, millet, oats, peas, peanuts, rice, rye, sweet clover, sunflower, sweetpea, soybean, sorghum triticale, yam beans, velvet beans, vetch, wheat, wisteria, and nut plants. The plant species preferred for practicing this invention include, but not restricted to, representatives of Gramineae, Compositeae, Solanaceae and Rosaceae.
(58) Further preferred species for the use in this invention are plants from the following genera: Arabidopsis, Agrostis, Allium, Antirrhinum, Apium, Arachis, Asparagus, Atropa, Avena, Bambusa, Brassica, Bromus, Browaalia, Camellia, Cannabis, Capsicum, Cicer, Chenopodium, Chichorium, Citrus, Coffea, Coix, Cucumis, Curcubita, Cynodon, Dactylis, Datura, Daucus, Digitalis, Dioscorea, Elaeis, Eleusine, Festuca, Fragaria, Geranium, Glycine, Helianthus, Heterocallis, Hevea, Hordeum, Hyoscyamus, Ipomoea, Lactuca, Lens, Lilium, Linum, Lolium, Lotus, Lycopersicon, Majorana, Malus, Mangifera, Manihot, Medicago, Nemesia, Nicotiana, Onobrychis, Oryza, Panicum, Pelargonium, Pennisetum, Petunia, Pisum, Phaseolus, Phleum, Poa, Prunus, Ranunculus, Raphanus, Ribes, Ricinus, Rubus, Saccharum, Salpiglossis, Secale, Senecio, Setaria, Sinapis, Solanum, Sorghum, Stenotaphrum, Theobroma, Trifolium, Trigonella, Triticum, Vicia, Vigna, Vitis, Zea, and the Olyreae, the Pharoideae and many others.
(59) Most preferred plants for this invention are plants that do not enter the animal or human food chain like Nicotiana species, e.g. Nicotiana benthamiana and Nicotiana tabacum.
(60) Proteins of interest, their fragments (functional or non-functional) and their artificial derivatives that can be expressed in plants or plants cells using the present invention include, but are not limited to: starch modifying enzymes (starch synthase, starch phosphorylation enzyme, debranching enzyme, starch branching enzyme, starch branching enzyme II, granule bound starch synthase), sucrose phosphate synthase, sucrose phosphorylase, polygalacturonase, polyfructan sucrase, ADP glucose pyrophosphorylase, cyclodextrin glycosyltransferase, fructosyl transferase, glycogen synthase, pectin esterase, aprotinin, avidin, bacterial levansucrase, E. coli glgA protein, MAPK4 and orthologues, nitrogen assimilation/methabolism enzyme, glutamine synthase, plant osmotin, 2S albumin, thaumatin, site-specific recombinase/integrase (FLP, Cre, R recombinase, Int, SSVI Integrase R, Integrase phiC31, or an active fragment or variant thereof), oil modifying enzymes (like fatty acids desaturases, elongases etc), isopentenyl transferase, Sca M5 (soybean calmodulin), coleopteran type toxin or an insecticidally active fragment, ubiquitin conjugating enzyme (E2) fusion proteins, enzymes that metabolise lipids, amino acids, sugars, nucleic acids and polysaccharides, superoxide dismutase, inactive proenzyme form of a protease, plant protein toxins, traits altering fiber in fiber producing plants, Coleopteran active toxin from Bacillus thuringiensis (Bt2 toxin, insecticidal crystal protein (ICP), CryIC toxin, delta endotoxin, polyopeptide toxin, protoxin etc.), insect specific toxin AaIT, cellulose degrading enzymes, E1 cellulase from Acidothermus celluloticus, lignin modifying enzymes, cinnamoyl alcohol dehydrogenase, trehalose-6-phosphate synthase, enzymes of cytokinin metabolic pathway, HMG-CoA reductase, E. coli inorganic pyrophosphatase, seed storage protein, Erwinia herbicola lycopen synthase, ACC oxidase, pTOM36 encoded protein, phytase, ketohydrolase, acetoacetyl CoA reductase, PHB (polyhydroxybutanoate) synthase, enzymes involved in the synthesis of polyhydroxylalkanoates (PHA), acyl carrier protein, napin, EA9, non-higher plant phytoene synthase, pTOM5 encoded protein, ETR (ethylene receptor), plastidic pyruvate phosphate dikinase, nematode-inducible transmembrane pore protein, trait enhancing photosynthetic or plastid function of the plant cell, stilbene synthase, an enzyme capable of hydroxylating phenols, catechol dioxygenase, catechol 2,3-dioxygenase, chloromuconate cycloisomerase, anthranilate synthase, Brassica AGL15 protein, fructose 1,6-biphosphatase (FBPase), AMV RNA3, PVY replicase, PLRV replicase, potyvirus coat protein, CMV coat protein, TMV coat protein, luteovirus replicase, MDMV messenger RNA, mutant geminiviral replicase, Umbellularia californica C12:0 preferring acyl-ACP thioesterase, plant C10 or C12:0 preferring acyl-ACP thioesterase, C14:0 preferring acyl-ACP thioesterase (luxD), plant synthase factor A, plant synthase factor B, D6-desaturase, proteins having an enzymatic activity in fatty acids biosynthesis and modifications, e.g. the peroxysomal -oxidation of fatty acids in plant cells, acyl-CoA oxidase, 3-ketoacyl-CoA thiolase, lipase, maize acetyl-CoA-carboxylase, etc.; 5-enolpyruvylshikimate-3-phosphate synthase (EPSP), phosphinothricin acetyl transferase (BAR, PAT), CP4 protein, ACC deaminase, protein having posttranslational cleavage site, DHPS gene conferring sulfonamide resistance, bacterial nitrilase, 2,4-D monooxygenase, acetolactate synthase or acetohydroxyacid synthase (ALS, AHAS), polygalacturonase, Taq polymerase, bacterial nitrilase, many other enzymes of bacterial or phage origin including restriction endonucleases, methylases, DNA and RNA ligases, DNA and RNA polymerases, reverse transcriptases, nucleases (DNases and RNases), phosphatases, transferases etc.
(61) The present invention can be used for the purpose of molecular farming and purification of commercially valuable and pharmaceutically important proteins including industrial enzymes (cellulases, lipases, proteases, phytases etc.) and fibrous proteins (collagen, spider silk protein, etc.). Human or animal health protein may be expressed and purified using described in our invention approach. Examples of such proteins of interest include inter alia immune response proteins (monoclonal antibodies, single chain antibodies, T cell receptors etc.), antigens including those derived from pathogenic microorganisms, colony stimulating factors, relaxins, polypeptide hormones including somatotropin (HGH) and proinsulin, cytokines and their receptors, interferons, growth factors and coagulation factors, enzymatically active lysosomal enzyme, fibrinolytic polypeptides, blood clotting factors, trypsin, trypsinogen, al-antitrypsin (AAT), human serum albumin, glucocerebrosidases, native cholera toxin B, thrombin, human gastric lipase, granulocyte-macrophage colony stimulating factor (GM-CMF), serpin, lactoferrin, lisozyme, oleosin, prothrombin, alpha-galactosidase, as well as function-conservative proteins like fusions, mutant versions and synthetic derivatives of the above proteins.
(62) The content of International patent application PCT/EP03/12530 and European patent application 04016012.9 are incorporated herein by reference in their entireties.
EXAMPLES
(63) The following examples are presented to illustrate the present invention. Modifications and variations may be made without departing from the spirit and scope of the invention.
Example 1
(64) Construction of a TMV-Based RNA Vector
(65) Cloned cDNAs of the crucifer-infecting tobamovirus (cr-TMV; Dorokhov et al., 1994, FEBS Lett. 350, 5-8) and of the turnip vein-clearing virus (TVCV; Lartey et al., 1994, Arch. Virol. 138, 287-298) were obtained from Prof. Atabekov from Moscow University, Russia. A viral vector containing a green fluorescence protein (GFP) gene was made in several cloning steps. The resulting construct, pICH8543 (
(66) To confirm this observation, we made a viral vector construct containing a mutation in the MP. This construct, called pICH14833, is similar to pICH8543 but differs by a deletion of 389 bp in the MP gene, upstream of the EcoRI site present in the MP. The sequence of the NcoI to EcoRI fragment that includes this deletion is given in the annex as SEQ ID No. 1. The entire viral construct (from the ACT2 promoter to the Nos terminator) was cloned between the T-DNA left and right borders of pICBV49, a pBIN19-derived Kan.sup.R binary vector. Due to the deletion in the MP, replicons produced from this construct cannot move from cell to cell but are able to replicate autonomously within a cell. Cell to cell movement can be restored when MP is provided in trans, e.g. from a constitutive promoter such as the cauliflower mosaic virus 35S promoter.
(67) To make an MP expression construct, the TVCV MP gene was amplified by PCR from cloned TVCV cDNA (GenBank accession Z29370, by 4802 to 5628) and subcloned in a binary vector under control of the 35S promoter. The resulting construct, called pICH10745 (not shown), and pICH14833 were transformed into Agrobacterium strain GV3101 and various dilutions of an overnight culture were infiltrated in Nicotiana benthamiana leaves as described by English and colleagues (1997, Plant J., 12, 597-603), except that the infiltration media lacked acetosyringone. Infiltration of pICH14833 alone led to the appearance of a few GFP expressing cells within the infiltrated area. By counting protoplasts prepared from the infiltrated area, we found that only one to three protoplasts expressed GFP from a total of 500 protoplasts (0.2 to 0.6%). Coinfiltration of pICH14833 and pICH10745 led to the formation of GFP-expressing foci that grew from each initial GFP-expressing cell. Ultimately, due to cell-to-cell movement, a large proportion of cells in the infiltrated area expressed GFP (
(68) RNA viruses such as tobamoviruses replicate in the cytoplasm and never enter the nucleus. Therefore, they have evolved in an environment where they are not exposed to the nuclear pre-mRNA processing machinery. As a result, it is not surprising that RNA replicon transcripts generated in the nucleus from artificial viral constructs may not be recognized and processed properly by the RNA processing machinery. Moreover, RNA replicons from viral vectors are very large: approximately 7,000 nt in the case of the replicon based on TMV. Very few plant genes have such a large size and the majority of such genes contains introns that facilitate processing of the pre-mRNAs, export from the nucleus, and that improve the stability of the processed transcripts. We therefore hypothesized that modifications of the pre-mRNAs that would increase the efficiency of accurate processing and of export of correctly processed transcripts from the nucleus to the cytosol would lead to an increase of the number of cells that would initiate viral replication. It turned out that there are two approaches can be used to make RNA virus-based vectors that can more efficiently initiate viral replication after DNA delivery to the nucleus: (1) one approach is the removal of sequence features that might induce unwanted processing events (such as alternative splicing events using cryptic splice sites, or premature termination events); (2) a second approach is the addition of introns to increase the amount of properly processed transcripts, to improve export of the RNA from the nucleus to the cytoplasm, and/or to improve stability of the transcripts.
Example 2
(69) Removal of Intron-Like Sequences Increases the Frequency of Viral RNA Replicon Formation in the Cytoplasm
(70) We analyzed the sequence of the RNA replicon from pICH4351 using the Netgenell server program (http://www.cbs.dtu.dk/services/NetGene2/) and noticed several intron-like sequence features that might induce alternative splicing events. One such feature is a 0.6 kb uridine-rich region (corresponding to nt 827 to 1462 in GenBank accession BRU03387) at the beginning of the RdRP (
Example 3
(71) Removal of Intron-Like Sequences in the MP Subgenomic Promoter
(72) A second potentially problematic region corresponds to the MP subgenomic promoter (
Example 4
(73) Addition of Introns Improves the Frequency of Formation of Functional RNA Rep/Icons in the Cytoplasm
(74) We tested whether the addition of introns into viral pro-replicon sequences would increase the frequency of initiation of replication. Two constructs were made, pICH15025 and pICH15034 (
(75) Both constructs were separately agroinfiltrated into N. benthamiana leaves and compared to pICH14833. Both constructs significantly increased the number of cells initiating viral replication (
(76) A third clone was made, pICH15499, which contained all 6 introns (
Example 5
(77) Addition of Introns and Removal of Intron-Like Sequences Increases the Frequency of the Formation of Functional RNA Replicons in the Cytoplasm
(78) Removing intron-like features and adding additional introns in one construct showed that both types of modifications can contribute to improve initiation of viral replication. We subcloned the 6 introns of pICH15499 into pICH15900, which contains the mutagenized MP subgenomic promoter region. The resulting clone pICH15860 (
Example 6
(79) Not all Intron Additions Increase the Frequency of Appearance of Functional RNA Replicons in the Cytoplasm
(80) We inserted two different Arabidopsis introns at the beginning of the RdRP, resulting in clone pICH15477 (the sequence of this region is shown as SEQ ID No. 6 in the annex). The sequence in this region already looks very exon-like (e.g. GC-rich without cryptic splice sites) before the addition of introns. No improvement on replication of viral initiation was seen with this construct. Therefore, not any addition of an intron will result in an improvement of the viral vector. It appears that the position chosen for intron insertion or mutagenesis is an important parameter. For example, all intron insertions or nucleotide substitutions that were made in regions near problematic structures such as the MP subgenomic promoter resulted in large improvements, while insertions of introns into sequences that are already exon-like did not.
Example 7
(81) Insertion of Introns in MP Sequences Increase the Frequency of Viral Replicon Formation
(82) We first made a frameshift in the MP by digestion with the restriction enzyme AvrII, filling and religation. We then inserted two introns in the MP. The resulting clone pICH16422 (
Example 8
(83) Insertion of Introns into a MP Containing Vector Improves the Frequency of Initiation of Viral Replication of Autonomous Functional Clones
(84) A Kpn1 EcoRI fragment was subcloned from pICH15499 into pICH8543. The resulting clone, 16700 (
Example 9
(85) Activation of an Inactive Replicon Stably Integrated on a Chromosome
(86) It is also possible to stably transform intron-containing viral vector constructs in transgenic plants. To avoid deleterious viral replication that would inhibit plant growth, an inactive clone (pro-replicon) can be made by having a part of the vector present in antisense orientation (
(87) The flipped part can be located at the 3 end of the vector (as shown in
Example 10
(88) Plant Viral RNA Sequences Contain Potentially Unstable Regions
(89) The analysis of RNA profile of selected plant RNA viruses as well as one well characterised plant gene (AtDMC1) was performed by using the Netgenell server program (http://www.cbs.dtu.dk/services/NetGene2/). The RNA profile shown in
Example 11
(90) Optimized Vectors Work in Other Species
(91) A fully optimized construct containing the mutagenized region (described in pICH15466) and 16 introns (including the six introns of pICH15860, the two introns of pICH16422 and eight additional introns) was made. In summary this construct contains introns inserted at the following positions (given relative to TVCV sequence, GenBank accession BRU03387): nt 209, nt 828, nt 1169, nt 1378, nt 1622, nt 1844, nt 2228, nt 2589, nt 2944, nt 3143, nt 3381, nt 3672, nt 3850, nt 4299, nt 5287, nt 5444.
(92) This construct was tested for expression in Beta vulgaris. Infiltration of the entire plant was performed as described next. Agrobacteria carrying pICH18711 were inoculated to 300 ml of LB containing 50 g/ml Rifampicin and 50 g/ml Kanamycin (selection for the binary vector) and grown until saturation. The bacteria were pelleted at 4800 g for 10 min and resuspended in 3 l of infiltration buffer (10 mM MES pH 5.5, 10 mM MgSO.sub.4) in order get a 10-fold dilution relative to the saturated Agrobacterium culture. A beaker containing the infiltration solution was placed in an exsiccator (30 mm diameter), with the aerial parts of a plant dipped in the solution. A vacuum was applied for two minutes using a Type PM 16763-860.3 pump from KNF Neuberger (Freiburg, Germany), reaching from 0.5 to 0.9 bar. The plants were returned to the greenhouse under standard conditions.
(93) GFP expression was high in leaves of the plants infiltrated with pICH18711 (
(94) TABLE-US-00001 ANNEX SEQIDNo.1(NcoI-EcoRIfragmentofpICH14833): ccatggacaaagtgataaaggcagctttttgtggagacgatagcctgatttacattcctaaaggtttagacttgcctgatattcaggcggg cgcgaacctcatgtggaacttcgaggccaaactcttcaggaagaagtatggttacttctgtggtcgttatgttattcaccatgatagagga gccattgtgtattacgatccgcttaaactaatatctaagttaggttgtaaacatattagagatgttgttcacttagaagagttacgcgagtctt tgtgtgatgtagctagtaacttaaataattgtgcgtatttttcacagttagatgaggccgttgccgaggttcataagaccgcggtaggcggt tcgtttgctttttgtagtataattaagtatttgtcagataagagattgtttagagatttgttctttgtttgataatgtcgatagtctcgtacgaaccta aggtgagtgatttcctcaatctttcgaagaaggaagagatcttgccgaaggctctaacgaggttagaattc SEQIDNo.2(partofpICH15466): ggagataacctgagcttcttcttccataatgagagcactctcaattacacccacagcttcagcaacatcatcaagtacgtgtgcaagac gttcttccctgctagtcaacgcttcgtgtaccacaaggagttcctggtcactagagtcaacacttggtactgcaagttcacgagagtggat acgttcactctgttccgtggtgtgtaccacaacaatgtggattgcgaagagttttacaaggctatggacgatgcgtggcactacaaaaa gacgttagcaatgcttaatgccgagaggaccatcttcaaggataacgctgcgttaaacttttggttcccgaaagtgagagacatggttat cgtccctctctttgacgcttctatcacaactggtaggatgtctaggagagaggttatggtgaacaaggacttcgtctacacggtcctaaat cacatcaagacctatcaagctaaggcactgacgtacgcaaacgtgctgagcttcgtggagtctattaggtctagagtcataattaacgg tgtcactgccaggtctgaatgggacacagacaaggcaattctaggtccattagcaatgacattcttcctgatcacgaagctgggtcatgt gcaagat SEQIDNo.3(partofpICH15900): gcggacgatacgtgatccaccatgatagaggagccattgtgtattacgatccgcttaaactaatatctaagctcggctgcaagcacatc agagacgtcgtgcacttagaagagttacgcgagtctttgtgcgacgtagctagtaacttgaacaactgcgcctacttctcacagttagat gaggccgttgctgaggtccacaagactgcggtcggaggctccttcgcgttctgtagcatcatcaaatacttgtcagacaagaggctgtt cagggacctgttcttcgtctgagttgacg SEQIDNo.4(partofpICH15025):(contains3Intronsshownunderlinedinitalics) Cccgagctatactgtaccttcgccgaccgattggtactacagtacaagaaggcggaggagttccaatcgtgtgatctttccaaacctct agaagagtcagagaagtactacaacgcattatccgagctatcagtgcttgagaatctcgactcttttgacttagaggcgtttaagacttta tgtcagcagaagaatgtggacccggatatggcagcaaag gtaaatcctggtccacacttttacgataaaaacacaagattttaaactatgaactgatcaataatcattcctaaaagaccacacttttgtttt gtttctaaagtaatttttactgttataacag gtggtcgtagcaatcatgaagtcagaattgacgttgcctttcaagaaacctacagaagaggaaatctcggagtcgctaaaaccagga gaggggtcgtgtgcagagcataaggaagtgttgagcttacaaaatgatgctccgttcccgtgtgtgaaaaatctagttgaaggttccgt gccggcgtatggaatgtgtcctaagggtggtggtttcgacaaattggatgtggacattgctgatttccatctcaagagtgtagatgcagtt aaaaagggaactatgatgtctgcggtgtacacagggtctatcaaagttcaacaaatgaagaactacatagattacttaagtgcgtcgct ggcagctacagtctcaaacctctgcaag gtaagaggtcaaaaggtttccgcaatgatccctctttttttgtttctctagtttcaagaatttgggtatatgactaacttctgagtgttccttgatg catatttgtgatgagacaaatgtttgttctatgttttag gtgcttagagatgttcacggcgttgacccagagtcacaggagaaatctggagtgtgggatgttaggagaggacgttggttacttaaac ctaatgcgaaaagtcacgcgtggggtgtggcagaagacgccaaccacaagttggttattgtgttactcaactgggatgacggaaagc cggtttgtgatgagacatggttcagggtggcggtgtcaagcgattccttgatatattcggatatgggaaaacttaagacgctcacgtcttg cagtccaaatggtgagccaccggagcctaacgccaaagtaattttggtcgatggtgttcccggttgtggaaaaacgaaggagattatc gaaaag gtaagttctgcatttggttatgctccttgcattttaggtgttcgtcgctcttccatttccatgaatagctaagattttttttctctgcattcattctt cttgcctcagttctaactgtttgtggtatttttgttttaattattgctacaggtaaacttctctgaagacttgattttagtccctgggaaggaagctt SEQIDNo.5(partofpICH15034):(contains3Intronsshownunderlinedinitalics) ctgcag gtaaaatattggatgccagacgatattctttcttttgatttgtaactttttcctgtcaaggtcgataaattttattttttttggtaaaaggtcgataatt tttttttggagccattatgtaattttcctaattaactgaaccaaaattatacaaaccag gtttgctggaaaatttggttgcaatgatcaaaagaaacatgaatgcgccggatttgacagggacaattgacattgaggatactgcatct ctggtggttgaaaagttttgggattcgtatgttgacaaggaatttagtggaacgaacgaaatgaccatgacaagggagagcttctccag gtaaggacttctcatgaatattagtggcagattagtgttgttaaagtctttggttagataatcgatgcctcctaattgtccatgttttactggtttt ctacaattaaag gtggctttcgaaacaagagtcatctacagttggtcagttagcggactttaactttgtggatttgccggcagtagatgagtacaagcatatg atcaagagtcaaccaaagcaaaagttagacttgagtattcaagacgaatatcctgcattgcagacgatagtctaccattcgaaaaag atcaatgcgattttcggtccaatgttttcagaacttacgaggatgttactcgaaaggattgactcttcgaagtttctgttctacaccagaaag acacctgcacaaatagaggacttcttttctgacctagactcaacccaggcgatggaaattctggaactcgacatttcgaagtacgataa gtcacaaaacgagttccattgtgctgtagagtacaagatctgggaaaagttaggaattgatgagtggctagctgaggtctggaaacaa g gtgagttcctaagttccatttttttgtaatccttcaatgttattttaacttttcagatcaacatcaaaattaggttcaattttcatcaaccaaataat atttttcatgtatatatag gtcacagaaaaacgaccttgaaagattatacggccggaatcaaaacatgtctttggtatcaaaggaaaagtggtgatgtgacaacctt tattggtaataccatcatcattgccgcatgtttgagctcaatgatccccatgg SEQIDNo.6(fragmentofpICH15477,containing1Intronshowninunderlineditalics) Gttttagttttattgcaacaacaacaacaaattacaataacaacaaacaaaatacaaacaacaacaacatggcacaatttcaacaa acaattgacatgcaaactctccaagccgctgcgggacgcaacagcttggtgaatgatttggcatctcgtcgcgtttacgataatgcagt cgaggagctgaatgctcgttccagacgtcccaag gtaaaacaacatttcattcacatatatgaatacttttgtcattgagtacgaagaagacacttactacttgttgatgaaagtttccgcctttata cttatctatatcattttcatcatttcaaactagtatgaaattaggtgatgtttatatgatatcatggaacattaatctatagggaaactgttttgag ttagttttgtataatattittccctgtttgatgttag gttcatttctccaaggcagtgtctacggaacagacactgattgcaacaaacgcatatccggagttcgagatttcctttactcatacgcaat ccgctgtgcactccttggccggaggccttcggtcacttgagttggagtatctcatgatgcaagttccgttcggctctctgacctacgacatc ggcggaaacttctccgcgcacctcttcaaaggtaattttctttctctactcaattttctccaagatccaatatttgaagactgatctatagttaa aattaatctctactccattcttgttacctcaggtcgcgattacgttcactgctgcatgc: gttttagttttattgcaacaacaacaacaaattacaataacaacaaacaaaatacaaacaacaacaacatggcacaatttcaacaaa caattgacatgcaaactctccaagccgctgcgggacgcaacagcttggtgaatgatttggcatctcgtcgcgtttacgataatgcagtc gaggagctgaatgctcgttccagacgtcccaaggtaaaacaacatttcattcacatatatgaatacttttgtcattgagtacgaagaaga cacttactacttgttgatgaaagtttccgcctttatacttatctatatcattttcatcatttcaaactagtatgaaattaggtgatgtttatatgatat catggaacattaatctatagggaaactgttttgagttagttttgtataatatttttccctgtttgatgttaggttcatttctccaaggcagtgtctac ggaacagacactgattgcaacaaacgcatatccggagttcgagatttcctttactcatacgcaatccgctgtgcactccttggccggag gccttcggtcacttgagttggagtatctcatgatgcaagttccgttcggctctctgacctacgacatcggcggaaacttctccgcgcacct cttcaaaggtaattttctttctctactcaattttctccaagatccaatatttgaagactgatctatagttaaaattaatctctactccattcttgttac ctcaggtcgcgattacgttcactgctgcatgc SEQIDNo.7:T-DNAregionofpICH12691,whereinsequencesegmentshavethefollowing function: Nucleotides1to25:Leftborder(oppositestrand), Nucleotides86to1484:Nospromoter-NPTIIcodingsequence-Nosterminator(ontheopposite strand), Nucleotides1506to1552:AttPrecombinationsite(oppositestrand), Nucleotides1553to1599:intron5 part(oppositestrand), Nucleotides1600to2022:TVCVRdRP5 end(oppositestrand), Nucleotides2023to2809:Arabidopsisactin2promoter(oppositestrand), Nucleotides2836to2903:AttBrecombinationsite, Nucleotides2904to2959:intron3 part, Nucleotides2960to7991:TVCVRdRP3 part-MP5 part, Nucleotides7992to8168:cr-TMVMP3 end. Nucleotides8248to8967:GFPcodingsequence Nucleotides8961to9215:cr-TMV3 untranslatedregion, Nucleotides9234to9497:Nosterminator, Nucleotides9549to9473:T-DNArightborder(oppositestrand): tggcaggatatattgtggtgtaaacaaattgacgcttagacaacttaataacacattgcggacgtttttaatgtactggggtggatgcagg tcgatctagtaacatagatgacaccgcgcgcgataatttatcctagtttgcgcgctatattttgttttctatcgcgtattaaatgtataattgcg ggactctaatcataaaaacccatctcataaataacgtcatgcattacatgttaattattacatgcttaacgtaattcaacagaaattatatg ataatcatcgcaagaccggcaacaggattcaatcttaagaaactttattgccaaatgtttgaacgatctgcttgactctagatccagagtc ccgctcagaagaactcgtcaagaaggcgatagaaggcgatgcgctgcgaatcgggagcggcgataccgtaaagcacgaggaa gcggtcagcccattcgccgccaagctcttcagcaatatcacgggtagccaacgctatgtcctgatagcggtccgccacacccagccg gccacagtcgatgaatccagaaaagcggccatttttccaccatgatattcggcaagcaggcatcgccatgagtcacgacgagatcctc gccgtcgggcatacgcgccttgagcctggcgaacagttcggctggcgcgagcccctgatgctcttcgtccagatcatcctgatcgaca agaccggcttccatccgagtacgtgctcgctcgatgcgatgtttcgcttggtggtcgaatgggcaggtagccggatcaagcgtatgcag ccgccgcattgcatcagccatgatggatactttctcggcaggagcaaggtgagatgacaggagatcctgccccggcacttcgcccaa tagcagccagtcccttcccgcttcagtgacaacgtcgagcacagctgcgcaaggaacgcccgtcgtggccagccacgatagccgc gctgcctcgtcctggagttcattcagggcaccggacaggtcggtcttgacaaaaagaaccgggcgcccctgcgctgacagccggaa cacggcggcatcagagcagccgattgtctgttgtgcccagtcatagccgaatagcctctccacccaagcggccggagaacctgcgt gcaatccatcttgttcaatcatgcgaaacgatccagatccggtgcagattatttggattgagagtgaatatgagactctaattggataccg aggggaatttatggaacgtcagtggagcatttttgacaagaaatatttgctagctgatagtgaccttaggcgacttttgaacgcgcaata atggtttctgacgtatgtgcttagctcattaaactccagaaacccgcggctgagtggctccttcaacgttgcggttctgtcagttccaaacg taaaacggcttgtcccgcgtcatcggcgggggtcataacgtgactcccttaattctccgctcatggtaccagcttctcgagcgaccctac gcccccaactgagagaactcaaaggttaccccagttggggcacaacaaaaatcaaatctaaatttgtgtaattatgaaaatgaaactt acctttgaagaggtgcgcggagaagtttccgccgatgtcgtaggtcagagagccgaacggaacttgcatcatgagatactccaactc aagtgaccgaaggcctccggccaaggagtgcacagcggattgcgtatgagtaaaggaaatctcgaactccggatatgcgtttgttgc aatcagtgtctgttccgtagacactgccttggagaaatgaaccttgggacgtctggaacgagcattcagctcctcgactgcattatcgta aacgcgacgagatgccaaatcattcaccaagctgttgcgtcccgcagcggcttggagagtttgcatgtcaattgtttgttgaaattgtgcc atgttgttgttgtttgtattttgtttgttgttattgtaatttgttgttgttgttgcaataaaactaaaacttcaaagcggagaggaaaatatatgaattt atataggcgggtttatctcttacaactttattttcggcctttcaaaaaaataattaaaatcgacagacacgaatcatttcgaccacaggtaa agataacgtgacctggctgtcagacagccttttccctcgtgttaactaatttttaaactaattaatcatctcagcccttggattagttcttttgctt tgatggcttcatgactgtgacctgctcgatccgcgtgttacatgacagctccgtttttttagtggttaacttaaaccgagtcaatccaggcaa cgttagtcgtcgtcgtggttggcttgttcaattagatttcatacaattcaacgtaatttaattcgttttctattagaattgtatcataattattcag accgtgaaagaaagtgtctttcatgatgtgtttatggatatttatacaataagatacaatgtttcatcatattcactattcacgattagtatgta cattaaataatggctactactacatccgaactcgtcaaaacgattctgaatcaattatacatatgctgactcttgcatacataaaaaatag ttgtttaaattttgtctaactaatgtttggtataagtataatgttgagttgagataccaattacatcgagtctagccattttgtcgtgccatattcgt caaaactttcttacataatgataacctagatctagatgagatatgtatcaatgtatttgagatcataattaagttcgttctaaattttgtcgaaa cgcgtggtacgctgcagaattgctcgaagccgcggtgcgggtgccagggcgtgcccttgggctccccgggcgcgtactccacctcac ccatcttttattacatgtttgaacttcaacaatttatgactttttgttcttattgttgcaggtcgcgattacgttcactgctgcatgcctaatctggat gtacgtgacattgctcgccatgaaggacacaaggaagctatttacagttatgtgaatcgtttgaaaaggcagcagcgtcctgtgcctga ataccagagggcagctttcaacaactacgctgagaacccgcacttcgtccattgcgacaaacctttccaacagtgtgaattgacgaca gcgtatggcactgacacctacgctgtagctctccatagcatttatgatatccctgttgaggagttcggttctgcgctactcaggaagaatgt gaaaacttgtttcgcggcctttcatttccatgagaatatgcttctagattgtgatacagtcacactcgatgagattggagctacttttcagaa gtccggtgataatttaagttttttctttcataatgagagcactctcaattacacccacagttttagtaatataattaagtatgtgtgtaaaacgtt ctttcctgctagtcaacggtttgtgtatcataaggagtttttagttactagagtcaacacttggtactgtaagtttacgagagtggatacttttac tcttttccgtggtgtgtaccataataatgtggattgcgaagagttttacaaggctatggacgatgcgtggcactacaaaaagacgttagca atgcttaatgccgagaggaccatcttcaaggataacgctgcgttaaacttttggttcccgaaagtgagagacatggttatcgtccctctctt tgacgcttctatcacaactggtaggatgtctaggagagagattatggtgaacaaggatttcgtttatacggtcctaaatcacataaaaac gtatcaagctaaggctttaacttacgcaaatgttctgtcctttgtggagtctattaggtctagagtgataattaacggtgtcactgccaggtct gaatgggacacagacaaggcaattctaggtccattagcaatgacatttttccttataacaaagttgggtcatgtgcaggatgaaataatc ctgaaaaagttccagaagttcgacagaaccaccaatgagctgatttggacaagtctctgcgatgccctgatgggggttattccctcggt caaggagacgcttgtgcgcggtggttttgtgaaagtagcagaacaagccttagagataaaggttcccgagctatactgtacctttgccg acagattggtactacagtacaagaaggcggaggagttccaatcgtgtgatctttccaaacctctagaagagtcagagaagtactaca acgcattatccgagctatcagtgcttgagaatctcgactcttttgacttagaggcgtttaagactttatgtcagcagaagaatgtggaccc ggatatggcagcaaaggtggtcgtagcaatcatgaagtcagaattgacgttgcctttcaagaaacctacagaagaggaaatctcgga gtcgctaaaaccaggagaggggtcgtgtgcagagcataaggaagtgttgagcttacaaaatgatgctccgttcccgtgtgtgaaaaat ctagttgaaggttccgtgccggcgtatggaatgtgtcctaagggtggtgtttcgacaaattggatgtggacattgctgatttccatctcaa gagtgtagatgcagttaaaaagggaactatgatgtctgcggtgtacacagggtctatcaaagttcaacaaatgaagaactacatagat tacttaagtgcgtcgctggcagctacagtctcaaacctctgcaaggtgcttagagatgttcacggcgttgacccagagtcacaggaga aatctggagtgtgggatgttaggagaggacgttggttacttaaacctaatgcgaaaagtcacgcgtggggtgtggcagaagacgcca accacaagttggttattgtgttactcaactgggatgacggaaagccggtttgtgatgagacatggttcagggtggcggtgtcaagcgatt ccttgatatattcggatatgggaaaacttaagacgctcacgtcttgcagtccaaatggtgagccaccggagcctaacgccaaagtaatt ttggtcgatggtgttcccggttgtggaaaaacgaaggagattatcgaaaaggtaaacttctctgaagacttgattttagtccctgggaag gaagcttctaagatgatcatccggagggccaaccaagctggtgtgataagagcggataagagcaatgttagaacggtggattccttct tgatgcatccttctagaagggtgtttaagaggttgtttatcgatgaaggactaatgctgcatacaggttgtgtaaatttcctactgctgctatct caatgtgacgtcgcatatgtgtatggggacacaaagcaaattccgttcatttgcagagtcgcgaactttccgtatccagcgcattttgcaa aactcgtcgctgatgagaaggaggttagaagagttacgctcaggtgcccggctgatgttacgtatttccttaacaagaagtatgacggg gcggtgatgtgtaccagcgcggtagagagatccgtgaaggcagaagtggtgagaggaaagggtgcattgaacccaataaccttac cgttggagggtaaaattttgaccttcacacaagctgacaagttcgagttactggagaagggttacaaggatgtgaacactgtgcacga ggtgcaaggggagacgtacgagaagactgctattgtgcgcttgacatcaactccgttagagatcatatcgagtgcgtcacctcatgtttt ggtggcgctgacaagacacacaacgtgttgtaaatattacaccgttgtgttggacccgatggtgaatgtgatttcagaaatggagaagt tgtccaatttccttcttgacatgtatagagttgagcgggggtccaatagcaattacagatcgatgcagtattcagggacagcaacttgttt gttcagacgcccaagtcaggagattggcgagatatgcaattttactatgacgctcttcttcccggaaacagtactattctcaatgaatttga tgctgttacgatgaatttgagggatatttccttaaacgtcaaagattgcagaatcgacttctccaaatccgtgcaacttcctaaagaacaa cctattttcctcaagcctaaaataagaactgcggcagaaatgccgagaactgcaggtttgctggaaaatttggttgcaatgatcaaaag aaacatgaatgcgccggatttgacagggacaattgacattgaggatactgcatctctggtggttgaaaagttttgggattcgtatgttgac aaggaatttagtggaacgaacgaaatgaccatgacaagggaaagtttttctagatggctttcgaaacaagagtcatctacagttggtc agttagcggactttaactttgtggatttgccggcagtagatgagtacaagcatatgatcaagagtcaaccaaagcaaaagttagacttg agtattcaagacgaatatcctgcattgcagacgatagtctaccattcgaaaaagatcaatgcgattttcggtccaatgttttcagaactta cgaggatgttactcgaaaggattgactcttcgaagtttctgttctacaccagaaagacacctgcacaaatagaggacttcttttctgacct agactcaacccaggcgatggaaattctggaactcgacatttcgaagtacgataagtcacaaaacgagttccattgtgctgtagagtac aagatctgggaaaagttaggaattgatgagtggctagctgaggtatggaaacaaggacacagaaaaacgaccttgaaagattatac ggccggagtcaaaacatgtctttggtatcaaaggaaaagtggtgatgtgacaacctttattggtaataccatcatcattgcagcctgtttg agctcaatgatccccatggacaaagtgataaaggcagctttttgtggagacgatagcctgatttacattcctaaaggtttagacttgcctg atattcaggcgggcgcgaacctcatgtggaacttcgaggccaaactcttcaggaagaagtatggttacttctgtggtcgttatgttattca ccatgatagaggagccattgtgtattacgatccgcttaaactaatatctaagttaggttgtaaacatattagagatgttgttcacttagaag agttacgcgagtctttgtgtgatgtagctagtaacttaaataattgtgcgtatttttcacagttagatgaggccgttgccgaggttcataaga ccgcggtaggcggttcgtttgctttttgtagtataattaagtatttgtcagataagagattgtttagagatttgttctttgtttgataatgtcgatag tctcgtacgaacctaaggtgagtgatttcctcaatctttcgaagaaggaagagatcttgccgaaggctctaacgaggttaaaaaccgtg tctattagtactaaagatattatatctgtcaaggagtcggagactttgtgtgatatagatttgttaatcaatgtgccattagataagtatagat atgtgggtatcctaggagcgtgtttttaccggagagtggctagtgccagacttcgttaaaggtggagtgacgataagtgtgatagataagc gtctggtgaactcaaaggagtgcgtgattggtacgtacagagccgcagccaagagtaagaggttccagttcaaattgttccaaatta ctttgtgtccaccgtggacgcaaagaggaagccgtggcaggttcatgttcgtatacaagacttgaagattgaggcgggttggcagccg ttagctctggaagtagtttcagttgctatggtcaccaataacgttgtcatgaagggtttgagggaaaaggtcgtcgcaataaatgatccg gacgtcgaaggtttcgaaggtgtggttgacgaatcgtcgattcggttgcagcatttaaagcggttgacaactttaaaagaaggaaaaa gaaggttgaagaaaagggtgtagtaagtaagtataagtacagaccggagaagtacgccggtcctgattcgtttaatttgaaagaaga aaacgtcttacaacattacaaacccgaataatcgataactcgagtatttttacaacaattaccaacaacaacaaacaacaaacaaca ttacaattacatttacaattatcatggtgagcaagggcgaggagctgttcaccggggtggtgcccatcctggtcgagctggacggcgac gtaaacggccacaagttcagcgtgtccggcgagggcgagggcgatgccacctacggcaagctgaccctgaagttcatctgcacca ccggcaagctgcccgtgccctggcccaccctcgtgaccaccttcagctacggcgtgcagtgcttcagccgctaccccgaccacatga agcagcacgacttcttcaagtccgccatgcccgaaggctacgtccaggagcgcaccatcttcttcaaggacgacggcaactacaag acccgcgccgaggtgaagttcgagggcgacaccctggtgaaccgcatcgagctgaagggcatcgacttcaaggaggacggcaa catcctggggcacaagctggagtacaactacaacagccacaacgtctatatcatggccgacaagcagaagaacggcvatcaaggt gaacttcaagatccgccacaacatcgaggacggcagcgtgcagctcgccgaccactaccagcagaacacccccatcggcgacg gccccgtgctgctgcccgacaaccactacctgagcacccagtccgccctgagcaaagaccccaacgagaagcgcgatcacatgg tcctgctggagttcgtgaccgccgccgggatcactcacggcatggacgagctgtacaagtaaagcggcccctagagcgtggtgcgc acgatagcgcatagtgtttttctctccacttgaatcgaagagatagacttacggtgtaaatccgtaggggtggcgtaaaccaaattacgc aatgttttgggttccatttaaatcgaaaccccttatttcctggatcacctgttaacgcacgtttgacgtgtattacagtgggaataagtaaaa gtgagaggttcgaatcctccctaaccccgggtaggggcccagcggccgctctagctagagtcaagcagatcgttcaaacatttggca ataaagtttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttgaattacgttaagcatgtaataattaacatgta atgcatgacgttatttatgagatgggtttttatgattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagcgcgca aactaggataaattatcgcgcgcggtgtcatctatgttactagatcgaccagcttagatcagattgtcgtttcccgccttcagtttaaactat cagtgtttgacaggatatattggcgggtaaac